1. In the data analysis process, which of the following refers
to a phase of analysis? Select all that apply.
- Organize data
into understandable sections
- Visualize the data
- Format data using sorts and filters
- Get input from
others
2. During which of the four phases of analysis do you compare
your data to external sources?
- Transform data
- Format and
adjust data
- Get input from others
- Organize data
Explanation: Exploratory
data analysis (EDA) is the phase that often encompasses the process of
comparing your data to data obtained from other sources. During this phase, you
are not only investigating patterns inside your dataset, but you are also
examining how your data aligns or contrasts with external standards or
benchmarks. You might think of it as putting your results into a larger
framework in order to get a more profound comprehension.
3. You are performing a calculation during your analysis of a
dataset. Which phase of analysis are you in?
- Organize data
- Format and adjust data
- Transform data
- Get input from others
Explanation: During
the Exploratory Data Analysis (EDA) phase, calculations are often included as a
component. You are now running a variety of computations, obtaining statistics,
and crunching numbers in order to discover patterns, trends, or insights that
are included inside the dataset where you are currently working. It's the stage
of the analytical journey when you get your hands dirty and crunch some
numbers.
4. Fill in the blank: Filtering involves showing only the data
that meets a specific _____ while hiding the rest.
- model
- measure
- criteria
- observation
Explanation: Filtering is the process of
displaying just the data that satisfies a certain set of criteria while
concealing the other data. Imagine having a VIP access card for your data; only
those who have been selected will be allowed to pass through!
5. A data analyst is sorting spreadsheet data. They want to make
sure that, when they rearrange the data, data across rows is kept together.
What technique should they use to sort the data?
- Sort Together
- Sort Rows
- Sort Column
- Sort Sheet
Explanation: When
the data analyst is sorting the data, they should utilize a method that is
known as "Sort by Column." This will guarantee that the data from all
of the rows are kept together. Specifically, this entails picking the complete
dataset and sorting it according to a certain column, all the while preserving
the integrity of the row-wise connections.
6. A data analyst uses a function to sort a spreadsheet range
between cells H1 and K65. They sort in ascending order by the first column,
Column H. What is the syntax they are using?
- =SORT(H1:K65, 1, FALSE)
- =SORT(H1:K65, A, FALSE)
- =SORT(H1:K65, 1, TRUE)
- =SORT(H1:K65, A, TRUE)
Explanation: The syntax for sorting a range of cells between H1 and K65 in
ascending order by the first column (Column H) is dependent on the tool or
language that is being used. The syntax would normally require picking the
range and providing the sort criteria if we were talking about spreadsheet
software such as Microsoft Excel or Google Sheets. Such software is an example.
7. You are querying a database that contains data about music.
Each album is given an ID number. You are only interested in data related to
the album with ID number 6. The album IDs are listed in the album_id column.
You write the SQL query below. Add a WHERE
clause that will return only data about the album with ID number 6.
SELECT
*
FROM
track
How many tracks are on the album with ID
number 6?
- 20
- 13
- 5
- 8
8. You are working with a database that contains invoice data
about online music purchases. You are only interested in invoices sent to
customers located in the city of Chicago. You want to sort the invoices by
order total in ascending order. The order totals are listed in the total
column.
You write the SQL query below. Add an
ORDER BY clause that will sort the invoices by order total in ascending order.
What total appears in row 2 of your query
result?
- 1.98
- 7.96
- 15.86
- 5.94
9. During which of the four phases of analysis can you find a
correlation between two variables?
- Format and adjust data
- Organize data
- Transform data
- Get input from others
Explanation: It is
common practice to discover correlations between two variables during the
Exploratory Data Analysis (EDA) phase of the research process. During this
phase, you will be delving into your dataset, investigating the connections
between the elements, and discovering patterns. Analysis of correlation lets
you understand the degree to which two variables are connected to one another,
so offering insights into the effect that one variable has on the other. One of
the most important aspects of the EDA phase is having a solid grasp of the
structure that lies behind your data.
10. Typically, a data analyst uses filters when they want to
expand the amount of data they are working with.
- True
- False
Explanation: Certainly not! The quantity of data that you are dealing with
might be narrowed down or restricted with the use of filters. You are
effectively establishing criteria when you use filters, which allows you to
display just the data that satisfies specific circumstances while concealing
the remainder of the facts.
11. A data analyst sorts a spreadsheet range between cells F19
and G82. They sort in ascending order by the second column, Column G. What is
the syntax they are using?
- =SORT(F19:G82, 2, FALSE)
- =SORT(F19:G82, B, TRUE)
- =SORT(F19:G82, 2, TRUE)
- =SORT(F19:G82, B, FALSE)
Explanation: If we are discussing spreadsheet software such as Microsoft
Excel or Google Sheets, the syntax for sorting a range between cells F19 and
G82 in ascending order by the second column (Column G) would normally include
choosing the range and providing the sort criteria. This would be the case if
we were talking about the program.
12. Which phase of the data analysis process has the goal of
identifying trends and relationships?
- Analyze
- Process
- Act
- Prepare
Explanation: The
Exploratory Data Analysis (EDA) phase of the data analysis process has as its
major objective the identification of patterns and connections between the sets
of data. The objective of exploratory data analysis (EDA) is for analysts to
investigate the information in order to discover patterns, correlations, and
insights that may give a more in-depth knowledge of the data and help with
further analysis or decision-making.
13. Which of the following actions might occur when transforming
data? Select all that apply.
- Recognize relationships in your data
- Eliminate irrelevant info from your data
- Make calculations based on your data
- Identify a pattern in your data
14. Fill in the blank: Sorting ranks data based on a specific
_____ that you select.
- model
- calculation
- observation
- metric
15. A data analyst is sorting data in a spreadsheet. Which tool
are they using if all of the data is sorted by the ranking of a specific sorted
column and data across rows is kept together?
- Sort Sheet
- Sort Together
- Sort Document
- Sort Rank
Explanation: It is
quite probable that a data analyst is making use of the sorting tool that is
available inside the spreadsheet program when they are sorting data in a
spreadsheet. This is because the data is sorted according to the ranking of a
particular column, and the data that is included within rows is kept together.
Tools such as Microsoft Excel and Google Sheets often have this capability as a
standard feature. The tool gives users the ability to sort data based on the
values in a particular column while preserving the integrity of the connections
that exist between rows.
16. You are querying a database that contains data about music.
You are only interested in data related to the jazz musician Miles Davis. The
names of the musicians are listed in the composer column.
You write the SQL query below. Add a WHERE
clause that will return only data about music by Miles Davis.
What track by Miles Davis appears in row 1
of your query result?
- So What
- Summertime
- Compulsion
- Now’s The Time