ICDL Data Analytics - Foundation 1.0
ICDL Data Analytics - Foundation 1.0
DATA ANALYTICS -
FOUNDATION
Syllabus 1.0
Learning Material
(MS Excel, Power BI)
ICDL Foundation, ICDL Europe, ICDL, ECDL and related logos are registered business names and/or trademarks of ECDL Foundation.
This courseware may be used to assist candidates to prepare for the ECDL Foundation Certification Programme as titled on the
courseware. ICDL Foundation does not warrant that the use of this courseware publication will ensure passing of the tests for that ICDL
Foundation Certification Programme.
The material contained in this courseware does not guarantee that candidates will pass the test for the ICDL Foundation Certification
Programme. Any and all assessment items and / or performance-based exercises contained in this courseware relate solely to this
publication and do not constitute or imply certification by ICDL Foundation in respect of the ECDL Foundation Certification Programme or
any other ICDL Foundation test. This material does not constitute certification and does not lead to certification through any other process
than official ICDL Foundation certification testing.
Candidates using this courseware must be registered with the National Operator before undertaking a test for an ICDL Foundation
Certification Programme. Without a valid registration, the test(s) cannot be undertaken and no certificate, nor any other form of
recognition, can be given to a candidate. Registration should be undertaken at an Approved Test Centre.
Screen shots used with permission from Microsoft. Tool and application-specific details are correct as of October 2018. Online tools and
applications are subject to frequent update and change.
For details of the specific areas of the ICDL Data Analytics - Foundation syllabus covered in
each section of this book, refer to the ICDL Data Analytics - Foundation syllabus map at the
end of the book.
© 2019 Page v
6.1 Summarising Data Introduction ......................................................................................... 72
6.2 Changing Aggregation Methods ........................................................................................ 73
6.3 Displaying Multiple Aggregation Values ............................................................................ 76
6.4 Using Built-In Calculations ................................................................................................ 79
6.5 Review Exercise ............................................................................................................... 87
Page vi © 2019
12.3 Creating Matrix Visualizations ....................................................................................... 188
12.4 Adding Interactivity Using Slicers .................................................................................. 190
12.5 Review Exercise ........................................................................................................... 194
Organisations can collect and process vast amounts of data such as sales figures,
operating costs, logistics, customer satisfaction rates and online engagement
metrics.
This data is collected from an increasing range of sources - for example, mobile
devices, online platforms, payment systems, cameras, GPS systems, wireless
sensors, and legacy systems. And this business data is stored in a wide variety of
formats.
With vast amounts of data from different data streams and in different formats,
organisations are using data analytics to find meaningful and useful insights that
they can use to meet their organisational goals.
Data Analytics
Data analytics refers to the techniques and processes used to collect, organise
and examine data sets to create meaningful and useful information.
In recent times there has been a rapid rise in the use of data analytics across all
types of organisations and sectors due to, among other things, technological
advances in data analytics, increased computing power and better data storage
capabilities. The use of data analytics can provide many benefits – for example, in
business it can be used to gain competitive advantage, improve performance and
increase profits.
Page 2 © 2019
ICDL Data Analytics - Foundation Lesson 1 – Key Concepts
There are different categories and types of data analytics, which although
interrelated, have different purposes and provide different insights. In business
deciding which type of data analytics is appropriate depends on a variety of
factors, such as, the type of data being used, the stage of the workflow and your
business requirements and objectives.
Two broad categories of data analytics are quantitative analytics and qualitative
analytics.
In quantitative analytics the data set is assessed objectively, providing broad and
generalised insights. For example, you may calculate the percentage of
customers who returned a product, the percentage of products that were faulty, or
the number of sales in one store compared to another.
© 2019 Page 3
Lesson 1 – Key Concepts ICDL Data Analytics - Foundation
In qualitative analytics the data set is assessed subjectively, providing narrow and
specific insights. For example, you may need to analyse customer transcripts or
interview notes to determine whether a customer likes your product or is satisfied
with your service levels. You might have to pay close attention to the customer’s
language in order to draw conclusions, which will be subjective and depend on
your interpretation of the customer’s responses.
Qualitative analytics can also provide useful insights that you are unlikely to get
from quantitative analytics. For example, you can use qualitative analytics to delve
into the reasons for things, such as why some customers like your products and
others don’t.
Page 4 © 2019
ICDL Data Analytics - Foundation Lesson 1 – Key Concepts
Four types of data analytics, which can be broadly classified by their different
purposes, are descriptive analytics, which is the simplest type, followed by
diagnostic analytics, then predictive analytics and finally prescriptive analytics,
which is the most complex type.
01 02 03 04
Descriptive Diagnostic Predictive Prescriptive
Analytics: Analytics: Analytics: Analytics:
Descriptive Analytics
Diagnostic Analytics
© 2019 Page 5
Lesson 1 – Key Concepts ICDL Data Analytics - Foundation
Predictive Analytics
Prescriptive Analytics
The diagram below illustrates the value-added contribution and complexity of the
various types of data analytics in relation to each other.
Prescriptive
Analytics
Level of Value-added Contribution
Predictive
Analytics
Diagnostic
Analytics
Business Insights
Descriptive
Analytics
Page 6 © 2019
ICDL Data Analytics - Foundation Lesson 1 – Key Concepts
Case Example
The following is an online article that reports how a fashion retail company
leverages their business using data analytics.
The Wall Street Journal reports that “Zara has built its
business on RFID tags.” This gives Zara the ability to
manage inventory cheaply at the same time it allows the
company to observe the frequency of garments moving
in and out of dressing rooms.
The various data analytics activities described in the article can be identified as
belonging to one of the data analytics types - either descriptive, diagnostic,
predictive or prescriptive.
© 2019 Page 7
Lesson 1 – Key Concepts ICDL Data Analytics - Foundation
The table below outlines which data analytics activities belong to which type of
data analytics:
Page 8 © 2019
ICDL Data Analytics - Foundation Lesson 1 – Key Concepts
Knowing how to use and interpret data can help businesses reach their business
goals and stay ahead of their competition. The insights gained from data analytics
can provide businesses with the following benefits:
These benefits can generate positive outcomes for a business and its customers.
Identifying patterns/trends
Data analytics can be used to identify customer preferences and behaviour – for
example, data gathered through loyalty card systems and online shopping
platforms can provide information on purchasing patterns and trends in real time.
This information can be used to provide personalised recommendations and
targeted advertising to shoppers.
Data can also be analysed to identify market trends – for example, in an online
retail environment, popular products and those becoming popular can be
identified and this information used in supply chain management.
Organisations, such as social media platforms, can use data analytics as the
basis of new business models where access to services are free in exchange for
personal data. Data analytics are then used to generate revenue from the
personal data, for example, by categorising customers according to their product
preferences and using this information in targeted marketing campaigns.
© 2019 Page 9
Lesson 1 – Key Concepts ICDL Data Analytics - Foundation
Improving efficiency
For example, managers and decision-makers can make quicker and more
effective choices using regularly updated dashboards and reports that display
and/or summarise important performance metrics.
HR Service providers may use data analytics to help them decide whether to
introduce a new HR policy. They may forecast the impact of the proposed HR
policy on employee performance and only decide to introduce it if it has a positive
effect on employee performance and well-being.
Healthcare providers may use data analytics to predict patient’s medical needs.
This information will help them when deciding how to allocate resources most
effectively when planning for future demand.
Using visual data in reports and dashboards can make businesses more agile and
help them find revealing insights faster and make decisions more quickly.
Page 10 © 2019
ICDL Data Analytics - Foundation Lesson 1 – Key Concepts
The data analysis process and phases will vary depending on the type of data
analytics used and your business implementation. It can include tasks such as
defining the question you are trying to answer; deciding what data you need to
answer the question; determining if you have the data; collecting the data; getting
the data ready for analysis (cleaning and transforming the data); analysing the
data; interpreting the results; and using the results.
For example, a type of data analytics known as data mining, which explores large
data sets and generates predictions, typically follows a process with six phases.
6. 1. Business
Deployment understanding
5. 2. Data
Evaluation understanding
4. 3. Data
Modelling preparation
© 2019 Page 11
Lesson 1 – Key Concepts ICDL Data Analytics - Foundation
Phase Description
3. Data In the third phase the aim is to prepare the data for the
preparation modelling phase. You construct the final data set to use
in your analysis from the initial raw data. This involves
selecting, cleaning and transforming (constructing and
integrating) the data.
5. Evaluation In the fifth phase the aim is to ensure that the results are
useful for the business. To do this, you evaluate the
results of the model in relation to the business success
criteria.
6. Deployment In the sixth phase the aim is to use the results in the
business to meet the business goals. To deploy the
model, you integrate the results into the business. This
phase includes planning the deployment, reporting the
results and monitoring the results.
Note: In this Foundation module, the focus is on the data understanding and data
preparation phases and the associated tools in Microsoft Excel and Power BI.
Page 12 © 2019
ICDL Data Analytics - Foundation Lesson 1 – Key Concepts
Data protection is an area of law that is designed to protect the privacy rights of
individuals in relation to the processing of their personal data. Personal data is
typically defined as data that contains identifiers that can be used to identify an
individual. Data protection laws govern how organisations use personal data and
need to be complied with when analysing data that contains personal data.
© 2019 Page 13
Lesson 1 – Key Concepts ICDL Data Analytics - Foundation
a) Descriptive b) Diagnostic
c) Predictive d) Prescriptive
2. Which of the following data analytics types uses forecasting tools to predict
future trends?
a) Descriptive b) Diagnostic
c) Predictive d) Prescriptive
3. Which of the following data analytics types is the most complex and very
process-intensive?
a) Descriptive b) Diagnostic
c) Predictive d) Prescriptive
_____________________________________________________
_____________________________________________________
6. Identify the last 2 phases of the data analytics process in data mining:
__________________________________
__________________________________
Page 14 © 2019
LESSON 2 –
STATISTICAL ANALYSIS
To make sense of, and interpret large data sets of quantitative data, the data
needs to be summarised or described in some way. Statistical analysis is a way
of summarising or describing data to highlight typical values in the data, as well as
how these vary.
Each of these measures calculates the location of the central point within a data
set using a different method. Choosing the appropriate measure of central
tendency depends on the type of data set and what you want to find out.
Measures of variation
These are statistical analysis techniques for interpreting and summarising large
sets of quantitative data by examining the distribution of the data set. Rather than
showing how data are similar, as is the case with measures of central tendency,
measures of variation describe how the data varies or differs in a data set. The
three most common measures of variation are:
• Range
• Variance
• Quartiles
Using a measure of central tendency and the relevant measure of variation for a
data set data provides a better overview of the data than using one alone. In data
analysis, this type of statistical analysis can be a first step in describing the data,
before more complex analysis.
Page 16 © 2019
ICDL Data Analytics - Foundation Lesson 2 – Statistical Analysis
Case Example
Consider the following report to see if the measures of central tendency can be
applied to solve the question presented:
In response to the question in the report “Are our feet really getting bigger?” you
may want to check if feet have indeed grown bigger or is it that the sale of larger
sizes of shoes has increased?
You know that shoe sizes are based on some measurements. Some things you
may consider checking are:
• What is considered an average size of shoes?
• Is the average size a single value or a range of values?
• How far apart are the measurements of different sizes from what is
considered an average size?
© 2019 Page 17
Lesson 2 – Statistical Analysis ICDL Data Analytics - Foundation
Mean
The mean is the most popular measure of central tendency. This is commonly
referred to as the average of a data set. The mean is calculated by summing up
all the values in the data set and then dividing by the number of values in the data
set. Since it incorporates all the values in the data set, any changes to the values
will affect the mean.
Case Example
The following is the gross profit information for 10 fashion retail stores in a city:
As a potential investor in fashion retail, what representative profit value will you
use for these 10 retail stores?
The mean may not always reflect the central position, especially when the data
set has outliers. Outlier are values that are unusual compared to the rest of the
data set by being particularly small or large. The mean value will be skewed by
these values.
Case Example
Will a mean value of the service response time be representative of the service
quality of the restaurants in that city?
© 2019 Page 19
Lesson 2 – Statistical Analysis ICDL Data Analytics - Foundation
Chart Showing the Values in a Data set and the Skewed Mean
As in the example above, by inspecting the raw data, it suggests that the mean of
7.4 minutes, may not accurately reflect the typical response time, as most calls
are in the 2 to 8 minutes range. The outliers (19 and 13 minutes in this case) tend
to pull the mean away from the centre.
Page 20 © 2019
ICDL Data Analytics - Foundation Lesson 2 – Statistical Analysis
Concepts
Median
The median is the middle value in the data set distribution. It splits the data set in
half. In an ordered set of data, the median is the middle value such that 50% of all
data lies above or below it. Median is less affected by outliers and skewed data.
Case Example
The following sales information was collected for a small business over 10 days.
What is the median sales value?
Concepts
Mode
The mode is the value that occurs the most frequently in the data set. Mode is not
affected by extreme values. It can be used for either numerical or qualitative data.
If there are no repeating values in the data set, then the data set does not have a
mode. For some data sets, there can also be more than one mode. It is possible
to have two modes (bimodal), three modes (trimodal) or more modes within larger
data sets.
Case Example
Page 22 © 2019
ICDL Data Analytics - Foundation Lesson 2 – Statistical Analysis
• The MEDIAN function finds the median (middle number) of a set of values
in a data set. The MEDIAN function syntax is
• The MODE function finds the mode (the most frequently occurring number)
in a numeric data set. The MODE function syntax is
The arguments can be a number or cell reference that refers to numeric values.
Steps
© 2019 Page 23
Lesson 2 – Statistical Analysis ICDL Data Analytics - Foundation
The data set shows order details such as order date, amount and status.
2. Click in the cell G4 and type the formula =AVERAGE(C2:C16) to calculate the
mean amount value.
3. Click in the cell G6 and type the formula =MEDIAN(C2:C16) to calculate the
median amount value.
Note: In Excel, you do not have to sort the data range for calculating median.
4. Click in the cell G8 and type the formula =MODE(C2:C16) to calculate the
mode amount value.
5. The central tendency values of the data set are displayed in the respective
cells. Save the file as Orders1completed.xlsx workbook.
Page 24 © 2019
ICDL Data Analytics - Foundation Lesson 2 – Statistical Analysis
Case Example
In the example below there are sample data sets for two retail stores. The mean,
median, and mode of each store’s daily sales all equal $20 000. However, there is
significant difference between the two data sets. Store A’s daily sales are much
more consistent than those of Store B, which shows greater variation. This
illustrates the need for measures of variation.
Sample Data Set 1 – Store A Sales Sample Data Set 2 – Store B Sales
20 000 20 000
21 000 2 000
19 000 Mean = 160 000 / 8 40 000 Mean = 160 000 / 8
20 100 = 20 000 0 = 20 000
19 900 39 000
19 500 Median = 20 000 1 000 Median = 20 000
20 500 20 000
20 000 Mode = 20 000 38 000 Mode = 20 000
Total = Total =
160 000 160 000
21500 45000
21000 40000
35000
20500
30000
20000 25000
19500 20000
15000
19000
10000
18500
5000
18000 0
1 2 3 4 5 6 7 8 1 2 3 4 5 6 7 8
© 2019 Page 25
Lesson 2 – Statistical Analysis ICDL Data Analytics - Foundation
Concepts
Range
Range is one of the most basic measures of variation. It is the difference of the
maximum and minimum values where Maximum is the largest value in the data
set and Minimum is the smallest value in the data set. It is one way to describe
how the data is dispersed or spread out in a data set
Range = Maximum-Minimum
Case Example
The following is the sample data set for 10 retail stores. The calculation shows the
minimum, maximum, and range gross profit
Concepts
Variance
Variance measures how far a data set is spread out. Variance tells how
representative the mean is of each of the values in the data set.
• The closer each individual value in the data set is to the mean, the smaller
the variance will be.
• If each value is at the mean in a data set, the variance will equal zero,
which indicates that there is no variation from the mean across the entire
data set.
• The further each individual value in the data set is from the mean, the
greater the variance will be, which indicates that the mean is not as typical
of the individual values in the data set.
Page 26 © 2019
ICDL Data Analytics - Foundation Lesson 2 – Statistical Analysis
Technically variance is the average of the squared differences from the mean.
Case Example
The following is the new contracts volume handled by three sales representatives:
Sample Data
Sales Rep A Sales Rep B Sales Rep C
10 10 7
New
8 10 6
Contracts
6 6 6
Volume
4 1 6
2 3 5
Total 30 30 30
= 30 / 5 = 30 / 5 = 30 / 5
Mean
=6 =6 =6
In the data set above, each sales representative’s contracts volume has a mean
equal to 6. However, the contracts volume differs with respect to how much the
individual values vary from the mean.
Since the variance represents the average squared deviation from the mean, you
need to determine which sales representative’s contracts volume data set has the
greatest variance and which one has the smallest variance.
• The contracts volume of Sales Rep B seems to be further from the mean
value of 6, so it should have the largest variance.
• The mean value of 6 seems to be most typical of the contracts volume of
Sales Rep C, so it should have the smallest variance.
• The contracts volume for Sales Rep A appears to be in between, so it
should have a variance between Sales Rep B and Sales Rep C.
Sales Rep B
Sales Rep C
from Mean
from Mean
from Mean
from Mean
from Mean
from Mean
Difference
Difference
Difference
Difference
Difference
Difference
Contracts
Contracts
Contracts
Squared
Squared
Squared
Volume
Volume
Volume
10 4 16 10 4 16 7 1 1
8 2 4 10 4 16 6 0 0
6 0 0 6 0 0 6 0 0
4 -2 4 1 -5 25 6 0 0
2 -4 16 3 -3 9 5 -1 1
Total 40 Total 66 Total 2
Total of Difference from Total of Difference from Total of Difference from
Mean Squared = 40 Mean Squared = 66 Mean Squared = 2
© 2019 Page 27
Lesson 2 – Statistical Analysis ICDL Data Analytics - Foundation
The size of the variance does not have any special underlying standard
interpretation. Variance, along with the mean, allows you to judge how well the
mean serves as a measure of central tendency.
Using the summary, you can reach some conclusions about what each of these
data sets would look like. For example, you can see that all three data sets are of
the same size (5), and that all three have the same mean (6). The fact that the
variance for Sales Rep C is only 0.4 indicates that most of the individual values in
the data set should be very close to the value of 6. Similarly, you would assume
that the individual values for Sales Rep B must be more diverse or spread out
around the mean since the variance is much larger.
Concepts
Quartiles
Quartiles in statistics are values that divide the data set into quarters according to
where the values fall on the number line. They are often used in sales and
surveys to divide population into groups.
The three quartiles that divide a data set into quarters are:
• The median of the lower half of the data set (25th percentile)
• The median of the upper half of the data set (75th percentile)
The basic formula used to find a quartile by determining the value in the
appropriate position in the ranked discrete data set are outlined below, where n is
the number of values in the data set:
Page 28 © 2019
ICDL Data Analytics - Foundation Lesson 2 – Statistical Analysis
Quartile Positions
Case Example
The following calculates the quartiles for a new product’s online order quantities
for 12 days immediately after the first launch.
Online Order Quantities
Sample Data 16 14 18 4 17 10 11 12 4 3 4 18
The order quantities ranged from 3 to 18. How can we evaluate the variation
within this set of data?
3 4 4 4 7 10 11 12 14 16 17 18
© 2019 Page 29
Lesson 2 – Statistical Analysis ICDL Data Analytics - Foundation
• The MAX and MIN functions are used to calculate the range. The syntax is
MAX(number1, [number2],...)-MIN(number1,[number2],...)
• The VAR.P function finds the variance in an entire population. The syntax
is
• The QUARTILE function finds the quartile for a particular data set. The
syntax is
QUARTILE(array, quart)
The array argument refers to the numeric data you want to analyse and is the
range of cells containing the data set. The quart argument is the quartile value
you want to return. There are 5 values for the quart argument as follows:
Steps
2. The data set shows order details such as order date, amount and status. Click
in the cell G4 and type the formula =MAX(C2:C16)-MIN(C2:C16) to calculate
the range of the amount value.
3. Click in the cell G7 and type the formula =VAR.P(C2:C16) to calculate the
variance of the amount value.
Page 30 © 2019
ICDL Data Analytics - Foundation Lesson 2 – Statistical Analysis
Based on the variance value, complete the following statement to describe the
amount values compared to the mean:
Most of the amount values in the data set are
Answer: Most of amount values in the data set are more diverse or spread out
around the mean because the variance is much larger compared to the mean.
This indicates that in this example the mean is not the best measure to
determine the central tendency of the amount values.
5. Click in the cell G10 and type the formula =QUARTILE(C2:C16,2) to calculate
the second quartile value.
6. Click in the cell G11 and type the formula =QUARTILE(C2:C16,3) to calculate
the third quartile value.
Note: In Excel, you do not have to sort the data range for quartile calculations.
The range, variance and quartile values of the data set are displayed in the
respective cells.
© 2019 Page 31
Lesson 2 – Statistical Analysis ICDL Data Analytics - Foundation
2. The data set shows credit card transactions of a retail store for a period of 3
months. Enter a formula in I4 to calculate the mean amount of all the credit
card transactions.
3. Enter a formula in I6 to calculate the median amount of all the credit card
transactions.
4. Enter a formula in I8 to calculate the mode amount of all the credit card
transactions.
6. Enter a formula in I10 to calculate the range of all the credit card transactions.
Page 32 © 2019
LESSON 3 –
IMPORTING DATA SETS
The data sets that you want to analyse may be stored in various formats in
different programs and sources. They may contain a few rows of data or
thousands of rows of data. Common sources of data may include text files,
spreadsheets, the web and databases.
You can bring data stored in different formats in various programs into a
spreadsheet application such as Microsoft Excel, where you can perform further
analysis activities, generate reports and create dashboards. In some cases, you
will need to export data from the program or system where it resides before you
can import it into Excel. In other cases, you can import data from external sources
directly into Excel from within Excel.
When you are importing data into Excel you can make a permanent connection to
the external data source so that the imported data can be updated (refreshed).
Imported data, when refreshed, is updated with any changes made to the external
data source since the last refresh. You can also import data without making a
connection to the external data source, in which case the imported data remains
static regardless of any changes made to the external data source.
Connected Static
Page 34 © 2019
ICDL Data Analytics – Foundation Lesson 3 – Importing Data Sets
In many cases when data is exported from other programs or systems, it is saved
in a text file format. Two commonly used text file formats are:
You can import data from a text file into Microsoft Excel by importing the data as
an external data range. This makes a permanent connection to the source file that
allows the data to be refreshed. Or you can import data from a text file into
Microsoft Excel by opening the file in Excel. This doesn’t make a permanent
connection to the source file, which means the data is static and can’t be updated.
Steps
2. On the Data tab, in the Get External Data group, click From Text.
4. Click Import.
5. In the first dialog box of the Text Import Wizard, select Delimited under
Original data type and check the option My data has headers.
Note: If importing a .txt file, select the Fixed width radio button and follow the
steps in the Text Import Wizard.
© 2019 Page 35
Lesson 3 – Importing Data Sets ICDL Data Analytics - Foundation
6. Click Next.
7. In the second dialog box of the Text Import Wizard, check the option Comma
under Delimiters.
8. Click Next.
Page 36 © 2019
ICDL Data Analytics – Foundation Lesson 3 – Importing Data Sets
9. In the third dialog box of the Text Import Wizard, select the required data
format for the columns and specify the columns to skip.
a. Select the Contact column and select the option Do not import
column (skip).
b. Select the Phone Number column and select the option Do not import
column (skip).
c. Select the Contract Date column and select the option Date.
d. Ensure the date format is DMY.
11. In the Import Data dialog box, ensure the option Existing worksheet is
selected and click on cell A3.
© 2019 Page 37
Lesson 3 – Importing Data Sets ICDL Data Analytics - Foundation
Note: To refresh the data, right-click on the data set and choose Refresh.
If there is active content in the workbook, the Message Bar displays security
alerts. To proceed, click Enable Content.
Note: These steps apply to Microsoft Excel 2016 as part of Microsoft Office 2016
purchased as a stand-alone product. Other versions of Microsoft Excel 2016, such
as Microsoft Excel 2016 as part of an Office 365 subscription, may include
different functionality and steps. For example, the Get & Transform Data option
in Microsoft Excel 2016 in Microsoft Office 365 follows different steps for importing
data.
For versions of Excel 2016 that include the Get & Transform Data option, the
steps to import a text file are:
1. In the workbook, click From Text/CSV in the Get & Transform Data group on
the Data tab.
2. Browse to the location of the file and select the file Customer Data.csv.
3. Click Import.
4. Click Load.
The Get & Transform functionality automatically detects column delimiters
including identifying column names and data types.
Page 38 © 2019
ICDL Data Analytics – Foundation Lesson 3 – Importing Data Sets
When you are managing and analysing data you may want to import data from
one workbook into a different workbook in Microsoft Excel – for example, if you
have existing sales or finance reports. You can do this and create a dynamic
connection between the two workbooks. This allows you to refresh the connection
between the workbooks to ensure the imported data is up-to-date.
Steps
2. On the Data tab, in the Get External Data group, click Existing Connections.
5. Click Open.
8. In the Import Data dialog box, ensure the option Existing worksheet is
selected and click on cell A3.
© 2019 Page 39
Lesson 3 – Importing Data Sets ICDL Data Analytics - Foundation
9. Click OK.
Note: To refresh the data, right-click on the data set and choose Refresh.
Note: These steps apply to Microsoft Excel 2016 as part of Microsoft Office 2016
purchased as a stand-alone product. Other versions of Microsoft Excel 2016, such
as Microsoft Excel 2016 as part of an Office 365 subscription, may include
different functionality and steps. For example, the Get & Transform Data option
in Microsoft Excel 2016 in Microsoft Office 365 follows different steps for importing
data.
For versions of Excel 2016 that include the Get & Transform Data option, the
steps to import a spreadsheet are:
1. In the workbook, click Get Data in the Get & Transform Data group on the
Data tab.
2. Click Get File and then click Get Workbook.
3. Browse to the location of the file and select the file, Ticket Sales.xlsx.
4. Click Import.
5. Click Promotion Sales.
6. Click Load.
Page 40 © 2019
ICDL Data Analytics – Foundation Lesson 3 – Importing Data Sets
You may want to import information from web pages into Microsoft Excel for
analysis. For example, you may want to import stock information directly from a
web page or company sales figures from an internal company web page.
Depending on your needs, you can import data that you can refresh, or you can
import data and make it static.
To import a web table into Excel, you can use the Web Query feature in Excel.
This feature finds all the tables on a web page and allows you to select the tables
you want to import. The tables from the web page can be dynamically updated in
the spreadsheet.
Steps
2. On the Data tab, in the Get External Data group, click From Web.
3. In the New Web Query dialog box, click in the Address text box and type the
website address http://www.icdl.org/exchangerates and click Go.
© 2019 Page 41
Lesson 3 – Importing Data Sets ICDL Data Analytics - Foundation
6. In the Import Data dialog box, ensure the option Existing worksheet is
selected and click on cell B3.
7. Click OK.
Page 42 © 2019
ICDL Data Analytics – Foundation Lesson 3 – Importing Data Sets
Excel will attempt to import the web data as it appears in the web table. However,
the less structured the web data, the more formatting will be required in Excel to
prepare the data for analysis.
Note: To refresh the data, right-click on the data set and click Refresh.
Note: These steps apply to Microsoft Excel 2016 as part of Microsoft Office 2016
purchased as a stand-alone product. Other versions of Microsoft Excel 2016, such
as Microsoft Excel 2016 as part of an Office 365 subscription, may include
different functionality and steps. For example, the Get & Transform Data option
in Microsoft Excel 2016 in Microsoft Office 365 follows different steps for importing
data.
For versions of Excel 2016 that include the Get & Transform Data option, the
steps to import a web table are:
1. In the workbook, click From Web in the Get & Transform Data group on the
Data tab.
2. Enter the URL http://www.icdl.org/exchangerates and click OK.
3. Select the table to import.
4. Click Load.
© 2019 Page 43
Lesson 3 – Importing Data Sets ICDL Data Analytics - Foundation
You can import database tables from a DBMS such as Microsoft Access into
Microsoft Excel. In Excel you can more easily filter, chart, and analyse a data set
to produce meaningful reports. Many organisations use Microsoft Excel and
Access almost interchangeably, feeding data between the two applications.
Steps
2. On the Data tab, in the Get External Data group, click From Access.
4. Click Open.
Page 44 © 2019
ICDL Data Analytics – Foundation Lesson 3 – Importing Data Sets
6. In the Import Data dialog box, ensure the option Table is selected.
Note: To refresh the data, right-click on the data set and choose Refresh.
Note: These steps apply to Microsoft Excel 2016 as part of Microsoft Office 2016
purchased as a stand-alone product. Other versions of Microsoft Excel 2016, such
as Microsoft Excel 2016 as part of an Office 365 subscription, may include
different functionality and steps. For example, the Get & Transform Data option
in Microsoft Excel 2016 in Microsoft Office 365 follows different steps for importing
data.
For versions of Excel 2016 that include the Get & Transform Data option, the
steps to import a database table are:
1. Click Get Data in the Get & Transform Data group on the Data tab and select
From Database, From Microsoft Access Database.
2. Browse to the location of the file and select the file.
3. Click Import.
4. Select the table to import.
5. Click Load.
© 2019 Page 45
Lesson 3 – Importing Data Sets ICDL Data Analytics - Foundation
6. Accept all the other default settings and place the data from cell A1 onwards.
Page 46 © 2019
LESSON 4 –
SHAPING DATA SETS
To get meaningful information from data analysis you need to ensure that you are
working with a good quality data set. For example, data imported into Microsoft
Excel from external sources, such as text files, spreadsheets, web pages or
databases may contain errors or inconsistencies or may not be arranged as
needed.
You will often need to shape and prepare the data set before you can analyse it,
which happens in the data preparation phase of the data analytics process.
Before you begin shaping and preparing the data set it is good practice to save a
backup copy of the original data set. This includes cleaning the data to fix any
issues that may affect the quality of the analysis. The cleaning tasks will depend
on the type of data and its quality but may involve:
Spreadsheet applications contain built-in features for some of the cleaning tasks
and it is good practice to complete these types of tasks first. For example, in Excel
you can
• fix incorrect formatting or replace text using the Find and Replace tool.
For other tasks, such as extracting data, you may need to manipulate imported
data columns, which may involve changing imported values into new values using
formulas.
Page 48 © 2019
ICDL Data Analytics - Foundation Lesson 4 – Shaping Data Sets
Data duplication can occur in data sets when records are entered more than once
or consolidated from multiple sources. Duplicates may occur for one or more
fields. If you want to limit the records to unique data for some fields or unique
records for the entire data set, you can use Excel's built-in Remove Duplicates
tool. This tool finds and removes exact duplicates.
Steps
2. Copy the range A4:E11 and paste the copied range starting from cell G4
onwards.
3. Select any cell within the data set that starts from cell G4.
4. On the Data tab, in the Data Tools group, click Remove Duplicates.
© 2019 Page 49
Lesson 4 – Shaping Data Sets ICDL Data Analytics - Foundation
5. Click OK.
A message will appear to indicate how many duplicate values were removed,
or how many unique values remain.
6. Click OK.
Unique Records
When duplicates are removed, the first occurrence of the value in the data set is
kept, but other identical values are deleted.
Page 50 © 2019
ICDL Data Analytics - Foundation Lesson 4 – Shaping Data Sets
In Excel you can use the VLOOKUP function to validate data, which can save a
huge amount of time when reconciling data. The VLOOKUP function looks up a
value in the first column of a specified range of cells and returns a value from a
specified column in the same row.
Returned
Value
You can use the VLOOKUP function to look up given values to quickly find
specific values in a data set. You can use this to automatically verify that you are
using the correct data. For example, you can use the VLOOKUP function to verify
if a value, such as a country code matches a list of values in a reference data set,
such as a list of approved country codes.
And you can perform calculations or display results using the values returned. For
example, you can use the VLOOKUP function to find a commission rate in a table
based on a specified value in the data set. You can calculate the amount of
commission to pay based on the specified value multiplied by the rate returned by
the VLOOKUP function.
• lookup_value is the value you want to look for in a specified range of cells.
• table_array is the range of cells containing the value you want to return.
The VLOOKUP function searches vertically down the first column of the specified
table array to find the specified lookup value and then looks across the
corresponding row to find the specified column and returns the value from the cell
where the row and column meet.
© 2019 Page 51
Lesson 4 – Shaping Data Sets ICDL Data Analytics - Foundation
Steps
Page 52 © 2019
ICDL Data Analytics - Foundation Lesson 4 – Shaping Data Sets
The VLOOKUP function looks up the value specified in the cell B6 (IE) in the
first column of the specified range of cells (H6:I12). It finds IE in cell H10. It
returns the value from row 10 in column 1, specified by the column number
index. For the range H6:I12 this is column H.
Note: The $ sign before the row and column coordinates makes the cell
references absolute so they won't change.
If the imported country code exists in the first column of the reference data set,
the function will return the country code otherwise it will return #N/A to indicate
that the value is not found.
© 2019 Page 53
Lesson 4 – Shaping Data Sets ICDL Data Analytics - Foundation
To generate the correct country code for the countries listed in the range
A13:A19:
4. Check that the generated data is the same as that shown in the range A24
onwards.
Page 54 © 2019
ICDL Data Analytics - Foundation Lesson 4 – Shaping Data Sets
In Excel you can use the IF function to validate the data set – for example to
check whether values are in a particular range. The IF function is one of the most
popular and useful functions in Excel. You use an IF statement to evaluate a
condition and to return one value if the condition is met, and another value if the
condition is not met.
For example:
Check IF the value is
greater than 10,
IF it is greater than 10 11
return YES,
IF it is not greater than
10 then return NO
YES
When analysing data, you often need to evaluate several conditions at the same
time, which means you need to create more complex logical tests.
To create more complex logical tests, you can use AND or OR functions in the
logical test of an IF function. If the logical test contains the AND function, TRUE is
returned if all the conditions are met; otherwise FALSE is returned. If the logical
test contains the OR function, TRUE is returned if any of the conditions is met;
otherwise FALSE is returned. You can combine AND/OR functions and use
multiple logical functions to meet the requirements of your data analysis.
You can also use multiple IF functions together to create complex logical tests.
Multiple IF functions used together in one formula are known as nested IF
functions. They are useful for returning 3 or more different results. Additional IF
statements are included in the value_if_true and value_if_false arguments of the
IF function.
© 2019 Page 55
Lesson 4 – Shaping Data Sets ICDL Data Analytics - Foundation
Steps
The IF worksheet contains regional sales data in the range A5:B10. These
values need to be validated to check whether they are within the maximum
range specified in cell F6, which is 800,000.
If the sales amount exceeds the maximum range, the function will return the
message Error, otherwise it will return the sales amount itself.
In this example one location, Germany, has exceeded the maximum range.
Page 56 © 2019
ICDL Data Analytics - Foundation Lesson 4 – Shaping Data Sets
The IF worksheet also contains regional sales data in the range A15:B20.
These values need to be validated to check whether they are within the
minimum and maximum range specified in cells F16 and G16.
6. One way to check whether the values are in the minimum and maximum range
specified in cells F16 and G16 is to use the AND logical test.
If the sales amount is within the minimum and maximum range, the function
will return the sales amount, otherwise the message Error will be returned.
In this example the sales value for Germany is outside the minimum and
maximum range.
Note: Another way to check whether the values are in the minimum and
maximum range specified in cells F16 and G16 is to use a nested IF function.
For example, in cell D16, enter either of the following:
• =IF(B16>$G$16,"Error",IF(B16<$F$16,"Error", IF(B16<$G$16,B16)))
• =IF(B16>$G$16,"Error",IF(B16<$F$16,"Error", IF(B16>$F$16,B16)))
© 2019 Page 57
Lesson 4 – Shaping Data Sets ICDL Data Analytics - Foundation
One of the main tasks in cleaning data is to extract parts of data and parse it to
separate columns. A common task is to divide the text in a single cell into multiple
columns – for example, you may want to split the first name and last name
contained in one cell into two separate columns or split an address contained in
one cell into several columns.
The table below outlines the common functions in Excel used for extracting text.
For example:=LEFT("Ireland",3)
returns "Ire".
RIGHT RIGHT(text, [num_chars])
Page 58 © 2019
ICDL Data Analytics - Foundation Lesson 4 – Shaping Data Sets
The functions can be combined (nested) in Excel to perform particular tasks. The
table below outlines some common combinations of these functions for extracting
text.
Using these functions to clean data involves using the formula to extract the data
into a new column, filling down the new column, and converting the formulas in
the new column to values.
Steps
The cell range A4:A10 contains the imported country and its relevant country
codes. The data is however combined. You need to split the country and the
code into separate columns.
© 2019 Page 59
Lesson 4 – Shaping Data Sets ICDL Data Analytics - Foundation
Note: To extract the country into the cell C4, you need to extract all the entries
in cell A4 except for the last 2 characters. You can do this using the LEFT and
LEN functions. Use the LEN function to find the length of the entry in A4 and
then subtract 2.
5. To extract the country code into the cell D4, you need to extract the last 2
characters in cell A4 using the RIGHT function. Select the cell D4.
8. Copy the range C4:D10 and paste as values starting from cell F4 onwards.
Cleansed Data
Page 60 © 2019
ICDL Data Analytics - Foundation Lesson 4 – Shaping Data Sets
The data set shows order and shipping details for a period of seven days.
3. Enter a formula in H4 to extract the first number from Shipping Details in B4.
Hint: Use the LEN function to find the length of the entry in B4 and then
subtract 4.
7. Enter a formula in L4 to extract the 2 characters after the hyphen from Order
ID in D4.
8. Using the table range P6:Q10, calculate the commission amount payable
based on the Order Value (Order Value * Rate).
Hint: Use the VLOOKUP function in approximate match mode to find the rate
based on the Order Value.
9. Copy the formulas in row 4 (in columns H to M) to the rest of the rows in the
data set.
© 2019 Page 61
Lesson 4 – Shaping Data Sets ICDL Data Analytics - Foundation
Page 62 © 2019
LESSON 5 –
FILTERING DATA SETS
The visual design and layout of a table facilitates the summary and presentation
of key numerical information. You can use predefined or custom table styles to
quickly format table data in Excel and there are options to modify the table
display.
Before formatting a data set as table, you need to organise the data set. Some
guidelines include:
• Organise the data in rows and columns, with each row containing
information about one record, such as a sales entry, or inventory
transaction.
• Ensure each column contains a descriptive and unique heading in the first
row of the data set.
• Ensure each column contains one type of data, such as dates, currency, or
text.
Steps
Page 64 © 2019
ICDL Data Analytics - Foundation Lesson 5 – Filtering Data Sets
Note: The data range for the table is automatically highlighted, and the range
address is listed.
© 2019 Page 65
Lesson 5 – Filtering Data Sets ICDL Data Analytics - Foundation
6. On the Design tab, check or uncheck the desired options in the Table Style
Options group.
These options can affect the table style in various ways, depending on the
type of content in the table.
• Header Row - Apply or remove formatting from the first row in the table.
• Banded Rows - Display odd and even rows with alternating shading for
ease of reading.
• First Column - Apply or remove formatting from the first column in the
table.
• Last Column - Apply or remove formatting from the last column in the
table.
• Filter Button - Toggle AutoFilter on and off. With filtering enabled in the
header row, table data can quickly be sorted or filtered.
Page 66 © 2019
ICDL Data Analytics - Foundation Lesson 5 – Filtering Data Sets
Slicers are visual filters that provide buttons that you can quickly click to filter the
data in a table. Slicers show at a glance the current filter that is applied. This
makes it easy to understand the information that is shown in a filtered table.
Steps
3. On the Table Tools, Design tab, in the Tools group, click Insert Slicer.
4. In the Insert Slicers dialog box, check the fields (columns) for which you want
to create a slicer. In this case select the Region and Year fields (columns).
5. Click OK.
© 2019 Page 67
Lesson 5 – Filtering Data Sets ICDL Data Analytics - Foundation
Inserted Slicers
The selected buttons are shown in the slicers and the table columns are
filtered according to the selections.
Page 68 © 2019
ICDL Data Analytics - Foundation Lesson 5 – Filtering Data Sets
9. To select more than one item, press CTRL, and then click the items on which
you want to filter.
© 2019 Page 69
Lesson 5 – Filtering Data Sets ICDL Data Analytics - Foundation
2. Format the data set range A1:F58 as table using one of the medium styles.
6. Move and resize the slicers so that they appear at the right of the table.
7. Use the slicers to filter for credit transactions for the regions Europe and
United States.
Page 70 © 2019
LESSON 6 –
PIVOT TABLE DATA AGGREGATION
When you have prepared your data, you can start analysing it. In Excel, you can
perform simple analysis using statistical methods like sum, count, average,
maximum and minimum to summarise the data in a selected range. You can also
perform simple analysis using options like sorting, filtering and table slicers.
Sometimes you may need more powerful analysis to answer specific questions
and show comparisons, patterns, and trends. For example, you may need to find
out
• What percentage of the yearly sales took place in the first quarter?
• How did this year’s sales compare with last year’s sales?
In Excel, you can use a pivot table to calculate, summarise, and analyse large
data sets with thousands of rows quickly and easily. A pivot table is useful tool for
summarising data in two or more ways - for example, to find the number of units
sold both by store and by month. There are many ways you can work with pivot
tables to analyse data to find the information you are interested in.
Page 72 © 2019
ICDL Data Analytics – Foundation Lesson 6 – Pivot Table Data Aggregation
For example, if you are analysing the sales data across multiple stores and the
total sales amount by store is shown, you can change this to show the average
sales amount by store instead.
Steps
2. The PivotTable shows the total credit card transaction value for each
department. Click on any value in the Sum of Amount item in the PivotTable.
3. On the Analyze tab, in the Active Field group, click Field Settings.
4. Select the function Average in the Summarize value field by list box.
5. Click in the Custom Name text box and type Average Amount to change the
name.
Note: Every field in a PivotTable has a name. Fields in the row, column, and
filter areas inherit their names from the heading in the source data. Fields in
the value section are given default names such as Sum of Amount. Select
the function before renaming the field because the name will change to a
default name when a function is selected, e.g. when selecting Average, the
name will reflect Average of Amount.
© 2019 Page 73
Lesson 6 – Pivot Table Data Aggregation ICDL Data Analytics - Foundation
6. You can also change the format of the value. Click Number Format to set the
format of the value.
Page 74 © 2019
ICDL Data Analytics – Foundation Lesson 6 – Pivot Table Data Aggregation
© 2019 Page 75
Lesson 6 – Pivot Table Data Aggregation ICDL Data Analytics - Foundation
For an overview of different summary values on a data set, you can display
multiple summary calculations (aggregation values) in a pivot table, such as Sum,
Count, Average, Max, and Min.
For example, you may want to analyse the sales performance of multiple stores
and compare several aspects of their performance at one time. You may want to
show the total amount of sales by store, the total number of sales by store, the
average sale amount by store, the highest sale amount by store, and the lowest
sale amount by store. You can display all these aggregation values using the
Sum, Count, Average, Max, and Min calculations at the same time in a pivot
table.
Steps
3. To display Sum, Count, Max and Min aggregation values, drag and drop the
following fields into the Σ Values area in the PivotTable Fields panel:
Page 76 © 2019
ICDL Data Analytics – Foundation Lesson 6 – Pivot Table Data Aggregation
5. Select Summarize Values By and select Max. This changes the aggregate
values for the field Sum of Amount2 from total sum to maximum.
7. Select Summarize Values By and select Min. This changes the aggregate
values for the field Sum of Amount3 from sum to minimum.
8. To change the field names, right click each of the four fields. (Sum of
Amount, Count of Trans#, Max of Amount2, Min of Amount3).
10. Change the names in the Custom Name text box as follows:
© 2019 Page 77
Lesson 6 – Pivot Table Data Aggregation ICDL Data Analytics - Foundation
The four aggregation values are displayed in the pivot table with customised field
names.
The key data summary of the transactions for the different departments are
displayed in the pivot table to facilitate analysis.
Page 78 © 2019
ICDL Data Analytics – Foundation Lesson 6 – Pivot Table Data Aggregation
For example, when analysing the sales performance of multiple stores, you may
see the sales of each store by year, but you want to show the difference between
two years’ sales. Maybe you want to compare yearly sales totals to see if there
are any trends, maybe indicating problems that need to be addressed or
successes that need to be capitalised. You can do this using a built-in (custom)
calculation – in this example Difference From.
The table below outlines the commonly used built-in (custom) calculations in the
Show Value As option.
© 2019 Page 79
Lesson 6 – Pivot Table Data Aggregation ICDL Data Analytics - Foundation
Steps
3. On the Analyze tab, in the Active Field group, click Field Settings.
4. Select the Show Values As tab in the Value Field Settings dialog box.
5. Click the Show values as drop-down list and select % of Grand Total.
Page 80 © 2019
ICDL Data Analytics – Foundation Lesson 6 – Pivot Table Data Aggregation
The Pivot table shows the amount for each department as a percentage of the
Grand Total.
© 2019 Page 81
Lesson 6 – Pivot Table Data Aggregation ICDL Data Analytics - Foundation
To rank the number of transactions for each department from highest to lowest
using the Rank Largest to Smallest custom calculation (with the highest value
being ranked as 1):
1. Click on any value in the Count of Trans # item in the PivotTable in the
Pivot3 worksheet in the Cards Pivot workbook.
2. On the Analyze tab, in the Active Field group, click Field Settings.
3. Select the Show Values As tab in the Value Field Settings dialog box.
4. Click the Show values as drop-down list and select Rank Largest to
Smallest.
Ranking Values
Note: The default base field is Department as the ranking is done by comparing
the number of transactions across the departments.
6. Click OK.
Page 82 © 2019
ICDL Data Analytics – Foundation Lesson 6 – Pivot Table Data Aggregation
The number of transactions are ranked from highest to lowest (with the highest
value being ranked as 1).
To add another value item to show the cumulative total over a period of time –
in this example a cumulative daily total:
3. In the PivotTable Fields panel, drag and drop the Amount field into the Σ
Values area.
Note: If necessary, set the aggregation method for the field to Sum.
5. On the Analyze tab, in the Active Field group, click Field Settings.
6. Select the Show Values As tab in the Value Field Settings dialog box.
7. Click the Show values as drop-down list and select Running Total In.
9. Type Cumulative Total in the Custom Name box to modify the name.
© 2019 Page 83
Lesson 6 – Pivot Table Data Aggregation ICDL Data Analytics - Foundation
Running Total
The cumulative total up to each date is shown in the same row as the date.
Page 84 © 2019
ICDL Data Analytics – Foundation Lesson 6 – Pivot Table Data Aggregation
To add another value item to show the difference between one value and
another value in a pivot table - in this example, to show the difference between
the total amount of Visa card transactions and the other card types (Amex,
Debit and MasterCard):
3. In the PivotTable Fields panel, drag and drop the Amount field into the Σ
Values area.
Note: If necessary, set the aggregation method for the field to Sum.
5. On the Analyze tab, in the Active Field group, click Field Settings.
6. Select the Show Values As tab in the Value Field Settings dialog box.
7. Click the Show values as drop-down list and select Difference From.
9. Select Visa in the Base item list box to define Visa as the value to be
subtracted from the other values.
Difference From
© 2019 Page 85
Lesson 6 – Pivot Table Data Aggregation ICDL Data Analytics - Foundation
The difference between the total amount for each card and the Visa card is
shown.
The resulting pivot table shows Visa as the preferred type of payment in this
context and the gap in transactional values by other payment methods
compared to it.
Note: The Difference From calculation subtracts one value in the pivot table
from another and shows the result (that is the difference from the value of the
Base item in the Base field). To see the percentage difference, use %
Difference From. The percentage change figure is useful for analysing trend
data.
Page 86 © 2019
ICDL Data Analytics – Foundation Lesson 6 – Pivot Table Data Aggregation
2. Create a PivotTable in the build area provided from cell K4 onwards using the
appropriate fields and aggregation methods to summarise the average
amount of tickets sold for each office and the number of tickets sold for
each office.
3. Label the average amount as Average Cost and format the values as
currency without decimal.
6. Create a PivotTable in the build area provided from cell G4 onwards using the
appropriate fields and built-in calculations to show the daily total of the Order
Value and the running total of the Order Value.
© 2019 Page 87
Lesson 6 – Pivot Table Data Aggregation ICDL Data Analytics - Foundation
9. Create a PivotTable in cell G15 onwards using the data set in the range
A3:D20 to show the percentage of the order value for each shipping detail.
12. Create a pivot table in the build area provided from cell H4 onwards using the
appropriate fields and built-in calculations to rank 2018 Sales for each Market
from largest to smallest.
Page 88 © 2019
LESSON 7 –
PIVOT TABLE FREQUENCY ANALYSIS
When analysing large sets of data in pivot tables, you may want to look at subsets
of the data to find patterns and trends. You can do this by organizing the data into
groups.
If your pivot table shows how often something occurs for a particular field in a data
set, this is a frequency distribution table. You can put data into groups to create a
grouped frequency distribution table. This provides useful information on how data
in the data set is distributed across groups. The group with the highest frequency
for the data set is known as the modal group.
If the data in your pivot table has date or time fields, you can group the data by
specified time periods. The groups available are seconds, minutes, hours, day,
month, quarter, or year. In the example below, the first pivot table lists daily
transaction totals and numbers. The transaction totals and numbers in the second
pivot table have been grouped by week. In the second pivot table you can see at
a glance that the highest total occurred in week 30/4/2018-6/5/2018 and the
highest number (or frequency) of transactions occurred 11/6/2018-17/6/2018.
Note: You can also group date and time data into hierarchical groups such as by
week and day or by year and quarter.
If the data in your pivot table contains numeric values, you can group the data by
a specified interval. In the example below, the first pivot table shows individual
transaction amounts and the number of transactions, or frequency for each
amount. In the second pivot table the transaction amounts have been grouped
into intervals of 20. In the second pivot table you can see at a glance that the
most frequently occurring transaction amounts occurred in the 21-40 and the 101-
120 ranges.
Page 90 © 2019
ICDL Data Analytics - Foundation Lesson 7 – Pivot Table Frequency Analysis
Steps
Grouping Dates
4. In the Grouping dialog box, in the By list box, select the period of time to
group by – in this case, to view a weekly report deselect the Months option,
select Days and type 7 in the Number of Days box.
Note: There is no weekly option, so to group the data by week you need to
select Days and set the number of days as 7. The week range is determined
by the date in the Starting at box, so adjust this if necessary. By default, the
Starting at and Ending at dates are the first and last dates in the pivot table.
© 2019 Page 91
Lesson 7 – Pivot Table Frequency Analysis ICDL Data Analytics - Foundation
Grouping by a Week
5. Click OK.
Page 92 © 2019
ICDL Data Analytics - Foundation Lesson 7 – Pivot Table Frequency Analysis
2. Click on any numerical value in the Rows area in the pivot table.
4. In the Grouping dialog box, in the By text box, type the range or interval to
group by – in this case, 20 - and click OK.
Grouping Numbers
Note: By default, the starting and ending values are the lowest and highest
values in the pivot table, so adjust this if necessary.
© 2019 Page 93
Lesson 7 – Pivot Table Frequency Analysis ICDL Data Analytics - Foundation
There are many ways that pivot tables automatically group data, but you may also
want to manually create custom groups.
You will need to manually create custom groups when the data you want to group
are not date, time or numeric items, or the group you want to create does not
appear in the underlying data. For example, you may have a pivot table that
shows a breakdown of credit card transactions by various banks. You want to
further group the transactions into groups of banks, but those groups don't appear
in the underlying data. In this case. you will need to manually create your own
custom groups of banks.
When you group data items, the group is given a name by default. You can
rename groups to define names that are more relevant for your purposes.
Steps
2. Select the items to add to the first group – in this case, the issuers listed are to
be grouped by country, so select the following issuers to be part of a USA
group:
• American Express
• Bank of America
• CapitalOne
• Chase Manhattan
• CitiBank
• Seattle FirstBank
• Wells Fargo
Note: To select multiple non-contiguous items, press CTRL and click on the
items.
Page 94 © 2019
ICDL Data Analytics - Foundation Lesson 7 – Pivot Table Frequency Analysis
© 2019 Page 95
Lesson 7 – Pivot Table Frequency Analysis ICDL Data Analytics - Foundation
To rename groups:
1. For each manual grouping a new name is automatically assigned (for example
Group1, Group2 and Group3). To change the group name to a meaningful
label, click on the group name and type a new name (for example, USA).
2. To hide the details in each group, click the Collapse button next to the group
name.
Page 96 © 2019
ICDL Data Analytics - Foundation Lesson 7 – Pivot Table Frequency Analysis
Data that are grouped by applying selected criteria or manually can be easily
ungrouped.
Steps
3. On the Analyze tab, in the Group group, click Ungroup to show the original
data.
Ungrouping data
Note: Alternatively, you can right-click a group label and select Ungroup.
© 2019 Page 97
Lesson 7 – Pivot Table Frequency Analysis ICDL Data Analytics - Foundation
Note: If the Total Sales is not visible for the Year, set the Subtotals to
Automatic in the Years field settings.
6. Group the sales amount in the Row area by every thousand, starting from 0 to
8000.
7. Select the following labels in the Column area, group the selection and rename
the group accordingly.
8. Remove the original Region field from the Columns area in the Field List
pane.
Page 98 © 2019
LESSON 8 –
FILTERING PIVOT TABLES
When you have summarized data in a pivot table in Excel, you may want to create
reports that analyse certain parts of the data. You can create these reports using
one or more report filters. A report filter can be used to display selected items in
a pivot table. Items selected in the filter are displayed in the pivot table and items
that are not selected are hidden. Report filters are displayed above the pivot table
for easy access.
For example, you might want to show the sales amounts for a specific year
instead of all years or you might want to show the sales amounts for one or two
regions instead of all regions. The pivot table below has report filters for
Account Date and Region which are showing information for the year 2017 and
the region United States.
You may need to create a pivot table report for each unique item in a field. For
example, you may need to create a report for each region or each year. You can
do this by creating a report filter, filtering by each item in the report filter list, and
copying the output for each item into a new worksheet. A quicker way to do this is
to create a report filter and use the Show Report Filter Pages feature. This
feature automates the process of applying a filter for each item and copying the
results into separate worksheets. It also names the new worksheet after the item.
This saves lots of time, especially if you are creating a large number of reports.
Steps
2. Click on any cell in the pivot table to open the PivotTable Fields panel.
3. To add the account date as a filter, move the Account Date field from the
Columns area to the Filters area in the PivotTable Fields panel.
4. To add the Region as a filter, drag and drop the Region field to the Filters
area.
The applied report filters (Account Date and Region) appear above the pivot
table.
5. To use the report filters to view the summary for the United States in the year
2017, select the year 2017 from the Account Date drop-down list and click
OK.
6. And select the region United States from the Region drop-down list and click
OK.
7. To clear the filters from the pivot table, select All from the Account Date drop-
down list and click OK.
8. And select All from the Region drop-down list and click OK.
Note: To remove the report filters from the pivot table, remove the fields from
the Filters area.
1. Click on any cell in the pivot table (with report filters applied).
2. On the Analyze tab, in the PivotTable group, click the Options down-pointing
arrow, and click Show Report Filter Pages.
3. Select the required filter field, in this case Region, and click OK.
A worksheet is created for each item in the Region report filter list with a pivot
table filtered for that item. The worksheet is also renamed to match the item
name.
As well as using report filters, you can also use slicers to filter data visually in a
pivot table. The slicer feature provides buttons that you can click to quickly and
easily filter the pivot table data, making reports interactive.
For example, you may have a pivot table that shows the total amount spent by
different types of customers on different product categories and you create a slicer
that allows you to filter this information by different areas. The slicer below has
buttons that allow you to filter by the areas Central, East, North, South and West.
Slicers also indicate the current filtering state, which makes it easy to understand
what is shown in a filtered pivot table. When you select an item in a slicer, that
item is included in the filter and the data for that item is displayed in the report.
You can also customise slicers by changing slicer options. For example, you can
change the name of the slicer for formulas, the header name displayed in the
slicer, the order in which the items are displayed in the slicer, the colour of the
slicer, the number of columns, the button size and the slicer size.
You can also connect a slicer to multiple pivot tables to create useful cross-filtered
reports. A slicer can only be connected to multiple pivot tables when the pivot
tables have the same source data range.
Steps
2. The pivot table shows total amount spent by different types of customers on
different product categories. Click on any value in the pivot table.
4. Select the relevant field, in this case, select Area and click OK.
The slicer appears on the worksheet alongside the PivotTable (in a layered
display if you have more than one slicer).
5. Click and drag the slicer to position it, for example, to the right of the pivot
table. Resize the slicer as necessary using the sizing handles.
6. Use the slicer to filter the pivot table by doing one of the following:
7. Click the Clear Filter icon at the top right of the slicer to clear the filter.
2. Change the Slicer’s Name and display options for Header and Item Sorting
and Filtering if required.
3. On the Options tab, in the Slicer Styles group, select a style for the slicer – in
this case Light Orange, Slicer Style Light 2.
4. On the Options tab, in the Buttons group, select the number of columns – in
this case, type 5 in the Columns box, and change the button height and width
as necessary.
5. The slicer dimension may need to be adjusted to ensure all the buttons are
visible. On the Options tab, in the Size group, change the overall size of the
slicer height and width as necessary.
The slicer style is changed to light orange and the number of columns is
changed from 1 to 5.
Formatted Slicer
Note: To delete a slicer, click on the slicer to select it, and press Delete. Or
right-click the slicer and select Remove <Name of Slicer> from the context
menu.
4. In the Report Connections dialog box, check the option for PivotTable2 to
link the Profession slicer to the bottom pivot table
5. Click OK.
6. Repeat steps 3 and 4 to connect the Category slicer to the top pivot table
(PivotTable1).
The slicers (Profession and Category) are connected to both pivot tables.
Making a selection in either slicer filters the data in both pivot tables
accordingly.
To easily display data in a pivot table for different time periods, you can use the
Timeline option in Excel. Timelines are based on date fields, so to create a
timeline the pivot table needs to include at least one date-formatted field.
A Timeline has a scroll bar that allows you to focus on a time period before
selecting a specific time period in the date field to display. You can choose what
level the time period is shown as - years, quarters, months, or days.
Similar to slicers, you can customise and format Timelines. You can also use a
single Timeline to filter numerous pivot tables, if the pivot tables use the same
data source.
Steps
4. In the Insert Timelines dialog box select the date-formatted field on which to
base the timeline – in this case, click Date and click OK.
Timeline Filter
6. To change the time level, click the arrow next to the time level shown and
select the new time level – in this case, DAYS.
7. Drag the Timeline scroll bar to focus in on a particular time period - in this
case, starting from the 15th March.
8. To select the time period to display, click a period tile in the timespan control
and drag at either end of the tile to select the relevant tiles - in this case, from
16th to 20th Mar.
9. On the Options tab, in the Timeline Styles group, select a style for the
timeline.
10. On the Options tab, in the Size group, change the overall size of the timeline
height and width as required.
11. On the Options tab, in the Show group, set the options you want to display in
the timeline.
Timeline Elements
Note: To clear the Timeline, click the Clear Filter icon at the top right of
the timeline.
To delete a Timeline, select the Timeline and press Delete or right-click the
Timeline and select Remove Timeline.
To use one Timeline to filter multiple pivot tables, select the Timeline and in
the Options tab select Report Connections and select the pivot tables to
include. The pivot tables need to use the same data source.
3. Add a slicer using the field Category for the pivot table in cell A11:B14.
4. Move and resize the slicer so that it fits the range A1:B9.
5. Link the slicer with the rest of the PivotTables in the worksheet.
6. Add a timeline using the field OrderDate for the PivotTable in cell A11:B14.
8. Remove the display of the Time Level, Selection Label and Scrollbar in the
timeline.
9. Move and resize the slicer so that it fits the range D1:G5.
10. Link the timeline with the rest of the PivotTables in the worksheet.
11. Select all the categories except Camping in the Category slicer.
12. Set the time period from January to June in the OrderDate timeline.
You may want to add a visualization to an existing pivot table. Pivot charts are an
effective way to create data visualizations that also allow you to display different
views of your data.
You can create a pivot chart from an existing pivot table report. When you create
a pivot chart from an existing pivot table report, the pivot chart reflects the view of
the existing pivot table report. For example, the pivot table report below shows the
totals of the cheque and credit card transactions for different regions. The
corresponding pivot chart shows the same view.
The pivot chart uses the pivot table layout to determine the placement of fields.
• Row fields in the pivot table become category fields in the pivot chart.
• Column fields in the pivot table become series fields in the pivot chart.
• Pivot table fields are displayed as field buttons in the pivot chart. These
can be used for filtering.
• The row fields (Region) in the pivot table are shown on the category
axis in the pivot chart.
• The column fields (Cheque and Credit) in the pivot table are shown on
the series axis.
• The pivot table fields (Region and Category) are displayed as field
buttons in the pivot chart.
When you have created your pivot chart you can use it to further analyse your
data. You can manipulate pivot chart data in the same way you can manipulate
pivot table data. Changing the information displayed in a pivot chart automatically
changes the corresponding pivot table, and vice versa. You can also add slicers
to the pivot chart.
You can customize the visual display and layout of your pivot chart to suit your
needs – for example, you can change the chart type, adjust the chart layout and
formatting.
If you delete the corresponding pivot table, the pivot chart changes to an ordinary
chart.
Steps
4. In the Insert Chart dialog box, select the chart type – Column is selected be
default.
5. Select the chart variant – in this case, Stacked Column and click OK.
6. The pivot chart is placed on the same sheet as the pivot table it is based on.
Move and resize the chart as necessary.
Note: To move the pivot chart to a different worksheet (or to a Chart sheet), on
the Analyze tab in the Actions group, click on Move Chart.
When you have created the pivot chart you can customise the appearance and
layout to suit your needs.
7. To add a chart title above the chart, click on the PivotChart area and select
the Chart Elements button.
8. Select Chart Title, select Above Chart and enter the title – in this case, enter
Credit and Cheque Transactions
9. To hide field buttons, deselect the relevant option in the Field Buttons drop-
down menu, in the Show/Hide group, on the Analyze tab – in this case,
deselect Show Axis Field Buttons, Show Legend Field Buttons and Show
Value Field Buttons.
Note: Alternatively, to hide field buttons, right-click the field button and select
the relevant option.
10. To remove gridlines, click on the PivotChart area and select the Chart
Elements button
12. To move the legend below the chart title, click on the PivotChart area and
select the Chart Elements button.
As well as creating a pivot chart from an existing pivot table report, you can create
a pivot chart directly from data stored in a table.
When you create a pivot chart directly from fields in a table in Microsoft Excel, a
corresponding pivot table is automatically created. A pivot chart must have a
corresponding pivot table because the data used to plot the pivot chart comes
from a pivot table.
In the example below, the Region, Sales, and Category fields in the table are
used to create the corresponding pivot chart, and the matching pivot table is
automatically created in Excel.
When you create a pivot chart directly from fields in a table, a PivotChart build
area is displayed with the PivotChart Fields panel.
You build the pivot chart in the same way you would a pivot table.
The table below describes how the elements in a PivotChart correspond to the
elements in a PivotTable.
Steps
4. In the Create PivotChart dialog box, select the option Existing Worksheet.
Note: Clicking a cell in the table previously means the Select a table or range
option is selected by default.
6. Click OK.
7. In the PivotChart Fields pane, drag and drop the fields to the respective
areas as follows:
The pivot chart is created using a default chart type (a clustered column chart).
8. To change the chart type, on the Design tab, in the Type group, click Change
Chart Type.
9. In the Change Chart Type dialog box, select the chart type, for example,
select Bar in the chart type and 100% Stacked Bar in the chart variant
11. The chart type is changed to a 100% stacked bar chart. Move and resize the
chart as necessary.
12. You can change the information that is displayed in the pivot chart. For
example, the Legend field button (Category in this example) allows you to
choose the data series information to display and the Axis field button
(Region in this example) allows you to choose the axis information to display.
In this case, to exclude the Refund data series, select the Category field
button, deselect Refund and click OK.
13. You can customize the layout of the chart. To add a chart title, click on the
PivotChart area and select the Chart Elements button.
14. Select Chart Title and enter the chart title Popular Payment Methods above
the chart.
15. To hide field buttons, deselect the relevant option in the Field Buttons drop-
down menu, in the Show/Hide group, on the Analyze tab – in this case,
deselect Show Axis Field Buttons to remove the Region field button.
Note: Alternatively, to hide field buttons, right-click the field button and select
the relevant option.
16. To insert a slicer for the pivot chart, click Insert Slicer in the Filter group in the
Analyze tab.
19. To format the slicer to display the buttons in 2 columns, set the Columns field
to 2 in the Buttons group in the Options tab.
20. Move the slicer to the right of the pivot chart and resize it to show all the button
labels.
21. You can customize the appearance of the pivot chart. To change the style of
the chart, click on the PivotChart area and select the Chart Styles button.
23. To change the colour of the chart, click on the PivotChart area and select the
Chart Styles button.
The information displayed in the pivot chart, as well as the layout and
appearance have been updated and a slicer has been added.
2. Insert a 3D Pie pivot chart for the existing pivot table in the workbook.
3. Move the pivot chart and position it below the pivot table.
6. Add the data label to the outside end of each pie chart slice.
8. Add a slicer using the field Location for the pivot chart.
10. Move and resize the slicer so that it fits the range L1:P3.
11. Create a Clustered Column pivot chart directly from the data set fields in the
table and present it in a new worksheet. The chart should count the number of
personnel for each 10000 range of salary. To do this:
a. Add the fields Salary to the Axis area and Staff ID to the Values area.
b. In the corresponding pivot table, group the Salary from 10000 to 50000
by every 10000.
13. Modify the chart title to reflect Salary Range Frequency Distribution.
15. Add data labels to the outside end of the data series.
Data visualization provides a visual way to access data by displaying trends and
patterns within data in a visual form. Data visualization makes it simpler and more
efficient to analyse and understand textual and numerical data; it saves time; it
provides insights that contribute to business intelligence; and it helps with decision
making. It can be used in the earlier stages of data analytics to explore and make
sense of the data, as well as for reporting purposes.
There are lots of specialized data visualization tools designed to help businesses
easily perform sophisticated data analytics.
Key Features
These powerful tools allow you to explore data sets and find and present
information and insights from the data in a way that suits your business needs
through the creation of data visualizations. There are a wide range of data
visualizations provided that can go beyond traditional forms. Part of your data
analysis will include determining which types of visualizations provide the best
insights and business intelligence for your specific needs. Common examples of
data visualizations include:
• Charts (column, bar, line, pie) - Useful for displaying relationships among
numeric data by category or over time periods.
• Tables - Useful for presenting and comparing detailed data and exact
values.
Data visualization tools go beyond presenting information in visual forms and also
provide interactive capabilities that allow you to filter and drill into the data in
visualizations for querying and analysis.
Data visualization tools also enable production of insightful visual reports. A basic
report includes visualizations and results from the data analytic process. A report
can consist of a single page with one or more visualizations or it can consist of
multiple pages of visualizations. The aim of reports is to provide key insights from
your data that can be used to achieve business goals, such as summary
information, trends, and patterns in a visual format.
Data visualization tools also provide the ability to easily share the business
intelligence information produced in dashboards and reports. And they have the
ability to support collaboration.
Page 134 © 2019
ICDL Data Analytics - Foundation Lesson 10 – Data Visualization Tools
Microsoft Power BI
Microsoft Power BI provides powerful data analytics with user-friendly interfaces.
It consists of a desktop application called Power BI Desktop, an online service
called Power BI service, and mobile Power BI apps.
Power BI Desktop is free to download and install. You can use it to connect to
many types of data sources, explore your data using visualizations, and design
and create reports using visualizations.
Note: In order to share content created in Power BI Desktop, you will need to
publish it to Power BI service.
URL: https://powerbi.microsoft.com/
Tableau Public
Tableau Public is a free web-based platform used to create data visualizations,
which are uploaded to the Tableau Public website. With a large collection of data
connectors and visualizations with intuitive design, it is one of the leading tools in
data visualization. Each visual comes with shareable code that allows you to
tweet it, share it on Facebook, or embed it in your organization’s website. There is
also a paid version.
URL: https://public.tableau.com/
Steps
To download Microsoft Power BI Desktop to your device:
When you have installed the Microsoft Power BI Desktop application, you can
launch the application by double-clicking the Power BI Desktop app icon on your
desktop. This opens the Microsoft Power BI Desktop environment where you
can get data, create visualizations, build reports and publish reports to Power BI
service.
The main parts of the Microsoft Power BI Desktop environment are illustrated
below:
Power BI Desktop Screen Interface
Ribbon
Similar to Microsoft Excel and other Microsoft Office applications, the Ribbon
displays the toolbars associated with reports and visualizations.
Views
There are three main views as outlined below. You can change the view by
selecting the relevant icon.
The Report View, shown above, is where you build reports. The Report
View brings the Power BI screen to the report canvas page.
The Data View is where you inspect, explore, and understand the
underlying data in your Power BI Desktop model.
Pages tab
Each tab at the bottom of the canvas represents a page in a report. The tab lets
you select or add a report page.
Report canvas
The report canvas is where visualizations are displayed. You use it to display and
arrange visualizations and contents of reports.
In the Visualizations and Filters panes you select the visualizations and
determine the display of visualizations – for example you can set the colour and
format for the visualizations. And you can select the filters for the visualizations.
You can quickly create visualizations by selecting the icons under the
Visualizations pane.
You can import a data set from a spreadsheet in Microsoft Excel into Power BI
and save it in a Power BI file.
You can then use the data set in the Power BI file to create visualizations and
build reports.
Steps
To import a data set from a spreadsheet into Power BI Desktop:
2. On the Home tab, in the External Data group, click Get Data.
4. Locate and select the Sales Format.xlsx file and click Open.
5. The Navigator window is displayed. Check the Table2 option under Display
Options.
6. Click Load.
In the report View, the Fields pane displays all the column headings in the
data set.
9. The Save As dialog box is displayed. Type Report in the File name box and
click Save.
c) Map d) Report
2. Which of the following consists of a single screen that provides key metrics at
a glance?
a) Dashboard b) Report
c) KPI d) Summary
3. Which of the following is the main purpose for creating an analytical report?
a) To create precision graphics for mechanical design.
b) To solely represent data in rows and columns.
c) To create, publish and share graphical insights.
d) To store, organise and display data in a database format.
4. Which one of the following is a common type of data visualization used for
measuring progress against predefined conditions?
a) Table b) Map
7. Open Power BI Desktop, import the Sales data set in the Sales worksheet
from the file Sales overview.xlsx and save it as Sales overview
imported.pbxi.
Page 142 © 2019
LESSON 11 –
CREATING BASIC DATA VISUALIZATIONS
Creating data visualizations based on a data set is one of the key functions of
data visualization tools.
You can use visualizations to explore your data. And you can combine
visualizations to create meaningful and insightful reports.
There are many types of visualizations to choose from. As part of your data
analysis, you will determine which visualizations are appropriate for your needs.
A table is useful for comparing detailed quantitative data. In a table, related data
is laid out in a logical grid of columns and rows.
Steps
2. Select the relevant Power BI file – in this case, select TableChart.pbxi and
click Open.
All the fields are automatically selected as Values under the Visualizations
pane.
Note: Alternatively, drag and drop the fields to the Values area under
Visualizations pane.
A table comparing the sales performance between 2017 and 2018 is created.
Note: You may need to size the table to view all the information.
5. Select the table and click the Format icon below the visualization icons in
the Visualizations pane to show the formatting options for the table.
6. Click Field formatting to expand the options and set the font color of the
values in 2017 Sales to green and its background color to white.
7. Click the down arrow next to 2017 Sales to change the field to 2018 Sales.
Set the background color of 2018 Sales to yellow.
8. Save the file as TableChart Added Power BI file and close the file.
Charts are one of the most commonly used visualizations in data analytics. They
reveal relationships such as similarities, differences, and trends among multiple
values. Microsoft Power BI has a range of visualization options from basic charts
to more advanced visuals. Some popular types of charts include:
• Line chart: This is the most popular chart type and is used to present a
continuous data set. It is suited for trend-based visualizations of data over
time, with the emphasis on the continuation or the flow of the values (a
trend).
• Bar/column chart:
o A bar chart is a horizontal column chart. It is best for multiple
categories, especially those with lengthy labels or for displaying a
data set with negative numbers.
o A column chart, on the other hand, displays data in vertical
columns.
To facilitate comparison of multiple variables across categories of data,
Power BI includes Stacked bar and Stacked column charts, as well as
Clustered bar and Clustered column charts.
• The data points to be presented: How many are there for each category?
When you create your chart, you can then modify how it looks using the formatting
options in Power BI.
© 2019 Page 147
Lesson 11 – Basic Data Visualizations ICDL Data Analytics - Foundation
Steps
To create a Line chart visualization - in this case, to plot the sales and profit
values by month to visualise a trend:
1. Open the relevant Power BI file in Power BI Desktop – in this case, select
Sales Chart.pbxi.
2. In Report view, select the Line chart icon in the Visualizations pane.
3. Select the fields to add to the chart – in this case, select the Order Date,
Sales, and Profit fields under the Orders table in the Fields pane.
Note: Alternatively, drag and drop the fields to the respective options under
Visualizations pane.
The Order Date field is automatically selected as Axis, with Sales and Profit
automatically selected as Values under the Visualizations pane.
4. Select the time scale to display – in this case, to display the Order Date by
month, deselect Year, Quarter, Day listed under Order Date by clicking the x
symbols.
6. Click the Format icon below the visualization icons in the Visualizations pane
to show the formatting options for the line chart.
7. Click Legend to expand the options and set Position to Top Center.
Legend Options
8. Expand the Data colors section and set the colours for the data series – in
this case, set Sales: Purple and Profit: Green.
9. Expand the Shapes section and set the following options: Stroke width: 3,
Show marker: On, and Marker shape: Diamond.
10. Expand the Title section and set the following options: Font colour: Black
and Text size: 10.
11. The formatted line chart will be updated on the report canvas.
Hover over a data point on the line chart to highlight the details and drop line.
To create a Donut chart visualization - in this case, to show the sales distribution
percentage for the various regions:
1. In Report view in Power BI Desktop, with Sales Chart.pbxi still open, click
on an area outside the Line Chart in the Report Canvas area and select the
Donut chart icon in the Visualizations pane.
2. Select the fields to add to the chart – in this case, select the Region and Sales
fields under the Orders table in the Fields pane.
4. Expand the Detail labels section and set Label style to Category, percent of
total and Text size to 10.
5. Expand the Title section and set the Title Text to Sales Distribution by
Region, Font colour to Black and Text size to 10.
6. Position the formatted donut chart to the right of the line chart and resize it as
necessary.
Note: To create a pie chart, select the Pie chart icon in the Visualizations pane.
You can also change the donut chart to a pie chart by selecting the donut chart
and then clicking the Pie chart icon in the Visualizations pane.
1. In Report view in Power BI Desktop, with Sales Chart.pbxi still open, click
on an area outside the charts in the Report Canvas area and select the
Stacked column chart icon in the Visualizations pane.
2. Select the fields to add to the chart – in this case, select the Region, Sales
and Profit fields under the Orders table in the Fields pane.
The Region field is automatically selected to appear under Axis, with Sales
and Profit automatically selected to appear under Value under the
Visualizations pane.
3. To format the chart, click the Format icon below the visualization icons in the
Visualizations pane.
4. Expand the Legend section and set the Position to Top Center and Text size
to 10.
5. Slide the Y-Axis option to Off to turn off the y-axis display.
6. Expand the Data colours section and modify the series colour Sales to Purple
and Profit to Green.
8. Expand the Title section, set the Font colour to Black and Text size to 10.
9. Position the stacked column chart below the line chart and resize as
necessary.
To create a clustered bar chart – in this case, to compare sales figures across
four customer types:
1. In Report view in Power BI Desktop, with Sales Chart.pbxi still open click on
an area outside the stacked column chart in the Report Canvas area and
select the Clustered bar chart icon in the Visualizations pane.
2. Add fields to the chart – in this case, select the Customer Type and Sales
fields under the Orders table in the Fields pane.
3. Click the Format icon below the visualization icons in the Visualizations
pane.
5. Expand the Data colours section and set the data colour for Corporate to
Purple.
7. Expand the Data labels section and set Position to Inside Center.
8. Expand the Title section and set the following options Font colour to Black
and Text size to 10.
9. Position the clustered bar chart to the right of the stacked column chart and
resize as necessary.
The data series are by default sorted by the values in descending order.
10. To sort the data series by customer type, select the clustered bar chart
visualization, click More options (the ellipses) in the top right of the
visualization and select Sort by Customer Type.
11. Create a copy of the clustered bar chart visualization and position it to the right
of the copied clustered bar chart.
© 2019 Page 157
Lesson 11 – Basic Data Visualizations ICDL Data Analytics - Foundation
12. Uncheck Sales and check Profit to change the field under Value to Profit.
A Clustered bar chart displaying Profit by Customer Type is created.
13. Align the three chart visualizations in a row with equal spacing in between.
14. Save the file as SalesChart Added Power BI file and close the file.
After the creation of visualizations, you may want to enhance the visuals to note
key summary features and highlight important statistical characteristics. For
example, you may want to show maximum and minimum values or above
average and below average values.
Some conditional formatting options, for example, for a table visualization, include
setting the background color and font color for cells based on cell values or
rules and showing cell values with data bars.
Steps
To set conditional formatting by Background colour on values in a table
visualization:
3. Go to 2017 Sales within the Values area in the Visualizations pane and click
the field’s down arrow.
5. Specify colours for the lowest value and the highest value. In this case, set the
colour yellow in the Minimum area and the colour purple in the Maximum
area
Note: If you select the Diverging box, you can configure an optional Center
value as well.
Depending on the Power BI Desktop version, you can also specify colours for
specific value ranges by choosing the Rules Format by option. In the Rules
area, set one or more value ranges and related colours by entering the relevant
If value condition, the relevant and value condition and a related colour.
Table cells with values in the specified ranges will be filled with the specified
colour.
6. Click OK.
8. To edit the applied conditional formatting, select the field’s down arrow, select
Conditional Formatting and click Background color from the contextual
menu and edit as required.
9. To remove the conditional formatting, select the field’s down arrow, select
Remove Conditional Formatting. Select All or select the specific formatting
type to remove.
1. Go to 2018 Sales within the Values area in the Visualizations pane, click the
field’s down arrow and select Conditional Formatting.
3. Click the down arrow next to Color scale and set the Format by to Rules
option.
Note: You can use the Color scale option to specify colours for the minimum
value and the maximum value.
4. When you select the Format by Rules option, you can specify one or more
value ranges, each with a set colour. In each value range, set an If value
condition, an and value condition, and a colour.
In this case, set the If value to is greater than or equal to 0 and is less than
50000 in the Rules area:
Specifying Rules
Note: You can add more rules by clicking the Add button.
Values are highlighted in the table to bring attention to the range and
variation within the data set.
6. To edit the applied conditional formatting, select the field’s down arrow, select
Conditional Formatting and click Font color from the contextual menu and
edit as required.
7. To remove the conditional formatting, select the field’s down arrow, select
Remove Conditional Formatting. Select All or select Font color.
2. Select the fields to add to the table – in this case, select the Market and 2018
Sales fields under the Table2 table in the Fields pane.
3. Go to 2018 Sales within the Values area in the Visualizations pane, click the
field’s down arrow and select Conditional Formatting.
4. Click Data bars from the contextual menu.
6. Click the Positive bar down arrow and select the required colour (for example,
green) and click OK.
7. Place the table visualization to the right of the first table and resize if
necessary.
The data bars enhanced the table visualization to highlight the market share of
2018 sales.
Note: To edit data bars, select the down arrow next to the relevant field, select
Conditional Formatting and click Data bars and edit as required.
To remove data bars, select the down arrow next to the relevant field, select
Remove Conditional Formatting and select Data bars.
9. Save the file as Sales Chart Formatted Power BI file and close the file.
To visually inspect specific sections of a data set, you can apply filters to
visualizations so that the visualizations only display the required information.
In Power BI, you can specify the items to display within a given visualization using
Visual Level Filters.
Steps
To apply visual level filters on a table visualization:
2. In Report view, click on the table visualization to view the 2018 Sales listed by
Country.
3. Under Visual level filters in the Filters pane, the value fields 2018 Sales and
Country are automatically included. Click on the down arrow next to 2018
Sales under Visual level filters to show the visual level filter options for the
Sales 2018 field.
For numerical values, the visual level filter options allow for display of values
by setting a rule (for example, is less than or equal to) and specifying a value.
4. To show only sales volume of less than 30000 in the table visualisation, enter
the value under Show items when the value and the option is less than and
press Return.
5. Click on the down arrow next to Country under Visual level filters to show
the visual level filter options for the Country field.
For qualitative values, the items to display can be selected in a list displayed
under Basic filtering.
6. To display only 2018 sales for one country at each application of the visual
level filter, select Require single selection. This allows the sales value only
for a selected country to be shown.
8. Scroll down the list and select the countries – in this case, select Singapore
and Malaysia.
9. Save the file as Visual Levels Added Power BI file and close it.
The setting for the Visual level filters is saved with the file.
Steps
To create a visualization using a Map – in this case a map chart of sales volume
by state:
Map Template
3. Select the fields to add to the map – in this case, select the State and Sales
fields under the Orders table in the Fields pane.
4. Click the Format icon below the visualization icons in the Visualizations
pane.
5. Expand the Map styles section and change the Theme to Grayscale.
7. Expand the Title section and set the Font colour to Black and Text size to
10.
8. Expand the Data colours section and change the Default colour to Yellow.
9. To highlight the sales for a state (for example California) on the map, slide the
Show all option to On and change the colour for California to Green.
10. Expand the Bubbles section and set the Size to 5%.
The map now shows the sales volume by state in proportion to the size of its
bubble with a special highlight for California to facilitate quick comparison of
sales volumes across the states.
2. Select the fields to add to the map – in this case, select State and Customer
ID fields under Orders table in the Fields pane.
4. Click the Count of Customer ID field’s drop down list in the Color saturation
area and select the function Count (Distinct).
5. Click the Format icon below the visualization icons in the Visualizations
pane.
6. Expand the Map styles section and change the Theme to Grayscale.
7. Expand the Title section and set the Title Text to Customer Distribution by
State, Font colour to Black, and Text size to 10.
9. Move the filled map to beneath the bubble map and resize the filled map as
necessary.
The Filled map facilitates an overview on the count of customers by state using
concentration of colours.
11. Save the file as MapChart Added Power BI file and close it.
c) Table d) Map
2. Open the Chartstory Power BI file in Power BI Desktop and add a border to
the stacked column chart for sales and profit by month.
5. Place the clustered column chart below the stacked column chart.
6. Save the file as Chartstory Added Power BI file and close it.
8. Use the Background colour conditional formatting for the Amount column
and format all values according to the following range:
• Red fill for values between 0 and below 1500.
• Yellow fill for values between 1500 and 2000.
• Green fill for values above 2000.
9. Apply the Data bars conditional formatting for the Transactions column:
• Set the bar colour to purple.
• Change the bar direction to Right to left.
10. Save the file as PivotReport Added Power BI file and close it.
This displays the Base measure (current status), the Goal to be achieved and the
distance from the goal as a percentage.
For example, the Base measure may be the current number of customers and the
Goal may be the number of customers you’d like to achieve this year. The Base is
evaluated against the Goal.
• If the Base measure exceeds the Goal, this is a positive KPI shown in
green font with a tick.
• If the Base measure isn’t meeting the Goal, this is a negative KPI shown in
red font with an exclamation mark.
Positive KPI – Base Measure Exceeds Goal Negative KPI – Goal Not Yet Met
This is also known as a gauge chart. This has a circular arc measuring progress
toward a goal/target. All possible values are spread evenly along the arc, from the
minimum (left-most value) to the maximum (right-most value).
Progress (current status) toward the goal is represented by the shading. And a
single value shown in bold inside the arc represents the progress.
Steps
To create a KPI visualization – in this case, to create a Customer’s KPI (to show
the number of existing customers metric against the targeted number of
customers for the year) and an Orders KPI (to show the number of orders metric
against the targeted number of orders for the year):
2. To set the target or goal against which to measure progress, click New
Measure in the Calculations group on the Home tab.
3. In the Formula bar that appears at the top of the canvas, type Customers
Target = 1800. Press Enter to add the measure to the fields list in the Fields
pane.
4. In the Report view, click on a blank area in the Report Canvas area and
select KPI icon in the Visualizations pane.
KPI Template
5. Drag and drop the field Customer ID from the Fields pane into the Indicator
area (base) of the Visualizations pane.
The indicator controls the base value’s display units and decimal places.
6. Drag and drop the field Order Date from the Fields pane into the Trend axis
area of the Visualizations pane.
7. Click the Order Date drop down list in the Trend axis area and select Date
Hierarchy.
8. Drag and drop the measure Customers Target from the Fields pane into the
Target goals area (target) of the Visualizations pane.
The visualization displays the goal and the distance from the goal as a
percentage.
The indicator will be displayed in green with a check icon if the target exceeds
the goal.
The indicator will be displayed in red with an exclamation icon if the target is
not yet met. If the indicator matches the goal, the indicator will be displayed in
yellow.
10. Click the Format icon below the visualization icons in the Visualizations
pane.
11. Expand the Title section and set the following options:
a. Title Text: Customers KPI
b. Font colour: Black
c. Text size: 10
13. Position the KPI visual to the right of the bubble map and resize it as
necessary.
14. To create a KPI visualization to show the number of orders metric against
the targeted number of orders for the year, first create a copy of the
Customers KPI and position it below the Customers KPI.
15. To set the target or goal against which to measure progress, create a new
measure called Orders Target and assign the value 1300 to it.
16. Select the second KPI and set the following options in the Visualizations
pane:
a. Indicator: Order ID
b. Aggregation method: Count (Distinct)
c. Trend Axis: Order Date
d. Order Date Hierarchy: Year
e. Target goals: Orders Target
f. Title: Orders KPI
KPI Visuals
17. Save the file as KPI Added Power BI file and the KPI visualizations are saved
with the file.
To create a gauge chart - in this case to display the sales achieved against a
target sales figure:
Gauge Template
3. To set the target or goal against which to measure progress, create a new
measure called Sales Target and assign the value 2500000 to it.
4. Drag and drop the Sales field from the Fields pane into the Value area of the
Visualizations pane.
5. Drag and drop the fields Sales Target from the Fields pane into the Target
value area of the Visualizations pane.
Gauge Visualization
6. Click the Format icon below the visualization icons in the Visualizations
pane.
7. Expand the Gauge axis section and set the Max option to 3000000.
9. Expand the Title section and set the Title Text to Sales Target, Font colour
to Black, and Text size to 10.
10. Expand the Target section, ensure the option is set to On and set the value
decimal places as 2.
11. Position the gauge below the Orders KPI and resize as necessary.
12. Save the file as Gauge added Power BI file and the visualizations with gauge
chart is saved with the file.
Sometimes you may want to highlight a single number in a report - for example,
total sales, market share year over year, or total number of customers. Microsoft
Power BI offers Card as a type of data highlighting for the purposes of reporting.
Steps
To create a card – in this case to create three cards to display the number of
orders, the number of customers and the total sales in a report:
2. In the Report view, select the Card icon in the Visualizations pane.
3. Select the Order ID field under the Orders table in the Fields pane.
The first value of the Order ID field is displayed by default as the field consists
of text values. The aggregation method for the field needs to be changed to
count the number of unique orders.
4. Click the First Order ID field’s down arrow in the Fields area and select
Count (Distinct).
The card displays the number of unique orders in the data set.
5. To format the card visualization, click the Format icon below the visualization
icons in the Visualizations pane.
8. Expand the Title section and type Number of Orders in the Title Text box.
9. Select a blue font colour, center align the title and set the font size to 14.
10. Adjust the height of the card visualization so that the title and the value are
displayed without too much white space.
11. In the Data label section, set the Text size to 28.
12. To add additional cards, copy the number or orders visualization and paste
two sets of it on the right.
13. Edit the two new card visualizations to display the Number of Customers and
Total Sales as follows:
Fields with numerical values are displayed as summed value and large
numbers are displayed as truncated values, e.g. 1.98M. You can use the
Display units and Value decimal places settings in the Data label section to
format them.
15. On the Format tab, choose Align in the Arrange group, and choose Align
Top to align the cards to align the cards to in a single row to the card at the
highest level.
16. If necessary, use Distribute in the Arrange group to distribute the cards into
equal spacing in between them.
Matrix Icon
Steps
To create a matrix visualisation – in this case to display the total sales for each
customer type and product category:
5. Adjust the width of the matrix so that it spans approximately the width of the
first two cards.
6. Click the Format icon below the visualization icons in the Visualizations
pane.
7. Expand the Matrix style section and set the style to Minimal.
9. Expand the Conditional formatting section and slide the Data bars option to
On.
11. Adjust the width and height of the matrix so that all the values are visible,
without scrollbars.
A report can include interactive features that allow users to view different aspects
of the data and derive other insights from the report. A slicer provides such
interactivity features through the display of filters on the report.
Microsoft Power BI offers slicers, with similar functions as that in Microsoft Excel
Pivot Tables.
• Select the Slicer icon in the Visualizations pane and then select the data
field and drag it to the Fields box in the Visualizations pane.
Slicer Template
• Drag the data field from the Fields pane into the report canvas area, and
then select the Slicer icon in the Visualizations panel to turn the
visualization into a slicer.
Power BI allows for different types of slicers, with different effects and options.
Some examples of slicers include field slicers and the date slicers.
Steps
To insert a slicer - in this case for the Region field in a sales summary report:
3. Drag and drop the Region field from the Fields pane into the Field area of the
Visualizations pane.
The slicer is displayed as a list with options displayed with check boxes.
Create more focused reports by putting slicers next to important visuals.
4. Position the slicer visualization next to the matrix visualization and resize the
slicer as necessary.
Note: List slicer items are sorted in ascending alphanumerical order by default.
To reverse the sort order to descending, select the ellipses (...) in the top right
corner of the slicer and choose Sort descending in the drop-down list.
5. Click the Format icon below the visualization icons in the Visualizations
pane.
6. Expand the Selection Controls section and slide the Show “Select all”
option to On.
8. Slide the Title option to On, expand the section and set the following options:
a. Title Text: View by region:
b. Font color: Black
c. Text size: 10
1. Drag and drop the Order Date field from the Fields pane into the report
canvas area.
2. With the new visualization selected, select the Slicer icon in the
Visualizations pane to convert the new visualization to a slicer.
Date fields are automatically displayed as a slider control with the date range
populated. Numeric and date/time data types produce range slider slicers by
default.
3. Position the slicer next to the Region slicer and resize as required.
Note: The slider resizes with the slicer size, but it disappears and the dates
are cut off if the slicer is too small.
Note: The date data field produces a Between range slider slicer type by
default. To change the slicer type, with the slicer selected, hover over the
upper-right area of the slicer, click the drop down arrow that appears, and
choose one of the other options, such as Before or After.
4. Click the Format icon below the visualization icons in the Visualizations
pane.
6. Slide the Title option to On, expand the section and set the following options:
a. Title Text: Select order date range: 1 January to 30 June 2018
b. Font color: Black
c. Text size: 10
Select different date ranges with the slider, or select a date field to type in a
value or pop up a calendar for more precise selection.
The file shows a table and a map visualization of 2018 sales figures for Brazil,
Chile and Columbia.
3. On the Report view create a summary of the sales performance for Brazil,
Chile and Columbia by adding three card visualizations to display the
Number of Orders, Number of Customers and Total Sales for 2018 using
the following fields and aggregation methods:
8. Centre align the titles and set their text size to 18.
10. Align the three cards in a single straight row with equal spacing in between.
11. Add a slicer for all three cards to show the information for Brazil, Chile and
Columbia.
In Power BI Desktop you can connect to data sources and build reports based
on your data set by adding appropriate visualizations in Report View over one or
more pages.
Alternatively, you can connect to data sources and create reports based on your
data set in Power BI service - the Power BI online platform.
Once you have created and saved your report you may want to share it with
others. To do this, you need to use Power BI service.
Before you can share reports from Power BI service you need to set up the
appropriate access. To access Power BI service as an individual user, there are
two main licensing options:
• A Power BI Pro license – this enables you to access all content and
capabilities of the Power BI service including connecting to data sets,
creating reports, and sharing content and collaborating with others. To
share and collaborate, the other users will also need to have Power BI Pro
licenses.
• A Power BI free license – this allows you to access some of the Power BI
service capabilities, but it doesn’t allow you to share content or collaborate.
Share Reports
Power BI Pro license Power BI Pro license
As an individual you can purchase a Power BI Pro license, or you can sign up for
a Power BI Pro trial license. With a trial license, you are automatically moved to
a Power BI free license when the trial expires. If you are in an organisation that
uses Office 365, you should check with your IT administrator to find out your
licensing arrangements.
Steps
To set up Power BI service – in this case to sign up for a free 60-day trial of the
Power BI service using a Power BI Pro license:
2. On the get started page, select Try Free > under Power BI.
3. Type a work email address (not personal email) in the text box provided, and
then click Sign up.
5. Check your email for a link or verification code email from Power BI.
Verification Code
6. Select the link within the email to verify your email address or key in the
verification code and click Start.
You will be taken to https://app.powerbi.com and you can begin using Power
BI service.
You can share reports created in Power BI Desktop by first publishing them to
Power BI service. When you publish a Power BI Desktop file to Power BI
service, the data set and any reports you created in Report view are published to
a workspace in Power BI service. A workspace is a dedicated space used to
collaborate and share content with others. You can also share reports you created
in Power BI service.
Two options for sharing a report from a workspace in Power BI service are:
• Emailing a link: You can share a report with a few people by sending an
email with a link to the report. This can be done from within Power BI
service or by copying the link into another messaging app. You and your
recipients will need to have a Power BI Pro license to do this.
Recipients of the report can view and interact with it, but they can’t edit it.
Users within your organisation can share the report with others within your
organisation, if granted the appropriate permission, but users outside your
organisation won’t be able to share it.
Email a link
Note: You can also allow someone to edit content if you both have Power BI Pro
licenses – for example by adding them to the relevant workspace in Power BI
service.
Steps
1. In Power BI Desktop, open the report you have created – in this case, Orders
Final - Solution Power BI file.
Note: Multiple app workspaces can be created by Power BI Pro users in the
Power BI service to organise related content.
6. Once the publishing is completed, you receive a link to your report. Click the
link Open ‘Orders Final – Solution.pbix’ in Power BI to open the report in
your Power BI site.
Publishing Completed
2. Select My Workspace on the left menu pane and click the heading Reports.
3. The published report will be listed in the window. Click on the report name to
view it.
2. To share the report, select My Workspace, click Reports and click the Share
icon .
Note: Alternatively, you can share the report from within the Report view by
clicking on the Share icon.
3. Enter the email addresses of other users in the Share report window and click
Share.
Note: Alternatively, copy the URL provided under Report Link and send via
other messaging apps to other users.
4. To exit the report, click the Profile icon at the top right of the browser
window and select Sign Out.
6. Review the warning shown in the window and confirm that the data is okay to
embed in a public website and click Publish.
7. A window appears that provides a link that can be sent in email or HTML that
can be pasted directly into your web page or blog.
8. Click Close.
Dashboards are a visual display of key vital information needed to highlight one or
more objectives. Information is consolidated and arranged on a single screen thus
allowing information to be monitored at a glance. Dashboards provide an overview
and allow organisations to monitor essential performance metrics. A dashboard is
interactive and highly customisable, and the tiles update as the underlying data
changes.
You can create dashboards in the Microsoft Power BI service but not in Power
BI Desktop.
A dashboard A tile
Note: If the original visualization used to create the tile changes, the tile does not
change. For example, if you pinned a column chart from a report and then you
changed the chart to a line chart, the dashboard tile continues to show a column
chart. The data refreshes, but the visualization type does not.
Steps
2. Select My Workspace on the left menu pane and select Orders Final -
Solution under the Reports heading.
3. On Page 1, hover over the Number of Orders visualization and click on the
pushpin icon .
4. On the Pin to dashboard window, select New dashboard and type Orders
Dashboard in the text box.
6. From page 1 of the report, repeat the steps to pin Number of Customers card
and Total Orders Value card visualizations to the existing dashboard (Orders
Dashboard).
7. From page 2 of the report, repeat the steps to pin Sales and Profit by Month
line chart and Sales Distribution by Region donut chart to the existing
dashboard (Orders Dashboard).
1. Select My Workspace on the left menu pane and select Orders Dashboard
under the Dashboards heading.
2. Hover over the Number of Orders Card tile and click on the resize handle (
) at the bottom right corner.
The tiles can be resized to a variety of sizes -- from 1x1 tile units up to 5x5.
Resized Tiles
Tiles also can be renamed for more descriptive and meaningful headings.
4. Hover over the line chart and click on the More options icon (...) to open the
tile action menu.
6. In the Tile details window, enter the Title as Sales and Profit Distribution
and Subtitle as Half Yearly (Jan to Jun) and confirm the setting by clicking
Apply.
When you create and finalise dashboards in the Microsoft Power BI service, you
can share them with others. One option is to share them via a link.
Share
via
a link
Note: You and the intended recipients will need to have Power BI Pro licenses.
Steps
2. Select My Workspace on the left pane and select Orders Dashboard under
the Dashboards heading.
3. Click the Share icon ( ) under the Actions column.
Note: Alternatively, from within an open Dashboard click the Share icon.
4. In the Share Dashboard window, type the full email addresses of the
recipients, separating each with a space.
© 2019 Page 211
Lesson 13 – Publishing and Sharing ICDL Data Analytics - Foundation
Share Options
6. Click Share.
Power BI sends an email invitation to the individuals, with a link to the shared
content. A message indicating the dashboard has been shared will be
displayed.
Sharing Message
3. Select the ellipsis (...) next to Read and reshare for a recipient and select:
a. Read to keep the recipient from sharing with anyone else.
b. Remove access to keep that recipient from seeing the shared content
at all.
If you remove items with a warning icon, it is best to remove related content
because it will not display properly.
Note: You can modify the access rights for recipients of a report by following
similar steps for a report.
Page 212 © 2019
ICDL Data Analytics - Foundation Lesson 13 – Publishing and Sharing
When you are creating reports and dashboards your aim is to present key insights
and actionable and useful information that can be understood at a glance. To do
this, you need to determine the data that you need to gather and what type of
visualisations to use. You also need to consider the best way to present the
information to your audience. You will need to think about the design and layout of
your reports and dashboards.
Incorporating good design practices into your reports and dashboards will make
them easier to understand and therefore more effective. Good design practices
simplify the visual representation of complex data, helping others understand the
key insights and information more easily. Some tips for good design include:
Use a clean and uncluttered layout on reports and dashboards so that the
reader can see the key insights easily. Present only the most relevant data as
a cluttered layout can be confusing and can distract the user from key
insights.
Use descriptive titles to help readers understand the data visualizations. Also
consider adding descriptive labels using text boxes to describe the page,
grouping of visuals, or to describe an individual visual.
Use consistent fonts and colours to help the reader understand the
information being presented. In general use one or two font types and up to
three font sizes. Using different font sizes helps to create a visual hierarchy in
the content. For example, use different font sizes for titles, labels, and data.
Use colour appropriately to help highlight critical data and improve the
reader’s understanding of the visuals. The appropriate use of colour can help
the reader make connections between visuals and make data processing
easier. When applying colours to reports and dashboards consider:
• Displaying most of the data using natural colours.
• Using bright or dark colours to highlight outliers or critical data.
• Using colour shades or gradients to show the spread of values across
the data.
• Using colour to highlight variance around a central value. For example,
use green to show positive values and red to show negative values.
With the results of the data analytical processes easily accessible through reports
and dashboards, you and your team can collaborate on the evaluation of the
results in the evaluation phase of the data analysis process.
6. 1. Business
Deployment understanding
5. 2. Data
Evaluation understanding
4. 3. Data
Modelling preparation
• Do the insights address the problem you set out to solve or the goals you
set out to achieve?
• What are the practical implementation steps that can be taken to address
the problem or achieve the goals?
If you find that you haven’t solved the original problem or met the original
objectives, you can revisit and review the business understanding and data
understanding phases to make the most of the on-going data analysis. The key to
gaining insights is not always getting more data but gathering the right data to
address your needs. And the dynamic nature of business and its continuous
development means that process of data analysis is always ongoing.
3. Which one of the following should you avoid to achieve a clean, uncluttered
layout in your reports and dashboards?
a) Divide the visualizations over multiple pages.
b) Use borders or white space to section the visualizations.
c) Add decorative elements such as background images.
d) Align and distribute the visualizations.
4. Why should you use consistent fonts and colours to design your reports and
dashboards?
a) To differentiate between the headings and actual data.
b) To accentuate a hierarchical structure.
c) To emphasize critical data.
d) To show the availability of various fonts.
Ref ICDL Task Item Location Ref ICDL Task Item Location
1.1.1 Identify the main types of 1.1 Types of Data 2.1.1 Import data into a 3.1 Importing
data analytics: descriptive, Analytics. spreadsheet application: Data Sets
diagnostic, predictive, .csv file, spreadsheet, Introduction
prescriptive, quantitative, website table, database
qualitative. table. 3.2 Importing
Data from Text
1.1.2 Outline the business 1.2 Business Files.
benefits of data analytics: Benefits.
identifies patterns/trends, 3.3 Importing
improves efficiency, Data from
supports decision making, Spreadsheets.
presents information 3.4 Importing
effectively. Data from
1.1.3 Identify the main phases of Website Tables.
1.3 Data Analysis
data analysis: business Process. 3.5 Importing
understanding, data Data from
understanding, data Database Tables.
preparation, modelling,
evaluation, deployment. 2.1.2 Remove duplicate data. 4.2 Removing
1.1.4 Recognise data protection Duplicate Data.
1.4 Data
considerations when Protection 2.1.3 Validate that given values 4.3 Validating
analysing data like: Considerations. belong to a reference data Data Using
anonymise personal data if
set using the VLOOKUP VLOOKUP.
possible, comply with
function.
applicable data protection
regulations. 2.1.4 Validate that given values 4.4 Validating
belong to a specified range Data Using IF
1.2.1 Describe measures of 2.1 Summary using one or more if Functions.
central tendency of a data Statistics functions.
set: mean, median, mode. Introduction
2.1.5 Extract values from a string 4.5 Extracting
2.2 Measures of using text functions: left, Values Using
Central right, len, mid, find. Text Functions.
Tendency.
2.2.1 Format a data set as a 5.1 Formatting
1.2.2 Calculate the central 2.3 Calculating built-in table. Data Sets as
tendency value of a data Central
set using a function: mean, Tables
Tendency.
median, mode.
2.2.2 Insert and use table slicers. 5.2 Using Table
1.2.3 Describe measures of 2.1 Summary Slicers.
variation of a data set: Statistics
quartiles, variance, range. Introduction 3.1.1 Change the method of 6.2 Changing
aggregation for a value: Aggregation
2.4 Measures of sum, average, count, Methods.
Variation. minimum, maximum.
1.2.4 Calculate the variation of a 2.5 Calculating 3.1.2 Display multiple 6.3 Displaying
data set: quartile, variance, Variation. aggregation values.
Multiple
range.
Aggregation
Values.
3.1.3 Display values as: % 6.4 Using Built-In
calculation, difference from
Calculations.
specific values, running
total, ranked.
3.3.2 Insert and use slicers to 4.2.1 Create tables in a report. 11.1 Creating
8.2 Using Slicers.
filter single, multiple pivot Table
tables. Visualizations.
3.3.3 Insert and filter a timeline. 8.3 Using 4.2.2 Visualize data as a chart: 11.2 Creating
Timelines. column, bar, line, pie. Chart
Visualizations.
3.4.1 Insert a pivot chart for an 9.1 Inserting Pivot
existing pivot table. Charts from Pivot 4.2.3 Apply, edit font and 11.3 Enhancing
Tables. background conditional Visualizations
formatting to show: high/low Using Conditional
3.4.2 Create a pivot chart from 9.2 Creating Pivot values, above/below Formatting.
fields in a data set. Charts from average values.
Tables. 4.2.4 Apply, edit data bars. 11.3 Enhancing
4.1.1 Understand the concept of 10.1 Key Visualizations
data visualization using Features of Data Using Conditional
reports and dashboards. Visualization Formatting.
Outline common Tools
visualizations like: charts, 4.2.5 Apply, edit visual level 11.4 Enhancing
key performance indicators 11.2 Creating filters. Visualizations
(KPIs), maps. Chart Using Visual
Visualizations. Level Filters.
11.5 Creating 4.3.1 Publish a report.
Visualizations 13.1 Creating and
using Maps Preparing to
Share Reports.
12.1 Creating
Visualizations to 13.2 Publishing
Measure and Sharing
Progress Reports.