0% found this document useful (0 votes)
6 views65 pages

Business Analytics Notes

The document outlines the process of data preparation and visualization using MS-Excel, detailing steps such as data collection, cleansing, structuring, and validation. It also covers essential Excel features like filtering, sorting, conditional formatting, and functions for data analysis, including covariance and correlation. The importance of addressing outliers and ensuring data quality for effective decision-making is emphasized throughout.

Uploaded by

Swati kumari
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
6 views65 pages

Business Analytics Notes

The document outlines the process of data preparation and visualization using MS-Excel, detailing steps such as data collection, cleansing, structuring, and validation. It also covers essential Excel features like filtering, sorting, conditional formatting, and functions for data analysis, including covariance and correlation. The importance of addressing outliers and ensuring data quality for effective decision-making is emphasized throughout.

Uploaded by

Swati kumari
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 65

Business Analytics

Unit - II
Visualisation Using Spreadsheet
Data Preparation
Data preparation is the process of
gathering, combining, structuring,
and organising data using various
analytical or business intelligence
tools and techniques. It acts as a
bridge between raw data and
insights and enhances data quality
and usability.
Applications of Data Preparation
1. Enhanced Result Reliability
2. Identification and Resolution of Data Issues
3. Informed Decision-Making
4. Cost Reduction
5. Time and Resource Savings
6. Financial Growth and Return on Investment (ROI)
Data Preparation Process

1) Data Collection
Gathering relevant data from various internal and external sources, Here the data professionals who have
actually collected the data must ensure that the data aligns with the objectives of the planned analytics
applications.

2) Data Discovery and Profiling


Second step is about understanding the collected data Le., What it contains? It is concerned about
identifying inconsistencies, missing values, and anomalies etc.

3) Data Cleansing
In this step anomalies are corrected to create complete data set. It involves handling missing values,
removing duplicates, resolving inconsistencies, and addressing any data quality problems.
Data Preparation Process

4) Data Structuring
In this step, the data is organised or modelled to meet the requirements of the analytics applications,
converting formats (e.g., CSV to tables)

5) Data transformation and Enrichment


Data transformation and enrichment involve modifying and enhancing the data to make it more suitable for
analysis. It includes aggregating data, applying mathematical operations, creating derived variables,
adding external information. The goal is to transform the data into a unified and usable format that
supports the desired analytics tasks.

6) Data Validation and Publishing


Lastly, the transformed data obtained from the above step is validated for its complete-ness and
consistency. It helps identify any remaining errors or inconsistencies before the data is published or stored
in a data repository or data warehouse. Once the data iş validated, it is made available for analysis by
relevant stakeholders within the organization.
GETTING STARTED WITH
MS-EXCEL
MS-Excel is a software program
developed by Microsoft. This software
program makes use of spreadsheets
to arrange the data using formulas and
functions.
Welcome
Page

Welcome page is the first page that viewers come across when they open MS-Excel. This page provides
quick access to various options and templates, allowing users to begin their work efficiently.
Ribbon
A Ribbon is known as a toolbox of MS-Excel. It is located at the top of the screen. A Ribbon has
three main components. These are: Tabs, Groups, and Commands.

Tabs

Group Commands

a. Tabs : Tab is the activity area of the MS-Excel. For example, 'File' tab has the tools used to store or save the data:
'Page layout' tab enables the user to set the layout of the Excel worksheet.

b. Groups : Groups show related tools together. For example: 'Font' group shows all the tools related to text
formatting, and 'Alignment' group shows all tools related to alignment settings

c. Commands : A command is a tool that is used to carry out operations. A command can be in the form of a button,
a box to enter information, or an expandable menu to choose option from.
Quick Access Toolbar
Quick Access Toolbar is a customizable
toolbar. It is located at the top of the MS-Office
window. It contains a set of commands that are
frequently used.

Basic Excel Functions


A basic excel formula to add, subtract, multiply or divide can be created in the work-sheet.
An 'Equal to' (=) sign is used to start a formula, followed by numeric values and calculation operators like plus
(+), minus (-), asterisk (*), or slash (/) signs etc.

Alternatively, the following formulas can be used:

SUM Function : Used to fund the sum of given numbers.


MIN Function : Returns the smallest number form the set of given numbers.
MAX Function : Returns the largest number form the set of given numbers.
COUNT Function : Used to count the number of cells containing test within in a specified range or array of a dataset.
DATA PREPARATION AND CLEANING
USING MS-EXCEL
• Filter and Sort
Filtering and Sorting are essential operations used to organize and analyse data efficiently.

• Filtering enables users to display only the data that meets certain criteria, such as
showing records for specific regions or dates. This functionality is crucial for narrowing
down large datasets to focus on relevant information.

The following steps are used to perform filter in MS-Excel:


Step 1: Click anywhere inside the table for which filter is to be applied.
Step 2: Click on Data Tab Filter option (Drop downs on top of each column appear).
Step 3: Click on the dropdown of that column for which the data is to be filtered
(Checkboxes consisting distinct values of that particular column will appear).
Step 4: Choose that value for which the filter is required.
• Sorting arranges data in a specific order, either ascending or descending, based on one
or more columns. For example, sorting sales data by date helps identify trends over
time.

One can sort data in any of the following ways:

(a) Alphabetically, ie, A to Z or Z to A.


(b) Numerically, ie., highest to lowest or lowest to highest.
(c) Date and Time wise, Le, newest to oldest or oldest to newest.

Sorting can be classified into three levels viz single level, multi-level, and custom
sorting:
Shorting > A. Single level sorting

• In single level sorting, the data is sorted based on a single criterion or column in ascending or
descending order.
Step 1: Select the columns from table whose data is to sorted.
Step 2: Click on the Data Tab.
Step 3: Under Data Tab SORT (A dialogue box will appear).
Step 4: Select all the three fields (Column, Sort on, Order).
Step 5: Click OK.

Shorting > B. Multi-level sorting


• In multi-level sorting, the data is sorted hierarchically, first by one criterion and then by additional
criteria, allowing for a more detailed and refined sorting order.
Step 1: Select the columns from table whose data is to be sorted.
Step 2: Click on the Data Tah
Step 3: Under Data Tab select SORT (A dialogue box will appear).
Step 4: Select all the three fields (Column, Sort on, Order).
Step 5: Add another level of SORT.
Step 6: Select all the three fields (Column, Sort on, Order)
Step 7: Click OK.
Shorting > C. Custom sorting

• Custom sorting enables the user to specify the order in which data should be arranged based on his
specific requirements.
Step 1: Select the columns from table whose data is to be sorted.
Step 2: Click on the Data Tab.
Step 3: Under Data Tab SORT (A dialogue box will appear).
Step 4: Select two fields (Column, Sort on).
Step 5: For the third field (Order) Select Custom list option.
Step 6: From the given custom lists, select either from the given custom lists or create a NEW LIST as
per needs and select it.
Step 7: Click OK.
CONDITIONAL FORMATTING
Conditional formatting is a special feature of Excel used to find unique and duplicate
values by formatting the cells. It allows users to apply specific formatting styles (like
colour fills, font styles, or icons) to cells based on predefined criteria or conditions.
It’s a powerful tool for visually analysing and interpreting data since it highlights
important information, trends or outliers, making patterns easier to identify.

Steps involved in conditional formatting are as follows:


Step 1: Select the column to which you want to apply conditional formatting.
Step 2: Navigate to the Home tab and click on the Conditional Formatting option.
Step 3: Choose the New Rule option from the dropdown menu.
Step 4: Create the desired formatting rule.
Step 5: Click OK to confirm the rule.
REMOVING DUPLICATES
In MS Excel, a user can store large amount of data, and this often leads to the
problem of duplicate values, which can result in inaccurate data analysis and
compromise data integrity. Duplicate values clutter the dataset and disrupt formulas,
making it difficult to extract meaningful insights. However, Excel offers tools to
efficiently identify and remove duplicates, saving time and effort. By eliminating
duplicates, users can ensure a cleaner, more organized dataset, enhancing the
reliability and usefulness of their spreadsheets.
The following steps are to be followed to remove duplicates in MS-Excel:
Step 1: Select the cell range that has duplicate values.
Step 2: Click on 'Data tab, select 'Remove Duplicates, and then.
Step 3: Under columns tab, check or uncheck the column where the duplicates are
to be removed.
Step 4: Click OK
DATA VALIDATION
Data Validation is an important feature of MS-Excel. In other words, it enables the
user to define the type of data that can be entered in each cell of a worksheet. It helps
a user by restricting entries in the selected cells. Data validation allow users to set a
validation rule to the worksheet.
Some data validation restrictions are as follows:
❖ Put numeric or text entries only
❖ Enter specified range of numbers
❖ Specific length of data inputs
❖ Set a predefined drop-down list
❖ Restrict date and time entries outside or within a specific range
Additionally, data validation can display an input or warning message whenever a
wrong data has been entered.
Data validation through MS-Excel involves the following steps:
Step 1: Select the cell.

Step 2: Go to 'Data tools' choose 'Data Validation’

Step 3: In data validation box, click on the 'Settings tab, then under 'Allow' option, select
an option:

(a) Whole Number - to restrict the cell to accept only whole numbers.
(b) Decimal - to restrict the cell to accept only decimal numbers.
(c) List - to pick data from the drop-down list.
(d) Date - to restrict the cell to accept only date.
(e) Time - to restrict the cell to accept only time.
(f) Text Length - to restrict the length of the text.
(g) Custom - for custom formula.
Step 4: In the data validation box, under the 'settings tab' click on 'Data' option, select a
condition:
❖ If Custom is chosen, then choose the formula according to which data needs to be
validated.
❖ Click the Input Message tab and write a custom message that will appear on entering
the wrong data.
❖ Select the Show input message when cell is selected checkbox to display the
message when the user selects or hovers over the selected cell.
❖ Select the Error Alert tab to customize the error message.
Step 5: Click OK.

Now, if a user enters a wrong or invalid value in the worksheet, an error message will pop
up on the screen.
IDENTIFYING OUTLIERS IN THE DATA
An outlier can be defined as a data value that distinctively varies from the average value
of the distribution or pattern of the dataset. It is a data point that lies far away from most
of the observations and is significantly different in value or nature. It is essential to
address outliers to prevent skewness, bias, and misinterpretation of data insights.

Causes: Experimental anomalies, measurement errors, extreme values, data entry


mistakes.
Impact: Skews statistical analysis results and affects validity of insights.
Identification Methods: - Graphical: Scatter plots, box plots, Statistical: Tukey's fences,
Z-tests.
Treatment Methods: - Removal from the dataset, Transformation using data averages,
Use Statistical techniques less sensitive to extreme values.
MS- Excel offers the 'quartile formula' to identify outliers and remove them. In this
technique, the quartile acts as a dividing point which splits the data into three quarters:

➢ The First quartile (Q1) is the lower portion of the data with smaller values.
➢ The Second quartile (Q2) is the middle value.
➢ The Third quartile (Q3) is the upper quarter with higher values.

The range of values from smaller quartile (Q1) to higher quartile (Q3) is the inter-quartile
range (IQR) ie., Q3- Q1. Outliers lies outside the IQR. Outliers lies 1.5 times below or
above the IQR.
Steps to calculate outliers using IQR:
Step 1: Determine the range of data (array) for which the quartile is to be calculated.
Step 2: Use the formula QUARTILE (array, 1) to calculate the first quartile. This will give
the value that separates the lowest 25% of the data from the rest.
Step 3: Use the formula QUARTILE (array, 3) to calculate the third quartile. This will give
the value that separates the lowest 75% of the data from the rest.
Step 4: Subtract the first quartile (Q1) from the third quartile (Q3) to calculate the inter-
quartile range. The IQR represents the range of the middle 50% of the data.
Step 5: Multiply the inter quartile range (IQR) by 1.5 and add it to the third quartile (Q3).
This will give the upper bound value, which is used to identify potential outliers.
Step 6: Multiply the inter-quartile range (IQR) by 1.5 and subtract it from the first quartile
(Q1). This will give the lower bound value, which is used to identify potential outliers.

Value lying outside the upper and lower bounds are the outliers.
By following these steps, quartiles can be calculated and used to analyse the
distribution and identify potential outliers in the dataset
1. Find the first quartile (Q1) using = QUARTILE(range, 1).
2. Find the third quartile (Q3) using = QUARTILE(range, 3).
3. Calculate IQR as = Q3 – Q1.
4. Define the lower boundary as = Q1 – 1.5*IQR and the upper boundary
as = Q3 + 1.5*IQR.
Any data points outside this range are considered outliers.
Conditional Formatting. Spreadsheets allow us to apply condit
TEXT TO COLUMN
The Text to Columns feature is used to split a single column of text data into
multiple columns based on a specified delimiter, such as commas, spaces or tabs.
This allows columns to have atomic values and facilitates easier data analysis
and manipulation by allowing users to work with individual data elements more
effectively.
For example, if a column contains first name, last name and profession in a single column, then
this information can be separated in different columns.

Steps to split data in MS Excel.


Step 1: Select the cell or column that contains the text to be split.
Step 2: Select Data > Text to Columns.
Step 3: In the Convert Text to Columns Wizard displayed on the screen, select Delimited > Next.
Step 4: Select the Delimiters for your data.
Step 5: Select Next.
Step 6: Preview the split and select Finish.
COVARIANCE
Covariance is a statistical function that calculates the joint variability of two random
variables, given two sets of data.
Covariance is used to measure the direction of the relationship between the two
variables which means they tend to move together or in the opposite directions. It
means if the covariance coefficient is positive then both variables will move in the same
direction ie, if one increases/decreases, other also increases/decreases. While, a
negative coefficient implies an inverse relationship between the two.

To calculate covariance in Excel, use the covariance.p functions.


The syntax is = COVARIANCE.P (array1, array2)
where Array1 is a range or array of integer values Array2 is a second range or array of
integer values.
Note the following points:
• If the given arrays contain text or logical values, they are ignored by the COVARIANCE
in Excel function.
• The data should contain numbers, names, arrays, or references that are numeric. If
some cells do not contain numeric data, they are ignored.
• The data sets should be of the same size, with the same number of data points.
• The data sets should be neither empty nor should the standard deviation of their
values be zero.
CORRELATION MATRIX
Correlation is a statistical measure that describes the extent to which
two or more variables are related to each other. It indicates the strength
and direction of a relationship between variables. When two variables
are correlated, a change in one variable is associated with changes in
another—either positively or negatively.
The coefficient's value lies between 1 and -1, where +1 signifies positive
correlation and -1 implies negative correlation between the variables.

In Excel, the CORREL function returns the Pearson correlation coefficient


for two sets of values.
Its syntax is CORREL (array1, array2)
Where Array1 is a range of independent values and Array2 is a range of dependent
values. However, the two arrays should have equal length.
Note the following points:
• If cells in an array contain text, logical values or blanks, then they are ignored.
• If the arrays are of different lengths, N/A error is returned.
• If either of the arrays is empty or if the standard deviation of their values equals zero,
then divide by zero occurs.
MOVING AVERAGE

Moving averages are statistical calculations used to analyse data points by creating
averages of different subsets of the dataset over a specified period.
The calculation involves taking
the average of a set number of consecutive data points, then “moving” one data
point forward to calculate the next average.
Moving averages are widely used in time series analysis, finance and forecasting to
identify trends, cyclical patterns and potential reversals in data.

A moving average can be calculated using the AVERAGE function or SUM function.
Its syntax is AVERAGE(number1, number2,…) or SUM(number1, number2,…)/
FINDING MISSING VALUES

Finding and addressing missing values is crucial for maintaining data integrity, Quality
Assurance and Efficient Analysis.
Excel does not have any particular function to list missing values analysts commonly
use functions or tools that highlight blank cells or count the number of missing entries.

Functions like IF and ISNUMBER and commands like filter can be used to identify and
list missing values in Excel. Once identified, analysts can address missing values
through various methods: imputation (filling in missing values with mean, median or
mode), deleting rows or columns with excessive missing data, or utilizing predictive
modelling techniques to estimate missing values based on available data. Choosing an
appropriate method depends on the context and significance of the missing data.
Data Summarization
Data summarization is the process of transforming a given large dataset into a
smaller form, usually presentable, for reporting, analysis, and further examination.
It involves condensing a dataset into meaningful insights, making
it easier to understand and analyse, extracting central insights and patterns from
data without losing vital information. It allows quick realization of an overview of
the structure and general features of the dataset, hence facilitates further analysis
and inference.
Key Techniques in Data Summarization
1. Descriptive Statistics
• Measures of Central Tendency: Summarize data using mean, median, and mode,
which describe the central point of dataset.
• Measures of Dispersion: This include the range, variance and standard Deviation
which describe dispersion or variability in data.
3. Percentiles and Quartiles: The former provide insight into the distribution by telling
about the relative standing of the data points.
4. Frequency Distributions: Summarizing data by counting how often each value or
category appears.
5. Data Aggregation: This would involve combining many data points into summary
values. For example, the addition of the sales data by month or the average score
across different categories.
6. Data Grouping: The grouping of data into categories or segments and summarizing
each group in isolation. This can be done using techniques like pivot tables which
summarize data based on different dimensions.
7. Visualization: Charts and Graphs Setting up trends and distributions of data with bar
charts, histograms, pie charts, and line graphs. Box Plots can be used to visualize
distribution, central value, variability of the data, and possible outliers.
8. Data Profiling: It contains information about the structure of a dataset, the count of
missing values, or the data types, or on the distribution of categorical variables.
Importance of Data Summarization:
Simplifies large datasets: It makes large, complex datasets easier to understand and
interpret.
Identifies patterns: It helps identify key trends, outliers, and relationships between
variables.
Facilitates decision-making: Summarized data is useful for making quick decisions and
drawing meaningful conclusions.

By reducing a large dataset to its key characteristics, data summarization provides an


efficient way to understand the data and helps guide further analysis or reporting.
Descriptive Statistics

Measures of Central Tendency Measures of Dispersion


• Mean • Standard Deviation
• Median • Variance
• Mode • Range
Measures of Central Tendency

Mean
Mean is used to calculate the numerical average
of a dataset. Arithmetic mean is calculated by
adding all the values of the given dataset and
dividing it by the by number of items therein.

The following steps are used to calculate mean using


MS-Excel:-
Step 1: Click on an empty cell
Step 2: Type ‘=AVERAGE(cell range)' for example:
(B2:B20)
Step 3: Press 'ENTER' and mean will be displayed.
Measures of Central Tendency

Median
Median refers to the middle value of the series
when arranged in ascending or descending
order.

The following steps are used to calculate median


using MS-Excel:
Step 1: Click on an empty cell
Step 2: Type '=MEDIAN(cell range)' for example:
(B2:B20)
Step 3: Press 'ENTER' and MEDIAN will be
displayed.
Measures of Central Tendency

Mode
Mode refers to the most recurring value in the
sample. In other words, it refers to the most
frequent number of the given dataset.

The following steps are used to calculate mode using


MS-Excel:
Step 1: Click on an empty cell
Step 2: Type ‘=MODE(cell range) for example:
(B2:B20)
Step 3: Press 'ENTER' and MODE will be displayed.
Measures of Dispersion

Standard Deviation (SD)


Standard deviation is used in normally distributed
data and shows how spread the values are from
the mean. Thus, gives an understanding of how
scattered a data is.

The following steps are used to calculate SD using


MS-Excel:
Step 1: Click on an empty cell
Step 2: Type '=STDEV.S(cell range)' for example:
(B2: B20)
Step 3: Press 'ENTER' and SD will be displayed.
Measures of Dispersion

Variance
Variance is like standard deviation however it
measures how firmly or free-floating values are
scattered around the mean.

The following steps are used to calculate Variance


using MS-Excel:
Step 1: Click on an empty cell
Step 2: Type “=VAR.S(cell range)” for example:
(B2:B20)
Step 3: Press 'ENTER" and SD will be displayed.
Measures of Dispersion

Range
Range refers to the difference between the
largest and the smallest value of the dataset, This
measure indicates the distance between the
maximum and the minimum values of the sample.

The following steps are used to calculate Range


using MS-Excel:
Step 1: Click on an empty cell
Step 2: Type “=MIN(cell range)”
Step 3: Select empty cell say,
Step 4: Type “=MAX(cell range)”
Step 5: Subtract E2E1 and ENTER.
Descriptive Statistics some
other functions

SUM = Calculate the total

COUNT = Count the number of cell with numbers

COUNTA = Count the number of cell with any


type of data in given range
Data Visualization
Visualizations are a powerful tool in data analysis, enabling analysts to interpret and
communicate insights more effectively. Its is a graphical representation of data or
information. Data visualization convert large and small data sets into visuals that enables
them to spot trends in data, convey trends, distributions, relationships in the data and
communicate complex information effectively. By using various visual elements like charts,
graphs, and maps, Data visualization tools provide an accessible way to understand
patterns, outliers, or trends etc. in the given data.
Steps to draw
charts in excel

Microsoft Excel provides different


types of charts to visualize data in
the spreadsheet

DATA
Steps to draw
charts in excel 1

Step 1: Organize the data in rows


and columns within the Excel sheet.
Every row and column should be
labelled clearly to identify the data
to be visualized.
Steps to draw
charts in excel 1 2

Step 2: Select the data by clicking


and dragging mouse to highlight
the data to be visualized. In this
selection, include the row and
column headers.
(as shown in the figure)
Steps to draw
charts in excel 1 2 3

Step 3: Choose a chart type by


clicking on the “Insert” tab. In the
“Charts” section, select the
required chart option (Column,
Line, Pie, Bar, Area, Scatter, etc.)
by clicking on the dropdown arrow
below the chart type.
Steps to draw
charts in excel 1 2 3 4

Step 4: Insert the chart. Once the


desired chart is selected, it is
automatically created and inserted
in the worksheet. Now, it can be
clicked and dragged to change its
position or resized by using the
sizing handles at the corners.
Steps to draw
charts in excel 1 2 3 4 5

Step 5: Customize the chart.


For this, click on the chart to select it. Now,
you would be able to see two additional
tabs: “Design” and “Format”. Use these
tabs to customize the chart’s appearance,
style and layout. Important information like
chart title, axis labels, legend, data labels,
etc. can be added to enhance visualization
and data interpretation
Steps to draw
charts in excel 1 2 3 4 5 6

Step 6: Edit the data (optional). In


case you wish to make changes to
the data, simply edit it in the
worksheet. Excel will automatically
update the chart to reflect the
changes.
Types of Data Visualizations in Excel
Excel offers a variety of charts to visualize data. Some commonly used
charts for visualizing data are:
Column Chart Bar Chart
It displays data using vertical bars. Each bar It is similar to a column chart, but instead of
represents a category. A column chart is vertical bars, it has horizontal bars. They are
preferred when certain values have to be usually used to compare values across
compared across categories or to visualize categories when the category names are long
trends over time. or there are several categories.
Histogram Example: Health Studies: Reveals common
ranges of blood pressure levels in a
A histogram is a type of data visualization population.
used to represent the distribution of a
dataset by grouping data points into bins or
intervals. It is a graphical representation of the
frequency distribution of a continuous or
discrete variable, where the x-axis represents
the data ranges (bins), and the y-axis
represents the frequency.

It is used to Understanding Data Distribution,


Identifying Patterns, Identifying Outliers,
Visualizing Data Range and Frequency
Line Chart
The line chart plots data points and then
connects these points by lines. These lines
show trends or change in values over time.
Line charts are widely used for continuous
data like stock prices or temperature
measurements.
They are particularly useful for displaying
changes in data, identifying patterns, compare
multiple series, detecting relationships,
predicting future trends and handling large
datasets.
Pie Chart Scatter Plot
A scatter plot displays the relationship
A pie chart plots data as slices of a circle. Size
between two continuous variables through
of each slice is proportional to the value it
data points on a Cartesian coordinate system,
represents. That is, it represents the proportion
with each axis representing a variable. These
of each category within a whole.
charts identify patterns or correlations between
two variables, Detect Outliers and to Visualize
Data Distribution .

Sale (Kg)
Pivot Tables
Pivot tables are an important part of MS Excel
that allows users to quickly summarize large
amounts of data, analyze numerical data in detail
and answer unanticipated questions about the
data. Such a table is specifically designed to
query data in user-friendly and interactive way.
For example, consider the dataset given in figure.
There are 7 columns and 91 rows.

Step 1: Click any single cell inside the data set.


Step 2: Click on the Insert tab, in the Tables group.

Step 3: Click on Pivot Table.

Step 4: From the dialog box that appears,


Excel automatically selects the data and the
default location set for a new pivot table is
New Worksheet.

Step 5: Click on OK.


Step 5
Step 6: Now, drag the fields.
• Rows: Define the categories to be displayed in rows (e.g., Category,
Product).
• Columns: Define the categories to be displayed as column headers
(e.g., Date).
• Values: Contain the numerical data to be summarized (e.g.,
Quantity, Profit).
• Filters: Allow filtering by specific criteria (e.g., showing only data for
a certain date or product).
• Analyze the Results: Once the fields are set, the pivot table will
automatically summarize the data based on our selections.

For example, to get the total sale of each product, drag the Product
field to the Rows area, Amount field to the Values area.
Pivot Chart
Pivot Chart is a dynamic visualization tool that helps users summarize and analyze large
datasets. Trends and patterns can be easily identified by pivot charts. It is a visual
representation of a pivot table, automatically synchronizing with it to reflect any changes.

It allows interactive visualization, enabling users to drill down into specific data points and filter
data directly from the chart. Pivot charts can be created in various formats such as bar, line,
pie, and area charts.

They are widely used in sales performance analysis, financial reporting, inventory
management, and survey analysis to visualize summarized data and make informed decisions.
Steps to insert a pivot chart using data from the pivot table:

Step 1: Click any cell inside the pivot table.

Step 2: On the PivotTable Analyze tab, click on PivotChart in the Tools group.
Step 3: Click OK on the Insert Chart dialog box.

The pivot chart will appear on the screen. Any


changes made in the pivot chart are
immediately reflected in the pivot table and vice
versa.
Interactive Dashboard
Creating interactive dashboards in spreadsheets is an effective way to visualize data and
provide a dynamic, user-friendly interface for data exploration. Microsoft Excel support various
built-in features that allow for interactive elements such as drop-down lists, buttons and charts,
which can be linked to data and updated dynamically.

In Microsoft Excel, an interactive dashboard is usually a one-pager report that allows business
users to track and measure crucial business KPIs and metrics under one roof. It combines
charts, figures, and tables to help users visualize complicated data in an easy-to-understand
format.
Step 1: Define the Purpose of the Dashboard. For this, you must be clear with answers for two
questions - “Why is the dashboard being created ?” and “For whom it is created ?”.
Different stakeholders or departments within the organization want to analyze different facts
and figures. So, it is important to understand the purpose of the dashboard and then collect
data around it for accurate and effective decision-making.

Step 2: Gather Data in the form of a table and then convert this table into a pivot table.
If you want 3 pivot charts on the interactive dashboard then you must have 3 pivot tables. So,
you can simply duplicate the pivot table sheet in the Excel workbook.

Step 3: Create Charts using the Pivot Table.


For example: The first chart would represent every product’s monthly sales. For this chart, we
need 3 data entries - Sales, Product, and Month. In the Pivot table sheet, drag and drop the
Month data in the rows area, product in the columns area, and Sales in the values area.
Step 4: In the PivotTable Analyze group, click on PivotChart and select a suitable chart from
the chart drop-down.
Step 5: Click on Ok. The pivot chart will be created.
Once the chart is created, you can style it using formatting options. Just
click on the + sign that besides the chart and then, tick the following items.
Chart Title to change the title of the chart.
Legend to enable, disable, or edit the legend.
Axes to edit horizontal axis and vertical axis of the chart.
Data Table to insert a table representing all values in the data table.

After formatting the chart, it can be moved to the Interactive dashboard sheet. Repeat the
same steps to create other pivot charts and place them on the interactive dashboard.
Step 6: Add Interactive Features to the dashboard design. For this, select any chart and click
on PivotChart Analyze.

Step 7: Click on Insert Timeline. However, to insert a timeline to any pivot chart, there must be
a Date column in the data. Make sure that the Date checkbox is ticked before you press OK.

Step 8: Like timeline, slicer can also be added on the interactive dashboard. A slicer is just a
fancy name for a filter. To add a slicer, perform the following steps:
(i) Click on one of the pivot charts to activate it.
(ii) In the Insert tab, click on the Slicer option.
(iii) From the list of all the variables, perform slice and dice operations.
But before performing these operations, you need to connect the Slicer to the Charts. To
connect the slicer:
(a) Click on the slicer to activate it.
(b) In the Slicer tab, click on the Report Connections button.
(c) From the list of pivot tables, check all the boxes.

You might also like