Excel Data Analysis Tutorial
Excel Data Analysis Tutorial
The tutorial has plenty of screenshots that explain how to use a particular feature, in a
step-by-step manner.
Audience
This tutorial has been designed for all those readers who depend heavily on MS-Excel to
prepare charts, tables, and professional reports that involve complex data. It will help all
those readers who use MS-Excel regularly to analyze data.
Prerequisites
The readers of this tutorial are expected to have a good prior understanding of the basic
features available in Microsoft Excel.
All the content and graphics published in this e-book are the property of Tutorials Point (I)
Pvt. Ltd. The user of this e-book is prohibited to reuse, retain, copy, distribute or republish
any contents or a part of contents of this e-book in any manner without written consent
of the publisher.
We strive to update the contents of our website and tutorials as timely and as precisely as
possible, however, the contents may contain inaccuracies or errors. Tutorials Point (I) Pvt.
Ltd. provides no guarantee regarding the accuracy, timeliness or completeness of our
website or its contents including this tutorial. If you discover any errors on our website or
in this tutorial, please notify us at [email protected]
i
Data Analysis with Excel
Table of Contents
About the Tutorial ............................................................................................................................................ i
Audience ........................................................................................................................................................... i
Prerequisites ..................................................................................................................................................... i
Copyright & Disclaimer ..................................................................................................................................... i
Table of Contents ............................................................................................................................................ ii
5. Tables ..................................................................................................................................................... 31
Difference between Tables and Ranges ........................................................................................................ 31
Create Table .................................................................................................................................................. 32
Table Name ................................................................................................................................................... 35
Managing Names in a Table .......................................................................................................................... 36
Table Headers replacing Column Letters ....................................................................................................... 38
Propagation of a Formula in a Table ............................................................................................................. 39
Resize Table ................................................................................................................................................... 41
Remove Duplicates ........................................................................................................................................ 42
Convert to Range ........................................................................................................................................... 45
Table Style Options ........................................................................................................................................ 45
Table Styles .................................................................................................................................................... 46
ii
Data Analysis with Excel
v
Data Analysis with Excel
vi
Data Analysis with Excel
vii
Data Analysis with Excel
viii
Data Analysis with Excel
9
1. DATA ANALYSIS − OVERVIEW Data Analysis with Excel
Data Analysis is a process of inspecting, cleaning, transforming and modeling data with the
goal of discovering useful information, suggesting conclusions and supporting decision-
making.
Data Mining
Business Intelligence
Statistical Analysis
Predictive Analytics
Text Analytics
Data Mining
Data Mining is the analysis of large quantities of data to extract previously unknown,
interesting patterns of data, unusual data and the dependencies. Note that the goal is the
extraction of patterns and knowledge from large amounts of data and not the extraction of
data itself.
Data mining analysis involves computer science methods at the intersection of the artificial
intelligence, machine learning, statistics, and database systems.
The patterns obtained from data mining can be considered as a summary of the input data
that can be used in further analysis or to obtain more accurate prediction results by a decision
support system.
Business Intelligence
Business Intelligence techniques and tools are for acquisition and transformation of large
amounts of unstructured business data to help identify, develop and create new strategic
business opportunities.
The goal of business intelligence is to allow easy interpretation of large volumes of data to
identify new opportunities. It helps in implementing an effective strategy based on insights
that can provide businesses with a competitive market-advantage and long-term stability.
10
Data Analysis with Excel
Statistical Analysis
Statistics is the study of collection, analysis, interpretation, presentation, and organization
of data.
Inferential statistics: It uses patterns in the sample data to draw inferences about
the represented population or accounting for randomness. These inferences can be-
o answering yes/no questions about the data (hypothesis testing)
o estimating numerical characteristics of the data (estimation)
o describing associations within the data (correlation)
o modeling relationships within the data (E.g. regression analysis)
Predictive Analytics
Predictive Analytics use statistical models to analyze current and historical data for forecasting
(predictions) about future or otherwise unknown events. In business, predictive analytics is
used to identify risks and opportunities that aid in decision-making.
Text Analytics
Text Analytics, also referred to as Text Mining or as Text Data Mining is the process of deriving
high-quality information from text. Text mining usually involves the process of structuring the
input text, deriving patterns within the structured data using means such as statistical pattern
learning, and finally evaluation and interpretation of the output.
Thus, data analysis is a process for obtaining large, unstructured data from various sources
and converting it into information that is useful for-
Answering questions
11
Data Analysis with Excel
Test hypotheses
Decision-making
Disproving theories
You will be learning these data analysis techniques with Excel as part of two parts-
12
2. DATA ANALYSIS PROCESS Data Analysis with Excel
Data Analysis is a process of collecting, transforming, cleaning, and modeling data with the
goal of discovering the required information. The results so obtained are communicated,
suggesting conclusions, and supporting decision-making. Data visualization is at times used
to portray the data for the ease of discovering the useful patterns in the data. The terms Data
Modeling and Data Analysis mean the same.
Data Analysis Process consists of the following phases that are iterative in nature-
The data required for analysis is based on a question or an experiment. Based on the
requirements of those directing the analysis, the data necessary as inputs to the analysis is
identified (e.g., Population of people). Specific variables regarding a population (e.g., Age and
Income) may be specified and obtained. Data may be numerical or categorical.
Data Collection
Data Collection is the process of gathering information on targeted variables identified as data
requirements. The emphasis is on ensuring accurate and honest collection of data. Data
Collection ensures that data gathered is accurate such that the related decisions are valid.
Data Collection provides both a baseline to measure and a target to improve.
Data is collected from various sources ranging from organizational databases to the
information in web pages. The data thus obtained, may not be structured and may contain
irrelevant information. Hence, the collected data is required to be subjected to Data
Processing and Data Cleaning.
Data Processing
The data that is collected must be processed or organized for analysis. This includes
structuring the data as required for the relevant Analysis Tools. For example, the data might
have to be placed into rows and columns in a table within a Spreadsheet or Statistical
Application. A Data Model might have to be created.
Data Cleaning
The processed and organized data may be incomplete, contain duplicates, or contain errors.
Data Cleaning is the process of preventing and correcting these errors. There are several
types of Data Cleaning that depend on the type of data. For example, while cleaning the
financial data, certain totals might be compared against reliable published numbers or defined
thresholds. Likewise, quantitative data methods can be used for outlier detection that would
be subsequently excluded in analysis.
Data Analysis
Data that is processed, organized and cleaned would be ready for the analysis. Various data
analysis techniques are available to understand, interpret, and derive conclusions based on
the requirements. Data Visualization may also be used to examine the data in graphical
format, to obtain additional insight regarding the messages within the data.
Statistical Data Models such as Correlation, Regression Analysis can be used to identify the
relations among the data variables. These models that are descriptive of the data are helpful
in simplifying analysis and communicate results.
The process might require additional Data Cleaning or additional Data Collection, and hence
these activities are iterative in nature.
Communication
14
Data Analysis with Excel
The results of the data analysis are to be reported in a format as required by the users to
support their decisions and further action. The feedback from the users might result in
additional analysis.
The data analysts can choose data visualization techniques, such as tables and charts, which
help in communicating the message clearly and efficiently to the users. The analysis tools
provide facility to highlight the required information with color codes and formatting in tables
and charts.
15
3. DATA ANALYSIS WITH EXCEL – OVERVIEW Data Analysis with Excel
Excel provide commands, functions and tools that make your data analysis tasks easy. You
can avoid many time consuming and/or complex calculations using Excel. In this tutorial, you
will get a head start on how you can perform data analysis with Excel. You will understand
with relevant examples, step by step usage of Excel commands and screen shots at every
step.
However, there are certain operations that are more effective when data is in tables rather
than in ranges. There are also operations that are exclusively for tables.
You will understand the ways of analyzing data in ranges and tables as well. You will
understand how to name ranges, use the names and manage the names. The same would
apply for names in the tables.
Conditional Formatting
Excel provides you conditional formatting commands that allow you to color the cells or font,
have symbols next to values in the cells based on predefined criteria. This helps one in
visualizing the prominent values. You will understand the various commands for conditionally
formatting the cells.
As you are aware, PivotTable is normally used to summarize data. However, Subtotals with
Ranges is another feature provided by Excel that will allow you to group / ungroup data and
summarize the data present in ranges with easy steps.
Quick Analysis
With Quick Analysis tool in Excel, you can quickly perform various data analysis tasks and
make quick visualizations of the results.
PivotTables
With PivotTables you can summarize the data, prepare reports dynamically by changing the
contents of the PivotTable.
Data Visualization
You will learn several Data Visualization techniques using Excel Charts. You will also learn
how to create Band Chart, Thermometer Chart, Gantt chart, Waterfall Chart, Sparklines and
PivotCharts.
Data Validation
It might be required that only valid values be entered into certain cells. Otherwise, they may
lead to incorrect calculations. With data validation commands, you can easily set up data
validation values for a cell, an input message prompting the user on what is expected to be
entered in the cell, validate the values entered with the defined criteria and display an error
message in case of incorrect entries.
Financial Analysis
Excel provides you several financial functions. However, for commonly occurring problems
that require financial analysis, you can learn how to use a combination of these functions.
Formula Auditing
17
Data Analysis with Excel
When you use formulas, you might want to check whether the formulas are working as
expected. In Excel, Formula Auditing commands help you in tracing the precedent and
dependent values and error checking.
Inquire
Excel also provides Inquire add-in that enables you compare two workbooks to identify
changes, create interactive reports, and view the relationships among workbooks,
worksheets, and cells. You can also clean the excessive formatting in a worksheet that makes
Excel slow or makes the file size huge.
18
4. WORKING WITH RANGE NAMES Data Analysis with Excel
While doing Data Analysis, referring to various data will be more meaningful and easy if the
reference is by Names rather than cell references – either a single cell or a range of cells. For
example, if you are calculating Net Present Value based on a Discount Rate and a series of
Cash Flows, the formula
With Excel, you can create and use meaningful names to various parts of your data. The
advantages of using range names include-
Entering a name is less error prone than entering a cell or range address.
If you type a name incorrectly in a formula, Excel will display a #NAME? error.
You can quickly move to areas of your worksheet by using the defined names.
With Names, your formulas will be more understandable and easier to use. For
example, a formula Net_Income = Gross_Income – Deductions is more intuitive than
C40 = C20 – B18.
Creating formulas with range names is easier than with cell or range addresses. You
can copy a cell or range name into a formula by using formula Autocomplete.
19
Data Analysis with Excel
Deleting names.
Applying names.
Using names in a formula.
Viewing names in a workbook.
Using paste names and paste list.
Using names for range intersections.
Copying formulas with names.
20
Data Analysis with Excel
You can use any combination of letters, numbers and the symbols - underscores,
backslashes, and periods. Other symbols are not allowed.
A name cannot begin with a number (example- 1stQuarter) or resemble a cell address
(example- QTR1).
If you prefer to use such names, precede the name with an underscore or a backslash
(example- \1stQuarter, _QTR1)
Names cannot contain spaces. If you want to distinguish two words in a name, you
can use underscore (example- Cash_Flows instead of Cash Flows)
Your defined names should not clash with Excel’s internally defined names, such as
Print_Area, Print_Titles, Consolidate_Area, and Sheet_Title. If you define the
same names, they will override the Excel’s internal names and you will not get any
error message. However, it is advised not to do so.
Keep the names short but understandable, though you can use up to 255 characters
21
Data Analysis with Excel
3. Click Define Name in the Defined Names group. The New Name dialog box appears.
5. Check that the range that is selected and displayed in the Refers box is correct. Click OK.
22
Data Analysis with Excel
Create a Range Name using the Create Names from Selection dialog box
You can also create Range names using the Create Names from the Selection dialog box
from Formulas tab, when you have Text values that are adjacent to your range.
1. Select the range for which you want to define a name along with the row / column that
contains the name.
3. Click Create from Selection in the Defined Names group. The Create Names from
Selection dialog box appears.
4. Select top row as the Text appears in the top row of the selection
5. Check the range that got selected and displayed in the box next to Refers to be correct.
Click OK.
23
Data Analysis with Excel
Now, you can find the largest value in the range with =Sum (Student Name), as shown
below-
24
Data Analysis with Excel
You can create names with multiple selection also. In the example given below, you can name
the row of marks of each student with the student’s name.
Now, you can find the total marks for each student with =Sum (student name), as shown
below.
The Name Savings_Bank_Interest_Rate is set to a constant 5%. You can verify this in
Name Manager. You can see that the value is set to 0.05 and in the Refers to =0.05 is placed.
26
Data Analysis with Excel
Managing Names
An Excel Workbook can have any number of named cells and ranges. You can manage these
names with the Name Manager.
Click Name Manager in the Defined Names group. The Name Manager dialog box
appears. All the names defined in the current workbook are displayed.
27
Data Analysis with Excel
The List of Names are displayed with the defined Values, Cell Reference (including Sheet
Name), Scope and Comment.
28
Data Analysis with Excel
Scope of a Name
The Scope of a name by default is the workbook. You can find the Scope of a defined names
from the list of names under the Scope column in the Name Manager.
You can define the Scope of a New Name when you define the name using New Name
dialog box. For example, you are defining the name Interest_Rate. Then you can see that the
Scope of the New Name Interest_Rate is the Workbook.
29
Data Analysis with Excel
Suppose you want the Scope of this interest rate restricted to this Worksheet only.
1. Click the down-arrow in the Scope Box. The available Scope options appear in the drop-
down list.
30
Data Analysis with Excel
The Scope options include Workbook, and the sheet names in the workbook.
2. Click the current worksheet name, in this case NPV and click OK. You can define / find the
sheet name in the worksheet tab.
3. To verify that Scope is worksheet, click Name Manager. In the Scope column, you wil find
NPV for Interest_Rate. This means you can use the Name Interest_Rate only in the Worksheet
NPV, but not in the other Worksheets.
31
Data Analysis with Excel
Note: Once you define the Scope of a Name, it cannot be modified later.
Clear Filter
Names Scoped to Worksheet
Names Scoped to Workbook
Names with Errors
Names without Errors
Defined Names
Table Names
You can apply Filter to the defined Names by selecting one or more of these options.
2. Select Names with Errors. Names that contain error values will be displayed.
32
Data Analysis with Excel
3. From the obtained list of Names, select the ones you want to delete and click Delete.
Editing Names
You can use the Edit option in the Name Manager dialog box to-
2. Click the Name you want to edit in the Name Manager dialog box. In this case, Array1.
34
Data Analysis with Excel
4. Change the Name by typing the new name that you want in the Name Box.
5. Click the Range button to the right of Refers to Box and include the new cell references.
35
Data Analysis with Excel
Applying Names
Consider the following example-
36
Data Analysis with Excel
As you observe, names are not defined and used in PMT function. If you place this function
somewhere else in the worksheet, you also need to remember where exactly the parameter
values are. You know that using names is a better option.
In this case, the function is already defined with cell references that do not have names. You
can still define names and apply them.
2. Select the cell containing the formula. Click next to Define Name in the Defined
Names group on the Formulas tab. From the drop-down list, click Apply Names.
3. The Apply Names dialog box appears. Select the Names that you want to Apply and
click OK.
37
Data Analysis with Excel
38
Data Analysis with Excel
39