Excel DataAnalysis
Excel DataAnalysis
https://www.depts.ttu.edu/itts/training/shortcourses/handouts.php
OR
Google "TTU Short Course", click the link and then "Handouts"
Benjamin Chamness
Accessibility & Data Analyst
ITTS Technology Support, IT Team Web
Material Notes
• Information here may apply to other spreadsheet software tools
(LibreOffice, OpenOffice, Google Sheets, etc.).
• These other tools are not approved for use with TTU data.
Things that describe our data. These The values we care about for our
can help “Group” our data for insights. business needs. These values are often
summarized.
• Date
• Weekday • Foot Traffic
• Location • Customer Count
• Social Media Posts • Total Sales
• High Temp • Customer Conversion (*careful
when summarizing
• Rain percentages)
• Foot Traffic
Pivot Tables and Charts
Outline
• Our Test Data
• Pivot Tables and Charts
• Analysis ToolPak
Pivot Tables and Charts
• Pivot Tables are an easy way to build reports on your data in
Excel
• Allow you to group your data by category (date, location,
temperature, college, major, residence hall, etc)
• Allows you to summarize the data by some calculation (total
sales, average traffic, count of students, etc.)
What reports may be relevant with our
data?
• Total sales and customers by date and location
• Average sales and customers by day of week and location
• Average sales by temperature
Pivot Table Creation
1. Select all your data (ideally by selecting the column letters)
2. Go to the “Insert” menu and select “PivotTable”
3. Confirm that the “Table/Range” field is the correct value (for
our test data, it should be “SampleData!$A:$J”)
4. State where you want the PivotTable to be created. The
default of “New Worksheet” is fine for our purposes, but you
could also tell Excel to crate the PivotTable in a specific cell.
PivotTable Parts
• Field List shows all of
your available data
fields.
• Filters will filter your
data by a category.
• Rows and Columns
break data out by
categories.
• Values will hold your
data summaries.
About Our Data
• Categories: • Facts:
• Date • Foot Traffic
• Weekday • Customer Count
• Location • Total Sales
• Social Media Posts • Customer Conversion (*careful
• High Temp when summarizing
• Rain percentages)
• Foot Traffic
Use these in “Rows”, Use these in “Values”
“Columns”, and “Filters”
Rows and Columns
• Will identify distinct category values in your dataset
• Will separate your data points by the categories
• Will NOT fill in any gaps (days of the week not represented)
Values
• Perform some summary (count, average, sum, etc) on a "fact" in your
dataset
• Will NOT fill in any gaps (data intersections not represented in your data)
• Do not summarize on percentage data
Report 1: Total sales and customers by
time and location
• As with other charts in Excel, you have all of the same tools to
adjust labels, titles, colors, and other chart elements
Bar and Pie Chart Examples
Stacked Area Chart Example
• Set "Sum of
Total Sales" to
be a "Running
Total In > Date"
• Notice date
gap: Jan 9 - 20
• Solution: Build
a list of dates
and use
GetPivotData
to fetch values
Analysis ToolPak
Outline
• Our Test Data
• Pivot Tables and Charts
• Analysis ToolPak
To Enable:
1. Click on "File" in the top left
and then "Options" towards
the bottom.
2. In the "Excel Options" window
that opens, click on "Add-ins"
on the left, and then click the
button that says "Go…"
3. Check the box next to
"Analysis ToolPak" and then
click "OK".
4. When enabled, it should be in
the "Data" ribbon menu.
Analysis Limitations: Text
• String data is hard to analyze
• Binary strings (yes/no, true/false) can be mapped to 0 and 1
• Categories with some defined order can sometimes be mapped
to integers
• Freshman, Sophomore, Junior, Senior
• Strongly Agree, Agree, Disagree, Strongly Disagree
• Our “Rain” column is a binary yes/no, so it is coded as 1s and
0s
• Weekday and Location can’t be mapped to numbers as easily
Correlation
• Helps describe a relationship between variables in your dataset.
• A positive correlation means two variables will change in the
same direction (increase/decrease together).
• A negative correlation means two variables will change in
opposite directions (increase in one, decrease in the other).
• Values close to +/-1 show a strong correlation, values close to 0
show weak / no correlation
Skewness > 0
Skewness < 0
Daily Sales
Descriptive Statistics
1. Click the "Data Analysis" tool in the "Data" menu.
2. Select "Descriptive Statistics"
3. Select your data for the "Input Range" (entering in the column
letters)
4. Check the "Labels in First Row" box and the "Summary
Statistics" box, then "OK"
Descriptive Statistics
Total Sales