LEARN - Statistics For Data Analysis
LEARN - Statistics For Data Analysis
Understand what If the sample data If the sample doesn’t Continue to leverage Use additional
your sample data fits a probability fit a distribution, use the central limit variables to increase
looks like distribution, use it as the central limit theorem to draw the accuracy of your
a model for the theorem to make conclusions about estimates and make
entire population estimates about what a population predictions based
population looks like based on a on their
parameters sample relationships
Represents the frequency of each value Represents the middle of the values Represents the dispersion of the values
Maven Analytics
PROBABILITY DISTRIBUTIONS
Maven Analytics
CONFIDENCE INTERVALS
A confidence interval is an estimate of an unknown population value using a sample
• It is a range defined by a point estimate, like the sample mean, plus/minus a margin of error
• It includes a confidence level, or probability of including the population value (can’t be
certain!)
𝜇= ?
Remember, the sample means are normally 239.9
distributed around the population mean
239.9
239.9
239.9
Maven Analytics
HYPOTHESIS TESTING
μo μo μo
p/2 p/2 p p
tlower tupper t t
Maven Analytics
REGRESSION ANALYSIS
The goal of regression is to predict a dependent variable using independent variables
• This is achieved by fitting a line through the sample data points that models the population
This is the dependent variable (y), This is the independent variable (x), which
which is what you’re trying to predict helps you predict the dependent variable
Maven Analytics
NEW COURSE: STATS FOR
DATA ANALYSIS!
Discuss the role of statistics in the context of business
Why Statistics? intelligence and decision-making, and introduce the statistics
workflow
You have data from the first graduating class of their MBA program, including
details & scores from their application, the program itself, and their employment
status 2 months later
Your goal is to leverage statistics to evaluate the results of this class, predict the
performance of future classes, and propose changes in recruitment to improve
graduate outcomes
Maven Analytics
COURSE EXPECTATIONS
This course is about introducing & demystifying essential statistics concepts
• Our goal is to break down seemingly complex techniques using simple and intuitive
explanations that will help you develop an intuition into when, why, and how to
use them in the real world
We’ll be using Excel for Office 365 on a PC for the course demos
• What you see on your screen may not always match what you see on ours, especially
if you are running a different operating system or following along with an older
version of Excel