Statistics Cheatsheet

1. This document provides a cheat sheet on key statistics concepts including: fundamentals like populations, samples, and variables; measures of central tendency and dispersion; distributions; bivariate relationships; experimental design; and probability concepts. 2. It defines important terms like mean, median, variance, standard deviation, and correlation and outlines statistical procedures like hypothesis testing, confidence intervals, and linear regression. 3. Key probability topics covered include sample spaces, empirical and theoretical probability, independent and mutually exclusive events, and conditional probability.

Uploaded by

naticool1115906

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOC, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

555 views

Statistics Cheatsheet

Uploaded by

naticool1115906

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOC, PDF, TXT or read online on Scribd

You are on page 1/ 3

1 011333 0 56677

2 etc 1
Statistics Cheat Sheet q. Mean: x = ∑xi / n
Mr. Roth , Mar 2004 r. Median: M: If odd – center, if even - mean of 2
1. Fundamentals s. Boxplot:
Min Q1 M Q3 Max
a. Population – Everybody to be analysed
 Parameter - # summarizing Pop

b. Sample – Subset of Pop we collect data on

Variance: s 2 = ∑( x − x ) /( n −1) = SS x /( n −1) ,
2
t.
 Statistics - # summarizing Sample

c. Quantitative Variables – a number u. p78: standard deviation, s = √s2

 Discrete – countable (# cars in family)
v. SS x = ∑( x − x ) 2 = ∑ x 2 − (∑ x ) 2 / n
 Continuous – Measurements – always #

between w. Density curve – relative proportion within classes –

d. Qualitative area under curve = 1
 Nominal – just a name
x. Normal Distribution: 68, 95, 99.7 % within 1, 2, 3 std
 Ordinal – Order matters (low, mid, high)
deviations.

Choosing a Sample y. p98: z-score z = ( x − x ) / s or ( x − µ) / σ

z. Standard Normal: N(0,1) when N(μ,σ)
• Sample Frame – list of pop we choose sample from
• Biased – sampling differs from pop characteristics. 3. Bivariate - Scatterplots & Correlation
• Volunteer Sample – any of below three types may a. Explanatory – independent variable
end up as volunteer if people choose to respond. b. Response – dependent variable
Sample Designs c. Scatterplot: form, direction, strength, outliers
e. Judgement Samp: Choose what we think represents d. – form is linear negative, …
 Convenience Sample – easily accessed people e. – to add categorical use different color/symbol
f. Probability Samp: Elements selected by Prob f. p147: Linear Correlation- direction & strength of
 Simple random sample – every element = linear relationship
chance g. Pearsons Coeff: {-1 ≤ r ≤ 1} 1 is perfectly linear +
 Systematic sample – almost random but we slope, -1 is perfectly linear – slope.
choose by method 1 (x − x) ( y − y) SS xy
g. Census – data on every everyone/thing in pop h. r= *∑ = ,
n −1 sx sy SS x SS y
Stratified Sampling
i. r = zxzy / (n - 1),
Divide pop into subpop based upon characteristics
h. Proportional: in proportion to total pop j. SS xy = ∑xy −
∑x ∑ y
n
i. Stratified Random: select random within substrata
j. Cluster: Selection within representative clusters 4. Regression
Collect the Data k. least squares – sum of squares of vertical error
minimized
k. Experiment: Control the environment 
l. p154: y = b0 + b1x, or y = a + bx ,
l. Observation:
m. (same as y = mx + b)
2. Single Variable Data - Distributions
m. Graphing Categorical: Pie & bar chart) n. b1 =
∑( x − x )( y − y ) = SS xy
= r (sy / sx)
n. Histogram (classes, count within each class) ∑( x − x ) 2
SS x

o. – shape, center, spread. Symmetric, skewed right, o. Then solving knowing lines thru centroid (
skewed left ( x , y ); a = y −bx
p. Stemplots
0 11222 0 112233 p. b0 =
∑ y − (b ∑ x)1

n
Statistics Cheat Sheet
q. r^2 is proportion of variation described by linear c. 2) Theoretical: Relative frequency/proportion of a
relationship given event given all possible outcomes (Sample

r. residual = y - y = observed – predicted. Space)
s. Outliers: in y direction -> large residuals, in x d. Event: outcome of random phenomenon
direction -> often influential to least squares line. e. n(S) – number of points in sample space
t. Extrapolation – predict beyond domain studied f. n(A) – number of points that belong to A
u. Lurking variable g. p 183: Empirical: P'(A) = n(A)/n = #observed/
v. Association doesn't imply causation #attempted.
h. p 185: Law of large numbers – Exp -> Theoret.
5. Data – Sampling
i. p. 194: Theoretical P(A) = n(A)/n(S) ,
a. Population: entire group favorable/possible
b. Sample: part of population we examine j. 0 ≤ P(A) ≤ 1, ∑ (all outcomes) P(A) = 1
c. Observation: measures but does not influence k. p. 189: S = Sample space, n(S) - # sample points.
response Represented as listing {(, ), …}, tree diagram, or grid
d. Experiment: treatments controlled & responses l. p. 197 Complementary Events P(A) + P( A ) = 1
observed
m. p200: Mutually exclusive events: both can't happen
e. Confounded variables (explanatory or lurking) when
at the same time
effects on response variable cannot be distinguished
n. p203. Addition Rule: P(A or B) = P(A) + P(B) – P(A
f. Sampling types: Voluntary response – biased to
and B) [which = 0 if exclusive]
opinionated, Convenience – easiest
o. p207: Independent Events: Occurrence (or not) of A
g. Bias: systematically favors outcomes
does not impact P(B) & visa versa.
h. Simple Random Sample (SRS): every set of n
p. Conditional Probability: P(A|B) – Probability of A
individuals has equal chance of being chosen
given that B has occurred. P(B|A) – Probability of B
i. Probability sample: chosen by known probability given that A has occurred.
j. Stratified random: SRS within strata divisions q. Independent Events iff P(A|B) = P(A) and P(B|A) =
k. Response bias – lying/behavioral influence P(B)
6. Experiments r. Special Multiplication. Rule: P(A and B) = P(A)*P(B)
a. Subjects: individuals in experiment s. General mult. Rule: P(A and B) = P(A)*P(B|A) =
P(B)*P(A|B)
b. Factors: explanatory variables in experiment
t. Odds / Permutations
c. Treatment: combination of specific values for each
factor u. Order important vs not (Prob of picking four
numbers)
d. Placebo: treatment to nullify confounding factors
v. Permutations: nPr, n!/(n – r)! , number of ways to
e. Double-blind: treatments unknown to subjects &
pick r item(s) from n items if order is important :
individual investigators
Note: with repetitions p alike and q alike = n!/p!q!.
f. Control Group: control effects of lurking variables
w. Combinations: nCr, n!/((n – r)!r!) , number of ways
g. Completely Randomized design: subjects allocated to pick r item(s) from n items if order is NOT
randomly among treatments important
h. Randomized comparative experiments: similar x. Replacement vs not (AAKKKQQJJJJ10) (a) Pick an
groups – nontreatment influences operate equally A, replace, then pick a K. (b) Pick a K, keep it, pick
i. Experimental design: control effects of lurking another.
variables, randomize assignments, use enough y. Fair odds - If odds are 1/1000 and 1000 payout. May
subjects to reduce chance take 3000 plays to win, may win after 200.
j. Statistical signifi: observations rare by chance
8. Probability Distribution
k. Block design: randomization within a block of
individuals with similarity (men vs women) a. Refresh on Numb heads from tossing 3 coins. Do
grid {HHH,….TTT} then #Heads vs frequency
7. Probability & odds chart{(0,1), (1,3), (2,3), (4,1)} – Note Pascals triangle
a. 2 definitions: b. Random variable – circle #Heads on graph above.
b. 1) Experimental: Observed likelihood of a given "Assumes unique numerical value for each outcome
outcome within an experiment in sample space of probability experiment".

42010936.doc -2- Printed 10/15/2010

Statistics Cheat Sheet
c. Discrete – countable number a. Statistical Inference: methods for inferring data
d. Continuous – Infinite possible values. about population from a sample
e. Probability Distribution: Add next to coins frequency b. If x is unbiased, use to estimate μ
chart a P(x) with 1/8, 3/8, 3/8, 1/8 values c. Confidence Interval: Estimate+/- error margin
f. Probability Function: Obey two properties of prob. (0 d. Confidence Level C: probability interval captures
≤ P(A) ≤ 1, ∑ (all outcomes) P(A) = 1. true parameter value in repeated samples
g. Parameter: Unknown # describing population e. Given SRS of n & normal population, C confidence
h. Statistic: # computed from sample data interval for μ is: x ± z * σ / n
Sample Population f. Sample size for desired margin of error – set +/-
Mean x μ - mu value above & solve for n.
2
Variance s σ2
Standard
12. Tests of significance
s σ - sigma
deviation g. Assess evidence supporting a claim about popu.

Base: x = ∑x / n , s 2 = ∑
(x − x)
2 h. Idea – outcome that would rarely happen if claim
i. were true evidences claim is not true
( n −1)
i. Ho – Null hypothesis: test designed to assess
Frequency Dist Probability Distribution evidence against Ho. Usually statement of no effect
Me x = ∑xf / ∑ f µ = ∑[ xP ( x )]
j. Ha – alternative hypothesis about population
an
parameter to null
Var
∑( x − x ) f
2
σ = ∑[( x − µ) P ( x )]
2 2

s2 = k. Two sided: Ho: μ = 0, Ha: μ ≠ 0

(∑ f −1)
l. P-value: probability, assuming Ho is true, that test
Std s = √s2 σ= σ 2
statistic would be as or more extreme (smaller P-
Dv value is > evidence against Ho)
j. Probability acting as an f / ∑f . Lose the -1 x −µ
m. z=
9. Sampling Distribution σ/ n
a. By law of large #'s, as n -> population, x → µ n. Significance level α : if α = .05, then happens no
more than 5% of time. "Results were significant (P
b. Given x as mean of SRS of size n, from pop with μ
< .01 )"
and σ. Mean of sampling distribution of x is μ and
o. Level α 2-sided test rejects Ho: μ = μo when uo falls
standard deviation is σ / n outside a level 1 – α confidence int.
c. If individual observations have normal distribution a. Complicating factors: not complete SRS from
N(μ,σ) – then x of n has N(μ, σ / n ) population, multistage & many factor designs,
d. Central Limit Theorem: Given SRS of b from a outliers, non-normal distribution, σ unknown.
population with μ and σ. When n is large, the b. Under coverage and nonresponse often more
x is approx normal. serious than the random sampling error accounted
sample mean
for by confidence interval
10. Binomial Distribution c. Type I error: reject Ho when it's true – α gives
a. Binomial Experiment. Emphasize Bi – two possible probability of this error
outcomes (success,failure). n repeated identical d. Type II error: accept Ho when Ha is true
trials that have complementary P(success) + e. Power is 1 – probability of Type II error
P(failure) = 1. binomial is count of successful trials
where 0≤x≤n
b. p : probability of success of each observation
c. Binomial Coefficient: nCk = n!/(n – k)!k!
n  k n −k
d. Binomial Prob: P(x = k) =   p (1 − p )
 
k
e. Binomal μ = np
f. Binomal σ = np (1 − p )

11. Confidence Intervals

42010936.doc -3- Printed 10/15/2010

3141b86-6fd4-7726-D8ad-20a1516bcd Statistics Interview Cheat Sheet - Emmading - Com. All Rights Reserved.
No ratings yet
3141b86-6fd4-7726-D8ad-20a1516bcd Statistics Interview Cheat Sheet - Emmading - Com. All Rights Reserved.
10 pages
Statistics For Experimenters - Box and Hunter
91% (11)
Statistics For Experimenters - Box and Hunter
655 pages
The Power of Data Storytelling by Sejal Vora 2019 9789353282905 9789353282912 Compress
No ratings yet
The Power of Data Storytelling by Sejal Vora 2019 9789353282905 9789353282912 Compress
249 pages
Chapter 9
No ratings yet
Chapter 9
126 pages
Data Analysis With Python by IBM: - (On Coursera)
No ratings yet
Data Analysis With Python by IBM: - (On Coursera)
3 pages
Data Analysis Using Spss
100% (2)
Data Analysis Using Spss
131 pages
Final Note
No ratings yet
Final Note
23 pages
Compiled Notes: Mscfe 610 Econometrics
No ratings yet
Compiled Notes: Mscfe 610 Econometrics
44 pages
ExcelData Analysis Manual
No ratings yet
ExcelData Analysis Manual
19 pages
Data Collection
No ratings yet
Data Collection
104 pages
Powerpoint Workshop Introduction To Deep Learning - Statistics and Data Analysis
No ratings yet
Powerpoint Workshop Introduction To Deep Learning - Statistics and Data Analysis
26 pages
Model Perf Cheat Sheet
No ratings yet
Model Perf Cheat Sheet
2 pages
15 Statistical Hypothesis Tests in Python (Cheat Sheet)
No ratings yet
15 Statistical Hypothesis Tests in Python (Cheat Sheet)
11 pages
Data Visualization Using Seaborn - Towards Data Science
No ratings yet
Data Visualization Using Seaborn - Towards Data Science
31 pages
Statistics For Data Analysis Lec 1 Introduction and Visualization
No ratings yet
Statistics For Data Analysis Lec 1 Introduction and Visualization
8 pages
Introduction To Statistics
0% (1)
Introduction To Statistics
19 pages
Sheet4ProbPower PDF
No ratings yet
Sheet4ProbPower PDF
2 pages
Statistic Cheat Sheet
No ratings yet
Statistic Cheat Sheet
3 pages
100 Days Data Analyst Learning Roadmap
No ratings yet
100 Days Data Analyst Learning Roadmap
6 pages
Approaches To The Analysis of Survey Data PDF
No ratings yet
Approaches To The Analysis of Survey Data PDF
28 pages
Introduction To Tableau
No ratings yet
Introduction To Tableau
18 pages
Visualization Types - Introduction To Data Visualization - LibGuides at Duke University
No ratings yet
Visualization Types - Introduction To Data Visualization - LibGuides at Duke University
6 pages
Forecast Time Series With R Language
No ratings yet
Forecast Time Series With R Language
98 pages
Stratified Sampling
No ratings yet
Stratified Sampling
3 pages
Data Science Bootcamp
No ratings yet
Data Science Bootcamp
26 pages
Excel Shortcuts 2007 To 2013 Plus
No ratings yet
Excel Shortcuts 2007 To 2013 Plus
4 pages
Parametric and Nonparametric Machine Learning Algorithms
No ratings yet
Parametric and Nonparametric Machine Learning Algorithms
16 pages
Data Entry, Coding & Cleaning: SPSS Training
No ratings yet
Data Entry, Coding & Cleaning: SPSS Training
25 pages
Python Primer: Patrice Koehl Modified by Xin Liu in Apr., 2011
No ratings yet
Python Primer: Patrice Koehl Modified by Xin Liu in Apr., 2011
33 pages
Statistical Infrences Lec 1
No ratings yet
Statistical Infrences Lec 1
35 pages
Programming For Data Science
100% (1)
Programming For Data Science
4 pages
Cheat Sheet Stats For Exam Cheat Sheet Stats For Exam
No ratings yet
Cheat Sheet Stats For Exam Cheat Sheet Stats For Exam
3 pages
Chapter 1-Database System Introduction
No ratings yet
Chapter 1-Database System Introduction
58 pages
Quantitative Techniques & Operations Research: Ankit Sharma Neha Rathod Suraj Bairagi Vaibhav Thamman
No ratings yet
Quantitative Techniques & Operations Research: Ankit Sharma Neha Rathod Suraj Bairagi Vaibhav Thamman
12 pages
Statistics Notes
No ratings yet
Statistics Notes
15 pages
A Beginner's Guide To Getting Your First Data Science Job: 2019 Edition
No ratings yet
A Beginner's Guide To Getting Your First Data Science Job: 2019 Edition
63 pages
Medians and Order Statistics: CLRS Chapter 9
No ratings yet
Medians and Order Statistics: CLRS Chapter 9
19 pages
Excel For Data Analysis
No ratings yet
Excel For Data Analysis
9 pages
Data Mart Info
No ratings yet
Data Mart Info
5 pages
Everything You Need For Clear and Efficient Data Visualization
No ratings yet
Everything You Need For Clear and Efficient Data Visualization
41 pages
Practical List Ip
100% (1)
Practical List Ip
10 pages
Query Optimiation
No ratings yet
Query Optimiation
39 pages
Tableau Syllabus
No ratings yet
Tableau Syllabus
13 pages
Data Analytics Tableau & Python
No ratings yet
Data Analytics Tableau & Python
15 pages
Prefixes That Will Make You Better at Life: Prefix Meaning Examples
No ratings yet
Prefixes That Will Make You Better at Life: Prefix Meaning Examples
5 pages
Getting Started With Tableau Prep
No ratings yet
Getting Started With Tableau Prep
3 pages
Exploratory Data Analysis
100% (1)
Exploratory Data Analysis
20 pages
Advanced Statistical Inference
No ratings yet
Advanced Statistical Inference
7 pages
SQL Tutorial
No ratings yet
SQL Tutorial
28 pages
Combining Like Terms Worksheet PDF
No ratings yet
Combining Like Terms Worksheet PDF
2 pages
ST2195 Complete
No ratings yet
ST2195 Complete
430 pages
Predictive Analytics: A Survey, Trends, Applications, Oppurtunities & Challenges
No ratings yet
Predictive Analytics: A Survey, Trends, Applications, Oppurtunities & Challenges
5 pages
Introduction To IBM SPSS Statistics
No ratings yet
Introduction To IBM SPSS Statistics
85 pages
Data Analysis Project
No ratings yet
Data Analysis Project
18 pages
Statistics Cheat Sheet
100% (1)
Statistics Cheat Sheet
4 pages
Chapter 5
No ratings yet
Chapter 5
58 pages
The Nature of Statistics (Statistics - A Universal Guide To The Unknown Book 1)
No ratings yet
The Nature of Statistics (Statistics - A Universal Guide To The Unknown Book 1)
184 pages
Unit3 160420200647 PDF
No ratings yet
Unit3 160420200647 PDF
146 pages
Chapter 1-Data and Statistics: Multiple Choice
No ratings yet
Chapter 1-Data and Statistics: Multiple Choice
20 pages
A Lesson 1 Introduction To Statistics & SPSS
100% (1)
A Lesson 1 Introduction To Statistics & SPSS
8 pages
Probability Cheatsheet
100% (2)
Probability Cheatsheet
10 pages
Statistics Notes
No ratings yet
Statistics Notes
17 pages
AP Stat Review
No ratings yet
AP Stat Review
23 pages
A-level Maths Revision: Cheeky Revision Shortcuts
From Everand
A-level Maths Revision: Cheeky Revision Shortcuts
Scool Revision
3.5/5 (8)
Performance of BPSK Over Rayleigh Fading Channel
No ratings yet
Performance of BPSK Over Rayleigh Fading Channel
5 pages
Probabilistic physics-guided machine learning for fatigue data analysis
No ratings yet
Probabilistic physics-guided machine learning for fatigue data analysis
31 pages
Fag Erland 2009
No ratings yet
Fag Erland 2009
7 pages
The End of History Illusion
No ratings yet
The End of History Illusion
10 pages
Normal Distribution Review
No ratings yet
Normal Distribution Review
22 pages
Binomial Distribution
No ratings yet
Binomial Distribution
3 pages
Murphy Gaussians
No ratings yet
Murphy Gaussians
15 pages
Module 1A Notes Introduction To Statistical Analysis For Chemistry Students
No ratings yet
Module 1A Notes Introduction To Statistical Analysis For Chemistry Students
5 pages
Risk Aggregation and EC
No ratings yet
Risk Aggregation and EC
22 pages
LAb Act 5
100% (1)
LAb Act 5
15 pages
Chapter 3
No ratings yet
Chapter 3
36 pages
Week 8 Lecture New
No ratings yet
Week 8 Lecture New
49 pages
Statistical Methods For Assessing Agreement Between Two Methods of Clinical Measurement
No ratings yet
Statistical Methods For Assessing Agreement Between Two Methods of Clinical Measurement
9 pages
Robust Control Charts For Times Series Data
No ratings yet
Robust Control Charts For Times Series Data
6 pages
Bayes SolutionsPublic
No ratings yet
Bayes SolutionsPublic
37 pages
Lab #2
No ratings yet
Lab #2
5 pages
Estudio de Keros Inka
No ratings yet
Estudio de Keros Inka
8 pages
Ecological Methodology 1st Edition Charles J Krebs - Own the ebook now and start reading instantly
100% (1)
Ecological Methodology 1st Edition Charles J Krebs - Own the ebook now and start reading instantly
56 pages
ACC 324 - L Statistical Analysis With Computer Application
No ratings yet
ACC 324 - L Statistical Analysis With Computer Application
13 pages
STAT Midterm (Solution)
No ratings yet
STAT Midterm (Solution)
14 pages
Introducing Credibility Theory Into Glms For Ratemaking On Auto Portfolio
No ratings yet
Introducing Credibility Theory Into Glms For Ratemaking On Auto Portfolio
105 pages
Economics of European Integration Take Home. FINAL 2
No ratings yet
Economics of European Integration Take Home. FINAL 2
23 pages
Section 1 - Section 1 Question No.1 Bookmark: Examination: M.Sc. Statistics
No ratings yet
Section 1 - Section 1 Question No.1 Bookmark: Examination: M.Sc. Statistics
22 pages
Assign I P 050209
100% (2)
Assign I P 050209
2 pages
Normal Distribution
No ratings yet
Normal Distribution
16 pages
T Roshavelov Skopie 2014 PDF
No ratings yet
T Roshavelov Skopie 2014 PDF
41 pages

Statistics Cheatsheet

Uploaded by

Statistics Cheatsheet

Uploaded by

1 011333 0 56677

b. Sample – Subset of Pop we collect data on

c. Quantitative Variables – a number u. p78: standard deviation, s = √s2

between w. Density curve – relative proportion within classes –

Choosing a Sample y. p98: z-score z = ( x − x ) / s or ( x − µ) / σ

42010936.doc -2- Printed 10/15/2010

s2 = k. Two sided: Ho: μ = 0, Ha: μ ≠ 0

11. Confidence Intervals

42010936.doc -3- Printed 10/15/2010

You might also like