Formulae

This fact sheet summarizes key concepts for statistics exams including: - Notation for population and sample parameters like means, variances, and proportions - Assumptions and pivotal quantities for confidence intervals and hypothesis tests for one and two population means, variances, proportions, and differences between means - Examples of a confidence interval for a normal population mean with unknown variance and a lower-tail test for the same - Formulas for sample covariance, correlation, and simple linear regression slope and intercept estimates The fact sheet is organized by statistical test or method and lists the assumptions, pivotal quantity and its distribution, and examples for each. It covers one and two sample tests and intervals for both normal and non-normal populations.

Uploaded by

silvia.jmez.glez

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

20 views2 pages

Formulae

Uploaded by

silvia.jmez.glez

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 2

Statistics II

Fact sheet for exams

Confidence intervals and hypothesis testing in one and two populations.

Notation:
2
• µX and σX : population mean and variance of a random variable/population X, X̄ and s2X : sample mean and quasi-
variance
• pX population proportion if X ∼ Bernoulli(pX ), p̂X sample proportion
• X n : simple random sample (SRS) of size n from X
• (1 − α) confidence level, α significance level
• zα an upper α quantile of N(0,1) distribution, tn−1;α an upper α quantile of a tn−1 distribution

Parameter Assumptions: SRS(s) and Pivotal quantity and distribution

X̄ − µX
µX Normal population, known variance √ ∼ N (0, 1)
σX / n
X̄ − µX
µX Normal population, unknown variance √ ∼ tn−1
sX / n
X̄ − µX
µX Nonnormal population, large sample size √ ∼approx. N (0, 1)
sX / n
p̂ − pX
pX Bernoulli population, large sample size ! X ∼approx. N (0, 1)
p̂X (1 − p̂X )/n
(For hypothesis testing replace p̂X with pX in the standard error)

(n − 1)s2X
2
σX and σX Normal population 2 ∼ χ2n−1
σX
D̄ − µD
µX − µY Normal diﬀerence Di = Xi − Yi , matched pairs √ ∼ tn−1
sD / n
X̄ − Ȳ − (µX − µY )
µX − µY Normal populations, common variance " ∼ tnX +nY −2 , where
sp n1X + n1Y
(nX − 1)s2X + (nY − 1)s2Y
s2p =
nX + nY − 2
X̄ − Ȳ − (µX − µY )
µX − µY Normal populations, known variances " 2 2
∼ N (0, 1)
σX σY
nX + nY

X̄ − Ȳ − (µX − µY )
µX − µY Nonnormal populations, large sample sizes " 2 ∼approx. N (0, 1)
sX s2Y
nX + nY

p̂ − p̂Y − (pX − pY )
pX − pY Bernoulli populations, large sample sizes #X $ % ∼approx. N (0, 1), where
p̂0 (1 − p̂0 ) n1X + n1Y
nX p̂X + nY p̂Y
p̂0 =
nX + nY
2 s2X /σX
2
σX /σY2 and σX /σY Normal populations ∼ FnX −1,nY −1
sY /σY2
2

2 2
Example: To construct an (1 − α) confidence interval for µX if X ∼ N (µX , σX ) with σX unknown we have:
& '
sx sx
CI1−α (µX ) = x̄ − tn−1;α/2 √ ; x̄ + tn−1;α/2 √
n n
To perform a lower-tail test H0 : µX ≥ µ0 versus H1 : µX < µ0 , the rejection region at significance level α, RRα , is:
( t 0
) )
* ,x̄ −
) -. /
µ0
)
1
RRα = t : √ < tn−1;1−α
)
) sx / n )
)
+ 2

1
Sample covariance and correlation based on bivariate observations (x1 , y1 ), . . . , (xn , yn ):
n
3 n
3
sxy (xi − x̄) (yi − ȳ) xi yi − nx̄ȳ r(x,y) 4
n
, -. / , -. / cov (x, y) xi yi − nx̄ȳ
i=1 i=1 i=1
cov (x, y) = = , cor (x, y) = =5 5
n−1 n−1 sx sy 4
n 4
n
2
xi − nx̄ 2 yi2 − nȳ 2
i=1 i=1

Slope and intercept estimates in the simple linear regression model yi = β0 + β1 xi + ui , where
ui ∼ iid N (0, σ 2 ) to obtain the fitted line ŷi = β̂0 + β̂1 xi :
n n
1 3 3
(xi − x̄) (yi − ȳ) xi yi − nx̄ȳ
cov(x, y) n − 1 i=1 i=1
β̂1 = = n = n , β̂0 = ȳ − β̂1 x̄
s2x 1 3 2
3
(xi − x̄) x2i − nx̄2 n
n − 1 i=1 i=1
3
e2i
i=1
Pivotal quantities for β1 , β0 , σ 2 , with residuals ei = yi − ŷi and residual variance s2R = :
n−2
β̂1 − β1 β̂0 − β0 (n − 2) s2R
5 ∼ tn−2 , 5 & ' ∼ tn−2 , ∼ χ2n−2
s2R 1 x̄2 σ2
s2R +
(n − 1)s2X n (n − 1)s2X

Confidence intervals for the mean and individual response for y0 given X = x0 :
6 9 : 6 9 :
7 7
7 1 (x − x̄)
2 7 1 (x − x̄)
2
ŷ0 ± tn−2,α/2 8s2R , ŷ0 ± tn−2,α/2 8s2R 1 + +
0 0
+
n (n − 1) s2X n (n − 1) s2X

ANOVA table for the simple linear regression model (R-squared R2 = SSM/SST ):
Source of variability SS 4n DF Mean F ratio
Model SSM =4 i=1 (ŷi − ȳ)2 4 1 SSM/1 SSM/s2R
n n
Residuals/errors SSR = i=1 (yi − ŷi )2 = i=1 e2i n−2 SSR/(n − 2) = s2R
Total SST = SSM + SSR n−1
To test H0 : β1 = 0 vs. H1 : β1 ∕= 0, test stat is F = SSM/s2R ∼ F1,n−2 and RRα = {F > F1,n−2;α }.

Model formulation, estimates, fitted model and residuals in multiple linear regression model
yi = β0 + β1 xi1 + β2 xi2 + · · · + βk xik + ui , where ui ∼ iid N (0, σ 2 ) in matrix notation:
y = Xβ + u, β̂ = (X T X)−1 X T y, ŷ = X β̂, e = y − ŷ, where
; >
; > ; > β0 ; >
y1 1 x11 x12 ··· x1k < β1 ? u1
< y2 ? < 1 x21 x22 ··· x2k ? < ? < u2 ?
< ? < ? < ? < ?
y=< .. ?, X=< .. .. .. .. .. ?, β = < β2 ? , u = < . ?
= . @ = . . . . . @ < .. ? = .. @
= . @
yn 1 xn1 xn2 ··· xnk un
βk
4n
Pivotal quantities for σ 2 and βj , j = 0, 1, . . . , k, with residual variance s2R = 2
i=1 ei /(n − k − 1):

(n − k − 1) s2R β̂j − βj
∼ χ2n−k−1 , ∼ tn−k−1 ,
σ2 s(β̂j )
"
where s(β̂j ) = s2 (β̂j ) and s2 (β̂j ) is the j-th diagonal element of the (estimated) variance-covariance matrix of β̂,
with the matrix defined as Sβ̂ = s2R (X T X)−1 .

ANOVA table for the multiple linear regression model:

Source of variability SS DF Mean F ratio
Model SSM k SSM/k (SSM/k)/s2R
Residuals/errors SSR n−k−1 SSR/(n − k − 1) = s2R
Total SST n−1

Social Anxiety Social Interaction Anxiet
No ratings yet
Social Anxiety Social Interaction Anxiet
70 pages
Module 3 Regression Notes
No ratings yet
Module 3 Regression Notes
3 pages
Financial Economic Formula Sheet
No ratings yet
Financial Economic Formula Sheet
8 pages
Chapter 2: Simple Linear Regression (Cont'd)
No ratings yet
Chapter 2: Simple Linear Regression (Cont'd)
37 pages
Applied Regression - HW1 - JP, Savio, Leila, Mohan
100% (1)
Applied Regression - HW1 - JP, Savio, Leila, Mohan
18 pages
Linear Regression
No ratings yet
Linear Regression
47 pages
The Challenges of Nursing Students in The Clinical Learning Environment
No ratings yet
The Challenges of Nursing Students in The Clinical Learning Environment
20 pages
Regression - Validating The Model Sept 2012
No ratings yet
Regression - Validating The Model Sept 2012
68 pages
Chapter 03 Inferences
No ratings yet
Chapter 03 Inferences
32 pages
Tutorial2 SLR
No ratings yet
Tutorial2 SLR
10 pages
Course 10-Part 2
No ratings yet
Course 10-Part 2
27 pages
2.2.2.5 Lab - Basic Data Analytics
No ratings yet
2.2.2.5 Lab - Basic Data Analytics
7 pages
Formuleblad Statistiek
No ratings yet
Formuleblad Statistiek
10 pages
Data Science Zom A To Project
No ratings yet
Data Science Zom A To Project
47 pages
Docs Quality MA - IMS.00001 Integrated Management System Manual
No ratings yet
Docs Quality MA - IMS.00001 Integrated Management System Manual
42 pages
Lecture 5
No ratings yet
Lecture 5
45 pages
Chap02-5 (Autosaved)
No ratings yet
Chap02-5 (Autosaved)
66 pages
课本附录 (二) - 公式表 Formula Sheet - final
No ratings yet
课本附录 (二) - 公式表 Formula Sheet - final
2 pages
Chapter 2: Simple Linear Regression
No ratings yet
Chapter 2: Simple Linear Regression
58 pages
Simple Linear Regression 69
No ratings yet
Simple Linear Regression 69
69 pages
12PE Activity Analysis Written Report - Rubric
No ratings yet
12PE Activity Analysis Written Report - Rubric
2 pages
Anova
No ratings yet
Anova
5 pages
Correlation and Regression: Fathers' and Daughters' Heights
No ratings yet
Correlation and Regression: Fathers' and Daughters' Heights
43 pages
C1 English
No ratings yet
C1 English
26 pages
CS 700 Final
No ratings yet
CS 700 Final
12 pages
General (UCP-AFMT1003-A-FOMS, F21) - Microsoft Teams
No ratings yet
General (UCP-AFMT1003-A-FOMS, F21) - Microsoft Teams
8 pages
deVCBAdMRRKlQgQHTHUSAw - Course 1 Week 4 Glossary - DA Terms and Definitions 1
No ratings yet
deVCBAdMRRKlQgQHTHUSAw - Course 1 Week 4 Glossary - DA Terms and Definitions 1
4 pages
Stats101A - Chapter 2
No ratings yet
Stats101A - Chapter 2
59 pages
19 Jurnal Erizal Respatti EDIT
No ratings yet
19 Jurnal Erizal Respatti EDIT
8 pages
Bulba Advanced Instructions
No ratings yet
Bulba Advanced Instructions
13 pages
PE Civil: Transportation Ebook Practice Exam
No ratings yet
PE Civil: Transportation Ebook Practice Exam
41 pages
A48970353 - 24830 - 7 - 2019 - MGN303 At2 Q1912
No ratings yet
A48970353 - 24830 - 7 - 2019 - MGN303 At2 Q1912
5 pages
The Influence of Distance and Quality of Care On Place of Delivery in Rural Ghana
No ratings yet
The Influence of Distance and Quality of Care On Place of Delivery in Rural Ghana
8 pages
Formula Help Sheet
No ratings yet
Formula Help Sheet
6 pages
Simple Regression
No ratings yet
Simple Regression
46 pages
x=^μ= x n x x) n−1 s σ s x−μ σ se (´x) = σ n: Sample Mean
No ratings yet
x=^μ= x n x x) n−1 s σ s x−μ σ se (´x) = σ n: Sample Mean
5 pages
Topic 3a
No ratings yet
Topic 3a
64 pages
Forecastin G Moving Averages - 3 Period Moving Average
No ratings yet
Forecastin G Moving Averages - 3 Period Moving Average
9 pages
Simple Linear Regression1
No ratings yet
Simple Linear Regression1
36 pages
SOA Exam Statistics For Risk Modelling Study Manual
No ratings yet
SOA Exam Statistics For Risk Modelling Study Manual
42 pages
Abdul Rehman MC060402152 MBA Marketing
No ratings yet
Abdul Rehman MC060402152 MBA Marketing
48 pages
Comparisons of Several Multivariate Means
No ratings yet
Comparisons of Several Multivariate Means
9 pages
ECMT1020 Formulas 2021
No ratings yet
ECMT1020 Formulas 2021
9 pages
Stat 353 Study Guide
No ratings yet
Stat 353 Study Guide
44 pages
Sophia Rabe-Hesketh, Anders Skrondal - Multilevel and Longitudinal Modeling Using Stata. 2 Vols.-Stata Press (2012)
100% (2)
Sophia Rabe-Hesketh, Anders Skrondal - Multilevel and Longitudinal Modeling Using Stata. 2 Vols.-Stata Press (2012)
1,030 pages
Research Proposal1
No ratings yet
Research Proposal1
25 pages
ExamFinal Topics
No ratings yet
ExamFinal Topics
9 pages
CUHK STAT5102 Ch3
No ratings yet
CUHK STAT5102 Ch3
73 pages
BI Unit 1-1
No ratings yet
BI Unit 1-1
23 pages
Chapter Iii - Part IV
No ratings yet
Chapter Iii - Part IV
22 pages
Pengaruh Penentuan Lokasi Terhadap Kesuksesan Usah
No ratings yet
Pengaruh Penentuan Lokasi Terhadap Kesuksesan Usah
12 pages
Formulas
No ratings yet
Formulas
3 pages
Statistics Packet
No ratings yet
Statistics Packet
17 pages
Assignment 2 Guidance
100% (1)
Assignment 2 Guidance
5 pages
TSNotes 1
No ratings yet
TSNotes 1
29 pages
Intuitive Biostatistics: Choosing A Statistical Test: Back Completely Revised Second Edition
No ratings yet
Intuitive Biostatistics: Choosing A Statistical Test: Back Completely Revised Second Edition
6 pages
43651-Article Text-205325-1-10-20230123
No ratings yet
43651-Article Text-205325-1-10-20230123
8 pages
STA248
No ratings yet
STA248
26 pages
Type Equation Here. Type Equation Here.: N SST K N SSE R y
No ratings yet
Type Equation Here. Type Equation Here.: N SST K N SSE R y
2 pages
Chapter3-Goodness of Fit Tests
No ratings yet
Chapter3-Goodness of Fit Tests
24 pages
Simple Linear
No ratings yet
Simple Linear
10 pages
Formulas 12 Eng
No ratings yet
Formulas 12 Eng
2 pages
Lecture 12
No ratings yet
Lecture 12
29 pages
Simple Linear Regression Analysis
No ratings yet
Simple Linear Regression Analysis
55 pages
Reg Analysis
No ratings yet
Reg Analysis
63 pages
Course Outline Educ 103 Advanced Stat
No ratings yet
Course Outline Educ 103 Advanced Stat
4 pages
STAT630Slide Adv Data Analysis
No ratings yet
STAT630Slide Adv Data Analysis
238 pages
Inference For Regression
No ratings yet
Inference For Regression
24 pages
FormulaSheet FinalExam
No ratings yet
FormulaSheet FinalExam
8 pages
Statistics Formula Sheet: Summarising Data
No ratings yet
Statistics Formula Sheet: Summarising Data
3 pages
Regression Analysis
100% (1)
Regression Analysis
280 pages
Course Book: Civil Engineering Department
No ratings yet
Course Book: Civil Engineering Department
56 pages
01 SLR Final
No ratings yet
01 SLR Final
37 pages
DM Lab Cse
No ratings yet
DM Lab Cse
108 pages
215 Final Exam Formula Sheet
No ratings yet
215 Final Exam Formula Sheet
2 pages
Formula Sheet
No ratings yet
Formula Sheet
3 pages
Health Management Information System
No ratings yet
Health Management Information System
30 pages
Data Analyst RoadMap
No ratings yet
Data Analyst RoadMap
1 page
INDE 3364 Final Exam Cheat Sheet
No ratings yet
INDE 3364 Final Exam Cheat Sheet
5 pages
EC2303 Final Formula Sheet PDF
No ratings yet
EC2303 Final Formula Sheet PDF
8 pages
Simple Linear Regression: Parameters
No ratings yet
Simple Linear Regression: Parameters
34 pages
Regression and Multiple Regression Analysis
100% (1)
Regression and Multiple Regression Analysis
21 pages

Formulae

Uploaded by

Formulae

Uploaded by

Statistics II

Fact sheet for exams

Confidence intervals and hypothesis testing in one and two populations.

Parameter Assumptions: SRS(s) and Pivotal quantity and distribution

ANOVA table for the multiple linear regression model:

You might also like