Stat Review Continued

Uploaded by

James Harden

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

5 views24 pages

Stat Review Continued

Uploaded by

James Harden

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 24

Statistics Review, Lecture 2

Readings and Resources

Read If You Need Extra Background

BH Ch. 2-4, 9, 10, 15, 16, and 21.3

1
Probability
• Random Variable
– Has some probability (pj) of taking on a
specific numeric value (xj) each time it is drawn
– Has a “support” or possible numbers it can take
on.
– Examples: flipping a coin, ages in our
classroom, level in school
• Notation:
P( X = x j ) = p j
2
Probability Continued
• P(a<X<b)
– Draw what it looks like for a normal curve with
arbitrary points a and b
• Cumulative Distribution Function
– P(X<=x)=F(x)
– Bounded between 0 and 1
– Draw it for Binomial, Uniform, and Normal
distributions
3
More Stats

• Joint Distributions
– fX,Y(x,y)=P(X=x,Y=y)
• Examples: Return and event announcement
(merger).
• What does it mean for distributions to be
independent?
– fX,Y(x,y)=fX(x) fY(y)
• Example:
– Big negative return on the S&P 500, the price of green tea in
China.

4
More Stats

• Conditional Distribution Functions:

– fX|Y(x|y)=P(X=x|Y=y)
– The probability of X given Y
– These do not have to be predictive (stuff can
happen at the same time).
• Example:
– Probability of a stock market crash given it is October.
– Mistaken conditional relationship: Smooth landing of
fighter pilot and praise/punishment after last landing.
– Regression to the mean. 5
Example for Conditional Distributions -
International Soccer results and the Stock
Market, Edmans et al. 2007
Stock Your Team Your Team Marginal
Return Loses Wins Prob.
Bad Return .30 .02 .32

Average .15 .18 .33

Good .05 .30 .35
Return
Marginal .50 .50 100
Prob.
6
Example for Conditional
Distributions
• What is the joint distribution?
• What is the marginal distribution?
• What is the probability that the UK market
drops when a UK team wins an
international soccer game?

7
Covariance
• Cov(X,Y)=E[(X- μx)(Y- μy)]
• If X and Y are independent, then
Cov(X,Y)=0
• Var[aX+bY]=
a 2 Var[X]+b2 Var[Y]+2abCov(X,Y)
• What is Var(X-Y) if X and Y are
independent?
8
Correlation Coefficient ρ
cov( X , Y )
=
(var( X ) var(Y )).5

Regression Coefficient β

cov( X , Y )
=
var( X ) 9
Correlation versus Regression
• Correlation between random variables
assigns no causation.
• We may investigate the correlation of
sunspots and stock market crashes, but that
does not mean we are determining causation
- i.e. that sunspots cause stock market
crashes, or vice versa.

10
Correlation and Regression
Continued
• Causation, or dependency has us positing a
model like,
– given the value of one variable, X, we expect
another variable, Y, to take on a particular
value, i.e. that X causes Y.
• We estimate these relationships with
techniques like least squares.
• R2 is a typical measure of goodness-of-fit.

11
Dummy Variables in Regressions

• Ordinal variables, usually 0/1 indicating

an event.
• There are two kinds of dummy variables
we will consider - intercept and slope
dummies.

Yt = α + βXt + αMonday Mondayt + β Monday Xt *Mondayt +εt

– Where Mondayt is a variable equal to 1 on the first day
of the week.
12
13
14
Dummy Variables in Regressions
• Ordinal variables, usually 0/1 indicating
an event.
• Intercept dummy.
PageViewst = α + αcat Catt + β ClipLength ClipLengtht +εt
– Catt is a variable equal to 1 if there is a cat in the video.
– ClipLengtht is the length of the video.
– I expect a positive coefficient for αcat and a negative
coefficient for βClipLength. Longer clips, fewer
pageviews, but if there is a cat, more pageviews.
15
Dummy Variables in Regressions
• Slope dummy.
PageViewst = α +αcat Catt + β ClipLength ClipLengtht
+ β Interaction Catt *ClipLengtht +εt
• This captures an interaction effect between having
a cat in the video and the length of the video.
• I expect a positive coefficient for β Interaction. Longer
clips get fewer pageviews, but if there is a cat in
the long video, it gets more pageviews than a clip
the same length but with no cat.
16
Dummy Variables in Regressions
• Another example, a CAPM model with a
dummy for Mondays (allows return to be
different for Mondays) and an interaction
term between Monday and the market
return.
Returnt = α + βMktReturnt + αMonday Mondayt
+ β Monday Xt *Mondayt +εt
– Where Mondayt is a variable equal to 1 on the first day
of the week.

17
Dummy Variables in Regressions
Continued
• An intercept dummy allows the constant to
be different in the regression depending on
whether the dummy is 1 or 0.
• For instance we may expect the mean return
to be lower on Mondays.

18
Slope Dummy Variables
• We may also expect that the effect of
explanatory variables on the dependent
variable will change with other conditions,
and for this we need a slope dummy which
allows the slope parameter to be different
depending on the condition.
• For instance, the impact of mood on a
Monday may be exaggerated by a soccer
defeat. 19
Dummy Variables Continued
• Dummy variables may be used to test for a
change in the intercept or the slope
parameters (test the Monday effect).
• We can only include dummy variables for
less categories than exist in the data (or
exclude another term).
– For instance, dummies may be used to model
quarterly seasonality, but we can’t include a
dummy for each quarter as well as an intercept
term. 20
Multicollinearity
• One or more regressors are nearly linear
combinations of the other regressors.
• Symptoms:
– High R2and F-statistic for the significance of a
group of regressors jointly, but all individual
variables in the group have low t-statistics.
• Consequences:
– Estimates of coefficients become imprecise,
sensitive to sample window.
• Solutions? More data, simpler questions. 21
Specification Error
• Basically, use of the wrong model:
– Faulty inclusion or exclusion of variables.
– Mis-measured variables.
– Incorrect form of model.
• This can be pretty serious.
– Faulty omission can lead to invalid inference
and biased estimates.

22
Other Complications
• Heteroskedasticity:
– Can lead to invalid inference.

• Autocorrelation
• Can lead to invalid inference and biased
estimation.

23
S&P 100 Index and Volatility

Heteroskedasticity:
Residuals predictably
large in magnitude

Autocorrelation:
Residuals
predictably
negative.

Qbus2810 Notes PDF
100% (1)
Qbus2810 Notes PDF
58 pages
Automata Computability and PDF
0% (3)
Automata Computability and PDF
1 page
Econometrics 1 Cumulative Final Study Guide
No ratings yet
Econometrics 1 Cumulative Final Study Guide
35 pages
ML Course Slides
No ratings yet
ML Course Slides
345 pages
Bitcoin Encryption Decryption DSA
100% (1)
Bitcoin Encryption Decryption DSA
16 pages
Options Greeks
No ratings yet
Options Greeks
12 pages
K Kiran Kumar IIM Indore
100% (1)
K Kiran Kumar IIM Indore
115 pages
Data Science and Machine Learning
100% (1)
Data Science and Machine Learning
190 pages
Dummy Variables: Nominal Scale
No ratings yet
Dummy Variables: Nominal Scale
17 pages
Econometrics CH 1-4
100% (1)
Econometrics CH 1-4
315 pages
Introduction To Econometrics Ii (Econ-3062) : Mohammed Adem (PHD)
100% (5)
Introduction To Econometrics Ii (Econ-3062) : Mohammed Adem (PHD)
83 pages
Econ1203 Notes
67% (3)
Econ1203 Notes
35 pages
The Problem of Overfitting - Coursera
No ratings yet
The Problem of Overfitting - Coursera
1 page
QP MFML Midsem
No ratings yet
QP MFML Midsem
3 pages
Data Structures and Algorithm MANUAL
No ratings yet
Data Structures and Algorithm MANUAL
92 pages
Chip Design For Turbo Encoder Module For In-Vehicle System: A Project Report ON
No ratings yet
Chip Design For Turbo Encoder Module For In-Vehicle System: A Project Report ON
7 pages
Regression Analysis Estimation and Interpretation of Regression Equation Dummy Independent Variable
No ratings yet
Regression Analysis Estimation and Interpretation of Regression Equation Dummy Independent Variable
39 pages
Running A Proper Regression Analysis: V G R Chandran Govindaraju Uitm Email: Website
No ratings yet
Running A Proper Regression Analysis: V G R Chandran Govindaraju Uitm Email: Website
36 pages
Econometrics
No ratings yet
Econometrics
147 pages
Graphical Method of Solution: Product-Mix Problem
No ratings yet
Graphical Method of Solution: Product-Mix Problem
7 pages
Chapter10 Econometrics DummyVariableModel
No ratings yet
Chapter10 Econometrics DummyVariableModel
8 pages
What Is A Math/Stats Model?: 1. Often Describe Relationship Between Variables 2. Types
No ratings yet
What Is A Math/Stats Model?: 1. Often Describe Relationship Between Variables 2. Types
64 pages
Formula Sheet ENMG 435
No ratings yet
Formula Sheet ENMG 435
11 pages
MGT555 CH 6 Regression Analysis
No ratings yet
MGT555 CH 6 Regression Analysis
19 pages
Multiple Regression
No ratings yet
Multiple Regression
49 pages
Dummy Variables
No ratings yet
Dummy Variables
25 pages
ECON 332 Business Forecasting Methods Prof. Kirti K. Katkar
No ratings yet
ECON 332 Business Forecasting Methods Prof. Kirti K. Katkar
38 pages
Clase 2
No ratings yet
Clase 2
48 pages
Introduction To Data Analysis: Professor David Richardson IIT Stuart School of Business
No ratings yet
Introduction To Data Analysis: Professor David Richardson IIT Stuart School of Business
31 pages
A Random Forest-Based Classification Method For Prediction of Car Price
No ratings yet
A Random Forest-Based Classification Method For Prediction of Car Price
1 page
Intervalo de Confianza y Dummy Variables 1
No ratings yet
Intervalo de Confianza y Dummy Variables 1
13 pages
Stanford Dog Classification Using Convolutional Neural Network (CNN)
No ratings yet
Stanford Dog Classification Using Convolutional Neural Network (CNN)
8 pages
Chapter 3
No ratings yet
Chapter 3
36 pages
Empirical Models: Data Collection
No ratings yet
Empirical Models: Data Collection
16 pages
YMS Topic Review (Chs 1-8)
No ratings yet
YMS Topic Review (Chs 1-8)
7 pages
Https Duckduckgo Com Q "Cryptool"+aes+histogram+256&norw 1&t Ffab&ia Web
No ratings yet
Https Duckduckgo Com Q "Cryptool"+aes+histogram+256&norw 1&t Ffab&ia Web
6 pages
CI DSA Study Guide PDF
No ratings yet
CI DSA Study Guide PDF
1 page
Q1) An Array Contains 25 Positive Integers - Write A Module Which: A) Finds All The Require of Elements Whose Sum Is 25. Ans
No ratings yet
Q1) An Array Contains 25 Positive Integers - Write A Module Which: A) Finds All The Require of Elements Whose Sum Is 25. Ans
12 pages
Basics
No ratings yet
Basics
61 pages
Econometrics 2
No ratings yet
Econometrics 2
84 pages
23-SIMPLEC Algorithm For Colocated Meshes
No ratings yet
23-SIMPLEC Algorithm For Colocated Meshes
31 pages
Project Crashing To Solve Time
No ratings yet
Project Crashing To Solve Time
7 pages
IMOmath - Basic Methods For Solving Functional Equations
No ratings yet
IMOmath - Basic Methods For Solving Functional Equations
2 pages
Dummy Variable Final
No ratings yet
Dummy Variable Final
14 pages
Session 11 - Correlation and Regression
No ratings yet
Session 11 - Correlation and Regression
28 pages
Fixed Point Conversion
No ratings yet
Fixed Point Conversion
50 pages
Basics
No ratings yet
Basics
8 pages
HW 5 Soln
100% (1)
HW 5 Soln
12 pages
Samenvatting Chapter 1-3 Econometrie Watson
No ratings yet
Samenvatting Chapter 1-3 Econometrie Watson
16 pages
Data Science Q&A - Latest Ed (2020) - 3 - 1
No ratings yet
Data Science Q&A - Latest Ed (2020) - 3 - 1
2 pages
MultivariableRegression 1
No ratings yet
MultivariableRegression 1
30 pages
A General Study On Genetic Fuzzy Systems: Editor Jenny Smith C 1993 John Wiley & Sons LTD
No ratings yet
A General Study On Genetic Fuzzy Systems: Editor Jenny Smith C 1993 John Wiley & Sons LTD
25 pages
Chapter 1 Econometrics
No ratings yet
Chapter 1 Econometrics
21 pages
Topic 7 Regression (Cont.)
No ratings yet
Topic 7 Regression (Cont.)
47 pages
Multiple Linear Regression
No ratings yet
Multiple Linear Regression
43 pages
Chapter 3 Notes Part 3
No ratings yet
Chapter 3 Notes Part 3
9 pages
Lecture - 01 - REVIEW MATERIAL - Quantitative - Review 8801
No ratings yet
Lecture - 01 - REVIEW MATERIAL - Quantitative - Review 8801
35 pages
Assignment 4
No ratings yet
Assignment 4
6 pages
Class 6 AI Excite - Period 1
No ratings yet
Class 6 AI Excite - Period 1
13 pages
Multiple - Regression4 - Tagged
No ratings yet
Multiple - Regression4 - Tagged
40 pages
Presentation4 - Bivariate Analysis and Simple Linear Regression
No ratings yet
Presentation4 - Bivariate Analysis and Simple Linear Regression
31 pages
MLCourse Slides
No ratings yet
MLCourse Slides
427 pages
PDS Gtu-Qp W2023
No ratings yet
PDS Gtu-Qp W2023
2 pages
Dummy Variable Regression
No ratings yet
Dummy Variable Regression
8 pages
Differential Equation 2
No ratings yet
Differential Equation 2
22 pages
Support Vector Machine
No ratings yet
Support Vector Machine
21 pages
Extending The Multiple Regression
No ratings yet
Extending The Multiple Regression
19 pages
Lecture 4
No ratings yet
Lecture 4
45 pages
AP Statistics Portfolio Q2
No ratings yet
AP Statistics Portfolio Q2
17 pages
Open Ended Lab
No ratings yet
Open Ended Lab
4 pages
Inventory Accounts
No ratings yet
Inventory Accounts
9 pages
8 2 Correlations+models Ninell
No ratings yet
8 2 Correlations+models Ninell
44 pages
Screenshot 2024-12-15 at 8.15.38 PM
No ratings yet
Screenshot 2024-12-15 at 8.15.38 PM
138 pages
Unit Iii Efficiency 9
No ratings yet
Unit Iii Efficiency 9
16 pages
MF Welcome Session With PD 6.18
No ratings yet
MF Welcome Session With PD 6.18
15 pages
EDA Lec10 Week 10 Dec v1
No ratings yet
EDA Lec10 Week 10 Dec v1
58 pages
Intro Slides
No ratings yet
Intro Slides
55 pages
The Broker 2025 Kickoff
No ratings yet
The Broker 2025 Kickoff
8 pages
Case 2025 - SFA - Day On Bay PDF
No ratings yet
Case 2025 - SFA - Day On Bay PDF
7 pages
Multi Regrson
No ratings yet
Multi Regrson
40 pages
Econometrics I - Lecture 7 (Wooldridge)
No ratings yet
Econometrics I - Lecture 7 (Wooldridge)
34 pages
VBCC - Qualification Case 2024 - EN
No ratings yet
VBCC - Qualification Case 2024 - EN
3 pages
MRNA Blog 03-27-2024
No ratings yet
MRNA Blog 03-27-2024
2 pages
IONS Blog 01-23-2024
No ratings yet
IONS Blog 01-23-2024
2 pages
LGND Blog 01-08-2024
No ratings yet
LGND Blog 01-08-2024
2 pages
Van Berkom Small-Cap Case Competition
No ratings yet
Van Berkom Small-Cap Case Competition
1 page
Theme 3 Multivariante Regression Model
No ratings yet
Theme 3 Multivariante Regression Model
8 pages
6800 5 Black Scholes
No ratings yet
6800 5 Black Scholes
36 pages
Bs 341 Exam Tutorial 1
No ratings yet
Bs 341 Exam Tutorial 1
6 pages
Econometrics 2 Notes
No ratings yet
Econometrics 2 Notes
14 pages
Topic 4 ETC1000
No ratings yet
Topic 4 ETC1000
13 pages
Econometrics II All Chapters
No ratings yet
Econometrics II All Chapters
240 pages
MFIN5400 - s12 - Fundamentals of Credit Analysis
No ratings yet
MFIN5400 - s12 - Fundamentals of Credit Analysis
30 pages
MFIN5400 Homework1s - w25
No ratings yet
MFIN5400 Homework1s - w25
9 pages
Econometrics Cheat Sheet
No ratings yet
Econometrics Cheat Sheet
4 pages
ES12005 Lecture 2.5 2024-25
No ratings yet
ES12005 Lecture 2.5 2024-25
75 pages
Econometrics II Chapter One
No ratings yet
Econometrics II Chapter One
87 pages
DECS Cheat Sheet
No ratings yet
DECS Cheat Sheet
8 pages
CH 9 Dummy Variable
No ratings yet
CH 9 Dummy Variable
10 pages
2022 Econometrics I Chapter Four
No ratings yet
2022 Econometrics I Chapter Four
83 pages
Session 5 (Master)
No ratings yet
Session 5 (Master)
55 pages
Session 2 (Master)
No ratings yet
Session 2 (Master)
66 pages
Session 3 (Master)
No ratings yet
Session 3 (Master)
47 pages