0% found this document useful (0 votes)

4 views11 pages

Homework 1

This document outlines Homework I for STOR 455/002, due on January 30, 2025. It includes exercises on bivariate datasets, variance, covariance, and linear regression models, providing detailed mathematical proofs and derivations. The exercises require the application of statistical concepts to demonstrate relationships between variables and to derive least-squares estimates.

Uploaded by

Songquan Dong

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

4 views11 pages

Homework 1

Uploaded by

Songquan Dong

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 11

STOR 455/002 - Homework I

January 17, 2025

This homework is due on 11:59 pm (Eastern Time) on Jan 30, 2025. The homework is to be
submitted in Gradescope.

Exercise 1 (13 points)

Consider a bivariate dataset consisting of 𝑛 observations on 2 numerical variables 𝑥 (") and
(&)
𝑥 ($) . Let 𝑥% denotes the value of the variable 𝑥 (&) for the 𝑗-th observation, where 𝑖 = 1,2
and 𝑗 = 1, … , 𝑛.

(a) (2 + 3 + 3 points)
Fix two real numbers 𝑎 and 𝑏. Consider the new variable 𝑧 : = 𝑎𝑥 (") + 𝑏. In other words, the
(")
value of the variable 𝑧 for the 𝑗-th observation is given by 𝑧% = 𝑎𝑥% + 𝑏. Show that
(")
𝑧‾' = 𝑎𝑥‾' + 𝑏, var(𝑧) = 𝑎$ var6𝑥 (") 7, cov6𝑧, 𝑥 ($) 7 = 𝑎cov6𝑥 (") , 𝑥 ($) 7.
Use the definition of sample mean, variance and covariance.

' '
1 1 (")
𝑧‾' = ; 𝑧% = ;<𝑎𝑥% + 𝑏=
𝑛 𝑛
%(" %("

' '
1 (") 1
𝑧‾' = 𝑎 ⋅ ; 𝑥% + ; 𝑏
𝑛 𝑛
%(" %("

(") " (")

Simplify using 𝑥‾' = ' ∑'%(" 𝑥% :

(")
𝑧‾' = 𝑎𝑥‾' + 𝑏
'
1 $
var(𝑧) = ;6𝑧% − 𝑧‾' 7
𝑛−1
%("

(") (")
Substitute 𝑧% = 𝑎𝑥% + 𝑏 and 𝑧‾' = 𝑎𝑥‾' + 𝑏:

'
1 (") (")
$
var(𝑧) = ; C𝑎𝑥% + 𝑏 − <𝑎𝑥‾' + 𝑏=D
𝑛−1
%("

'
1 (") (")
$
var(𝑧) = ; 𝑎$ <𝑥% − 𝑥‾' =
𝑛−1
%("

'
1 (") (")
$
var(𝑧) = 𝑎 ⋅ ;<𝑥% − 𝑥‾' = = 𝑎$ var6𝑥 (")7
$
𝑛−1
%("

'
($)
1 ($) ($)
cov6𝑧, 𝑥 7= ;6𝑧% − 𝑧‾' 7 <𝑥% − 𝑥‾' =
𝑛−1
%("

(") (")
Substitute 𝑧% = 𝑎𝑥% + 𝑏 and 𝑧‾' = 𝑎𝑥‾' + 𝑏:

'
($)
1 (") (") ($) ($)
cov6𝑧, 𝑥 7= ; 𝑎 <𝑥% − 𝑥‾' =<𝑥% − 𝑥‾' =
𝑛−1
%("

'
($)
1 (") (") ($) ($)
cov6𝑧, 𝑥 7=𝑎⋅ ;<𝑥% − 𝑥‾' = <𝑥% − 𝑥‾' = = 𝑎cov6𝑥 (") , 𝑥 ($) 7
𝑛−1
%("
(b) (5 points)
Show that

var6𝑥 (") + 𝑥 ($) 7 = var6𝑥 (")7 + var6𝑥 ($)7 + 2cov6𝑥 (") , 𝑥 ($) 7.

Write down Var6𝑥 (") + 𝑥 ($)7 as

'
(") ($)
1 (") ($) (") ($)
$
var6𝑥 +𝑥 7= ;<𝑥% + 𝑥% − 𝑥‾' − 𝑥‾' = .
𝑛−1
%("

Expand in the following way :

$ $ $
(") ($) (") ($) (") (") ($) ($) (") (") ($) ($)
<𝑥% + 𝑥% − 𝑥‾' − 𝑥‾' = = <𝑥% − 𝑥‾' = + <𝑥% − 𝑥‾' = + 2<𝑥% − 𝑥‾' =<𝑥% − 𝑥‾' =.

'
(") ($)
1 (") ($) (") ($)
$
var6𝑥 +𝑥 7= ;<𝑥% + 𝑥% − 𝑥‾' − 𝑥‾' =
𝑛−1
%("

$ $ $
(") ($) (") ($) (") (") ($) ($) (") (") ($) ($)
<𝑥% + 𝑥% − 𝑥‾' − 𝑥‾' = = <𝑥% − 𝑥‾' = + <𝑥% − 𝑥‾' = + 2<𝑥% − 𝑥‾' =<𝑥% − 𝑥‾' =

var6𝑥 (") + 𝑥 ($) 7

' '
1 (") (")
$
($) ($)
$
= F;<𝑥% − 𝑥‾' = + ;<𝑥% − 𝑥‾' =
𝑛−1
%(" %("
'
(") (") ($) ($)
+ 2 ;<𝑥% − 𝑥‾' = <𝑥% − 𝑥‾' =G
%("

var6𝑥 (") + 𝑥 ($) 7 = var6𝑥 (") 7 + var6𝑥 ($) 7 + 2cov6𝑥 (") , 𝑥 ($)7
Exercise 2 (7 points)
Consider a dataset of 𝑛 observations which collected data on a variable 𝑌. The value for
the numerical variable 𝑌 for the 𝑖-th observation is given by 𝑌& , for 𝑖 = 1, … , 𝑛. Establish the
following sum-of-squares decomposition. For any real number 𝑐,
' '

;(𝑌& − 𝑐) = ;(𝑌& − 𝑌‾' )$ + 𝑛(𝑌‾' − 𝑐)$ ,

&(" &("

where 𝑌‾' is the sample mean of the variable 𝑌. Observe that the above identity shows the
LHS to be minimized uniquely at 𝑐 = 𝑌‾' .
Write (𝑌& − 𝑐) on the left hand side as
(𝑌& − 𝑐) = (𝑌& − 𝑌‾' ) + (𝑌‾' − 𝑐),
and use the quadratic formula (𝑎 + 𝑏)$ = 𝑎$ + 𝑏 $ + 2𝑎𝑏 to expand (𝑌& − 𝑐)$ .
' '

;(𝑌& − 𝑐)$ = ;(𝑌& − 𝑌‾' )$ + 𝑛(𝑌‾' − 𝑐)$ .

&(" &("

(𝑌& − 𝑐) = (𝑌& − 𝑌‾' ) + (𝑌‾' − 𝑐). (𝑌& − 𝑐)$ = [(𝑌& − 𝑌‾' ) + (𝑌‾' − 𝑐)]$ .
(𝑌& − 𝑐)$ = (𝑌& − 𝑌‾' )$ + (𝑌‾' − 𝑐)$ + 2(𝑌& − 𝑌‾' )(𝑌‾' − 𝑐).
' ' ' '

;(𝑌& − 𝑐) = ;(𝑌& − 𝑌‾'

$ )$ + ;(𝑌‾' − 𝑐) + 2 ;(𝑌& − 𝑌‾' ) (𝑌‾' − 𝑐).
$

&(" &(" &(" &("

since ∑'&("(𝑌‾' − 𝑐)$ = 𝑛(𝑌‾' − 𝑐)$ and ∑'&("(𝑌& − 𝑌‾' ) = 0,

' '

;(𝑌& − 𝑐) = ;(𝑌& − 𝑌‾' )$ + 𝑛(𝑌‾' − 𝑐)$ .

&(" &("

Exercise 3 (13 points)

Consider the bivariate dataset of 𝑛 observations
{(𝑌" , 𝑥" ), … , (𝑌' , 𝑥' )},
where 𝑌 and 𝑥 are respectively response and explanatory variables. We want to fit a simple
linear regression model of 𝑌 on 𝑥, given by
𝑌& = 𝛽) + 𝛽" 𝑥& + 𝜀& , 𝑖 = 1, … , 𝑛.
In this exercise we shall prove (without using calculus) the following algebraic expressions
for the least-square estimates :
𝑟*+ 𝑠+
𝛽Q) = 𝑌‾' − 𝛽Q" 𝑥‾' , 𝛽Q" = .
𝑠*
In other words, the above quantities solve the following minimization problem:

(a) (2 points )
Use Exercise 1 to show that for any (𝛽) , 𝛽" ),
' '

;(𝑌& − 𝛽) − 𝛽" 𝑥& )$ = ;(𝑌& − 𝑌‾' − 𝛽" 𝑥& + 𝛽" 𝑥‾' )$ + 𝑛(𝑌‾' − 𝛽" 𝑥‾' − 𝛽) )$ .
&(" &("

(b) (2 points)
Deduce from Part (a) that any solution 6𝛽Q) , 𝛽Q" 7 to the optimization problem in () satisfies
the relation

𝛽Q) = 𝑌‾' − 𝛽Q" 𝑥‾' .

(d) (4 points)
Show that
'
1 $
;6(𝑌& − 𝑌‾' ) − 𝛽" (𝑥& − 𝑥‾' )7 = 𝛽"$ Var(𝑥) − 2𝛽" Cov(𝑥, 𝑌) + Var(𝑌).
𝑛−1
&("

(e) (3 points)
Use Part (d) to deduce that () is solved at
𝑟*+ 𝑠+
𝛽Q" = .
𝑠*
Use the fact that for any positive real number 𝑎 and real numbers 𝑏, 𝑐, the quadratic
expression 𝑎𝑡 $ + 𝑏𝑡 + 𝑐 is minimized at 𝑡 = −𝑏/2𝑎.
a.
Using Exercise 2, substitute 𝑐 = 𝛽) + 𝛽" 𝑥& into the sum-of-squares decomposition. Let 𝑐 =
𝑌‾' − 𝛽" 𝑥‾' + 𝛽) − (𝑌‾' − 𝛽" 𝑥‾' ). Expanding:
' '
$
;(𝑌& − 𝛽) − 𝛽" 𝑥& )$ = ;6𝑌& − 𝑌‾' − 𝛽" (𝑥& − 𝑥‾' )7 + 𝑛(𝑌‾' − 𝛽" 𝑥‾' − 𝛽) )$ .
&(" &("

This matches the required decomposition.

𝑌& − 𝛽) − 𝛽" 𝑥& = 6𝑌& − 𝑌‾' − 𝛽" (𝑥& − 𝑥‾' )7 + (𝑌‾' − 𝛽" 𝑥‾' − 𝛽) ).

' '
$
;(𝑌& − 𝛽) − 𝛽" 𝑥& )$ = ;6𝑌& − 𝑌‾' − 𝛽" (𝑥& − 𝑥‾' )7 + 𝑛(𝑌‾' − 𝛽" 𝑥‾' − 𝛽) )$ .
&(" &("

The cross-term vanishes because ∑'&("(𝑌& − 𝑌‾' ) = 0 and ∑'&("(𝑥& − 𝑥‾' ) = 0.

b. The total sum of squares is minimized when the second term 𝑛(𝑌‾' − 𝛽" 𝑥‾' − 𝛽) )$ is zero.
Thus:

𝑌‾' − 𝛽" 𝑥‾' − 𝛽) = 0

𝛽Q) = 𝑌‾' − 𝛽Q" 𝑥‾' .

c. Substitute 𝛽Q) = 𝑌‾' − 𝛽Q" 𝑥‾' to the original minimization problem:

' '
$ $
;6𝑌& − 𝛽Q) − 𝛽" 𝑥& 7 = ;6(𝑌& − 𝑌‾' ) − 𝛽" (𝑥& − 𝑥‾' )7 .
&(" &("

𝛽Q" minimizes this reduced sum of squares

$
6(𝑌& − 𝑌‾' ) − 𝛽" (𝑥& − 𝑥‾' )7 = (𝑌& − 𝑌‾' )$ − 2𝛽" (𝑌& − 𝑌‾' )(𝑥& − 𝑥‾' ) + 𝛽"$ (𝑥& − 𝑥‾' )$ .

'
1
;[⋯ ] = 𝛽"$ Var(𝑥) − 2𝛽" Cov(𝑥, 𝑌) + Var(𝑌).
𝑛−1
&("
e. 𝛽"$ Var(𝑥) − 2𝛽" Cov(𝑥, 𝑌) + Var(𝑌) is a quadratic in 𝛽" . Minimizing this quadratic gives:

Cov(𝑥, 𝑌) 𝑟*+ 𝑠+
𝛽Q" = = ,
Var(𝑥) 𝑠*

,-.(*,+)
where 𝑟*+ = 0! 0"
, 𝑠* = XVar(𝑥), and 𝑠+ = XVar(𝑌).

Exercise 4 (12 points)

The file contains data from the public health study on Nepalese children. The dataset has
877 observations on 3 variables:

(a) (5 + 5 points)
Fit separate simple linear regression models with being the response variable and being
the predictor variable on the sub-datasets of male and female children. Report the
estimated co-efficients and a scatter plot (along with the fitted line) for each sub-
population.
nepal <- read.csv("/Users/macbook/Desktop/STOR455/nepal.csv")
males <- subset(nepal, sex == 1)
females <- subset(nepal, sex == 2)
male_model <- lm(weight~height, data = males)
female_model <- lm(weight~height, data = females)
male_model$coefficients

## (Intercept) height
## -9.0869252 0.2393433

female_model$coefficients

## (Intercept) height
## -8.3712108 0.2281936

plot(males$height, males$weight, main = "Weight vs. Height (Males)", xlab =

"Height (cm)", ylab = "Weight (kg)", pch = 19)
abline(male_model, col = "red", lwd = 2)
plot(females$height, females$weight, main = "Weight vs. Height (Females)",
xlab = "Height (cm)", ylab = "Weight (kg)", pch = 19)
abline(male_model, col = "red", lwd = 2)
(b) (2 points)
Comment on the goodness-of-fit of the simple linear regression models for the two sub-
populations. For which sub-population, the model fits the data better?
summary(male_model)

##
## Call:
## lm(formula = weight ~ height, data = males)
##
## Residuals:
## Min 1Q Median 3Q Max
## -2.7192 -0.5064 -0.0510 0.4496 3.2427
##
## Coefficients:
## Estimate Std. Error t value Pr(>|t|)
## (Intercept) -9.086925 0.288998 -31.44 <2e-16 ***
## height 0.239343 0.003341 71.63 <2e-16 ***
## ---
## Signif. codes: 0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
##
## Residual standard error: 0.8373 on 453 degrees of freedom
## Multiple R-squared: 0.9189, Adjusted R-squared: 0.9187
## F-statistic: 5131 on 1 and 453 DF, p-value: < 2.2e-16

summary(female_model)

##
## Call:
## lm(formula = weight ~ height, data = females)
##
## Residuals:
## Min 1Q Median 3Q Max
## -2.82127 -0.57982 -0.02652 0.50813 3.15115
##
## Coefficients:
## Estimate Std. Error t value Pr(>|t|)
## (Intercept) -8.371211 0.303580 -27.57 <2e-16 ***
## height 0.228194 0.003551 64.26 <2e-16 ***
## ---
## Signif. codes: 0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
##
## Residual standard error: 0.8916 on 420 degrees of freedom
## Multiple R-squared: 0.9077, Adjusted R-squared: 0.9075
## F-statistic: 4129 on 1 and 420 DF, p-value: < 2.2e-16

Examining the regression outputs for both subpopulations, the male model exhibits a
lower standard error (0.8383 vs. 0.8924), indicating that its predictions are slightly more
precise compared to the female model. Additionally, the male subpopulation
demonstrates a higher R-squared value (0.9188 vs. 0.9073) and a lower residual standard
error (RSE) than the female subpopulation. These results suggest that the simple linear
regression model provides a better fit for the male subpopulation, as it explains a slightly
greater proportion of the variance in weight and produces more precise predictions.

Exercise 5 (3 + 2 points)
Consider a bivariate dataset consisting of 𝑛 observations on 2 numerical variables 𝑥 (") and
𝑥 ($) . We first fit a simple linear regression model with 𝑥 (") as the response variable and 𝑥 ($)
as the explanatory variable. Let 𝑏"$ denote the (least-square) estimated slope co-efficient.
Next we fit a simple linear regression model with 𝑥 ($) as the response variable and 𝑥 (") as
the explanatory variable. Let 𝑏$" denote the (least-square) estimated slope of this new
fitted line. Show that
$
𝑏"$ 𝑏$" = 6𝑟* ($) *(&) 7 ,

where 𝑟* ($) * (&) is the sample correlation co-efficient between the variables 𝑥 (") and 𝑥 ($) .
Argue that both the estimated slopes have the same signs and at most one of them can
have absolute value greater than 1.
The least-square slope estimates are:

Cov6𝑥 (") , 𝑥 ($)7 Cov6𝑥 (") , 𝑥 ($) 7

𝑏"$ = , 𝑏$" = .
Var(𝑥 ($) ) Var(𝑥 (") )

$
\Cov6𝑥 (") , 𝑥 ($) 7]
𝑏"$ ⋅ 𝑏$" = .
Var(𝑥 (") )Var(𝑥 ($) )
The correlation coefficient:

Cov6𝑥 (") , 𝑥 ($)7

𝑟* ($)* (&) = ,
𝑠* ($) 𝑠* (&)

since 𝑠* ($) = XVar(𝑥 (") ) and 𝑠* (&) = XVar(𝑥 ($))

$
$ \Cov6𝑥 (") , 𝑥 ($) 7]
6𝑟* ($) * (&) 7 = .
Var(𝑥 (") )Var(𝑥 ($) )

$
Thus, 𝑏"$ ⋅ 𝑏$" = 6𝑟* ($)* (&) 7
Both slopes 𝑏"$ and 𝑏$" share the sign of Cov6𝑥 (") , 𝑥 ($) 7, which matches the sign of 𝑟* ($)* (&) .

The slopes are:$ b_{12} = r_{x{(1)}x{(2)}} , b_{21} = r_{x{(1)}x{(2)}} . $

0 ($) 0 (&)
If 0! > 1, then 0! <1
!(&) !($)

Since `𝑟* ($) * (&) ` ≤ 1, at most one of |𝑏"$ | or |𝑏$" | can exceed 1.

HW 03 Sol
No ratings yet
HW 03 Sol
9 pages
Probability Distribution
No ratings yet
Probability Distribution
13 pages
36-401 Modern Regression HW #2 Solutions: Problem 1 (36 Points Total)
No ratings yet
36-401 Modern Regression HW #2 Solutions: Problem 1 (36 Points Total)
15 pages
LM Ques PPR
No ratings yet
LM Ques PPR
8 pages
Scott and Watson CHPT 4 Solutions
No ratings yet
Scott and Watson CHPT 4 Solutions
4 pages
Solutions To Lab 2 Statistical Inference: Nataliia Ostapenko
No ratings yet
Solutions To Lab 2 Statistical Inference: Nataliia Ostapenko
23 pages
Final AK (Spring 2024)
No ratings yet
Final AK (Spring 2024)
14 pages
Basic Econometrics 2019 Question Paper With Solution Delhi University BBE Business Economics
No ratings yet
Basic Econometrics 2019 Question Paper With Solution Delhi University BBE Business Economics
12 pages
Shanghai Jiaotong University Shanghai Advanced Institution of Finance
No ratings yet
Shanghai Jiaotong University Shanghai Advanced Institution of Finance
3 pages
Exercise 1
0% (1)
Exercise 1
5 pages
ECON 342 AE Model Specification and Data Problems 2021
No ratings yet
ECON 342 AE Model Specification and Data Problems 2021
43 pages
Curve Fitting
No ratings yet
Curve Fitting
17 pages
CH 12 Sol
No ratings yet
CH 12 Sol
5 pages
STA 302 / 1001 - Summer 2010 Term Test
No ratings yet
STA 302 / 1001 - Summer 2010 Term Test
9 pages
Stats216 hw3 PDF
No ratings yet
Stats216 hw3 PDF
26 pages
WST 311 Notes Part 2 2024
No ratings yet
WST 311 Notes Part 2 2024
21 pages
Reg HW1 Solution
No ratings yet
Reg HW1 Solution
2 pages
Exam 1 Spring 2023 Donald
No ratings yet
Exam 1 Spring 2023 Donald
8 pages
MATH3714 Jan 2024
No ratings yet
MATH3714 Jan 2024
9 pages
Series 1
No ratings yet
Series 1
2 pages
Lecture25 Ps
No ratings yet
Lecture25 Ps
10 pages
Suggested Solutions: Problem Set 3 Econ 210: April 27, 2015
No ratings yet
Suggested Solutions: Problem Set 3 Econ 210: April 27, 2015
11 pages
Linear Regression Course
No ratings yet
Linear Regression Course
22 pages
A Sample Mid-Term Examination of Econometrics Multiple Choice
No ratings yet
A Sample Mid-Term Examination of Econometrics Multiple Choice
8 pages
Probability and Statistics - Book (DR Hari Arora)
100% (3)
Probability and Statistics - Book (DR Hari Arora)
473 pages
ML Lec-3
No ratings yet
ML Lec-3
11 pages
Homework 2 DSC 40A
No ratings yet
Homework 2 DSC 40A
13 pages
Regression Equation: Independent Variable Predictor Variable Explanatory Variable Dependent Variable Response Variable
No ratings yet
Regression Equation: Independent Variable Predictor Variable Explanatory Variable Dependent Variable Response Variable
60 pages
Lab-5-1-Regression and Multiple Regression
100% (2)
Lab-5-1-Regression and Multiple Regression
8 pages
Problem Set 3 PDF
No ratings yet
Problem Set 3 PDF
2 pages
Problem Set 03 - Solutions
No ratings yet
Problem Set 03 - Solutions
16 pages
CH 2
No ratings yet
CH 2
31 pages
Demo0 Sol1
No ratings yet
Demo0 Sol1
5 pages
Additional Exercises: C 2005 A. Colin Cameron and Pravin K. Trivedi "Microeconometrics: Methods and Applications"
No ratings yet
Additional Exercises: C 2005 A. Colin Cameron and Pravin K. Trivedi "Microeconometrics: Methods and Applications"
5 pages
p8 p15 Annotated
No ratings yet
p8 p15 Annotated
10 pages
Regression
No ratings yet
Regression
12 pages
Regression Analysis
100% (1)
Regression Analysis
280 pages
Problem Set 3
No ratings yet
Problem Set 3
9 pages
HWK 5
No ratings yet
HWK 5
16 pages
QUIZ (Objectives) Identification: - (Residual)
No ratings yet
QUIZ (Objectives) Identification: - (Residual)
5 pages
Least Square Methods
No ratings yet
Least Square Methods
2 pages
Answer Key To Exercises - LN3 - Ver2
No ratings yet
Answer Key To Exercises - LN3 - Ver2
16 pages
Multiple Linear Regression
No ratings yet
Multiple Linear Regression
18 pages
Principle of Least Squares
No ratings yet
Principle of Least Squares
8 pages
Final - Econ3005 - 2022spring - Combined 2
No ratings yet
Final - Econ3005 - 2022spring - Combined 2
11 pages
ECON 301 - Midterm - F2023 - Answer Key
No ratings yet
ECON 301 - Midterm - F2023 - Answer Key
4 pages
ch9 - Model Specification and Data Problems
No ratings yet
ch9 - Model Specification and Data Problems
79 pages
Gary Chamberlain Econometric S
No ratings yet
Gary Chamberlain Econometric S
152 pages
Assignment 1 New Version
No ratings yet
Assignment 1 New Version
4 pages
Solutions Chapter 4 PDF
No ratings yet
Solutions Chapter 4 PDF
31 pages
36-401 Modern Regression HW #9 Solutions: Problem 1 (44 Points)
No ratings yet
36-401 Modern Regression HW #9 Solutions: Problem 1 (44 Points)
14 pages
Experiment 6 - Linear Systems, Regression, Curve Fitting, and Interpolation
No ratings yet
Experiment 6 - Linear Systems, Regression, Curve Fitting, and Interpolation
24 pages
Stat2 2023 Syllabus B v1.0 Weeks 5-6-7
No ratings yet
Stat2 2023 Syllabus B v1.0 Weeks 5-6-7
41 pages
Assignment: Topic - Testing For Violation of OLS Assumptions
No ratings yet
Assignment: Topic - Testing For Violation of OLS Assumptions
50 pages
CH 04
No ratings yet
CH 04
26 pages
HW3 Solutions - Stats 500: Problem 1
No ratings yet
HW3 Solutions - Stats 500: Problem 1
4 pages
L7 CurveFitting (LeastSquaresRegression)
No ratings yet
L7 CurveFitting (LeastSquaresRegression)
45 pages
Probability
No ratings yet
Probability
11 pages
Probability Distribution: Binomial Poisson Normal
No ratings yet
Probability Distribution: Binomial Poisson Normal
45 pages
Farming Sto Pro
No ratings yet
Farming Sto Pro
26 pages
Chapter 04 Revised
No ratings yet
Chapter 04 Revised
15 pages
Stationary Process: Contts
No ratings yet
Stationary Process: Contts
4 pages
2020HW7
No ratings yet
2020HW7
2 pages
Worksheet 4 - Mean and Variance of Sampling Distribution
No ratings yet
Worksheet 4 - Mean and Variance of Sampling Distribution
2 pages
The Binomial Poisson and Normal Distributions
No ratings yet
The Binomial Poisson and Normal Distributions
45 pages
Ece-Nd-2021-Ma 6451-Probability and Random Processes-78505649-70769 (Ma6451)
No ratings yet
Ece-Nd-2021-Ma 6451-Probability and Random Processes-78505649-70769 (Ma6451)
4 pages
Bayesian Networks
No ratings yet
Bayesian Networks
24 pages
Actuarial Society of India: Examinations
No ratings yet
Actuarial Society of India: Examinations
5 pages
L19 - Chi Square Test 1
No ratings yet
L19 - Chi Square Test 1
17 pages
FDT and MCT
No ratings yet
FDT and MCT
19 pages
Lecture Notes 1 Basic Probability
No ratings yet
Lecture Notes 1 Basic Probability
28 pages
F Distribution PDF
No ratings yet
F Distribution PDF
2 pages
ST318 Revision Notes
No ratings yet
ST318 Revision Notes
52 pages
Probability Questions
No ratings yet
Probability Questions
6 pages
Expected Value and Variance of Moran S B
No ratings yet
Expected Value and Variance of Moran S B
21 pages
Hsslive-Xi-Maths-16. PROBABILITY
No ratings yet
Hsslive-Xi-Maths-16. PROBABILITY
5 pages
Probability and Combinatorics - Olivia Kucan
No ratings yet
Probability and Combinatorics - Olivia Kucan
3 pages
Discrete Distributions
No ratings yet
Discrete Distributions
22 pages
Statistical Intervals For A Single Sample: Learning Objectives
No ratings yet
Statistical Intervals For A Single Sample: Learning Objectives
18 pages
Table
No ratings yet
Table
10 pages
2 Descriptive Statistics Handout
No ratings yet
2 Descriptive Statistics Handout
2 pages
Applied Statistics Syllabus 2021 2022 Revised
No ratings yet
Applied Statistics Syllabus 2021 2022 Revised
94 pages
Questions and Answers On Generalized Method of Moments: A B A B
No ratings yet
Questions and Answers On Generalized Method of Moments: A B A B
7 pages
Written Report Probability
100% (1)
Written Report Probability
8 pages
Probability and Statistics Class 5
No ratings yet
Probability and Statistics Class 5
38 pages
Syllabus
No ratings yet
Syllabus
7 pages

Homework 1

Uploaded by

Homework 1

Uploaded by

STOR 455/002 - Homework I

January 17, 2025

Exercise 1 (13 points)

(") " (")

Write down Var6𝑥 (") + 𝑥 ($)7 as

Expand in the following way :

var6𝑥 (") + 𝑥 ($) 7

;(𝑌& − 𝑐) = ;(𝑌& − 𝑌‾' )$ + 𝑛(𝑌‾' − 𝑐)$ ,

;(𝑌& − 𝑐)$ = ;(𝑌& − 𝑌‾' )$ + 𝑛(𝑌‾' − 𝑐)$ .

;(𝑌& − 𝑐) = ;(𝑌& − 𝑌‾'

&(" &(" &(" &("

since ∑'&("(𝑌‾' − 𝑐)$ = 𝑛(𝑌‾' − 𝑐)$ and ∑'&("(𝑌& − 𝑌‾' ) = 0,

;(𝑌& − 𝑐) = ;(𝑌& − 𝑌‾' )$ + 𝑛(𝑌‾' − 𝑐)$ .

Exercise 3 (13 points)

𝛽Q) = 𝑌‾' − 𝛽Q" 𝑥‾' .

This matches the required decomposition.

The cross-term vanishes because ∑'&("(𝑌& − 𝑌‾' ) = 0 and ∑'&("(𝑥& − 𝑥‾' ) = 0.

𝑌‾' − 𝛽" 𝑥‾' − 𝛽) = 0

c. Substitute 𝛽Q) = 𝑌‾' − 𝛽Q" 𝑥‾' to the original minimization problem:

𝛽Q" minimizes this reduced sum of squares

Exercise 4 (12 points)

plot(males$height, males$weight, main = "Weight vs. Height (Males)", xlab =

Cov6𝑥 (") , 𝑥 ($)7 Cov6𝑥 (") , 𝑥 ($) 7

Cov6𝑥 (") , 𝑥 ($)7

since 𝑠* ($) = XVar(𝑥 (") ) and 𝑠* (&) = XVar(𝑥 ($))

The slopes are:$ b_{12} = r_{x{(1)}x{(2)}} , b_{21} = r_{x{(1)}x{(2)}} . $

You might also like