0% found this document useful (0 votes)

16 views3 pages

Assignment 3

Uploaded by

hsarpong15

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

16 views3 pages

Assignment 3

Uploaded by

hsarpong15

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 3

RMI 8300

Assignment 3
Please show your work clearly to get full credit

1. We will now consider the Boston housing data set, from the MASS library.

(a) Based on this data set, provide an estimate for the population mean of medv.
Call this estimate 𝜇̂ .

(b) Provide an estimate of the standard error of 𝜇̂ .Interpret this result.

Hint: We can compute the standard error of the sample mean by dividing the sample
standard deviation by the square root of the number of observations.

(d) Based on your bootstrap estimate from (c), provide a 95 % conﬁdence interval
for the mean of medv. Compare it to the results obtained using t.test(Boston$medv)
Hint: You can approximate a 95 % conﬁdence interval using the formula [𝜇̂ − 2SE(𝜇̂ ), 𝜇̂
+2SE(𝜇̂ )].

(e) Based on this data set, provide an estimate, 𝜇̂ 𝑚𝑒𝑑 , for the median value of medv
in the population.

(f) We now would like to estimate the standard error of 𝜇̂ 𝑚𝑒𝑑 . Unfortunately, there
is no simple formula for computing the standard error of the median. Instead, estimate
the standard error of the median using the bootstrap. Comment on your ﬁndings.

(g) Based on this data set, provide an estimate for the tenth percentile of medv in
Boston suburbs. Call this quantity 𝜇̂ 0.1 .(You can use the quantile() function.)

(h) Use the bootstrap to estimate the standard error of 𝜇̂ 0.1.Comment on your
ﬁndings.

2. Consider the use of a logistic regression model to predict the probability of

default using income and balance on the Default data set. In particular, we will now
compute estimates for the standard errors of the income and balance logistic regression
coefficients in two different ways: (1) using the bootstrap, and (2) using the standard
formula for computing the standard errors in the glm() function. Do not forget to set a
random seed before beginning your analysis.
(a) Using the summary() and glm() functions, determine the estimated standard
errors for the coefficients associated with income and balance in a multiple
logistic regression model that uses both predictors.

(b) Write a function, boot.fn(), that takes as input the Default data set as well as an
index of the observations, and that outputs the coeﬃcient estimates for income
and balance in the multiple logistic regression model.

(c) Use the boot() function together with your boot.fn() function to estimate the
standard errors of the logistic regression coeﬃcients for income and balance.

(d) Comment on the estimated standard errors obtained using the glm() function
and using your bootstrap function.

3. We saw that the cv.glm() function can be used in order to compute the LOOCV test
error estimate. Alternatively, one could compute those quantities using just the glm()
and predict.glm() functions, and a for loop. You will now take this approach in order to
compute the LOOCV error for a simple logistic regression model on the Weekly data set.

(a) Fit a logistic regression model that predicts Direction using Lag1 and Lag2.

(b) Fit a logistic regression model that predicts Direction using Lag1 and Lag2 using
all but the ﬁrst observation.

(c) Use the model from (b) to predict the direction of the first observation. You can
do this by predicting that the first observation will go up if P
(Direction="Up"|Lag1, Lag2) > 0.5. Was this observation correctly classified?

(d) Write a for loop from i =1 to i = n, where n is the number of observations in the
data set, that performs each of the following steps:

i. Fit a logistic regression model using all but the ith observation to predict
Direction using Lag1 and Lag2.
ii. Compute the posterior probability of the market moving up for the ith
observation.
iii. Use the posterior probability for the ith observation in order to predict
whether or not the market moves up.
iv. Determine whether or not an error was made in predicting the direction
for the ith observation. If an error was made, then indicate this as a 1, and
otherwise indicate it as a 0.
(e) Take the average of the n numbers obtained in (d)iv in order to obtain the
LOOCV estimate for the test error. Comment on the results.

4. Perform cross-validation on a simulated data set.

(a) Generate a simulated data set as follows:

>set.seed(100)
>rnorm(100)
>y=x-2*x^2+rnorm(100)
In this data set, what is n and what is p? Write out the model used to generate the data
in equation form.

(b) Create a scatterplot of X against Y . Comment on what you ﬁnd.

(c) Set a random seed, and then compute the LOOCV errors that result from ﬁtting
the following four models using least squares:

Note you may ﬁnd it helpful to use the data.frame() function to create a single
data set containing both X and Y .

(d) Repeat (c) using another random seed, and report your results. Are your results
the same as what you got in (c)? Why?

(e) Which of the models in (c) had the smallest LOOCV error? Is this what you
expected? Explain your answer.

(f) Comment on the statistical significance of the coefficient estimates that results
from fitting each of the models in (c) using least squares. Do these results agree with the
conclusions drawn based on the cross-validation results?

Statistics in Engineering With Examples in MATLAB® and R Second Edition
100% (1)
Statistics in Engineering With Examples in MATLAB® and R Second Edition
811 pages
Quantified Trading Strategies
No ratings yet
Quantified Trading Strategies
34 pages
Stat 5700 HW 2
No ratings yet
Stat 5700 HW 2
15 pages
Estimating A Regression Line: F. Chiaromonte 1
No ratings yet
Estimating A Regression Line: F. Chiaromonte 1
13 pages
Tutorial 1-13 Answer Intermediate Macro
No ratings yet
Tutorial 1-13 Answer Intermediate Macro
40 pages
H-311 Linear Regression Analysis With R
100% (1)
H-311 Linear Regression Analysis With R
71 pages
Analyst Training 1747893285
No ratings yet
Analyst Training 1747893285
162 pages
Linear Regression
No ratings yet
Linear Regression
22 pages
Kock2016 Minimum Sample Size Estimation in PLS-SEM
No ratings yet
Kock2016 Minimum Sample Size Estimation in PLS-SEM
35 pages
222BDA35 Activity2
No ratings yet
222BDA35 Activity2
5 pages
강준혁 회귀분석 과제 4
No ratings yet
강준혁 회귀분석 과제 4
10 pages
m565A3-24F
No ratings yet
m565A3-24F
22 pages
Lab 3 - Logistic Regression: Part B
No ratings yet
Lab 3 - Logistic Regression: Part B
7 pages
Activity 7
No ratings yet
Activity 7
5 pages
Simple Linear Regression
No ratings yet
Simple Linear Regression
7 pages
10 - 4 - ML - SUP - Linear Regression
No ratings yet
10 - 4 - ML - SUP - Linear Regression
59 pages
Assignment 2
No ratings yet
Assignment 2
3 pages
Assignment 1
No ratings yet
Assignment 1
3 pages
Docx
No ratings yet
Docx
7 pages
Linear Regression
No ratings yet
Linear Regression
17 pages
Graded Homework 1 Solutions
No ratings yet
Graded Homework 1 Solutions
19 pages
Problem-Set - 1 Practise Problems From Textbook
No ratings yet
Problem-Set - 1 Practise Problems From Textbook
2 pages
Fundamentals of Data Mining in Genomics and Proteomics (Dubitzky, Granzow & Berrar 2006-12-19)
No ratings yet
Fundamentals of Data Mining in Genomics and Proteomics (Dubitzky, Granzow & Berrar 2006-12-19)
300 pages
Sample Exam For ML YSZ Sample For Machine Lerning - CMNKNVMNCS."NMD, MN, MVN, MDNV, MNDV MC, MDN, MDCNVM, NDV, M Ccwdmnbnbew, Mwbe
No ratings yet
Sample Exam For ML YSZ Sample For Machine Lerning - CMNKNVMNCS."NMD, MN, MVN, MDNV, MNDV MC, MDN, MDCNVM, NDV, M Ccwdmnbnbew, Mwbe
4 pages
Mathematical statistics basic ideas and selected topics Volume II Bickel P.J. instant download
100% (2)
Mathematical statistics basic ideas and selected topics Volume II Bickel P.J. instant download
66 pages
10 - 4 - ML - SUP - Linear Regression
No ratings yet
10 - 4 - ML - SUP - Linear Regression
59 pages
IE 451 Fall 2023-2024 Homework 4 Solutions
No ratings yet
IE 451 Fall 2023-2024 Homework 4 Solutions
19 pages
Analysis Course HW5
No ratings yet
Analysis Course HW5
7 pages
CS1B April 2024
No ratings yet
CS1B April 2024
9 pages
TP MSDC 3
No ratings yet
TP MSDC 3
6 pages
pastPaper2024Spring_Assm02
No ratings yet
pastPaper2024Spring_Assm02
24 pages
Introduction To Econometrics With R
No ratings yet
Introduction To Econometrics With R
18 pages
21BCS5999 - Ankit Kumar (Assignment 2)
No ratings yet
21BCS5999 - Ankit Kumar (Assignment 2)
16 pages
SDSC3006_Assignment 2
No ratings yet
SDSC3006_Assignment 2
3 pages
Project A (1)
No ratings yet
Project A (1)
7 pages
Stat Modelling Notes
No ratings yet
Stat Modelling Notes
49 pages
Michaelis Menten Equation
No ratings yet
Michaelis Menten Equation
19 pages
Lec10 PSet
No ratings yet
Lec10 PSet
4 pages
Lab 3. Linear Regression 230223
100% (1)
Lab 3. Linear Regression 230223
7 pages
Assignment 1
No ratings yet
Assignment 1
16 pages
Linear Regression
No ratings yet
Linear Regression
59 pages
Test Your Knowledge of Linear Regression and PCA in R
No ratings yet
Test Your Knowledge of Linear Regression and PCA in R
7 pages
Partial Least Squares. PAUL BENJAMIN LOWRY AND JAMES GASKIN
100% (1)
Partial Least Squares. PAUL BENJAMIN LOWRY AND JAMES GASKIN
36 pages
L10 Multiple Regression
No ratings yet
L10 Multiple Regression
14 pages
Homework 3 R Tutorial: How To Use This Tutorial
No ratings yet
Homework 3 R Tutorial: How To Use This Tutorial
8 pages
ECON1203 HW Solution Week11
No ratings yet
ECON1203 HW Solution Week11
7 pages
Linear Regression With LM Function, Diagnostic Plots, Interaction Term, Non-Linear Transformation of The Predictors, Qualitative Predictors
100% (1)
Linear Regression With LM Function, Diagnostic Plots, Interaction Term, Non-Linear Transformation of The Predictors, Qualitative Predictors
15 pages
ISYE6501-Homework-5
No ratings yet
ISYE6501-Homework-5
5 pages
Complete Download Intermittent Demand Forecasting: Context, Methods and Applications Syntetos PDF All Chapters
100% (4)
Complete Download Intermittent Demand Forecasting: Context, Methods and Applications Syntetos PDF All Chapters
76 pages
ch03 Regression
No ratings yet
ch03 Regression
10 pages
Assignment 2
No ratings yet
Assignment 2
5 pages
A Comparison of PLS and ML Bootstrapping Techniques in SEM: A Monte Carlo Study
No ratings yet
A Comparison of PLS and ML Bootstrapping Techniques in SEM: A Monte Carlo Study
9 pages
Which Test When: 1 Exploratory Tests
No ratings yet
Which Test When: 1 Exploratory Tests
5 pages
Series 1
No ratings yet
Series 1
2 pages
Assignment 3 (2023)
No ratings yet
Assignment 3 (2023)
9 pages
HW3
No ratings yet
HW3
2 pages
MIT 302 - Statistical Computing II - Tutorial 03
No ratings yet
MIT 302 - Statistical Computing II - Tutorial 03
16 pages
Informed Trade in Spot Foreign Exchange Markets
No ratings yet
Informed Trade in Spot Foreign Exchange Markets
23 pages
Time Series Analysis By State Space Methods Second Edition 2nd Edition James Durbin download
No ratings yet
Time Series Analysis By State Space Methods Second Edition 2nd Edition James Durbin download
77 pages
A Study of Cross-Validation and Bootstrap For Accuracy Estimation and Model Selection
No ratings yet
A Study of Cross-Validation and Bootstrap For Accuracy Estimation and Model Selection
8 pages
Applied Biostatistics for the Health Sciences 2nd Edition Richard J. Rossi - The complete ebook set is ready for download today
100% (1)
Applied Biostatistics for the Health Sciences 2nd Edition Richard J. Rossi - The complete ebook set is ready for download today
62 pages
Using R For Linear Regression
No ratings yet
Using R For Linear Regression
9 pages
Notes 23 Regression R
No ratings yet
Notes 23 Regression R
5 pages
Sample Exam For ML YSZ: Question 1 (Linear Regression)
No ratings yet
Sample Exam For ML YSZ: Question 1 (Linear Regression)
4 pages
100 Investment Banking Technical Questions 1747740993
No ratings yet
100 Investment Banking Technical Questions 1747740993
12 pages
KNN Algorithm For Conditional Mean and Variance Estimation With Automated Uncertainty Quantification and Variable Selection
No ratings yet
KNN Algorithm For Conditional Mean and Variance Estimation With Automated Uncertainty Quantification and Variable Selection
31 pages
Homework 2
100% (1)
Homework 2
14 pages
DCF
No ratings yet
DCF
15 pages
Structural and Functional Equivalence of The Eysenck Personality Questionnaire Within and Between Countries
No ratings yet
Structural and Functional Equivalence of The Eysenck Personality Questionnaire Within and Between Countries
32 pages
Structural Model Tests
No ratings yet
Structural Model Tests
50 pages
Faktor Keprilakuan Organisasi Dalam Implementasi Sistem Akuntansi Keuangan Daerah
No ratings yet
Faktor Keprilakuan Organisasi Dalam Implementasi Sistem Akuntansi Keuangan Daerah
30 pages
Inext
No ratings yet
Inext
18 pages
R Code Default Data PDF
No ratings yet
R Code Default Data PDF
10 pages
Statistical Methods For Clinical Validation of Follow-On Companion Diagnostic Devices Via An External Concordance Study
No ratings yet
Statistical Methods For Clinical Validation of Follow-On Companion Diagnostic Devices Via An External Concordance Study
25 pages
Quick Interpretation of The Data
No ratings yet
Quick Interpretation of The Data
5 pages
Dept of Eco Ets Course Content Mphil Econometrics
No ratings yet
Dept of Eco Ets Course Content Mphil Econometrics
18 pages
Eilander, Dkk. 2020 PDF
No ratings yet
Eilander, Dkk. 2020 PDF
13 pages
The Smell of Us - Crowdsourcing Human Body Odor Evaluation: December 2016
No ratings yet
The Smell of Us - Crowdsourcing Human Body Odor Evaluation: December 2016
20 pages
Multinomial Logistic Regression - R Data Analysis Examples - IDRE Stats
No ratings yet
Multinomial Logistic Regression - R Data Analysis Examples - IDRE Stats
8 pages
Testing For Granger Causality in Panel Data: 17, Number 4, Pp. 972-984
No ratings yet
Testing For Granger Causality in Panel Data: 17, Number 4, Pp. 972-984
13 pages
Econometrics Mock Exam - Solutions
No ratings yet
Econometrics Mock Exam - Solutions
3 pages
Shanghai Jiaotong University Shanghai Advanced Institution of Finance
No ratings yet
Shanghai Jiaotong University Shanghai Advanced Institution of Finance
3 pages
A Decision-Directed Bayesian Equalizer: California Santa Clara University
No ratings yet
A Decision-Directed Bayesian Equalizer: California Santa Clara University
5 pages
Tajmouati Samya Publications 09 08 2022 10 08 16 55
No ratings yet
Tajmouati Samya Publications 09 08 2022 10 08 16 55
6 pages
Theil-Sen No R
No ratings yet
Theil-Sen No R
5 pages
Dissolution Similarity Requirements How Similar or Dissimilar Are The Global Regulatory Expectations
No ratings yet
Dissolution Similarity Requirements How Similar or Dissimilar Are The Global Regulatory Expectations
8 pages
Eviews VAR Mit
No ratings yet
Eviews VAR Mit
5 pages
MODULE 3 Classification
No ratings yet
MODULE 3 Classification
5 pages
Rules
No ratings yet
Rules
1 page
Best Practices in Exploratory Factor Analysis PDF
No ratings yet
Best Practices in Exploratory Factor Analysis PDF
2 pages
IGNOU BCA Statistical Techniques Previous Year Unsolved Papers BCS 040
From Everand
IGNOU BCA Statistical Techniques Previous Year Unsolved Papers BCS 040
Manish Soni
No ratings yet
Quant Developers' Tools and Techniques: Quant Books, #1
From Everand
Quant Developers' Tools and Techniques: Quant Books, #1
Manfred Hindering
No ratings yet
Multi-dimensional Monte Carlo Integrations Utilizing Mathematica
From Everand
Multi-dimensional Monte Carlo Integrations Utilizing Mathematica
SUJAUL CHOWDHURY
No ratings yet
AP Computer Science Principles: Student-Crafted Practice Tests For Excellence
From Everand
AP Computer Science Principles: Student-Crafted Practice Tests For Excellence
Sama Alshatali
No ratings yet
Random Sample Consensus: Robust Estimation in Computer Vision
From Everand
Random Sample Consensus: Robust Estimation in Computer Vision
Fouad Sabry
No ratings yet

Assignment 3

Uploaded by

Assignment 3

Uploaded by

RMI 8300

(b) Provide an estimate of the standard error of 𝜇̂ .Interpret this result.

2. Consider the use of a logistic regression model to predict the probability of

4. Perform cross-validation on a simulated data set.

(a) Generate a simulated data set as follows:

(b) Create a scatterplot of X against Y . Comment on what you ﬁnd.

You might also like