Assignment 2

This document contains a summary of key points from a machine learning lecture on regression techniques: 1. Linear regression parameters can take any value in the real space. 2. Regressing a dependent variable Y on an independent variable X1 that has a correlation coefficient of -0.005 with Y mostly does not explain away Y. 3. Subset selection methods are computationally expensive for large datasets.

Uploaded by

abdul.azeez

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

101 views

Assignment 2

Uploaded by

abdul.azeez

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 4

Assignment 2

Introduction to Machine Learning

Prof. B. Ravindran
1. The parameters obtained in linear regression
(a) can take any value in the real space
(b) are strictly integers
(c) always lie in the range [0,1]
(d) can take only non-zero values
Sol. (a)
2. Suppose that we have N independent variables (X1, X2, . . . Xn) and the dependent variable
is Y . Now imagine that you are applying linear regression by fitting the best fit line using
the least square error on this data. You found that the correlation coefficient for one of its
variables (Say X1) with Y is -0.005.

(a) Regressing Y on X1 mostly does not explain away Y .

(b) Regressing Y on X1 explains away Y .
(c) The given data is insufficient to determine if regressing Y on X1 explains away Y or not.

Sol. (a)
The absolute value of the correlation coefficient denotes the strength of the relationship. Since
absolute correlation is significantly less, regressing Y on X1 mostly does not explain away Y .
3. Which of the following is a limitation of subset selection methods in regression?
(a) They tend to produce biased estimates of the regression coefficients.
(b) They cannot handle datasets with missing values.
(c) They are computationally expensive for large datasets.
(d) They assume a linear relationship between the independent and dependent variables.
(e) They are not suitable for datasets with categorical predictors.

Sol. (c)
They are computationally expensive for large datasets.
4. The relation between studying time (in hours) and grade on the final examination (0-100) in
a random sample of students in the Introduction to Machine Learning Class was found to be:
Grade = 30.5 + 15.2 (h)
How will a student’s grade be affected if she studies for four hours?

(a) It will go down by 30.4 points.

(b) It will go down by 30.4 points.
(c) It will go up by 60.8 points.

1
(d) The grade will remain unchanged.
(e) It cannot be determined from the information given

Sol. (c)
The slope of the regression line gives the average increase in grade for every hour increase in
studying. So, if studying is increased by four hours, the grade will increase by 4(15.2) = 60.8.
5. Which of the statements is/are True?

(a) Ridge has sparsity constraint, and it will drive coefficients with low values to 0.
(b) Lasso has a closed form solution for the optimization problem, but this is not the case
for Ridge.
(c) Ridge regression does not reduce the number of variables since it never leads a coefficient
to zero but only minimizes it.
(d) If there are two or more highly collinear variables, Lasso will select one of them randomly.

Sol. (c),(d)
Refer to the lecture
6. Find the mean of squared error for the given predictions:
Y f(x)
1 2
2 3
4 5
8 9
16 15
32 31
Hint: Find the squared error for each prediction and take the mean of that.
(a) 1
(b) 2
(c) 1.5
(d) 0
Sol. (a)

Σ(Y − f (x))2
Mean squared error =
6
(−1)2 + (−1)2 + (−1)2 + (−1)2 + 12 + 12
=
6
6
=
6
=1

2
7. Consider the following statements:
Statement A: In Forward stepwise selection, in each step, that variable is chosen which has the
maximum correlation with the residual, then the residual is regressed on that variable, and it
is added to the predictor.
Statement B: In Forward stagewise selection, the variables are added one by one to the previ-
ously selected variables to produce the best fit till then

(a) Both the statements are True.

(b) Statement A is True, and Statement B is False
(c) Statement A is False and Statement B is True
(d) Both the statements are False.

Sol. (d)
Refer to the lecture
8. The linear regression model y = a0 +a1x1 +a2x2 +...+apxp is to be fitted to a set of N training
data points having p attributes each. Let X be N × (p + 1) vectors of input values (augmented
by 1‘s), Y be N × 1 vector of target values, and θ be (p + 1) × 1 vector of parameter values(a0,
a1, a2, ..., ap). If the sum squared error is minimized for obtaining the optimal regression model,
which of the following equation holds?

(a) XT X = XY
(b) Xθ = XT Y
(c) XT Xθ = Y
(d) XT Xθ = XT Y

Sol. (d)
This comes from minimizing the sum of the least squares.
RSS(θ) = (Y — XθT )(Y − Xθ) (in matrix form)
If we take the derivative and equate it to 0, then we get,
XT (Y − Xθ) = 0
So,
XT Xθ = XT Y, θ = (XT X)−1X T Y.
9. Which of the following statements is true regarding Partial Least Squares (PLS) regression?
(a) PLS is a dimensionality reduction technique that maximizes the covariance between the
predictors and the dependent variable.
(b) PLS is only applicable when there is no multicollinearity among the independent variables.
(c) PLS can handle situations where the number of predictors is larger than the number of
observations.
(d) PLS estimates the regression coefficients by minimizing the residual sum of squares.
(e) PLS is based on the assumption of normally distributed residuals.
(f) All of the above.
(g) None of the above.

3
Sol. (a)
PLS is a dimensionality reduction technique that maximizes
the covariance between the predictors and the dependent variable.
10. Which of the following statements about principal components in Principal Component Re-
gression (PCR) is true?
(a) Principal components are calculated based on the correlation matrix of the original pre-
dictors.
(b) The first principal component explains the largest proportion of the variation in the
dependent variable.
(c) Principal components are linear combinations of the original predictors that are uncorre-
lated with each other.
(d) PCR selects the principal components with the highest p-values for inclusion in the re-
gression model.
(e) PCR always results in a lower model complexity compared to ordinary least squares
regression.
Sol. (c)
Principal components are linear combinations of the original predictors
that are uncorrelated with each other.

Exam Final
100% (1)
Exam Final
21 pages
Mcqs Econometric
74% (19)
Mcqs Econometric
25 pages
Econ MIdterm 2 Practise
No ratings yet
Econ MIdterm 2 Practise
11 pages
Assignment 2
No ratings yet
Assignment 2
3 pages
30-questions-to-test-a-data-scientist-on-linear-regression
No ratings yet
30-questions-to-test-a-data-scientist-on-linear-regression
10 pages
ML Unit 03 MCQ
No ratings yet
ML Unit 03 MCQ
20 pages
Introduction To Machine Learning Week 2 Assignment
100% (1)
Introduction To Machine Learning Week 2 Assignment
8 pages
Unit 3 MCQ
No ratings yet
Unit 3 MCQ
20 pages
Assignment 1-12 ML
No ratings yet
Assignment 1-12 ML
54 pages
Test 1 With Key 10-3
No ratings yet
Test 1 With Key 10-3
16 pages
ML U3 MCQ
No ratings yet
ML U3 MCQ
20 pages
linear regression
No ratings yet
linear regression
37 pages
Economterics Final 2024.
No ratings yet
Economterics Final 2024.
32 pages
Int 354 ML-1
No ratings yet
Int 354 ML-1
4 pages
Regression DPP 01 Discussion Notes664745df1b2c900018f5ac7e
No ratings yet
Regression DPP 01 Discussion Notes664745df1b2c900018f5ac7e
32 pages
ML Question bank
No ratings yet
ML Question bank
13 pages
Regression Analysis mcq1 PDF
No ratings yet
Regression Analysis mcq1 PDF
9 pages
RGRSSN Assgnmnt
No ratings yet
RGRSSN Assgnmnt
11 pages
Part A Multiple Choice (10 Marks)
No ratings yet
Part A Multiple Choice (10 Marks)
16 pages
Regression _ DPP 01
No ratings yet
Regression _ DPP 01
13 pages
Wa0006.
No ratings yet
Wa0006.
4 pages
BA 182 Regression MC Samplex With Answer
No ratings yet
BA 182 Regression MC Samplex With Answer
4 pages
MS2301
No ratings yet
MS2301
7 pages
Quiz_2_2021_sol
No ratings yet
Quiz_2_2021_sol
8 pages
Machine 2020 Jul-Dec
No ratings yet
Machine 2020 Jul-Dec
45 pages
Introduction To Machine Learning - Unit 4 - Week 2
No ratings yet
Introduction To Machine Learning - Unit 4 - Week 2
4 pages
SDSC3006_Assignment 3
No ratings yet
SDSC3006_Assignment 3
4 pages
Linear Regression
No ratings yet
Linear Regression
15 pages
STAT741 Regression Analysis: Quiz #1 9pm, Wednesday, 1/31: y X y X y X y X
No ratings yet
STAT741 Regression Analysis: Quiz #1 9pm, Wednesday, 1/31: y X y X y X y X
3 pages
PA QB Units 1-5
No ratings yet
PA QB Units 1-5
25 pages
Exam SRM Sample Questions
No ratings yet
Exam SRM Sample Questions
71 pages
Econometrics 2
No ratings yet
Econometrics 2
9 pages
Exam SRM Sample Questions
No ratings yet
Exam SRM Sample Questions
69 pages
Review Questions and Key Oct 4 11
No ratings yet
Review Questions and Key Oct 4 11
3 pages
Pajares, Allan Mark L. - MLR.
No ratings yet
Pajares, Allan Mark L. - MLR.
2 pages
Machine Learning,( CS-3035), Online Spring End Semester Examination 2021
No ratings yet
Machine Learning,( CS-3035), Online Spring End Semester Examination 2021
8 pages
Sample Questions
No ratings yet
Sample Questions
8 pages
12
No ratings yet
12
16 pages
Economterics Final 2024 10
No ratings yet
Economterics Final 2024 10
104 pages
Machine 2021 Jan-Apr
No ratings yet
Machine 2021 Jan-Apr
45 pages
Midterm
No ratings yet
Midterm
9 pages
Dsce PP
No ratings yet
Dsce PP
3 pages
Assignment - Week 2 - Final
No ratings yet
Assignment - Week 2 - Final
3 pages
Machine Learning Test Regression
No ratings yet
Machine Learning Test Regression
6 pages
CHW 4
No ratings yet
CHW 4
7 pages
ML Afawerquestions
No ratings yet
ML Afawerquestions
5 pages
Aff700 1000 220401
No ratings yet
Aff700 1000 220401
8 pages
Linear Regression Basics QUIZS
No ratings yet
Linear Regression Basics QUIZS
13 pages
Multiple Choice Test Bank Questions No Feedback - Chapter 4: y + X + X + X + U
No ratings yet
Multiple Choice Test Bank Questions No Feedback - Chapter 4: y + X + X + X + U
6 pages
Exam All Questions
No ratings yet
Exam All Questions
566 pages
2024_PCS_24P2CSC04_Question Bank ML
No ratings yet
2024_PCS_24P2CSC04_Question Bank ML
7 pages
quiz2 (1)
No ratings yet
quiz2 (1)
3 pages
Econometric Mod L
No ratings yet
Econometric Mod L
8 pages
Chapter 1 and 2 Mcqs Econometrics
No ratings yet
Chapter 1 and 2 Mcqs Econometrics
10 pages
ECON 6001 Assignment1 2023
No ratings yet
ECON 6001 Assignment1 2023
9 pages
DS100-2-Grp#4 Chapter 6 Advanced Analytical Theory and Methods Regression (CADAY, CASTOR, CRUZ, SANORIA, TAN)
No ratings yet
DS100-2-Grp#4 Chapter 6 Advanced Analytical Theory and Methods Regression (CADAY, CASTOR, CRUZ, SANORIA, TAN)
4 pages
DA UNIT-III
No ratings yet
DA UNIT-III
14 pages
A-level Maths Revision: Cheeky Revision Shortcuts
From Everand
A-level Maths Revision: Cheeky Revision Shortcuts
Scool Revision
3.5/5 (8)
Numerical Analysis II Essentials
From Everand
Numerical Analysis II Essentials
The Editors of REA
No ratings yet
Student's Solutions Manual and Supplementary Materials for Econometric Analysis of Cross Section and Panel Data, second edition
From Everand
Student's Solutions Manual and Supplementary Materials for Econometric Analysis of Cross Section and Panel Data, second edition
Jeffrey M. Wooldridge
No ratings yet
2019 Dip + 5TH Module 7TH Sem Ec For Ug
No ratings yet
2019 Dip + 5TH Module 7TH Sem Ec For Ug
63 pages
Ecesch
No ratings yet
Ecesch
4 pages
Active Constellation Extension With Enlipping Technique For PAPR Reduction in FBMC-OQAM Systems
No ratings yet
Active Constellation Extension With Enlipping Technique For PAPR Reduction in FBMC-OQAM Systems
5 pages
Novel Approach For PAPR Reduction in FBMC-OQAM Using Enlipping
No ratings yet
Novel Approach For PAPR Reduction in FBMC-OQAM Using Enlipping
4 pages
Python For Data Science - Unit 6 - Week 4
No ratings yet
Python For Data Science - Unit 6 - Week 4
5 pages
A Survey On LDPC Decoding Techniques: Varsha Vimal Sood, Dr. H.P.Sinha, Alka Kalra
No ratings yet
A Survey On LDPC Decoding Techniques: Varsha Vimal Sood, Dr. H.P.Sinha, Alka Kalra
7 pages
17CS664 Module3 Lists Dictionaries Tuples (Python)
No ratings yet
17CS664 Module3 Lists Dictionaries Tuples (Python)
31 pages
17CS664 Module-1 (Python)
No ratings yet
17CS664 Module-1 (Python)
38 pages
2 PB
No ratings yet
2 PB
6 pages
An Introduction To Wavelet Transform
No ratings yet
An Introduction To Wavelet Transform
80 pages
AISC Proceedings Proposal Form: Book Format
No ratings yet
AISC Proceedings Proposal Form: Book Format
4 pages
MIMO Channel Models
No ratings yet
MIMO Channel Models
24 pages
LDPC
No ratings yet
LDPC
15 pages
Number Systems
No ratings yet
Number Systems
12 pages
Open For Feedback Till 24-07-2017: Visvesvaraya Technological University, Belagavi B.E (CBCS) Open Electives Lists
No ratings yet
Open For Feedback Till 24-07-2017: Visvesvaraya Technological University, Belagavi B.E (CBCS) Open Electives Lists
5 pages
PAPR Reduction in OFDM Systems With Various Point FFTs
No ratings yet
PAPR Reduction in OFDM Systems With Various Point FFTs
5 pages
Allpass Example
No ratings yet
Allpass Example
8 pages
MATLAB Programs: % Program For The Generation of Unit Impulse Signal
No ratings yet
MATLAB Programs: % Program For The Generation of Unit Impulse Signal
96 pages
Ecsyll5-Updated On 03-08-2017
No ratings yet
Ecsyll5-Updated On 03-08-2017
21 pages
080123-1-OFDM (A) - Competence Development-partI-final
No ratings yet
080123-1-OFDM (A) - Competence Development-partI-final
22 pages
Reference: Digital Signal Processing Laboratory Using Matlab Author: Sanjit K. Mitra
No ratings yet
Reference: Digital Signal Processing Laboratory Using Matlab Author: Sanjit K. Mitra
16 pages
Peak To Average Power Ratio (Papr) PDF
No ratings yet
Peak To Average Power Ratio (Papr) PDF
68 pages
BER AND PAPR ANALYSIS OF 8X8 MIMO OFDM SYSTEM - USING SLM TECHNIQUE - Thesis - M.Tech PDF
No ratings yet
BER AND PAPR ANALYSIS OF 8X8 MIMO OFDM SYSTEM - USING SLM TECHNIQUE - Thesis - M.Tech PDF
82 pages
Peak To Average Power Ratio
No ratings yet
Peak To Average Power Ratio
25 pages
saes-h-101v
No ratings yet
saes-h-101v
380 pages
Yu Ye PDF
No ratings yet
Yu Ye PDF
40 pages
NDA NA National Defence Academy Naval Academy Entrance Examination Mathematics Chapterwise Previous Years 3000 Objective Questions Manjul Tyagi
100% (6)
NDA NA National Defence Academy Naval Academy Entrance Examination Mathematics Chapterwise Previous Years 3000 Objective Questions Manjul Tyagi
62 pages
Cancer
No ratings yet
Cancer
16 pages
14 - GoGetter L1 TB Skills Revision 7and8
No ratings yet
14 - GoGetter L1 TB Skills Revision 7and8
2 pages
MR Jashim Uddin
No ratings yet
MR Jashim Uddin
2 pages
Pavement Markings
No ratings yet
Pavement Markings
4 pages
2021 Platform Experience and Design - Team Overview - External
No ratings yet
2021 Platform Experience and Design - Team Overview - External
31 pages
Helicopteros 00m
No ratings yet
Helicopteros 00m
17 pages
Greatest Books of All Time Original
No ratings yet
Greatest Books of All Time Original
95 pages
Method of Statement (Concrete Repair)
No ratings yet
Method of Statement (Concrete Repair)
3 pages
DELTA-SC 2030 tds_eng
No ratings yet
DELTA-SC 2030 tds_eng
2 pages
J-349 (G+M+1) Dip-02
No ratings yet
J-349 (G+M+1) Dip-02
18 pages
Exercise 6.2
100% (2)
Exercise 6.2
2 pages
Radpol Heat Shrink Tubes 2024
No ratings yet
Radpol Heat Shrink Tubes 2024
34 pages
Threaded Joints in Steam Piping - Pipelines, Piping and Fluid Mechanics Engineering - Eng-Tips
No ratings yet
Threaded Joints in Steam Piping - Pipelines, Piping and Fluid Mechanics Engineering - Eng-Tips
3 pages
FSMM 5 Stuart
No ratings yet
FSMM 5 Stuart
5 pages
Optical Destruction Devices: NSA/CSS Evaluated Products List For
No ratings yet
Optical Destruction Devices: NSA/CSS Evaluated Products List For
5 pages
NP4Fparts103 A05
No ratings yet
NP4Fparts103 A05
4 pages
Caterpillar Cat D8N TRACK-TYPE TRACTOR Dozer Bulldozer (Prefix 7TK) Service Repair Manual Instant Download
No ratings yet
Caterpillar Cat D8N TRACK-TYPE TRACTOR Dozer Bulldozer (Prefix 7TK) Service Repair Manual Instant Download
25 pages
Itb05-042 (Procedure To Complete Iavl When Idle Speed Needs To Be Reduced)
No ratings yet
Itb05-042 (Procedure To Complete Iavl When Idle Speed Needs To Be Reduced)
5 pages
RET615 PG 756891 ENe PDF
No ratings yet
RET615 PG 756891 ENe PDF
72 pages
Gasuzifinekozewik
No ratings yet
Gasuzifinekozewik
2 pages
Survivable Network Design
No ratings yet
Survivable Network Design
23 pages
Iso 8653 2016
100% (1)
Iso 8653 2016
9 pages
FORM 08 - Intervention Report - CRANK SHEAR - Copie - Copie
No ratings yet
FORM 08 - Intervention Report - CRANK SHEAR - Copie - Copie
3 pages
LOLLYPOPS
No ratings yet
LOLLYPOPS
3 pages
The Power of Worship
No ratings yet
The Power of Worship
9 pages
ComAp India
No ratings yet
ComAp India
38 pages
Comprehensive System Assessment of Cancer Patient
No ratings yet
Comprehensive System Assessment of Cancer Patient
7 pages