0% found this document useful (0 votes)
251 views1 page

Assignment-Based Subjective Questions

The document contains assignment-based and general subjective questions related to linear regression analysis. The assignment-based questions ask about inferring the effect of categorical variables, the purpose of drop_first=True in dummy variable creation, identifying the highest correlated numerical variable, validating linear regression model assumptions, and identifying the top features. The general questions ask about explaining linear regression, Anscombe's quartet, Pearson's R, scaling methods, reasons for infinite VIF values, and the use of Q-Q plots.

Uploaded by

Nitish Gupta
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
251 views1 page

Assignment-Based Subjective Questions

The document contains assignment-based and general subjective questions related to linear regression analysis. The assignment-based questions ask about inferring the effect of categorical variables, the purpose of drop_first=True in dummy variable creation, identifying the highest correlated numerical variable, validating linear regression model assumptions, and identifying the top features. The general questions ask about explaining linear regression, Anscombe's quartet, Pearson's R, scaling methods, reasons for infinite VIF values, and the use of Q-Q plots.

Uploaded by

Nitish Gupta
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 1

Assignment-based Subjective Questions

1. From your analysis of the categorical variables from the dataset, what could you infer about
their effect on the dependent variable? (3 marks)
2. Why is it important to use drop_first=True during dummy variable creation? (2 mark)
3. Looking at the pair-plot among the numerical variables, which one has the highest correlation
with the target variable? (1 mark)
4. How did you validate the assumptions of Linear Regression after building the model on the
training set? (3 marks)
5. Based on the final model, which are the top 3 features contributing significantly towards
explaining the demand of the shared bikes? (2 marks)

General Subjective Questions


1. Explain the linear regression algorithm in detail. (4 marks)
2. Explain the Anscombe’s quartet in detail. (3 marks)
3. What is Pearson’s R? (3 marks)
4. What is scaling? Why is scaling performed? What is the difference between normalized scaling
and standardized scaling? (3 marks)
5. You might have observed that sometimes the value of VIF is infinite. Why does this happen?
(3 marks)
6. What is a Q-Q plot? Explain the use and importance of a Q-Q plot in linear regression.
(3 marks)

You might also like