W4 Assgn2
W4 Assgn2
Week 4 : Assignment 4
The due date for submitting this assignment has passed.
Due on 2024-08-21, 23:59 IST.
3) If a linear regression model achieves zero training error, can we say that all the data 1 point
points lie on a straight line in the feature space?
Yes
No
Data Description:
An automotive service chain is launching its new grand service station this weekend.
They offer to service a wide variety of cars. The current capacity of the station is to check 315
cars thoroughly per day. As an inaugural offer, they claim to freely check all cars that arrive on
their launch day, and report whether they need servicing or not!
Unexpectedly, they get 450 cars. The servicemen will not work longer than the working hours,
but the data analysts have to!
Can you save the day for the new service station?
Now for the cars they cannot check in detail, they measure those attributes and store them in
‘ServiceTest.csv
(https://drive.google.com/file/d/1h_Va9tkMB6UDSuqD6MzeYqgdph6yhtmy/view?
usp=drive_link)’
Problem Statement:
Use machine learning techniques to identify whether the cars require service or not.
4) Which of the following machine learning techniques would NOT be appropriate to 1 point
solve the problem given in the problem statement?
kNN
Random Forest
Logistic Regression
Linear regression
Prepare the data by following the steps given below, and answer questions 5 and 6.
Encode categorical variable, Service - Yes as 1 and No as 0 for both the train and test
datasets.
Split the set of independent features and the dependent feature on both the train and
test datasets.
Set random state for the instance of the logistic regression class as 0.
5) After applying logistic regression, what is/are the correct observations from the 1 point
resultant confusion matrix?
6) The logistic regression model built between the input and output variables is 1 point
checked for its prediction accuracy of the test data. What is the accuracy range (in %) of the
predictions made over test data?
60 - 79
90 - 95
30 – 59
80 – 89
Standardization
Dummy variables
Correlation
None of the above
8) A regression model with the function 𝑦 = 80 + 4.5𝑥 was built to understand the 1 point
impact of temperature 𝑥 on ice cream sales 𝑦 . The temperature this month is 10 degrees more
than the previous month. What is the predicted difference in ice cream sales?
56 units
45 units
80 units
None of the above
9) 𝑋 and 𝑌 are two variables that have a strong linear relationship. Which of the 1 point
following statements are incorrect?
The Global Happiness Index report contains the Happiness Score data with multiple
features (namely the Economy, Family, Health, and Freedom) that could affect the target
variable value.
Prepare the data by following the steps given below, and answer question 10.
Split the set of independent features and the dependent feature on the given dataset
Create training and testing data from the set of independent features and dependent
feature by splitting the original data in the ratio 3:1 respectively, and set the value for
random_state of the training/test split method’s instance as 1
10) A multiple linear regression model is built on the Global Happiness Index dataset 1 point
‘GHI Report.csv (https://drive.google.com/file/d/1c7UeZMZuYYfOXMMagI4UpvC-
VrJ7MXc8/view?usp=drive_link)’. What is the RMSE of the baseline model?
2.00
0.50
1.06
0.75