0% found this document useful (0 votes)

88 views8 pages

Revision: High Variance

- The classifier's precision is 0.1, based on 85 true positives and 890 false positives out of 1000 total examples in the cross-validation set. - Two conditions where training on a large dataset is likely to give good performance are: 1) the features contain sufficient information to predict the target accurately, and 2) the learning algorithm has a large number of parameters.

Uploaded by

Kailash A

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

88 views8 pages

Revision: High Variance

Uploaded by

Kailash A

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 8

Revision

High variance
• indicated by gap in errors between training and testing data sets.

• Algorithm has overfit the data for the training data set.

• Increasing the regularization parameter will reduce overfitting

The recommended way to choose a value of regularization parameter λ to use is to

choose the lowest cross validation error. You should not use the training data set for
this purpose.

1
Week 6: Advice for Applying Machine Learning
Question 1.
You train a learning algorithm, and find that it has unacceptably high error on the test
set. You plot the learning curve, and obtain the figure below. Is the algorithm suffering
from high bias, high variance, or neither?

(i) Neither

(ii) High variance

(iii) High bias [CORRECT]

Question 2.
Suppose you have implemented regularized logistic regression to classify what object is
in an image (i.e., to do object recognition). However, when you test your hypothesis on a
new set of images, you find that it makes unacceptably large errors with its predictions
on the new images. However, your hypothesis performs well (has low error) on the
training set. Which of the following are promising steps to take? Check all that apply.

(i) SELECTED Try increasing the regularization parameter λ.

(ii) WRONG Try evaluating the hypothesis on a cross validation set rather than the
test set.

(iii) CORRECT Try using a smaller set of features.

(iv) WRONG Try decreasing the regularization parameter λ.

• SELECTED Get more training examples.

Question 3.
Suppose you have implemented regularized logistic regression to predict what items cus-
tomers will purchase on a web shopping site. However, when you test your hypothesis
on a new set of customers, you find that it makes unacceptably large errors in its predic-
tions. Furthermore, the hypothesis performs poorly on the training set. Which of the
following might be promising steps to take? Check all that apply.

(i) SELECTED Try decreasing the regularization parameter λ.

(ii) WRONG Use fewer training examples.

(iii) WRONG Try evaluating the hypothesis on a cross validation set rather than the
test set.

(iv) CORRECT Try adding polynomial features.

2
Question 4.
Which of the following statements are true? Check all that apply.

(i) WRONG Suppose you are training a regularized linear regression model. The
recommended way to choose what value of regularization parameter λ to use is to
choose the value of λ which gives the lowest test set error.

(ii) CORRECT Suppose you are training a regularized linear regression model. The
recommended way to choose what value of regularization parameter λ to use is to
choose the value of λ which gives the lowest cross validation error.

(iii) CORRECT The performance of a learning algorithm on the training set will typ-
ically be better than its performance on the test set.

(iv) WRONG Suppose you are training a regularized linear regression model.The rec-
ommended way to choose what value of regularization parameter λ to use is to
choose the value of λ which gives the lowest training set error.

(iv) CORRECT A typical split of a dataset into training, validation and test sets might
be 60% training set, 20% validation set, and 20% test set.

• WRONG It is okay to use data from the test set to choose the regularization
parameter λ, but not the model parameters (θ).

• WRONG Suppose you are training a logistic regression classifier using polynomial
features and want to select what degree polynomial (denoted d in the lecture
videos) to use. After training the classifier on the entire training set, you decide
to use a subset of the training examples as a validation set. This will work just as
well as having a validation set that is separate (disjoint) from the training set.

• CORRECT Suppose you are using linear regression to predict housing prices, and
your dataset comes sorted in order of increasing sizes of houses. It is then important
to randomly shuffle the dataset before splitting it into training, validation and test
sets, so that we don’t have all the smallest houses going into the training set, and
all the largest houses going into the test set.

Question 5.
Which of the following statements are true? Check all that apply.

(i) CORRECT A model with more parameters is more prone to overfitting and typi-
cally has higher variance.

(ii) WRONG If the training and test errors are about the same, adding more features
will not help improve the results.

3
(iii) CORRECT If a learning algorithm is suffering from high variance, adding more
training examples is likely to improve the test error.

(iv) CORRECT If a learning algorithm is suffering from high bias, only adding more
training examples may not improve the test error significantly.

4
Week 6: Machine Learning System Design
Question 1
You are working on a spam classification system using regularized logistic regression.
“Spam” is a positive class (y = 1) and “not spam” is the negative class (y = 0). You
have trained your classifier and there are m = 1000 examples in the cross-validation set.
The chart of predicted class vs. actual class is:

Actual Class: 1 Actual Class: 0

Predicted Class: 1 85 890
Predicted Class: 0 15 10

For reference:

• Accuracy = (true positives + true negatives) / (total examples)

• Precision = (true positives) / (true positives + false positives)

• Recall = (true positives) / (true positives + false negatives)

• F1 score = (2 × precision × recall) / (precision + recall)

What is the classifier’s precision (as a value from 0 to 1)?

Enter your answer in the box below. If necessary, provide at least two values after
the decimal point.
0.09

0.1 Question 1
CORRECT Suppose a massive dataset is available for training a learning algorithm.
Training on a lot of data is likely to give good performance when two of the following
conditions hold true. Which are the two?

(i) WRONG When we are willing to include high order polynomial features of x (such
as x21 , x22 , x1 x2 , etc.).
(ii) SELECTED The features x contain sufficient information to predict y accurately.
(For example, one way to verify this is if a human expert on the domain can
confidently predict y when given only x).
(iii) WRONG We train a learning algorithm with a large number of parameters (that
is able to learn/represent fairly complex functions).
(iv) We train a learning algorithm with a small number of parameters (that is thus
unlikely to overfit).

5
0.2 Question 3.
Suppose you have trained a logistic regression classifier which is outputing hθ(x). Cur-
rently, you predict 1 if hθ(x) ≥threshold, and predict 0 if hθ(x) ¡ threshold, where
currently the threshold is set to 0.5. Suppose you decrease the threshold to 0.1. Which
of the following are true? Check all that apply.

(i) The classifier is likely to have unchanged precision and recall, but lower accuracy.

(ii) The classifier is likely to now have higher precision.

(iii) SELECTED The classifier is likely to now have higher recall.

(iv) The classifier is likely to have unchanged precision and recall, but higher accuracy.

(v) The classifier is likely to have unchanged precision and recall, and thus the same
F1 score.

(vi) The classifier is likely to have unchanged precision and recall, but higher accuracy.

(vii) SELECTED The classifier is likely to now have lower precision.

(viii) The classifier is likely to now have lower recall.

Question 4.
WRONG Suppose you are working on a spam classifier, where spam emails are positive
examples (y=1) and non-spam emails are negative examples (y=0). You have a training
set of emails in which 99% of the emails are non-spam and the other 1% is spam. Which
of the following statements are true? Check all that apply.

• WRONG If you always predict spam (output y=1), your classifier will have a recall
of 0% and precision of 99%.

• WRONG If you always predict non-spam (output y=0), your classifier will have a
recall of 0%.

• CORRECT If you always predict spam (output y=1), your classifier will have a
recall of 100% and precision of 1%.

• CORRECT If you always predict non-spam (output y=0), your classifier will have
an accuracy of 99%.

predictT predictF
actualT
actualF

6
Question 5.
Which of the following statements are true? Check all that apply.

(i) CORRECT Using a very large training set makes it unlikely for model to overfit
the training data.

(ii) WRONG If your model is underfitting the training set, then obtaining more data
is likely to help.

(iii) CORRECT The ”error analysis” process of manually examining the examples
which your algorithm got wrong can help suggest what are good steps to take
(e.g., developing new features) to improve your algorithm’s performance.

(iv) WRONG It is a good idea to spend a lot of time collecting a large amount of data
before building your first version of a learning algorithm.

(v) WRONG After training a logistic regression classifier, you must use 0.5 as your
threshold for predicting whether an example is positive or negative.

7
What is the classifier’s recall : 0.85 Accuracy : 0.095 Suppose a massive dataset is
available for training a learning algorithm. Training on a lot of data is likely to give
good performance when two of the following conditions hold true.
Which are the two?
We train a learning algorithm with a large number of parameters (that is able to
learn/represent fairly complex functions).
We train a learning algorithm with a small number of parameters (that is thus
unlikely to overfit).
CORRECT The features x contain sufficient information to predict y accurately.
(For example, one way to verify this is if a human expert on the domain can confidently
predict y when given only x).
WRONG When we are willing to include high order polynomial features of x (such
as x21, x22, x1x2, etc.).
CORRECT The classifier is likely to now have lower precision.
CORRECT If you always predict non-spam (output y=0), your classifier will have a
Recall of 0
If you always predict non-spam (output y=0), your classifier will have an accuracy
of 99
WRONG If you always predict non-spam (output y=0), your classifier will have 99
WRONG If you always predict non-spam (output y=0), your classifier will have 99
A good classifier should have both a high precision and high recall on the cross
validation set.
Q5
CORRECT The ”error analysis” process of manually examining the examples which
your algorithm got wrong can help suggest what are good steps to take (e.g., developing
new features) to improve your algorithm’s performance.
CORRECT Using a very large training set makes it unlikely for model to overfit the
training data.

ECS7020P Sample Paper Solutions
No ratings yet
ECS7020P Sample Paper Solutions
6 pages
La Morte
No ratings yet
La Morte
69 pages
ML Interview Questions PDF
100% (5)
ML Interview Questions PDF
20 pages
Forecasting: To Accompany by Render, Stair, and Hanna Power Point Slides Created by Brian Peterson
No ratings yet
Forecasting: To Accompany by Render, Stair, and Hanna Power Point Slides Created by Brian Peterson
84 pages
Sample Size Determination: BY DR Zubair K.O
100% (1)
Sample Size Determination: BY DR Zubair K.O
43 pages
Lecture 8: Gradient Descent and Logistic Regression
No ratings yet
Lecture 8: Gradient Descent and Logistic Regression
39 pages
Evans Analytics2e PPT 04
No ratings yet
Evans Analytics2e PPT 04
63 pages
Data Analysis Finals1
No ratings yet
Data Analysis Finals1
10 pages
Application of Probabilistic Graphical Models in Forecasting Crude Oil Price
No ratings yet
Application of Probabilistic Graphical Models in Forecasting Crude Oil Price
78 pages
Assignment 2: Introduction To Machine Learning Prof. B. Ravindran
100% (1)
Assignment 2: Introduction To Machine Learning Prof. B. Ravindran
3 pages
Vijayakumar
No ratings yet
Vijayakumar
183 pages
Sensors: Deep Convolutional and LSTM Recurrent Neural Networks For Multimodal Wearable Activity Recognition
No ratings yet
Sensors: Deep Convolutional and LSTM Recurrent Neural Networks For Multimodal Wearable Activity Recognition
25 pages
Risk and Return - Section 11.2
No ratings yet
Risk and Return - Section 11.2
99 pages
Steps Ofvvector Estimating Error Correction Model
No ratings yet
Steps Ofvvector Estimating Error Correction Model
4 pages
Chi-Squared Distribution
No ratings yet
Chi-Squared Distribution
12 pages
Saathvik - Official - RESUME
No ratings yet
Saathvik - Official - RESUME
1 page
Linear Regression: An Approach For Forecasting
No ratings yet
Linear Regression: An Approach For Forecasting
12 pages
Statistics and Probability: Quarter 2 Week 3 Test of Hypothesis
No ratings yet
Statistics and Probability: Quarter 2 Week 3 Test of Hypothesis
6 pages
PubHlth 540 Word Problems Unit 5
No ratings yet
PubHlth 540 Word Problems Unit 5
9 pages
Problem B: Binary Search Tree: Input
No ratings yet
Problem B: Binary Search Tree: Input
10 pages
Exam in Statistical Machine Learning Statistisk Maskininlärning (1RT700)
No ratings yet
Exam in Statistical Machine Learning Statistisk Maskininlärning (1RT700)
12 pages
Machine Learning Andrew NG Week 6
No ratings yet
Machine Learning Andrew NG Week 6
11 pages
Statistical Learning Slides
No ratings yet
Statistical Learning Slides
60 pages
Math 1040
No ratings yet
Math 1040
5 pages
Chapter Four Data Analysis and Presention of Findings1-1
No ratings yet
Chapter Four Data Analysis and Presention of Findings1-1
15 pages
Machine Learning Using Matlab: Lecture 8 Advice On ML Application
No ratings yet
Machine Learning Using Matlab: Lecture 8 Advice On ML Application
30 pages
ML MCQ 1
No ratings yet
ML MCQ 1
5 pages
Solution To Homework 7
No ratings yet
Solution To Homework 7
4 pages
Recentered Influence Functions (Rifs) in Stata: Rif Regression and Rif Decomposition
No ratings yet
Recentered Influence Functions (Rifs) in Stata: Rif Regression and Rif Decomposition
44 pages
Unit 7 - Week 4: Assignment 4
No ratings yet
Unit 7 - Week 4: Assignment 4
5 pages
Data Estrus SPSS
No ratings yet
Data Estrus SPSS
3 pages
Machine Learning Andrew NG Week 6 Quiz 1
No ratings yet
Machine Learning Andrew NG Week 6 Quiz 1
8 pages
CSO504 Machine Learning: Evaluation and Error Analysis Validation and Regularization Koustav Rudra 22/08/2022
No ratings yet
CSO504 Machine Learning: Evaluation and Error Analysis Validation and Regularization Koustav Rudra 22/08/2022
28 pages
Week 6 Lecture Notes
No ratings yet
Week 6 Lecture Notes
9 pages
Data Science
No ratings yet
Data Science
16 pages
10-701/15-781 Machine Learning - Midterm Exam, Fall 2010: Aarti Singh Carnegie Mellon University
No ratings yet
10-701/15-781 Machine Learning - Midterm Exam, Fall 2010: Aarti Singh Carnegie Mellon University
16 pages
Midterm Sol
No ratings yet
Midterm Sol
23 pages
CS 229, Autumn 2017 Problem Set #2: Supervised Learning II
No ratings yet
CS 229, Autumn 2017 Problem Set #2: Supervised Learning II
6 pages
Module 5 - NON-PARAMETRIC TESTS
No ratings yet
Module 5 - NON-PARAMETRIC TESTS
50 pages
Quantitative Methods For Economic Analysis 1 Solved MCQs (Set-7)
100% (1)
Quantitative Methods For Economic Analysis 1 Solved MCQs (Set-7)
5 pages
Exam in Statistical Machine Learning Statistisk Maskininlärning (1RT700)
No ratings yet
Exam in Statistical Machine Learning Statistisk Maskininlärning (1RT700)
10 pages
Quiz1 Solutions Quiz 1 Soln
No ratings yet
Quiz1 Solutions Quiz 1 Soln
7 pages
Efa Medstat
No ratings yet
Efa Medstat
20 pages
40 Interview Questions On Machine Learning - AnalyticsVidhya
100% (1)
40 Interview Questions On Machine Learning - AnalyticsVidhya
21 pages
09.the Gauss-Markov Theorem and BLUE OLS Coefficient Estimates
No ratings yet
09.the Gauss-Markov Theorem and BLUE OLS Coefficient Estimates
10 pages
ML Suggestion
No ratings yet
ML Suggestion
5 pages
ML Midsem 2018 Solutions
No ratings yet
ML Midsem 2018 Solutions
7 pages
Midterm Solutions PDF
No ratings yet
Midterm Solutions PDF
17 pages
ML 04 Validation Regularization
No ratings yet
ML 04 Validation Regularization
57 pages
MCQ Question
No ratings yet
MCQ Question
5 pages
Test 1 Formula Sheet STAT1012
No ratings yet
Test 1 Formula Sheet STAT1012
2 pages
4.machine Learning Basics (C)
No ratings yet
4.machine Learning Basics (C)
9 pages
SS ZG568 EC 2R SECOND SEM 2020 2021 Solution 1617000149821
No ratings yet
SS ZG568 EC 2R SECOND SEM 2020 2021 Solution 1617000149821
6 pages
Machine 2021 Jul-Dec
No ratings yet
Machine 2021 Jul-Dec
46 pages
10 Advice For Applying Machine Learning
No ratings yet
10 Advice For Applying Machine Learning
25 pages
Midterm Solutions Machine
100% (1)
Midterm Solutions Machine
17 pages
MLPUE1 Solution
No ratings yet
MLPUE1 Solution
9 pages
2023-24 AIML ML Mid-Semester Regular QP Anwer-Keys
No ratings yet
2023-24 AIML ML Mid-Semester Regular QP Anwer-Keys
4 pages
3 LogisticRegression
No ratings yet
3 LogisticRegression
30 pages
Wa0197.
No ratings yet
Wa0197.
4 pages
Factor Analysis Overview1
No ratings yet
Factor Analysis Overview1
12 pages
Machine Learning Interview Questions PDF
No ratings yet
Machine Learning Interview Questions PDF
14 pages
Taxi Fare Prediction Using Random Forests
No ratings yet
Taxi Fare Prediction Using Random Forests
10 pages
Machine Learning PYQ 2021
No ratings yet
Machine Learning PYQ 2021
4 pages
ASSIGNMENT2
No ratings yet
ASSIGNMENT2
6 pages
Machine 2021 Jan-Apr
No ratings yet
Machine 2021 Jan-Apr
45 pages
ML 21-22 Sem
No ratings yet
ML 21-22 Sem
10 pages
Exam Spring 10
No ratings yet
Exam Spring 10
10 pages
ML 2023a Midsem Solution
No ratings yet
ML 2023a Midsem Solution
9 pages
2023 ML Assignment
No ratings yet
2023 ML Assignment
57 pages
2022 ML Assignments
No ratings yet
2022 ML Assignments
45 pages
Practice Questions
No ratings yet
Practice Questions
3 pages
ML Viva Questions
No ratings yet
ML Viva Questions
25 pages
Achine Learning Machine Learning Systems: Uilding A Spam Classifier
No ratings yet
Achine Learning Machine Learning Systems: Uilding A Spam Classifier
9 pages
NPTEL ML Assignment Week1
100% (4)
NPTEL ML Assignment Week1
5 pages
Introduction To Machine Learning
No ratings yet
Introduction To Machine Learning
116 pages
Case 2
No ratings yet
Case 2
2 pages
IML-IITKGP - Assignment 1 Solution
No ratings yet
IML-IITKGP - Assignment 1 Solution
7 pages
Wa0030.
No ratings yet
Wa0030.
36 pages
Assignment 5 Solution
No ratings yet
Assignment 5 Solution
6 pages
E-Commerce Product Delivery Prediction
No ratings yet
E-Commerce Product Delivery Prediction
13 pages
ML Exam
No ratings yet
ML Exam
11 pages
EE2211 Past Paper
No ratings yet
EE2211 Past Paper
14 pages
DL Prac1
No ratings yet
DL Prac1
5 pages
Machine Learning Volume I 280820241047
No ratings yet
Machine Learning Volume I 280820241047
4 pages
IE360 Quiz3
No ratings yet
IE360 Quiz3
4 pages
ML Tips and Tricks
No ratings yet
ML Tips and Tricks
32 pages
Ps 2
No ratings yet
Ps 2
9 pages
Week 1
No ratings yet
Week 1
3 pages
ML Classification Trupesh Patel
No ratings yet
ML Classification Trupesh Patel
39 pages
Shreya Bansal - 250418 - 153433
No ratings yet
Shreya Bansal - 250418 - 153433
971 pages
Unit-I Machine Learning Basics
No ratings yet
Unit-I Machine Learning Basics
85 pages
Sample QP For Mid-Semester Exam
No ratings yet
Sample QP For Mid-Semester Exam
5 pages
Machine Learning Interview Questions
From Everand
Machine Learning Interview Questions
Tech Interviews
4.5/5 (2)
Certified Lean Six Sigma Green Belt (ICGB) Practice Questions And Exam Tests ICGB Exam Guidebook And Updated Questions
From Everand
Certified Lean Six Sigma Green Belt (ICGB) Practice Questions And Exam Tests ICGB Exam Guidebook And Updated Questions
Idea Link
No ratings yet

Revision: High Variance

Uploaded by

Revision: High Variance

Uploaded by

Revision

• Increasing the regularization parameter will reduce overfitting

The recommended way to choose a value of regularization parameter λ to use is to

(ii) High variance

(iii) High bias [CORRECT]

(i) SELECTED Try increasing the regularization parameter λ.

(iii) CORRECT Try using a smaller set of features.

(iv) WRONG Try decreasing the regularization parameter λ.

• SELECTED Get more training examples.

(i) SELECTED Try decreasing the regularization parameter λ.

(ii) WRONG Use fewer training examples.

(iv) CORRECT Try adding polynomial features.

Actual Class: 1 Actual Class: 0

• Accuracy = (true positives + true negatives) / (total examples)

• Precision = (true positives) / (true positives + false positives)

• Recall = (true positives) / (true positives + false negatives)

• F1 score = (2 × precision × recall) / (precision + recall)

What is the classifier’s precision (as a value from 0 to 1)?

(ii) The classifier is likely to now have higher precision.

(iii) SELECTED The classifier is likely to now have higher recall.

(vii) SELECTED The classifier is likely to now have lower precision.

(viii) The classifier is likely to now have lower recall.

You might also like