100% found this document useful (1 vote)

2K views4 pages

Machine Learning Week 2 Coursera

Gradient descent is preferable to the normal equation for multivariate linear regression when there are many features (n=200,000). With such a large number of features, inverting the matrix (XTX) in the normal equation would be very computationally expensive. Feature scaling speeds up gradient descent by reducing the number of iterations needed to reach a good solution, as it helps prevent larger features from dominating the cost function.

Uploaded by

Hương Đặng

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

100% found this document useful (1 vote)

2K views4 pages

Machine Learning Week 2 Coursera

Uploaded by

Hương Đặng

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 4

Machine Learning Week 2 Quiz 1 (Linear

Regression with Multiple Variables)

Stanford Coursera
Question 1
Suppose m=4 students have taken some class, and the class had a midterm exam and a
final exam. You have collected a dataset of their scores on the two exams, which is as
follows:

Midterm Exam (midterm exam)2

89 7921 96

72 5184 74

94 8836 87

69 4761 78

You'd like to use polynomial regression to predict a student's final exam score from their
midterm exam score. Concretely, suppose you want to fit a model of the form
hθ(x)=θ0+θ1x1+θ2x2, where x1 is the midterm score and x2 is (midterm score)2. Further, you
plan to use both feature scaling (dividing by the "max-min", or range, of a feature) and
mean normalization.

What is the normalized feature x2(4)? (Hint: midterm = 69, final = 78 is training example
4.) Please round off your answer to two decimal places and enter in the text box below.

Answer:

The mean of x2 is 6675.5 (= (7921+5184+8836+4761):4 ) and the range is 8836 - 4761 is
4075.

x2(4) = (4761 - 6675.5) / 4075 = -0.47

Question 2
You run gradient descent for 15 iterations with α=0.3 and compute J(θ) after each
iteration. You find that the value of J(θ) decreases quickly then levels off. Based on this,
which of the following conclusions seems most plausible?

 Rather than use the current value of α, it'd be more promising to try a larger
value of α (say α=1.0).

 Rather than use the current value of α, it'd be more promising to try a smaller
value of α (say α=0.1).

 α=0.3 is an effective choice of learning rate.

Answer:

Answer Explanation

α=0.3 is an effective choice of We want gradient descent to quickly converge to the minimum
learning rate. of α seems to be good

Question 3
Suppose you have m=14 training examples with n=3 features (excluding the additional
all-ones feature for the intercept term, which you should add). The normal equation is
θ=(XTX)−1XTy. For the given values of m and n, what are the dimensions of θ, X, and y in
this equation?

 X is 14×3, y is 14×1, θ is 3×3

 X is 14×4, y is 14×4, θ is 4×4

 X is 14×4, y is 14×1, θ is 4×1

 X is 14×3, y is 14×1, θ is 3×1

Answer Explanation

X is 14×4, y is 14×1, θ is X has m rows and n + 1 columns (+1 because of the x 0=1 term. y is an
Answer Explanation

4×1 vector.

Question 4
Suppose you have a dataset with m=50 examples and n=200000 features for each
example. You want to use multivariate linear regression to fit the parameters θ to our
data. Should you prefer gradient descent or the normal equation?

 Gradient descent, since (XTX)−1 will be very slow to compute in the normal

equation.

 Gradient descent, since it will always converge to the optimal θ.

 The normal equation, since it provides an efficient way to directly find the
solution.

 The normal equation, since gradient descent might be unable to find the optimal
θ.

Answer Explanation

Gradient descent, since (XTX)−1 will be With n = 200000 features, you have to invert a 200001 x 200
very slow to compute in the normal the normal equation. Inverting such a large matrix is comput
equation. gradient descent is a good choice.

Question 5
Which of the following are reasons for using feature scaling?

 It speeds up solving for θ using the normal equation.

 It prevents the matrix XTX (used in the normal equation) from being non-
invertable (singular/degenerate).

 It is necessary to prevent gradient descent from getting stuck in local optima.

 It speeds up gradient descent by making it require fewer iterations to get to a
good solution.

True or
Statement Explanation
False

It speeds up solving for θ using the normal The magnitude of the feature values
False
equation. of computational cost.

It prevents the matrix XTX (used in the normal

False equation) from being non-invertable none
(singular/degenerate).

It is necessary to prevent gradient descent from The cost function J(θ) for linear regre
False
getting stuck in local optima. optima.

It speeds up gradient descent by making it Feature scaling speeds up gradient d

True require fewer iterations to get to a good extra iterations that are required whe
solution. take on much larger values than the r

MTH603 Final Term Solved MCQs
73% (15)
MTH603 Final Term Solved MCQs
66 pages
FGLPJZT Ry-4iteQkXaRJw Textbook Final Spring25
No ratings yet
FGLPJZT Ry-4iteQkXaRJw Textbook Final Spring25
907 pages
SMDM-Project Report (Madhur Dhananiwala)
100% (2)
SMDM-Project Report (Madhur Dhananiwala)
43 pages
PM Guided Project Sample Business Report
100% (1)
PM Guided Project Sample Business Report
52 pages
01 Enisa Network Forensics Toolset
No ratings yet
01 Enisa Network Forensics Toolset
32 pages
DATA MINING PROJECT PAVITHRAA GOVINDARAJAN 24 OCT 2021 Jupyter Notebook PDF
100% (3)
DATA MINING PROJECT PAVITHRAA GOVINDARAJAN 24 OCT 2021 Jupyter Notebook PDF
49 pages
Project Predictive Modeling PDF
100% (1)
Project Predictive Modeling PDF
58 pages
Capstone Project Report
No ratings yet
Capstone Project Report
187 pages
Great Learning Predictive Modelling Project
No ratings yet
Great Learning Predictive Modelling Project
12 pages
Quiz Feedback - Coursera
100% (1)
Quiz Feedback - Coursera
4 pages
Wholesale Customers Data Analysis PDF
No ratings yet
Wholesale Customers Data Analysis PDF
25 pages
Linear - Regression - Assignment: Problem Statement
100% (3)
Linear - Regression - Assignment: Problem Statement
24 pages
SMDM Project Report-Survi Ghura
100% (1)
SMDM Project Report-Survi Ghura
26 pages
PM ProjectJune - 2021
100% (1)
PM ProjectJune - 2021
33 pages
Machine Learning Project: Raghul Harish
100% (2)
Machine Learning Project: Raghul Harish
46 pages
Palash Bhai - Machine Learning Assignment
100% (2)
Palash Bhai - Machine Learning Assignment
18 pages
Shivani Pandey TSF
100% (1)
Shivani Pandey TSF
32 pages
Mini Project - Factor Hair Analysis: Sravanthi.M
100% (2)
Mini Project - Factor Hair Analysis: Sravanthi.M
24 pages
Anshul Dyundi Machine Learning July 2022
50% (2)
Anshul Dyundi Machine Learning July 2022
46 pages
SMDM Project Report
100% (1)
SMDM Project Report
19 pages
Random Forest - US - Heart - Patients - Class
100% (1)
Random Forest - US - Heart - Patients - Class
24 pages
Machine Learning Project: Problem 1
67% (3)
Machine Learning Project: Problem 1
26 pages
Machine Learning (Project5) PDF
100% (2)
Machine Learning (Project5) PDF
13 pages
Property Price Prediction Capstone Project
100% (1)
Property Price Prediction Capstone Project
7 pages
Answer Report (Preditive Modelling)
100% (1)
Answer Report (Preditive Modelling)
29 pages
Akshaya SMDM Project Report
100% (1)
Akshaya SMDM Project Report
18 pages
Cart-Rf-Ann: Prepared by Muralidharan N
67% (3)
Cart-Rf-Ann: Prepared by Muralidharan N
33 pages
Chapter 5 Final
No ratings yet
Chapter 5 Final
80 pages
UNIT 3.1 (Greedy Method - Knapsack Problem)
No ratings yet
UNIT 3.1 (Greedy Method - Knapsack Problem)
24 pages
Car Transport Prediction
100% (2)
Car Transport Prediction
27 pages
Great Learning: SMDM Final Assignment
100% (1)
Great Learning: SMDM Final Assignment
16 pages
Machine Learning Guided Project
No ratings yet
Machine Learning Guided Project
23 pages
Data Mining Case Study PDF
100% (1)
Data Mining Case Study PDF
21 pages
Predictive Modelling Project Gloria Susan Raju 11 APR 2021 PDF
No ratings yet
Predictive Modelling Project Gloria Susan Raju 11 APR 2021 PDF
56 pages
Business Report Project - Sheetal - SMDM
100% (1)
Business Report Project - Sheetal - SMDM
20 pages
Nagareddy 18-Nov-2023
No ratings yet
Nagareddy 18-Nov-2023
20 pages
Tutorial 2 - Clustering
100% (2)
Tutorial 2 - Clustering
6 pages
DeepLearning Practice Question Answers
No ratings yet
DeepLearning Practice Question Answers
43 pages
Business Report DSBA Data Mining Project - Part 2 Segmentation Using K-Means Clustering
No ratings yet
Business Report DSBA Data Mining Project - Part 2 Segmentation Using K-Means Clustering
28 pages
Order of Growth
No ratings yet
Order of Growth
29 pages
SMDM - Project Report - Lakshmi
No ratings yet
SMDM - Project Report - Lakshmi
26 pages
2 - Neural Network
100% (1)
2 - Neural Network
59 pages
Week 1 Quiz
100% (1)
Week 1 Quiz
28 pages
Data Mining Clustering PDF
No ratings yet
Data Mining Clustering PDF
15 pages
Business Analytics Report: Submitted To
No ratings yet
Business Analytics Report: Submitted To
32 pages
Week 1 Graded Quiz On Solution PDF
100% (1)
Week 1 Graded Quiz On Solution PDF
2 pages
Cart-Rf-ANN: Prepared by Muralidharan N
0% (1)
Cart-Rf-ANN: Prepared by Muralidharan N
16 pages
Logistic Regression Quiz: Pandas Version: 1.0.5 Seaborn Version: 0.10.1 Matplotlib Version: 3.2.1 Sklearn Version: 0.23.1
50% (2)
Logistic Regression Quiz: Pandas Version: 1.0.5 Seaborn Version: 0.10.1 Matplotlib Version: 3.2.1 Sklearn Version: 0.23.1
1 page
Weekly Quiz 2 - PGPBABI.O.OCT19 Statistical Methods For Decision Making - Great Learning PDF
100% (1)
Weekly Quiz 2 - PGPBABI.O.OCT19 Statistical Methods For Decision Making - Great Learning PDF
6 pages
Vijayalakshmi
No ratings yet
Vijayalakshmi
17 pages
Report On Linear Regression Using R
No ratings yet
Report On Linear Regression Using R
15 pages
Problem 2 Businessreport ML
No ratings yet
Problem 2 Businessreport ML
9 pages
Project Advance Stats - Abhishek
No ratings yet
Project Advance Stats - Abhishek
14 pages
SQL Quiz Results
No ratings yet
SQL Quiz Results
17 pages
Strategic Approach To Software Testing
No ratings yet
Strategic Approach To Software Testing
6 pages
Introduction To Programming With Matlab: Exercises
100% (1)
Introduction To Programming With Matlab: Exercises
18 pages
Extended Project FastKart SQLite MYSQL 1 1 PDF
No ratings yet
Extended Project FastKart SQLite MYSQL 1 1 PDF
5 pages
MySQL - Week 1 Quiz
No ratings yet
MySQL - Week 1 Quiz
9 pages
Data Mining Project - 27.06.2021
No ratings yet
Data Mining Project - 27.06.2021
6 pages
AS Notebook - PCA - Wine Data-4
100% (1)
AS Notebook - PCA - Wine Data-4
1 page
Newton-Raphson Method: Numerical Analysis
No ratings yet
Newton-Raphson Method: Numerical Analysis
14 pages
Structural Analysis I Moment Distribution Method Examples
100% (2)
Structural Analysis I Moment Distribution Method Examples
5 pages
Predictive Modeling - Supporting File1
No ratings yet
Predictive Modeling - Supporting File1
3 pages
As Quiz 3 PCA Solution PDF
100% (1)
As Quiz 3 PCA Solution PDF
1 page
Mealy Machine
0% (1)
Mealy Machine
6 pages
ML Quiz-2
No ratings yet
ML Quiz-2
5 pages
Project Questions
No ratings yet
Project Questions
3 pages
End Term Quiz1 - Attempt Review
No ratings yet
End Term Quiz1 - Attempt Review
5 pages
Chapter - 5 Applications of Linked Listsds
No ratings yet
Chapter - 5 Applications of Linked Listsds
18 pages
ML Quiz 1
No ratings yet
ML Quiz 1
4 pages
Iterative Methods For Solution of Systems of Linear Equations
No ratings yet
Iterative Methods For Solution of Systems of Linear Equations
10 pages
Bakari832017ARJOM34769 PDF
No ratings yet
Bakari832017ARJOM34769 PDF
7 pages
Foundational Concepts of AI
No ratings yet
Foundational Concepts of AI
10 pages
CHE F242 Numerical Methods IISEM2014-15
No ratings yet
CHE F242 Numerical Methods IISEM2014-15
3 pages
Performance Analysis and Tuning For General Purpose Graphics Processing Units Synthesis Lectures On Computer Architecture Illustrated Hyesoon Kim PDF Download
No ratings yet
Performance Analysis and Tuning For General Purpose Graphics Processing Units Synthesis Lectures On Computer Architecture Illustrated Hyesoon Kim PDF Download
42 pages
Srs1 Cubic Spline For Excel Example v25
0% (1)
Srs1 Cubic Spline For Excel Example v25
30 pages
4.1.6.relation Extraction
No ratings yet
4.1.6.relation Extraction
6 pages
Extended Project
No ratings yet
Extended Project
1 page
KSC Method - Polynomial Expansion (Shortcut Method)
No ratings yet
KSC Method - Polynomial Expansion (Shortcut Method)
5 pages
IA-2 Question Bank 2024-25 EVEN Sem
No ratings yet
IA-2 Question Bank 2024-25 EVEN Sem
3 pages
Machine Learning Coursera by Andrew NG Week 4 Quiz 1
No ratings yet
Machine Learning Coursera by Andrew NG Week 4 Quiz 1
9 pages
Poly
No ratings yet
Poly
5 pages
Lab 6 - Regular Falsi Method
No ratings yet
Lab 6 - Regular Falsi Method
8 pages
Efficient Very Large-Scale Integration Architecture Design of Proportionate-Type Least Mean Square Adaptive Filters
No ratings yet
Efficient Very Large-Scale Integration Architecture Design of Proportionate-Type Least Mean Square Adaptive Filters
7 pages
Machine Learning Andrew NG Week 6 Quiz 1
No ratings yet
Machine Learning Andrew NG Week 6 Quiz 1
8 pages
Machine Learning Andrew NG Week 6
No ratings yet
Machine Learning Andrew NG Week 6
11 pages
ML Quiz 2
No ratings yet
ML Quiz 2
1 page
Deep Learning (Syllabus)
No ratings yet
Deep Learning (Syllabus)
1 page
MIP Model For Split Delivery VRP With Fleet & Driver Scheduling
No ratings yet
MIP Model For Split Delivery VRP With Fleet & Driver Scheduling
5 pages
Gujarat Technological University
No ratings yet
Gujarat Technological University
2 pages
Analysis of Algorithms - Set 4 (Analysis of Loops)
No ratings yet
Analysis of Algorithms - Set 4 (Analysis of Loops)
3 pages
Class X: Polynomial: Alfa Circle Contact No:-9621645520
No ratings yet
Class X: Polynomial: Alfa Circle Contact No:-9621645520
1 page
Machine Learning Andrew NG Week 5 Quiz 1
No ratings yet
Machine Learning Andrew NG Week 5 Quiz 1
3 pages
Polynomials
No ratings yet
Polynomials
1 page
Gauss-Jordan Elimination
No ratings yet
Gauss-Jordan Elimination
2 pages

Machine Learning Week 2 Coursera

Uploaded by

Machine Learning Week 2 Coursera

Uploaded by

Machine Learning Week 2 Quiz 1 (Linear

Regression with Multiple Variables)

Midterm Exam (midterm exam)2

x2(4) = (4761 - 6675.5) / 4075 = -0.47

 α=0.3 is an effective choice of learning rate.

 X is 14×3, y is 14×1, θ is 3×3

 X is 14×4, y is 14×4, θ is 4×4

 X is 14×4, y is 14×1, θ is 4×1

 X is 14×3, y is 14×1, θ is 3×1

 Gradient descent, since (XTX)−1 will be very slow to compute in the normal

 Gradient descent, since it will always converge to the optimal θ.

 It speeds up solving for θ using the normal equation.

 It is necessary to prevent gradient descent from getting stuck in local optima.

It prevents the matrix XTX (used in the normal

It speeds up gradient descent by making it Feature scaling speeds up gradient d

You might also like