100% found this document useful (1 vote)

97 views

ML Coursera Python Assignments

The document provides instructions for a linear regression programming exercise. It introduces linear regression and loads data on city populations and food truck profits from a text file. It asks the reader to: 1. Implement a function to return a 5x5 identity matrix as a warmup exercise. 2. Plot the loaded population and profit data to visualize the relationship. 3. Implement functions to compute the cost of and perform gradient descent for linear regression with one variable to predict profits based on population.

Uploaded by

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

100% found this document useful (1 vote)

97 views

ML Coursera Python Assignments

Uploaded by

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 20

31/03/2023, 12:22 ml-coursera-python-assignments

Programming Exercise 1: Linear

Regression
Introduction
In this exercise, you will implement linear regression and get to see it work on data. Before starting on this programming
exercise, we strongly recommend watching the video lectures and completing the review questions for the associated
topics.
All the information you need for solving this assignment is in this notebook, and all the code you will be implementing will
take place within this notebook. The assignment can be promptly submitted to the coursera grader directly from this
notebook (code and instructions are included below).
Before we begin with the exercises, we need to import all libraries required for this programming exercise. Throughout the
course, we will be using numpy for all arrays and matrix operations, and matplotlib for plotting.
You can find instructions on how to install required libraries in the README file in the github repository.

# used for manipulating directory paths

import os

# Scientific and vector computation for python

import numpy as np

# Plotting library
from matplotlib import pyplot
from mpl_toolkits.mplot3d import Axes3D # needed to plot 3-D surfaces

# library written for this exercise providing additional functions for assignment submission, and others
import utils

# define the submission/grader object for this exercise

grader = utils.Grader()

# tells matplotlib to embed plots within the notebook

%matplotlib inline

Submission and Grading

After completing each part of the assignment, be sure to submit your solutions to the grader.
For this programming exercise, you are only required to complete the first part of the exercise to implement linear regression
with one variable. The second part of the exercise, which is optional, covers linear regression with multiple variables. The
following is a breakdown of how each part of this exercise is scored.
Required Exercises
https://deepnote.com/@yash-pant-/ml-coursera-python-assignments-fcbc6966-87b3-47b5-8aec-4ed678023448 1/20
31/03/2023, 12:22 ml-coursera-python-assignments

SectionPart Submitted FunctionPoints

1 Warm up exercise warmUpExercise 10
2 Compute cost for one variable computeCost 40
3 Gradient descent for one variable gradientDescent 50
Total Points 100
Optional Exercises
SectionPart Submitted Function Points
4 Feature normalization featureNormalize 0
5 Compute cost for multiple variables computeCostMulti 0
6 Gradient descent for multiple variables gradientDescentMulti 0
7 Normal Equations normalEqn 0
You are allowed to submit your solutions multiple times, and we will take only the highest score into consideration.
At the end of each section in this notebook, we have a cell which contains code for submitting the solutions
thus far to the grader. Execute the cell to see your score up to the current section. For all your work to be
submitted properly, you must execute those cells at least once. They must also be re-executed everytime the
submitted function is updated.

Debugging
Here are some things to keep in mind throughout this exercise:
Python array indices start from zero, not one (contrary to OCTAVE/MATLAB).
There is an important distinction between python arrays (called list or tuple ) and numpy arrays. You should use
numpy arrays in all your computations. Vector/matrix
Yash Pantoperations work only with numpy arrays. Published
/ ml-coursera-python-assignments Python listsat do
Julnot
19, 2021
support vector operations (you need to use for loops).
If you are seeing many errors at runtime, inspect your matrix operations to make sure that you are adding and
multiplying matrices of compatible dimensions. Printing the dimensions of numpy arrays using the shape property will
help you debug.
By default, numpy interprets math operators to be element-wise operators. If you want to do matrix multiplication, you
need to use the dot function in numpy . For, example if A and B are two numpy matrices, then the matrix operation
AB is np.dot(A, B) . Note that for 2-dimensional matrices or vectors (1-dimensional), this is also equivalent to A@B
(requires python >= 3.5).

1 Simple python and numpy function

The first part of this assignment gives you practice with python and numpy syntax and the homework submission process.
In the next cell, you will find the outline of a python function. Modify it to return a 5 x 5 identity matrix by filling in the
following code:
A = np.eye(5)

https://deepnote.com/@yash-pant-/ml-coursera-python-assignments-fcbc6966-87b3-47b5-8aec-4ed678023448 2/20
31/03/2023, 12:22 ml-coursera-python-assignments

def warmUpExercise():
"""
Example function in Python which computes the identity matrix.

Returns
-------
A : array_like
The 5x5 identity matrix.

Instructions
------------
Return the 5x5 identity matrix.
"""
# ======== YOUR CODE HERE ======
A = np.identity(5,dtype = float) # modify this line

# ==============================
return A

The previous cell only defines the function warmUpExercise . We can now run it by executing the following cell to see its
output. You should see output similar to the following:
array([[ 1., 0., 0., 0., 0.],
[ 0., 1., 0., 0., 0.],
[ 0., 0., 1., 0., 0.],
[ 0., 0., 0., 1., 0.],
[ 0., 0., 0., 0., 1.]])

warmUpExercise()

array([[1., 0., 0., 0., 0.],

[0., 1., 0., 0., 0.],
[0., 0., 1., 0., 0.],
[0., 0., 0., 1., 0.],
[0., 0., 0., 0., 1.]])

1.1 Submitting solutions

After completing a part of the exercise, you can submit your solutions for grading by first adding the function you modified
to the grader object, and then sending your function to Coursera for grading.
The grader will prompt you for your login e-mail and submission token. You can obtain a submission token from the web
page for the assignment. You are allowed to submit your solutions multiple times, and we will take only the highest score into
consideration.
Execute the next cell to grade your solution to the first part of this exercise.
You should now submit your solutions.
# appends the implemented function in part 1 to the grader object
grader[1] = warmUpExercise

https://deepnote.com/@yash-pant-/ml-coursera-python-assignments-fcbc6966-87b3-47b5-8aec-4ed678023448 3/20
31/03/2023, 12:22 ml-coursera-python-assignments

# send the added functions to coursera grader for getting a grade on this part
grader.grade()

Submitting Solutions | Programming Exercise linear-regression

Invalid email or token. You used an invalid email or your token may have expired. Please make sure you have entered all field

2 Linear regression with one variable

Now you will implement linear regression with one variable to predict profits for a food truck. Suppose you are the CEO of a
restaurant franchise and are considering different cities for opening a new outlet. The chain already has trucks in various
cities and you have data for profits and populations from the cities. You would like to use this data to help you select which
city to expand to next.
The file Data/ex1data1.txt contains the dataset for our linear regression problem. The first column is the population of a
city (in 10,000s) and the second column is the profit of a food truck in that city (in $10,000s). A negative value for profit
indicates a loss.
We provide you with the code needed to load this data. The dataset is loaded from the data file into the variables x and y :

# Read comma separated data

data = np.loadtxt(os.path.join('Data', 'ex1data1.txt'), delimiter=',')
X, y = data[:, 0], data[:, 1]

m = y.size # number of training examples

2.1 Plotting the Data

Before starting on any task, it is often useful to understand the data by visualizing it. For this dataset, you can use a scatter
plot to visualize the data, since it has only two properties to plot (profit and population). Many other problems that you will
encounter in real life are multi-dimensional and cannot be plotted on a 2-d plot. There are many plotting libraries in python
(see this blog post for a good summary of the most popular ones).
In this course, we will be exclusively using matplotlib to do all our plotting. matplotlib is one of the most popular
scientific plotting libraries in python and has extensive tools and functions to make beautiful plots. pyplot is a module
within matplotlib which provides a simplified interface to matplotlib 's most common plotting tasks, mimicking
MATLAB's plotting interface.
You might have noticed that we have imported the `pyplot` module at the beginning of this exercise using the
command `from matplotlib import pyplot`. This is rather uncommon, and if you look at python code elsewhere
or in the `matplotlib` tutorials, you will see that the module is named `plt`. This is used by module renaming by
using the import command `import matplotlib.pyplot as plt`. We will not using the short name of `pyplot`
module in this class exercises, but you should be aware of this deviation from norm.
In the following part, your first job is to complete the plotData function below. Modify the function and fill in the following
code:

https://deepnote.com/@yash-pant-/ml-coursera-python-assignments-fcbc6966-87b3-47b5-8aec-4ed678023448 4/20
31/03/2023, 12:22 ml-coursera-python-assignments

pyplot.plot(x, y, 'ro', ms=10, mec='k')

pyplot.ylabel('Profit in $10,000')
pyplot.xlabel('Population of City in 10,000s')

def plotData(x, y):

"""
Plots the data points x and y into a new figure. Plots the data
points and gives the figure axes labels of population and profit.

Parameters
----------
x : array_like
Data point values for x-axis.

y : array_like
Data point values for y-axis. Note x and y should have the same size.

Instructions
------------
Plot the training data into a figure using the "figure" and "plot"
functions. Set the axes labels using the "xlabel" and "ylabel" functions.
Assume the population and revenue data have been passed in as the x
and y arguments of this function.

Hint
----
You can use the 'ro' option with plot to have the markers
appear as red circles. Furthermore, you can make the markers larger by
using plot(..., 'ro', ms=10), where `ms` refers to marker size. You
can also set the marker edge color using the `mec` property.
"""
fig = pyplot.figure() # open a new figure

# ====================== YOUR CODE HERE =======================

pyplot.plot(x,y,'ro',ms = 10,mec = 'k')
pyplot.xlabel('Popluation of City in 10,000s')
pyplot.ylabel('Profit in $10,000')

# =============================================================

Now run the defined function with the loaded data to visualize the data. The end result should look like the following figure:

Execute the next cell to visualize the data.

plotData(X, y)

https://deepnote.com/@yash-pant-/ml-coursera-python-assignments-fcbc6966-87b3-47b5-8aec-4ed678023448 5/20
31/03/2023, 12:22 ml-coursera-python-assignments

To quickly learn more about the matplotlib plot function and what arguments you can provide to it, you can type ?
pyplot.plot in a cell within the jupyter notebook. This opens a separate page showing the documentation for the
requested function. You can also search online for plotting documentation.
To set the markers to red circles, we used the option 'or' within the plot function.

?pyplot.plot

2.2 Gradient Descent

In this part, you will fit the linear regression parameters θ to our dataset using gradient descent.
2.2.1 Update Equations
The objective of linear regression is to minimize the cost function
1 m 2
J(θ) = 2m
∑i=1 (hθ (x(i) ) − y (i) )

where the hypothesis hθ (x) is given by the linear model hθ (x) = θT x = θ0 + θ1 x1

Recall that the parameters of your model are the θj values. These are the values you will adjust to minimize cost J(θ). One
way to do this is to use the batch gradient descent algorithm. In batch gradient descent, each iteration performs the update

(i)
θj = θj − α m1 ∑m

(i)

(i)
i=1 (hθ (x ) − y ) xj
simultaneously update θj for all j

With each step of gradient descent, your parameters θj come closer to the optimal values that will achieve the lowest cost
J(θ).

**Implementation Note:** We store each example as a row in the the $X$ matrix in Python `numpy`. To take
into account the intercept term ($\theta_0$), we add an additional first column to $X$ and set it to all ones.
This allows us to treat $\theta_0$ as simply another 'feature'.

2.2.2 Implementation
We have already set up the data for linear regression. In the following cell, we add another dimension to our data to
accommodate the θ0 intercept term. Do NOT execute this cell more than once.

https://deepnote.com/@yash-pant-/ml-coursera-python-assignments-fcbc6966-87b3-47b5-8aec-4ed678023448 6/20
31/03/2023, 12:22 ml-coursera-python-assignments

# Add a column of ones to X. The numpy function stack joins arrays along a given axis.
# The first axis (axis=0) refers to rows (training examples)
# and second axis (axis=1) refers to columns (features).
X = np.stack([np.ones(m), X], axis=1)

2.2.3 Computing the cost J(θ)

As you perform gradient descent to learn minimize the cost function J(θ), it is helpful to monitor the convergence by
computing the cost. In this section, you will implement a function to calculate J(θ) so you can check the convergence of
your gradient descent implementation.
Your next task is to complete the code for the function computeCost which computes J(θ). As you are doing this,
remember that the variables X and y are not scalar values. X is a matrix whose rows represent the examples from the
training set and y is a vector whose each elemennt represent the value at a given row of X .

def computeCost(X, y, theta):

"""
Compute cost for linear regression. Computes the cost of using theta as the
parameter for linear regression to fit the data points in X and y.

Parameters
----------
X : array_like
The input dataset of shape (m x n+1), where m is the number of examples,
and n is the number of features. We assume a vector of one's already
appended to the features so we have n+1 columns.

y : array_like
The values of the function at each data point. This is a vector of
shape (m, ).

theta : array_like
The parameters for the regression function. This is a vector of
shape (n+1, ).

Returns
-------
J : float
The value of the regression cost function.

Instructions
------------
Compute the cost of a particular choice of theta.
You should set J to the cost.
"""

# initialize some useful values

m = y.size # number of training examples

# You need to return the following variables correctly

J = 0

# ====================== YOUR CODE HERE =====================

h = np.dot(X,theta)
J = (1/(2 * m))*(np.sum(np.square((h)-y)))

https://deepnote.com/@yash-pant-/ml-coursera-python-assignments-fcbc6966-87b3-47b5-8aec-4ed678023448 7/20
31/03/2023, 12:22 ml-coursera-python-assignments

# ===========================================================
return J

Once you have completed the function, the next step will run computeCost two times using two different initializations of θ.
You will see the cost printed to the screen.

J = computeCost(X, y, theta=np.array([0.0, 0.0]))

print('With theta = [0, 0] \nCost computed = %.2f' % J)
print('Expected cost value (approximately) 32.07\n')

# further testing of the cost function

J = computeCost(X, y, theta=np.array([-1, 2]))
print('With theta = [-1, 2]\nCost computed = %.2f' % J)
print('Expected cost value (approximately) 54.24')

With theta = [0, 0]

Cost computed = 32.07
Expected cost value (approximately) 32.07

With theta = [-1, 2]

Cost computed = 54.24
Expected cost value (approximately) 54.24

You should now submit your solutions by executing the following cell.
grader[2] = computeCost
grader.grade()

Submitting Solutions | Programming Exercise linear-regression

2.2.4 Gradient descent

Next, you will complete a function which implements gradient descent. The loop structure has been written for you, and you
only need to supply the updates to θ within each iteration.
As you program, make sure you understand what you are trying to optimize and what is being updated. Keep in mind that
the cost J(θ) is parameterized by the vector θ, not X and y. That is, we minimize the value of J(θ) by changing the values
of the vector θ, not by changing X or y. Refer to the equations in this notebook and to the video lectures if you are
uncertain. A good way to verify that gradient descent is working correctly is to look at the value of J(θ) and check that it is
decreasing with each step.
The starter code for the function gradientDescent calls computeCost on every iteration and saves the cost to a python
list. Assuming you have implemented gradient descent and computeCost correctly, your value of J(θ) should never
increase, and should converge to a steady value by the end of the algorithm.
**Vectors and matrices in `numpy`** - Important implementation notes

https://deepnote.com/@yash-pant-/ml-coursera-python-assignments-fcbc6966-87b3-47b5-8aec-4ed678023448 8/20
31/03/2023, 12:22 ml-coursera-python-assignments

A vector in numpy is a one dimensional array, for example np.array([1, 2, 3]) is a vector. A matrix in numpy is a two
dimensional array, for example np.array([[1, 2, 3], [4, 5, 6]]) . However, the following is still considered a matrix
np.array([[1, 2, 3]]) since it has two dimensions, even if it has a shape of 1x3 (which looks like a vector).

Given the above, the function np.dot which we will use for all matrix/vector multiplication has the following properties:
It always performs inner products on vectors. If x=np.array([1, 2, 3]) , then np.dot(x, x) is a scalar.
For matrix-vector multiplication, so if X is a m × n matrix and y is a vector of length m, then the operation np.dot(y,
X) considers y as a 1 × m vector. On the other hand, if y is a vector of length n, then the operation np.dot(X, y)
considers y as a n × 1 vector.
A vector can be promoted to a matrix using y[None] or [y[np.newaxis] . That is, if y = np.array([1, 2, 3]) is a
vector of size 3, then y[None, :] is a matrix of shape 1 × 3. We can use y[:, None] to obtain a shape of 3 × 1.

def gradientDescent(X, y, theta, alpha, num_iters):

"""
Performs gradient descent to learn `theta`. Updates theta by taking `num_iters`
gradient steps with learning rate `alpha`.

Parameters
----------
X : array_like
The input dataset of shape (m x n+1).

y : array_like
Value at given features. A vector of shape (m, ).

theta : array_like
Initial values for the linear regression parameters.
A vector of shape (n+1, ).

alpha : float
The learning rate.

num_iters : int
The number of iterations for gradient descent.

Returns
-------
theta : array_like
The learned linear regression parameters. A vector of shape (n+1, ).

J_history : list
A python list for the values of the cost function after each iteration.

Instructions
------------
Peform a single gradient step on the parameter vector theta.

While debugging, it can be useful to print out the values of

the cost function (computeCost) and gradient here.
"""
# Initialize some useful values
m = y.shape[0] # number of training examples

# make a copy of theta, to avoid changing the original array, since numpy arrays
# are passed by reference to functions
theta = theta.copy()

J_history = [] # Use a python list to save cost in every iteration

https://deepnote.com/@yash-pant-/ml-coursera-python-assignments-fcbc6966-87b3-47b5-8aec-4ed678023448 9/20
31/03/2023, 12:22 ml-coursera-python-assignments

h = np.dot(X,theta)

for i in range(num_iters):
# ==================== YOUR CODE HERE =================================
theta = theta - alpha/m * (np.sum((h - y)*i))

# =====================================================================

# save the cost J in every iteration

J_history.append(computeCost(X, y, theta))

return theta, J_history

After you are finished call the implemented gradientDescent function and print the computed θ. We initialize the θ
parameters to 0 and the learning rate α to 0.01. Execute the following cell to check your code.

# initialize fitting parameters

theta = np.zeros(2)

# some gradient descent settings

iterations = 1500
alpha = 0.01

theta, J_history = gradientDescent(X ,y, theta, alpha, iterations)

print('Theta found by gradient descent: {:.4f}, {:.4f}'.format(*theta))
print('Expected theta values (approximately): [-3.6303, 1.1664]')

Theta found by gradient descent: 65646.4758, 65646.4758

Expected theta values (approximately): [-3.6303, 1.1664]

We will use your final parameters to plot the linear fit. The results should look like the following figure.

# plot the linear fit

plotData(X[:, 1], y)
pyplot.plot(X[:, 1], np.dot(X, theta), '-')
pyplot.legend(['Training data', 'Linear regression']);

https://deepnote.com/@yash-pant-/ml-coursera-python-assignments-fcbc6966-87b3-47b5-8aec-4ed678023448 10/20
31/03/2023, 12:22 ml-coursera-python-assignments

Your final values for θ will also be used to make predictions on profits in areas of 35,000 and 70,000 people.
Note the way that the following lines use matrix multiplication, rather than explicit summation or looping, to
calculate the predictions. This is an example of code vectorization in `numpy`.

Note that the first argument to the `numpy` function `dot` is a python list. `numpy` can internally converts
**valid** python lists to numpy arrays when explicitly provided as arguments to `numpy` functions.

# Predict values for population sizes of 35,000 and 70,000

predict1 = np.dot([1, 3.5], theta)
print('For population = 35,000, we predict a profit of {:.2f}\n'.format(predict1*10000))

predict2 = np.dot([1, 7], theta)

print('For population = 70,000, we predict a profit of {:.2f}\n'.format(predict2*10000))

For population = 35,000, we predict a profit of 2954091411.77

For population = 70,000, we predict a profit of 5251718065.36

You should now submit your solutions by executing the next cell.

grader[3] = gradientDescent
grader.grade()

Submitting Solutions | Programming Exercise linear-regression

Invalid email or token. You used an invalid email or your token may have expired. Please make sure you have entered all field

2.4 Visualizing J(θ)

To understand the cost function J(θ) better, you will now plot the cost over a 2-dimensional grid of θ0 and θ1 values. You
will not need to code anything new for this part, but you should understand how the code you have written already is

creating these images.

In the next cell, the code is set up to calculate J(θ) over a grid of values using the computeCost function that you wrote.
After executing the following cell, you will have a 2-D array of J(θ) values. Then, those values are used to produce surface
and contour plots of J(θ) using the matplotlib plot_surface and contourf functions. The plots should look something
like the following:

The purpose of these graphs is to show you how J(θ) varies with changes in θ0 and θ1 . The cost function J(θ) is bowl-
shaped and has a global minimum. (This is easier to see in the contour plot than in the 3D surface plot). This minimum is the

optimal point for θ0 and θ1 , and each step of gradient descent moves closer to this point.

https://deepnote.com/@yash-pant-/ml-coursera-python-assignments-fcbc6966-87b3-47b5-8aec-4ed678023448 11/20
31/03/2023, 12:22 ml-coursera-python-assignments

# grid over which we will calculate J

theta0_vals = np.linspace(-10, 10, 100)
theta1_vals = np.linspace(-1, 4, 100)

# initialize J_vals to a matrix of 0's

J_vals = np.zeros((theta0_vals.shape[0], theta1_vals.shape[0]))

# Fill out J_vals

for i, theta0 in enumerate(theta0_vals):
for j, theta1 in enumerate(theta1_vals):
J_vals[i, j] = computeCost(X, y, [theta0, theta1])

# Because of the way meshgrids work in the surf command, we need to

# transpose J_vals before calling surf, or else the axes will be flipped
J_vals = J_vals.T

# surface plot
fig = pyplot.figure(figsize=(12, 5))
ax = fig.add_subplot(121, projection='3d')
ax.plot_surface(theta0_vals, theta1_vals, J_vals, cmap='viridis')
pyplot.xlabel('theta0')
pyplot.ylabel('theta1')
pyplot.title('Surface')

# contour plot
# Plot J_vals as 15 contours spaced logarithmically between 0.01 and 100
ax = pyplot.subplot(122)
pyplot.contour(theta0_vals, theta1_vals, J_vals, linewidths=2, cmap='viridis', levels=np.logspace(-2, 3, 20)
pyplot.xlabel('theta0')
pyplot.ylabel('theta1')
pyplot.plot(theta[0], theta[1], 'ro', ms=10, lw=2)
pyplot.title('Contour, showing minimum')
pass

Optional Exercises
If you have successfully completed the material above, congratulations! You now understand linear regression and should
able to start using it on your own datasets.
For the rest of this programming exercise, we have included the following optional exercises. These exercises will help you
gain a deeper understanding of the material, and if you are able to do so, we encourage you to complete them as well. You
can still submit your solutions to these exercises to check if your answers are correct.

3 Linear regression with multiple variables

In this part, you will implement linear regression with multiple variables to predict the prices of houses. Suppose you are
selling your house and you want to know what a good market price would be. One way to do this is to first collect
information on recent houses sold and make a model of housing prices.
The file Data/ex1data2.txt contains a training set of housing prices in Portland, Oregon. The first column is the size of the
house (in square feet), the second column is the number of bedrooms, and the third column is the price of the house.
3.1 Feature Normalization
https://deepnote.com/@yash-pant-/ml-coursera-python-assignments-fcbc6966-87b3-47b5-8aec-4ed678023448 12/20
31/03/2023, 12:22 ml-coursera-python-assignments

We start by loading and displaying some values from this dataset. By looking at the values, note that house sizes are about
1000 times the number of bedrooms. When features differ by orders of magnitude, first performing feature scaling can make
gradient descent converge much more quickly.

# Load data
data = np.loadtxt(os.path.join('Data', 'ex1data2.txt'), delimiter=',')
X = data[:, :2]
y = data[:, 2]
m = y.size

# print out some data points

print('{:>8s}{:>8s}{:>10s}'.format('X[:,0]', 'X[:, 1]', 'y'))
print('-'*26)
for i in range(10):
print('{:8.0f}{:8.0f}{:10.0f}'.format(X[i, 0], X[i, 1], y[i]))

Your task here is to complete the code in featureNormalize function:

Subtract the mean value of each feature from the dataset.
After subtracting the mean, additionally scale (divide) the feature values by their respective “standard deviations.”
The standard deviation is a way of measuring how much variation there is in the range of values of a particular feature (most
data points will lie within ±2 standard deviations of the mean); this is an alternative to taking the range of values (max-min).
In numpy , you can use the std function to compute the standard deviation.
For example, the quantity X[:, 0] contains all the values of x1 (house sizes) in the training set, so np.std(X[:, 0])
computes the standard deviation of the house sizes. At the time that the function featureNormalize is called, the extra

column of 1’s corresponding to x0 = 1 has not yet been added to X .

You will do this for all the features and your code should work with datasets of all sizes (any number of features / examples).
Note that each column of the matrix X corresponds to one feature.
**Implementation Note:** When normalizing the features, it is important to store the values used for
normalization - the mean value and the standard deviation used for the computations. After learning the
parameters from the model, we often want to predict the prices of houses we have not seen before. Given a
new x value (living room area and number of bedrooms), we must first normalize x using the mean and
standard deviation that we had previously computed from the training set.

def featureNormalize(X):
"""
Normalizes the features in X. returns a normalized version of X where
the mean value of each feature is 0 and the standard deviation
is 1. This is often a good preprocessing step to do when working with
learning algorithms.

Parameters
----------
X : array_like
The dataset of shape (m x n).

Returns
-------
X_norm : array_like
https://deepnote.com/@yash-pant-/ml-coursera-python-assignments-fcbc6966-87b3-47b5-8aec-4ed678023448 13/20
31/03/2023, 12:22 ml-coursera-python-assignments

The normalized dataset of shape (m x n).

Instructions
------------
First, for each feature dimension, compute the mean of the feature
and subtract it from the dataset, storing the mean value in mu.
Next, compute the standard deviation of each feature and divide
each feature by it's standard deviation, storing the standard deviation
in sigma.

Note that X is a matrix where each column is a feature and each row is
an example. You needto perform the normalization separately for each feature.

Hint
----
You might find the 'np.mean' and 'np.std' functions useful.
"""
# You need to set these values correctly
X_norm = X.copy()
mu = np.zeros(X.shape[1])
sigma = np.zeros(X.shape[1])

# =========================== YOUR CODE HERE =====================

# ================================================================
return X_norm, mu, sigma

Execute the next cell to run the implemented featureNormalize function.

# call featureNormalize on the loaded data

X_norm, mu, sigma = featureNormalize(X)

print('Computed mean:', mu)

print('Computed standard deviation:', sigma)

You should now submit your solutions.

grader[4] = featureNormalize
grader.grade()

After the featureNormalize function is tested, we now add the intercept term to X_norm :

# Add intercept term to X

X = np.concatenate([np.ones((m, 1)), X_norm], axis=1)

3.2 Gradient Descent

Previously, you implemented gradient descent on a univariate regression problem. The only difference now is that there is
one more feature in the matrix X . The hypothesis function and the batch gradient descent update rule remain unchanged.

https://deepnote.com/@yash-pant-/ml-coursera-python-assignments-fcbc6966-87b3-47b5-8aec-4ed678023448 14/20
31/03/2023, 12:22 ml-coursera-python-assignments

You should complete the code for the functions computeCostMulti and gradientDescentMulti to implement the cost
function and gradient descent for linear regression with multiple variables. If your code in the previous part (single variable)
already supports multiple variables, you can use it here too. Make sure your code supports any number of features and is
well-vectorized. You can use the shape property of numpy arrays to find out how many features are present in the dataset.
**Implementation Note:** In the multivariate case, the cost function can also be written in the following
vectorized form:
1
J(θ) = 2m (Xθ

− y )T (Xθ − y )

where
KaTeX parse error: Expected 'EOF', got '\\' at position 27: … (x^{(1)})^T - \̲\̲ - (x…
def computeCostMulti(X, y, theta):
"""
Compute cost for linear regression with multiple variables.
Computes the cost of using theta as the parameter for linear regression to fit the data points in X and

Parameters
----------
X : array_like
The dataset of shape (m x n+1).

y : array_like
A vector of shape (m, ) for the values at a given data point.

theta : array_like
The linear regression parameters. A vector of shape (n+1, )

Returns
-------
J : float
The value of the cost function.

Instructions
------------
Compute the cost of a particular choice of theta. You should set J to the cost.
"""
# Initialize some useful values
m = y.shape[0] # number of training examples

# You need to return the following variable correctly

J = 0

# ======================= YOUR CODE HERE ===========================

# ==================================================================
return J

You should now submit your solutions.

grader[5] = computeCostMulti
grader.grade()

https://deepnote.com/@yash-pant-/ml-coursera-python-assignments-fcbc6966-87b3-47b5-8aec-4ed678023448 15/20
31/03/2023, 12:22 ml-coursera-python-assignments

def gradientDescentMulti(X, y, theta, alpha, num_iters):

"""
Performs gradient descent to learn theta.
Updates theta by taking num_iters gradient steps with learning rate alpha.

Parameters
----------
X : array_like
The dataset of shape (m x n+1).

y : array_like
A vector of shape (m, ) for the values at a given data point.

theta : array_like
The linear regression parameters. A vector of shape (n+1, )

alpha : float
The learning rate for gradient descent.

num_iters : int
The number of iterations to run gradient descent.

Returns
-------
theta : array_like
The learned linear regression parameters. A vector of shape (n+1, ).

J_history : list
A python list for the values of the cost function after each iteration.

Instructions
------------
Peform a single gradient step on the parameter vector theta.

While debugging, it can be useful to print out the values of

the cost function (computeCost) and gradient here.
"""
# Initialize some useful values
m = y.shape[0] # number of training examples

# make a copy of theta, which will be updated by gradient descent

theta = theta.copy()

J_history = []

for i in range(num_iters):
# ======================= YOUR CODE HERE ==========================

# =================================================================

# save the cost J in every iteration

J_history.append(computeCostMulti(X, y, theta))

return theta, J_history

You should now submit your solutions.

https://deepnote.com/@yash-pant-/ml-coursera-python-assignments-fcbc6966-87b3-47b5-8aec-4ed678023448 16/20
31/03/2023, 12:22 ml-coursera-python-assignments

grader[6] = gradientDescentMulti
grader.grade()

3.2.1 Optional (ungraded) exercise: Selecting learning rates

In this part of the exercise, you will get to try out different learning rates for the dataset and find a learning rate that
converges quickly. You can change the learning rate by modifying the following code and changing the part of the code that
sets the learning rate.
Use your implementation of gradientDescentMulti function and run gradient descent for about 50 iterations at the chosen
learning rate. The function should also return the history of J(θ) values in a vector J .
After the last iteration, plot the J values against the number of the iterations.
If you picked a learning rate within a good range, your plot look similar as the following Figure.

If your graph looks very different, especially if your value of J(θ) increases or even blows up, adjust your learning rate and
try again. We recommend trying values of the learning rate α on a log-scale, at multiplicative steps of about 3 times the
previous value (i.e., 0.3, 0.1, 0.03, 0.01 and so on). You may also want to adjust the number of iterations you are running if
that will help you see the overall trend in the curve.
**Implementation Note:** If your learning rate is too large, $J(\theta)$ can diverge and ‘blow up’, resulting in
values which are too large for computer calculations. In these situations, `numpy` will tend to return NaNs. NaN
stands for ‘not a number’ and is often caused by undefined operations that involve −∞ and +∞.

**MATPLOTLIB tip:** To compare how different learning learning rates affect convergence, it is helpful to plot
$J$ for several learning rates on the same figure. This can be done by making `alpha` a python list, and looping
across the values within this list, and calling the plot function in every iteration of the loop. It is also useful to
have a legend to distinguish the different lines within the plot. Search online for `pyplot.legend` for help on
showing legends in `matplotlib`.
Notice the changes in the convergence curves as the learning rate changes. With a small learning rate, you should find that
gradient descent takes a very long time to converge to the optimal value. Conversely, with a large learning rate, gradient
descent might not converge or might even diverge! Using the best learning rate that you found, run the script to run gradient
descent until convergence to find the final values of θ. Next, use this value of θ to predict the price of a house with 1650
square feet and 3 bedrooms. You will use value later to check your implementation of the normal equations. Don’t forget to
normalize your features when you make this prediction!

"""
Instructions
------------
We have provided you with the following starter code that runs
gradient descent with a particular learning rate (alpha).

Your task is to first make sure that your functions - `computeCost`

and `gradientDescent` already work with this starter code and
support multiple variables.

After that, try running gradient descent with different values of

alpha and see which one gives you the best result.
https://deepnote.com/@yash-pant-/ml-coursera-python-assignments-fcbc6966-87b3-47b5-8aec-4ed678023448 17/20
31/03/2023, 12:22 ml-coursera-python-assignments

Finally, you should complete the code at the end to predict the price
of a 1650 sq-ft, 3 br house.

Hint
----
At prediction, make sure you do the same feature normalization.
"""
# Choose some alpha value - change this
alpha = 0.1
num_iters = 400

# init theta and run gradient descent

theta = np.zeros(3)
theta, J_history = gradientDescentMulti(X, y, theta, alpha, num_iters)

# Plot the convergence graph

pyplot.plot(np.arange(len(J_history)), J_history, lw=2)
pyplot.xlabel('Number of iterations')
pyplot.ylabel('Cost J')

# Display the gradient descent's result

print('theta computed from gradient descent: {:s}'.format(str(theta)))

# Estimate the price of a 1650 sq-ft, 3 br house

# ======================= YOUR CODE HERE ===========================
# Recall that the first column of X is all-ones.
# Thus, it does not need to be normalized.

price = 0 # You should change this

# ===================================================================

print('Predicted price of a 1650 sq-ft, 3 br house (using gradient descent): ${:.0f}'.format(price))

You do not need to submit any solutions for this optional (ungraded) part.

3.3 Normal Equations

In the lecture videos, you learned that the closed-form solution to linear regression is
−1
θ = (X T X ) XT y

Using this formula does not require any feature scaling, and you will get an exact solution in one calculation: there is no “loop
until convergence” like in gradient descent.
First, we will reload the data to ensure that the variables have not been modified. Remember that while you do not need to
scale your features, we still need to add a column of 1’s to the X matrix to have an intercept term (θ0 ). The code in the next
cell will add the column of 1’s to X for you.

# Load data
data = np.loadtxt(os.path.join('Data', 'ex1data2.txt'), delimiter=',')
X = data[:, :2]
y = data[:, 2]
m = y.size
X = np.concatenate([np.ones((m, 1)), X], axis=1)

https://deepnote.com/@yash-pant-/ml-coursera-python-assignments-fcbc6966-87b3-47b5-8aec-4ed678023448 18/20
31/03/2023, 12:22 ml-coursera-python-assignments

Complete the code for the function normalEqn below to use the formula above to calculate θ.

def normalEqn(X, y):

"""
Computes the closed-form solution to linear regression using the normal equations.

Parameters
----------
X : array_like
The dataset of shape (m x n+1).

y : array_like
The value at each data point. A vector of shape (m, ).

Returns
-------
theta : array_like
Estimated linear regression parameters. A vector of shape (n+1, ).

Instructions
------------
Complete the code to compute the closed form solution to linear
regression and put the result in theta.

Hint
----
Look up the function `np.linalg.pinv` for computing matrix inverse.
"""
theta = np.zeros(X.shape[1])

# ===================== YOUR CODE HERE ============================

# =================================================================
return theta

You should now submit your solutions.

grader[7] = normalEqn
grader.grade()

Optional (ungraded) exercise: Now, once you have found θ using this method, use it to make a price prediction for a 1650-
square-foot house with 3 bedrooms. You should find that gives the same predicted price as the value you obtained using
the model fit with gradient descent (in Section 3.2.1).

# Calculate the parameters from the normal equation

theta = normalEqn(X, y);

# Display normal equation's result

print('Theta computed from the normal equations: {:s}'.format(str(theta)));

# Estimate the price of a 1650 sq-ft, 3 br house

# ====================== YOUR CODE HERE ======================

https://deepnote.com/@yash-pant-/ml-coursera-python-assignments-fcbc6966-87b3-47b5-8aec-4ed678023448 19/20
31/03/2023, 12:22 ml-coursera-python-assignments

price = 0 # You should change this

# ============================================================

print('Predicted price of a 1650 sq-ft, 3 br house (using normal equations): ${:.0f}'.format(price))

https://deepnote.com/@yash-pant-/ml-coursera-python-assignments-fcbc6966-87b3-47b5-8aec-4ed678023448 20/20

Predicting Consumer Behavior in E-Commerce Using Recommendation Systems
No ratings yet
Predicting Consumer Behavior in E-Commerce Using Recommendation Systems
8 pages
TIA 2/FM Seminar A.1 Problems
No ratings yet
TIA 2/FM Seminar A.1 Problems
23 pages
Practical-8: Create An Application in Android To Find Factorial of A Given Number Using Onclick Event
No ratings yet
Practical-8: Create An Application in Android To Find Factorial of A Given Number Using Onclick Event
4 pages
Linear Regression
No ratings yet
Linear Regression
14 pages
Ex 1
No ratings yet
Ex 1
15 pages
Ex 1
No ratings yet
Ex 1
15 pages
ML Exercise 1
No ratings yet
ML Exercise 1
15 pages
assgmt1
No ratings yet
assgmt1
7 pages
HW 1 in 2015
No ratings yet
HW 1 in 2015
3 pages
Dhrumil Aml
No ratings yet
Dhrumil Aml
14 pages
C1 W1 Lab03 Model Representation Soln-Copy1
No ratings yet
C1 W1 Lab03 Model Representation Soln-Copy1
7 pages
C1 W1 Lab02 Model Representation Soln
No ratings yet
C1 W1 Lab02 Model Representation Soln
7 pages
ML Lab 06 Manual - Linear Regression 1 (Version 6)
No ratings yet
ML Lab 06 Manual - Linear Regression 1 (Version 6)
8 pages
C1 W1 Lab02 Model Representation Soln
No ratings yet
C1 W1 Lab02 Model Representation Soln
5 pages
Lab Experiments Vi Sem-1
No ratings yet
Lab Experiments Vi Sem-1
10 pages
C1 W1 Lab02 Model Representation Soln
No ratings yet
C1 W1 Lab02 Model Representation Soln
7 pages
Basic Math: 1.1 Scipy Constants (Scipy - Constants)
No ratings yet
Basic Math: 1.1 Scipy Constants (Scipy - Constants)
32 pages
Ml Cyber Lab
No ratings yet
Ml Cyber Lab
16 pages
Machine Learning Coursera All Exercies PDF
No ratings yet
Machine Learning Coursera All Exercies PDF
117 pages
Machine Learning Coursera All Exercies
75% (12)
Machine Learning Coursera All Exercies
117 pages
Programming Exercise 1: Linear Regression: Machine Learning
No ratings yet
Programming Exercise 1: Linear Regression: Machine Learning
15 pages
Programming Exercise 1: Linear Regression: Machine Learning
No ratings yet
Programming Exercise 1: Linear Regression: Machine Learning
15 pages
Exercise
No ratings yet
Exercise
15 pages
exercise01
No ratings yet
exercise01
3 pages
Stanford Machine Learning Exercise 1
No ratings yet
Stanford Machine Learning Exercise 1
15 pages
C1 W1 Lab02 Model Representation Soln
No ratings yet
C1 W1 Lab02 Model Representation Soln
5 pages
CIS 419/519 Introduction To Machine Learning Assignment 2: Instructions
No ratings yet
CIS 419/519 Introduction To Machine Learning Assignment 2: Instructions
12 pages
AIML Assignment_merged
No ratings yet
AIML Assignment_merged
7 pages
Regression Model
No ratings yet
Regression Model
6 pages
ML Lab Manual
No ratings yet
ML Lab Manual
38 pages
Linear_Regression
No ratings yet
Linear_Regression
18 pages
Record
No ratings yet
Record
25 pages
Phython Practical Notebook1
No ratings yet
Phython Practical Notebook1
14 pages
Machine Learning Programming Exercise
100% (2)
Machine Learning Programming Exercise
118 pages
Machine
No ratings yet
Machine
33 pages
Linear Regression
No ratings yet
Linear Regression
8 pages
Machine Learning Lab
No ratings yet
Machine Learning Lab
23 pages
Assignment 1
No ratings yet
Assignment 1
2 pages
Lab 11,12 - Copy
No ratings yet
Lab 11,12 - Copy
7 pages
LAB1_ML_EAC22050
No ratings yet
LAB1_ML_EAC22050
17 pages
Machine Learning Lab
No ratings yet
Machine Learning Lab
43 pages
Unit 5
No ratings yet
Unit 5
171 pages
2.3 SciPy-1
No ratings yet
2.3 SciPy-1
17 pages
ML Record Print
No ratings yet
ML Record Print
20 pages
Linear Regression - Numpy and Sklearn
No ratings yet
Linear Regression - Numpy and Sklearn
7 pages
mayhoc
No ratings yet
mayhoc
51 pages
DA Lab ANSWERS
No ratings yet
DA Lab ANSWERS
10 pages
ML Lab Manual
100% (1)
ML Lab Manual
37 pages
MIT6 0002F16 ProblemSet5
No ratings yet
MIT6 0002F16 ProblemSet5
13 pages
dml
No ratings yet
dml
25 pages
Dav Pracs
No ratings yet
Dav Pracs
9 pages
Assignment 1
100% (1)
Assignment 1
3 pages
ml file syllabus
No ratings yet
ml file syllabus
43 pages
Ex5 PDF
No ratings yet
Ex5 PDF
14 pages
Ex 5
No ratings yet
Ex 5
14 pages
MLCyberLab
No ratings yet
MLCyberLab
9 pages
Python Advanced Programming: The Guide to Learn Python Programming. Reference with Exercises and Samples About Dynamical Programming, Multithreading, Multiprocessing, Debugging, Testing and More
From Everand
Python Advanced Programming: The Guide to Learn Python Programming. Reference with Exercises and Samples About Dynamical Programming, Multithreading, Multiprocessing, Debugging, Testing and More
Marcus Richards
No ratings yet
Introduction to Algorithms
From Everand
Introduction to Algorithms
S VASIST
No ratings yet
Learn Programming Using C#
From Everand
Learn Programming Using C#
Taurius Litvinavicius
No ratings yet
Amazing Java: Learn Java Quickly
From Everand
Amazing Java: Learn Java Quickly
Andrei Besedin
No ratings yet
Ian Talks Python A-Z
From Everand
Ian Talks Python A-Z
Ian Eress
No ratings yet
Python Programming Using Google Colab
From Everand
Python Programming Using Google Colab
AM Govind Kumar
No ratings yet
C Programming
From Everand
C Programming
Netra
No ratings yet
Perception of Bank Employees' Towards Working Environment of Selected Indian Universal Banks
No ratings yet
Perception of Bank Employees' Towards Working Environment of Selected Indian Universal Banks
21 pages
GD&T - Day 2-1 PDF
100% (1)
GD&T - Day 2-1 PDF
94 pages
Tea Sensor de Velocidade 3
No ratings yet
Tea Sensor de Velocidade 3
7 pages
Chemistry Viva ( (Viva Questions On Titration
100% (1)
Chemistry Viva ( (Viva Questions On Titration
2 pages
The DLL Import Library Tool
No ratings yet
The DLL Import Library Tool
3 pages
A Practical Handbook of Speech Coders
No ratings yet
A Practical Handbook of Speech Coders
14 pages
Full download Rational Homotopy Theory and Differential Forms 2nd Edition Phillip Griffiths pdf docx
100% (3)
Full download Rational Homotopy Theory and Differential Forms 2nd Edition Phillip Griffiths pdf docx
81 pages
ELISA Protocol (General Guidelines)
No ratings yet
ELISA Protocol (General Guidelines)
5 pages
Synthesis, Analgesic, and Antiparkinsonian Profiles of Some Pyridine, Pyrazoline, and Thiopyrimidine Derivatives
No ratings yet
Synthesis, Analgesic, and Antiparkinsonian Profiles of Some Pyridine, Pyrazoline, and Thiopyrimidine Derivatives
10 pages
Errata - June 2 2004 PDF
No ratings yet
Errata - June 2 2004 PDF
4 pages
Notes_ML_02_Slides_RNN_ANN
No ratings yet
Notes_ML_02_Slides_RNN_ANN
105 pages
Waec Physics 2009
No ratings yet
Waec Physics 2009
7 pages
Governor (Types I, II, IV, and V) - Check
No ratings yet
Governor (Types I, II, IV, and V) - Check
5 pages
zshrc
No ratings yet
zshrc
4 pages
Nick Richter Resume PDF
No ratings yet
Nick Richter Resume PDF
2 pages
Python3 Quiz - 2.py 0 4
No ratings yet
Python3 Quiz - 2.py 0 4
2 pages
Nithilan Valan Week 4 CTF Report Hacktify Internship
No ratings yet
Nithilan Valan Week 4 CTF Report Hacktify Internship
9 pages
Error Analysis and Propagation
No ratings yet
Error Analysis and Propagation
7 pages
Department of Electrical Engineering: Dec20012 - Programming Fundamentals (Practical Report)
No ratings yet
Department of Electrical Engineering: Dec20012 - Programming Fundamentals (Practical Report)
21 pages
HNC 24-25 Unit 4019 Assignment Brief
No ratings yet
HNC 24-25 Unit 4019 Assignment Brief
7 pages
Veritas Volume Manager 5.0
No ratings yet
Veritas Volume Manager 5.0
29 pages
Electronics: Automated and Intelligent System For Monitoring Swimming Pool Safety Based On The Iot and Transfer Learning
No ratings yet
Electronics: Automated and Intelligent System For Monitoring Swimming Pool Safety Based On The Iot and Transfer Learning
13 pages
SAP SD Basic Knowledge Promotion Plan (Agreements) - Develop Paper
No ratings yet
SAP SD Basic Knowledge Promotion Plan (Agreements) - Develop Paper
8 pages
Therapeutic Drug Monitoring: Deborah E. Keil, Nadia Ayala
No ratings yet
Therapeutic Drug Monitoring: Deborah E. Keil, Nadia Ayala
19 pages
The Correlation of Gold, Exchange Rate, and Stock Market On Covid-19 Pandemic Period
No ratings yet
The Correlation of Gold, Exchange Rate, and Stock Market On Covid-19 Pandemic Period
13 pages
5 - Week3 - ODL 1 - Chapter 1 - Property of Signal
No ratings yet
5 - Week3 - ODL 1 - Chapter 1 - Property of Signal
19 pages
Structure - Problem Description
No ratings yet
Structure - Problem Description
9 pages