0% found this document useful (0 votes)

127 views12 pages

Ex 3

This document provides instructions for Programming Exercise 3, which involves implementing multi-class classification using logistic regression and neural networks. It includes 3 key points: 1) The exercise uses a dataset of 5000 handwritten digit images to perform digit recognition with 10 classes from 0 to 9. Students will implement one-vs-all logistic regression and neural networks for classification. 2) For logistic regression, students will write vectorized versions of the cost function and gradient to efficiently train 10 separate logistic regression classifiers for the one-vs-all approach. 3) The second part of the exercise involves implementing and training a neural network for the same digit recognition task. Students will complete code for the cost function and predictions.

Uploaded by

api-322416213

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

127 views12 pages

Ex 3

Uploaded by

api-322416213

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 12

Programming Exercise 3:

Multi-class Classification and Neural Networks

Machine Learning

Introduction
In this exercise, you will implement one-vs-all logistic regression and neural
networks to recognize hand-written digits. Before starting the programming
exercise, we strongly recommend watching the video lectures and completing
the review questions for the associated topics.
To get started with the exercise, you will need to download the starter
code and unzip its contents to the directory where you wish to complete the
exercise. If needed, use the cd command in Octave/MATLAB to change to
this directory before starting this exercise.
You can also find instructions for installing Octave/MATLAB in the Environment Setup Instructions of the course website.

Files included in this exercise

ex3.m - Octave/MATLAB script that steps you through part 1
ex3 nn.m - Octave/MATLAB script that steps you through part 2
ex3data1.mat - Training set of hand-written digits
ex3weights.mat - Initial weights for the neural network exercise
submit.m - Submission script that sends your solutions to our servers
displayData.m - Function to help visualize the dataset
fmincg.m - Function minimization routine (similar to fminunc)
sigmoid.m - Sigmoid function
[?] lrCostFunction.m - Logistic regression cost function
[?] oneVsAll.m - Train a one-vs-all multi-class classifier
[?] predictOneVsAll.m - Predict using a one-vs-all multi-class classifier
[?] predict.m - Neural network prediction function
? indicates files you will need to complete
1

Throughout the exercise, you will be using the scripts ex3.m and ex3 nn.m.
These scripts set up the dataset for the problems and make calls to functions
that you will write. You do not need to modify these scripts. You are only
required to modify functions in other files, by following the instructions in
this assignment.

Where to get help

The exercises in this course use Octave1 or MATLAB, a high-level programming language well-suited for numerical computations. If you do not have
Octave or MATLAB installed, please refer to the installation instructions in
the Environment Setup Instructions of the course website.
At the Octave/MATLAB command line, typing help followed by a function name displays documentation for a built-in function. For example, help
plot will bring up help information for plotting. Further documentation for
Octave functions can be found at the Octave documentation pages. MATLAB documentation can be found at the MATLAB documentation pages.
We also strongly encourage using the online Discussions to discuss exercises with other students. However, do not look at any source code written
by others or share your source code with others.

Multi-class Classification

For this exercise, you will use logistic regression and neural networks to
recognize handwritten digits (from 0 to 9). Automated handwritten digit
recognition is widely used today - from recognizing zip codes (postal codes)
on mail envelopes to recognizing amounts written on bank checks. This
exercise will show you how the methods youve learned can be used for this
classification task.
In the first part of the exercise, you will extend your previous implemention of logistic regression and apply it to one-vs-all classification.
1

Octave is a free alternative to MATLAB. For the programming exercises, you are free
to use either Octave or MATLAB.

1.1

Dataset

You are given a data set in ex3data1.mat that contains 5000 training examples of handwritten digits.2 The .mat format means that that the data has
been saved in a native Octave/MATLAB matrix format, instead of a text
(ASCII) format like a csv-file. These matrices can be read directly into your
program by using the load command. After loading, matrices of the correct
dimensions and values will appear in your programs memory. The matrix
will already be named, so you do not need to assign names to them.
% Load saved matrices from file
load('ex3data1.mat');
% The matrices X and y will now be in your Octave environment

There are 5000 training examples in ex3data1.mat, where each training

example is a 20 pixel by 20 pixel grayscale image of the digit. Each pixel is
represented by a floating point number indicating the grayscale intensity at
that location. The 20 by 20 grid of pixels is unrolled into a 400-dimensional
vector. Each of these training examples becomes a single row in our data
matrix X. This gives us a 5000 by 400 matrix X where every row is a training
example for a handwritten digit image.

(x(1) )T
(x(2) )T

.
(m) T
(x )
The second part of the training set is a 5000-dimensional vector y that
contains labels for the training set. To make things more compatible with
Octave/MATLAB indexing, where there is no zero index, we have mapped
the digit zero to the value ten. Therefore, a 0 digit is labeled as 10, while
the digits 1 to 9 are labeled as 1 to 9 in their natural order.

1.2

Visualizing the data

You will begin by visualizing a subset of the training set. In Part 1 of ex3.m,
the code randomly selects selects 100 rows from X and passes those rows
to the displayData function. This function maps each row to a 20 pixel by
20 pixel grayscale image and displays the images together. We have provided
2
This is a subset of the MNIST handwritten digit dataset (http://yann.lecun.com/
exdb/mnist/).

the displayData function, and you are encouraged to examine the code to
see how it works. After you run this step, you should see an image like Figure
1.

Figure 1: Examples from the dataset

1.3

Vectorizing Logistic Regression

You will be using multiple one-vs-all logistic regression models to build a

multi-class classifier. Since there are 10 classes, you will need to train 10
separate logistic regression classifiers. To make this training efficient, it is
important to ensure that your code is well vectorized. In this section, you
will implement a vectorized version of logistic regression that does not employ
any for loops. You can use your code in the last exercise as a starting point
for this exercise.
1.3.1

Vectorizing the cost function

We will begin by writing a vectorized version of the cost function. Recall

that in (unregularized) logistic regression, the cost function is
m

1 X (i)
y log(h (x(i) )) (1 y (i) ) log(1 h (x(i) )) .
J() =
m i=1

To compute each element in the summation, we have to compute h (x(i) )

for every example i, where h (x(i) ) = g(T x(i) ) and g(z) = 1+e1z is the
4

sigmoid function. It turns out that we can compute this quickly for all our
examples by using matrix multiplication. Let us define X and as

(x(1) )T
0
(x(2) )T
1

X=
and

.. .
..

.
.
(m) T
n
(x )
Then, by computing the matrix product X, we have

(x(1) )T
T (x(1) )
(x(2) )T T (x(2) )

X =
=
..
..

.
.
(m) T
T
(x )
(x(m) )

In the last equality, we used the fact that aT b = bT a if a and b are vectors.
This allows us to compute the products T x(i) for all our examples i in one
line of code.
Your job is to write the unregularized cost function in the file lrCostFunction.m
Your implementation should use the strategy we presented above to calculate T x(i) . You should also use a vectorized approach for the rest of the
cost function. A fully vectorized version of lrCostFunction.m should not
contain any loops.
(Hint: You might want to use the element-wise multiplication operation
(.*) and the sum operation sum when writing this function)
1.3.2

Vectorizing the gradient

Recall that the gradient of the (unregularized) logistic regression cost is a

vector where the j th element is defined as
m

1 X
J
(i)
(i) (i)
(h (x ) y )xj .
=
j
m i=1
To vectorize this operation over the dataset, we start by writing out all

the partial derivatives explicitly for all j ,

P
m
(i)
(i) (i)
J
(h
(x
)

y
)x

0
i=1

J
Pm (h (x(i) ) y (i) )x(i)
1

1
i=1

J

1

Pm
2 =
(i)
(i) (i)
(h
(x
)

y
)x

2
i=1

J
Pm
(i)
(i) (i)
n
i=1 (h (x ) y )xn

where

1 X
(h (x(i) ) y (i) )x(i)
m i=1

1 T
X (h (x) y).
m

h (x) y =

h (x(1) ) y (1)
h (x(2) ) y (2)
..
.
h (x(1) ) y (m)

(1)

Note that x(i) is a vector, while (h (x(i) )y (i) ) is a scalar (single number).
To understand the last step of the derivation, let i = (h (x(i) ) y (i) ) and
observe that:

|
|
|
2
X

(i)
(1)
(2)
(m)

i x = x
x
... x
.. = X T ,
.
i
|
|
|
m
where the values i = (h (x(i) ) y (i) ).
The expression above allows us to compute all the partial derivatives
without any loops. If you are comfortable with linear algebra, we encourage
you to work through the matrix multiplications above to convince yourself
that the vectorized version does the same computations. You should now
implement Equation 1 to compute the correct vectorized gradient. Once you
are done, complete the function lrCostFunction.m by implementing the
gradient.

Debugging Tip: Vectorizing code can sometimes be tricky. One common strategy for debugging is to print out the sizes of the matrices you
are working with using the size function. For example, given a data matrix X of size 100 20 (100 examples, 20 features) and , a vector with
dimensions 20 1, you can observe that X is a valid multiplication operation, while X is not. Furthermore, if you have a non-vectorized version
of your code, you can compare the output of your vectorized code and
non-vectorized code to make sure that they produce the same outputs.
1.3.3

Vectorizing regularized logistic regression

After you have implemented vectorization for logistic regression, you will now
add regularization to the cost function. Recall that for regularized logistic
regression, the cost function is defined as
m
n

X 2
1 X (i)
(i)
(i)
(i)
y log(h (x )) (1 y ) log(1 h (x )) +
.
J() =
m i=1
2m j=1 j

Note that you should not be regularizing 0 which is used for the bias
term.
Correspondingly, the partial derivative of regularized logistic regression
cost for j is defined as
m

1 X
J()
(i)
=
(h (x(i) ) y (i) )xj
0
m i=1
J()
=
j

1 X
(i)
(h (x(i) ) y (i) )xj
m i=1

for j = 0
!
+

j
m

for j 1

Now modify your code in lrCostFunction to account for regularization.

Once again, you should not put any loops into your code.

Octave/MATLAB Tip: When implementing the vectorization for regularized logistic regression, you might often want to only sum and update
certain elements of . In Octave/MATLAB, you can index into the matrices to access and update only certain elements. For example, A(:, 3:5)
= B(:, 1:3) will replaces the columns 3 to 5 of A with the columns 1 to
3 from B. One special keyword you can use in indexing is the end keyword
in indexing. This allows us to select columns (or rows) until the end of the
matrix. For example, A(:, 2:end) will only return elements from the 2nd
to last column of A. Thus, you could use this together with the sum and
.^ operations to compute the sum of only the elements you are interested
in (e.g., sum(z(2:end).^2)). In the starter code, lrCostFunction.m, we
have also provided hints on yet another possible method computing the
regularized gradient.
You should now submit your solutions.

1.4

One-vs-all Classification

In this part of the exercise, you will implement one-vs-all classification by

training multiple regularized logistic regression classifiers, one for each of
the K classes in our dataset (Figure 1). In the handwritten digits dataset,
K = 10, but your code should work for any value of K.
You should now complete the code in oneVsAll.m to train one classifier for
each class. In particular, your code should return all the classifier parameters
in a matrix RK(N +1) , where each row of corresponds to the learned
logistic regression parameters for one class. You can do this with a for-loop
from 1 to K, training each classifier independently.
Note that the y argument to this function is a vector of labels from 1 to
10, where we have mapped the digit 0 to the label 10 (to avoid confusions
with indexing).
When training the classifier for class k {1, ..., K}, you will want a mdimensional vector of labels y, where yj 0, 1 indicates whether the j-th
training instance belongs to class k (yj = 1), or if it belongs to a different
class (yj = 0). You may find logical arrays helpful for this task.

Octave/MATLAB Tip: Logical arrays in Octave/MATLAB are arrays

which contain binary (0 or 1) elements. In Octave/MATLAB, evaluating
the expression a == b for a vector a (of size m1) and scalar b will return
a vector of the same size as a with ones at positions where the elements
of a are equal to b and zeroes where they are different. To see how this
works for yourself, try the following code in Octave/MATLAB:
a = 1:10; % Create a and b
b = 3;
a == b
% You should try different values of b here
Furthermore, you will be using fmincg for this exercise (instead of fminunc).
fmincg works similarly to fminunc, but is more more efficient for dealing with
a large number of parameters.
After you have correctly completed the code for oneVsAll.m, the script
ex3.m will continue to use your oneVsAll function to train a multi-class classifier.
You should now submit your solutions.
1.4.1

One-vs-all Prediction

After training your one-vs-all classifier, you can now use it to predict the
digit contained in a given image. For each input, you should compute the
probability that it belongs to each class using the trained logistic regression
classifiers. Your one-vs-all prediction function will pick the class for which the
corresponding logistic regression classifier outputs the highest probability and
return the class label (1, 2,..., or K) as the prediction for the input example.
You should now complete the code in predictOneVsAll.m to use the
one-vs-all classifier to make predictions.
Once you are done, ex3.m will call your predictOneVsAll function using
the learned value of . You should see that the training set accuracy is about
94.9% (i.e., it classifies 94.9% of the examples in the training set correctly).
You should now submit your solutions.

Neural Networks

In the previous part of this exercise, you implemented multi-class logistic regression to recognize handwritten digits. However, logistic regression cannot
form more complex hypotheses as it is only a linear classifier.3
In this part of the exercise, you will implement a neural network to recognize handwritten digits using the same training set as before. The neural
network will be able to represent complex models that form non-linear hypotheses. For this week, you will be using parameters from a neural network
that we have already trained. Your goal is to implement the feedforward
propagation algorithm to use our weights for prediction. In next weeks exercise, you will write the backpropagation algorithm for learning the neural
network parameters.
The provided script, ex3 nn.m, will help you step through this exercise.

2.1

Model representation

Our neural network is shown in Figure 2. It has 3 layers an input layer, a

hidden layer and an output layer. Recall that our inputs are pixel values of
digit images. Since the images are of size 2020, this gives us 400 input layer
units (excluding the extra bias unit which always outputs +1). As before,
the training data will be loaded into the variables X and y.
You have been provided with a set of network parameters ((1) , (2) )
already trained by us. These are stored in ex3weights.mat and will be
loaded by ex3 nn.m into Theta1 and Theta2 The parameters have dimensions
that are sized for a neural network with 25 units in the second layer and 10
output units (corresponding to the 10 digit classes).
% Load saved matrices from file
load('ex3weights.mat');
%
%
%
%

The matrices Theta1 and Theta2 will now be in your Octave

environment
Theta1 has size 25 x 401
Theta2 has size 10 x 26
3

You could add more features (such as polynomial features) to logistic regression, but
that can be very expensive to train.

Figure 2: Neural network model.

2.2

Feedforward Propagation and Prediction

Now you will implement feedforward propagation for the neural network. You
will need to complete the code in predict.m to return the neural networks
prediction.
You should implement the feedforward computation that computes h (x(i) )
for every example i and returns the associated predictions. Similar to the
one-vs-all classification strategy, the prediction from the neural network will
be the label that has the largest output (h (x))k .
Implementation Note: The matrix X contains the examples in rows.
When you complete the code in predict.m, you will need to add the
column of 1s to the matrix. The matrices Theta1 and Theta2 contain
the parameters for each unit in rows. Specifically, the first row of Theta1
corresponds to the first hidden unit in the second layer. In Octave/MATLAB, when you compute z (2) = (1) a(1) , be sure that you index (and if
necessary, transpose) X correctly so that you get a(l) as a column vector.
Once you are done, ex3 nn.m will call your predict function using the
loaded set of parameters for Theta1 and Theta2. You should see that the

accuracy is about 97.5%. After that, an interactive sequence will launch displaying images from the training set one at a time, while the console prints
out the predicted label for the displayed image. To stop the image sequence,
press Ctrl-C.
You should now submit your solutions.

Submission and Grading

After completing this assignment, be sure to use the submit function to submit your solutions to our servers. The following is a breakdown of how each
part of this exercise is scored.
Part
Regularized Logisic Regression
One-vs-all classifier training
One-vs-all classifier prediction
Neural Network Prediction Function
Total Points

Submitted File
lrCostFunction.m
oneVsAll.m
predictOneVsAll.m
predict.m

Points
30 points
20 points
20 points
30 points
100 points

You are allowed to submit your solutions multiple times, and we will take
only the highest score into consideration.

Basic Mathematical Foundations Ai Hands
No ratings yet
Basic Mathematical Foundations Ai Hands
521 pages
Worked Examples in Mechanical Vibrations using MATLAB
From Everand
Worked Examples in Mechanical Vibrations using MATLAB
Eric Okoth Ogur
No ratings yet
Intrusion Detection System (IDS)
No ratings yet
Intrusion Detection System (IDS)
39 pages
Clean Code in Python
50% (2)
Clean Code in Python
35 pages
Sparkr: Interactive R at Scale: Shivaram Venkataraman Zongheng Yang
No ratings yet
Sparkr: Interactive R at Scale: Shivaram Venkataraman Zongheng Yang
36 pages
Exercise
No ratings yet
Exercise
15 pages
d2l en PDF
No ratings yet
d2l en PDF
651 pages
D2l-En Deep Learning PDF
No ratings yet
D2l-En Deep Learning PDF
639 pages
Ex 3
No ratings yet
Ex 3
12 pages
ML Lab Manual
100% (1)
ML Lab Manual
37 pages
Linear Regression
No ratings yet
Linear Regression
36 pages
Machine Learning Lab
No ratings yet
Machine Learning Lab
43 pages
Questions
No ratings yet
Questions
284 pages
MATLAB for Beginners: A Gentle Approach
From Everand
MATLAB for Beginners: A Gentle Approach
Peter I. Kattan
No ratings yet
DAC ML Tutorial Final Deck
No ratings yet
DAC ML Tutorial Final Deck
150 pages
MATLAB for Beginners: A Gentle Approach - Revised Edition
From Everand
MATLAB for Beginners: A Gentle Approach - Revised Edition
Peter Kattan
No ratings yet
Python Advanced Programming: The Guide to Learn Python Programming. Reference with Exercises and Samples About Dynamical Programming, Multithreading, Multiprocessing, Debugging, Testing and More
From Everand
Python Advanced Programming: The Guide to Learn Python Programming. Reference with Exercises and Samples About Dynamical Programming, Multithreading, Multiprocessing, Debugging, Testing and More
Marcus Richards
No ratings yet
A Friendly Introduction to MATLAB Programming
From Everand
A Friendly Introduction to MATLAB Programming
Orhan Gazi
No ratings yet
Computer Viruses Powerpoint
100% (1)
Computer Viruses Powerpoint
14 pages
List of Intel Microprocessors
No ratings yet
List of Intel Microprocessors
61 pages
Machine Learning Coursera All Exercies PDF
No ratings yet
Machine Learning Coursera All Exercies PDF
117 pages
ML RECORD - Merged
No ratings yet
ML RECORD - Merged
33 pages
Machine Learning Exercises in Python, Part 1: Curious Insight
No ratings yet
Machine Learning Exercises in Python, Part 1: Curious Insight
14 pages
Module 2 Lab Activity - Regression
No ratings yet
Module 2 Lab Activity - Regression
9 pages
ML File - Merged
No ratings yet
ML File - Merged
24 pages
ML Labs
No ratings yet
ML Labs
46 pages
C1 W2 Lab02 Multiple Variable Soln
No ratings yet
C1 W2 Lab02 Multiple Variable Soln
11 pages
Ex 2
No ratings yet
Ex 2
13 pages
Deep Learning
100% (3)
Deep Learning
661 pages
Programming Exercise 1: Linear Regression: Machine Learning
No ratings yet
Programming Exercise 1: Linear Regression: Machine Learning
15 pages
ML Record Print
No ratings yet
ML Record Print
20 pages
Ex 1
No ratings yet
Ex 1
15 pages
Notes5 Regression
No ratings yet
Notes5 Regression
14 pages
GPRS Communication Protocol
No ratings yet
GPRS Communication Protocol
21 pages
Programming Exercise 1: Linear Regression: Machine Learning
No ratings yet
Programming Exercise 1: Linear Regression: Machine Learning
15 pages
MLP - Week 6 - MNIST - LogitReg - Ipynb - Colaboratory
No ratings yet
MLP - Week 6 - MNIST - LogitReg - Ipynb - Colaboratory
19 pages
CH 1
No ratings yet
CH 1
24 pages
OCS DCC Description
No ratings yet
OCS DCC Description
39 pages
ML Lab 08 Manual - Logisitic Regression (Ver7)
No ratings yet
ML Lab 08 Manual - Logisitic Regression (Ver7)
9 pages
Chapter 12 - Data Structures: Outline
No ratings yet
Chapter 12 - Data Structures: Outline
36 pages
Ex 3
No ratings yet
Ex 3
12 pages
1.1 ID5059 1.2 Tom Kelsey - Jan 2021: February 15, 2021
No ratings yet
1.1 ID5059 1.2 Tom Kelsey - Jan 2021: February 15, 2021
43 pages
Requirement Analysis
100% (2)
Requirement Analysis
45 pages
Agniva
No ratings yet
Agniva
16 pages
Machine Learning Coursera All Exercies
75% (12)
Machine Learning Coursera All Exercies
117 pages
Artifical Intelligence Coursework Report
No ratings yet
Artifical Intelligence Coursework Report
28 pages
Minsky y Papert
No ratings yet
Minsky y Papert
77 pages
Chapter 1 - Introduction - Slides - 2
No ratings yet
Chapter 1 - Introduction - Slides - 2
48 pages
Codesys Opc Server
No ratings yet
Codesys Opc Server
36 pages
Ex 8
No ratings yet
Ex 8
15 pages
Machine Learning Lab
No ratings yet
Machine Learning Lab
23 pages
Unit-III Advanced Machine Learning
No ratings yet
Unit-III Advanced Machine Learning
8 pages
Acp 2014 Paper 1
No ratings yet
Acp 2014 Paper 1
24 pages
Linear Regression: Machine Learning
No ratings yet
Linear Regression: Machine Learning
9 pages
Adobe Creative Suite 3 Design Premium: Deliver Innovative Ideas in Print, Web, and Mobile
No ratings yet
Adobe Creative Suite 3 Design Premium: Deliver Innovative Ideas in Print, Web, and Mobile
18 pages
Ex 1
No ratings yet
Ex 1
15 pages
EMC Symmetrix DMX-4 EMC Symmetrix V-Max
No ratings yet
EMC Symmetrix DMX-4 EMC Symmetrix V-Max
9 pages
Image Processing in UAV
No ratings yet
Image Processing in UAV
11 pages
Cisco CCNA Security Module 4
No ratings yet
Cisco CCNA Security Module 4
9 pages
Neural Network Package For Octave Developers Guide
No ratings yet
Neural Network Package For Octave Developers Guide
31 pages
Ex 4
No ratings yet
Ex 4
15 pages
CIS 419/519 Introduction To Machine Learning Assignment 2: Instructions
No ratings yet
CIS 419/519 Introduction To Machine Learning Assignment 2: Instructions
12 pages
Ex 7
No ratings yet
Ex 7
17 pages
HW 3
No ratings yet
HW 3
4 pages
ML Exercise 1
No ratings yet
ML Exercise 1
15 pages
MATLAB for Beginners: A Gentle Approach - Revised Edition
From Everand
MATLAB for Beginners: A Gentle Approach - Revised Edition
Peter I. Kattan
3.5/5 (11)
Ex 6
No ratings yet
Ex 6
16 pages
Machine Learning in MATLAB: Roland Memisevic
No ratings yet
Machine Learning in MATLAB: Roland Memisevic
16 pages
Pgdit
No ratings yet
Pgdit
12 pages
Sam HW2
No ratings yet
Sam HW2
4 pages
Ex 5
No ratings yet
Ex 5
14 pages
Kunal Dinesh
No ratings yet
Kunal Dinesh
12 pages
Exercise 3: Logistic Regression: Andrew NG (Very Slightly Edited by Luis R. Izquierdo For The University of Burgos)
No ratings yet
Exercise 3: Logistic Regression: Andrew NG (Very Slightly Edited by Luis R. Izquierdo For The University of Burgos)
5 pages
Math Behind Machine Learning
No ratings yet
Math Behind Machine Learning
9 pages
Programming Exercise 4: Neural Networks Learning
No ratings yet
Programming Exercise 4: Neural Networks Learning
15 pages
Introduction to Algorithms
From Everand
Introduction to Algorithms
S VASIST
No ratings yet
Declaring Variables: Vbscript Variables and Datatypes
No ratings yet
Declaring Variables: Vbscript Variables and Datatypes
6 pages
Cybercriminals Use RansomWeb Attacks To Hold Website Databases Hostage
No ratings yet
Cybercriminals Use RansomWeb Attacks To Hold Website Databases Hostage
4 pages
Assignment 3
No ratings yet
Assignment 3
5 pages
Question Bank OOAD
No ratings yet
Question Bank OOAD
5 pages
ATM Assignment
No ratings yet
ATM Assignment
5 pages
Software Development Life Cycle Report: Shruti Dath.G. Divya Rani S.Y
No ratings yet
Software Development Life Cycle Report: Shruti Dath.G. Divya Rani S.Y
5 pages
Bliss: Blind Source Separation and Applications
No ratings yet
Bliss: Blind Source Separation and Applications
4 pages
Assignment Bca
No ratings yet
Assignment Bca
8 pages
A Brief Introduction to MATLAB: Taken From the Book "MATLAB for Beginners: A Gentle Approach"
From Everand
A Brief Introduction to MATLAB: Taken From the Book "MATLAB for Beginners: A Gentle Approach"
Peter Kattan
2.5/5 (2)
Programming with MATLAB: Taken From the Book "MATLAB for Beginners: A Gentle Approach"
From Everand
Programming with MATLAB: Taken From the Book "MATLAB for Beginners: A Gentle Approach"
Peter Kattan
4.5/5 (3)
HW 1
No ratings yet
HW 1
4 pages
Graphs with MATLAB (Taken from "MATLAB for Beginners: A Gentle Approach")
From Everand
Graphs with MATLAB (Taken from "MATLAB for Beginners: A Gentle Approach")
Peter Kattan
4/5 (2)
Motherboard
No ratings yet
Motherboard
2 pages
Python 19
No ratings yet
Python 19
2 pages
Robust Anti-Malware Solution: A Fast, Strong, and Entirely New Approach To Endpoint Protection
No ratings yet
Robust Anti-Malware Solution: A Fast, Strong, and Entirely New Approach To Endpoint Protection
2 pages
Scrum Questions
No ratings yet
Scrum Questions
2 pages
Line Drawing Algorithm: Mastering Techniques for Precision Image Rendering
From Everand
Line Drawing Algorithm: Mastering Techniques for Precision Image Rendering
Fouad Sabry
No ratings yet
Top Numerical Methods With Matlab For Beginners!
From Everand
Top Numerical Methods With Matlab For Beginners!
Andrei Besedin
No ratings yet
Programming Exercise 2
No ratings yet
Programming Exercise 2
13 pages

Ex 3

Uploaded by

Ex 3

Uploaded by

Programming Exercise 3:

Multi-class Classification and Neural Networks

Files included in this exercise

Where to get help

There are 5000 training examples in ex3data1.mat, where each training

Visualizing the data

Figure 1: Examples from the dataset

Vectorizing Logistic Regression

You will be using multiple one-vs-all logistic regression models to build a

Vectorizing the cost function

We will begin by writing a vectorized version of the cost function. Recall

To compute each element in the summation, we have to compute h (x(i) )

Vectorizing the gradient

Recall that the gradient of the (unregularized) logistic regression cost is a

the partial derivatives explicitly for all j ,

Vectorizing regularized logistic regression

Now modify your code in lrCostFunction to account for regularization.

In this part of the exercise, you will implement one-vs-all classification by

Octave/MATLAB Tip: Logical arrays in Octave/MATLAB are arrays

Our neural network is shown in Figure 2. It has 3 layers an input layer, a

The matrices Theta1 and Theta2 will now be in your Octave

Figure 2: Neural network model.

Feedforward Propagation and Prediction

Submission and Grading

You might also like