0% found this document useful (0 votes)

22 views12 pages

DL - Assignment 1

The document provides instructions for completing Assignment 1 on implementing neural networks. Students are asked to submit their code, a report, saved models, and a readme file in a zip folder. The report should discuss their experience and highlight results without explaining code. Objectives of the assignment include deriving backpropagation equations, implementing feedforward and backpropagation in layers, comprehending activation functions, and using cross entropy loss. Task 1 involves mathematical derivation of backpropagation. Task 2 requires implementing a neural network with one hidden layer, multiple activations, and cross entropy loss. Task 3 is to create a multi-layer network for MNIST digit classification with 2 hidden layers. The dataset will be loaded using a provided function.

Uploaded by

msds21024

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

22 views12 pages

DL - Assignment 1

Uploaded by

msds21024

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 12

Deep Learning

Assignment 1
Implementation of Neural Network
Submission:

Submit all of your codes and results in a single zip file with name FirstName_RollNumber_01.zip
• Submit single zip file containing
(a) codes (b) report (c) Saved Models (d) Readme.txt
• There should be Report.pdf detailing your experience and highlighting any interesting results.
Kindly don’t explain your code in the report, just explain the results. Your report should include
your comments on the results of all the steps, with images, for example what happened when
you changed the learning rate etc.
• Readme.txt should explain how to run your code, preferably it should accept the command line
arguments e.g dataset path used for training the model.
• The assignment is only acceptable in .py files. No Jupyter notebooks.
• In the root directory, there should be 2 python files, a report and a folder containing saved
models.
• Root directory should be named as FirstName_RollNumber_01
• Your code notebooks should be named as ‘rollNumber_01_task1.py
• Follow all the naming conventions.
• For each convention, there is a 3% penalty if you don’t follow it.
• Email instructor or TA if there are any questions. You cannot look at others code or use others
code, however you can discuss with each other. Plagiarism will lead to a straight zero with
additional consequences as well.
• 2% (of obtained marks) deduction per day for late submission.
• The submissions will only be accepted till 27 April midnight.
• DON’T RESUBMIT THE DATASETS PROVIDED IN YOUR SUBMISSION.

Due Date: 11:59 PM on Sunday, 24th April 2022

Note: For this assignment (and for others in general) you are not allowed to search online for any kind of
implementation. Do not share code or look at the other’s code. You should not be in possession of any
implementation related to the assignment, other than your own. In case of any confusion please reach
out to the TA’s (email them or visit them during their office hours).

Objectives: In this assignment you will write the code for training the Neural Networks. The goals of this
assignment are as follows.

• Mathematical derivation of backpropagation equations.

• Understand how to design and implement an efficient layered architecture for Neural Networks.
• For each layer:
o Implement feedforward (preferably in vectorized fashion).
o Keep track of the local gradients and activations.
o Understand the mechanism of back-propagation (preferably in vectorized fashion).
• Comprehend how different activation functions behave i.e. Sigmoid, ReLU, Tanh, Swish, ELU
• Implement Binary cross-entropy loss function.
NOTE: You can only use Numpy for code implementations. It's recommended that you use VS Code for
debugging.

Report: You have to write a report explaining, what is your implementation logic? How and why you
distributed the data in three parts i.e. Train, Test and Validation set? How you came up with the
optimized weights (how you find learning rate?). Which activation function performs best on the given
dataset? Finally, your comment?

Task 1: Backpropagation (20 Points)

Task 2: Implement a Neural Network with a single hidden layer, multiple activation function and
cross entropy loss function (10 Points)
Steps for the implementation are below:

1. You are given a class named Neural_Network which will take sizes of input, output and hidden
layers as a parameter.
a. Input size: Size of the input feature vector to our Neural Network. In our case it is 2, pass
it as a parameter.
b. Hidden layer: number of neurons in the hidden layer of the architecture.
c. Output size: is the size output layer, number of neurons that will be in the output layer.
d. Randomly initialize W1 and W2 i.e. Weight matrices connecting input layer to hidden
layer and hidden layer to output layer respectively.
e. Pass activation functions as an input in the class e.g. ‘Sigmoid’, ‘Tanh’ or ‘Relu’.
2. Below is the list of functions which you need to implement within the class Neural_Network,
description of each function is provided in the source code.
a. Feedforward (self, X)
i. X is input feature(s) ii.
Return predicted vector(s)
b. Backpropagation (self,X, Y, y_pred, lr):
i. X is input feature(s)
ii. Y is actual label(s)
iii. Y_pred is predicted value(s)
iv. lr is learning rate
c. Sigmoid(self, s)
i. Return sigmoid applied on s value(s)
d. Sigmoid_derivative(self, s)
i. Return derivative of sigmoid, on s
e. tanh(self, s)
i. Return tanh applied on s value(s)
f. tanh_derivative(self, s)
i. Return derivative of tanh, on s
g. relu(self, s)
i. Return relu applied on s value(s)
h. relu_derivative(self, s)
i. Return derivative of relu, on s
i. Crossentropy(self, Y, Y_pred)
i. Y is the actual label(s) ii. Y_pred is a predicted label(s) iii. Return
error based on cross entropy
j. Train(self, trainX, trainY,epochs = 100, learningRate = 0.001, plot_err
= True ,validationX = Null, validationY = Null)
i. trainX is the training feature dataset in row format
ii. trainY is the label of the dataset iii. epochs is the number of times the entire
dataset will be passed to the network for training, default value is 100.
iv. learningRate is the constant used for weight update.
v. plot_err bool variable if you want to plot error on a graph or not
vi. validationX is the validation feature dataset in row format, show validation error
in each epoch
vii. validationY is the validation label of the dataset.
k. Save_model(self,name)
i. Save the model under the name of ‘name’
l. Load_model(self,name)
i. Load the model using ‘name’.
m. Accuracy(self, testX, testY)
i. testX is dataset for testing
ii. testY is the labels of test data
iii. plot accuracy on an image
iv. return accuracy
n. predict(self, testX)
i. testX is the test row feature
ii. return predicted value on testX
o. main()
i. Call all the functions to train and test the network in this function.
ii. Find the accuracy and loss curves of training and validation data and print the
accuracy of the test set and return all these values.
Task 3: Implement a multi-layer Neural Network for multi-class classification.
In this part, you will create a complete neural network architecture consisting of multiple layers. You
are required to report results with 2 hidden layers on the MNIST dataset. Follow the architecture of
your network as shown in the below diagram.

MNIST Dataset:
The dataset is attached with the assignment in the Task3_Data folder. This dataset contains 60000
training and 10000 test samples. Each sample is a grayscale image of size 28x28. There are 10
classes in which each sample is an image of one of the digits (0 to 9). Please note that in the MNIST
dataset there are 10 categories. If we randomly guess one category, there is a 1/10 probability that
it would be correct. Therefore, you cannot (theoretically) make a classifier that performs worse
than that. If you get less than 10% accuracy in any of your experiments, you can safely assume that
you are doing something fundamentally wrong. If your final results are less than 20% in terms of
accuracy, your solution(s) will not be graded. To load the dataset, we are providing you a function
load_dataset that will return you the followings:

train_set_x, train_set_y, test_set_x, test_set_y =

load_dataset (path_to_dataset)

Figure: Sample Images from the Dataset

# Function to load dataset
from matplotlib import image as img
def loadDataset(path):
print('Loading Dataset...')
train_x, train_y, test_x, test_y = [], [], [], []
for i in range(10):
for filename in glob.glob(path + '\\train\\' + str(i)+'\\*.png'):
im=img.imread(filename)
train_x.append(im)
train_y.append(i)
for i in range(10):
for filename in glob.glob(path + '\\test\\' + str(i)+'\\*.png'):
im=img.imread(filename)
test_x.append(im)
test_y.append(i)
print('Dataset loaded...')
return np.array(train_x), np.array(train_y),
np.array(test_x),np.array(test_y)

Note: You can also use any other custom function you want to load the data.
Points to keep in mind:

• train_set_x and test_set_x are numpy arrays of shape (m_train, num_px,

num_px, 1)and (m_test, )respectively, where m_train is total number of train
or test images, and num_px is the width and height of images that is 28 in case of MNIST.
*_set_x : is being used for the sample *_set_y : is being for the label of the
sample.
• The dimension of train_set_y and test_set_y should be (m_train, 1) and
(m_test , ) respectively

• Each row of your train_set_x and test_set_x should be an array representing an

image. Write the following code in your main notebook to verify that the train_set_x
is correctly loaded by visualizing any image using the following code (Feel free to change
the index value and re-run to see other images).

index = 25
plt.imshow(train_set_x[index])

To verify that all the data is loaded print the dimension of each
variable and you should get the following outputs:

train_set_x shape: (60000, 28, 28, 1)

test_set_x shape: (10000, 28, 28, 1)
train_set_y shape: (60000, 1)
test_set_y shape: (10000, 1)

Vectorization of Samples:
To input samples in the neural network, we need to convert a 2D matrix into a one
dimensional vector. Reshape your images of shape (num_px, num_px, 1) to a flattened
numpy array of shape (num_px*num_px, 1). After reshaping, your training and test
dataset should be a numpy-array where each row represents a flattened image. The
dimension of the data should be:

Train_set_x_flatten shape: (60000, 784)

Test_set_x_flatten shape: (10000, 784)

HINT: there is a reshape function in the numpy allowing you to reshape the whole matrix with a
single call. Do check that you have vectorized the images correctly.

Mean Image Subtraction:

There are many ways to do mean subtraction but here we are using mean image
subtraction. Mean Image Subtraction is a method of normalizing the dataset. We compute
the mean of the whole dataset and subtract the mean from each sample in our dataset. In
case of MNIST dataset example is given below:

Mean subtraction will translate our dataset to the mean of the whole data.
Validation Set:
Divide the training dataset after flattening into train and validation set. You can try different
fractions for this division. You can use the sklearn.model_selection.train_test_split() function for this
task. Try different versions where you also shuffle the data to divide them into these two sets.

Note: Do not use the final instances for the validation set. The dataset loading code loads the
instances in order. You might not get the instances for the last class (that comes in order which will
probably be class 9) in the training set at all this way. Remember to shuffle the training data before
train test split or use the library builtin function to do this.
t-SNE: t- Distributed Stochastic Neighbor Embedding (t-SNE) is an unsupervised,
non-linear technique primarily used for data exploration and visualizing
high-dimensional data. In simpler terms, t-SNE gives you a feel or intuition of how
high dimensional data is arranged in space.

You can use sklearn library to plot t-SNE and visualize data points.

Once you have loaded the data you need to implement the several helper functions.

1. Data preprocessing
data = meanSubtraction(data)

Note that this function must be called before splitting data into train, test and validation set.

2. Initialize Network
net = init_network(no_of_layers, input_dim, neurons_per_layer)
For example if you pass following parameters to this function:
net = init_network(2, 784, [100, 50, 10])

● Use np.random.randn()to initialize the weights matrices or you can use any other weight
initialization method you learned in the class.
It should return you the network architecture with parameters initialized:
Size of net(1).w = 784x100
Size of net(1).b = 100
Size of net(2).w = 100x50
Size of net(2).b = 50
Size of net(3).w = 50x10
Size of net(3).b = 10
You should add the empty arrays in each layer to store the activations and local gradients of each layer;
this will help you in back-propagation.

3. Training
net = train_sgd(net, train_set_x, train_set_y, test_set_x,
test_set_y, learning_rate, drop_out, batch_size, training_epochs)

• This function returns a trained Neural Network net

• net network architecture

• train_set_x input training data of MxN size. where M is the feature dimension which is

784 for MNIST, and N is the number of training examples

• train_set_y 1xN array containing class labels i.e. [0,1,2,..,9]

• test_set_x and test_set_y will be used for validation

• learning_rate as the name suggest, learning rate of your Neural Network

• batch_size tells how many examples to pick for each iteration e.g. 20
• training_epochs how many training epochs to run.
Please make sure your code is modular. You can divide your training process into following function

➢ feed-forward - this function will forward through (the network) your input examples and will
also compute local gradients along the way at each layer

➢ back-prop - this function will traverse network backwards and keep

➢ back propagating the loss - multiplying gradients from above to the current local gradients

➢ validate/test - function will test how well your network is doing in terms of loss and
accuracy.

You need to convert the labels to one hot encoding, because we now have 10 classes and their
labels are 0,1,2,3...and 9. For each training sample you need to generate a vector of length 10,
whose all indices will be zeros except the index of its original label that will be 1. For example:
4. Feed Forward step
net = feedForward(net, batch_data, keep_prob)

zi = wT xi + b

ai = sigmoid(zi)

Perform these two operations on all the layers and store the value of a in net(layer).a

At this step also store the value of derivative of your activation function in net(layer).local_grad
Derivative of sigmoid is σ(x)*(1 - σ (x)).

5. Back Propagation Step

[net cost] = backPropagation(net, batch_label, batch_size,
learningRate, keep_prob)

The first step here, you will apply the soft-max on the output layer. Now for each input example
you have 10 output probabilities and you also have its label vector of same length. You have to
implement softmax at the last layer and calculate cross-entropy loss.

Once you have calculated the loss you can compute the gradients dw at any layer using the
equations we discussed in class. Please note that you will essentially be running in reverse order.
Store dw and db for each layer.

6. Update Step
[net] = sgd (net, dw, db, learning_rate)

Again here you will be running a for loop by iterating layers from first to last and update the weights
using update rule.

net(i).w = net(i).w - (learning_rate * dW(i))

net(i).b = net(i).b - (learning_rate * dB(i))
7. Testing Step [net] = test(net, test_set_x, test_set_y)
Since you have updated your weights for 1 epoch, now perform a feedforward step on your test
data and compute the loss. After every epoch plot the training loss and validation loss.

Merging all functions together:

You will now see how the overall model is structured by putting together all the building blocks
(functions that you implemented in the previous parts) together, in the right order. Now
implement a main function model in which call all the functions in the correct order to train
and test your network.

Note: You are not restricted to implement the assignment in a way that is explained above,
you can break down or merge the several functions, but you are required to implement in a
modular way.
Report
For this assignment, and all other assignments and projects, you must write a report. In the report
you will describe the critical decisions you made, important things you learned, or any decisions you
made to write your algorithm a particular way. You are required to report the accuracy you achieved.
For each experiment, you are required to provide analysis of various hyperparameters.

1. Plot loss and accuracy curves for both training and testing data with mean image subtraction
and without mean image subtraction. Report the difference in their accuracy and loss
curves.
2. Visualize data points using t-SNE technique
• Having a shallow network of 2 hidden layers, take an output of 1st and 2nd hidden
layer and plot t-SNE and place those images in your report. If you have more layers,
you need to plot t-SNE plots after each layer for your best accuracy model.
• Add more hidden layers in your network and report how it affects the t-SNE
plot(You can take a small subset of like 500 images for comparison).
3. Show and analyse confusion matrix of 10- class classification problem.
4. Plot loss and accuracy curves for different configurations of the architecture..
5. Report the accuracy by changing the number of neurons in the hidden layers.

Marks Division:
1. The marks division is as below:
a. Working code [50 points]
i. Task 2
1. Feedforward [10 points]
2. FeedBackward [10 points]
3. Activations [10 points]
4. Loss [05 points]
ii. Task 3
1. Feedforward [10 points]
2. FeedBackward [10 points]
3. Activations [10 points]
4. Loss [05 points]
5. Generic [10 points]
6. Mean Subtraction [05 points]
7. t-SNE plots [15 points]
iii.
b. Report [40 points]
i. Task 1
1. Computation Graph [10 points]
2. Mathematical Derivation [10 points]
ii. Task 2
1. Loss and accuracy curves [10 points]
2. Test Accuracy [05 points]
3. Analysis(in different experiments) [10 points]
iii. Task 3
1. Loss and accuracy curves with and without mean subtraction
[10 points]
2. Test Accuracy [05 points]
3. Test Accuracy [05 points]
4. Confusion Matrix for Training, Validation and Test set [10 points]
5. t_SNE plot [10 points]
6. Analysis(in different experiments) [10 points]
Conclusion [10 points]

c. Evaluation and Viva [10 points]

Pytorch MNIST Digits Prediction Hands On 1
No ratings yet
Pytorch MNIST Digits Prediction Hands On 1
16 pages
C2 W1 Assignment
No ratings yet
C2 W1 Assignment
25 pages
Deep Learning Lab Practicals
No ratings yet
Deep Learning Lab Practicals
24 pages
Microsoft Windows Shortcut Keys List: Advertisement
No ratings yet
Microsoft Windows Shortcut Keys List: Advertisement
5 pages
Final DL
No ratings yet
Final DL
26 pages
Ad3511 Deep Learning Lab Manual
No ratings yet
Ad3511 Deep Learning Lab Manual
80 pages
EnvisionProject Quick Start Guide
100% (1)
EnvisionProject Quick Start Guide
48 pages
Question Example
No ratings yet
Question Example
10 pages
Introduction To Effects of Social Media To Teens
100% (1)
Introduction To Effects of Social Media To Teens
5 pages
Python Advanced Programming: The Guide to Learn Python Programming. Reference with Exercises and Samples About Dynamical Programming, Multithreading, Multiprocessing, Debugging, Testing and More
From Everand
Python Advanced Programming: The Guide to Learn Python Programming. Reference with Exercises and Samples About Dynamical Programming, Multithreading, Multiprocessing, Debugging, Testing and More
Marcus Richards
No ratings yet
CG-100 Manual
No ratings yet
CG-100 Manual
142 pages
Pec 104 Lesson 2
0% (1)
Pec 104 Lesson 2
11 pages
Yamaha Ypg 235
100% (1)
Yamaha Ypg 235
120 pages
NN From Scratch
No ratings yet
NN From Scratch
5 pages
Web Search - Hollis Randall Hillner OPPT Trustee
No ratings yet
Web Search - Hollis Randall Hillner OPPT Trustee
12 pages
Advanced C Concepts and Programming: First Edition
From Everand
Advanced C Concepts and Programming: First Edition
Gayatri
3/5 (1)
Assignment 4x
No ratings yet
Assignment 4x
19 pages
FSP 3000R7 Release Note 7 1 5
No ratings yet
FSP 3000R7 Release Note 7 1 5
40 pages
5 Axis CNC Breakout Board
No ratings yet
5 Axis CNC Breakout Board
13 pages
Surface Pit Design and Range Diagrams
No ratings yet
Surface Pit Design and Range Diagrams
11 pages
NN From Scratch PDF 1735495327
No ratings yet
NN From Scratch PDF 1735495327
19 pages
Design and Construction of An Ledscore Board For Minna Township Stadium
No ratings yet
Design and Construction of An Ledscore Board For Minna Township Stadium
9 pages
DL Record
No ratings yet
DL Record
36 pages
Measurement & Control Question Paper
No ratings yet
Measurement & Control Question Paper
4 pages
Lab 2-Image-Classification-Using-NNs
No ratings yet
Lab 2-Image-Classification-Using-NNs
6 pages
Assignment 1
No ratings yet
Assignment 1
4 pages
Introduction To Genetic Algorithm Neural Networks
No ratings yet
Introduction To Genetic Algorithm Neural Networks
44 pages
R Deep Neural Network Step by Step
No ratings yet
R Deep Neural Network Step by Step
27 pages
Lab 8
No ratings yet
Lab 8
10 pages
DL Record
No ratings yet
DL Record
37 pages
DL Lab Manual
No ratings yet
DL Lab Manual
44 pages
Deep Learning Lab (Ai&ds)
No ratings yet
Deep Learning Lab (Ai&ds)
39 pages
Assignment1 NN Scrach
No ratings yet
Assignment1 NN Scrach
3 pages
Ex No:1 Implementing A Perceptron Algorithm For Binary Classification Date: Aim
No ratings yet
Ex No:1 Implementing A Perceptron Algorithm For Binary Classification Date: Aim
41 pages
Assignment 3
No ratings yet
Assignment 3
6 pages
DL Lab - Merged
No ratings yet
DL Lab - Merged
60 pages
Deep Learning
No ratings yet
Deep Learning
30 pages
Lab Programs
No ratings yet
Lab Programs
2 pages
C2 W1 Assignment
No ratings yet
C2 W1 Assignment
24 pages
Deep Learning
No ratings yet
Deep Learning
46 pages
AD3511-DEEP LEARNING LAB MANUAL Revised
No ratings yet
AD3511-DEEP LEARNING LAB MANUAL Revised
72 pages
Exp. No.: I. Aim: AIML634P Neural Network Lab 2262034
No ratings yet
Exp. No.: I. Aim: AIML634P Neural Network Lab 2262034
6 pages
DL Lab Manual
No ratings yet
DL Lab Manual
18 pages
AD3511 Deep Learning Lab Manual
No ratings yet
AD3511 Deep Learning Lab Manual
54 pages
Experiment 2.4 DL
No ratings yet
Experiment 2.4 DL
4 pages
This Python Script Implements A Single
No ratings yet
This Python Script Implements A Single
6 pages
Operating Manual V1 0 3
No ratings yet
Operating Manual V1 0 3
50 pages
Ilovepdf Merged
No ratings yet
Ilovepdf Merged
10 pages
Overall Explanation: 1. Data Loading and Preprocessing
No ratings yet
Overall Explanation: 1. Data Loading and Preprocessing
4 pages
Assignment3 AL
No ratings yet
Assignment3 AL
23 pages
Deep Learning Assignment
No ratings yet
Deep Learning Assignment
11 pages
Project Documentation
No ratings yet
Project Documentation
24 pages
Lab 12
No ratings yet
Lab 12
6 pages
Deep Learning Lab Manual
No ratings yet
Deep Learning Lab Manual
49 pages
ML Lab 11 Manual - Neural Networks (Ver4)
No ratings yet
ML Lab 11 Manual - Neural Networks (Ver4)
8 pages
Sales and Distribution (SD)
No ratings yet
Sales and Distribution (SD)
12 pages
Help Qthread en
No ratings yet
Help Qthread en
33 pages
Assignment 2
No ratings yet
Assignment 2
2 pages
Deep Learning Lab Manual - 23-24
No ratings yet
Deep Learning Lab Manual - 23-24
41 pages
Deep Learning Practical
No ratings yet
Deep Learning Practical
12 pages
CCS355-Neural Networks and Deep Learning - Assignment 1
No ratings yet
CCS355-Neural Networks and Deep Learning - Assignment 1
15 pages
PRML Lab01
No ratings yet
PRML Lab01
2 pages
It App Notes
No ratings yet
It App Notes
20 pages
DL Practical
No ratings yet
DL Practical
23 pages
Classifying Hand-Written Digits Using Neural Network
No ratings yet
Classifying Hand-Written Digits Using Neural Network
21 pages
Keras
No ratings yet
Keras
4 pages
ASNM Program Explain
No ratings yet
ASNM Program Explain
4 pages
Assignment 3 DS5620
No ratings yet
Assignment 3 DS5620
11 pages
Implement A Neural Network Using Python
No ratings yet
Implement A Neural Network Using Python
5 pages
ID6001 Homework
No ratings yet
ID6001 Homework
2 pages
Assignment 2
No ratings yet
Assignment 2
3 pages
Deep Learning
No ratings yet
Deep Learning
30 pages
Introduction To Deep Learning Assignment 0: September 2023
No ratings yet
Introduction To Deep Learning Assignment 0: September 2023
3 pages
CS419 Assignment
No ratings yet
CS419 Assignment
3 pages
2802ICT Programming Assignment 2
No ratings yet
2802ICT Programming Assignment 2
6 pages
ML High Scorer Assignment: Basic Implementation of A Neural Network in Python
No ratings yet
ML High Scorer Assignment: Basic Implementation of A Neural Network in Python
2 pages
Cromemco System 400 Service Manual 023-6066 19851025
No ratings yet
Cromemco System 400 Service Manual 023-6066 19851025
118 pages
Systems & Applications Standard
No ratings yet
Systems & Applications Standard
131 pages
Classical Feedback Control With MATLAB: B. J Lurie Paul J Enright
No ratings yet
Classical Feedback Control With MATLAB: B. J Lurie Paul J Enright
1 page
Comp 321 Lecture Slide Chapter 3 (Register Transfer & Microoperations)
No ratings yet
Comp 321 Lecture Slide Chapter 3 (Register Transfer & Microoperations)
43 pages
Orientation
No ratings yet
Orientation
41 pages
Quickbooks Database Server Manager
No ratings yet
Quickbooks Database Server Manager
5 pages
HM-12/HM-13 Firmware Upgrade Instructions: Firmware Upgrade May Damage The Module Boot Loader System, Please Use Caution
No ratings yet
HM-12/HM-13 Firmware Upgrade Instructions: Firmware Upgrade May Damage The Module Boot Loader System, Please Use Caution
5 pages
VCT MY-SG Challengers Rulebook
No ratings yet
VCT MY-SG Challengers Rulebook
25 pages
Constructors 1
No ratings yet
Constructors 1
12 pages
Storage For Data Resilience With Safeguarded Copy For Sales Level 2 Quiz Attempt Review PDF
No ratings yet
Storage For Data Resilience With Safeguarded Copy For Sales Level 2 Quiz Attempt Review PDF
3 pages
Killing With Keyboards - Edited Document
No ratings yet
Killing With Keyboards - Edited Document
4 pages
Priyanka M: Brief Summary
No ratings yet
Priyanka M: Brief Summary
3 pages
Bci Implementation On Cognitive States
No ratings yet
Bci Implementation On Cognitive States
7 pages
Invoice 39091
No ratings yet
Invoice 39091
1 page

DL - Assignment 1

Uploaded by

DL - Assignment 1

Uploaded by

Deep Learning

Due Date: 11:59 PM on Sunday, 24th April 2022

• Mathematical derivation of backpropagation equations.

Task 1: Backpropagation (20 Points)

train_set_x, train_set_y, test_set_x, test_set_y =

Figure: Sample Images from the Dataset

• train_set_x and test_set_x are numpy arrays of shape (m_train, num_px,

• Each row of your train_set_x and test_set_x should be an array representing an

train_set_x shape: (60000, 28, 28, 1)

Train_set_x_flatten shape: (60000, 784)

Mean Image Subtraction:

• This function returns a trained Neural Network net

• net network architecture

784 for MNIST, and N is the number of training examples

• train_set_y 1xN array containing class labels i.e. [0,1,2,..,9]

• test_set_x and test_set_y will be used for validation

➢ back-prop - this function will traverse network backwards and keep

5. Back Propagation Step

net(i).w = net(i).w - (learning_rate * dW(i))

Merging all functions together:

c. Evaluation and Viva [10 points]

You might also like