Instructions:: Problem 1

An optional homework assignment on deep learning for mechanics is described. It involves developing a recurrent neural network (RNN) with fully connected layers to classify words based on their spelling. Students are instructed to load necessary packages, define model configurations with different layer sizes, train the models on name data from 18 languages, test the models, and make predictions on sample names. It also involves calculating the output and gradient of a convolutional layer given input and kernel matrices, and describing approaches for sentence classification using techniques like convolutional networks, RNNs, and pretrained word embeddings.

Uploaded by

Lomash Yaduvanshi

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

67 views3 pages

Instructions:: Problem 1

Uploaded by

Lomash Yaduvanshi

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 3

Deep Learning for Mechanics (APL 745)

Homework – 4 (Optional)
Handed out: 03-04-2022 Submission due: NA

Instructions:

• We highly encourage handwritten homework for derivation and other theoretical part. Compile
as single report containing solutions, derivations, figures, etc
• Zipped folder should be turned in on Sakai with the following naming scheme:
HW4_Enrl_No.zip
• Collaboration is encouraged however all submitted reports, programs, figures, etc. should be
an individual student’s writeup. Direct copying will be considered cheating.
• Discussion on methods used, results obtained for programming assignment and technical
discussion is essential. Homework problems that simply provide computer outputs with no
technical discussion, Algorithms, etc. will receive no credit.

Problem 1:
In this problem, we will develop (step by step) a recurrent neural network with fully connected
layers to classify words. In this problem, RNN reads words as a series of characters - outputting
a prediction and hidden state at each step, feeding its previous hidden state into each next step.
We take the final prediction to be the output, i.e. which class the word belongs to.

Load the following packages for this problem:

• PyTorch is a deep learning package. (other deep learning packages can also be used.)
• Numpy is the fundamental package for scientific computing with Python.
• Matplotlib is a famous library to plot graphs in Python.

Step 1: In this problem, we consider a few thousand surnames from 18 languages of origin, and
predict which language a name is from based on the spelling. The data folder contains the
following language.txt files: Greek, French, Czech, Dutch, Polish, Scottish, Chinese, English,
Italian, Portuguese, Japanese, German, Russian, Korean, Arabic, Vietnamese, Spanish, Irish.

Step 2: Define the following model structure with different Python files as follows:
• Model-configuration-1: Build 2- linear layers which operate on an input and hidden
state with 128 hidden units for both the layers.
• Model-configuration-2: Build 2- linear layers which operate on an input and hidden
state with 256 hidden units for both the layers.

Step 3: Train the RNN

Step 4: Test the RNN model using the test data and make sure that the gradient calculation is
disabled during this inference stage.
Write a computer code to answer the following questions:
(A) Use a suitable loss function for training the RNN model.
(B) Train all the above model-configurations with the following learning rate and weight decay
training-configurations:
• Training configuration – 1: 0.005
• Training configuration – 2: 0.001
(C) Plot the following:
• Epoch vs Loss
• Create a confusion matrix and plot it, indicating for every actual language (rows) which
language the network guesses (columns).
(D) Finally predict for the following names:
‘Geoffrey’, ‘Hinton’, ‘Yann’, ‘LeCun’, ‘Yoshua’, ‘Bengio’

Problem 2:
Table 1 depicts two matrices. One of the form (5 × 5 one) represents an image. The second
(3 × 3 one) represents a convolution kernel. (Consider the bias term to be zero)
a. How many values will be generated if we forward propagate the image over the given
convolution kernel?
b. Calculate these values
c. Suppose the gradient backpropagated from the layers above this layer is a 3 × 3 matrix of all
1s. Write down the value of the gradient (with respect to the input) backpropagated out of this.

Problem 3:
Sentence Classification is a common problem in NLP. There are many ways to solve this problem. As
simplest, you could just use machine learning algorithm such as Logistic regression, or any you can
think of using such as a multilayer Perceptron (MLP). You could even take help from ConvNets or
RNNs to give you good sentence vector, which you can feed into linear classification layer. Goal of this
question is to make sure that you have a clear picture in your mind about all these possible techniques.
Let's say you have a corpus of 50K words. For the simplicity of this question, assume you have the
trained word embedding matrix of size [50K*300] available with you, which can give you a word vector
of size [1*300]. Consider one sentence having 10 words for the classification task and describe your
approach for below techniques.
a. Design a one layer ConvNet which first maps the sentence to a vector of length 5 (with the
help of convolution and pooling), then feeds this vector to fully connected layer with softmax
to get the probability values f or possible 3 classes.
b. Clearly mention the sizes for your input, kernal, outputs at each step (till you get the final [ 3*1]
output vector from softmax)
c. Please describe the effect of small filter size vs large filters size during the convolution. What
would be your approach to s elect the filter sizes for classification task?
d. How can a simple RNN which is trained for language modeling be used to get the sentence
vector?
e. Design a simple RNN which first maps the sentence to a vector of length 50 (with the help of
convolution and pooling), then feeds this vector to fully connected layer with softmax to get
the probability values for possible 4 classes.
f. Clearly mention the sizes of all the RNN components such as your input vector, hidden layer
weight matrix, hidden state vectors, cell state vector, output layers (RNN components sizes
would be same at each time stamp).

Principles: Life and Work
From Everand
Principles: Life and Work
Ray Dalio
4/5 (643)
The Gifts of Imperfection: Let Go of Who You Think You're Supposed to Be and Embrace Who You Are
From Everand
The Gifts of Imperfection: Let Go of Who You Think You're Supposed to Be and Embrace Who You Are
Brené Brown
4/5 (1175)
The Glass Castle: A Memoir
From Everand
The Glass Castle: A Memoir
Jeannette Walls
4.5/5 (1856)
Sing, Unburied, Sing: A Novel
From Everand
Sing, Unburied, Sing: A Novel
Jesmyn Ward
4/5 (1267)
The Perks of Being a Wallflower
From Everand
The Perks of Being a Wallflower
Stephen Chbosky
4.5/5 (4103)
Her Body and Other Parties: Stories
From Everand
Her Body and Other Parties: Stories
Carmen Maria Machado
4/5 (903)
Shoe Dog: A Memoir by the Creator of Nike
From Everand
Shoe Dog: A Memoir by the Creator of Nike
Phil Knight
4.5/5 (629)
Steve Jobs
From Everand
Steve Jobs
Walter Isaacson
4.5/5 (1139)
The Emperor of All Maladies: A Biography of Cancer
From Everand
The Emperor of All Maladies: A Biography of Cancer
Siddhartha Mukherjee
4.5/5 (298)
The Yellow House: A Memoir (2019 National Book Award Winner)
From Everand
The Yellow House: A Memoir (2019 National Book Award Winner)
Sarah M. Broom
4/5 (100)
Angela's Ashes: A Memoir
From Everand
Angela's Ashes: A Memoir
Frank McCourt
4.5/5 (943)
The World Is Flat 3.0: A Brief History of the Twenty-first Century
From Everand
The World Is Flat 3.0: A Brief History of the Twenty-first Century
Thomas L. Friedman
3.5/5 (2289)
The Outsider: A Novel
From Everand
The Outsider: A Novel
Stephen King
4/5 (2885)
A Heartbreaking Work Of Staggering Genius: A Memoir Based on a True Story
From Everand
A Heartbreaking Work Of Staggering Genius: A Memoir Based on a True Story
Dave Eggers
3.5/5 (233)
Team of Rivals: The Political Genius of Abraham Lincoln
From Everand
Team of Rivals: The Political Genius of Abraham Lincoln
Doris Kearns Goodwin
4.5/5 (244)
Paypal Credit Cashout Guide
100% (2)
Paypal Credit Cashout Guide
6 pages
Rise of ISIS: A Threat We Can't Ignore
From Everand
Rise of ISIS: A Threat We Can't Ignore
Jay Sekulow
3.5/5 (144)
Manhattan Beach: A Novel
From Everand
Manhattan Beach: A Novel
Jennifer Egan
3.5/5 (919)
Fear: Trump in the White House
From Everand
Fear: Trump in the White House
Bob Woodward
3.5/5 (836)
John Adams
From Everand
John Adams
David McCullough
4.5/5 (2546)
The Unwinding: An Inner History of the New America
From Everand
The Unwinding: An Inner History of the New America
George Packer
4/5 (45)
The Light Between Oceans: A Novel
From Everand
The Light Between Oceans: A Novel
M.L. Stedman
4.5/5 (815)
Little Women
From Everand
Little Women
Louisa May Alcott
4.5/5 (2369)
RMA Case Study
No ratings yet
RMA Case Study
8 pages
Relay Setting Calculation-Super Cement
100% (2)
Relay Setting Calculation-Super Cement
41 pages
EMP Q5 - Artificial Intelligence
No ratings yet
EMP Q5 - Artificial Intelligence
5 pages
GUI in JAVA
No ratings yet
GUI in JAVA
99 pages
Coatron M1 08-0544
No ratings yet
Coatron M1 08-0544
2 pages
MAN B&W Diesel A/S: Service Letter
No ratings yet
MAN B&W Diesel A/S: Service Letter
2 pages
Cimco FTP Server Insert en
No ratings yet
Cimco FTP Server Insert en
16 pages
Control Lab 2
No ratings yet
Control Lab 2
7 pages
Companies Anritsu S331L Anritsu S331L PDF
No ratings yet
Companies Anritsu S331L Anritsu S331L PDF
82 pages
Spra 001
No ratings yet
Spra 001
47 pages
Manual Bien Tan Yaskawa GA700 Series
No ratings yet
Manual Bien Tan Yaskawa GA700 Series
11 pages
Question 4
No ratings yet
Question 4
9 pages
1559 - Power Quality Analyser
No ratings yet
1559 - Power Quality Analyser
12 pages
Tuv0t en
No ratings yet
Tuv0t en
2 pages
Redeem Koin Festive 30 Jan
No ratings yet
Redeem Koin Festive 30 Jan
78 pages
A Seminar Report On: R. H. Sapat College of Engineering, Management Studies and Research
No ratings yet
A Seminar Report On: R. H. Sapat College of Engineering, Management Studies and Research
32 pages
NIELIT-Networking Cerificate Course
No ratings yet
NIELIT-Networking Cerificate Course
4 pages
New Cyb206
No ratings yet
New Cyb206
19 pages
Model Answer-17332 (PR - Test - 50 - 1) - FIRST
No ratings yet
Model Answer-17332 (PR - Test - 50 - 1) - FIRST
21 pages
AI at The Edge Daniel Situnayake Download
No ratings yet
AI at The Edge Daniel Situnayake Download
51 pages
SAP Solution Manager 7.2
No ratings yet
SAP Solution Manager 7.2
2 pages
AVR DAC 328 Tutorial
No ratings yet
AVR DAC 328 Tutorial
5 pages
Integrated Mines
No ratings yet
Integrated Mines
20 pages
FINAL DESIGN - Automation Layout
No ratings yet
FINAL DESIGN - Automation Layout
31 pages
Sony CDX L490b
No ratings yet
Sony CDX L490b
38 pages
PDF
No ratings yet
PDF
41 pages
BAM-1022-9800 Manual Rev E
No ratings yet
BAM-1022-9800 Manual Rev E
75 pages
Troubleshooting: Library of Technical Reference Books
No ratings yet
Troubleshooting: Library of Technical Reference Books
4 pages
A SWOT Analysis of Reliability Centered Maintenance Framework
100% (1)
A SWOT Analysis of Reliability Centered Maintenance Framework
19 pages

Instructions:: Problem 1

Uploaded by

Instructions:: Problem 1

Uploaded by

Deep Learning for Mechanics (APL 745)

Load the following packages for this problem:

Step 3: Train the RNN

You might also like