Assignment 3

The document outlines Assignment #3 for CS 224d, focusing on Recursive Neural Networks (RNNs) for sentiment analysis. It includes instructions for problem-solving, coding deliverables, and emphasizes individual work while adhering to the Honor Code. Key tasks involve deriving equations, implementing a model, and achieving a minimum accuracy of 75% on sentiment classification.

Uploaded by

lecotem

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

12 views3 pages

Assignment 3

Uploaded by

lecotem

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 3

CS 224d: Assignment #3

Updated Thursday 12th May, 2016 at 11:30pm

Due date: 5/21 11:59 PM PST (You are allowed to use three (3) late days maximum for this
assignment)

This handout consists of several homework problems, as well as instructions on the “deliverables”
associated with the coding portions of this assignment.

These questions require thought, but do not require long answers. Please be as concise as possible.

We encourage students to discuss in groups for assignments. However, each student must finish the
problem set and programming assignment individually, and must turn in her/his assignment. We ask
that you abide by the university Honor Code and that of the Computer Science department, and make
sure that all of your submitted work is done by yourself.

Please review any additional instructions posted on the assignment page at

http://cs224d.stanford.edu/assignments.html. When you are ready to submit, please follow the
instructions on the course website.

1 RNN’s (Recursive Neural Network)

Welcome to SAIL (Stanford Artificial Intelligence Lab): Congrats! You have just been given a
Research Assistantship in SAIL. Your task is to discover the power of Recursive Neural Networks (RNNs).
So you plan an experiment to show their effectiveness on Positive/Negative Sentiment Analysis. In this part,
you will derive the forward and backpropogation equations, implement them, and test the results.

Our RNN has one ReLU layer and one softmax layer, and uses Cross Entropy loss as its cost function.
We follow the parse tree given from the leaf nodes up to the top of the tree and evaluate the cost at each
node. During backprop, we follow the exact opposite path. Figure 1 shows an example of such a RNN
applied to a simple sentence ”I love this assignment”. These equations are sufficient to explain our model:
X
CE(y, ŷ) = − yi log(yˆi )
i

where y is the label represented as a one-hot row vector, and ŷ is a row vector containing the predicted
probabilities for all classes. In our case, y ∈ R1×5 and ŷ ∈ R1×5 to represent our 5 sentiment classes: Really
Negative, Negative, Neutral, Positive, and Really Positive. Furthermore,
h i
h(1) = max( h(1) (1)
Lef t , hRight
W (1) + b(1) , 0)

ŷ = softmax(h(1) U + b(s) )
(1) (1)
where hLef t is the vector representation of the left subtree (possibly be a word vector), the hRight of the
right subtree. For clarity, L ∈ R|V |×d , W (1) ∈ R2d×d , b(1) ∈ R1×d , U ∈ Rd×5 , b(s) ∈ R1×5 .

(a) (20 points) Follow the example parse tree in Figure 1 in which we are given a parse tree and truth labels
y for each node. Starting with Node 1, then to Node 2, finishing with Node 3, write the update rules for
W (1) ,b(1) , U , b(s) , and L after the evaluation of ŷ against our truth, y. This means for at each node, we
evaluate:

1
CS 224d: Assignment #3

Figure 1: RNN (Recursive Neural Network) example

δ3 = ŷ − y

as our first error vector and we backpropogate that error through the network, aggregating gradient at
each node for:

∂J ∂J ∂J ∂J ∂J
∂U ∂b(s) ∂W (1) ∂b(1) ∂Li
Points will be deducted if you do not express the derivative of activation functions (ReLU) in terms of
their function values (as with Assignment 1 and 2) or do not express the gradients by using an “error
vector” (δi ) propagated back to each layer. Tip on notation: δbelow and δabove should be used for error
that is being sent down to the next node, or came from an above node. This will help you think about
the problem in the right way. Note you should not be updating gradients for Li in Node 1. But error
should be leaving Node 1 for sure!

(b) (80 points) Implementation time! We have simplified the problem to reduce training time by binarizing
the sentiment labels. This means that all the sentences are either positive or negative. The internal
nodes however, can be positive negative or neutral. While training, the cost function includes predictions
over all nodes that have a sentiment associated with them and ignores the neutral nodes. While testing,
we are only interested in the performance at the full sentence level. This is all provided for you in the
starter code.

Page 2 of 3
CS 224d: Assignment #3

• (a) Download, unzip, and have a look at the code base.

• (b) From the command line, run ./setup.sh to download the labeled parse tree dataset.
• (c) Begin implementing the model in rnn.py by filling out the outline.
• (d) Run the model and show us the resulting loss plot, final training accuracy, final validation
accuracy. You should be able to achieve an accuracy of 75%.
• (e) (10 points extra credit) Tune the parameters of the model to improve performance and report
your new loss plot, final training accuracy, final validation accuracy. Explain why the changes you
made helped achieve the improvement.
• (f) Ensure that your code is using the best model config and the trained weights are present in the
weights folder. Run ./prepare submission.sh and submit the resulting zip file.

Page 3 of 3

HSBC Bank Statement TemplateLab Com
100% (1)
HSBC Bank Statement TemplateLab Com
1 page
Recurrent Neural Nets
No ratings yet
Recurrent Neural Nets
144 pages
Cs230exam Spr18 Soln PDF
100% (1)
Cs230exam Spr18 Soln PDF
45 pages
Calcium Carbonate
33% (3)
Calcium Carbonate
1 page
Introduction To Modern Industrial Engineering
100% (2)
Introduction To Modern Industrial Engineering
221 pages
Kra 4 Community Linkages and Professional Engagement & Personal Growth and
No ratings yet
Kra 4 Community Linkages and Professional Engagement & Personal Growth and
7 pages
CS230 Midterm Fall 2022
No ratings yet
CS230 Midterm Fall 2022
14 pages
Isd Process V1
100% (1)
Isd Process V1
3 pages
NLP Basics
No ratings yet
NLP Basics
119 pages
Unit 5 Updated
No ratings yet
Unit 5 Updated
125 pages
CS60010: Deep Learning: Recurrent Neural Network
No ratings yet
CS60010: Deep Learning: Recurrent Neural Network
44 pages
LSTM Lecture
No ratings yet
LSTM Lecture
163 pages
Recurrent Neural Networks
No ratings yet
Recurrent Neural Networks
86 pages
AML - Lecture - 09 - 08nov24
No ratings yet
AML - Lecture - 09 - 08nov24
126 pages
01-Transformer Based NLP Applications
No ratings yet
01-Transformer Based NLP Applications
55 pages
DL Unit 4 Part 2
No ratings yet
DL Unit 4 Part 2
8 pages
Modbus TCP Client RTU Slave MN67010 ENG
No ratings yet
Modbus TCP Client RTU Slave MN67010 ENG
9 pages
Unit 2
No ratings yet
Unit 2
48 pages
Christopher Manning Lecture 5: Language Models and Recurrent Neural Networks (Oh, and Finish Neural Dependency Parsing J)
No ratings yet
Christopher Manning Lecture 5: Language Models and Recurrent Neural Networks (Oh, and Finish Neural Dependency Parsing J)
66 pages
NLP Lecture 6
No ratings yet
NLP Lecture 6
57 pages
NLP Week7 RNNLSTM
No ratings yet
NLP Week7 RNNLSTM
66 pages
Rnnjan 25
No ratings yet
Rnnjan 25
59 pages
cs224n-2021-LSTM NN
No ratings yet
cs224n-2021-LSTM NN
59 pages
Introduction To Rnns
No ratings yet
Introduction To Rnns
48 pages
In4310 2023 Slides Rnns Part1
No ratings yet
In4310 2023 Slides Rnns Part1
47 pages
Outline
No ratings yet
Outline
50 pages
Chap 7.1 Sequence Analysis Using FFN
No ratings yet
Chap 7.1 Sequence Analysis Using FFN
47 pages
Sequence Learning Problem
No ratings yet
Sequence Learning Problem
42 pages
Project
No ratings yet
Project
11 pages
Keras For Beginners: Implementing A Recurrent Neural Network
No ratings yet
Keras For Beginners: Implementing A Recurrent Neural Network
13 pages
11 RNN
No ratings yet
11 RNN
32 pages
CS230: Deep Learning: Winter Quarter 2019 Stanford University Midterm Examination 180 Minutes
No ratings yet
CS230: Deep Learning: Winter Quarter 2019 Stanford University Midterm Examination 180 Minutes
29 pages
DL 4
No ratings yet
DL 4
19 pages
Court Order
100% (1)
Court Order
17 pages
NLP Lab2
No ratings yet
NLP Lab2
7 pages
245008-23CS2902 - Deep Learning
No ratings yet
245008-23CS2902 - Deep Learning
4 pages
Cs224n Midterm 2018 Solution
No ratings yet
Cs224n Midterm 2018 Solution
17 pages
sp19 Midterm Solutions
No ratings yet
sp19 Midterm Solutions
11 pages
sp20 Midterm Solutions
No ratings yet
sp20 Midterm Solutions
12 pages
Natural Language Processing Lab 9
No ratings yet
Natural Language Processing Lab 9
13 pages
Question Bank - 3
No ratings yet
Question Bank - 3
5 pages
CS 224D: Deep Learning For NLP: Lecture Notes: Part IV Spring 2015
No ratings yet
CS 224D: Deep Learning For NLP: Lecture Notes: Part IV Spring 2015
12 pages
Report XRNN
No ratings yet
Report XRNN
4 pages
RNN IITMumbai
No ratings yet
RNN IITMumbai
9 pages
CS663-2024-Executive NLP - Assignment Sentiment Analysis
No ratings yet
CS663-2024-Executive NLP - Assignment Sentiment Analysis
4 pages
10 Exercises RNN MUD SOLVED
No ratings yet
10 Exercises RNN MUD SOLVED
4 pages
Assignment Problem
No ratings yet
Assignment Problem
11 pages
Assignment 1
No ratings yet
Assignment 1
7 pages
NLP A2
No ratings yet
NLP A2
7 pages
Assignment 1: Welcome To Tensorflow: Problem 1: Op Is All You Need
No ratings yet
Assignment 1: Welcome To Tensorflow: Problem 1: Op Is All You Need
4 pages
Experiment 3.3
No ratings yet
Experiment 3.3
3 pages
DL 5
No ratings yet
DL 5
5 pages
CS671 Assignment4 Details 09may2022 PDF
No ratings yet
CS671 Assignment4 Details 09may2022 PDF
2 pages
Assesment Ns
No ratings yet
Assesment Ns
2 pages
CS224D - Assignment 1
No ratings yet
CS224D - Assignment 1
3 pages
CS224d - Assignment 2
No ratings yet
CS224d - Assignment 2
3 pages
NLP Assignment 2
No ratings yet
NLP Assignment 2
3 pages
ANN Lab Assignment
No ratings yet
ANN Lab Assignment
1 page
Cambridge O Level: Environmental Management 5014/22
No ratings yet
Cambridge O Level: Environmental Management 5014/22
11 pages
Read Across America Day - by Slidesgo
No ratings yet
Read Across America Day - by Slidesgo
56 pages
Case
No ratings yet
Case
4 pages
What Is Athletic Sports and Management?
No ratings yet
What Is Athletic Sports and Management?
3 pages
Krisis Hipertensi
No ratings yet
Krisis Hipertensi
29 pages
Sony KDL - 52s5100 Chasis Exr2
No ratings yet
Sony KDL - 52s5100 Chasis Exr2
104 pages
Learning Area Learners With Special Educational Needs (LSEN) Learning Delivery Modality Modular Distance Learning Modality
No ratings yet
Learning Area Learners With Special Educational Needs (LSEN) Learning Delivery Modality Modular Distance Learning Modality
5 pages
Briere ITCT-A Final PDF
No ratings yet
Briere ITCT-A Final PDF
119 pages
Strategy Papers and Cases Questions
0% (1)
Strategy Papers and Cases Questions
9 pages
Operating Room
No ratings yet
Operating Room
1 page
1. 听力部分SL Mock Examination02-S
No ratings yet
1. 听力部分SL Mock Examination02-S
8 pages
Internship Jntuh 160425 With Schedule
No ratings yet
Internship Jntuh 160425 With Schedule
3 pages
Parkinson Disease & ALS Cheat Sheet
No ratings yet
Parkinson Disease & ALS Cheat Sheet
4 pages
5 People Who Disappeared But Would Reappear Years Later
No ratings yet
5 People Who Disappeared But Would Reappear Years Later
5 pages
Accuriopress 6136 6136p 6120 - Additional Information - en - 3 1 0
No ratings yet
Accuriopress 6136 6136p 6120 - Additional Information - en - 3 1 0
60 pages
How Human Behaviour Amplifies The Bullwhip Effect A Study Based On The Beer Distribution Game Online
No ratings yet
How Human Behaviour Amplifies The Bullwhip Effect A Study Based On The Beer Distribution Game Online
12 pages
VK Liste 2017
No ratings yet
VK Liste 2017
29 pages
Semitic Alphabets
No ratings yet
Semitic Alphabets
16 pages
Anfis Based Kinematic Analysis of A 4-Dofs Scara Robot: Jyotindra Narayan Ashish Singla
No ratings yet
Anfis Based Kinematic Analysis of A 4-Dofs Scara Robot: Jyotindra Narayan Ashish Singla
7 pages
Features
No ratings yet
Features
7 pages
Worksheet 3 LS6 - MIANO, REYMARK
No ratings yet
Worksheet 3 LS6 - MIANO, REYMARK
1 page
Hazard Identification: 2. Risk Analysis/Evaluation 3. Risk Control
No ratings yet
Hazard Identification: 2. Risk Analysis/Evaluation 3. Risk Control
2 pages
Java4s Com Hibernate
No ratings yet
Java4s Com Hibernate
5 pages
Fundamental Math
From Everand
Fundamental Math
Russell Pead
No ratings yet
Multiple Integrals, A Collection of Solved Problems
From Everand
Multiple Integrals, A Collection of Solved Problems
Steven Tan
No ratings yet
The Tech Interview Playbook: From DSA to System Design
From Everand
The Tech Interview Playbook: From DSA to System Design
Chinmoy Mukherjee
No ratings yet
A-level Maths Revision: Cheeky Revision Shortcuts
From Everand
A-level Maths Revision: Cheeky Revision Shortcuts
Scool Revision
3.5/5 (8)
IGNOU MCA Design and Analysis of Algorithms Previous Years Unsolved Papers MCS 211
From Everand
IGNOU MCA Design and Analysis of Algorithms Previous Years Unsolved Papers MCS 211
Manish Soni
No ratings yet
Nell: An SVG Drawing Language
From Everand
Nell: An SVG Drawing Language
Stefan Hollos
No ratings yet
Painless Pre-Algebra
From Everand
Painless Pre-Algebra
Barron's Educational Series
3/5 (2)
Advanced C++ Interview Questions You'll Most Likely Be Asked
From Everand
Advanced C++ Interview Questions You'll Most Likely Be Asked
Vibrant Publishers
No ratings yet
Top Numerical Methods With Matlab For Beginners!
From Everand
Top Numerical Methods With Matlab For Beginners!
Andrei Besedin
No ratings yet