Ucs664 Est 23

Uploaded by

Harshita Gupta

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

31 views3 pages

Ucs664 Est 23

Uploaded by

Harshita Gupta

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 3

Roll Number: __________________________

Thapar Institute of Engineering & Technology, Patiala

Department of Computer Science & Engineering
END SEMESTER EXAMINATION
B. E. (Third Year): Semester-VI (2023/24) Course Code: UCS664
(COE/CSE/ENC) Course Name: Conversational AI: Natural Language
Processing
May 15, 2024 Wednesday, 14:00-17:00 hrs.
Time: 3 Hours, M. Marks: 60 Name Of Faculty: Dr. Jasmeet Singh

Note: Attempt any six questions. Assume missing data, if any, suitably

Q.No Questions Marks CO BL

Q.1 i. Explain the purpose, geometrical interpretation, and computation
method of self-attention, multi-head self-attention, masked multi-head
self-attention and multi-head encoder-decoder self-attention in regard
to transformers? (5) CO3 L2
ii. The input to a transformer decoder-block is a three words sequence
each represented by a 2-dimensional embedding as Word 1: [0 0.1],
Word 2: [0.4 0.3], and Word 3: [-0.1 0]. If these embeddings are input
to masked multi-head attention layer of decoder, compute output of the
first head of this layer (correct up to three decimal places). Given the
weights for query, key and value for the first head as follows:

(5) CO3 L3
Q.2 i. A sequential model has been designed to train a Named Entity
Recognizer (NER). The model consists of an embedding layer followed
by a GRU layer (with 100 neurons) and a time distributed output layer.
The model has been trained on standard CONLL-2003 dataset
containing 30,000 sentences with 17 distinct NER tags. The model has
been trained with 36,000 most frequent words with a maximum
sequence length of 110 words. The embedding layer generates
embeddings of 300 dimensions.
a) Compute total number of parameters in embedding layer, GRU layer,
and output layer? (4)
b) If the model has been trained using Adam optimization technique
with a learning rate of 2e-5 and batch size of 128, how many
iterations of Adam are required to complete one epoch? (1) CO4 L4
ii. Write an expression to generate activations (c<t>) at any tth time-stamp
in a GRU network. Also derive gradient of loss function at tth time-stamp
w.r.t Wuc and Wrx (where symbols have their usual meanings). Consider
binary cross-entropy as loss function? (5) CO3 L2
Q.3 i. Consider following unique words present in a corpus:
ABLE, APE, BEATABLE, CAP, CHILDREN, CHIEF, CHILDLESS, CHILL,
CHILDLIKE, CHILDISH, CODE, FIXABLE, READ, READABLE, READING,
READS, RED, ROPE, RIPE
Using the given vocabulary and entropy-based letter successor variety
method, find the stem of the word CHILDREN? (4) CO2 L3
ii. Explain how Word2Vec Skip-gram model can be modeled using feed-
forward networks? (2) CO1 L1
iii. What is the major limitation of Skip-gram model? How Skip-gram with
Negative Sampling (SGNS) handles it? Write an expression of cost
1
function of SGNS and derive gradient of cost function w.r.t word vector
of central word (vc)? (4) CO1 L2
Q.4 i. A BERT-small model with 8 encoder blocks each with 6 self-attention
heads has been trained for a sequence classification task with three
output classes. The input to the model is a sequence of maximum length
of 100 each represented with 512 dimensions embedding. The feed-
forward layer used in each BERT block has 128 neurons in the first
hidden layer. Compute the total number of trainable parameters for the
model? (5) CO4 L4
ii. Explain how BERT model is fine tuned for intent classification, slot
filling and extractive question answering tasks. Explain the input layer,
output layer and loss function for fine-tuning the model for each of
these tasks? (5) CO4 L3

Q.5 i. A-single layer LSTM model has been designed with a single neuron in
the hidden layer. The weights and bias matrices for forget gate, output
gate, input gate, and candidate update are given below (where symbols
have their usual meanings). For example, Wfx indicate weight matrix for
forget gate that takes x-type input.
Wfx=[0.7 0.45], Wfa=[0.1], bf=[0.15]; Wix=[0.95 0.8],
Wia=[0.8], bi=[0.65]
Wox=[0.6 0.4], Woa=[0.25], bo=[0.1]; Wcx=[0.45 0.25],
Wca=[0.15], bc=[0.2]
Also consider the input at second time-stamp and previous long- and
short-term cell contents as follows:

a) Compute long-term memory contents and short-term memory

contents at second time-stamp? (5)
b) Given actual value (label) as 1.25 at second time stamp and squared
error as the loss function, compute the gradient of loss function at
second time-stamp w.r.t Wox. Consider only one example in training
set. (3) CO3 L3
ii. List four architectural differences between LSTM and GRU networks? (2) CO3 L1

Q.6 i. What are positional encodings? Why are they required in transformer
models? (2) CO4 L1
ii. A transformer-based machine translation model is trained to translate
French sentences to English. Generate positional encoding for the input
French input sentence ‘cette image est cliqué par moi’. Use
dimensionality (d) = 4 and scaling factor (n) =100? (4) CO4 L3
iii. The output of the machine translation model trained in part(ii) is ‘the
picture the picture by me’.
Compute BLEU score for the translation, given the following reference
translations:
Reference Translation-1: this picture is clicked by me
Reference Translation-2: this picture was clicked by me (4) CO4 L3

Q.7 i. Why deep neural networks cannot handle sequential data? (2) CO1 L2
ii. What do you mean by language modeling? How Recurrent Neural
Networks (RNNs) are used for language modeling? (2) CO2 L1
iii. What do you mean by perplexity of a language model? How it is
computed? (2) (2) CO2 L1

2
iv. What is the use of dropout layer in neural networks? How inverted
dropout is implemented? (2) CO1 L2
v. What do you mean by vanishing gradient problem in RNNs? How it is
handled? (2) CO2 L2

Applied NLP
50% (2)
Applied NLP
8 pages
Branko Kovačević, Zoran Banjac, Milan Milosavljević (Auth.) - Adaptive Digital Filters-Springer-Verlag Berlin Heidelberg (2013) PDF
No ratings yet
Branko Kovačević, Zoran Banjac, Milan Milosavljević (Auth.) - Adaptive Digital Filters-Springer-Verlag Berlin Heidelberg (2013) PDF
220 pages
Question Bank
No ratings yet
Question Bank
14 pages
MT1SP19
No ratings yet
MT1SP19
13 pages
ENGMEC3 LQ1 Reviewer
No ratings yet
ENGMEC3 LQ1 Reviewer
21 pages
Control Engineering (10me82)
No ratings yet
Control Engineering (10me82)
122 pages
Chapter 3 - Forecasting - EXCEL TEMPLATES
No ratings yet
Chapter 3 - Forecasting - EXCEL TEMPLATES
14 pages
Chapter 2-Simple Searching and Sorting Algorithms
100% (1)
Chapter 2-Simple Searching and Sorting Algorithms
21 pages
Course Outline ADA PGDM Executive 2020
No ratings yet
Course Outline ADA PGDM Executive 2020
4 pages
Advanced Encryption Standard The Origins of AES
No ratings yet
Advanced Encryption Standard The Origins of AES
12 pages
Overlap-Save Method: Block Convolution
No ratings yet
Overlap-Save Method: Block Convolution
26 pages
State Variable Project
No ratings yet
State Variable Project
17 pages
CSE 101 Homework 1 Solutions: Winter 2021
No ratings yet
CSE 101 Homework 1 Solutions: Winter 2021
3 pages
HW3 Solutions Autotag
No ratings yet
HW3 Solutions Autotag
6 pages
Probabilistic Engineering Mechanics: Rahul Kumar, Shaikh Faruque Ali, Sayan Gupta
No ratings yet
Probabilistic Engineering Mechanics: Rahul Kumar, Shaikh Faruque Ali, Sayan Gupta
15 pages
CS2004
No ratings yet
CS2004
2 pages
ML 5
No ratings yet
ML 5
20 pages
Problem Set #4 Due: 1:00pm On Wednesday, February 19: Written Problems
No ratings yet
Problem Set #4 Due: 1:00pm On Wednesday, February 19: Written Problems
5 pages
BE LP3 Q2 41239 ML MiniProject
No ratings yet
BE LP3 Q2 41239 ML MiniProject
6 pages
An Overview On Clustering Methods: T. Soni Madhulatha
No ratings yet
An Overview On Clustering Methods: T. Soni Madhulatha
7 pages
Speaker Dependent Continuous Kannada Speech Recognition Using HMM
No ratings yet
Speaker Dependent Continuous Kannada Speech Recognition Using HMM
4 pages
Mod 12 Newsvendor Calculator
No ratings yet
Mod 12 Newsvendor Calculator
6 pages
Detached Eddy Simulation
No ratings yet
Detached Eddy Simulation
2 pages
NN Text Generation Zaid Bouslikhin
No ratings yet
NN Text Generation Zaid Bouslikhin
14 pages
(COMP4332) (2021) (S) Final P6a03t 90367
No ratings yet
(COMP4332) (2021) (S) Final P6a03t 90367
14 pages
Bayes' Theorem
No ratings yet
Bayes' Theorem
2 pages
Insem2 Scheme
No ratings yet
Insem2 Scheme
6 pages
Chapter-2-The Failure Distribution
No ratings yet
Chapter-2-The Failure Distribution
22 pages
Polynomial Expansion Paper
No ratings yet
Polynomial Expansion Paper
4 pages
CSE 4237 SoftCom Solutions
No ratings yet
CSE 4237 SoftCom Solutions
115 pages
Differences Between Steady-State and Dynamic Simulation
No ratings yet
Differences Between Steady-State and Dynamic Simulation
2 pages
Vin AI
No ratings yet
Vin AI
55 pages
Solution: Introduction To Deep Learning
No ratings yet
Solution: Introduction To Deep Learning
20 pages
MT1 SP19 Solutions
No ratings yet
MT1 SP19 Solutions
14 pages
Second Exam 2021-22
No ratings yet
Second Exam 2021-22
14 pages
Artificial Intelligence Questions
No ratings yet
Artificial Intelligence Questions
15 pages
Sample-Part B
No ratings yet
Sample-Part B
5 pages
Exam Long Questions
No ratings yet
Exam Long Questions
8 pages
DL MCQ
No ratings yet
DL MCQ
13 pages
IDL-EE 466-Unit 6-General Forecasting Techniques
No ratings yet
IDL-EE 466-Unit 6-General Forecasting Techniques
54 pages
cs224n Practice Midterm 3 Sol
No ratings yet
cs224n Practice Midterm 3 Sol
14 pages
F16midterm Sols v2
No ratings yet
F16midterm Sols v2
14 pages
DSE 3151 25 Sep 2023
No ratings yet
DSE 3151 25 Sep 2023
9 pages
Exam ml4nlp1 Hs21.example Solution
No ratings yet
Exam ml4nlp1 Hs21.example Solution
6 pages
NLP Lab2
No ratings yet
NLP Lab2
7 pages
DL Exam 2023-2
No ratings yet
DL Exam 2023-2
5 pages
Applied NLP - Project - Learner Template
No ratings yet
Applied NLP - Project - Learner Template
5 pages
Practice Exam Solutions
No ratings yet
Practice Exam Solutions
26 pages
Exam - Deep Learning - From Theory To Practice (201800177) - Jan 22 2019
No ratings yet
Exam - Deep Learning - From Theory To Practice (201800177) - Jan 22 2019
3 pages
Practice Final sp22
No ratings yet
Practice Final sp22
10 pages
U-4 Iml
No ratings yet
U-4 Iml
17 pages
Go4braindumps 1z0 1127 24 Questions by Day 22 07 2024 11qa
No ratings yet
Go4braindumps 1z0 1127 24 Questions by Day 22 07 2024 11qa
12 pages
Ai Exam 1
100% (1)
Ai Exam 1
10 pages
Ucs664 MST 23
No ratings yet
Ucs664 MST 23
2 pages
Two-Stage Hierarchical and Explainable Feature
No ratings yet
Two-Stage Hierarchical and Explainable Feature
13 pages
03 - Power Optimization
No ratings yet
03 - Power Optimization
20 pages
DSE 5251 Insem2
No ratings yet
DSE 5251 Insem2
2 pages
Assignment Class Notes
No ratings yet
Assignment Class Notes
8 pages
245008-23CS2902 - Deep Learning
No ratings yet
245008-23CS2902 - Deep Learning
4 pages
Uma035 5
No ratings yet
Uma035 5
2 pages
Recurrent Neural Nets
No ratings yet
Recurrent Neural Nets
144 pages
101905CS502H - Neural Networks and Deep Learning - Model Question Paper
100% (1)
101905CS502H - Neural Networks and Deep Learning - Model Question Paper
4 pages
OCI Answers
No ratings yet
OCI Answers
14 pages
Exam DL 2023
No ratings yet
Exam DL 2023
4 pages
New Microsoft Word Document 1
No ratings yet
New Microsoft Word Document 1
12 pages
DSE 5251 Makeup
No ratings yet
DSE 5251 Makeup
3 pages
WS 2021 Solutions
No ratings yet
WS 2021 Solutions
16 pages
Share Feedback: 1Z0-1127-24: Free Certification For Oracle Generative AI (20 Q & A) - Results
No ratings yet
Share Feedback: 1Z0-1127-24: Free Certification For Oracle Generative AI (20 Q & A) - Results
19 pages
Deep Learning - Question Bank
No ratings yet
Deep Learning - Question Bank
6 pages
A Methodology Combining Cosine Similarity With Classifier For Text Classification
No ratings yet
A Methodology Combining Cosine Similarity With Classifier For Text Classification
17 pages
2.hybrid Machine Learning and MCDM Framework For Consumer Preference
No ratings yet
2.hybrid Machine Learning and MCDM Framework For Consumer Preference
24 pages
Home Assignment Part 2
No ratings yet
Home Assignment Part 2
30 pages
Cs224n Midterm 2018 Solution
No ratings yet
Cs224n Midterm 2018 Solution
17 pages
G L: F D - C L R - S M: ATE OOP Ully ATA Ontrolled Inear E Currence For Equence Odeling
No ratings yet
G L: F D - C L R - S M: ATE OOP Ully ATA Ontrolled Inear E Currence For Equence Odeling
14 pages
Artificial Neural Networks B Yegnanarayana Instant Download
No ratings yet
Artificial Neural Networks B Yegnanarayana Instant Download
90 pages
DeepLearning Module3 4&5
No ratings yet
DeepLearning Module3 4&5
3 pages
Week 11
No ratings yet
Week 11
3 pages
OCI GEN AI Test 1
No ratings yet
OCI GEN AI Test 1
6 pages
Week 11 Nptel Deep Learning
No ratings yet
Week 11 Nptel Deep Learning
6 pages
Deepques
No ratings yet
Deepques
12 pages
Deep Learning April 2025 Question Paper Part 1
No ratings yet
Deep Learning April 2025 Question Paper Part 1
4 pages
Deep Learning Question Bank
No ratings yet
Deep Learning Question Bank
8 pages
19CSE456 - VI Sem May 2022
No ratings yet
19CSE456 - VI Sem May 2022
6 pages
NLP End Sem
No ratings yet
NLP End Sem
6 pages
IF4071 Model
No ratings yet
IF4071 Model
10 pages
NLP MCQ Advanced Real 1 20
No ratings yet
NLP MCQ Advanced Real 1 20
7 pages
1Z0 1127 25 Hrd57y
No ratings yet
1Z0 1127 25 Hrd57y
49 pages
Mid-Sem Question Papers - Makeup With Answers
No ratings yet
Mid-Sem Question Papers - Makeup With Answers
4 pages
IGNOU BCA Data and File Structure Previous Year Unsolved Papers MCS 021
From Everand
IGNOU BCA Data and File Structure Previous Year Unsolved Papers MCS 021
Manish Soni
No ratings yet
IGNOU BCA Fundamentals of Computer Networks Previous Year Unsolved Papers BCS 041
From Everand
IGNOU BCA Fundamentals of Computer Networks Previous Year Unsolved Papers BCS 041
Manish Soni
No ratings yet

Ucs664 Est 23

Uploaded by

Ucs664 Est 23

Uploaded by

Roll Number: __________________________

Thapar Institute of Engineering & Technology, Patiala

Q.No Questions Marks CO BL

a) Compute long-term memory contents and short-term memory

You might also like