0% found this document useful (0 votes)

66 views2 pages

NLP Endsem 2016

1. The document provides instructions for a 3 hour exam with 70 total marks covering natural language processing topics. It includes two parts: Part A contains 13 multiple choice questions worth 40 marks, and Part B contains 4 long answer questions worth 30 marks where students must answer 3 of the 4. 2. The questions cover a range of NLP tasks and algorithms including building bilingual dictionaries with machine learning, the Viterbi algorithm, word embeddings, latent semantic analysis, parsing with probabilistic context-free grammars, Hidden Markov Models, smoothing techniques, and more. Detailed explanations are required for full marks. 3. Students must show their work for problems involving algorithms, math, or code

Uploaded by

Puneet Sangal

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

66 views2 pages

NLP Endsem 2016

Uploaded by

Puneet Sangal

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 2

NLP Endsem 2016

Time: 3 hrs Total Marks: 70 Be Precise

======================================================
Important: Answer all questions from PART A, and any 3 out of 4
questions from Part B.
======================================================
Part A (40 marks)
(Answer all questions from this part)
1. Constructing a bilingual dictionary (say English to Hindi) is a non-trivial task.
How can machine learning help? Be specific in identifying the algorithm used and
explaining how parameters are estimated. [3]
2. Identify the specific problem that the Viterbi algorithm solves in the context of
HMMs. Explain the central intuition behind the algorithm using an example. [3]
3. What is CBOW in the context of Word2Vec? Why is it useful? [2]
4. Which of polysemy or synonymy is LSA better at handling, and why? Given a
rectangular matrix of size 2 x 3, how would you compute SVD of this matrix by
hand? [2+2]
5. You are given a set of sentences from an unknown language, which has never
been studied till date. How can you use Expectation Maximization to arrive at the
correct parse of these sentences? [3]
6. Are there situations where the parameters learnt from corpus in PCFG are
successful in correctly parsing some sentences, but unsuccessful over others? If
yes, explain with an example, and suggest a fix. If no, justify (your justification
must be accompanied by a proof sketch). [3]
7. We can use distributional models of similarity (KL divergence or its symmetrised
version) to estimate document and term relatedness from corpus. What
advantage does a method like Latent Semantic Analysis have over this approach?
[2]
8. Bottom up filtering is used to improve the efficiency of top down parsers. Is this
true? If yes, how? If no, correct the sentence and justify. [2]
9. Apart from decision trees, identify a rule induction technique that addresses a
classical problem in NLP. Discuss briefly the central idea behind the approach.
[3]
10. How is the success of HMM parameter learning related to an important property
of KL divergence? [2]
11. What limitation of Laplace smoothing does Good Turing smoothing overcome?
Where can Good Turing smoothing fail? Suggest a repair to overcome this
shortcoming. [2+1.5+1.5]
12. Can you think of any NLP task where knowledge of recall can reduce uncertainty
about precision (or vice versa)? Explain. [2]
13. Briefly explain the connection between branching factor and perplexity with an
example. [3]
14. What are Hearst patterns and what are they used for? Explain briefly with two
examples. Identify a limitation of Hearst patterns. [2+2]
15. There are only two words A and B in a corpus, and two (hidden) topics that can
generate these words. We have ten documents, each having 20 tokens of type A
and B. Suggest an approach to estimate (a) the probabilities with which each
topic generates A and B (b) estimate the belongingness of each document to the
two topics. Identify clearly all assumptions you may have made. Are there any
criteria that the document collection should ideally satisfy? [6]
16. What are rhetorical relations and in which context are they useful? Explain with
two examples. [3]

(Please Turn Over)

Part B ( 30 marks)
(Answer any three questions from this part)

1. (a) Use dynamic programming to compute the edit distance between words
WRONG and WINGS, assuming that the costs of insertion and deletion are 2 and
the cost of substitution is 1. [6]

(b) Assume that there are only two senses of the word bank (one pertaining to the
financial sense, and the other to the “river bank” sense) in WordNet. In a given
piece of raw text, the distributional neighbours of bank are {account, deposit,
river}. Given that each of these neighbours can have multiple senses as well, how
would you go about assigning dominance based ranks to the two senses of bank?
Show the steps in detail. [4]

2. Consider a Machine Translation parallel corpus having three sentence pairs. The
first sentence pair is “go there fast”/”jaldi udhar jaao”. The second sentence pair is
“go there”/”udhar jaao”. The third sentence pair is “go”/”jaao”. (a) Show how the
first few iterations of EM are useful in learning word alignments from this corpus.
Make clear any simplifying assumptions on top of IBM Model 3. (b) How is extra
knowledge “getting generated” in successive iterations of EM? [8+2]

3. What limitations of the basic parsing techniques does the CYK parser address? Is
there an assumption on the grammar rules that CYK can deal with? If yes, what
are these? Given the grammar below and the input sentence “w =(()(()))”, show the
steps in chart parsing using CYK. Alongside your charts showing each step,
mention clearly the rule(s) that is(are) used (if any) to advance to this step from
the previous one. [1.5 + 1.5 + 7]
S → SS
S →(S1
S1 → S)
S → ()

4. A PCFG is based on the following rules:

a. S → A B
b. B → D A
c. B → D A C
d. A → A C
e. A → a
f. A → b c
g. A → b d e
h. C → f g h
i. D → i
The corpus has the following two sentences, the first occurring 15 times and the
second 30 times:
1. a i b c f g h
2. b c i b d e
(a) Are the sentences accepted by the grammar? In case both of them are, which
of these two sentences is/are ambiguous? Show all possible parse trees of the
sentence(s).
(b) Make an APPROPRIATE initial choice of the rule probabilities. Show the
first three steps of the EM algorithm for estimating the parameters of this
PCFG. [3+7]

== The End ==

Integrate Reading Writing Basic 2 TG
No ratings yet
Integrate Reading Writing Basic 2 TG
120 pages
Principle of Speech Delivery
67% (3)
Principle of Speech Delivery
78 pages
MCQ NLP
67% (3)
MCQ NLP
11 pages
(Final) 1000+ SNLP MCQ
No ratings yet
(Final) 1000+ SNLP MCQ
688 pages
Format For Summer Internship Report
88% (24)
Format For Summer Internship Report
2 pages
Marie McGinn - Elucidating The Tractatus-Oxford University Press, USA (2009) PDF
100% (1)
Marie McGinn - Elucidating The Tractatus-Oxford University Press, USA (2009) PDF
331 pages
Code-Switching and Code-Mixing in Conversation Alumni Boarding School of Riyadlul 'Ulum Wadda'Wah
No ratings yet
Code-Switching and Code-Mixing in Conversation Alumni Boarding School of Riyadlul 'Ulum Wadda'Wah
24 pages
Question Bank NLP
100% (1)
Question Bank NLP
11 pages
E. J. Lowe-Personal Agency - The Metaphysics of Mind and Action-Oxford University Press (2008) PDF
100% (4)
E. J. Lowe-Personal Agency - The Metaphysics of Mind and Action-Oxford University Press (2008) PDF
241 pages
Methods and Approaches To ELT
No ratings yet
Methods and Approaches To ELT
19 pages
Question Bank - NLP
No ratings yet
Question Bank - NLP
3 pages
NLP Sample Questions-Stu
No ratings yet
NLP Sample Questions-Stu
4 pages
Bloom and Feuerstein
100% (2)
Bloom and Feuerstein
13 pages
SPCM 492 - Personality & Communication Lesson Plan Two: Justification of Instructional Methods
No ratings yet
SPCM 492 - Personality & Communication Lesson Plan Two: Justification of Instructional Methods
2 pages
NLP Question Bank
No ratings yet
NLP Question Bank
3 pages
Endsem NLP IMPORTANT QUESTIONS
No ratings yet
Endsem NLP IMPORTANT QUESTIONS
2 pages
Preview
62% (13)
Preview
13 pages
BAI601 All Modules VTU 10 Mark Complete
No ratings yet
BAI601 All Modules VTU 10 Mark Complete
18 pages
NLP Mcq+Dis Answers-Ok
No ratings yet
NLP Mcq+Dis Answers-Ok
52 pages
mockExamWS21 With Solution
No ratings yet
mockExamWS21 With Solution
35 pages
Lucas Paquetta Raw NLP
No ratings yet
Lucas Paquetta Raw NLP
12 pages
PERDEV LEAP Week 4 1st Quarter
No ratings yet
PERDEV LEAP Week 4 1st Quarter
8 pages
Behavioural Brain Research: Sciencedirect
No ratings yet
Behavioural Brain Research: Sciencedirect
13 pages
Midterm F09 Answers
No ratings yet
Midterm F09 Answers
12 pages
NLP Final
No ratings yet
NLP Final
11 pages
NLP - Viva - Que & Ans
No ratings yet
NLP - Viva - Que & Ans
15 pages
Quest NLP
No ratings yet
Quest NLP
13 pages
Analysis of Statistical Parsing in Natural Language Processing
No ratings yet
Analysis of Statistical Parsing in Natural Language Processing
6 pages
Lesson 2 The Revealers
No ratings yet
Lesson 2 The Revealers
5 pages
Cairns 2008
No ratings yet
Cairns 2008
20 pages
Visvesvaraya Technologi Cal University
No ratings yet
Visvesvaraya Technologi Cal University
13 pages
Ai Unit - 5
No ratings yet
Ai Unit - 5
12 pages
IJISRT18DC138
No ratings yet
IJISRT18DC138
6 pages
Machine Learning May 2024
No ratings yet
Machine Learning May 2024
8 pages
NLP Question Bank
No ratings yet
NLP Question Bank
7 pages
Mathews 1989
No ratings yet
Mathews 1989
18 pages
RPH UNIT 9 (Full)
No ratings yet
RPH UNIT 9 (Full)
19 pages
Sense Making Kritik
No ratings yet
Sense Making Kritik
23 pages
NLP QB2
No ratings yet
NLP QB2
9 pages
0 Yqn EK3 VG 4 He OTv 089 KX SI1 Ij Wzu Ax T1 Ag Gev OKKJE
No ratings yet
0 Yqn EK3 VG 4 He OTv 089 KX SI1 Ij Wzu Ax T1 Ag Gev OKKJE
4 pages
End Sem Answer Key 2023
No ratings yet
End Sem Answer Key 2023
4 pages
Model Question Paper
0% (1)
Model Question Paper
2 pages
NLP 2K19 MAY CS3EA06-IT3EA06 Natural Language Processing
No ratings yet
NLP 2K19 MAY CS3EA06-IT3EA06 Natural Language Processing
3 pages
Natural Language Processing
No ratings yet
Natural Language Processing
8 pages
Dysfunctions of A Team
100% (1)
Dysfunctions of A Team
1 page
Develop Soft Skills That Industry Demands
No ratings yet
Develop Soft Skills That Industry Demands
13 pages
MTE Practice Set
No ratings yet
MTE Practice Set
4 pages
VND Openxmlformats-Officedocument Wordprocessingml Document&rendition 1
No ratings yet
VND Openxmlformats-Officedocument Wordprocessingml Document&rendition 1
5 pages
NLP 2021 LastYr NLP Questions RemainingTopics
No ratings yet
NLP 2021 LastYr NLP Questions RemainingTopics
11 pages
Q 4 W 6
No ratings yet
Q 4 W 6
5 pages
QP Psychology 23-24 Class XII
No ratings yet
QP Psychology 23-24 Class XII
5 pages
NLP QB
No ratings yet
NLP QB
5 pages
NLP Mid QP
No ratings yet
NLP Mid QP
3 pages
NLP Previous Sem
No ratings yet
NLP Previous Sem
5 pages
NLP Final
No ratings yet
NLP Final
4 pages
Wa0002.
No ratings yet
Wa0002.
6 pages
SNLP Past Papers
No ratings yet
SNLP Past Papers
6 pages
NLP 2K22 MAY CS3EA06 Natural Language Processing
No ratings yet
NLP 2K22 MAY CS3EA06 Natural Language Processing
2 pages
Sample Questions: Subject Name: Semester: VIII
No ratings yet
Sample Questions: Subject Name: Semester: VIII
7 pages
Key Concepts in ELT - Noticing
No ratings yet
Key Concepts in ELT - Noticing
1 page
BAI601
No ratings yet
BAI601
2 pages
NLP Question Bank
No ratings yet
NLP Question Bank
7 pages
Practice Problems of NLP
No ratings yet
Practice Problems of NLP
3 pages
NLP QB
No ratings yet
NLP QB
7 pages
CEGP013091: 49.248.216.238 17/05/2024 13:48:57 Static-238
No ratings yet
CEGP013091: 49.248.216.238 17/05/2024 13:48:57 Static-238
3 pages
NLP Mu Qpapers 2022-2024
No ratings yet
NLP Mu Qpapers 2022-2024
5 pages
NLP Question
No ratings yet
NLP Question
4 pages
Lcolegario Dissertation Preliminary Pages
No ratings yet
Lcolegario Dissertation Preliminary Pages
9 pages
Emergenetics Thinking Attributes Handout - Amplitutde
No ratings yet
Emergenetics Thinking Attributes Handout - Amplitutde
2 pages
NLP Quiz
No ratings yet
NLP Quiz
2 pages
NLP QB
No ratings yet
NLP QB
4 pages
Rereading Michel Foucault and The Archaeology of Knowledge. - Printable
100% (1)
Rereading Michel Foucault and The Archaeology of Knowledge. - Printable
2 pages
19BM110
No ratings yet
19BM110
4 pages
Chuanpit Sriwichai (2020) - Students' Readiness and Problems in Learning English Through Blended Learning
No ratings yet
Chuanpit Sriwichai (2020) - Students' Readiness and Problems in Learning English Through Blended Learning
6 pages
IT3EA06 Natural Language Processing
No ratings yet
IT3EA06 Natural Language Processing
3 pages
NLP Previous Sem-1-3
No ratings yet
NLP Previous Sem-1-3
3 pages
NLP New QB
No ratings yet
NLP New QB
3 pages
NLP Endsem 2015
No ratings yet
NLP Endsem 2015
2 pages
CS6314
No ratings yet
CS6314
2 pages
Long Answer Qs
No ratings yet
Long Answer Qs
2 pages
Year B.Tech. - Computer Science &engineering: Speech and Natural Language Processing
No ratings yet
Year B.Tech. - Computer Science &engineering: Speech and Natural Language Processing
2 pages
Kcs072 Natural Language Processing
No ratings yet
Kcs072 Natural Language Processing
2 pages
Speech 4
No ratings yet
Speech 4
2 pages
1984-Essay Theses 3
No ratings yet
1984-Essay Theses 3
3 pages
Cse 8 Sem Natural Language Processing 3698 Summer 2019
No ratings yet
Cse 8 Sem Natural Language Processing 3698 Summer 2019
2 pages
B.tech 8th Sem (KOE88)
No ratings yet
B.tech 8th Sem (KOE88)
1 page
Natural Language Processing 16CSE16-3-6-21
No ratings yet
Natural Language Processing 16CSE16-3-6-21
1 page
IGNOU PGDCA MCS 208 Data Structure and Algorithm Previous Years Unsolved Papers
From Everand
IGNOU PGDCA MCS 208 Data Structure and Algorithm Previous Years Unsolved Papers
Manish Soni
No ratings yet
IGNOU MCA Design and Analysis of Algorithms Previous Years Unsolved Papers MCS 211
From Everand
IGNOU MCA Design and Analysis of Algorithms Previous Years Unsolved Papers MCS 211
Manish Soni
No ratings yet
IGNOU BCA Introduction to Algorithm Design Previous Year Unsolved Papers BCS 042
From Everand
IGNOU BCA Introduction to Algorithm Design Previous Year Unsolved Papers BCS 042
Manish Soni
No ratings yet

NLP Endsem 2016

Uploaded by

NLP Endsem 2016

Uploaded by

NLP Endsem 2016

Time: 3 hrs Total Marks: 70 Be Precise

(Please Turn Over)

4. A PCFG is based on the following rules:

You might also like