0% found this document useful (0 votes)

74 views6 pages

NLP MCQ Assignment on Language Models

This document contains 10 multiple choice questions about natural language processing topics such as extractive summarization, language modeling techniques, part-of-speech tagging, hidden Markov models, and more. The questions cover concepts from lectures and require applying techniques like linear interpolation, backoff smoothing, and Kneser-Ney smoothing to probability distributions derived from example corpora.

Uploaded by

geetha megharaj

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

74 views6 pages

NLP MCQ Assignment on Language Models

Uploaded by

geetha megharaj

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

Natural Language Processing

Assignment- 3
TYPE OF QUESTION: MCQ
Number of questions: 10 Total mark: 10 X 1 = 10

Question 1: Consider the following Wikipedia text on Extractive Summarization:

Here, content is extracted from the original data, but the extracted content is not modified in any
way. Examples of extracted content include key-phrases that can be used to tag or index a text
document, or key sentences including headings that collectively comprise an abstract, and
representative images or video segments, as stated above. For text, extraction is analogous to
the process of skimming, where the summary (if available), headings and subheadings, figures,
the first and last paragraphs of a section, and optionally the first and last sentences in a
paragraph are read before one chooses to read the entire document in detail. Other examples
of extraction that include key sequences of text in terms of clinical relevance…

Which of the following is correct? (Consider lowercase characters; ignore punctuations)

a. Pcontinuation(and) > Pcontinuation(in)

b. Pcontinuation(and) < Pcontinuation(in)
c. Pcontinuation(and) = Pcontinuation(in)
d. Data insufficient

Answer: c
Solution: “first and” appears twice, however will be counted once, since we only count the no.
of unique bigrams that end with “and”.

Question 2: Suppose you are reading an article on Sentiment Analysis. Till now, you have
read the words “sentiment”, “aspect”, and “opinion” - 5 times each, “triplet”, and
“extraction” - thrice each, “pointers”, “encoder”, “decoder”, and “network” - once each.
What are the Maximum Likelihood Estimate (MLE) and Good Turing Estimate
probabilities of reading “architecture” as the next word?

a. 4/25, 4/25
b. 0/25, 4/25
c. 4/25, 4/26
d. 0/25, 4/26

Answer: b
Solution: P*GT(architecture) = N1 / N, where N = Total occurrences of words already read.
Question 3. In a corpus, suppose there are 5 words, a, b, c, d and e. You are provided
with the following counts:

n-gram count n-gram count n-gram count

caa 0 aa 0 a 10

cab 3 ab 4 b 12

cac 2 ac 3 c 8

cad 0 ad 0 d 4

cae 3 ae 3 e 16

Use a Linear Interpolation based language model to estimate the trigram probabilities
Pinterpolation(wn|wn-1wn-2), where wn = {a,b,c,d,e}, wn-1 = a, and wn-2 = c. Consider the weights
corresponding to the bi-gram and unigram language models as 0.2 and 0.1 respectively.

a. 0.002, 0.3665, 0.251, 0.008, 0.3545

b. 0.02, 0.3665, 0.251, 0.008, 0.3545
c. 0.02, 0.3665, 0.251, 0.08, 0.3545
d. 0.02, 0.3545, 0.251, 0.008, 0.3665

Answer: b

Solution: Use the Linear Interpolation formula as explained in Lecture 12. The weights for the
tri-gram, bi-gram and unigram language models will be 0.7, 0.2 and 0.1.

Question 4. Consider the same corpus as given in Question 3. Use the recursive
definition of backoff smoothing to obtain the probability distribution Pbackoff(wn|wn-1wn-2),
where wn = {a,b,c,d,e}, wn-1 = a, and wn-2 = c. Assume that .

a. 0.1663, 0.3125, 0.1875, 0.0212, 0.3125

b. 0.1663, 0.3125, 0.3125, 0.0212, 0.1875
c. 0.1875, 0.3125, 0.1663, 0.0212, 0.3125
d. 0.0212, 0.3125, 0.1875, 0.1663, 0.3125

Answer: a

Solution: Follow the explanation given in Lecture 12. ƛa = 75/62.

Question 5. Consider the same corpus as given in Question 3. Calculate the Kneser-Ney
smoothed probability PKN(b|a), correct to two decimal places. Consider d = 0.75

a. 0.55
b. 0.37
c. 0.36
d. 0.38

Answer: d

Solution: Follow the explanation given in Lecture 12. Pcontinuation(b) = ¼, since there are a total of
4 bi-grams {ca, ab, ac, ae} with counts > 0.For calculating ƛa, |{w : c(a,w) > 0}| = 3 ({ab, ac, ae}).

Question 6: Match the following words with the type of morphemes they contain:
● w1: upbringing, w2: regularity, w3: readers, w4: walked
● (i) Only derivational, (ii) Only inflectional, (iii) Both derivational and inflectional

a. w1-(ii), w2-(i), w3-(iii), w4-(ii)

b. w1-(ii), w2-(i), w3-(ii), w4-(ii)
c. w1-(iii), w2-(i), w3-(iii), w4-(ii)
d. w1-(iii), w2-(i), w3-(ii), w4-(ii)

Answer: c

Solution: upbringing = up (derivational suffix) + bring (Root word) + ing (inflectional suffix).
regularity = regular (Root word) + ity (derivational suffix)
readers = read (Root word) + er (derivational suffix) + s (inflectional suffix).
walked = walk (Root word) + ed (inflectional suffix).

Question 7: identify the UPenn Treebank part of speech tag for the word “fast” in each of
the following sentences:
● Muslims fast during Ramadan.
● She spoke so fast that I could not follow her.
● There are several fast food centres around the city.
● He was fast asleep when I reached home.

a. VERB-VBP, ADJ-JJ, NOUN-NN, ADJ-JJ

b. VERB-VBP, ADJ-JJ, ADJ-JJ, ADV-RB
c. VERB-VBP, ADV-RB, ADJ-JJ, ADV-RB
d. VERB-VBP, ADJ-JJ, NOUN-NN, ADV-RB

Answer: c
Solution: Check at https://spacy.io/usage/linguistic-features

Question 8: Consider the following State Transition diagram with three states: Sunny,
Foggy and Rainy. The state-transition probabilities are mentioned.

Assume that the weather yesterday was “Foggy”, and today it is “Foggy” again. What is
the probability that it will be “Sunny” the day after tomorrow?

NOTE: Consider a First-Order Markov model.

a. 0.08
b. 0.16
c. 0.32
d. 0.64

Answer: c

Solution: According to Markov Chain assumption, tomorrow’s weather is only dependent on

Question 9: Suppose you are locked in a room and you want to find out how the weather
is outside, ONLY based on how you feel. Assume a First-Order Hidden Markov Model
with three states {Sunny, Foggy, Rainy} with their transition probabilities as shown in
Question 8. You either feel Happy or Grumpy, thereby defining your set of observables.
The (emission) probabilities of how you feel given the weather outside are shown below.
Happy (H) Grumpy (G)

Sunny (S) 0.7 0.3

Foggy (F) 0.5 0.5

Rainy (R) 0.4 0.6

Suppose you feel Happy on Day 1, and Grumpy on Day 2. Let the initial probabilities (or
likelihood) of the weather being Sunny, Foggy, and Rainy on Day 1 be 0.5, 0.3, and 0.2
respectively. What is the probability of deriving the observed sequence of feelings, i.e.
{Happy, Grumpy}, given the model.

a. 0.23565
b. 0.23585
c. 0.33565
d. 0.33585

Answer: b

Question 10. Suppose you feel Grumpy on Day 1, Happy on Day 2, and Happy again on
Day 3. Considering the same model as defined in Question 9, what is the probability of all
three days being Sunny?

a. 0.04704
b. 0.10976
c. 0.02016
d. 0.07526

Answer: a

Solution:

************END*******

NLP Assignment-4 Solution
100% (1)
NLP Assignment-4 Solution
5 pages
NPTEL NLP Assignment 4
No ratings yet
NPTEL NLP Assignment 4
6 pages
NPTEL NLP Assignment 3
No ratings yet
NPTEL NLP Assignment 3
6 pages
NLP Assignment 3: MCQ Questions
100% (1)
NLP Assignment 3: MCQ Questions
6 pages
1 Merged
No ratings yet
1 Merged
51 pages
NLP MCQ Assignment Overview
No ratings yet
NLP MCQ Assignment Overview
63 pages
Assignment 2 Solution
No ratings yet
Assignment 2 Solution
4 pages
Pract Q
No ratings yet
Pract Q
6 pages
Question Bank
No ratings yet
Question Bank
29 pages
Mid-Term: Answer Point Value: 1.0 Points Answer Key: D
100% (1)
Mid-Term: Answer Point Value: 1.0 Points Answer Key: D
12 pages
Decision Trees: USC Linguistics July 26, 2007
No ratings yet
Decision Trees: USC Linguistics July 26, 2007
6 pages
NPTEL NLP Assignment 2
No ratings yet
NPTEL NLP Assignment 2
6 pages
NLP Lec 05
No ratings yet
NLP Lec 05
18 pages
NLP Midsem Paper August 2024 Regular Solution
No ratings yet
NLP Midsem Paper August 2024 Regular Solution
10 pages
Trigram Model and RNNs in NLP Analysis
No ratings yet
Trigram Model and RNNs in NLP Analysis
6 pages
NLP Assignment-2 Solution
100% (3)
NLP Assignment-2 Solution
5 pages
Tutorial I
No ratings yet
Tutorial I
6 pages
Statistical Inference
No ratings yet
Statistical Inference
38 pages
Unit 5 - Lesson 5
No ratings yet
Unit 5 - Lesson 5
4 pages
Assignment 3 Solution
No ratings yet
Assignment 3 Solution
5 pages
Video v3
No ratings yet
Video v3
34 pages
Math 139 Exam 2 Problems To Practice - Fall 2022
No ratings yet
Math 139 Exam 2 Problems To Practice - Fall 2022
7 pages
Search Algorithms and Logic Inference
No ratings yet
Search Algorithms and Logic Inference
16 pages
ECE313F20 Exam1 Solution
No ratings yet
ECE313F20 Exam1 Solution
11 pages
FA22-BEE-068 - Assignment # 2
No ratings yet
FA22-BEE-068 - Assignment # 2
15 pages
Probability Models and Concepts Quiz
No ratings yet
Probability Models and Concepts Quiz
44 pages
04 - N-Gram Language Models
No ratings yet
04 - N-Gram Language Models
41 pages
PLM Language Models Overview
No ratings yet
PLM Language Models Overview
35 pages
Probability & Statistics Exercises
No ratings yet
Probability & Statistics Exercises
17 pages
Homework1 Solution
No ratings yet
Homework1 Solution
5 pages
N-gram Models in Language Processing
No ratings yet
N-gram Models in Language Processing
30 pages
Probability Lesson Plan Overview
No ratings yet
Probability Lesson Plan Overview
10 pages
Open STA 115 Comp. PQ - Seyi Ezekiel
No ratings yet
Open STA 115 Comp. PQ - Seyi Ezekiel
58 pages
8 LongQuiz2
No ratings yet
8 LongQuiz2
3 pages
Practice Questions NLP
No ratings yet
Practice Questions NLP
5 pages
Assignment - 4 Solution
No ratings yet
Assignment - 4 Solution
5 pages
Solutions To Sheet 1-Part 1
No ratings yet
Solutions To Sheet 1-Part 1
41 pages
Stat MCQ
No ratings yet
Stat MCQ
13 pages
Computational Linguistics Exam Guide
No ratings yet
Computational Linguistics Exam Guide
5 pages
Theory Test I PDF
No ratings yet
Theory Test I PDF
11 pages
2024 MMU12 Task 4 MG
No ratings yet
2024 MMU12 Task 4 MG
6 pages
Chapter 8
No ratings yet
Chapter 8
16 pages
Cognizant Technology Solutions (CTS)
No ratings yet
Cognizant Technology Solutions (CTS)
37 pages
NLP Exam Solutions for CSE Students
No ratings yet
NLP Exam Solutions for CSE Students
6 pages
Understanding Log-Linear Models in NLP
No ratings yet
Understanding Log-Linear Models in NLP
20 pages
Solutions 109
No ratings yet
Solutions 109
7 pages
Dependent and Independent
No ratings yet
Dependent and Independent
6 pages
Week 5 Exercises Solutions
100% (1)
Week 5 Exercises Solutions
12 pages
Chapter 2 Solutions
No ratings yet
Chapter 2 Solutions
9 pages
HW3
No ratings yet
HW3
2 pages
EE3110 Quiz1 Soln
No ratings yet
EE3110 Quiz1 Soln
4 pages
Module - 2 - Test Portion
No ratings yet
Module - 2 - Test Portion
33 pages
Probability and Statistics Exercises
No ratings yet
Probability and Statistics Exercises
42 pages
Homework 1
No ratings yet
Homework 1
3 pages
Probability (Full Solution)
No ratings yet
Probability (Full Solution)
10 pages
Tutorial 3-Solutions-Probability Theory (2 Files Merged)
No ratings yet
Tutorial 3-Solutions-Probability Theory (2 Files Merged)
29 pages
Unit 3 Homework All in One
No ratings yet
Unit 3 Homework All in One
11 pages
N Grams
No ratings yet
N Grams
13 pages
CTS Questions
No ratings yet
CTS Questions
5 pages
Module 5-Geetha Megharaj
No ratings yet
Module 5-Geetha Megharaj
70 pages
Understanding Multithreading Models
No ratings yet
Understanding Multithreading Models
17 pages
CFG & PDA Concepts for CSE Students
No ratings yet
CFG & PDA Concepts for CSE Students
46 pages
Dms Mod3
No ratings yet
Dms Mod3
5 pages
Turing Machine 1
No ratings yet
Turing Machine 1
18 pages
Module - 2 Notes-BCS303
100% (1)
Module - 2 Notes-BCS303
38 pages
Handling Deadlocks in PostgreSQL
No ratings yet
Handling Deadlocks in PostgreSQL
48 pages
Lec 1
No ratings yet
Lec 1
14 pages
Automata Theory and Computability (18CS54) : 5 Semester
No ratings yet
Automata Theory and Computability (18CS54) : 5 Semester
38 pages
Automata Theory and Computability (17CS54) : 5 Semester
No ratings yet
Automata Theory and Computability (17CS54) : 5 Semester
34 pages
Memory Management in OS BCS303
No ratings yet
Memory Management in OS BCS303
39 pages
Memory Management in OS BCS303
No ratings yet
Memory Management in OS BCS303
39 pages
DFA Design and Examples Lecture
No ratings yet
DFA Design and Examples Lecture
27 pages
Understanding Outcome-Based Education
No ratings yet
Understanding Outcome-Based Education
22 pages
Lec 3
No ratings yet
Lec 3
19 pages
Dependency Parsing MCQ Assignment
No ratings yet
Dependency Parsing MCQ Assignment
6 pages
NLP Assignment: MCQs on Word Relations
No ratings yet
NLP Assignment: MCQs on Word Relations
5 pages
Understanding Education and Teaching
No ratings yet
Understanding Education and Teaching
13 pages
Lec 5
No ratings yet
Lec 5
17 pages
NLP Assignment: Tokenization & Analysis
No ratings yet
NLP Assignment: Tokenization & Analysis
4 pages
Assignment 5 (COPY)
No ratings yet
Assignment 5 (COPY)
5 pages
NLP Assignment-10 Solution
0% (1)
NLP Assignment-10 Solution
4 pages
NLP Assignment-11 Solution
No ratings yet
NLP Assignment-11 Solution
5 pages
NLP Assignment 9: MCQ Questions
100% (1)
NLP Assignment 9: MCQ Questions
4 pages
Lhp- Nam Định 2024 Dhbb
No ratings yet
Lhp- Nam Định 2024 Dhbb
4 pages
t2 e 3148 Proper Nouns Ks2 What Is A Proper Noun Powerpoint Ver 6
No ratings yet
t2 e 3148 Proper Nouns Ks2 What Is A Proper Noun Powerpoint Ver 6
16 pages
Subprograms: CSC 4330/6330 9-1 12/10 Programming Language Concepts
No ratings yet
Subprograms: CSC 4330/6330 9-1 12/10 Programming Language Concepts
58 pages
Changing Methodologies in TESOL - (3 The Place of The Learner in Methods)
No ratings yet
Changing Methodologies in TESOL - (3 The Place of The Learner in Methods)
22 pages
Strings & DTM's.lab
100% (1)
Strings & DTM's.lab
14 pages
Taming Your Dragons
No ratings yet
Taming Your Dragons
15 pages
Comprehensive Poetry Anthology Guide
0% (1)
Comprehensive Poetry Anthology Guide
3 pages
MLA Handbook Ninth Edition The Modern Language Association of America Available Full Chapters
No ratings yet
MLA Handbook Ninth Edition The Modern Language Association of America Available Full Chapters
78 pages
Act 1
No ratings yet
Act 1
4 pages
Ing - RR - 11 - Kalila - Passive, Conditionals, Make Vs Get Vs Do
No ratings yet
Ing - RR - 11 - Kalila - Passive, Conditionals, Make Vs Get Vs Do
3 pages
De Thi Tuyen Sinh Vao Lop 10 Mon Tieng Anh Nam Hoc 2015 2016
No ratings yet
De Thi Tuyen Sinh Vao Lop 10 Mon Tieng Anh Nam Hoc 2015 2016
5 pages
50 Days Challenge Test
No ratings yet
50 Days Challenge Test
10 pages
Though, Although, Despite, in Spite Of, and However
100% (1)
Though, Although, Despite, in Spite Of, and However
4 pages
PRONUNCIATION F & V
No ratings yet
PRONUNCIATION F & V
3 pages
11 +Sushil+Ghimire
No ratings yet
11 +Sushil+Ghimire
5 pages
Anyone Who Speaks The Language Can Teach The Language
100% (4)
Anyone Who Speaks The Language Can Teach The Language
2 pages
Free Sight Word Reader and Comprehension Set 1
100% (6)
Free Sight Word Reader and Comprehension Set 1
11 pages
A1. Reading Comprehension. Read The Following Passage Carefully. Aunt Sarah's Earring
No ratings yet
A1. Reading Comprehension. Read The Following Passage Carefully. Aunt Sarah's Earring
20 pages
9 - MBA Course Outline 2014-15
No ratings yet
9 - MBA Course Outline 2014-15
222 pages
Noun Exercise
No ratings yet
Noun Exercise
2 pages
Modern English Saxoned
100% (1)
Modern English Saxoned
266 pages
Madina Book 1 Urdu Key
No ratings yet
Madina Book 1 Urdu Key
61 pages
Gujarat 1971 Census Population Data
No ratings yet
Gujarat 1971 Census Population Data
146 pages
Toefl Junior Speaking Scoring Guide
No ratings yet
Toefl Junior Speaking Scoring Guide
3 pages
Ih Cylt: Lesson Plan Cover Sheet
No ratings yet
Ih Cylt: Lesson Plan Cover Sheet
2 pages
Point of No Return: From Wikipedia, The Free Encyclopedia
No ratings yet
Point of No Return: From Wikipedia, The Free Encyclopedia
4 pages
Introduction To Software Project Management: VIT Chennai
No ratings yet
Introduction To Software Project Management: VIT Chennai
84 pages
Understanding Visual Communication
100% (3)
Understanding Visual Communication
14 pages
GRADE 9 Reading Writing
No ratings yet
GRADE 9 Reading Writing
2 pages
Comparative Study of Paremiological Units of The Korean and Uzbek Languages
No ratings yet
Comparative Study of Paremiological Units of The Korean and Uzbek Languages
5 pages

NLP MCQ Assignment on Language Models

Uploaded by

NLP MCQ Assignment on Language Models

Uploaded by

Natural Language Processing

Question 1: Consider the following Wikipedia text on Extractive Summarization:

Which of the following is correct? (Consider lowercase characters; ignore punctuations)

a. Pcontinuation(and) > Pcontinuation(in)

n-gram count n-gram count n-gram count

a. 0.002, 0.3665, 0.251, 0.008, 0.3545

a. 0.1663, 0.3125, 0.1875, 0.0212, 0.3125

Solution: Follow the explanation given in Lecture 12. ƛa = 75/62.

a. w1-(ii), w2-(i), w3-(iii), w4-(ii)

a. VERB-VBP, ADJ-JJ, NOUN-NN, ADJ-JJ

NOTE: Consider a First-Order Markov model.

Solution: According to Markov Chain assumption, tomorrow’s weather is only dependent on

Sunny (S) 0.7 0.3

Foggy (F) 0.5 0.5

Rainy (R) 0.4 0.6

You might also like