0% found this document useful (0 votes)

1K views4 pages

NLP Assignment-1 Solution

This document contains a 10 question multiple choice quiz on natural language processing topics. The questions cover concepts like Zipf's law, type-token ratio, lemmatization vs stemming, and Heap's law. Example solutions and explanations are provided for each question.

Uploaded by

geetha megharaj

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

1K views4 pages

NLP Assignment-1 Solution

Uploaded by

geetha megharaj

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 4

Natural Language Processing

Assignment- 1
TYPE OF QUESTION: MCQ
Number of questions: 10 Total mark: 10 X 1 = 10

____________________________________________________________________________

Question 1: In a corpus, you found that the word with rank 4th has a frequency of 500.
What can be the best guess for the rank of a word with frequency 250?

1. 2
2. 4
3. 8
4. 6

Answer: 3

Solution:
frequency * rank =k [by Zipfs law]
500*4 = 250*r
r=8

____________________________________________________________________________

Question 2: In the sentence, “In Mumbai I took my hat off. But I can’t put it back on.”, total
number of word tokens and word types are:
1. 14, 13
2. 13, 14
3. 15, 14
4. 14, 15
Answer: a) 14, 13.
Solution: Here, the word “I” is repeated two times so type count is
one less than token count.

____________________________________________________________________________

Question 3: Let the rank of two words, w1 and w2, in a corpus be 400 and 100,
respectively. Let m1 and m2 represent the number of meanings of w1 and w2
respectively. The ratio m1 : m2 would tentatively be
1. 1:4
2. 4:1
3. 1:2
4. 2:1
Answer: 3

Solution:
m1/m2 = sqrt(rank2)/sqrt(rank1) = sqrt(100)/sqrt(400) = 1:2

____________________________________________________________________________

Question 4: What is the valid range of type-token ratio of any text corpus?

1. TTR∈ (0,1] (excluding zero)

2. TTR∈ [0,1]
3. TTR∈ [−1,1]
4. TTR∈ [0,+∞] (any non-negative number)

Answer: 1.
Solution: Number of unique words or type ≤ Total number of tokens in text, and both are greater
than 1

____________________________________________________________________________

Question 5: If first corpus has 𝑇𝑇𝑅1 = 0.075 and second corpus has 𝑇𝑇𝑅2 = 0.15, where
𝑇𝑇𝑅1 and 𝑇𝑇𝑅2 represents type/token ratio in first and second corpus respectively, then

1. First corpus has more tendency to use different words.

2. Second corpus has more tendency to use different words.
3. Both a and b
4. None of these

Answer: b
Solution: Second corpus has more tendency to use different words. If TTR scores are higher
then there is more tendency to use different words.

____________________________________________________________________________

Question 6: Which of the following is/are true for the English Language?
1. Lemmatization works only on inflectional morphemes and Stemming works only on
derivational morphemes.
2. The outputs of lemmatization and stemming for the same word might differ.
3. Output of lemmatization are always real words
4. Output of stemming are always real words

Answer: 2, 3
Solution: Stemming usually refers to a crude heuristic process that chops off the ends of words
in the hope of achieving this goal correctly most of the time, and often includes the removal of
derivational affixes. Lemmatization usually refers to doing things properly with the use of a
vocabulary and morphological analysis of words, normally aiming to remove inflectional endings
only and to return the base or dictionary form of a word, which is known as the lemma .

____________________________________________________________________________

Question 7: An advantage of Porter stemmer over a full morphological parser?

1. The stemmer is better justified from a theoretical point of view
2. The output of a stemmer is always a valid word
3. The stemmer does not require a detailed lexicon to implement
4. None of the above

Answer: 3
Solution: The Porter stemming algorithm is a process for removing suffixes from words in
English. The Porter stemming algorithm was made on the assumption that we don’t have a stem
dictionary (lexicon) and that the purpose of the task is to improve Information Retrieval
performance. Stemming algorithms are typically rule-based. You can view them as a heuristic
process that sort-of lops off the ends of words.

____________________________________________________________________________

Question 8: Which of the following are instances of stemming? (as per Porter Stemmer)

1. are -> be
2. plays -> play
3. saw -> s
4. university -> univers
Answer: 2,4
Solution: Stemming cannot convert are->be as it can only convert or chop off word suffixes.
Also Porter Stemmer wouldn’t chop off if the final outcome is of length 1 as in saw -> s.
____________________________________________________________________________

Question 9: What is natural language processing good for?

1. Summarize blocks of text
2. Automatically generate keywords
3. Identifying the type of entity extracted
4. All of the above

Answer: 4

Solution:
For all the above-mentioned task, NLP can be used
____________________________________________________________________________

Question 10: What is the size of unique words in a document where total number of
words = 12000. K = 3.71 Beta = 0.69?

1. 2421
2. 3367
3. 5123
4. 1529

Answer: 1

Solution: 3.71 x 12000^0.69 = 2421 unique words. Heap’s Law

____________________________________________________________________________
************END*******

NLP Sem Questions and Answers
No ratings yet
NLP Sem Questions and Answers
72 pages
Deep Learning - Question Papers
50% (2)
Deep Learning - Question Papers
7 pages
Natural Language Processing Important Questions Answers
100% (1)
Natural Language Processing Important Questions Answers
31 pages
NLP Unit 1 Notes
100% (1)
NLP Unit 1 Notes
19 pages
IML-IITKGP - Assignment 1 Solution
No ratings yet
IML-IITKGP - Assignment 1 Solution
7 pages
NLP Assignment-2 Solution
100% (3)
NLP Assignment-2 Solution
5 pages
NLP Assignment-10 Solution
0% (1)
NLP Assignment-10 Solution
4 pages
ML MCQs
55% (11)
ML MCQs
17 pages
NLP Assignment-4 Solution
100% (1)
NLP Assignment-4 Solution
5 pages
NLP Question Paper Solution
No ratings yet
NLP Question Paper Solution
27 pages
NLP Assignment-7 Solution
No ratings yet
NLP Assignment-7 Solution
5 pages
NLP Assignment-3 Solution
100% (1)
NLP Assignment-3 Solution
6 pages
1.deep Learning Assignment1 Solutions 1
100% (3)
1.deep Learning Assignment1 Solutions 1
12 pages
NLP Assignment-9 Solution
100% (1)
NLP Assignment-9 Solution
4 pages
Module - 2 Notes-BCS303
100% (1)
Module - 2 Notes-BCS303
38 pages
RL Unit 1
100% (1)
RL Unit 1
26 pages
Vtu NLP Questions
100% (1)
Vtu NLP Questions
5 pages
NLP - (Natural Language Processing Lab Manual)
No ratings yet
NLP - (Natural Language Processing Lab Manual)
12 pages
Machine Learning Question Paper Solved ML
No ratings yet
Machine Learning Question Paper Solved ML
55 pages
Unit 1 Introduction of Machine Learning Notes
No ratings yet
Unit 1 Introduction of Machine Learning Notes
57 pages
DEEP LEARNING IIT Kharagpur Assignment - 4 - 2024
100% (2)
DEEP LEARNING IIT Kharagpur Assignment - 4 - 2024
7 pages
Heuristic Search: Dr.M. Nagaratna Professor, Dept - of CSE Jntuceh
No ratings yet
Heuristic Search: Dr.M. Nagaratna Professor, Dept - of CSE Jntuceh
54 pages
DEEP LEARNING (Previous Question Papers)
No ratings yet
DEEP LEARNING (Previous Question Papers)
3 pages
Unit I Notes Machine Learning Techniques 1
No ratings yet
Unit I Notes Machine Learning Techniques 1
21 pages
MCQ
100% (1)
MCQ
9 pages
Deep Learning R18 Jntuh Lab Manual
0% (1)
Deep Learning R18 Jntuh Lab Manual
21 pages
Deep Learning MCQ Previous Year MCQ
100% (1)
Deep Learning MCQ Previous Year MCQ
11 pages
Deep Learning-Question Bank-Module-Wise
67% (3)
Deep Learning-Question Bank-Module-Wise
5 pages
Deep Learning Question Paper
100% (1)
Deep Learning Question Paper
3 pages
Artificial Intelligence - Knowledge Representation and Reasoning - Unit 8 - Week 5
100% (1)
Artificial Intelligence - Knowledge Representation and Reasoning - Unit 8 - Week 5
5 pages
Deep Learning KCS078
0% (1)
Deep Learning KCS078
2 pages
101905CS502H - Neural Networks and Deep Learning - Model Question Paper
100% (1)
101905CS502H - Neural Networks and Deep Learning - Model Question Paper
4 pages
Question Bank Ann
50% (2)
Question Bank Ann
2 pages
ML Question Papers
100% (1)
ML Question Papers
7 pages
MCQ Unit Wise ML (ROE083) Que Bank With Ans.
100% (4)
MCQ Unit Wise ML (ROE083) Que Bank With Ans.
22 pages
NLP Assignment-1 Solution
No ratings yet
NLP Assignment-1 Solution
4 pages
AI MCQ QUESTION 100 MCQ
No ratings yet
AI MCQ QUESTION 100 MCQ
13 pages
Question Bank
No ratings yet
Question Bank
14 pages
Assignment 11: Introduction To Machine Learning Prof. B. Ravindran
100% (2)
Assignment 11: Introduction To Machine Learning Prof. B. Ravindran
3 pages
CS2351 - Artificial Intelligence-2 Marks
100% (1)
CS2351 - Artificial Intelligence-2 Marks
16 pages
Question Bank Beel801 PDF
100% (1)
Question Bank Beel801 PDF
10 pages
Introduction To Machine Learning - Unit 3 - Week 1
No ratings yet
Introduction To Machine Learning - Unit 3 - Week 1
3 pages
NLP Assignment-8 Solution
No ratings yet
NLP Assignment-8 Solution
5 pages
Assignment Week 4-Deep-Learning PDF
100% (1)
Assignment Week 4-Deep-Learning PDF
7 pages
MCQ
No ratings yet
MCQ
4 pages
ML Set 1 QB Question Paper
No ratings yet
ML Set 1 QB Question Paper
4 pages
Machine Learning, ML Ass 7
No ratings yet
Machine Learning, ML Ass 7
7 pages
MCQ Question
No ratings yet
MCQ Question
5 pages
Artificial Intelligence - Knowledge Representation and Reasoning - Unit 6 - Week 3
No ratings yet
Artificial Intelligence - Knowledge Representation and Reasoning - Unit 6 - Week 3
5 pages
Assignment 6 (COPY)
No ratings yet
Assignment 6 (COPY)
6 pages
Natural Language Processing - Unit 10 - Week 8
No ratings yet
Natural Language Processing - Unit 10 - Week 8
6 pages
NLP Assignment-11 Solution
No ratings yet
NLP Assignment-11 Solution
5 pages
Ilovepdf Merged
No ratings yet
Ilovepdf Merged
63 pages
NPTEL NLP 2025 Assignment 1
No ratings yet
NPTEL NLP 2025 Assignment 1
5 pages
IT8601-Computational Intelligence PDF
No ratings yet
IT8601-Computational Intelligence PDF
12 pages
1-NLP - Lab Manual
No ratings yet
1-NLP - Lab Manual
15 pages
NLP Assignment-6 Solution
No ratings yet
NLP Assignment-6 Solution
5 pages
Assignment 8
No ratings yet
Assignment 8
5 pages
Week3 Assignment
No ratings yet
Week3 Assignment
6 pages
Assignment 10: Introduction To Machine Learning Prof. B. Ravindran
100% (1)
Assignment 10: Introduction To Machine Learning Prof. B. Ravindran
4 pages
Artificial Intelligence - Knowledge Representation and Reasoning - Unit 4 - Week 1
No ratings yet
Artificial Intelligence - Knowledge Representation and Reasoning - Unit 4 - Week 1
4 pages
Question Bank Module-1 Questions. Introduction and Concept Learning
No ratings yet
Question Bank Module-1 Questions. Introduction and Concept Learning
6 pages
NLP Qa
No ratings yet
NLP Qa
10 pages
AI-important Questions
No ratings yet
AI-important Questions
2 pages
Data Science and Emerging Technologies - Yap Bee Wah, Dhiya Al-Jumeily Obe, Michael W - Berry - 2024 - Springer - 9789819702923 - Anna's Archive
No ratings yet
Data Science and Emerging Technologies - Yap Bee Wah, Dhiya Al-Jumeily Obe, Michael W - Berry - 2024 - Springer - 9789819702923 - Anna's Archive
574 pages
IR Module
No ratings yet
IR Module
80 pages
Assignment 5 (COPY)
No ratings yet
Assignment 5 (COPY)
5 pages
Module - 4 - BCS303-OS
No ratings yet
Module - 4 - BCS303-OS
39 pages
Module - 4 - BCS303-OS
No ratings yet
Module - 4 - BCS303-OS
39 pages
NLP-Lab Manual - Ashwini - Kachare
No ratings yet
NLP-Lab Manual - Ashwini - Kachare
41 pages
Unit2 A
No ratings yet
Unit2 A
22 pages
NLP Exp 3
No ratings yet
NLP Exp 3
24 pages
NLP Practicals All
No ratings yet
NLP Practicals All
57 pages
Lec 5
No ratings yet
Lec 5
17 pages
Dms Mod3
No ratings yet
Dms Mod3
5 pages
Assignment 1
No ratings yet
Assignment 1
4 pages
Module 5-Geetha Megharaj
No ratings yet
Module 5-Geetha Megharaj
70 pages
Answers 111111111111111111111111111
No ratings yet
Answers 111111111111111111111111111
21 pages
2 NLP Pipeline
No ratings yet
2 NLP Pipeline
57 pages
Ai&Ml Bai601 NLP Lab Manual
No ratings yet
Ai&Ml Bai601 NLP Lab Manual
48 pages
Lec 4
No ratings yet
Lec 4
22 pages
NLP m1
No ratings yet
NLP m1
148 pages
Turing Machine 1
No ratings yet
Turing Machine 1
18 pages
Lec 2
No ratings yet
Lec 2
13 pages
GM-3 2BCS303
No ratings yet
GM-3 2BCS303
48 pages
Text Preprocessing
No ratings yet
Text Preprocessing
59 pages
Module 3-1-GM
No ratings yet
Module 3-1-GM
46 pages
Term Vocabulary and Postings List
No ratings yet
Term Vocabulary and Postings List
64 pages
Lec 3
No ratings yet
Lec 3
19 pages
NLP 3-6
No ratings yet
NLP 3-6
20 pages
Module 1-2
No ratings yet
Module 1-2
19 pages
NLP Notes
No ratings yet
NLP Notes
16 pages
Lec 1
No ratings yet
Lec 1
14 pages
NLTK Cheatsheet
No ratings yet
NLTK Cheatsheet
27 pages
GM-2.3 Module3
No ratings yet
GM-2.3 Module3
17 pages
Beginners Practical Guide To NLP
No ratings yet
Beginners Practical Guide To NLP
18 pages
6 The Term Vocabulary & Posting List
No ratings yet
6 The Term Vocabulary & Posting List
19 pages
Sentiment Prediction in Hindi and English Language
No ratings yet
Sentiment Prediction in Hindi and English Language
25 pages
Viva Questions
No ratings yet
Viva Questions
6 pages
Screenshot 2024-11-29 at 8.35.21 AM
No ratings yet
Screenshot 2024-11-29 at 8.35.21 AM
40 pages
NLP Class10 PDF
No ratings yet
NLP Class10 PDF
9 pages
3.word Level Analysis-Tokenization Stemming
No ratings yet
3.word Level Analysis-Tokenization Stemming
8 pages
Lemmas and Lemmatization
No ratings yet
Lemmas and Lemmatization
5 pages
Automata Theory and Computability (18CS54) : 5 Semester
No ratings yet
Automata Theory and Computability (18CS54) : 5 Semester
38 pages
Automata Theory and Computability (17CS54) : 5 Semester
No ratings yet
Automata Theory and Computability (17CS54) : 5 Semester
34 pages
Automata Theory and Computability (17CS54) : 5 Semester
No ratings yet
Automata Theory and Computability (17CS54) : 5 Semester
27 pages
NLP Pre-Processing
No ratings yet
NLP Pre-Processing
6 pages
Natural Language Processing
No ratings yet
Natural Language Processing
14 pages
El Kah-Anoual-Publications-17-08-2022-11-08-19-34
No ratings yet
El Kah-Anoual-Publications-17-08-2022-11-08-19-34
10 pages

NLP Assignment-1 Solution

Uploaded by

NLP Assignment-1 Solution

Uploaded by

Natural Language Processing

1. TTR∈ (0,1] (excluding zero)

1. First corpus has more tendency to use different words.

Question 7: An advantage of Porter stemmer over a full morphological parser?

Question 9: What is natural language processing good for?

Solution: 3.71 x 12000^0.69 = 2421 unique words. Heap’s Law

You might also like