0% found this document useful (0 votes)

82 views7 pages

NLP QB

Uploaded by

mohanabhijeeth52

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

82 views7 pages

NLP QB

Uploaded by

mohanabhijeeth52

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 7

CSE (AI & ML)

Course code: MR20- Course Name NATURAL LANGUAGE

1CS0249R20) PROCESSING
QUESTION BANK
Qno Question Marks Section
1 What word tokenization? What are the major challenges to tokenize words of 12 Section-I
a sentence

2 What is minimum edit distance between two words? Calculate minimum edit 12 Section-I
distance between two words: “small” and “smell” using dynamic
programming algorithm. Consider cost for insertion, deletion and
substitution is 1,1,2.

3 What is the difference between non-word and real-word spelling correction? 12 Section-I
What is perplexity? Estimate the perplexity of the corpus based on unigram
language model: “the man is a thief but the man is a good man”

4 What is Maximum Likelihood Estimate? How is it used Language Model? 12 Section-I

Given the Corpus

Calculate the following:

a. Find all the possible bigrams from the given corpus.
b. Find frequencies of all the bigrams.
c. Find the frequencies of all unigram.
d. Calculate the Maximum Likelihood Estimate for all bigrams.

5 What is morphology in NLP? What is morphemes? What is bounded and free 12 Section-I
morphemes explain with example. What is stemming and how it is different
from lemmatization?

6 Write about Evaluation of Language Models and Basic Smoothing 12 Section-I

7 Explain about Noisy Channel Model for Spelling Correction-Gram Language 12 Section-I
Models

8 Develop a comprehensive text processing pipeline that includes tokenization, 12 Section-I

stemming, normalization, and spelling correction.

9 Apply different smoothing techniques to a language model and analyze their 12 Section-I
impact on performance.

10 Design an algorithm to correct spelling errors in a given text document. 12 Section-I

11 What is morphology in NLP? What is morphemes? What is bounded and free 12 Section-II
morphemes explain with example. What is stemming and how it is different
from lemmatization?

12 What is the difference between inflectional and derivational morphology 12 Section-II

explains with example. What is morphological analysis explain with example.

13 What is POS tagging? Find the POS tag for the phrase “the light book”, using 12 Section-II
Viterbi algorithm in Hidden markov tagging model with the following
information.

14 What is difference between real words and non words? What is FSA and how 12 Section-II
inflections in words can be represented using FSA, explain with example.

15 What are the problems of Hidden Markov model to predict the POS tags for a 12 Section-II
given sentence or phrase? Explain how Baum Welch algorithm learns the
parameters – transition matrix, observation matrix and initial state
distribution.

16 What is smoothing in language model? What are the advantages smoothing? 12 Section-II
Find the Good turing smoothing for the following sentence:
“he is he is good man”

17 Explain the different categories of affixes in morphology with examples. 12 Section-II

What are the differences between content and functional morphemes?
What is the difference between regular and irregular forms of verbs and
nouns respectively?

18 Established why maximum entropy model is better than hidden Markov 12 Section-II
model. How POS tagging is achieved in maximum entropy model. What is
beam search explain in detail.

19 How is the uniformity maintained in maximum entropy model? Write the 12 Section-II
maximum entropy model principles.

20 Consider the maximum entropy model for POS tagging, where you want to 12 Section-II
estimate P(tag|word). In a hypothetical setting, assume that tag can
take the values D, N and V (short forms for Determiner, Noun and
Verb). The variable word could be any member of a set V of possible
words, where V contains the words a, man, sleeps, as well as
additional words. The distribution should give the following
probabilities
P(D|a) = 0.9
P(N|man) = 0.9 P(V|sleeps) = 0.9
P(D|word) = 0.6 for any word other than a, man or sleeps P(N|word) = 0.3
for any word other than a, man or sleeps P(V|word) = 0.1 for any
word other than a, man or sleeps
It is assumed that all other probabilities, not defined above could take any
values such that
ΣP(tag|word) = 1 is satisfied for any word in V.
a. Define the features of your maximum entropy model that can
model this distribution. Mark your features as f1, f2 and so on.
Each feature should have the same format as explained in the
class
b. For each feature fi, assume a weight λi. Now, write expression
for the following probabilities in terms of your model
parameters
P(D|cat)
P(N|laughs)
P(D|man)
c. What value do the parameters in your model take to give the
distribution as described above. (i.e.P(D|a) = 0.9) and so on.

21 What is syntax? What is parsing? What is the difference between derivation 12 Section-
and parse tree? What is constituency? Write down different forms of III
constituency with example. What is the significance of “head” of a
constituency, explain?

22 What is the difference between top down and bottom up parsing? Apply CYK 12 Section-
algorithm to parse the sentence “a pilot likes flying planes” with given III
grammar
23 What is inside-outside probability? Apply CYK algorithm to parse the 12 Section-
sentence “a pilot likes flying planes” with given probabilistic context free III
grammar to find most probable sparse tree.

24 What is dependency parsing? What is difference between classical and 12 Section-

dependency parsing? Exaplain the dependency structure in dependency III
parsing with suitable example. What is head and dependent and what are
the criteria are set for them?

25 What is dependency graph? What are the main characteristics of 12 Section-

dependency graph? What is configuration in transition based dependency III
parsing and what is the initial value for configuration. Parse the following
sentence with Arc-Eager algorithm.

26 For the given grammar 12 Section-

III

Find the inside probabilities for each word for the following sentence;
“Astronomers saw stars with ears”

27 Evaluate the effectiveness of the CKY algorithm in various syntax 12 Section-

parsing tasks. III
28 Describe the inside-outside algorithm for calculating probabilities 12 Section-
over parse trees. III
29 Explain how PCFGs assign probabilities to different parse trees for 12 Section-
a given sentence. III
30 Discuss the evaluation of transition-based parsers using different 12 Section-
metrics. III
31 What do you mean by distributional semantics? What is contextual 12 Section-IV
representation and how we can we learn new words from contextual cues?
Explain with examples. What do mean by Distributional Semantic
Models(DSMs)?

32 What is word space? Write down the steps to create words space and explain 12 Section-IV
it with example and how can it be useful to show the word similarities.

33 How weights can be measured based on context? Deduce the formulation 12 Section-IV
for weight measurements. What is difference between attributional and
relational similarity?

34 What one-hot encoding? How words can be represented using one-hot 12 Section-IV
encoding explain with example? What are the limitations of one-hot
encoding explaining with example?

35 What is CBOW? How CBOW is used to emebed word explain with example. 12 Section-IV
What is the difference between skip-gram and CBOW?

36 Discuss the advantages and limitations of distributional semantic 12 Section-IV

models compared to other approaches.
37 Discuss the application of distributional semantic models in 12 Section-IV
sentiment analysis and topic modeling.
38 Discuss the different types of word embedding techniques, 12 Section-IV
including word2vec, GloVe, and fastText.
39 Describe the application of word embeddings in various NLP tasks, 12 Section-IV
including machine translation, sentiment analysis, and question
answering.
40 Explain how WordNet is used for word sense disambiguation and 12 Section-IV
lexical relation extraction.
41 What is summary? What is text summarization? What are the applications of 12 Section-V
text summarization give examples?

42 What are the main stages of text summarization? How salient words can be 12 Section-V
defined? How sentence can be weighted?

43 How sentences can be simplified, Explain with example. How summarization 12 Section-V
systems can be evaluated? What is ROUGE and how is it used for system
evaluation

44 What is text classification? What kind of problems can be solved using text 12 Section-V
classification? How text classification problems can be solved?
45 Discuss the different types of text classification tasks, including binary, multi- 12 Section-V
class, and hierarchical classification.

46 Discuss the evaluation of text classifiers using metrics like accuracy, precision, 12 Section-V
recall, and F1-score.

47 Describe the application of sentiment analysis in social media analysis, 12 Section-V

product reviews, and customer feedback.

48 Discuss the challenges of sentiment analysis, including handling sarcasm, 12 Section-V

irony, and ambiguity.

49 Describe the application of machine learning algorithms like Naive Bayes, 12 Section-V
support vector machines (SVMs), and random forests in text classification.

50 Describe the application of optimization algorithms like integer linear 12 Section-V

programming (ILP) and genetic algorithms in text summarization.
45 What is sentiment analysis? Explain with an example. Give 12 Section-V
examples where sentiment analysis can be used. Write down the
challenges faced by sentiment analysis.

47 12 Section-IV
48 12 Section-IV
49 12 Section-IV
50 12 Section-IV

Allen E. Everett - Warp Drive and Causality
No ratings yet
Allen E. Everett - Warp Drive and Causality
4 pages
NLP QB
No ratings yet
NLP QB
5 pages
VND Openxmlformats-Officedocument Wordprocessingml Document&rendition 1
No ratings yet
VND Openxmlformats-Officedocument Wordprocessingml Document&rendition 1
5 pages
Module 2
No ratings yet
Module 2
3 pages
Kcs072 Natural Language Processing
No ratings yet
Kcs072 Natural Language Processing
2 pages
NLP Unit 2 Imp
No ratings yet
NLP Unit 2 Imp
4 pages
CM3060 NLP Mock Exam Oct2021
No ratings yet
CM3060 NLP Mock Exam Oct2021
4 pages
NLP Sample Questions-Stu
No ratings yet
NLP Sample Questions-Stu
4 pages
Natural Language Processing
No ratings yet
Natural Language Processing
8 pages
NLP Assignment
No ratings yet
NLP Assignment
8 pages
NLP-Questions Class 10 Ai
No ratings yet
NLP-Questions Class 10 Ai
8 pages
Unit 6 - NLP Notes
No ratings yet
Unit 6 - NLP Notes
7 pages
NLP Endsem 2016
No ratings yet
NLP Endsem 2016
2 pages
Question Bank NLP
100% (1)
Question Bank NLP
11 pages
NLP Question Bank
No ratings yet
NLP Question Bank
3 pages
NLP 2K22 DEC CS3EA06 - IT3EA06 Natural Language Processing
No ratings yet
NLP 2K22 DEC CS3EA06 - IT3EA06 Natural Language Processing
4 pages
NLP QB
No ratings yet
NLP QB
16 pages
NLP Question Bank
No ratings yet
NLP Question Bank
7 pages
NLP ANONYMOUS QB Ans
No ratings yet
NLP ANONYMOUS QB Ans
21 pages
Long Answer Qs
No ratings yet
Long Answer Qs
2 pages
P.S.Senior Secondary School Class X - Artificial Intelligence - 2021-22 Natural Language Processing Question and Answers
No ratings yet
P.S.Senior Secondary School Class X - Artificial Intelligence - 2021-22 Natural Language Processing Question and Answers
7 pages
517-C-30070-Assignment - Chapter NLP
No ratings yet
517-C-30070-Assignment - Chapter NLP
9 pages
Unit Vapplications Notes
No ratings yet
Unit Vapplications Notes
13 pages
Faculty Name: Dr. Humera Khanam Subject Name:NLP
No ratings yet
Faculty Name: Dr. Humera Khanam Subject Name:NLP
206 pages
NLP Final
No ratings yet
NLP Final
4 pages
NLP
No ratings yet
NLP
16 pages
Ai Unit - 5
No ratings yet
Ai Unit - 5
12 pages
Question Bank - NLP
No ratings yet
Question Bank - NLP
3 pages
NLP Semester 7
No ratings yet
NLP Semester 7
1,072 pages
It3ea06 Natural Lanuage Processing
No ratings yet
It3ea06 Natural Lanuage Processing
4 pages
NLP - Viva - Que & Ans
No ratings yet
NLP - Viva - Que & Ans
15 pages
NLP Endsem 2015
No ratings yet
NLP Endsem 2015
2 pages
CCS369 Two Marks
No ratings yet
CCS369 Two Marks
9 pages
Quest NLP
No ratings yet
Quest NLP
13 pages
AI Unit-4
No ratings yet
AI Unit-4
59 pages
AI Unit V
No ratings yet
AI Unit V
64 pages
NLP Question Bank
No ratings yet
NLP Question Bank
7 pages
SKD Academy (CBSE) Session - 2024-2025 Subject - Artificial Intelligence (417) Important Questions Chap - NLP
No ratings yet
SKD Academy (CBSE) Session - 2024-2025 Subject - Artificial Intelligence (417) Important Questions Chap - NLP
7 pages
NLP Notes
No ratings yet
NLP Notes
203 pages
NLP Previous Sem
No ratings yet
NLP Previous Sem
5 pages
CM3060 NLP Final Mar2022
No ratings yet
CM3060 NLP Final Mar2022
5 pages
Chapter 6-NLP
No ratings yet
Chapter 6-NLP
8 pages
Artificial Intelligence: Natural Language Processing
No ratings yet
Artificial Intelligence: Natural Language Processing
13 pages
Ai Unit 5
No ratings yet
Ai Unit 5
16 pages
NLP Key
No ratings yet
NLP Key
16 pages
Question Bank NLP SOLUTIONS
No ratings yet
Question Bank NLP SOLUTIONS
21 pages
Kai073 Text Analytics and Natural Langugae Processing
No ratings yet
Kai073 Text Analytics and Natural Langugae Processing
2 pages
CMR University School of Engineering and Technology Department of Cse and It
No ratings yet
CMR University School of Engineering and Technology Department of Cse and It
8 pages
Lucas Paquetta Raw NLP
No ratings yet
Lucas Paquetta Raw NLP
12 pages
Natural Language processing-Regular-HO
No ratings yet
Natural Language processing-Regular-HO
10 pages
BAI601
No ratings yet
BAI601
2 pages
Lemmatization Is The Grouping Together of Different Forms of The Same Word. in Search
No ratings yet
Lemmatization Is The Grouping Together of Different Forms of The Same Word. in Search
11 pages
The Handbook of Computational Linguistics and Natural Language Processing
No ratings yet
The Handbook of Computational Linguistics and Natural Language Processing
5 pages
Wa0002.
No ratings yet
Wa0002.
6 pages
Unit-3 (NLP)
No ratings yet
Unit-3 (NLP)
28 pages
NLP Unitwise Imp Questions
100% (1)
NLP Unitwise Imp Questions
5 pages
End Sem Answer Key 2023
No ratings yet
End Sem Answer Key 2023
4 pages
Q ClassX AI Ch7
No ratings yet
Q ClassX AI Ch7
6 pages
Assignmet Questions NLP
No ratings yet
Assignmet Questions NLP
2 pages
From Simple IO to Monad Transformers
From Everand
From Simple IO to Monad Transformers
J Adrian Zimmer
2/5 (1)
Coding Interview Questions and Answers
From Everand
Coding Interview Questions and Answers
Chinmoy Mukherjee
No ratings yet
Probability (Notes)
No ratings yet
Probability (Notes)
16 pages
Machine Learning in A Nutshell
No ratings yet
Machine Learning in A Nutshell
36 pages
24: 12.07.05 Flory-Huggins Theory: Today
No ratings yet
24: 12.07.05 Flory-Huggins Theory: Today
4 pages
Lecture 2 Deep Learning Overview
No ratings yet
Lecture 2 Deep Learning Overview
98 pages
EE 203 Syllabus
No ratings yet
EE 203 Syllabus
2 pages
Problems On Flexible Budget
No ratings yet
Problems On Flexible Budget
3 pages
Formal Languages Models of Computation: Spring 2005 Costas Busch - RPI
No ratings yet
Formal Languages Models of Computation: Spring 2005 Costas Busch - RPI
36 pages
Ensemble Learning Based Automatic Detection of Tuberculosis in Chest X-Ray Images Using Hybrid Feature Descriptors
No ratings yet
Ensemble Learning Based Automatic Detection of Tuberculosis in Chest X-Ray Images Using Hybrid Feature Descriptors
12 pages
Repetitive Control
No ratings yet
Repetitive Control
22 pages
Condition Assessment Models For Sewer Pipelines
No ratings yet
Condition Assessment Models For Sewer Pipelines
121 pages
Steel Opensees Other Opensees Other
No ratings yet
Steel Opensees Other Opensees Other
12 pages
Aniket Asole - Senior Executive Analytics MMM - HR Central
No ratings yet
Aniket Asole - Senior Executive Analytics MMM - HR Central
1 page
Childs Guide To Optimal Control-Economics
No ratings yet
Childs Guide To Optimal Control-Economics
10 pages
Docs Gate User Guide
No ratings yet
Docs Gate User Guide
2 pages
Solutions To Selected Problems in Numerical Optimization 2nbsped - Compress
No ratings yet
Solutions To Selected Problems in Numerical Optimization 2nbsped - Compress
75 pages
On Adaptive Filter
No ratings yet
On Adaptive Filter
25 pages
Chapter 7 - K-Nearest-Neighbor: Data Mining For Business Analytics
No ratings yet
Chapter 7 - K-Nearest-Neighbor: Data Mining For Business Analytics
16 pages
978 0 7503 3395 5.preview
No ratings yet
978 0 7503 3395 5.preview
26 pages
DSP - Practical 05
No ratings yet
DSP - Practical 05
9 pages
CSE/MATH 6643: Numerical Linear Algebra: Haesun Park
No ratings yet
CSE/MATH 6643: Numerical Linear Algebra: Haesun Park
13 pages
Practice Questions On Height balanced/AVL Tree
No ratings yet
Practice Questions On Height balanced/AVL Tree
5 pages
Forty Years of Attacks On The RSA Crypto
No ratings yet
Forty Years of Attacks On The RSA Crypto
23 pages
Chapter 6 Review
No ratings yet
Chapter 6 Review
4 pages
Optimal Number of Trials For Monte Carlo Simulation
No ratings yet
Optimal Number of Trials For Monte Carlo Simulation
4 pages
Polynomial End Behavior: Date
No ratings yet
Polynomial End Behavior: Date
2 pages
Free Vibration Analyses of Timoshenko Beams With Free by Using The Discrete Singular Convolution. 2011
No ratings yet
Free Vibration Analyses of Timoshenko Beams With Free by Using The Discrete Singular Convolution. 2011
10 pages
Nonlinear Observer Design For L-V System
No ratings yet
Nonlinear Observer Design For L-V System
8 pages
Finite Element Method
50% (2)
Finite Element Method
24 pages

NLP QB

Uploaded by

NLP QB

Uploaded by

CSE (AI & ML)

Course code: MR20- Course Name NATURAL LANGUAGE

4 What is Maximum Likelihood Estimate? How is it used Language Model? 12 Section-I

Calculate the following:

6 Write about Evaluation of Language Models and Basic Smoothing 12 Section-I

8 Develop a comprehensive text processing pipeline that includes tokenization, 12 Section-I

10 Design an algorithm to correct spelling errors in a given text document. 12 Section-I

12 What is the difference between inflectional and derivational morphology 12 Section-II

17 Explain the different categories of affixes in morphology with examples. 12 Section-II

24 What is dependency parsing? What is difference between classical and 12 Section-

25 What is dependency graph? What are the main characteristics of 12 Section-

26 For the given grammar 12 Section-

27 Evaluate the effectiveness of the CKY algorithm in various syntax 12 Section-

36 Discuss the advantages and limitations of distributional semantic 12 Section-IV

47 Describe the application of sentiment analysis in social media analysis, 12 Section-V

48 Discuss the challenges of sentiment analysis, including handling sarcasm, 12 Section-V

50 Describe the application of optimization algorithms like integer linear 12 Section-V

You might also like