0% found this document useful (0 votes)

57 views5 pages

Previous Year Question Paper NLP

Uploaded by

alwynbgs

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

57 views5 pages

Previous Year Question Paper NLP

Uploaded by

alwynbgs

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 5

Previous year Question paper(NLP)

2023-24

Q. A corpus contains 4 documents in which the word ‘diet’ was appearing once in document1.
Identify the term in which we can categorize the word ‘diet’.
(a) Stop word
(b) Rare word
(c) Frequent word
(d) Removable word

Q. Identify the given Chat bot type:

It learns from its environment and experience. It also builds on its capabilities based on the
knowledge. These can collaborate with humans, working along-side them and learning from their
behavior.

Smartbot

Q. Which feature of NLP helps in understanding the emotions of the people mentioned with the
feedback?
(a) Virtual Assistants
(b) Sentiment Analysis
(c) Text classification
(d) Automatic Summarization

Q. What do you mean by syntax of a language?

(a) Meaning of a sentence
(b) Grammatical structure of a sentence
(c) Semantics of a sentence
(d) Synonym of a sentence

Q. Which algorithms result in two things, a vocabulary of words and frequency of the words in the
corpus?
(a) Sentence segmentation
(b) Tokenisation
(c) Bag of words
(d) Text normalisation

Q. Identify any two stop words which should not be removed from the given sentence and why?
Get help and support whether you're shopping now or need help with a past purchase. Contact us
at [email protected] or on our website www.pwershel.com

Stopwords in the given sentence which should not be removed are:

@, . (fullstop) ,_(underscore) , 123(numbers) These tokens are generally considered as stopwords,
but in the above sentence, these tokens are part of email id. removing these tokens may lead to
invalid website address and email ID. So these words should not be removed from the above
sentence.
Other stopwords are or,and,a,at,on

Q. We, human beings, can read, write and understand many languages. But computers can
understand only machine language. Do you think we might face any challenges if we try to teach
computers how to understand and interact in human languages? Explain.

Yes, we might face any challenges if we try to teach computers how to understand and interact in
human languages.
The possible difficulties are:
1. Arrangement of the words and meaning - the computer has to identify the different parts of a
speech. Also, it may be extremely difficult for a computer to understand the meaning behind the
language we use.

2. Multiple Meanings of a word - same word can be used in a number of different ways which
according to the context of the statement changes its meaning completely.

3. Perfect Syntax, no Meaning - Sometimes, a statement can have a perfectly correct

syntax but it does not mean anything. For example, take a look at this statement:

Chickens feed extravagantly while the moon drinks tea.

This statement is correct grammatically but does this make any sense? In Human language, a
perfect balance of syntax and semantics is important for better understanding.

2022-23
Q. What is the full form of TF-IDF?
Term Frequency Inverse Document Frequency

Q. A corpus contains 12 documents. How many document vectors will be there for that corpus?
a. 12
b. 1
c. 24
d. 1/12

Q. Identify the type of chatbot with the information given below:

These bots work on pre-programmed instructions inside the application/machine and are
generally easy to develop. They are deployed in the customer care section of various companies.
Their job is to answer some basic queries that they are coded for and connect them to human
executives once they are unable to handle the conversation.

Script bot

Q. What will be the results of conversion of the term, ‘happily’ in the process of stemming and
lemmatization? Which process takes longer time for execution?
Stemming Lemmatization
happily happi happy

Q. What do we get from the “bag of words'' algorithm?

Bag of words gives us two things:
1. A vocabulary of words for the corpus
2. The frequency of these words (number of times it has occurred in the whole corpus)

(4 marks)
Q. Samiksha, a student of class X was exploring the Natural Language Processing domain. She got
stuck while performing the text normalisation. Help her to normalise the text on the segmented
sentences given below:
Document 1: Akash and Ajay are best friends.
Document 2: Akash likes to play football but Ajay prefers to play online games.

1. Tokenisation
Akash, and, Ajay, are, best, friends
Akash, likes, to, play, football, but, Ajay, prefers, to, play, online, games
2. Removal of stopwords
Akash, Ajay, best, friends
Akash, likes, play, football, Ajay, prefers, play, online, games
3. converting text to a common case
akash, ajay, best, friends
akash, likes, play, football, ajay, prefers, play, online, games
4. Stemming/Lemmatisation (here whole tokens has been included, not only stem/lemma words)
akash, ajay, best, friend
akash, like, play, football, ajay, prefer, play, online, game

2021-22
Q. What will be the output of the word “studies” if we do the following:
a. Lemmatization
b. Stemming
The output of the word after lemmatization will be study.
The output of the word after stemming will be studi./stud
Q. How many tokens are there in the sentence given below?
Traffic Jams have become a common part of our lives nowadays. Living in an urban area means you
have to face traffic each and every time you get out on the road. Mostly, school students opt for buses to
go to school.
Ans: 46 tokens are there in the given sentence

Q. What is a corpus?
Ans: The term used to describe the whole textual data from all the documents altogether is known
as corpus.

Q. Identify any 2 stopwords in the given sentence:

Pollution is the introduction of contaminants into the natural environment that cause adverse
change.The three types of pollution are air pollution, water pollution and land pollution.
Ans: Stopwords in the given sentence are: is, the, of, that, into, are, and

Q. “Automatic summarization is used in NLP applications”. Is the given statement correct? Justify your
answer with an example.
Ans: Yes, the given statement is correct. Automatic summarization is relevant not only for
summarizing the meaning of documents and information, but also to understand the emotional
meanings within the information, such as in collecting data from social media. Automatic
summarization is especially relevant when used to provide an overview of a news item or blog
post, while avoiding redundancy from multiple sources and maximizing the diversity of content
obtained.

Q. Write any two applications of TFIDF (2 marks

Ans: 1. Document Classification
Helps in classifying the type and genre of a document.
2. Topic Modelling
It helps in predicting the topic for a corpus.
3. Information Retrieval System
To extract the important information out of a corpus.
4. Stop word filtering
Helps in removing the unnecessary words out of a text body.

Q. Write down the steps to implement bag of words algorithm. (2 marks) (if it is asked for 4 marks along
with detail explanation it also need examples)
Ans: The steps to implement bag of words algorithm are as follows:
1. Text Normalisation: Collect data and pre-process it
2. Create Dictionary: Make a list of all the unique words occurring in the corpus. (Vocabulary)
3. Create document vectors: For each document in the corpus, find out how many times the word
from the unique list of words has occurred.
4. Create document vectors for all the documents.

Q. Explain from the given graph, how the value and occurrence of a word are related in a corpus?
Ans: As shown in the graph, occurrence and value of a word are inversely proportional. The
words which occur most (like stop words) have negligible value. As the occurrence of words
drops, the value of such words rises. These words are termed as rare or valuable words. These
words occur the least but add the most value to the corpus.

NLP Lab Manual
No ratings yet
NLP Lab Manual
16 pages
NLP-Questions Class 10 Ai
No ratings yet
NLP-Questions Class 10 Ai
8 pages
(4th NLP'22) Final Exam
No ratings yet
(4th NLP'22) Final Exam
2 pages
Natural Language Processing Revision Notes
No ratings yet
Natural Language Processing Revision Notes
4 pages
Part B Notes
No ratings yet
Part B Notes
62 pages
Chapter 2 Part 1 & 2
No ratings yet
Chapter 2 Part 1 & 2
58 pages
NLP Lect-5 02.02.21
No ratings yet
NLP Lect-5 02.02.21
18 pages
Lecture 3
No ratings yet
Lecture 3
70 pages
NLP Lect-6 03.02.21
No ratings yet
NLP Lect-6 03.02.21
17 pages
1009 NLP PPT
No ratings yet
1009 NLP PPT
31 pages
Text Mining
No ratings yet
Text Mining
34 pages
PDF NLP
No ratings yet
PDF NLP
7 pages
NLP Lecture2 Text Pre Processing
No ratings yet
NLP Lecture2 Text Pre Processing
54 pages
NLP and Evaluation
No ratings yet
NLP and Evaluation
23 pages
Session1 2024 - 2025 - Natural Language Processing
No ratings yet
Session1 2024 - 2025 - Natural Language Processing
40 pages
Ch-3 NLP Questions
No ratings yet
Ch-3 NLP Questions
6 pages
Ai Notes
No ratings yet
Ai Notes
11 pages
Q - ClassX - AI - NATURAL LANGUAGE PROCESSING
No ratings yet
Q - ClassX - AI - NATURAL LANGUAGE PROCESSING
10 pages
Lab - Manual - IR - BE AI&DS CL II
No ratings yet
Lab - Manual - IR - BE AI&DS CL II
38 pages
517-C-30070-Assignment - Chapter NLP
No ratings yet
517-C-30070-Assignment - Chapter NLP
9 pages
Tds0002 Illustration Style Standards Issue e
No ratings yet
Tds0002 Illustration Style Standards Issue e
149 pages
2-Text Operations - New
No ratings yet
2-Text Operations - New
39 pages
Unit 6 - AI (NLP)
No ratings yet
Unit 6 - AI (NLP)
37 pages
2 Text Operations
No ratings yet
2 Text Operations
32 pages
NLP Notes
No ratings yet
NLP Notes
10 pages
NLP Qa
No ratings yet
NLP Qa
10 pages
Lab 2
No ratings yet
Lab 2
49 pages
P.S.Senior Secondary School Class X - Artificial Intelligence - 2021-22 Natural Language Processing Question and Answers
No ratings yet
P.S.Senior Secondary School Class X - Artificial Intelligence - 2021-22 Natural Language Processing Question and Answers
7 pages
NLP Key Points
No ratings yet
NLP Key Points
3 pages
C10 - Ai - Unit 3 - NLP - Half Yearly
No ratings yet
C10 - Ai - Unit 3 - NLP - Half Yearly
37 pages
Unit 6 - NLP Notes
No ratings yet
Unit 6 - NLP Notes
7 pages
Professional English I - HS3152 2021 Regulation - 2 Marks - Grammar
No ratings yet
Professional English I - HS3152 2021 Regulation - 2 Marks - Grammar
28 pages
Quest NLP
No ratings yet
Quest NLP
13 pages
Cbse - Department of Skill Education Artificial Intelligence
No ratings yet
Cbse - Department of Skill Education Artificial Intelligence
11 pages
Q ClassX AI Ch7
No ratings yet
Q ClassX AI Ch7
6 pages
NLB Final Lab Manual
No ratings yet
NLB Final Lab Manual
23 pages
NLP Manual (1-12)
No ratings yet
NLP Manual (1-12)
54 pages
AIUnit 6 10
No ratings yet
AIUnit 6 10
8 pages
NLP Revision Notes
No ratings yet
NLP Revision Notes
6 pages
SKD Academy (CBSE) Session - 2024-2025 Subject - Artificial Intelligence (417) Important Questions Chap - NLP
No ratings yet
SKD Academy (CBSE) Session - 2024-2025 Subject - Artificial Intelligence (417) Important Questions Chap - NLP
7 pages
NLP Lab Manual
No ratings yet
NLP Lab Manual
29 pages
Natural Language Processing
No ratings yet
Natural Language Processing
10 pages
Unit 6 Natural Language Processing
No ratings yet
Unit 6 Natural Language Processing
10 pages
9783293-CLASS10 AI Worksheet PART B UNIT6 Natural Language Processing
No ratings yet
9783293-CLASS10 AI Worksheet PART B UNIT6 Natural Language Processing
3 pages
NLP Mid Sem
No ratings yet
NLP Mid Sem
4 pages
Lemmatization Is The Grouping Together of Different Forms of The Same Word. in Search
No ratings yet
Lemmatization Is The Grouping Together of Different Forms of The Same Word. in Search
11 pages
Natural Language Processing (NLP)
No ratings yet
Natural Language Processing (NLP)
5 pages
Sentiment Analysis Using Supervised Machine Learning Ijariie13051
No ratings yet
Sentiment Analysis Using Supervised Machine Learning Ijariie13051
7 pages
AI HW
No ratings yet
AI HW
4 pages
Text Analysis With NLTK Cheatsheet PDF
No ratings yet
Text Analysis With NLTK Cheatsheet PDF
3 pages
NLP Ai X
No ratings yet
NLP Ai X
6 pages
Ai NLP
No ratings yet
Ai NLP
9 pages
Text Analysis With NLTK Cheatsheet PDF
No ratings yet
Text Analysis With NLTK Cheatsheet PDF
3 pages
NLP Final
No ratings yet
NLP Final
4 pages
NLP - CH-6
No ratings yet
NLP - CH-6
4 pages
MTE Practice Set
No ratings yet
MTE Practice Set
4 pages
Sample Paper Questions - NLP (Part 2)
No ratings yet
Sample Paper Questions - NLP (Part 2)
7 pages
It-3035 (NLP) - CS End May 2023
No ratings yet
It-3035 (NLP) - CS End May 2023
10 pages
DW and Olap
No ratings yet
DW and Olap
59 pages
Text Analysis With NLTK Cheatsheet
No ratings yet
Text Analysis With NLTK Cheatsheet
3 pages
Overview of Exchange Server Database Architecture and Database Engine
100% (1)
Overview of Exchange Server Database Architecture and Database Engine
5 pages
Data Mining and Predictive Analytics - Andres Fortino
No ratings yet
Data Mining and Predictive Analytics - Andres Fortino
390 pages
CKM3 For Group of Materials
No ratings yet
CKM3 For Group of Materials
3 pages
Data Warehousing Summary SET A
No ratings yet
Data Warehousing Summary SET A
27 pages
SQL Quiz
No ratings yet
SQL Quiz
10 pages
Upload 5 Documents To Download: Search
No ratings yet
Upload 5 Documents To Download: Search
3 pages
Ba Detailed JD-nagarro
No ratings yet
Ba Detailed JD-nagarro
2 pages
Unit 2 DWDM
No ratings yet
Unit 2 DWDM
14 pages
Humana Interview Prep
No ratings yet
Humana Interview Prep
10 pages
CE4515 4.0v1 Getting Started With Sophos Central XDR Data Lake
No ratings yet
CE4515 4.0v1 Getting Started With Sophos Central XDR Data Lake
15 pages
17.1 Instagram Analytics PDF
No ratings yet
17.1 Instagram Analytics PDF
9 pages
Data Mining and Data Warehousing
No ratings yet
Data Mining and Data Warehousing
9 pages
Data Mining Jurnal
No ratings yet
Data Mining Jurnal
20 pages
Zero-Downtime Encryption and Key Rotation: Ciphertrust Live Data Transformation
No ratings yet
Zero-Downtime Encryption and Key Rotation: Ciphertrust Live Data Transformation
2 pages
ASP Dot Net Programs
No ratings yet
ASP Dot Net Programs
53 pages
CH6 Foudation BI DB Information
No ratings yet
CH6 Foudation BI DB Information
21 pages
Chat GPT
No ratings yet
Chat GPT
2 pages
STATUS
No ratings yet
STATUS
12 pages
Poster 123
No ratings yet
Poster 123
1 page
CIA2-DBMS - III Studnts QB
No ratings yet
CIA2-DBMS - III Studnts QB
2 pages
Introduction To Library and Information Science
No ratings yet
Introduction To Library and Information Science
19 pages
Obj WS MID 2
No ratings yet
Obj WS MID 2
3 pages
Practice Bioinformatics - Sem2 - 2023-2024
No ratings yet
Practice Bioinformatics - Sem2 - 2023-2024
2 pages
Webbased Systems and Development
No ratings yet
Webbased Systems and Development
4 pages
Module 3-Task Analysis and Modeling in Human
No ratings yet
Module 3-Task Analysis and Modeling in Human
3 pages
Teradata Certification: Advanced Developer Exam
No ratings yet
Teradata Certification: Advanced Developer Exam
3 pages
FInal Rubric For Database Project
No ratings yet
FInal Rubric For Database Project
1 page
Kotari's Resume
No ratings yet
Kotari's Resume
1 page
Sixty Words or Phrases Commonly Misused by ESL/EFL Students Preparing for Universities
From Everand
Sixty Words or Phrases Commonly Misused by ESL/EFL Students Preparing for Universities
Kenneth Cranker
2.5/5 (7)
Natural Language Processing
From Everand
Natural Language Processing
Ajit Singh
No ratings yet

Previous Year Question Paper NLP

Uploaded by

Previous Year Question Paper NLP

Uploaded by

Previous year Question paper(NLP)

Q. Identify the given Chat bot type:

Q. What do you mean by syntax of a language?

Stopwords in the given sentence which should not be removed are:

3. Perfect Syntax, no Meaning - Sometimes, a statement can have a perfectly correct

Chickens feed extravagantly while the moon drinks tea.

Q. Identify the type of chatbot with the information given below:

Q. What do we get from the “bag of words'' algorithm?

Q. Identify any 2 stopwords in the given sentence:

Q. Write any two applications of TFIDF (2 marks

You might also like