0% found this document useful (0 votes)

111 views5 pages

Module 1

The document consists of a comprehensive list of questions and tasks related to Natural Language Processing (NLP) and its various components, including Named Entity Recognition (NER), tokenization, morphology, and regular expressions. It covers practical applications, theoretical concepts, and technical exercises, emphasizing the importance of NLP in real-life scenarios. Each question is assigned a specific mark value, indicating its complexity and depth of understanding required.

Uploaded by

rayobose51

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

111 views5 pages

Module 1

Uploaded by

rayobose51

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 5

1. Mention two practical applications of NER.

CO1 BL1 2 Marks

2. With examples explain the different types of NER attributes. CO1 BL2 10 Marks

3. What do you understand about Natural language processing? CO1 BL1 2 Marks

4. What are stop words? CO1 BL1 2 Marks

5. List any two real life applications of NLP. CO1 BL1 2 Marks

6. Explain the difference between precision and recall in information retrieval. CO1 BL2
5 Marks

7. What is NLTK? CO1 BL1 2 Marks

8. What is Multi Word Tokenization? CO1 BL1 2 Marks

9. What are stems? CO1 BL1 2 Marks

10. What are called affixes? CO1 BL1 2 Marks

11. What is lexicon? CO1 BL1 2 Marks

12. Why is Multi word tokenization preferred over Single word tokenization? CO1 BL1
2 Marks

13. What is sentence segmentation? CO1 BL1 2 Marks

14. Why is sentence segmentation important? CO1 BL1 2 Marks

15. What is morphology in NLP? CO1 BL1 2 Marks

16. List the different types of morphology available CO1 BL1 2 Marks

17. What is the difference between NLP and NLU? CO1 BL1 2 Marks

18. Give some popular examples of Corpus. CO1 BL1 2 Marks

19. State the difference between word and sentence tokenization? CO1 BL1 2 Marks

20. What are the phases of problem-solving in NLP? CO1 BL1 5 Marks

21. Explain the process of word tokenization with example. CO1 BL1 5 Marks

22. How does Named Entity Recognizer work? CO1 BL1 5 Marks

23. What are the benefits of eliminating stop words? Give some examples where stop word
elimination may be harmful. CO1 BL3 5 Marks

24. What do you mean by RegEx? Explain with example. CO1 BL1 5 Marks

25. Explain Dependency Parsing in NLP? CO1 BL1 5 Marks

26. Write a regular expression to represent a set of all strings over {a, b} of even length. CO1
BL3 5 Marks

27. Write a regular expression to represent a set of all strings over {a, b} of length 4 starting with
an a. CO1 BL3 5 Marks
28. Write a regular expression to represent a set of all strings over {a, b} containing at least one
a. CO1 BL3 5 Marks

29. Compare and contrast NLTK and Spacy, highlighting their differences. CO1 BL2 5
Marks

30. What is a Bag of Words? Explain with examples. CO1 BL2 5 Marks

31. Differentiate regular grammar and regular expression. CO1 BL3 5 Marks

32. Describe the word and sentence tokenization steps with the help of an example. CO1 BL2
10 Marks

33. How can the common challenges faced in morphological analysis in natural language
processing be overcome? CO1 BL3 10 Marks

34. Derive Minimum Edit Distance Algorithm and compute the minimum edit distance between
the words “MAM” and “MADAM”. CO1 BL4 10 Marks

35. Discuss the problem-solving approaches of any two real-life applications of Information
Extraction and NER in Natural Language Processing. CO1 BL1 10 Marks

36. How to solve any application of NLP. Justify with an example. CO4 BL5 10 Marks

37. What is Corpora? Define the steps of creating a corpus for a specific task. CO1 BL2
10 Marks

38. What is Information Extraction? CO1 BL1 5 Marks

39. State the different applications of Sentiment analysis and Opinion mining with examples.
Write down the variations as well. CO1 BL3 10 Marks

40. State a few applications of Information Retrieval. CO4 BL5 5 Marks

41. What is text normalization? CO3 BL3 10 Marks

42. Do you think any differences present between tokenization and normalization? Justify your
answer with examples. CO4 BL5 10 Marks

43. What makes part-of-speech (POS) tagging crucial in NLP, in your opinion? Give an example
to back up your response. CO4 BL4 5 Marks

44. Criticize the shortcomings of the fundamental Top-Down Parser. CO1 BL3 5 Marks

45. Do you believe there are any distinctions between prediction and classification? Illustrate
with an example. CO1 BL1 5 Marks

46. Explain the connection between word tokenization and phrase tokenization using examples.
How do both tokenization methods contribute to the development of NLP applications? CO1 BL3
10 Marks

47. “Natural Language Processing (NLP) has many real-life applications across various
industries.”- List any two real-life applications of Natural Language Processing. CO1 BL1 5
Marks

48. "Find all strings of length 5 or less in the regular set represented by the following regular
expressions:
(a) (ab + a)*(aa + b)

(b) (ab + ba)*a CO1 BL4 5 Marks

49. "Write regular expressions for the following languages.

1. the set of all alphabetic strings;

2. the set of all lower case alphabetic strings ending in a b;

3. the set of all strings from the alphabet a,b such that each a is immediately preceded by and
immediately followed by a b; CO1 BL4 10 Marks

50. Explain Rule based POS tagging CO1 BL2 5 Marks

51. Differentiate regular grammar and regular expression CO1 BL3 5 Marks

52. What is NLTK? CO1 BL2 2 Marks

53. What is Multi Word Tokenization? CO1 BL2 2 Marks

54. What is sentence segmentation? CO1 BL2 2 Marks

55. What is morphology in NLP? CO1 BL2 2 Marks

56. Give some popular examples of Corpus. CO1 BL2 2 Marks

57. What do you mean by word tokenization? CO1 BL2 2 Marks

58. Find the minimum edit distance between two strings ELEPHANT and RELEVANT? CO3
BL5 10 Marks

59. If str1 = " SUNDAY " and str2 = "SATURDAY" is given, calculate the minimum edit distance
between the two strings. CO1 BL5 10 Marks

60. List the different types of morphology available. CO4 BL2 5 Marks

61. What is Stemming? CO1 BL1 2 Marks

62. What is Corpus in NLP? CO1 BL1 2 Marks

63. State with example the difference between stemming and lemmatization. CO4 BL4
5 Marks

64. Write down the different stages of NLP pipeline. CO1 BL4 10 Marks

65. What is your understanding about Chatbot in the context of NLP? CO3 BL3 10
Marks

66. Write short note on text pre-processing in the context of NLP. Discuss outliers and how to
handle them CO3 BL2 10 Marks

67. Explain with example the challenges with sentence tokenization. CO3 BL3 5
Marks
68. Explain some of the common NLP tasks. CO1 BL2 5 Marks

69. What do you mean by text extraction and cleanup? Discuss with examples. CO3 BL2
10 Marks

70. What is word sense ambiguity in NLP? Explain with examples. CO3 BL1 5 Marks

71. Write short note on Bag of Words (BOW). CO1 BL3 10 Marks

72. Explain Homonymy with example? CO1 BL3 2 Marks

73. Define WordNet. CO1 BL1 2 Marks

74. Consider a document containing 100 words wherein the word apple appears 5 times and
assume we have 10 million documents and the word apple appears in one thousandth of these.
Then, calculate the term frequency and inverse document frequency? CO4 BL5 10 Marks

75. Explain the relationship between Singular Value Decomposition, Matrix Completion and
Matrix Factorization? CO1 BL3 5 Marks

76. Give two examples that illustrate the significance of regular expressions in NLP. CO1 BL1
5 Marks

77. Why is multiword tokenization preferable over single word tokenization in NLP? Give
examples. CO1 BL1 5 Marks

78. Differentiate between formal language and natural language. CO3 BL1 10 Marks

79. Explain lexicon, lexeme and the different types of relations that hold between lexemes. CO1
BL1 10 Marks

80. State the advantages of bottom-up chart parser compared to top-down parsing. CO1 BL1
10 Marks

81. Marks

82. Describe the Skip-gram model and its intuition in word embeddings. CO1 BL2 10
Marks

83. Explain the concept of Term Frequency-Inverse Document Frequency (TF-IDF) based ranking
in information retrieval. CO1 BL2 10 Marks

84. Tokenize and tag the following sentence: CO1 BL1 2 Marks

85. What different pronunciations and parts-of-speech are involved? CO1 BL1 2
Marks

86. Compute the edit distance (using insertion cost 1, deletion cost 1, substitution cost 1) of
“intention” and “execution”. Show your work using the edit distance grid. CO1 BL4 10
Marks

87. What is the purpose of constructing corpora in Natural Language Processing (NLP) research?
CO1 BL2 5 Marks

88. What role do regular expressions play in searching and manipulating text data? CO1 BL3
5 Marks
89. Explain the purpose of WordNet in Natural Language Processing (NLP). CO1 BL4 10
Marks

90. What is Pragmatic Ambiguity in NLP? CO1 BL4 10 Marks

91. Describe the class of strings matched by the following regular expressions: a. [a-zA-Z]+ b. [A-
Z][a-z]* CO1 BL4 10 Marks

92. Extract all email addresses from the following: “Contact us at [email protected] or
[email protected].” CO1 BL4 10 Marks

93. This regex is intended to match one or more uppercase letters followed by zero or more
digits. [A-Z] + [0-9]* However, it has a problem. What is it, and how can it be fixed?
CO1 BL4 10 Marks

94. Write a regex to find all dates in a text. The date formats should include:

DD-MM-YYYY

MM-DD-YYYY

YYYY-MM-DD CO1 BL4 10 Marks

95. Compute the minimum edit distance between the words MAMA and MADAAM. CO1 BL5
10 Marks

96. Evaluate the minimum edit distance in transforming the word ‘kitten’ to ‘sitting’ using
insertion, deletion, and substitution cost as 1. CO1 BL5 10 Marks

20 Data Annotation Interview Questions and Answers
0% (1)
20 Data Annotation Interview Questions and Answers
4 pages
Speech and Language Processing, 2nd Editio - Daniel Jurafsky
67% (3)
Speech and Language Processing, 2nd Editio - Daniel Jurafsky
383 pages
Glove
100% (1)
Glove
10 pages
A Fuzzy Ontology and Its Application To News Summarization
100% (1)
A Fuzzy Ontology and Its Application To News Summarization
22 pages
Module 3
No ratings yet
Module 3
5 pages
Module 4
No ratings yet
Module 4
3 pages
Question Bank - NLP
No ratings yet
Question Bank - NLP
3 pages
Natural Language Processing
No ratings yet
Natural Language Processing
6 pages
NLP Qb-Ese
No ratings yet
NLP Qb-Ese
2 pages
NLP End Sem Paper - Evaluation Scheme
No ratings yet
NLP End Sem Paper - Evaluation Scheme
14 pages
NLP - Short Assignments
No ratings yet
NLP - Short Assignments
8 pages
Be Computer Engineering Semester 7 2023 May Dloc III Natural Language Processing Rev 2019 C Scheme
0% (1)
Be Computer Engineering Semester 7 2023 May Dloc III Natural Language Processing Rev 2019 C Scheme
2 pages
NLP Lab Tasks
No ratings yet
NLP Lab Tasks
16 pages
Development of An Indian Legal Language Model (LLM) For Enhanced Legal Text Analysis and Assistance
No ratings yet
Development of An Indian Legal Language Model (LLM) For Enhanced Legal Text Analysis and Assistance
7 pages
Syllabus NLP
100% (1)
Syllabus NLP
2 pages
Unit 1
No ratings yet
Unit 1
35 pages
NLP Assignment Answer
No ratings yet
NLP Assignment Answer
4 pages
NLP Worksheet: Text Processing, Bag of Words, Tf-Idf Activity
No ratings yet
NLP Worksheet: Text Processing, Bag of Words, Tf-Idf Activity
6 pages
NLP Question Bank
No ratings yet
NLP Question Bank
1 page
NLP Lab Manual
No ratings yet
NLP Lab Manual
15 pages
NLP- AI2214601 unit 1to unit 5 notes
No ratings yet
NLP- AI2214601 unit 1to unit 5 notes
98 pages
UNIT I_NLP
No ratings yet
UNIT I_NLP
24 pages
NLP Assignment 2
No ratings yet
NLP Assignment 2
2 pages
Blue Print Exit Exam
No ratings yet
Blue Print Exit Exam
223 pages
IS 7118 Unit-2 Regular Expressions
No ratings yet
IS 7118 Unit-2 Regular Expressions
69 pages
Computer Organization and Design: Lecture: 3 Tutorial: 1 Practical: 0 Credit: 4
No ratings yet
Computer Organization and Design: Lecture: 3 Tutorial: 1 Practical: 0 Credit: 4
18 pages
Speech and Language Processing - J&M
No ratings yet
Speech and Language Processing - J&M
599 pages
NLP UNIT-II PPT
No ratings yet
NLP UNIT-II PPT
45 pages
Unit - 3 NLP - R20
No ratings yet
Unit - 3 NLP - R20
21 pages
CS224n: Natural Language Processing With Deep Learning
No ratings yet
CS224n: Natural Language Processing With Deep Learning
14 pages
Swe1017 NLP Syllabus
No ratings yet
Swe1017 NLP Syllabus
2 pages
Compiler Design Lab Manual
No ratings yet
Compiler Design Lab Manual
48 pages
Sentiment Analysis PPT
No ratings yet
Sentiment Analysis PPT
13 pages
QA Review: IR-based Question Answering
No ratings yet
QA Review: IR-based Question Answering
11 pages
NLP Unit-2 Notes
No ratings yet
NLP Unit-2 Notes
45 pages
Information Retrieval Systems (A70533)
No ratings yet
Information Retrieval Systems (A70533)
11 pages
NLP Unit I
No ratings yet
NLP Unit I
30 pages
Assignment 5 (COPY)
No ratings yet
Assignment 5 (COPY)
5 pages
Senior Design Project - Final Report
No ratings yet
Senior Design Project - Final Report
68 pages
Query Operation 2021
No ratings yet
Query Operation 2021
35 pages
Information Retrieval
No ratings yet
Information Retrieval
5 pages
Natural Language Processing-ppt
No ratings yet
Natural Language Processing-ppt
40 pages
CS 464 Question Bank
No ratings yet
CS 464 Question Bank
5 pages
NLP
No ratings yet
NLP
16 pages
Module 2
No ratings yet
Module 2
3 pages
Nlp All Answers
No ratings yet
Nlp All Answers
172 pages
NLP Study Material
No ratings yet
NLP Study Material
8 pages
NLP Previous Sem
No ratings yet
NLP Previous Sem
5 pages
2 - 6N302 Natural Language Processing
No ratings yet
2 - 6N302 Natural Language Processing
6 pages
Natural Language Processing
No ratings yet
Natural Language Processing
8 pages
CT3 Set A
No ratings yet
CT3 Set A
3 pages
21ai643 Model Paper
No ratings yet
21ai643 Model Paper
2 pages
21AI643
No ratings yet
21AI643
2 pages
Natural Language Processing All Question
No ratings yet
Natural Language Processing All Question
122 pages
vnd.openxmlformats-officedocument.wordprocessingml.document&rendition=1
No ratings yet
vnd.openxmlformats-officedocument.wordprocessingml.document&rendition=1
5 pages
NLP Previous Sem-4-5
No ratings yet
NLP Previous Sem-4-5
2 pages
NLP Previous Sem-1-3
No ratings yet
NLP Previous Sem-1-3
3 pages
NLP QB
No ratings yet
NLP QB
5 pages
NLP Sample Questions-Stu
No ratings yet
NLP Sample Questions-Stu
4 pages
NLP Syllabus R21
100% (1)
NLP Syllabus R21
2 pages
CT1 - Set-A-Deepa
100% (1)
CT1 - Set-A-Deepa
3 pages
MTE Practice Set
No ratings yet
MTE Practice Set
4 pages
TCL and The TK Toolkit 2nd Edition CONTENTS
0% (1)
TCL and The TK Toolkit 2nd Edition CONTENTS
8 pages
Python - 1 Year - Unit-5
No ratings yet
Python - 1 Year - Unit-5
217 pages
Ignou Assignment Guru: BCSL-063:Operating System Concepts and Networking Management Lab
No ratings yet
Ignou Assignment Guru: BCSL-063:Operating System Concepts and Networking Management Lab
5 pages
Chatgpt for python
No ratings yet
Chatgpt for python
192 pages
Python Interview Questions
No ratings yet
Python Interview Questions
38 pages
CPSC 388 - Compiler Design and Construction: Scanner - Regular Expressions To DFA
No ratings yet
CPSC 388 - Compiler Design and Construction: Scanner - Regular Expressions To DFA
23 pages
Mongodb Notes
100% (1)
Mongodb Notes
14 pages
LPI101
No ratings yet
LPI101
6 pages
Regular Expression and Cell Array1
No ratings yet
Regular Expression and Cell Array1
17 pages
Python Programming
No ratings yet
Python Programming
16 pages
SSIS Cheatsheet
100% (1)
SSIS Cheatsheet
1 page
Perl For Hardware Design
100% (1)
Perl For Hardware Design
16 pages
Unit 1 - Finite Automata
100% (1)
Unit 1 - Finite Automata
91 pages
U Center UserGuide (UBX 13005250)
100% (1)
U Center UserGuide (UBX 13005250)
94 pages
VB6 To TCL Mini HOWTO: Mark Hubbard
No ratings yet
VB6 To TCL Mini HOWTO: Mark Hubbard
12 pages
Basics of Python Programming
No ratings yet
Basics of Python Programming
29 pages
StoryTube_-_Generating_2D_Animation_for_a_Short_Story
No ratings yet
StoryTube_-_Generating_2D_Animation_for_a_Short_Story
6 pages
ATCD Unit Wise Important questions
No ratings yet
ATCD Unit Wise Important questions
5 pages
UNIX II:grep, Awk, Sed: October 30, 2017
No ratings yet
UNIX II:grep, Awk, Sed: October 30, 2017
26 pages
02 Javascript
No ratings yet
02 Javascript
43 pages
Python Question Paper 2
No ratings yet
Python Question Paper 2
15 pages
Introduction To UNIX With LINUX
No ratings yet
Introduction To UNIX With LINUX
106 pages
Manual - IP - DNS - MikroTik Wiki
No ratings yet
Manual - IP - DNS - MikroTik Wiki
5 pages
21EC643
No ratings yet
21EC643
4 pages
njio_2025_info_pack_private candidate
No ratings yet
njio_2025_info_pack_private candidate
48 pages
10 Helpful JavaScript Code Snippets v3
No ratings yet
10 Helpful JavaScript Code Snippets v3
6 pages
Pandas Cheat Sheet
No ratings yet
Pandas Cheat Sheet
2 pages
Jessa Jane S. Romo Bsit-2 Javascript Perl: Elements Character Set
No ratings yet
Jessa Jane S. Romo Bsit-2 Javascript Perl: Elements Character Set
4 pages

Module 1

Uploaded by

Module 1

Uploaded by

1. Mention two practical applications of NER.

CO1 BL1 2 Marks

4. What are stop words? CO1 BL1 2 Marks

7. What is NLTK? CO1 BL1 2 Marks

8. What is Multi Word Tokenization? CO1 BL1 2 Marks

9. What are stems? CO1 BL1 2 Marks

10. What are called affixes? CO1 BL1 2 Marks

11. What is lexicon? CO1 BL1 2 Marks

13. What is sentence segmentation? CO1 BL1 2 Marks

14. Why is sentence segmentation important? CO1 BL1 2 Marks

15. What is morphology in NLP? CO1 BL1 2 Marks

18. Give some popular examples of Corpus. CO1 BL1 2 Marks

25. Explain Dependency Parsing in NLP? CO1 BL1 5 Marks

38. What is Information Extraction? CO1 BL1 5 Marks

40. State a few applications of Information Retrieval. CO4 BL5 5 Marks

41. What is text normalization? CO3 BL3 10 Marks

(b) (a*b + b*a)*a CO1 BL4 5 Marks

49. "Write regular expressions for the following languages.

1. the set of all alphabetic strings;

2. the set of all lower case alphabetic strings ending in a b;

50. Explain Rule based POS tagging CO1 BL2 5 Marks

52. What is NLTK? CO1 BL2 2 Marks

53. What is Multi Word Tokenization? CO1 BL2 2 Marks

54. What is sentence segmentation? CO1 BL2 2 Marks

55. What is morphology in NLP? CO1 BL2 2 Marks

56. Give some popular examples of Corpus. CO1 BL2 2 Marks

57. What do you mean by word tokenization? CO1 BL2 2 Marks

61. What is Stemming? CO1 BL1 2 Marks

62. What is Corpus in NLP? CO1 BL1 2 Marks

72. Explain Homonymy with example? CO1 BL3 2 Marks

73. Define WordNet. CO1 BL1 2 Marks

90. What is Pragmatic Ambiguity in NLP? CO1 BL4 10 Marks

YYYY-MM-DD CO1 BL4 10 Marks

You might also like

(b) (ab + ba)*a CO1 BL4 5 Marks