Module 1 Lecture 1
Module 1 Lecture 1
CSE 6002:
Natural Language Processing Techniques
Recap from the Previous Lecture
• Course Handout Discussion
• Project Groups
2
What is NLP?
● Branch of Artificial Intelligence
3
Stages of Processing
● Phonetics and phonology
● Morphology
● Lexical Analysis
● Syntactic Analysis
● Semantic Analysis
● Pragmatics and Discourse
4
Challenges Associated with Phonetics /
Speech
● Homophones - Words that sound same / similar.
○ After Mahatma Gandhi was killed by Godse, India was mourning. However,
that did not stop some kids playing in the evening in a park. Someone once
asked them “Why are you playing? It is mourning time now.” To which one
of the kids said “Sir. It is not morning time, but it is evening, and we have
just finished our homework!”
● Word boundary - Where to split the words in speech
○ I got a plate.
○ I got up late.
● Disfluency - ah, um, etc…
5
Morphology
● Word formation from root words and morphemes
○ Eg. singular - plural (teacher + s = teachers), gender (lion + ess = lioness),
tense (listen + ing = listening), etc.
● First step in NLP - extract the morphemes of the given word
● Languages rich in morphology - Dravidian languages (Eg. Kannada,
Tamil, Telugu, etc.)
○ Example: Maadidhanu - Maadu (root verb) + past tense + male singular
● Languages poor in morphology - English
○ Example: Did - Do (root verb) + past tense
6
Lexical Analysis
● Words have different meanings.
● Meanings have different words.
Example:
● Where there’s a will…
● There are many relatives
7
Lexical Disambiguation
● Part of Speech disambiguation.
○ Love (is it a verb (I love to eat sushi) or a noun (God’s love is so wonderful)?)
● Sense disambiguation.
○ Bank (I went to the bank on the river to buy fish. vs. I went to the bank on
the river to withdraw Rs. 1000)
8
Syntactic Analysis
• Consider the sentence “I like mangoes” S
VP
NP
NP
V
N
N
I like
mangoes 9
Syntactic Analysis
● S -> NP VP
● NP -> N
● VP -> V NP
● N -> Noun (mangoes) / Pronoun (I)
● VP -> Verb (like)
10
Ambiguity in Parsing
● Natural Language Ambiguity:
I saw a boy with a telescope.
(Who has the telescope?)
● Design Ambiguity:
I saw a boy with a telescope which I dropped. Vs. I saw a boy with a
telescope which he dropped.
(Will the same parse tree be generated using probability?)
11
Semantic Analysis
● Semantic Analysis involves assigning semantic roles to entities in the
text.
Example: John gave the book to Mary.
Agent: John, Recipient: Mary, Object / Theme: the book, etc.
● Semantic ambiguity:
Example: Visiting people involves lot of work.
12
Pragmatics and Discourse
● Study of contexts in which language is used.
○ Example: Coreference Resolution.
● Very hard problem. Requires successful (or satisfactory) solutions of
previous problems.
● Disambiguation clues need not be present within the same
sentence, but can be present anywhere in the text!
13
History of NLP
14
The Imitation Game
15
ELIZA
16
Georgetown Experiment
17
ALPAC Report
18
Syntactic Structures and Conceptual Dependency Theory
19
From Rules to Data
● Starting from the 1980s, we have seen a movement from using rule-
based NLP systems to statistical systems due to the presence of data.
● With data, we can use probability theory to build reasonably robust
systems for language modeling, machine translation, etc.
○ Example: Which one is correct in each pair and why?
■ I saw an elephant. Vs. I saw an equipment.
■ An European war is currently going on. Vs. A European war is currently going on.
■ Tell me something. Vs. Say me something.
● All this is possible because of probability.
20
Example of Machine Translation
21
Example of Machine Translation
22
Research Activities in NLP
23
Organizations for Research in NLP
24
Publication Fora in NLP - Journals
25
Publication Fora in NLP - Conferences
26
Publication Fora in NLP - Workshops
27
Research Areas in NLP (from ACL 2022 CFP)
● Dialogue and Interactive Systems
● Discourse and Pragmatics
● Ethics and NLP
● Generation
● Information Retrieval and Text Mining
● Interpretability and Analysis of Models for NLP
● Linguistic Theories, Cognitive Modeling, and Psycholinguistics
● Machine Translation and Multilinguality
● Multimodality
● NLP Applications
● Phonology, Morphology, and Word Segmentation
● Question Answering
● Resources and Evaluation
● Semantics
● Sentiment Analysis, Stylistic Analysis, and Argument Mining
● Speech
● Summarization
● Tagging, Chunking and Parsing
28
Where to find research in NLP
29