CM321 NLP Syllabus
CM321 NLP Syllabus
Course Objectives:
The main objectives of this course are
1. Understand the Representation and Layers of NLP.
2. Represent the sentences as parse trees and describe NER and its challenges.
3. Use and create sentiment lexicons and Q&A.
4. Describe extractive Summarization in three generations and LLMs.
Course Outcomes:
After successful completion of the course, the students will be able to:
CO 1: Explain the probabilistic formulation of POS tagging and Viterbi decoding.
CO 2: Execute steps in top-down, bottom-up parsing and NER for example sentences.
CO 3: Analyze the semantic analysis, Question Answering of natural language sentences.
CO 4: Distinguish between extractive and abstractive summarization and LLMs.
Course Content:
UNIT I (CO1)
Introduction: Language and Linguistics, Ambiguity and Layers of NLP, Grammar, Probability,
and Data, Generations of NLP.
Representation and NLP: Ambiguity and Representations, Generation 1: Belongingness via
Grammars, Generation 2: Discrete Representational Semantics, Generation 3: Dense
Representations.
Shallow Parsing: Part-of-Speech Tagging, Statistical POS Tagging, Neural POS Tagging,
Chunking.
UNIT II (CO2)
Deep Parsing: Linguistics of Parsing, Algorithmics of Parsing, Constituency Parsing: Rule
Based, Statistical Parsing, Dependency Parsing, Neural Parsing.
Named Entity Recognition: Problem Formulation, Ambiguity in Named Entity Recognition,
Datasets, First Generation: Rule-Based Approaches, Second Generation: Probabilistic
Models, Third Generation: Sentence Representations and Position-Wise Labelling.
Natural Language Inference: Ambiguity in NLI, Problem Formulation, Datasets, First
Generation: Logical Reasoning, Second Generation: Alignment, Third Generation: Neural
Approaches.
UNIT IV (CO4)
Conversational AI: Problem Definition, Ambiguity Resolution in Conversational AI, Rule-
Based Approaches to Conversational AI, Statistical Approaches, Neural Approaches.
Summarization: Ambiguity in Text Summarization, Problem Definitions, Early Work,
Summarization Using Machine Learning.
Large Language Models: Background, Ambiguity Resolution, Generative LLMs, Usage of
LLMs.
Learning Resources:
Text Book: