Natural Language processing-Regular-HO
Natural Language processing-Regular-HO
COURSE HANDOUT
Course Objectives
No Course Objective
CO1 To learn the fundamental concepts and techniques of natural language processing (NLP)
CO2 To learn computational properties of natural languages and the commonly used algorithms
for processing linguistic information
Text Book(s)
T1 Speech and Language processing: An introduction to Natural Language Processing,
Computational Linguistics and speech Recognition by Daniel Jurafsky and James H.
Martin[2nd edition]
1. Introduction
3. Text classifications
4.1.1 N-Grams
4.1.2 Evaluating Language Models
4.1.3 Generalization and Zeros
4.1.4 Unknown Words
4.1.5 Smoothing
5. Lexical Analysis
5.1Lexical semantics
5.2Vector semantics
5.3 Words and vectors
5.4 Cosine for measuring similarity
5.5 TF-IDF : Weighing terms in the vector
5.6 Application of the tf-idf vector model
5.7 Word2vec
5.8 Visualizing Embedding
5.9 Semantic properties of embedding
5.10Bias and Embedding
5.11 Evaluating Vector Models
6. Word Disambiguation
7. Grammar
7.1 Introduction to Markov models and Hidden Markov models
7.2 Part-of-Speech Tagging
7.2.1 The Information Sources in Tagging
7.2.2 Markov Model Taggers
7.2.3 The probabilistic model
7.2.4 The Viterbi algorithm
7.2.5 Variations
7.3 Hidden Markov Model Taggers
7.3.1 Applying HMMs to POS tagging
7.3.2 The effect of initialization on HMM training
7.4 Transformation-Based Learning of Tags
7.4.1 Transformations
7.4.2 The learning algorithm
7.4.3 Relation to other models
7.4.4 Automata
Learning Outcomes:
No Learning Outcomes
LO1 Should have a good understanding of the field of natural language processing.
LO3 Should also understand the how natural language processing is used in Machine
translation and Information extraction.
Course Contents
11-12 Neural Networks and Neural Language Models: Chapter7 T1[3rd edition]
The XOR problem , Feed-Forward Neural Networks
Training Neural Nets, Neural Language Models
Word Disambiguation
Supervised Disambiguation :Bayesian classification
,An information-theoretic approach
Dictionary-Based Disambiguation : Thesaurus-based
disambiguation , Disambiguation based on translations
in a second-language corpus ,One sense per discourse,
one sense per collocation
Unsupervised Disambiguation
Evaluation :Pseudo words, Upper and lower bounds
on performance
Dependency Parsing:
Dependency Relations, Dependency Formalisms, Chapter13 T1[3rd edition]
Dependency Treebanks, Transition-Based Dependency
Parsing, Graph-Based Dependency Parsing ,Evaluation
24-25 Statistical Machine translation : Introduction Chapter 17,18 R2
Approaches, Language models, Word alignment
Translation models :
IBM models, Phrase Based systems, Syntax based
systems, Direct translation models
Example :Chinese Machine translation
31-32 Summary.
Evaluation Scheme
Evaluation Name Type Weight Duration Day, Date, Session,
Component (Quiz, Lab, Project, (Open book, Time
Midterm exam, End Closed book,
semester exam, etc) Online, etc.)
Syllabus for Mid-Semester Test (Closed Book): Topics in Weeks 1-8 (1-18 Hours)
Syllabus for Comprehensive Exam (Open Book): All topics given in plan of study
Evaluation Guidelines:
1. EC-1 consists of either two Assignments or three Quizzes. Announcements regarding the
same will be made in a timely manner.
2. For Closed Book tests: No books or reference material of any kind will be permitted.
Laptops/Mobiles of any kind are not allowed. Exchange of any material is not allowed.
3. For Open Book exams: Use of prescribed and reference text books, in original (not
photocopies) is permitted. Class notes/slides as reference material in filed or bound form is
permitted. However, loose sheets of paper will not be allowed. Use of calculators is permitted
in all exams. Laptops/Mobiles of any kind are not allowed. Exchange of any material is not
allowed.
4. If a student is unable to appear for the Regular Test/Exam due to genuine exigencies, the
student should follow the procedure to apply for the Make-Up Test/Exam. The genuineness of
the reason for absence in the Regular Exam shall be assessed prior to giving permission to
appear for the Make-up Exam. Make-Up Test/Exam will be conducted only at selected exam
centres on the dates to be announced later.
It shall be the responsibility of the individual student to be regular in maintaining the self-study
schedule as given in the course handout, attend the lectures, and take all the prescribed evaluation
components such as Assignment/Quiz, Mid-Semester Test and Comprehensive Exam according to the
evaluation scheme provided in the handout.
</DIV>