Nlpa (PG Aids)
Nlpa (PG Aids)
Course Outcome:
After completion of the Course, Students will be able to:
No Course Outcomes RBT Level*
Understand fundamental mathematical models and algorithms in the field
01 UN
of natural language processing.
02 Apply lexical, syntax and semantic analysis methods on textual data. AP
Apply natural language processing on real world problems of speech
03 AP
recognition, automated question answering, text classification.
04 Analyze the lexical, syntax and semantic analysis methods. AN
Evaluate the performance of natural language processing methods and
05 EV
compare it.
*RM: Remember, UN: Understand, AP: Apply, AN: Analyze, EL: Evaluate, CR: Create
Course Content:
No of % of
Unit Course Content
Hours Weightage
1. Introduction 03 10%
Concept and Use of Natural Language Processing, Ambiguity and
uncertainty in language, The Turing test, Models and Algorithms
2. Regular Expressions, Automata and Finite-State Transducers 14 30%
Basic Regular Expression Patterns, Disjunction, Grouping, and
Precedence, Advanced Operators, Regular Expression Substitution,
Using FSA to Recognize Sheeptalk, Formal Languages,
Nondeterministic FSAs, Using an NFSA to accept strings, Recognition
as Search, Relating Deterministic and Non-deterministic Automata,
Regular Languages and FSAs, Survey of (Mostly) English Morphology,
Inflectional Morphology, Derivational Morphology, The Lexicon and
Morphotactics, Morphological Parsing with Finite-State Transducers,
Orthographic Rules and Finite-State Transducers, Human
Morphological Processing, N-gram Language Models, Words, Corpora,
Text Normalization, Minimum Edit Distance, N-Grams, Evaluating
Language Models, Generalization and Zeros, Smoothing, Kneser-Ney
Smoothing
3. Word Classes and Part-of-Speech Tagging 07 20%
English Word Classes, Tagsets for English, Part of Speech Tagging,
Rule-based Part-of-speech Tagging, Stochastic Part-of-speech Tagging,
(a) Books:
1. Speech and Language Processing: An Introduction to Natural Language Processing,
Computational Linguistics and Speech Recognition by Daniel Jurafsky & James H. Martin
Pearson.
2. Speech and Language Processing by Daniel Jurafsky and James H. Martin Second edition,
Prentice Hall.
3. Natural Language Understanding by Allen James Second edition, Benjamin/Cumming.
4. Statistical Language Learning by Charniack Eugene MIT Press.
5. Foundations of Statistical Natural Language Processing by Manning Christopher and
Heinrich Schutze MIT Press.
The practical work will be carried out based on the content covered during the academic sessions.
Suggested Project List: The subject teacher has to assign the relevant project work to the students in
individual/team.
Suggested Activities for Students: The subject teacher has to assign the outcome based activities to
the students in individual/team.
*********