0% found this document useful (0 votes)
24 views4 pages

Nlpa (PG Aids)

The document outlines the syllabus for a postgraduate course on Natural Language Processing & Applications at Gujarat Technological University for the academic year 2024-25. It includes course objectives, outcomes, teaching schemes, detailed course content, and suggested learning resources. The course aims to equip students with the skills to develop various natural language processing modules and apply them to real-world problems.

Uploaded by

aphimani
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
24 views4 pages

Nlpa (PG Aids)

The document outlines the syllabus for a postgraduate course on Natural Language Processing & Applications at Gujarat Technological University for the academic year 2024-25. It includes course objectives, outcomes, teaching schemes, detailed course content, and suggested learning resources. The course aims to equip students with the skills to develop various natural language processing modules and apply them to real-world problems.

Uploaded by

aphimani
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 4

GUJARAT TECHNOLOGICAL UNIVERSITY

Program Name: Engineering


Level: PG
Branch: Artificial Intelligence and Data Science
Subject Code: ME02095061
Course/Subject Name: Natural Language Processing & Applications

WEF Academic Year 2024-25


Semester 2
Category of the Course Professional Elective Course

Data Structures and Programming


Prerequisite
Rationale
There is an increasing demand of making the computing
systems to be learning the natural languages. It should also able
to process the different parts of natural languages and respond
to the user in again natural language. The objective of this
course is prepare the learners for developing the different
modules of natural language processing like lexical analysis,
error checking, part-of-speech tagging, syntax checking etc.

Course Outcome:
After completion of the Course, Students will be able to:
No Course Outcomes RBT Level*
Understand fundamental mathematical models and algorithms in the field
01 UN
of natural language processing.
02 Apply lexical, syntax and semantic analysis methods on textual data. AP
Apply natural language processing on real world problems of speech
03 AP
recognition, automated question answering, text classification.
04 Analyze the lexical, syntax and semantic analysis methods. AN
Evaluate the performance of natural language processing methods and
05 EV
compare it.

*RM: Remember, UN: Understand, AP: Apply, AN: Analyze, EL: Evaluate, CR: Create

w.e.f. 2024-25 https://syllabus.gtu.ac.in/ Page 1 of 4


GUJARAT TECHNOLOGICAL UNIVERSITY
Program Name: Engineering
Level: PG
Branch: Artificial Intelligence and Data Science
Subject Code: ME02095061
Course/Subject Name: Natural Language Processing & Applications

Teaching and Examination Scheme:


Total
Teaching Scheme
Credits Assessment Pattern and Marks
(in hours)
(L+T+PR/2) Total
Theory Tutorial/Practical Marks
L T PR C PA/CA PA/CA
ESE (E) ESE (V)
(M) (I)
03 00 02 04 70 30 20 30 150

Course Content:
No of % of
Unit Course Content
Hours Weightage
1. Introduction 03 10%
Concept and Use of Natural Language Processing, Ambiguity and
uncertainty in language, The Turing test, Models and Algorithms
2. Regular Expressions, Automata and Finite-State Transducers 14 30%
Basic Regular Expression Patterns, Disjunction, Grouping, and
Precedence, Advanced Operators, Regular Expression Substitution,
Using FSA to Recognize Sheeptalk, Formal Languages,
Nondeterministic FSAs, Using an NFSA to accept strings, Recognition
as Search, Relating Deterministic and Non-deterministic Automata,
Regular Languages and FSAs, Survey of (Mostly) English Morphology,
Inflectional Morphology, Derivational Morphology, The Lexicon and
Morphotactics, Morphological Parsing with Finite-State Transducers,
Orthographic Rules and Finite-State Transducers, Human
Morphological Processing, N-gram Language Models, Words, Corpora,
Text Normalization, Minimum Edit Distance, N-Grams, Evaluating
Language Models, Generalization and Zeros, Smoothing, Kneser-Ney
Smoothing
3. Word Classes and Part-of-Speech Tagging 07 20%
English Word Classes, Tagsets for English, Part of Speech Tagging,
Rule-based Part-of-speech Tagging, Stochastic Part-of-speech Tagging,

w.e.f. 2024-25 https://syllabus.gtu.ac.in/ Page 2 of 4


GUJARAT TECHNOLOGICAL UNIVERSITY
Program Name: Engineering
Level: PG
Branch: Artificial Intelligence and Data Science
Subject Code: ME02095061
Course/Subject Name: Natural Language Processing & Applications

The Actual Algorithm for HMM tagging, Transformation-Based


Tagging, How TBL rules are applied, How TBL Rules are Learned,
Multiple tags and multiple words, Unknown words, Class-based N-
grams
4. Context-Free Grammars, Semantics and Pragmatics 15 30%
Constituency, Context-Free Rules and Trees, Sentence-Level
Constructions, The Noun Phrase, The Verb Phrase and
Subcategorization, Auxiliaries, Spoken Language Syntax, Grammar
Equivalence & Normal Form, Finite State & Context-Free Grammars,
Grammars & Human Processing, Parsing as Search, Top-Down and
Bottom-Up Parsing with comparison, A Basic Top-down Parser, Adding
Bottom-up Filtering, Problems with the Basic Top-down Parser, Left
Recursion, Ambiguity, Syntax-Driven Semantic Analysis, Attachments
for a Fragment of English, Idioms and Compositionality, Robust
Semantic Analysis, WordNet: A Database of Lexical Relations,
Reference Resolution, Text Coherence, Discourse Structure,
Psycholinguistic Studies of Reference and Coherence
5. Recent Trends in the Domain of Natural Language Processing 06 10%
TOTAL 45 100

Suggested Specification Table with Marks (Theory):


Distribution of Theory Marks (in %)
R Level U Level A Level N Level E Level C Level
10 20 20 20 20 10

w.e.f. 2024-25 https://syllabus.gtu.ac.in/ Page 3 of 4


GUJARAT TECHNOLOGICAL UNIVERSITY
Program Name: Engineering
Level: PG
Branch: Artificial Intelligence and Data Science
Subject Code: ME02095061
Course/Subject Name: Natural Language Processing & Applications

Where R: Remember; U: Understanding; A: Application, N: Analyze and E: Evaluate C: Create (as


per Revised Bloom’s Taxonomy)

Reference/Suggested Learning Resources:

(a) Books:
1. Speech and Language Processing: An Introduction to Natural Language Processing,
Computational Linguistics and Speech Recognition by Daniel Jurafsky & James H. Martin
Pearson.
2. Speech and Language Processing by Daniel Jurafsky and James H. Martin Second edition,
Prentice Hall.
3. Natural Language Understanding by Allen James Second edition, Benjamin/Cumming.
4. Statistical Language Learning by Charniack Eugene MIT Press.
5. Foundations of Statistical Natural Language Processing by Manning Christopher and
Heinrich Schutze MIT Press.

(b) Open source software and website

 Course-related online MOOCs on NPTEL/SWAYAM platform.


 Recently Published papers/articles in reputed journals.

Suggested Course Practical List:

 The practical work will be carried out based on the content covered during the academic sessions.

List of Laboratory/Learning Resources Required: Programming development environment (open


source is encouraged) related to the course content.

Suggested Project List: The subject teacher has to assign the relevant project work to the students in
individual/team.
Suggested Activities for Students: The subject teacher has to assign the outcome based activities to
the students in individual/team.
*********

w.e.f. 2024-25 https://syllabus.gtu.ac.in/ Page 4 of 4

You might also like