Bai601 NLP
Bai601 NLP
1
Information Retrieval: Design Features of Information Retrieval Systems, Information Retrieval
Models - Classical, Non-classical, Alternative Models of Information Retrieval - Custer model, Fuzzy
model, LSTM model, Major Issues in Information Retrieval.
Lexical Resources: WordNet, FrameNet, Stemmers, Parts-of-Speech Tagger, Research Corpora.
Textbook 1: Ch. 9, Ch. 12.
MODULE-5
Machine Translation: Language Divergences and Typology, Machine Translation using Encoder-
Decoder, Details of the Encoder-Decoder Model, Translating in Low-Resource Situations, MT
Evaluation, Bias and Ethical Issues.
Textbook 2: Ch. 13.
CIE for the theory component of the IPCC (maximum marks 50)
● IPCC means practical portion integrated with the theory of the course.
● CIE marks for the theory component are 25 marks and that for the practical component is 25
marks.
● 25 marks for the theory component are split into 15 marks for two Internal Assessment Tests (Two
Tests, each of 15 Marks with 01-hour duration, are to be conducted) and 10 marks for other
assessment methods mentioned in 22OB4.2. The first test at the end of 40-50% coverage of the
syllabus and the second test after covering 85-90% of the syllabus.
● Scaled-down marks of the sum of two tests and other assessment methods will be CIE marks for the
theory component of IPCC (that is for 25 marks).
● The student has to secure 40% of 25 marks to qualify in the CIE of the theory component of IPCC.
3
● On completion of every experiment/program in the laboratory, the students shall be evaluated
including viva-voce and marks shall be awarded on the same day.
● The CIE marks awarded in the case of the Practical component shall be based on the continuous
evaluation of the laboratory report. Each experiment report can be evaluated for 10 marks. Marks of
all experiments’ write-ups are added and scaled down to 15 marks.
● The laboratory test (duration 02/03 hours) after completion of all the experiments shall be
conducted for 50 marks and scaled down to 10 marks.
● Scaled-down marks of write-up evaluations and tests added will be CIE marks for the laboratory
component of IPCC for 25 marks.
● The student has to secure 40% of 25 marks to qualify in the CIE of the practical component of the IPCC.
The theory portion of the IPCC shall be for both CIE and SEE, whereas the practical portion will
have a CIE component only. Questions mentioned in the SEE paper may include questions from the
practical component.
Suggested Learning Resources:
Textbook:
1. Tanveer Siddiqui, U.S. Tiwary, “Natural Language Processing and Information Retrieval”,
Oxford University Press.
2. Daniel Jurafsky, James H. Martin, “Speech and Language Processing, An Introduction to
Natural Language Processing, Computational Linguistics, and Speech Recognition”, Pearson
Education, 2023.
Reference Books:
1. Akshay Kulkarni, Adarsha Shivananda, “Natural Language Processing Recipes - Unlocking
Text Data with Machine Learning and Deep Learning using Python”, Apress, 2019.
2. T V Geetha, “Understanding Natural Language Processing – Machine Learning and Deep
Learning Perspectives”, Pearson, 2024.
3. Gerald J. Kowalski and Mark.T. Maybury, “Information Storage and Retrieval systems”,
Kluwer Academic Publishers.
Web links and Video Lectures (e-Resources):
1. https://www.youtube.com/watch?v=M7SWr5xObkA
2. https://youtu.be/02QWRAhGc7g
3. https://www.youtube.com/watch?v=CMrHM8a3hqw
4. https://onlinecourses.nptel.ac.in/noc23_cs45/preview
5. https://archive.nptel.ac.in/courses/106/106/106106211/
4
Activity Based Learning (Suggested Activities in Class)/ Practical Based learning