NLP Syllabus
NLP Syllabus
CO1 :: describe the key concepts, applications, and challenges of Natural Language Processing
(NLP).
CO3 :: experiment and fine-tune deep learning models (RNN, LSTM, BERT, GPT) using
TensorFlow/Keras to perform tasks like sentiment analysis, text classification, and text
generation.
CO4 :: analyze different word representation methods and parsing techniques to analyze
sentence structures and semantic meaning.
Unit I
Introduction to Natural Language Processing : Definition, applications and use cases of NLP,
Steps in NLP, Finding structure of Words, sentences, syntax, and semantics, Phonetics, morphology,
pragmatics, phonology, Natural Language processing with NLTK
Unit II
Language Models and Text Processing : Text Preprocessing with Python: Reading & Writing files,
string manipulation, Regular Expression, Text Cleaning, Tokenization using NLTK, Text Analysis
Libraries of Text Processing, Native Python, Textblob, Pandas, Stop-word removal, stemming, and
lemmatization., Word Representations: Bag of Words (BoW), TF-IDF using sklearn
Unit III
Syntax and Parsing Techniques : Grammar Formalisms: Context-Free Grammar (CFG), Chomsky
Normal Form, Parsing Techniques: Implementing parsers in Python using libraries like nltk,
Applications of Parsing: Sentence structure analysis, Semantic role labeling
Unit IV
Semantic Analysis : Lexical Semantics in Python, Discourse Semantics in Python, Semantic
Similarity: Ontology and Knowledge Representation, Semantic Web libraries in Python (e.g., RDFLib),
Implementing sentiment analysis using Python libraries (TextBlob, VADER)
Unit V
Advanced NLP Techniques : Machine Learning in NLP: Supervised and unsupervised methods using
sklearn, Deep Learning in NLP: Recurrent Neural Networks (RNNs), Long Short-Term Memory (LSTM)
using TensorFlow/Keras, Transformers and Transformers library, Pre-trained Models: Fine-tuning
BERT, GPT System Overview, Text Generation: Basics of Natural Language Generation (NLG) with
Python
Unit VI
Prompt Engineering : Core Principles, Types of prompts, Comman Challenges & Solutions, Advance
Techniques: Zero-Shot Learning, Few-Shot Learning, COT (Change of Thought), Reinforcement
through iteration
• Perform sentiment analysis on a dataset (e.g., movie reviews) using TextBlob and VADER. Compare
the results of both libraries.
• Use Python's rdflib library to create and query a simple ontology. Demonstrate basic RDF and
SPARQL operations.
• Implement a Recurrent Neural Network (RNN) or LSTM to classify text sequences (e.g., classify
movie reviews as positive or negative) using TensorFlow/Keras.
• Use Transformers to fine-tune a BERT or GPT model for a text classification task (e.g., spam email
detection).
Text Books:
1. UNDERSTANDING NATURAL LANGUAGE PROCESSING by T V GEETHA, PEARSON
References:
1. MACHINE LEARNING by TOM M. MITCHELL, Mc Graw Hill Education