Natural Language Processing (NLP) Roadmap
Natural Language Processing (NLP) Roadmap
What is NLP?
NLP Applications
Challenges in NLP
Linguistics Overview (Syntax, Semantics, Pragmatics)
Components of NLP Pipeline
Tokenization
o Word Tokenization
o Sentence Tokenization
Text Cleaning
o Removing Punctuation, Numbers, Special Characters
o Lowercasing
o Removing Stopwords
Stemming
Lemmatization
Part-of-Speech (POS) Tagging
Named Entity Recognition (NER)
Sentence Segmentation
Stop Words Removal
Text Normalization (Unicode, ASCII)
Information Retrieval-based QA
Machine Learning-based QA
Extractive QA (e.g., BERT-based models)
Generative QA (e.g., T5, GPT)
Extractive Summarization
o TF-IDF, TextRank
Abstractive Summarization
o Seq2Seq Models
o Transformer-based Models (BERT, T5)
Evaluation Metrics (ROUGE)
1.11 Dialogue Systems and Chatbots
Rule-based Chatbots
Retrieval-based Chatbots
Generative-based Chatbots
Conversational AI Models (GPT, BERT, T5)
Intent Detection and Slot Filling
Audio Preprocessing
Feature Extraction (MFCC, Mel Spectrogram)
Acoustic Models (HMM, DNN)
End-to-End Speech Recognition (Deep Speech, Wav2Vec)
Introduction to Segmentation
Pixel-wise Classification
U-Net Architecture
Fully Convolutional Networks (FCN)
DeepLabV3
Mask R-CNN
Panoptic Segmentation
Spatio-Temporal Features
3D CNN (3D Convolutional Networks)
RNN/LSTM for Temporal Sequence Modeling
I3D (Inflated 3D ConvNet)
Two-Stream Networks (Optical Flow + RGB)