Introduction to Natural Language Processing

Uploaded by

abbastayyaba417

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

21 views

Introduction to Natural Language Processing

Uploaded by

abbastayyaba417

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 31

Introduction to Natural

Language Processing
INSTRUCTOR: DR . GULSHAN SALEEM
COURSE CODE: CS AL4253
Research Profile

1. Google Scholar Profile

2. ResearchGate Profile
3. MPVIR Group (https://sites.google.com/view/mpvir)
4. Projects:
o Leaf classification
o Disease classification
o Anomaly detection
o Object Detection and Tracking
o Security Surveillance

2
Learning Objectives
•By the end of this lecture, students will:
•Understand what Natural Language Processing (NLP) is.
•Learn about the history of NLP.
•Discover modern-day applications of NLP (chatbots, translation,
etc.).
What is NLP
Natural Language Processing (NLP) is a field at the intersection of
computer science, artificial intelligence, and linguistics.
It enables computers to understand, interpret, and generate human
language.
Goal: To bridge the gap between human communication and
computer understanding.
Why is NLP Important?
•Automates the processing and understanding of large volumes of
natural language data.
•Improves human-computer interaction through voice assistants,
search engines, and chatbots.
Used in diverse industries: healthcare, finance, customer service, etc.
Brief History of NLP
1950s: Alan Turing’s "Turing Test" for machine intelligence.
1960s: Early work on machine translation (MT) and rule-based
systems.
1980s: Statistical models began to emerge (n-grams, hidden Markov
models).
1990s: Rise of machine learning in NLP.
2010s: Deep learning and neural networks revolutionized NLP (e.g.,
Word2Vec, BERT, GPT).
Communication with Machines
Conversational Agents
Core Components of NLP
•Text Preprocessing: Tokenization, lemmatization, stemming.
•Syntax and Parsing: POS tagging, dependency parsing.
•Semantics: Word meaning, embeddings.
•Applications: Text classification, sentiment analysis, machine
translation.
NLP vs. AI vs. ML

•Artificial Intelligence (AI): General field focused on making machines

"intelligent."
•Machine Learning (ML): Subset of AI focused on learning from data.
•NLP: Subfield of AI focused on language understanding and
generation.
Modern-Day Applications of
NLP
•Chatbots and Virtual Assistants (e.g., Siri, Alexa): Enable human-
computer conversations.
•Machine Translation (e.g., Google Translate): Converts text from one
language to another.
•Sentiment Analysis: Analyzes emotions in social media, reviews, etc.
•Text Summarization: Summarizes large documents automatically.
•Speech Recognition (e.g., Speech-to-Text): Converts spoken words
into text.
Chatbots Example
•Definition: Chatbots simulate human conversation using text or
voice.
•Applications: Customer service, healthcare, educational tools.
•Example: Conversational AI like ChatGPT or Alexa.
Machine Translation Example
•Definition: Automatically translates text or speech from one
language to another.
•Applications: Breaking language barriers, real-time translation
services.
•Example: Google Translate and DeepL.
Machine Translation
Key Challenges in NLP
•Ambiguity: Words or sentences can have multiple meanings.
•Context: Understanding the context is difficult for machines (e.g.,
sarcasm, idioms).
•Data and Ethics: Bias in language models, lack of labeled data, and
privacy concerns.
Why NLP is Hard?
NLP Datasets
•IMDb Reviews: A dataset for sentiment analysis containing movie reviews labeled as
positive or negative.
•20 Newsgroups: A collection of approximately 20,000 newsgroup documents, useful for
text classification and topic modeling.
•SMS Spam Collection: A set of SMS messages labeled as spam or not spam, ideal for
binary classification tasks.
•Sentiment140: A dataset of 1.6 million tweets labeled for sentiment (positive, negative,
neutral), perfect for sentiment analysis.
•Common Crawl: A massive web archive that can be used for various NLP tasks like
language modeling or text generation.
NLP Datasets
•Wikipedia Dump: A raw dump of Wikipedia articles, great for unsupervised learning
tasks or building language models.
•Quora Question Pairs: A dataset of questions from Quora, labeled as duplicate or not,
useful for semantic similarity and paraphrase detection.
•Amazon Reviews: A collection of product reviews across various categories, useful for
sentiment analysis and recommendation systems.
•Enron Email Dataset: A dataset of emails from the Enron corporation, useful for tasks
like text classification and named entity recognition.
•TREC Question Classification: A dataset of questions categorized into various
classes, great for question classification tasks.
Sentence Segmentation
Example
Word Tokenization Example
Part of Speech
Lemmatization
Named Entity Recognition
Named Entity Recognition
People’s names.
Company names.
Geographical locations
Product names.
Date and time.
Amount of money.
Events
Coreference Resolution
San Pedro is a town on the southern part of the island of Ambergris Caye in
the Belize District of the nation of Belize, in Central America. According to
2015 mid-year estimates, the town has a population of about 16, 444. It is
the second-largest town in the Belize District and largest in the Belize Rural
South constituency.
Here, we know that ‘it’ in the sentence 6 stands for San Pedro, but for a
computer, it isn’t possible to understand that both the tokens are same
because it treats both the sentences as two different things while it’s
processing them. Pronouns are used with a high frequency in English
literature and it becomes difficult for a computer to understand that both
things are same.
Conclusion
•NLP is at the heart of modern applications like chatbots, machine
translation, and text analysis.
•The field has evolved from rule-based systems to machine learning
and deep learning approaches.
Text Book
Daniel Jurafsky and James H. Martin. 2008. Speech and Language Processing: An Introduction
to Natural Language Processing, Computational Linguistics and Speech Recognition. Prentice
Hall 2nd/3rd Edition
http://www.cs.colorado.edu/~martin/slp.html
Great Overview of the Field, explanations of
techniques, algorithms, etc.
Christopher D. Manning and Hinrich Schütze. 1999. Foundations of Statistical Natural
Language Processing. MIT Press
Natural Language Processing with Python
By Steven Bird, Ewan Klein, and Edward Loper
http://www.nltk.org/book
Downloadable open source programs to try out various
Useful Readings
Look at projects at Stanford:
http://web.stanford.edu/class/cs224n/

Other useful links

http://aclweb.org/
http://www.cs.vassar.edu/sigann/
Programming Language
Why Python is better-suited:
easy to learn, clean syntax, powerful features
becoming increasingly popular in CompLinguistics!
Extensive tutorials, CompLing support, toolkits, data, etc.
References
Chowdhary, K., & Chowdhary, K. R. (2020). Natural language
processing. Fundamentals of artificial intelligence, 603-649.
Daniel Jurafsky and James H. Martin. 2018. Speech and Language
Processing: An Introduction to Natural Language Processing. Third
Edition. Prentice Hall
Thank You 

Natural Language Processing With Python A Comprehensive Guide To NLP in The Age of AI For 2024 (Hayden Van Der Post) (Z-Library)
No ratings yet
Natural Language Processing With Python A Comprehensive Guide To NLP in The Age of AI For 2024 (Hayden Van Der Post) (Z-Library)
315 pages
V Rail Product Sheet 2021
No ratings yet
V Rail Product Sheet 2021
4 pages
At JUNOS Health Check v1.0
No ratings yet
At JUNOS Health Check v1.0
24 pages
NLP Lecture 1
No ratings yet
NLP Lecture 1
3 pages
Introduction to Natural Language Processing
No ratings yet
Introduction to Natural Language Processing
211 pages
Introducing Natural Language Processing
No ratings yet
Introducing Natural Language Processing
13 pages
unit 3&4
No ratings yet
unit 3&4
10 pages
NLP StudyMaterial
No ratings yet
NLP StudyMaterial
540 pages
Lakshmi Priya Vellineni - Module 4 Assignment
No ratings yet
Lakshmi Priya Vellineni - Module 4 Assignment
5 pages
Introduction To NLP - Part 1
No ratings yet
Introduction To NLP - Part 1
23 pages
NLP Notes
No ratings yet
NLP Notes
90 pages
Topic 2: Introduction To Natural Language Processing (NLP)
No ratings yet
Topic 2: Introduction To Natural Language Processing (NLP)
16 pages
Module I NLP
No ratings yet
Module I NLP
65 pages
module-1
No ratings yet
module-1
49 pages
NLP Presentation
No ratings yet
NLP Presentation
15 pages
NLP_UNIT-1[1]
No ratings yet
NLP_UNIT-1[1]
20 pages
SCO409 Lecture Notes
No ratings yet
SCO409 Lecture Notes
64 pages
Natural Language Processing Inside Pages 2
No ratings yet
Natural Language Processing Inside Pages 2
159 pages
Natural Language Processing 101
No ratings yet
Natural Language Processing 101
26 pages
Intro NLP
No ratings yet
Intro NLP
47 pages
What Is Natural Language Processing?
No ratings yet
What Is Natural Language Processing?
5 pages
ML Module A7707 - Part1
No ratings yet
ML Module A7707 - Part1
48 pages
NLP-UNIT-I FINAL
No ratings yet
NLP-UNIT-I FINAL
31 pages
NLP M1 Students (1)
No ratings yet
NLP M1 Students (1)
17 pages
Natural Language Processing
No ratings yet
Natural Language Processing
73 pages
SITA3012 NLP Unit 1
No ratings yet
SITA3012 NLP Unit 1
33 pages
CH1
No ratings yet
CH1
87 pages
Unit1 A
No ratings yet
Unit1 A
8 pages
Introduction to Data Science_Week 7_LAQ's
No ratings yet
Introduction to Data Science_Week 7_LAQ's
4 pages
NLP LectureNotes UNIT 1
No ratings yet
NLP LectureNotes UNIT 1
55 pages
NLP PPT1 (1)
No ratings yet
NLP PPT1 (1)
29 pages
Amer 2
No ratings yet
Amer 2
18 pages
Natural Language Processing Notes
No ratings yet
Natural Language Processing Notes
80 pages
CC S 339 NLP Basics &TSA
No ratings yet
CC S 339 NLP Basics &TSA
68 pages
NLP-UNIT-I FINAL
No ratings yet
NLP-UNIT-I FINAL
31 pages
DS Exp2 Rugved
No ratings yet
DS Exp2 Rugved
5 pages
Advances in Natural Language Processing
No ratings yet
Advances in Natural Language Processing
7 pages
Unit 1 and Unit 2 Good Notes
No ratings yet
Unit 1 and Unit 2 Good Notes
21 pages
Natural Language Processing (NLP) With Python - Tutorial
No ratings yet
Natural Language Processing (NLP) With Python - Tutorial
72 pages
Brocode OP
No ratings yet
Brocode OP
133 pages
Natural Language Processing_ Bridging the Gap Between Humans and Machines
No ratings yet
Natural Language Processing_ Bridging the Gap Between Humans and Machines
6 pages
Bhawini NLP Practical
No ratings yet
Bhawini NLP Practical
98 pages
Natural Language Processing
No ratings yet
Natural Language Processing
87 pages
1 intro to NLP
No ratings yet
1 intro to NLP
5 pages
Natural Language Processing
No ratings yet
Natural Language Processing
5 pages
DS Exp2 20101A0021 Satyam Mishra
No ratings yet
DS Exp2 20101A0021 Satyam Mishra
5 pages
NLP AI Detailed Presentation
No ratings yet
NLP AI Detailed Presentation
18 pages
Natural Language Processing: Bachelor of Technology Computer Science and Engineering
No ratings yet
Natural Language Processing: Bachelor of Technology Computer Science and Engineering
7 pages
An In-Depth Exploration of Natural Language Processing: Evolution, Applications, and Future Directions
100% (8)
An In-Depth Exploration of Natural Language Processing: Evolution, Applications, and Future Directions
5 pages
NLP handwritten notes_copy
No ratings yet
NLP handwritten notes_copy
26 pages
Lecture_1_Introduction
No ratings yet
Lecture_1_Introduction
57 pages
Archivo - 01 (4 Cópia)
No ratings yet
Archivo - 01 (4 Cópia)
6 pages
AI Unit-5
No ratings yet
AI Unit-5
10 pages
Natural Language Processing
No ratings yet
Natural Language Processing
2 pages
NLP Lect Unit I
No ratings yet
NLP Lect Unit I
140 pages
Basic NLP to End-to-end Pipeline .pptx_removed
No ratings yet
Basic NLP to End-to-end Pipeline .pptx_removed
35 pages
Nlp Materia
No ratings yet
Nlp Materia
29 pages
NLP Session 1 updated- Dr. Chetana Gavankar
No ratings yet
NLP Session 1 updated- Dr. Chetana Gavankar
41 pages
nlp-1
No ratings yet
nlp-1
37 pages
1 Natural Language Processing-Intro
No ratings yet
1 Natural Language Processing-Intro
16 pages
Natural Language Understanding: Fundamentals and Applications
From Everand
Natural Language Understanding: Fundamentals and Applications
Fouad Sabry
No ratings yet
Statistical Semantics: Fundamentals and Applications
From Everand
Statistical Semantics: Fundamentals and Applications
Fouad Sabry
No ratings yet
Numerical_Solutions
No ratings yet
Numerical_Solutions
2 pages
Topic-6A-2024
No ratings yet
Topic-6A-2024
55 pages
02 Testing Process_Done
No ratings yet
02 Testing Process_Done
24 pages
05 Boundary Value Analysis_Done
No ratings yet
05 Boundary Value Analysis_Done
16 pages
06 Decision Tables_Done
No ratings yet
06 Decision Tables_Done
27 pages
Lecture-14-Firewall
No ratings yet
Lecture-14-Firewall
46 pages
01a SQA_Done
No ratings yet
01a SQA_Done
28 pages
Lecture_10_DB_Security
No ratings yet
Lecture_10_DB_Security
33 pages
Lecture-13-ACL
No ratings yet
Lecture-13-ACL
19 pages
intro to NLP Course Outline (Fall-2024)
No ratings yet
intro to NLP Course Outline (Fall-2024)
4 pages
Lecture_11_Malware
No ratings yet
Lecture_11_Malware
17 pages
Lecture-12_SoftwareSecurity
No ratings yet
Lecture-12_SoftwareSecurity
22 pages
2. Lexical Analyzer
No ratings yet
2. Lexical Analyzer
16 pages
4. Parser and CFG
No ratings yet
4. Parser and CFG
12 pages
NLP POS NER
No ratings yet
NLP POS NER
11 pages
CV & Portfolio Project Ilham Fadillah
No ratings yet
CV & Portfolio Project Ilham Fadillah
3 pages
Review On Hand Gesture Recognition
No ratings yet
Review On Hand Gesture Recognition
5 pages
Microsoft Exams MS-900
No ratings yet
Microsoft Exams MS-900
2 pages
Pamplet - 1
No ratings yet
Pamplet - 1
2 pages
Variable Block
No ratings yet
Variable Block
3 pages
Java Hand Outs
No ratings yet
Java Hand Outs
42 pages
Adsense Criteo Direct Implementation Guide v1.0 PDF
No ratings yet
Adsense Criteo Direct Implementation Guide v1.0 PDF
10 pages
The Kids Learn App Proposal Report
100% (1)
The Kids Learn App Proposal Report
14 pages
OLAP and Data Warehousing: Slides Courtesy Of: Julia Stoyanovitch
No ratings yet
OLAP and Data Warehousing: Slides Courtesy Of: Julia Stoyanovitch
46 pages
Multi Company and Extended Multi Company
No ratings yet
Multi Company and Extended Multi Company
47 pages
Topaz TS460
No ratings yet
Topaz TS460
2 pages
Itbm Notes
No ratings yet
Itbm Notes
4 pages
Regular Expression Pocket Reference Regular Expressions for Perl Ruby PHP Python C Java and NET 2nd Edition Tony Stubblebine - The latest ebook is available for instant download now
No ratings yet
Regular Expression Pocket Reference Regular Expressions for Perl Ruby PHP Python C Java and NET 2nd Edition Tony Stubblebine - The latest ebook is available for instant download now
29 pages
Docs Frrouting Org Dev Guide en Latest
No ratings yet
Docs Frrouting Org Dev Guide en Latest
206 pages
Real Estate Project Status Report
No ratings yet
Real Estate Project Status Report
2 pages
Agile Frameworks
No ratings yet
Agile Frameworks
13 pages
Mr. Pranav Singh Mr. Manish Singh
100% (1)
Mr. Pranav Singh Mr. Manish Singh
25 pages
Top 200 Iot Projects For Engineering Student
No ratings yet
Top 200 Iot Projects For Engineering Student
38 pages
Database Management System Using Libreoffice Base: Ntroduction
No ratings yet
Database Management System Using Libreoffice Base: Ntroduction
49 pages
How To Use Carbonmade
No ratings yet
How To Use Carbonmade
118 pages
Merkur MARS
No ratings yet
Merkur MARS
3 pages
Test Closure Format-Template
No ratings yet
Test Closure Format-Template
6 pages
Chapter 2 Part One - AWT and Swing-Event
No ratings yet
Chapter 2 Part One - AWT and Swing-Event
46 pages
SubbaChary SQL DBA 4 Yrs
No ratings yet
SubbaChary SQL DBA 4 Yrs
4 pages
Operation Manual: GAD Series CATV System 1550nm Optical Fiber Amplifier
No ratings yet
Operation Manual: GAD Series CATV System 1550nm Optical Fiber Amplifier
15 pages
Celery Documentation Release 3.2.0a2
No ratings yet
Celery Documentation Release 3.2.0a2
585 pages
ITCC of Compressor
No ratings yet
ITCC of Compressor
417 pages
Top 48 Linux Interview Questions & Answers (Updated 2020) PDF
100% (1)
Top 48 Linux Interview Questions & Answers (Updated 2020) PDF
20 pages