0% found this document useful (0 votes)

11 views9 pages

Module 1.1

Lecture Notes for Natural Language processing A brief history of natural language processing, language challenges, applications, classical vs statistical vs deep learning-based, Basic concepts in linguistic data Structure: Morphology, syntax, semantics, pragmatics, Tokenized text and pattern matching-Recognizing names, Stemming, Tagging

Uploaded by

Abd Xy

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

11 views9 pages

Module 1.1

Uploaded by

Abd Xy

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 9

Brief History of NLP

o 1950s: Alan Turing proposed the Turing Test to evaluate a

machine's ability to exhibit intelligent behavior equivalent to,
or indistinguishable from, that of a human.
o 1960s: Development of early NLP systems such as ELIZA, a
computer program by Joseph Weizenbaum that simulated
conversation.
o 1970s-1980s: Introduction of rule-based systems like
SHRDLU and the development of the Chomsky hierarchy in
linguistics.
o 1990s: Statistical approaches began to dominate NLP,
utilizing probabilistic models to handle large corpora of text
data.
o 2000s: The rise of machine learning and deep learning
techniques led to significant advancements in NLP, such as
Google's PageRank algorithm for search engines.
o 2010s-Present: Development of powerful deep learning
models like Word2Vec, GloVe, BERT, and GPT, which
have revolutionized NLP tasks such as language translation,
sentiment analysis, and text generation.

Language Challenges in NLP

o Ambiguity: Words and sentences can have multiple

meanings.
o Example: "The farmer went to the bank." (Is "bank"
referring to the side of a river or a financial institution?)
o Context: Understanding the context is crucial for accurate
interpretation.
 Example: "He banked the plane" vs. "He went to the
bank."
o Sarcasm and Irony: Detecting sarcasm and irony can be
challenging.

 Example: "Oh, great! Another homework

assignment."

o Diverse Syntax and Grammar: Different languages have

different syntax and grammar rules.
 Example: Subject-Verb-Object (SVO) , She eats an
apple in English vs. Subject-Object-Verb (SOV) in
Japanese. Kanojo wa ringo o taberu

o Idioms and Phrases: Recognizing and interpreting

idiomatic expressions.

 Example: "Kick the bucket" meaning "to die."

Applications of NLP

o Machine Translation: Translating text from one language

to another.

 Example: Google Translate translating "Hello, world!"

into Spanish as "¡Hola, mundo!"

o Sentiment Analysis: Determining the sentiment (positive,

negative, neutral) of a text.

 Example: Analyzing product reviews to determine

customer satisfaction.

o Chatbots: Automated systems that interact with users via

text or speech.

 Example: Customer support chatbots like those used

by banks or online retailers.
o Information Retrieval: Extracting relevant information
from large datasets.

 Example: Search engines like Google retrieving

relevant web pages based on user queries.

o Speech Recognition: Converting spoken language into text.

 Example: Voice assistants like Siri, Alexa, and

Google Assistant.

Classical vs. Statistical vs. Deep Learning-based NLP

 Classical NLP:
o Rule-based Approaches: Utilize hand-crafted rules to
process language.

 Example: Parsing sentences using grammar rules.

o Manual Feature Engineering: Involves defining specific

linguistic features for analysis.

 Example: Identifying parts of speech (POS) using

predefined rules.
 Statistical NLP:

o Probabilistic Models: Use statistical methods to model and

predict language patterns.

 Example: Hidden Markov Models (HMMs) for POS

tagging.

o Large Amounts of Data: Relies on extensive corpora to

learn patterns.

 Example: Using n-grams to predict the next word in a

sentence.

 Deep Learning-based NLP:

o Neural Networks: Employ deep neural networks to learn

from raw text data.

 Example: Recurrent Neural Networks (RNNs) for

sequence prediction.

o End-to-End Learning: Models can learn to perform tasks

directly from data without explicit feature engineering.

 Example: Transformers like BERT and GPT for

various NLP tasks.
Basic Concepts in Linguistic Data Structure

 Morphology:

o Study of word structure and formation.

o Example: Analyzing the root, prefix, and suffix of words
like "unhappiness" (un- + happy + -ness).

 Syntax:

o Rules that govern sentence structure.

o Example: English follows Subject-Verb-Object (SVO)
order: "She (S) loves (V) music (O)."

 Semantics:

o Meaning of words and sentences.

o Example: Understanding that "bark" can refer to the sound a
dog makes or the outer covering of a tree.

 Pragmatics:

o Contextual use of language.

o Example: Interpreting "Can you pass the salt?" as a request
rather than a question about ability.
Tokenized Text and Pattern Matching

o Tokenization: Splitting text into individual tokens (words or

sentences).
o Example:
 Input Text: "Natural Language Processing is
fascinating."
 Tokenized Text: ['Natural', 'Language', 'Processing',
'is', 'fascinating', '.']
 Explanation: The sentence is divided into individual
words and punctuation marks.

o Pattern Matching: Identifying patterns within tokenized

text using regular expressions.
o Example:

 Input Text: "The quick brown fox jumps over the

lazy dog."
 Pattern: Words with exactly 4 letters.
 Matched Words: ['quick', 'over', 'lazy']
 Explanation: The pattern identifies words that are
exactly four letters long within the sentence.


Recognizing Names

o Named Entity Recognition (NER): Identifies proper nouns

and classifies them as people, organizations, etc.
o Example:
 Input Text: "Barack Obama was the 44th President of
the United States."
 Recognized Entities:

 'Barack Obama' as PERSON

 '44th President' as TITLE
 'United States' as GPE (Geopolitical Entity)

 Explanation: The NER system identifies and

categorizes names and titles within the text.

Stemming and Lemmatization

 Stemming:
o Reduces words to their base form by removing prefixes or
suffixes.
o Example:

 Input Words: ['running', 'jumps', 'easily', 'fairly']

 Stemmed Words: ['run', 'jump', 'easili', 'fairli']
 Explanation: The words are reduced to their root
forms, which may not always be meaningful.

 Lemmatization:

o Reduces words to their meaningful base form using

vocabulary and morphological analysis.
o Example:

 Input Words: ['running', 'jumps', 'easily', 'fairly']

 Lemmatized Words: ['run', 'jump', 'easy', 'fair']
 Explanation: The words are reduced to their base or
dictionary forms, ensuring they remain meaningful.

Tagging Parts of Speech

o POS Tagging: Assigns part-of-speech tags to each word in a
sentence.
o Example:

 Input Text: "The quick brown fox jumps over the

lazy dog."
 POS Tags: [('The', 'DT'), ('quick', 'JJ'), ('brown', 'JJ'),
('fox', 'NN'), ('jumps', 'VBZ'), ('over', 'IN'), ('the', 'DT'),
('lazy', 'JJ'), ('dog', 'NN')]
 Explanation: Each word is tagged with its
corresponding part of speech, such as determiner (DT),
adjective (JJ), noun (NN), verb (VBZ), and
preposition (IN).

Constituent Structure

o Constituent Structure Analysis: Breaks down sentences

into their sub-parts (constituents).
o Example:

 Input Text: "The quick brown fox jumped over the

lazy dog."
 Constituent Structure:

 Sentence (S)

 Noun Phrase (NP): "The quick brown

fox"
 Determiner (DT): "The"
 Adjectives (JJ): "quick", "brown"
 Noun (NN): "fox"
 Verb Phrase (VP): "jumped over the lazy
dog"

 Verb (VBD): "jumped"

 Prepositional Phrase (PP): "over
the lazy dog"

 Preposition (IN): "over"

 Noun Phrase (NP): "the lazy
dog"
 Determiner (DT):
"the"
 Adjective (JJ): "lazy"
 Noun (NN): "dog"

 Explanation: The sentence is parsed into a

hierarchical structure, showing the relationships
between words and phrases.

Brocode OP
No ratings yet
Brocode OP
133 pages
NLP PYQ SOLUTIONS
No ratings yet
NLP PYQ SOLUTIONS
59 pages
NLP Unit 1
No ratings yet
NLP Unit 1
44 pages
1 Introduction
No ratings yet
1 Introduction
99 pages
Introduction to NLP_first_week_lecture_1st
No ratings yet
Introduction to NLP_first_week_lecture_1st
6 pages
IntroductionToNLPAbebeZerihun
No ratings yet
IntroductionToNLPAbebeZerihun
45 pages
Introduction To Natural Language Processing and NLTK
No ratings yet
Introduction To Natural Language Processing and NLTK
23 pages
1_NLP.docx
No ratings yet
1_NLP.docx
26 pages
NLP_PPT
No ratings yet
NLP_PPT
41 pages
NLP IA1
No ratings yet
NLP IA1
7 pages
unit-4 NLP
No ratings yet
unit-4 NLP
54 pages
NLP-Questions (1)
No ratings yet
NLP-Questions (1)
26 pages
Chapter - 6 Communicating, Perceiving, and Acting
No ratings yet
Chapter - 6 Communicating, Perceiving, and Acting
30 pages
NLP Unit1
No ratings yet
NLP Unit1
51 pages
NLP-Unit-1-part1
No ratings yet
NLP-Unit-1-part1
61 pages
تعلم ML4 (1)
No ratings yet
تعلم ML4 (1)
42 pages
Natural Language Processing
No ratings yet
Natural Language Processing
30 pages
NLP
No ratings yet
NLP
88 pages
NLP Part1
No ratings yet
NLP Part1
67 pages
Hadi Pres, 21-12-24-1
No ratings yet
Hadi Pres, 21-12-24-1
16 pages
DLT Unit-5
No ratings yet
DLT Unit-5
48 pages
NLP m2
No ratings yet
NLP m2
71 pages
NLP SEM IMP
No ratings yet
NLP SEM IMP
46 pages
Chapter - 1
No ratings yet
Chapter - 1
25 pages
Natural Language Processing
No ratings yet
Natural Language Processing
24 pages
NLP Introduction Overview
No ratings yet
NLP Introduction Overview
34 pages
Adnan Amin
No ratings yet
Adnan Amin
19 pages
Module 1 Lecture 1
No ratings yet
Module 1 Lecture 1
29 pages
NLP Module 1
No ratings yet
NLP Module 1
124 pages
Natural Language Processing Lec 1
No ratings yet
Natural Language Processing Lec 1
23 pages
PresentationDayone-Introduction of NLP
No ratings yet
PresentationDayone-Introduction of NLP
17 pages
Chapter 6.
No ratings yet
Chapter 6.
31 pages
Artificial Intelligence: Natural Language Processing
No ratings yet
Artificial Intelligence: Natural Language Processing
41 pages
Natural Language Processing
No ratings yet
Natural Language Processing
27 pages
nlp
No ratings yet
nlp
19 pages
Chapter-1 Introduction To NLP
No ratings yet
Chapter-1 Introduction To NLP
12 pages
NLp_lab1
No ratings yet
NLp_lab1
33 pages
NLP Merged
100% (1)
NLP Merged
975 pages
Natural Language Processing 101
No ratings yet
Natural Language Processing 101
26 pages
Natural Language Processing_NOTES
No ratings yet
Natural Language Processing_NOTES
4 pages
NLP_Presentation1
No ratings yet
NLP_Presentation1
25 pages
NLP PPT1 (1)
No ratings yet
NLP PPT1 (1)
29 pages
NLP unit1
No ratings yet
NLP unit1
24 pages
lect1-intro-3jan08 (1)
No ratings yet
lect1-intro-3jan08 (1)
94 pages
Poeter Stemmer Algorithm
No ratings yet
Poeter Stemmer Algorithm
57 pages
Introduction To Natural Language Processing
No ratings yet
Introduction To Natural Language Processing
45 pages
Nlp Notes Unit 1
No ratings yet
Nlp Notes Unit 1
42 pages
38. Natural Language Processing (1) Copy
No ratings yet
38. Natural Language Processing (1) Copy
30 pages
Chapter 6-NLPs
No ratings yet
Chapter 6-NLPs
31 pages
Nlp Notes Unit 1
No ratings yet
Nlp Notes Unit 1
42 pages
Human Communication, Either Spoken or Written, Consisting of The Use of Words in A Structured and Conventional Way". Language Makes Us Unique From Other Living Beings and I Would
No ratings yet
Human Communication, Either Spoken or Written, Consisting of The Use of Words in A Structured and Conventional Way". Language Makes Us Unique From Other Living Beings and I Would
7 pages
Advances in Natural Language Processing
No ratings yet
Advances in Natural Language Processing
7 pages
Module I NLP
No ratings yet
Module I NLP
65 pages
Introduction To NLP - Part 1
No ratings yet
Introduction To NLP - Part 1
23 pages
NLP StudyMaterial
No ratings yet
NLP StudyMaterial
540 pages
ML Module A7707 - Part1
No ratings yet
ML Module A7707 - Part1
48 pages
NLP
No ratings yet
NLP
14 pages
Text Analytics and Natural Language Processing - KAI073.docx
No ratings yet
Text Analytics and Natural Language Processing - KAI073.docx
24 pages
Natural Language Processing
From Everand
Natural Language Processing
Ajit Singh
No ratings yet
Korean for K-Pop and K-Drama Fans: Casual & Chic Japanese Style Accessories (19 Projects + variations)
From Everand
Korean for K-Pop and K-Drama Fans: Casual & Chic Japanese Style Accessories (19 Projects + variations)
Soohee Kim
No ratings yet
Feature Descriptors
No ratings yet
Feature Descriptors
47 pages
SPIT PPT For Revision
No ratings yet
SPIT PPT For Revision
161 pages
PublicAdministration Rotated
No ratings yet
PublicAdministration Rotated
16 pages
Module 1.2
No ratings yet
Module 1.2
28 pages
Eng Grammar
No ratings yet
Eng Grammar
2 pages
FCE Grammar Diagnostic - Print - Quizizz
No ratings yet
FCE Grammar Diagnostic - Print - Quizizz
5 pages
1st Lecture of Grammar Adjuncts
No ratings yet
1st Lecture of Grammar Adjuncts
16 pages
Simbolisme Budaya Jawa Dalam Novel Darmagandhul (Kajian Etnosemiotik)
No ratings yet
Simbolisme Budaya Jawa Dalam Novel Darmagandhul (Kajian Etnosemiotik)
16 pages
Embodied Communication
No ratings yet
Embodied Communication
34 pages
Sig - Manual of The Primes
60% (5)
Sig - Manual of The Primes
247 pages
My Assignment Rally Fio 202013500341.
No ratings yet
My Assignment Rally Fio 202013500341.
3 pages
Upper-Intermediate GW 02a
No ratings yet
Upper-Intermediate GW 02a
2 pages
Samoyedic Shamanic Drums Some Symbolic Interpretation
No ratings yet
Samoyedic Shamanic Drums Some Symbolic Interpretation
22 pages
CAT2024 Grammar PrTests Adults 28079
No ratings yet
CAT2024 Grammar PrTests Adults 28079
8 pages
Life 6. Unit 5
No ratings yet
Life 6. Unit 5
19 pages
WhatsApp Chat with OFFICIAL BAP 2nd year?
No ratings yet
WhatsApp Chat with OFFICIAL BAP 2nd year?
64 pages
The Future of Kazakhstan
No ratings yet
The Future of Kazakhstan
2 pages
PRACTICE TEST 2A Đã S A
No ratings yet
PRACTICE TEST 2A Đã S A
7 pages
Rylsky_Translation_Report (2)
No ratings yet
Rylsky_Translation_Report (2)
3 pages
verb patterns rules
No ratings yet
verb patterns rules
3 pages
ENGLISH HL Grade 4 TERM 1 2024 EXEMPLAR TASK (2)
No ratings yet
ENGLISH HL Grade 4 TERM 1 2024 EXEMPLAR TASK (2)
3 pages
Speak Clearly Audio Programme Workbook
No ratings yet
Speak Clearly Audio Programme Workbook
29 pages
Abbj3203-Broadcast Journalism
No ratings yet
Abbj3203-Broadcast Journalism
20 pages
Grade 6 Language Handout Term 2 2024
100% (1)
Grade 6 Language Handout Term 2 2024
35 pages
Task-Based Approach
No ratings yet
Task-Based Approach
35 pages
Lesson Plan English Auxiliary Verb
No ratings yet
Lesson Plan English Auxiliary Verb
9 pages
News Item
No ratings yet
News Item
4 pages
Meeting at The Coffe Shop
No ratings yet
Meeting at The Coffe Shop
5 pages
Jurnal Skripsi Brajadenta Brajamusti
No ratings yet
Jurnal Skripsi Brajadenta Brajamusti
17 pages
Friedrich Marby - Aufrassungs-Plätze. Am Tor Zum Runen-Garten. Der Thingplatz" Und Was Er Ist (English Translation)
No ratings yet
Friedrich Marby - Aufrassungs-Plätze. Am Tor Zum Runen-Garten. Der Thingplatz" Und Was Er Ist (English Translation)
6 pages
Negation in turkish makale
No ratings yet
Negation in turkish makale
16 pages
Q2 - English 10
100% (1)
Q2 - English 10
39 pages
CÂU TƯỜNG THUẬT-Đáp án
No ratings yet
CÂU TƯỜNG THUẬT-Đáp án
8 pages
TUGAS TUTORIAL 1 Bahasa Inggris
No ratings yet
TUGAS TUTORIAL 1 Bahasa Inggris
6 pages