Natural Language Processing (NLP) With Deep NLP: From Zero To Hero
Natural Language Processing (NLP) With Deep NLP: From Zero To Hero
Python
Basic Concept of Machine
Learning and Deep Learning
Natural language processing
Natural language processing
ial is a subfield of linguistics, computer
Natural language processingor(NLP)
ut
i nT
science, information aengineering, and artificial intelligence concerned
u s s l
with the interactions
d H between computers and human
ri a (natural)
ha ut o
languages,Fain particular how to program computers i n T to process and
s s a
analyze large amounts of natural languageddata.
H u
h a
Fa
Challenges in natural i a l
language processing frequently involve
or
Tut
i n
speech recognition, anatural language understanding, and natural
u ss
dH
language generation.
ha
Fa
For further assistance, code and slide https://fahadhussaincs.blogspot.com/
YouTube Channel: https://www.youtube.com/channel/UCapJpINJKHzflWwCQ8Kse2g/playlists
Types of NLP
i a l
t o r
Tu
s a in
us a l
H i
h a d
Tu tor
Fa in
s s a
d Hu
h a
Fa
i a l
t o r
Tu
s a in
us
d H
h a
Fa
For further assistance, code and slide https://fahadhussaincs.blogspot.com/
YouTube Channel: https://www.youtube.com/channel/UCapJpINJKHzflWwCQ8Kse2g/playlists
How NLP, DNLP and DL involves in!!!
ri al
Tu to
s a in
us
d H
ah a l
F r i a
ut o
s a in T
u s
d H
a ha
F
i al
ut or
sa in T
u s
d H
a h a
For further assistance, code and slide https://fahadhussaincs.blogspot.com/
F
YouTube Channel: https://www.youtube.com/channel/UCapJpINJKHzflWwCQ8Kse2g/playlists
How NLP, DNLP and DL involves in!!!
i a l
t o r
Tu
s a in
us a l
H i
h a d
Tu tor
Fa in
s s a
d Hu
h a
Fa
i a l
t o r
Tu
s a in
us
d H
h a
Fa
For further assistance, code and slide https://fahadhussaincs.blogspot.com/
YouTube Channel: https://www.youtube.com/channel/UCapJpINJKHzflWwCQ8Kse2g/playlists
Applications
i a l
t o r
Tu
s a in
us a l
H i
h a d
Tu tor
Fa in
s s a
d Hu
h a
Fa
i a l
t o r
Tu
s a in
us
d H
h a
Fa
For further assistance, code and slide https://fahadhussaincs.blogspot.com/
YouTube Channel: https://www.youtube.com/channel/UCapJpINJKHzflWwCQ8Kse2g/playlists
Used by
i a l
t o r
Tu
s a in
us a l
H i
h a d
Tu tor
Fa in
s s a
d Hu
h a
Fa
i a l
t o r
Tu
s a in
us
d H
h a
Fa
For further assistance, code and slide https://fahadhussaincs.blogspot.com/
YouTube Channel: https://www.youtube.com/channel/UCapJpINJKHzflWwCQ8Kse2g/playlists
Thanks
Do Subscribe, like and share
next we will discuss about
Phonology – This science helps to deal with patterns present in the sound and speeches related
to the sound as a physical entity.
Morphology – This science deals with the structure of the words and the systematic relations
between them.
Semantics – This science deals with the literal meaning of the words, phrases as well as
sentences.
Data Processes,
Tokenization?
For further assistance, code and slide https://fahadhussaincs.blogspot.com/
YouTube Channel: https://www.youtube.com/channel/UCapJpINJKHzflWwCQ8Kse2g/playlists
Tokenization
Tokenization is the processorof ial replacing sensitive data with
Tut
unique identification symbols
a i n that retain all the essential
s s l
Hudata without compromising its security.
information about the
d ri a
o
a u t
Fah a in T
s s
d Hu
h a
Fa
i a l
t o r
Tu
s a in
us
d H
h a
Fa
For further assistance, code and slide https://fahadhussaincs.blogspot.com/
YouTube Channel: https://www.youtube.com/channel/UCapJpINJKHzflWwCQ8Kse2g/playlists
Tokenization
There are many library / framework
l for NLP problem solution
o ria
ut
1. Natural Language sToolkit
s a (NLTK) in T
Hu i a l
2. TextBlob a d to r
a h Tu
3. CoreNLP F a i n
uss
4. Gensim d H
h a
5. spaCy Fa
6. polyglot i a l
t o r
Tu
7. scikit–learn a i n
us s
8. Pattern d H
a h a
SoFlets' move to COLAB for practical work...
For further assistance, code and slide https://fahadhussaincs.blogspot.com/
YouTube Channel: https://www.youtube.com/channel/UCapJpINJKHzflWwCQ8Kse2g/playlists
Bag of words
The bag-of-words model is a simplifying representation
used in natural language processing and information
i a l
retrieval (IR). In this, a text (suchtoras a sentence or a
Tu
document) is represented sas a in the bag (multiset) of its
words, disregarding grammar u s and even word order but
d H
h a
Fa
keeping multiplicity.
i a l
u t or
in T
ss a
Hu
h a d
Fa
Deep NLP
Document Frequency: This downscales words that appear a lot across documents.
Inverse Document Frequency (IDF): is a weight indicating how commonly a word is used. The more
frequent its usage across documents, the lower its score. The lower the score, the less important the
word becomes.
For example, the word the appears in almost all English texts and would thus have a very low IDF score
as it carries very little “topic” information. In contrast, if you take the word coffee, while it is common, it’s
not used as widely as the word the. Thus, coffee would have a higher IDF score than the.
TF-IDF: is a numerical statistic that is intended to reflect how important a word is to a document in a
collection or corpus.
fair
men
women
words IDF
men
women
fair
f1 f2 f3
men women fair
Sentence 1
Sentence 2
Sentence 3
https://keras.io/
• Tokenizer API
Logistic regression is used when the response variable is categorical in nature. For
instance, yes/no, true/false, red/green/blue, 1st/2nd/3rd/4th, etc.
Linear regression is used when your response variable is continuous. For instance, weight,
height, number of hours, etc.
Y = mX + C
g(x) = 1 / (1 + e^-x)
For further assistance, code and slide https://fahadhussaincs.blogspot.com/
YouTube Channel: https://www.youtube.com/channel/UCapJpINJKHzflWwCQ8Kse2g/playlists