0% found this document useful (0 votes)

68 views74 pages

NLP - Natural Language Processing

The document discusses natural language processing (NLP) and several related concepts: - Machine learning techniques like bag-of-words, Word2Vec, and transformers are used for NLP tasks. Word2Vec aims to capture word similarity using neural networks. - Transformers use positional encoding and multi-head attention to better capture word order compared to Word2Vec. Transformers take the position of each word into account. - The document provides an overview of key steps in transformers including input embedding, positional encoding, multi-head attention, and adding and normalizing layers in the encoder.

Uploaded by

MichaelLevy

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

68 views74 pages

NLP - Natural Language Processing

Uploaded by

MichaelLevy

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 74

NLP –

Natural
NLP – Natural Language Processing
• I. Machine learning in Natural Language Processing
• II. Word2Vec
• III. Transformers
NLP – Natural Language Processing
• I. Machine learning in Natural Language Processing
• II. Word2Vec
• III. Transformers
Part 1 Machine learning in Natural Language Processing
Several Options
Bag of words with randomally numbers

1 3 8 7 2 4 15 10 0 789 92 34 47 71 79

Tf – idf
Explaination
the ratio of number of times the word
appears in a document compared to the
total number of words in that document
TF-IDF
1 23 4 5 6 7 8 9 10 11 12 13 14 15 16

Frequency

1 23 4 5 6 7 8 9 10 11 12 13 14 15 16

TF-IDF
NLP Problem : How to rate a review
Two Algorithm :
Machine
Learning
Naive Bayse

Decision tree
Naive bayse
Naive bayse (example from NLP)
• Rate reviews :
•- it’s beautiful !

1 5
4 5
Reminder - Confusion Matrix - Accuracy

Positive Negative

Accuracy = (TP + TN )/ (TP + FN + FP + TN)

Reminder – Accuracy (suite)
Decision tree
overcast

Rainy sunny

No
Yes

Not play play Not play

play
NLP – Natural Language Processing
• I. Machine learning in Natural Language Processing
• II. Word2Vec
• III. Transformers
NLP – Natural Language Processing
• Bag-of-words
• Word2Vec
• I. The Goal
• II. Implementation
• Reminder :
• Linear Regression
• Cost Function
• Neural Network
• Entrainment of the Network

• Skip-Gram (SG) / Continuous Bag Of Word (CBOW)

NLP – Natural Language Processing
• Bag-of-words
• Word2Vec
• I. The Goal
• II. Implementation
• Reminder :
• Linear Regression
• Cost Function
• Neural Network
• Entrainment of the Network

• Skip-Gram (SG) / Continuous Bag Of Word (CBOW)

Bag Of Word
Example:
• 1) The blue car have two red doors
• 2) On the blue doors, there are poster of red car
Vocabulary : ( 12 words)
[“the“, “blue“, “car“ , “have“, “two“, “red“, “doors“ , “on“ , “there“, “are“, “poster“, “of“ ]

One Hot Encoding :

“the“ => [1,0,0,0,0,0,0,0,0,0,0,0] , “blue“=> [0,1,0,0,0,0,0,0,0,0,0,0]
…
“two“=> [0,0,0,0,1,0,0,0,0,0,0,0] , “red“=> [0,0,0,0,0,1,0,0,0,0,0,0] ,
…
“poster“=> [0,0,0,0,0,0,0,0,0,0,1,0] , “of“=> [0,0,0,0,0,0,0,0,0,0,0,1]
Bag Of Word

Vocabulary : ( 12 words)
[“the“, “blue“, “car“ , “have“, “two“, “red“, “doors“ , “on“ , “there“, “are“,
“poster“, “of“ ]
• 1) The blue car have two red doors => [1,1,1,1,1,1,1,0,0,0,0,0]
• 2) On the blue doors,
there are poster of red car => [1,1,1,0,0,1,1,1,1,1,1,1]
Bag Of Word
• [1,1,1,1,1,1,1,0,0,0,0,0]
• [1,1,1,0,0,1,1,1,1,1,1,1]
Bag Of Word
• Weaknesses:
- word order : “John likes Mary” ~= “Mary likes John”
(issue : n-grams ?)

- meaning of the underlying words : ‘’King’’ != ‘’Queen’’

• Skip-Gram (SG) / Continuous Bag Of Word (CBOW)

Word2Vec (Word To Vector)- The Goal
Similarity
Word2Vec (Word To Vector)- The Goal
BOW (bag of word)
Apple [ 1 , 0 , 0 ]
Mango [ 0 Is_fruit
, 1 , is_animal
0 ] is_eatable
Apple [ [ 0 0.9
Elephant , 0 , , 0.01
1 ,] 1 ]
Mango [ 0.85 , 0.02 , 1 ]
Elephant [ 0.1, 0.9 , 1 ]
NLP – Natural Language Processing
• Bag-of-words
• Word2Vec
• I. The Goal
• II. Implementation
• Reminder :
• Linear Regression
• Cost Function
• Neural Network
• Entrainment of the Network

• Skip-Gram (SG) / Continuous Bag Of Word (CBOW)

Linear Regression
Y

X
Independent
Variable Dependent

Linear Model:
Slope Intercept (bias)
NLP – Natural Language Processing
• Bag-of-words
• Word2Vec
• I. The Goal
• II. Implementation
• Reminder :
• Linear Regression
• Cost Function
• Neural Network
• Entrainment of the Network

• Skip-Gram (SG) / Continuous Bag Of Word (CBOW)

Error Function

X
Error Function-- SSE

Sum of Squared Errors (SSE) = ½ Sum (Actual House Price – Predicted House Price)2
= ½ Sum(Y – Ypred)2
NLP – Natural Language Processing
• Bag-of-words
• Word2Vec
• I. The Goal
• II. Implementation
• Reminder :
• Linear Regression
• Cost Function
• Neural Network
• Entrainment of the Network

• Skip-Gram (SG) / Continuous Bag Of Word (CBOW)

Neural Network

X
‫פתרון‪ :‬רשת נוירונים ‪Neural network‬‬
‫היה לנו נוירון בודד וקלט‬ ‫•‬
‫נוסיף עוד נוירון (או יותר)‬ ‫•‬
‫לשכבת הפלט‬
‫נוסיף שכבת ביניים אחת‬ ‫•‬
‫(לפחות)‬
‫קיבלנו רשת נוירונים‬ ‫•‬

‫• ברשת הזאת כל הנוירונים‬

‫מחוברים זה לזה (נראה אח"כ‬
‫מבנים אחרים)‬
NLP – Natural Language Processing
• Bag-of-words
• Word2Vec
• I. The Goal
• II. Implementation
• Reminder :
• Linear Regression
• Cost Function
• Neural Network
• Entrainment of the Network

• Skip-Gram (SG) / Continuous Bag Of Word (CBOW)

Forward Propagation

https://www.youtube.com/watch?v=lGLto9Xd7bU
Backpropagation

https://www.youtube.com/watch?v=GJXKOrqZauk
NLP – Natural Language Processing
• Bag-of-words
• Word2Vec
• I. The Goal
• II. Implementation
• Reminder :
• Linear Regression
• Cost Function
• Neural Network
• Entrainment of the Network

• Continuous Bag Of Word (CBOW) / Skip-Gram (SG)

Continuous Bag Of Word (CBOW) / Skip-Gram (SG)
Continuous Bag Of Word
• Pineapples are spikey and yellow

Context Target

BIG Data
Skip-Gram (SG)
• Pineapples are spikey and yellow

Context Target

SMALL
Data
•

Skip-Gram (SG) Input : Target Output: Context

• Pineapples are spikey and yellow

Context Target
Skip-Gram (SG)
Similarity
Example:
Natural language processing
and machine is fun exciting

Is_fruit is_animal is_eatable

Apple [ 0.9 , 0.01 , 1 ]
Mango [ 0.85 , 0.02 , 1 ]
Elephant [ 0.1 , 0.9 , 1 ]
NLP – Natural Language Processing
• I. Machine learning in Natural Language Processing
• II. Word2Vec
• III. Transformers
Part 3

Transformer
More efficient
of Word2vec is it
possible ?
Embeddings that allows us Transformers model explicitly
to have multiple (more than Word2Vec takes as input the position (index)
one) vector (numeric) embeddings do of each word in the sentence
representations for the not take into
same word account the word
position.

Example: “John likes Mary” ~= “Mary likes John”

Transformers
• Transformers structure
• The encoder:
• Input Embedding
• Positional Encoding
• Multi-Head Attention
• Add & Norm
• Feed Forward

• The Decoder
Transformers
• Transformers structure
• The encoder:
• Input Embedding
• Positional Encoding
• Multi-Head Attention
• Add & Norm
• Feed Forward

• The Decoder
Transformers structure
Encoder
Transformers
• Transformers structure
• The encoder:
• Input Embedding
• Positional Encoding
• Multi-Head Attention
• Add & Norm
• Feed Forward

• The Decoder
The encoder
Transformers
• Transformers structure
• The encoder:
• Input Embedding
• Positional Encoding
• Multi-Head Attention
• Add & Norm
• Feed Forward

• The Decoder
Step 1 :
Input
Embedding
Transformers
• Transformers structure
• The encoder:
• Input Embedding
• Positional Encoding
• Multi-Head Attention
• Add & Norm
• Feed Forward

• The Decoder
Step 2 :
Positional
Encoding
For the positional encoding (PE) for each position and each
dimension i of the dmodel = 512 of the word embedding vector:
Transformers
• Transformers structure
• The encoder:
• Input Embedding
• Positional Encoding
• Multi-Head Attention
• Add & Norm
• Feed Forward
• The Decoder
Step 3 :
Multi-Head
Attention
3 Random
Matrix that we
will calculate
optimal values
in the
computing
• Related to itself more that he
others
Attention Matrix
Come Back Dog ?
Food ?
Transformers
• Transformers structure
• The encoder:
• Input Embedding
• Positional Encoding
• Multi-Head Attention
• Add & Norm
• Feed Forward
• The Decoder
Step 4 : Add & Norm
Transformers
• Transformers structure
• The encoder:
• Input Embedding
• Positional Encoding
• Multi-Head Attention
• Add & Norm
• Feed Forward
• The Decoder
Step 5 :
Feed Two neural layers

Forward
Transformers
• Transformers structure
• The encoder:
• Input Embedding
• Positional Encoding
• Multi-Head Attention
• Add & Norm
• Feed Forward
• The Decoder
The Decoder
Decoder

The original Transformer was trained on a 4.5-million-sentence-pair English-

German
dataset and a 36-million-sentence English-French dataset.

ChatGPT Ebook 4th Edition
No ratings yet
ChatGPT Ebook 4th Edition
109 pages
Assessment of Learning and Assessment For Learning
100% (3)
Assessment of Learning and Assessment For Learning
14 pages
Yugandar - Generative AI Architect
No ratings yet
Yugandar - Generative AI Architect
8 pages
Day 5. Product Price Prediction Using DataRobot
No ratings yet
Day 5. Product Price Prediction Using DataRobot
68 pages
CAM-232321 - 0818 Appier Tracking Tags 20220818
No ratings yet
CAM-232321 - 0818 Appier Tracking Tags 20220818
9 pages
Unit 5b - Natural Language Processing
No ratings yet
Unit 5b - Natural Language Processing
41 pages
AI Ebook Mar27 2025 Final Advt
No ratings yet
AI Ebook Mar27 2025 Final Advt
134 pages
Documentacao Akkio
No ratings yet
Documentacao Akkio
240 pages
Natural Language Processing and Information Retrieval Principles and Applications (Muskan Garg Etc.) (Z-Library)
100% (1)
Natural Language Processing and Information Retrieval Principles and Applications (Muskan Garg Etc.) (Z-Library)
271 pages
Cornell 24may24 Quantum Computers
100% (1)
Cornell 24may24 Quantum Computers
143 pages
AI Privacy Risks AI
100% (1)
AI Privacy Risks AI
107 pages
Artificial Intelligence As A Service
No ratings yet
Artificial Intelligence As A Service
3 pages
Machine Learning For Beginners Complete A - Declan Mellor
No ratings yet
Machine Learning For Beginners Complete A - Declan Mellor
102 pages
Lecture # 1-2 Introduction To Gen AI
No ratings yet
Lecture # 1-2 Introduction To Gen AI
41 pages
Global Fivetran Overview
No ratings yet
Global Fivetran Overview
2 pages
AI-Powered Automated Web Development System
No ratings yet
AI-Powered Automated Web Development System
6 pages
Generative AI Course Content Simply Learn
No ratings yet
Generative AI Course Content Simply Learn
36 pages
The Illustrated Transformer - Jay Alammar - Visualizing Machine Learning One Concept at A Time
No ratings yet
The Illustrated Transformer - Jay Alammar - Visualizing Machine Learning One Concept at A Time
22 pages
No-Code ML With DataRobot
No ratings yet
No-Code ML With DataRobot
73 pages
Chapter 3 Emerging Technology - Artificial Intelligence
100% (1)
Chapter 3 Emerging Technology - Artificial Intelligence
47 pages
Generative AI A Transformative Force in Business Intelligence
No ratings yet
Generative AI A Transformative Force in Business Intelligence
7 pages
The GenAI Leap How Networks Are Reinventing Themselves v2
No ratings yet
The GenAI Leap How Networks Are Reinventing Themselves v2
112 pages
DSPy A Framework For Programming With LLMs
No ratings yet
DSPy A Framework For Programming With LLMs
12 pages
Leadership Incubator Program by ETML PDF
No ratings yet
Leadership Incubator Program by ETML PDF
13 pages
Reading & Writing: Learning Module 2: Week 2 Patterns of Written Texts Across Disciplines
100% (1)
Reading & Writing: Learning Module 2: Week 2 Patterns of Written Texts Across Disciplines
6 pages
Trends in Enterprise Data Architecture and Model Deployment: Survey Results & Report
No ratings yet
Trends in Enterprise Data Architecture and Model Deployment: Survey Results & Report
18 pages
Mistral - Ai Strategic Memo
No ratings yet
Mistral - Ai Strategic Memo
7 pages
The Soar Cognitive Architecture-The MIT Press (2012) - John E. Laird
100% (1)
The Soar Cognitive Architecture-The MIT Press (2012) - John E. Laird
376 pages
Aronoff, Mark (1976) : Word Formation in Generative Grammar. Massachussetts: The MIT Press
79% (14)
Aronoff, Mark (1976) : Word Formation in Generative Grammar. Massachussetts: The MIT Press
74 pages
State of AI Report - 2024 ONLINE
100% (1)
State of AI Report - 2024 ONLINE
213 pages
Appier Media Deck Jul 2021
No ratings yet
Appier Media Deck Jul 2021
61 pages
GenAI Unit1 3
No ratings yet
GenAI Unit1 3
31 pages
Pandas - Python Data Analysis Library
No ratings yet
Pandas - Python Data Analysis Library
1 page
Botify - The SEO Challenge Today
No ratings yet
Botify - The SEO Challenge Today
22 pages
LLM Monitoring and Observability - A Summary of Techniques and Approaches For Responsible AI - by Josh Poduska - Towards Data Science
No ratings yet
LLM Monitoring and Observability - A Summary of Techniques and Approaches For Responsible AI - by Josh Poduska - Towards Data Science
12 pages
Building Intelligent Agents With Semantic Kernel: A Comprehensive Guide
No ratings yet
Building Intelligent Agents With Semantic Kernel: A Comprehensive Guide
16 pages
Artificial Intelligence Tools
No ratings yet
Artificial Intelligence Tools
23 pages
Building Your Own Autonomous LLM Agents - LinkedIn
No ratings yet
Building Your Own Autonomous LLM Agents - LinkedIn
33 pages
Day 1
No ratings yet
Day 1
32 pages
Natural Language Processing
No ratings yet
Natural Language Processing
12 pages
GenAI Pinnacle Roadmap
100% (1)
GenAI Pinnacle Roadmap
8 pages
Data Analytics Using Python
No ratings yet
Data Analytics Using Python
10 pages
AI and Its Impact To The Future of Work - PRE ICP IBS 2024 V01
No ratings yet
AI and Its Impact To The Future of Work - PRE ICP IBS 2024 V01
32 pages
Behaviourism and Mentalism Nasir
100% (2)
Behaviourism and Mentalism Nasir
7 pages
Intern - Gen AI
No ratings yet
Intern - Gen AI
2 pages
CS485 Ch5 Transformers
No ratings yet
CS485 Ch5 Transformers
50 pages
7 Deep Learning
No ratings yet
7 Deep Learning
75 pages
ML Module A7707 - Part1
No ratings yet
ML Module A7707 - Part1
48 pages
Day 2 Module 2 - Understanding LLMs
No ratings yet
Day 2 Module 2 - Understanding LLMs
14 pages
Ai Datarobot
No ratings yet
Ai Datarobot
84 pages
Data Modeling Powerdesigner Da Data Sheet
No ratings yet
Data Modeling Powerdesigner Da Data Sheet
4 pages
Wü - Ey Series On Personality" Processes
No ratings yet
Wü - Ey Series On Personality" Processes
456 pages
Explainable AI: Analytics Summit, 13 June 2019
No ratings yet
Explainable AI: Analytics Summit, 13 June 2019
24 pages
Caregiving 7: Teachnology and Livelihood Education Learning Module 4
No ratings yet
Caregiving 7: Teachnology and Livelihood Education Learning Module 4
4 pages
MLOps Buyers Guide by Seldon
No ratings yet
MLOps Buyers Guide by Seldon
11 pages
Tuesdays: September 11, 2018 Jon Graft
No ratings yet
Tuesdays: September 11, 2018 Jon Graft
33 pages
Word2Vec - A Baby Step in Deep Learning But A Giant Leap Towards Natural Language Processing
100% (1)
Word2Vec - A Baby Step in Deep Learning But A Giant Leap Towards Natural Language Processing
12 pages
Workflow Engine Guide
No ratings yet
Workflow Engine Guide
24 pages
Edureka Python Ebook
No ratings yet
Edureka Python Ebook
21 pages
LLMs in Production-MLC - GRC
No ratings yet
LLMs in Production-MLC - GRC
39 pages
MSE 109: Dynamics of Learning in Special Needs Education: Discussant: Luzviminda V. Lamsen Aileen Jean J. Galvez
100% (2)
MSE 109: Dynamics of Learning in Special Needs Education: Discussant: Luzviminda V. Lamsen Aileen Jean J. Galvez
51 pages
The Hitchhikers Guide To Artificial Intelligence 2018 19.original
No ratings yet
The Hitchhikers Guide To Artificial Intelligence 2018 19.original
18 pages
Everything You Need To Know About Small Language Models (SLM) and Its Applications
No ratings yet
Everything You Need To Know About Small Language Models (SLM) and Its Applications
3 pages
Data For GenAI
No ratings yet
Data For GenAI
17 pages
Word2Vec Tutorial - The Skip-Gram Model Chris McCormick PDF
No ratings yet
Word2Vec Tutorial - The Skip-Gram Model Chris McCormick PDF
39 pages
Intel GenAI Hackathon
No ratings yet
Intel GenAI Hackathon
10 pages
Langchain PDF Reader
100% (1)
Langchain PDF Reader
15 pages
Thematic Unit 3 Lesson Plans 1
No ratings yet
Thematic Unit 3 Lesson Plans 1
14 pages
Classroom Accommodations For Students With Learning Difficulties and Disabilities
No ratings yet
Classroom Accommodations For Students With Learning Difficulties and Disabilities
2 pages
Dalet Workflow Engine PDF
No ratings yet
Dalet Workflow Engine PDF
1 page
Notes On IBSE
No ratings yet
Notes On IBSE
28 pages
Historical Reasoning Towards A Framework For Analy
No ratings yet
Historical Reasoning Towards A Framework For Analy
25 pages
Hannahkittleresume 01
No ratings yet
Hannahkittleresume 01
3 pages
THE WOMEN'S PORN MARKET AVN February 2007
100% (1)
THE WOMEN'S PORN MARKET AVN February 2007
16 pages
Action: Three-Day In-Service Training
No ratings yet
Action: Three-Day In-Service Training
1 page
Remedial Teaching Plan 2023
No ratings yet
Remedial Teaching Plan 2023
2 pages
Grade 4: I. Objectives
No ratings yet
Grade 4: I. Objectives
6 pages
Topic 9 The Spiritual Self
No ratings yet
Topic 9 The Spiritual Self
18 pages
An Analysis of Questioning and Feedback Strategies Using The IRF Framework
No ratings yet
An Analysis of Questioning and Feedback Strategies Using The IRF Framework
28 pages
Theoretical Aspects of Goal-Setting and Motivation in Rehabilitation
No ratings yet
Theoretical Aspects of Goal-Setting and Motivation in Rehabilitation
24 pages
33 Week 4
No ratings yet
33 Week 4
4 pages
Quarter 2 English Week 4 Day 4
No ratings yet
Quarter 2 English Week 4 Day 4
4 pages
CV of Subodh
No ratings yet
CV of Subodh
2 pages
Is Synthetic A Priori Knowledge Possible?: Semih Togay 2005102914 48J 1 Paper
No ratings yet
Is Synthetic A Priori Knowledge Possible?: Semih Togay 2005102914 48J 1 Paper
5 pages
Week Six Mayer CH 3
No ratings yet
Week Six Mayer CH 3
7 pages
"My Favourite Place" Marking Rubric
No ratings yet
"My Favourite Place" Marking Rubric
1 page
Case Study Analysis On Reinventing The Wheel at Apex Door Company
No ratings yet
Case Study Analysis On Reinventing The Wheel at Apex Door Company
5 pages
Report Script
No ratings yet
Report Script
2 pages
NAMA: Shabiya Putri A NPM: 120404200038 Kelas: C Assignment
No ratings yet
NAMA: Shabiya Putri A NPM: 120404200038 Kelas: C Assignment
3 pages
10th Eldrok India K-12 Summit (EIKS)
No ratings yet
10th Eldrok India K-12 Summit (EIKS)
2 pages

NLP - Natural Language Processing

Uploaded by

NLP - Natural Language Processing

Uploaded by

NLP –

Accuracy = (TP + TN )/ (TP + FN + FP + TN)

Not play play Not play

• Skip-Gram (SG) / Continuous Bag Of Word (CBOW)

• Skip-Gram (SG) / Continuous Bag Of Word (CBOW)

One Hot Encoding :

- meaning of the underlying words : ‘’King’’ != ‘’Queen’’

• Skip-Gram (SG) / Continuous Bag Of Word (CBOW)

• Skip-Gram (SG) / Continuous Bag Of Word (CBOW)

• Skip-Gram (SG) / Continuous Bag Of Word (CBOW)

• Skip-Gram (SG) / Continuous Bag Of Word (CBOW)

‫• ברשת הזאת כל הנוירונים‬

• Skip-Gram (SG) / Continuous Bag Of Word (CBOW)

• Continuous Bag Of Word (CBOW) / Skip-Gram (SG)

Skip-Gram (SG) Input : Target Output: Context

• Pineapples are spikey and yellow

Is_fruit is_animal is_eatable

Example: “John likes Mary” ~= “Mary likes John”

The original Transformer was trained on a 4.5-million-sentence-pair English-

You might also like