0% found this document useful (0 votes)

55 views29 pages

08 Natural Language Processing in Tensorflow

The document discusses natural language processing (NLP) and provides examples of common NLP problems and modeling approaches. It outlines steps for modeling text data, including tokenization, embedding, and using recurrent neural networks (RNNs). It proposes experiments using models like LSTM, GRU, CNNs, and TensorFlow Hub pretrained feature extractors on a text classification task and evaluating results with metrics like accuracy and precision. The document appears to be instructional material for an NLP modeling workshop that will cover preprocessing text, building models, and evaluating performance.

Uploaded by

Akbar Shakoor

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

55 views29 pages

08 Natural Language Processing in Tensorflow

Uploaded by

Akbar Shakoor

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 29

Natural Language Processing (NLP)

with
Where can you get help?
“If in doubt, run the code”

• Follow along with the code

• Try it for yourself
• Press SHIFT + CMD + SPACE to read the docstring
• Search for it
• Try again
• Ask (don’t forget the Discord chat!)
(yes, including the “dumb”
questions)
“What is a NLP problem?”
Example NLP problems and NLU… (natural language
understanding)
“What tags should this article have?”

Machine learning
Representation learning
Arti cial intelligence

(multiple label options per

sample) Source: http://karpathy.github.io/2015/05/21/rnn-e ectiveness/

Classi cation These are also Text Generation

referred to as
sequence problems

Machine Translation Voice Assistants

fi
fi
ff
Other sequence problems

Source: http://karpathy.github.io/2015/05/21/rnn-e ectiveness/

ff
Other sequence problems
Image captioning

Input

Source: http://karpathy.github.io/2015/05/21/rnn-e ectiveness/

Output A sledgehammer leaning up against a tire

ff
Other sequence problems
Sentiment analysis

Input

Source: http://karpathy.github.io/2015/05/21/rnn-e ectiveness/ Output Positive 👍

ff
Other sequence problems
Time series forecasting

Input

Source: https://www.coindesk.com/price/bitcoin

Source: http://karpathy.github.io/2015/05/21/rnn-e ectiveness/ Output Price at next timestamp (e.g. $59,678)

ff
Other sequence problems
Machine Translation

Source: http://karpathy.github.io/2015/05/21/rnn-e ectiveness/

Input Output
ff
What we’re going to cover
(broadly)
• Downloading and preparing a text dataset

• How to prepare text data for modelling (tokenization and embedding)

• Setting up multiple modelling experiments with recurrent neural

networks (RNNs)

• Building a text feature extraction model using TensorFlow Hub

• Finding the most wrong prediction examples

• Using a model we’ve built to make predictions on text from the wild

👩🍳 👩🔬
(w e’ ll be co ok ing u p lots of co d e! )

How:
NLP inputs and outputs
Diaster 🌪

Not Diaster 👌
“Is this Tweet for a disaster or not?”
Actual output

🌪 👌
[[0.22, 0.98, 0.02…],
[[0.97, 0.03],
[0.09, 0.55, 0.87…],
[0.81, 0.19],
[0.53, 0.81, 0.79…],
…,
…,

Numerical encoding Predicted output

(Tokenization + Embedding) ea d y ex is t s , if n o t ,
(often alr (comes from looking at lots
you can build one) of these)
Input and output shapes
(for a text classification example)
We’re going to be building RNNs/
CNNs/Feature extractors to do this part!

👌 🌪
[0.99, 0.01]
i o n p r ob ab i l i t i e s )
(predict

(gets represented as a tens

or/embedding)
[batch_size, embedding_size] Shape = [2]
Shape = [None, 512]
or These will vary depending on the
Shape = [32, 512] problem you’re working on/what
(32 is a very c om m o n b a t c h s iz e) embedding style you use.
Steps in modelling with TensorFlow

1. Turn all data into numbers (neural networks can’t handle text/natural language)
2. Make sure all of your tensors are the right shape (pad sequences which don’t t)

fi
“What is a recurrent neural
network (RNN)?”
(typical)*

Architecture of an RNN

(what we’re working towa

rds
building)

Not Diaster 👌

*Note: there are almost an unlimited amount of ways you could stack together a recurrent neural network, this slide demonstrates only one.
Let’s code!
Tokenization vs Embedding
I=0
I love TensorFlow 0 1 2 love = 1
TensorFlow = 2
Tokenization — straight [[1, 0, 0],
mapping from token to
number (can be modelled but
[0, 1, 0], One-hot
Encoding
quickly gets too big) [0, 0, 1],
…,
Embedding — richer
representation of [[0.492, 0.005, 0.019],
relationships between tokens [0.060, 0.233, 0.899], Embedding
(can limit size + can be [0.741, 0.983, 0.567],
learned) …,
Experiments we’re running
Experiment Number Model
0 Naive Bayes with TF-IDF encoder (baseline)

1 Feed-forward neural network (dense model)

2 LSTM (RNN)

3 GRU (RNN)

4 Bidirectional-LSTM (RNN)

5 1D Convolutional Neural Network

6 TensorFlow Hub Pretrained Feature Extractor

7 TensorFlow Hub Pretrained Feature Extractor (10% of data)

(some common)

Classification evaluation methods

Key: tp = True Positive, tn = True Negative, fp = False Positive, fn = False Negative

Metric Name Metric Forumla Code When to use

tp + tn tf.keras.metrics.Accuracy() Default metric for classi cation

Accuracy Accuracy = or problems. Not the best for
tp + tn + fp + fn sklearn.metrics.accuracy_score() imbalanced classes.

tp tf.keras.metrics.Precision() Higher precision leads to less false

Precision Precision = or
tp + fp sklearn.metrics.precision_score() positives.

tp tf.keras.metrics.Recall() Higher recall leads to less false

Recall Recall = or
tp + fn sklearn.metrics.recall_score() negatives.

precision ⋅ recall Combination of precision and recall,

F1-score F1-score = 2 ⋅ sklearn.metrics.f1_score() usually a good overall metric for a
precision + recall classi cation model.

When comparing predictions to truth

Custom function labels to see where model gets
Confusion matrix NA or
sklearn.metrics.confusion_matrix() confused. Can be hard to use with
large numbers of classes.
fi
fi
Architecture of a RNN
(col o ur e d b l o c k e d it i o n )

Standard RNN
Types of RNN cells
Name When to use Learn more Code

LSTM (long short-term Default RNN layer for sequence Understanding LSTM Networks
tf.keras.layers.LSTM
memory) problems. by Chris Olah

Performs very similar to LSTM (could Illustrated Guide to LSTM’s and

GRU (gated recurrent unit) tf.keras.layers.GRU
be used as a default). GRU’s by Michael Phi

Bidirectional LSTM (goes Good for sequences which may

bene t from passing forward and
forward and backwards on Same as above tf.keras.layers.Bidirectional
backwards (e.g. translation or longer
sequence)
passages of text).
fi
Architecture of a Sequence
Conv1D Model (colour e d b l o c k e d it i o n )

Conv1D Sequence model

Model we’re building (USE* feature extractor)
*USE = Universal Sentence Encoder
Source: https://tfhub.dev/google/universal-sentence-encoder/4

Encoder Decoder
(Encodes sequences into (Decodes sequences into
numerical representation) desired output)
Ideal speed/performance trade off

Ideal position for

speed/performance
(high performance +
high speed)
Improving a model (from a model’s perspective)

Smaller model

Common ways to improve a deep model:

• Adding layers Larger model
• Increase the number of hidden units
• Change the activation functions
• Change the optimization function
• Change the learning rate (because you can alter each of
• Fitting on more data these, they’re hyperparameters)

• Fitting for longer

What is overfitting?
Over tting — when a model over learns patterns in a particular dataset and isn’t able to
generalise to unseen data.

For example, a student who studies the course materials too hard and then isn’t able to perform
well on the nal exam. Or tries to put their knowledge into practice at the workplace and nds
what they learned has nothing to do with the real world.

Under tting Balanced Over tting

(goldilocks zone)
fi
fi
fi
fi
fi
Improving a model (from a data perspective)

Method to improve a model

What does it do?
(reduce over tting)

Gives a model more of a chance to learn patterns between samples

More data (e.g. if a model is performing poorly on images of pizza, show it more
images of pizza).

Increase the diversity of your training dataset without collecting more

data (e.g. take your photos of pizza and randomly rotate them 30°).
Data augmentation (usually for images)
Increased diversity forces a model to learn more generalisation
patterns.

Not all data samples are created equally. Removing poor samples
Better data from or adding better samples to your dataset can improve your
model’s performance.

Take a model’s pre-learned patterns from one problem and tweak

Use transfer learning them to suit your own problem. For example, take a model trained on
pictures of cars to recognise pictures of trucks.
fi
The machine learning explorer’s
motto
“Visualize, visualize, visualize”
Data

Model It’s a good idea to visualize

these as often as possible.

Training

Predictions
The machine learning practitioner’s
motto

“Experiment, experiment, experiment”

👩🍳 👩🔬
(try lots of things an
d see what
tastes good)

11-rnn
No ratings yet
11-rnn
32 pages
DL (1)
No ratings yet
DL (1)
26 pages
Introduction To Deep Learning Charniak Eugene download
No ratings yet
Introduction To Deep Learning Charniak Eugene download
53 pages
midterm_study_guide_csci566
No ratings yet
midterm_study_guide_csci566
20 pages
Unit III
No ratings yet
Unit III
43 pages
DL MODULE 5
No ratings yet
DL MODULE 5
10 pages
Secret Life of Real Estate
29% (7)
Secret Life of Real Estate
8 pages
RNN LSTM
No ratings yet
RNN LSTM
37 pages
Lecture5
No ratings yet
Lecture5
102 pages
6 NN RNN
No ratings yet
6 NN RNN
55 pages
Sentiment Analysis with an Recurrent Neural Networks
No ratings yet
Sentiment Analysis with an Recurrent Neural Networks
12 pages
Astro AI
No ratings yet
Astro AI
20 pages
Module 4
No ratings yet
Module 4
36 pages
RNN
No ratings yet
RNN
48 pages
Dive Into Deep Learning
No ratings yet
Dive Into Deep Learning
105 pages
Unit 4
No ratings yet
Unit 4
86 pages
DL mod 3
No ratings yet
DL mod 3
4 pages
Deep Learning Updated
No ratings yet
Deep Learning Updated
11 pages
Slides PyConfr Bordeaux Calcagno
No ratings yet
Slides PyConfr Bordeaux Calcagno
46 pages
lec-10
No ratings yet
lec-10
37 pages
Day 4
No ratings yet
Day 4
22 pages
Bul
No ratings yet
Bul
1 page
20240112104558120
No ratings yet
20240112104558120
1 page
RINENG-S-25-00942
No ratings yet
RINENG-S-25-00942
55 pages
09 Milestone Project 2 Skimlit
No ratings yet
09 Milestone Project 2 Skimlit
32 pages
Sequence Learning Problem
No ratings yet
Sequence Learning Problem
42 pages
RNNs and Their Types - 15 Slides (Easy Copy-Paste Format)
No ratings yet
RNNs and Their Types - 15 Slides (Easy Copy-Paste Format)
6 pages
Public Administration Review - 2002 - Denhardt - The New Public Service Serving Rather than Steering
No ratings yet
Public Administration Review - 2002 - Denhardt - The New Public Service Serving Rather than Steering
11 pages
IILM-University-Programme-Fee-and-Scholarship
No ratings yet
IILM-University-Programme-Fee-and-Scholarship
14 pages
roadmap
No ratings yet
roadmap
13 pages
NN Text Generation Zaid Bouslikhin
No ratings yet
NN Text Generation Zaid Bouslikhin
14 pages
aM3RdIpjnYdPsGKF
No ratings yet
aM3RdIpjnYdPsGKF
20 pages
Deep Learning Notes (1) 2
No ratings yet
Deep Learning Notes (1) 2
54 pages
Uts Tmes Guevarra
No ratings yet
Uts Tmes Guevarra
16 pages
Data Structure Course
No ratings yet
Data Structure Course
48 pages
6 - RNN LSTM & Gru
No ratings yet
6 - RNN LSTM & Gru
14 pages
Unit 3
No ratings yet
Unit 3
41 pages
Astro AI
No ratings yet
Astro AI
20 pages
PALANCA_PR2_1J2J3 (1)
No ratings yet
PALANCA_PR2_1J2J3 (1)
21 pages
Deep Learning L3
No ratings yet
Deep Learning L3
37 pages
Trinity Thesis
100% (3)
Trinity Thesis
7 pages
TLE 9 MACRAME LESSON 1
No ratings yet
TLE 9 MACRAME LESSON 1
15 pages
RNN
No ratings yet
RNN
53 pages
Over Description About The Model
No ratings yet
Over Description About The Model
3 pages
RNN
No ratings yet
RNN
22 pages
Teal Grey Blue Trendy Retro Digitalism Creative Presentation
No ratings yet
Teal Grey Blue Trendy Retro Digitalism Creative Presentation
22 pages
RNN-StannfordBased
No ratings yet
RNN-StannfordBased
102 pages
MIL11 Q4 MOD4 v1
No ratings yet
MIL11 Q4 MOD4 v1
31 pages
CNN RNN LSTM Attention
No ratings yet
CNN RNN LSTM Attention
86 pages
NTT DATA Inc Global GenAI report infographic
No ratings yet
NTT DATA Inc Global GenAI report infographic
1 page
Lecture Notes 6
No ratings yet
Lecture Notes 6
5 pages
Deep-Learning-book-part1
No ratings yet
Deep-Learning-book-part1
100 pages
Introduction To DL With TensorFlow
No ratings yet
Introduction To DL With TensorFlow
55 pages
Scientific Notation Practice - W Key
0% (1)
Scientific Notation Practice - W Key
2 pages
Class - Xi Final - Science Exam Syllabus 2023-24
No ratings yet
Class - Xi Final - Science Exam Syllabus 2023-24
4 pages
dilkhush
No ratings yet
dilkhush
1 page
1 AI_Introduction and ML
No ratings yet
1 AI_Introduction and ML
32 pages
LSTM Lecture
No ratings yet
LSTM Lecture
163 pages
This Study Resource Was: Florida State University
No ratings yet
This Study Resource Was: Florida State University
6 pages
Recurrent Neural Networks: Anahita Zarei, PH.D
No ratings yet
Recurrent Neural Networks: Anahita Zarei, PH.D
37 pages
Unit III (2) RNN, LSTM, Gru
No ratings yet
Unit III (2) RNN, LSTM, Gru
14 pages
CP4252 ML UNIT- V
No ratings yet
CP4252 ML UNIT- V
17 pages
TLE 7 PPT WEEK 5 DAY 3
No ratings yet
TLE 7 PPT WEEK 5 DAY 3
18 pages
Deep Learning
No ratings yet
Deep Learning
21 pages
Deepnet Lourentzou
No ratings yet
Deepnet Lourentzou
49 pages
Intensifying Adjectives
No ratings yet
Intensifying Adjectives
15 pages
UNIT - 5 Lecture 2
No ratings yet
UNIT - 5 Lecture 2
26 pages
Eng Ppt Tech
No ratings yet
Eng Ppt Tech
18 pages
DL-unit-4-part-2
No ratings yet
DL-unit-4-part-2
8 pages
RNN LSTM
No ratings yet
RNN LSTM
72 pages
Respirstory System
No ratings yet
Respirstory System
1 page
Composing Software: An Exploration of Functional Programming and Object Composition in JavaScript
From Everand
Composing Software: An Exploration of Functional Programming and Object Composition in JavaScript
Eric Elliott
No ratings yet
Sequence Models231205
No ratings yet
Sequence Models231205
72 pages
B 09767 Eaf 3
No ratings yet
B 09767 Eaf 3
13 pages
07 Milestone Project 1 Food Vision
No ratings yet
07 Milestone Project 1 Food Vision
20 pages
Riverspring: at Home
No ratings yet
Riverspring: at Home
33 pages
BTC Patent
No ratings yet
BTC Patent
8 pages
Interpreting Defined: 1.2.1 Kade's Criteria
No ratings yet
Interpreting Defined: 1.2.1 Kade's Criteria
3 pages
Introduction To Recurrent Neural Networks (RNNS) : Dr. Hans Weber February 9, 2024
No ratings yet
Introduction To Recurrent Neural Networks (RNNS) : Dr. Hans Weber February 9, 2024
9 pages
Aldebaran
100% (2)
Aldebaran
5 pages
04 Transfer Learning With Tensorflow Part 1 Feature Extraction
No ratings yet
04 Transfer Learning With Tensorflow Part 1 Feature Extraction
18 pages
Ahmad Akmal Resume)
No ratings yet
Ahmad Akmal Resume)
5 pages
Dub Sample
100% (4)
Dub Sample
5 pages
Dominguez Rancho Adobe Ranch
No ratings yet
Dominguez Rancho Adobe Ranch
9 pages
Star Nations Astro Degrees (Old Version)
No ratings yet
Star Nations Astro Degrees (Old Version)
5 pages
Machine Learning Introduction by Google
No ratings yet
Machine Learning Introduction by Google
3 pages
Curriculum Vitae: Tharesh Kumar KG
No ratings yet
Curriculum Vitae: Tharesh Kumar KG
3 pages
Bise Bwp Ssc Result Guzatte
No ratings yet
Bise Bwp Ssc Result Guzatte
109 pages
Poetry Lesson
No ratings yet
Poetry Lesson
3 pages
XRP Passive Income, Gold Backed XRP, and Stuff About How CBDCs Are Evil
No ratings yet
XRP Passive Income, Gold Backed XRP, and Stuff About How CBDCs Are Evil
7 pages
Deep Unsupervised Learning
No ratings yet
Deep Unsupervised Learning
90 pages
What Is A Smart Contract
No ratings yet
What Is A Smart Contract
5 pages
Sce
No ratings yet
Sce
2 pages
Fearful
No ratings yet
Fearful
2 pages
Resume of Darlene292
No ratings yet
Resume of Darlene292
2 pages
Review of Secret Life of Real Estate
No ratings yet
Review of Secret Life of Real Estate
7 pages
Example
No ratings yet
Example
3 pages
Introduction T
No ratings yet
Introduction T
3 pages
DL Mannual For Reference
No ratings yet
DL Mannual For Reference
58 pages
Star+Nations+Astro+Degrees+new+version+23 09 2021
No ratings yet
Star+Nations+Astro+Degrees+new+version+23 09 2021
7 pages
Unit 5 - Volunteerism (NSTP)
No ratings yet
Unit 5 - Volunteerism (NSTP)
25 pages
7 Types of Adhd
No ratings yet
7 Types of Adhd
3 pages
Tensor Flow Guide
No ratings yet
Tensor Flow Guide
25 pages
Deep Learning
100% (2)
Deep Learning
49 pages
Chapter 8 Forecasting and Decision-Making
No ratings yet
Chapter 8 Forecasting and Decision-Making
14 pages
Onlin
No ratings yet
Onlin
1 page
Mastering Prompt
No ratings yet
Mastering Prompt
33 pages
600+ AI Tools List by Kavish Lodha
No ratings yet
600+ AI Tools List by Kavish Lodha
7 pages
Python Crash Course: A Hands-On Introduction to Programming
From Everand
Python Crash Course: A Hands-On Introduction to Programming
Eric Sindeu
No ratings yet
Hyades
100% (1)
Hyades
4 pages
HW 9
No ratings yet
HW 9
2 pages
Beyond Effective Go: Part 1 - Achieving High-Performance Code
From Everand
Beyond Effective Go: Part 1 - Achieving High-Performance Code
Corey S Scott
No ratings yet
3.3 Uniform Acceleration
No ratings yet
3.3 Uniform Acceleration
15 pages
Old Man Grease Joint Armor Program
No ratings yet
Old Man Grease Joint Armor Program
8 pages
Orion
No ratings yet
Orion
5 pages
Learn Python through Nursery Rhymes and Fairy Tales: Classic Stories Translated into Python Programs (Coding for Kids and Beginners)
From Everand
Learn Python through Nursery Rhymes and Fairy Tales: Classic Stories Translated into Python Programs (Coding for Kids and Beginners)
Shari Eskenas
5/5 (1)
Learn C Programming through Nursery Rhymes and Fairy Tales: Classic Stories Translated into C Programs
From Everand
Learn C Programming through Nursery Rhymes and Fairy Tales: Classic Stories Translated into C Programs
Shari Eskenas
No ratings yet
Mask
No ratings yet
Mask
18 pages
Sample of Unconventional Hypertrophy
No ratings yet
Sample of Unconventional Hypertrophy
6 pages
Ultimate Prompt
0% (2)
Ultimate Prompt
3 pages