0% found this document useful (0 votes)

19 views3 pages

Text Classification With Transformer - 1716327784332

The document shows how to implement a transformer block as a Keras layer and use it for text classification. It covers setting up the model, implementing the transformer and embedding layers, downloading and preparing the IMDB dataset, creating a classifier model using the transformer layer, and training and evaluating the model.

Uploaded by

mbohoumounpouyvanlandry

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

19 views3 pages

Text Classification With Transformer - 1716327784332

Uploaded by

mbohoumounpouyvanlandry

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 3

Search Keras documentation...

Text classi cation with

Transformer
► Code examples / Natural Language Processing / Text classi cation with Transformer ◆ Setup
◆ Implement a Transformer block as
a layer
Text classi cation with Transformer ◆ Implement embedding layer
◆ Download and prepare dataset
Author: Apoorv Nandan ◆ Create classier model using
Date created: 2020/05/10 transformer layer
Last modi ed: 2024/01/18 ◆ Train and Evaluate
Description: Implement a Transformer block as a Keras layer and use it for text classi cation.

ⓘ This example uses Keras 3

View in Colab • GitHub source

Setup
import keras
from keras import ops
from keras import layers

Implement a Transformer block as a layer

class TransformerBlock(layers.Layer):
def __init__(self, embed_dim, num_heads, ff_dim, rate=0.1):
super().__init__()
self.att = layers.MultiHeadAttention(num_heads=num_heads, key_dim=embed_dim)
self.ffn = keras.Sequential(
[layers.Dense(ff_dim, activation="relu"), layers.Dense(embed_dim),]
)
self.layernorm1 = layers.LayerNormalization(epsilon=1e-6)
self.layernorm2 = layers.LayerNormalization(epsilon=1e-6)
self.dropout1 = layers.Dropout(rate)
self.dropout2 = layers.Dropout(rate)

def call(self, inputs):

attn_output = self.att(inputs, inputs)
attn_output = self.dropout1(attn_output)
out1 = self.layernorm1(inputs + attn_output)
ffn_output = self.ffn(out1)
ffn_output = self.dropout2(ffn_output)
return self.layernorm2(out1 + ffn_output)

Implement embedding layer

Two separate embedding layers, one for tokens, one for token index (positions).

class TokenAndPositionEmbedding(layers.Layer):
def __init__(self, maxlen, vocab_size, embed_dim):
super().__init__()
self.token_emb = layers.Embedding(input_dim=vocab_size, output_dim=embed_dim)
self.pos_emb = layers.Embedding(input_dim=maxlen, output_dim=embed_dim)

def call(self, x):

maxlen = ops.shape(x)[-1]
positions = ops.arange(start=0, stop=maxlen, step=1)
positions = self.pos_emb(positions)
x = self.token_emb(x)
return x + positions
Download and prepare dataset Text classi cation with
Transformer
vocab_size = 20000 # Only consider the top 20k words ◆ Setup
maxlen = 200 # Only consider the first 200 words of each movie review
◆ Implement a Transformer block as
(x_train, y_train), (x_val, y_val) =
a layer
keras.datasets.imdb.load_data(num_words=vocab_size)
print(len(x_train), "Training sequences") ◆ Implement embedding layer
print(len(x_val), "Validation sequences") ◆ Download and prepare dataset
x_train = keras.utils.pad_sequences(x_train, maxlen=maxlen) ◆ Create classier model using
x_val = keras.utils.pad_sequences(x_val, maxlen=maxlen)
transformer layer
◆ Train and Evaluate

Downloading data from https://storage.googleapis.com/tensorflow/tf-keras-

datasets/imdb.npz
17465344/17464789 [==============================] - 0s 0us/step

<string>:6: VisibleDeprecationWarning: Creating an ndarray from ragged nested sequences

(which is a list-or-tuple of lists-or-tuples-or ndarrays with different lengths or
shapes) is deprecated. If you meant to do this, you must specify 'dtype=object' when
creating the ndarray
/usr/local/lib/python3.6/dist-packages/tensorflow/python/keras/datasets/imdb.py:159:
VisibleDeprecationWarning: Creating an ndarray from ragged nested sequences (which is a
list-or-tuple of lists-or-tuples-or ndarrays with different lengths or shapes) is
deprecated. If you meant to do this, you must specify 'dtype=object' when creating the
ndarray
x_train, y_train = np.array(xs[:idx]), np.array(labels[:idx])
/usr/local/lib/python3.6/dist-packages/tensorflow/python/keras/datasets/imdb.py:160:
VisibleDeprecationWarning: Creating an ndarray from ragged nested sequences (which is a
list-or-tuple of lists-or-tuples-or ndarrays with different lengths or shapes) is
deprecated. If you meant to do this, you must specify 'dtype=object' when creating the
ndarray
x_test, y_test = np.array(xs[idx:]), np.array(labels[idx:])

25000 Training sequences

25000 Validation sequences

Create classi er model using transformer layer

Transformer layer outputs one vector for each time step of our input sequence. Here, we take the
mean across all time steps and use a feed forward network on top of it to classify text.

embed_dim = 32 # Embedding size for each token

num_heads = 2 # Number of attention heads
ff_dim = 32 # Hidden layer size in feed forward network inside transformer

inputs = layers.Input(shape=(maxlen,))
embedding_layer = TokenAndPositionEmbedding(maxlen, vocab_size, embed_dim)
x = embedding_layer(inputs)
transformer_block = TransformerBlock(embed_dim, num_heads, ff_dim)
x = transformer_block(x)
x = layers.GlobalAveragePooling1D()(x)
x = layers.Dropout(0.1)(x)
x = layers.Dense(20, activation="relu")(x)
x = layers.Dropout(0.1)(x)
outputs = layers.Dense(2, activation="softmax")(x)

model = keras.Model(inputs=inputs, outputs=outputs)

Train and Evaluate

model.compile(optimizer="adam", loss="sparse_categorical_crossentropy", metrics=
["accuracy"])
history = model.fit(
x_train, y_train, batch_size=32, epochs=2, validation_data=(x_val, y_val)
)
Epoch 1/2
782/782 [==============================] - 15s 18ms/step - loss: 0.5112 - accuracy:
0.7070 - val_loss: 0.3598 - val_accuracy: 0.8444 Text classi cation with
Epoch 2/2
Transformer
782/782 [==============================] - 13s 17ms/step - loss: 0.1942 - accuracy:
0.9297 - val_loss: 0.2977 - val_accuracy: 0.8745 ◆ Setup
◆ Implement a Transformer block as
a layer
◆ Implement embedding layer
Terms | Privacy ◆ Download and prepare dataset
◆ Create classier model using
transformer layer
◆ Train and Evaluate

k3 Ve Service Manual
26% (19)
k3 Ve Service Manual
2 pages
Deep Learning TensorFlow and Keras
No ratings yet
Deep Learning TensorFlow and Keras
454 pages
Icc PDF
100% (1)
Icc PDF
279 pages
Grid Audit Report Format
100% (1)
Grid Audit Report Format
7 pages
Keras For Beginners: Implementing A Recurrent Neural Network
No ratings yet
Keras For Beginners: Implementing A Recurrent Neural Network
13 pages
Transformer
No ratings yet
Transformer
39 pages
CNN Text Classification
No ratings yet
CNN Text Classification
12 pages
A Comprehensive Guide To Understand and Implement Text Classification in Python
No ratings yet
A Comprehensive Guide To Understand and Implement Text Classification in Python
34 pages
566f0619-9145-4b8f-b12b-cb8a5b0cd30d
No ratings yet
566f0619-9145-4b8f-b12b-cb8a5b0cd30d
17 pages
Named Entity Recognition Using Transformers - 1716328213413
No ratings yet
Named Entity Recognition Using Transformers - 1716328213413
7 pages
DL Practical 09text Pre Processing
No ratings yet
DL Practical 09text Pre Processing
6 pages
Adobe Scan 08 Jan 2025
No ratings yet
Adobe Scan 08 Jan 2025
7 pages
Report On Text Classification Using CNN, RNN & HAN - Jatana - Medium
No ratings yet
Report On Text Classification Using CNN, RNN & HAN - Jatana - Medium
15 pages
Text Classification - Movie Review - News Wires
No ratings yet
Text Classification - Movie Review - News Wires
5 pages
Neural Networks
No ratings yet
Neural Networks
8 pages
Medical Text Classifier GabrieldeOlaguibel
No ratings yet
Medical Text Classifier GabrieldeOlaguibel
12 pages
CV Lab Manual
No ratings yet
CV Lab Manual
126 pages
DLT Experiment 2
No ratings yet
DLT Experiment 2
7 pages
A Hands-On Guide To Text Classification With Transformer Models (XLNet, BERT, XLM, RoBERTa)
No ratings yet
A Hands-On Guide To Text Classification With Transformer Models (XLNet, BERT, XLM, RoBERTa)
9 pages
DL 22Q71A4206
No ratings yet
DL 22Q71A4206
65 pages
Image Captioning With Visual Attention PDF
No ratings yet
Image Captioning With Visual Attention PDF
16 pages
Deep DL Manual Deep
No ratings yet
Deep DL Manual Deep
8 pages
CV Prince
No ratings yet
CV Prince
120 pages
Spam Detection Using Tensorflow
No ratings yet
Spam Detection Using Tensorflow
13 pages
3-Sentiment Analysis BERT
No ratings yet
3-Sentiment Analysis BERT
5 pages
A First Look On Nueral Network
No ratings yet
A First Look On Nueral Network
8 pages
Deep DL Manual Nainish
No ratings yet
Deep DL Manual Nainish
8 pages
Building Transformer Models With Attention Crash Course Build A Neural Machine Translator in 12 Days
No ratings yet
Building Transformer Models With Attention Crash Course Build A Neural Machine Translator in 12 Days
33 pages
Exp 10 Sentiment Analysis BERT
No ratings yet
Exp 10 Sentiment Analysis BERT
5 pages
Cad and Dog 2
No ratings yet
Cad and Dog 2
5 pages
Text Classification With Switch Transformer - 1716327819025
No ratings yet
Text Classification With Switch Transformer - 1716327819025
5 pages
cl12 Huggingface
No ratings yet
cl12 Huggingface
34 pages
تمثيل النص كموترات - تدريب - مايكروسوفت ليرن
No ratings yet
تمثيل النص كموترات - تدريب - مايكروسوفت ليرن
14 pages
Week 2
No ratings yet
Week 2
17 pages
21BCP167 Ai 9
No ratings yet
21BCP167 Ai 9
10 pages
Hand On Day 2 Salinan - Dari - 2 - Using - Transformers
No ratings yet
Hand On Day 2 Salinan - Dari - 2 - Using - Transformers
10 pages
Week 02 Ch2.1 Introduction To Neural Networks
No ratings yet
Week 02 Ch2.1 Introduction To Neural Networks
44 pages
DL Mannual For Reference
No ratings yet
DL Mannual For Reference
58 pages
Advertisement No 07 2018
No ratings yet
Advertisement No 07 2018
22 pages
Tensorflow 2 - 0 Slides PDF
No ratings yet
Tensorflow 2 - 0 Slides PDF
100 pages
ADL Exp File
No ratings yet
ADL Exp File
56 pages
Deep Learning
No ratings yet
Deep Learning
46 pages
09 Tensorflow101 Slide
No ratings yet
09 Tensorflow101 Slide
78 pages
Python TensorFlow Tutorial - Build A Neural Network - Adventures in Machine Learning
No ratings yet
Python TensorFlow Tutorial - Build A Neural Network - Adventures in Machine Learning
18 pages
ML Lab Session 05 - CNN Implementation
No ratings yet
ML Lab Session 05 - CNN Implementation
4 pages
Deep Learning Lab With Output
No ratings yet
Deep Learning Lab With Output
12 pages
Transformers Torch
No ratings yet
Transformers Torch
38 pages
Workshop 1 Frameworks Deep Learning
No ratings yet
Workshop 1 Frameworks Deep Learning
16 pages
Assignment Text Classification Using Hugging Face
No ratings yet
Assignment Text Classification Using Hugging Face
6 pages
Ece490 C5 HW
No ratings yet
Ece490 C5 HW
3 pages
Exercise 8
No ratings yet
Exercise 8
6 pages
Hugging Face
100% (1)
Hugging Face
11 pages
Assingment-3 NLP
No ratings yet
Assingment-3 NLP
5 pages
CCS355-Neural Networks and Deep Learning - Assignment 1
No ratings yet
CCS355-Neural Networks and Deep Learning - Assignment 1
15 pages
Module V
No ratings yet
Module V
19 pages
Deep Learning
No ratings yet
Deep Learning
43 pages
DL Lab Manual
No ratings yet
DL Lab Manual
18 pages
Tensor Flow 2
No ratings yet
Tensor Flow 2
3 pages
Dla
No ratings yet
Dla
23 pages
Bay Learn 2015 Deep Mind
No ratings yet
Bay Learn 2015 Deep Mind
69 pages
DL Lab - Merged
No ratings yet
DL Lab - Merged
60 pages
Java Programming Tutorial With Screen Shots & Many Code Example
From Everand
Java Programming Tutorial With Screen Shots & Many Code Example
Desmond Ohwofosirai
No ratings yet
The Definitive Guide to PowerShell
From Everand
The Definitive Guide to PowerShell
Wesley Dunne
No ratings yet
Gender: Project All Numerates Pre-Test Results
100% (1)
Gender: Project All Numerates Pre-Test Results
6 pages
The Implementation of Distance Protection Relay in Transmission Lines
No ratings yet
The Implementation of Distance Protection Relay in Transmission Lines
8 pages
Chapter 1 THE PROBLEM AND ITS BACKGROUND
No ratings yet
Chapter 1 THE PROBLEM AND ITS BACKGROUND
10 pages
Vmware - Kopia
No ratings yet
Vmware - Kopia
45 pages
Valve and Pump
No ratings yet
Valve and Pump
32 pages
Applications of Social Media and Social Network Analysis - Lecture Notes in Social Networks PDF
100% (1)
Applications of Social Media and Social Network Analysis - Lecture Notes in Social Networks PDF
247 pages
Tire Dimensions
No ratings yet
Tire Dimensions
1 page
Cobra C1 FastScanManual
No ratings yet
Cobra C1 FastScanManual
64 pages
Vigyan Vahini
No ratings yet
Vigyan Vahini
8 pages
Fault Analysis and Voltage Control 3
No ratings yet
Fault Analysis and Voltage Control 3
24 pages
Control Lab1
0% (1)
Control Lab1
59 pages
Week 03 - Quiz
No ratings yet
Week 03 - Quiz
1 page
Flow Over Cylinder
No ratings yet
Flow Over Cylinder
8 pages
BSMA 2022 Curriculum
100% (1)
BSMA 2022 Curriculum
2 pages
Fluostar 2L
No ratings yet
Fluostar 2L
1 page
Csit 301 Lesson Plan 1
No ratings yet
Csit 301 Lesson Plan 1
5 pages
VOCALOID 6 Reference Manual ENG
No ratings yet
VOCALOID 6 Reference Manual ENG
88 pages
? Gallery Walk Scoring Rubric
No ratings yet
? Gallery Walk Scoring Rubric
2 pages
SPM Swivels Operation Instruction and Service Manual
No ratings yet
SPM Swivels Operation Instruction and Service Manual
44 pages
EMR System UI Design
No ratings yet
EMR System UI Design
3 pages
2017.09.13 - MY18 GLE-Coupe
No ratings yet
2017.09.13 - MY18 GLE-Coupe
29 pages
Design and Optimization of Spur Gear: Second Review
No ratings yet
Design and Optimization of Spur Gear: Second Review
44 pages
R1 Nokia
No ratings yet
R1 Nokia
6 pages
Planning A Lesson Using PRIMM: The Five Stages of PRIMM
No ratings yet
Planning A Lesson Using PRIMM: The Five Stages of PRIMM
2 pages
Ambulong Climatological Extremes (As of 2016)
No ratings yet
Ambulong Climatological Extremes (As of 2016)
1 page
1941 - National Building Code of Canada
No ratings yet
1941 - National Building Code of Canada
432 pages
Sumana Bandyopadhyay - Kolkata The Colonial City in Transition - Reflections in Geographies of Urban India-Routledge (2022)
100% (1)
Sumana Bandyopadhyay - Kolkata The Colonial City in Transition - Reflections in Geographies of Urban India-Routledge (2022)
395 pages

Text Classification With Transformer - 1716327784332

Uploaded by

Text Classification With Transformer - 1716327784332

Uploaded by

Search Keras documentation...

Text classi cation with

ⓘ This example uses Keras 3

View in Colab • GitHub source

Implement a Transformer block as a layer

def call(self, inputs):

Implement embedding layer

def call(self, x):

Downloading data from https://storage.googleapis.com/tensorflow/tf-keras-

<string>:6: VisibleDeprecationWarning: Creating an ndarray from ragged nested sequences

25000 Training sequences

Create classi er model using transformer layer

embed_dim = 32 # Embedding size for each token

model = keras.Model(inputs=inputs, outputs=outputs)

Train and Evaluate

You might also like