Transformers For NLP

The document discusses the Transformer architecture for natural language processing. It explains key components of the Transformer like the encoder, embeddings, attention heads, layer normalization, and masking. It also provides a link to an interactive demo of Transformers and concludes with the hope that the reader found it informative.

Uploaded by

Mynam Meghana

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

202 views42 pages

Transformers For NLP

Uploaded by

Mynam Meghana

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 42

Transformers for NLP

Dr. Kisor K. Sahu

IITBBS
Transformer
architecture
Encoder

N=6
Transformer
architecture
Transformer Original paper embedding d=512

architecture

@ start training @ end training

Original paper embeding d=512

Transformer
architecture
Transformer
architecture
Transformer
architecture
Transformer
architecture
Embedding at a glance
Transformer
architecture
Transformer
architecture
Transformer
architecture
Transformer
architecture Query
Without
activation
Transformer
architecture
Video content is
the value
Transformer
architecture
After training

Final Self-Attention Filter

Original paper: 8 attention heads
Post-normalization

Layer Normalization means standardization of neuron activation along

the axis of feature

Adding a small value to

avoid dividing by 0
Transformer architecture
Transformer architecture
How to implement masking
FUN WITH TRANSFORMERS
Link: https://transformer.huggingface.co/doc/distil-gpt2

The end
Hope that you had a good time with transformer
Thank you!!!!!!!!!!

Current Best Practices For Training LLMs From Scratch - Final
No ratings yet
Current Best Practices For Training LLMs From Scratch - Final
23 pages
Machine Learning Short Notes
No ratings yet
Machine Learning Short Notes
36 pages
Semantic Kernel
100% (1)
Semantic Kernel
162 pages
A Beginner's Guide To Large Language Models
No ratings yet
A Beginner's Guide To Large Language Models
10 pages
What Is A Support Vector Machine?: Primer
No ratings yet
What Is A Support Vector Machine?: Primer
3 pages
Deep Learning Notes Andrew NG
No ratings yet
Deep Learning Notes Andrew NG
54 pages
Word2Vec Tutorial - The Skip-Gram Model Chris McCormick PDF
No ratings yet
Word2Vec Tutorial - The Skip-Gram Model Chris McCormick PDF
39 pages
Transformers Illustraded
No ratings yet
Transformers Illustraded
31 pages
Pytorch Tutorial by Chongruo Wu
No ratings yet
Pytorch Tutorial by Chongruo Wu
84 pages
GenAI Unit1 3
No ratings yet
GenAI Unit1 3
31 pages
ML Unit 1 Pallav
No ratings yet
ML Unit 1 Pallav
22 pages
TensorFlow Cheatsheet Zero To Mastery V1.01
No ratings yet
TensorFlow Cheatsheet Zero To Mastery V1.01
26 pages
Deep Reinforcement Learning
No ratings yet
Deep Reinforcement Learning
47 pages
A Novel Adoption of LSTM in Customer Touchpoint Prediction Problems Presentation 1
100% (1)
A Novel Adoption of LSTM in Customer Touchpoint Prediction Problems Presentation 1
73 pages
Day 1
No ratings yet
Day 1
32 pages
Fine Tuning Techniques For Large Language Models LLMs
No ratings yet
Fine Tuning Techniques For Large Language Models LLMs
15 pages
Autoencoders: Presented By: 2019220013 Balde Lansana (
No ratings yet
Autoencoders: Presented By: 2019220013 Balde Lansana (
21 pages
A Practical Primer To AI Agents 1736197641
No ratings yet
A Practical Primer To AI Agents 1736197641
23 pages
Deep Learning by AndrewNG Tutorial Notes
No ratings yet
Deep Learning by AndrewNG Tutorial Notes
298 pages
Machine Learning Summarized Notes 1660762916
No ratings yet
Machine Learning Summarized Notes 1660762916
111 pages
Machine Learning: Andrew NG's Course From Coursera: Presentation
100% (1)
Machine Learning: Andrew NG's Course From Coursera: Presentation
4 pages
MACHINELEARING UNIT 1material
100% (1)
MACHINELEARING UNIT 1material
64 pages
Yugandar - Generative AI Architect
No ratings yet
Yugandar - Generative AI Architect
8 pages
Basics of Deep Learning
100% (1)
Basics of Deep Learning
17 pages
Deep Learning
100% (2)
Deep Learning
49 pages
Machine Learning Q and AI 1686653642
67% (3)
Machine Learning Q and AI 1686653642
82 pages
02 - Lecture Note - TensorFlow Ops
No ratings yet
02 - Lecture Note - TensorFlow Ops
21 pages
Generative AI With Large Language Models AWS & DeepLearning
No ratings yet
Generative AI With Large Language Models AWS & DeepLearning
96 pages
RAG and LangChain
100% (1)
RAG and LangChain
14 pages
Transformers LLMs
100% (1)
Transformers LLMs
163 pages
RAG (Generative AI) - A "Rags To Riches" Moment For Artificial Intelligence - by Kanishk Khatter - Medium
No ratings yet
RAG (Generative AI) - A "Rags To Riches" Moment For Artificial Intelligence - by Kanishk Khatter - Medium
12 pages
Vector DB Guide
No ratings yet
Vector DB Guide
47 pages
GANppt
100% (1)
GANppt
34 pages
Machine Learning For Tabular Data XGBoost, Deep Learning, and AI (Mark Ryan, Luca Massaron) (Z-Library)
100% (1)
Machine Learning For Tabular Data XGBoost, Deep Learning, and AI (Mark Ryan, Luca Massaron) (Z-Library)
504 pages
Training Generative Adversarial Networks With Limited Data
No ratings yet
Training Generative Adversarial Networks With Limited Data
37 pages
Building Transformer Models With Attention - Web - Page
No ratings yet
Building Transformer Models With Attention - Web - Page
19 pages
TensorFlow Roadmap
No ratings yet
TensorFlow Roadmap
22 pages
L1 - Machine Learning For Finance
100% (1)
L1 - Machine Learning For Finance
131 pages
Introduction To Reinforcement Learning
100% (1)
Introduction To Reinforcement Learning
52 pages
Langchain Onepager
No ratings yet
Langchain Onepager
1 page
Langchain PDF Reader
100% (1)
Langchain PDF Reader
15 pages
Geographic Coordinate Conversion
No ratings yet
Geographic Coordinate Conversion
11 pages
Ai
No ratings yet
Ai
28 pages
PyTorch Workflow Fundamentals
No ratings yet
PyTorch Workflow Fundamentals
1 page
Deep Learning
100% (3)
Deep Learning
207 pages
Machine Learning Curriculum Berkley
100% (1)
Machine Learning Curriculum Berkley
12 pages
Machine Learning Basics: An Illustrated Guide For Non-Technical Readers
50% (2)
Machine Learning Basics: An Illustrated Guide For Non-Technical Readers
27 pages
Machine Learning Coursera
100% (1)
Machine Learning Coursera
55 pages
Self-Supervision, Bert, and Beyond: Building Transformer-Based Natural Language Processing Applications (Part 2)
No ratings yet
Self-Supervision, Bert, and Beyond: Building Transformer-Based Natural Language Processing Applications (Part 2)
117 pages
Running Llama 2 On CPU Inference Locally For Document Q&A - by Kenneth Leung - Jul, 2023 - Towards Data Science
100% (1)
Running Llama 2 On CPU Inference Locally For Document Q&A - by Kenneth Leung - Jul, 2023 - Towards Data Science
21 pages
Generative AI
No ratings yet
Generative AI
5 pages
Generative AI Interview Questions and Answers
No ratings yet
Generative AI Interview Questions and Answers
7 pages
LLM
100% (1)
LLM
10 pages
Deep Learning Interview Questions - Deep Learning Questions
No ratings yet
Deep Learning Interview Questions - Deep Learning Questions
21 pages
GenAI Interview Questions-Draft
No ratings yet
GenAI Interview Questions-Draft
27 pages
Evolving LLOMPS For RAG
No ratings yet
Evolving LLOMPS For RAG
6 pages
Uppwise Standard PPT 2
No ratings yet
Uppwise Standard PPT 2
13 pages
2022 AIOpen A Survey of Transformers Lin, Wang, Liu, Qiu
No ratings yet
2022 AIOpen A Survey of Transformers Lin, Wang, Liu, Qiu
22 pages
Applsci 14 04316
No ratings yet
Applsci 14 04316
27 pages
Linked List: Joy Mukherjee
No ratings yet
Linked List: Joy Mukherjee
21 pages
Sem8 Endsem
No ratings yet
Sem8 Endsem
21 pages
SCE Additional Problems
No ratings yet
SCE Additional Problems
2 pages
CS Chapter 8
No ratings yet
CS Chapter 8
113 pages
Axiomatic Solutions To Transboundary River Conflicts
No ratings yet
Axiomatic Solutions To Transboundary River Conflicts
3 pages
The Politicization of Water - Transboundary Water-Conflict in The
No ratings yet
The Politicization of Water - Transboundary Water-Conflict in The
90 pages
CS - Chapter 7
No ratings yet
CS - Chapter 7
50 pages
Hall Effect Transducers and Its Applications
No ratings yet
Hall Effect Transducers and Its Applications
16 pages
Assignment TX Line
0% (1)
Assignment TX Line
3 pages