0% found this document useful (0 votes)

32 views15 pages

答案解析

Uploaded by

zhouzhennsfz

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

32 views15 pages

答案解析

Uploaded by

zhouzhennsfz

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 15

Transformer

: NLP
: DASOU

弃
放
到
门
入
从
LP
N
1. Transformer
2. 3 Transformer Encoder
Transformer

4.
RPR

RPR
BN -- Batch Normalization
BN
BN
BN
6. NLP -layer-norm BatchNorm
BN NLP
layner-norm

弃
- BN CNN NLP

7. Decoder 放
到
门
入

8. Transformer
9. Transformer
从

10. Transformer Q K
11. Transformer attention
LP

12. softmax attention scaled dk

13. attention score padding mask

1. Transformer
1. Transformer
2. Transformer Q K

3. Transformer attention

4. softmax attention scaled dk

5. attention score padding mask

6. head
7. Transformer Encoder
8. embedding size
9. Transformer
10.
11. Transformer
12. transformer LayerNorm BatchNorm LayerNorm Transformer

13. BatchNorm
14. Transformer
15. Encoder Decoder seq2seq attention
16. Decoder encoder decoder
sequence mask)
17. Transformer Decoder
18. wordpiece model byte pair encoding
19. Transformer Dropout Dropout

20. bert bert mask transformer attention score

弃
放
到
2. 3 Transformer Encoder
门

Transformer encoder decoder encoder N

入

N=6
从
LP

RNN
N

Add&Norm Linear Add&Norm

Q/K/V Q/K ( softmax) V

n_heads n_heads
hidden_size/n_heads

Add
NLP NLP

Transformer RNN RNN

Bilstm
Elmo lstm RNN
GNMT tricks

Norm Layer Normalization Layer Normalization BN

Linear Relu

encoder

word embedding position encoding

word embedding word2vec

position encoding transformer

position encoding position encoding

弃
encoder

encoder decoder 放 K/V

Q decoder K/V encoder
到
门
入
从
LP
N

Transformer

Transformer
(1) (2) Q/K

(2)

弃
放
到
门
入
从
LP
N

(2)
弃
放
到
(1) k
门

5
入
从
LP
N

RPR Transformer-XL complex embeddings

RPR

RPR attention
attention
RPR

” / / / “

” “ ” “ -1 ” “ ” “ 1

RPR

4 attention 4
4

4 4 4 9

attention

弃
“ ”

“ ”
放
到
门
入

RPR : Q/K/V K
从

RPR : Q/K/V V
LP
N

BN -- Batch Normalization
layer-norm

BN Batch

MLP 10 5 5 10 10
10

CNN N·C·H·W N batch_size C H W

BN C N,H,W
..... N

BN
BN
BN

sigmoid

batch_size BN batch
batch_size

BN RNN

RNN

batch_size 10 10 9 5 10
20

batch
6 20

弃
batch

batch
放
到
batch 1
门
入

1000 600 400

从
LP
N

BN MLP CNN RNN

NLP BN Layer norm

6. NLP -layer-norm BatchNorm

NLP Transformer LayerNorm BatchNorm

LayerNorm
BN NLP

BN RNN batch

batch NLP BN

BN MLP BN batch_size

BN NLP

MLP

:“ / / / ” “ / / /
”

弃
BN " " “ ” BN

BN 放
到

layner-norm
门
入

layner-norm layner-norm
N C/H/W
从

“ / / / ”
LP

BN “ ”
layner-norm
N

N batch size

“ ”

insight

- BN CNN NLP

CNN BN NLP BN

BN NLP
NLP
CNN BN feature
map

7. Decoder

弃
放
到
门
入

decoder Q K/V Encoder

从

encoder K/V
LP

seq2seq attention
N

context vector

K/V RNN

K/V K/V
Encoder K/V

Encoder Decoder N N 6

Add&Norm

Encoder
mask

mask

mask mask ...(

)

mask

GAP

" / / / "

” “ ” “

teacher forcing ” “

” “ ” “

弃
” “

mask ”
放“ ” “ ” “
到
门

ground truth
入
从
LP

” “ ” “

GAP mask ” “ ” “
N

Q K/V encoder

K/V encoder

transformer encoder decode

decoder

8. Transformer
Transformer

Decoder RNN

Encoder

attention

6 encoder encoder

弃
放
到
门

9. Transformer
入

Transformer Multi-head Attention

https://www.zhihu.com/question/341222779
从

transformer
LP
N
弃
放
到
门

10. Transformer Q K
入
从

transformer K Q -
LP

https://www.zhihu.com/question/319339652

Q/K/V
N

11. Transformer attention

attention
dk
dk (
)
弃
放
到
门
入
从
LP
N

12. softmax attention scaled

dk
transformer attention scaled? - LinT -
https://www.zhihu.com/question/339723385/answer/782509914

import numpy as np
arr1=np.random.normal(size=(3,1000))
arr2=np.random.normal(size=(3,1000))
result=np.dot(arr1.T,arr2)
arr_var=np.var(result)
print(arr_var) #result: 2.9 ( 3 )
13. attention score padding mask
padding ( -1000 ) batch_size

弃
Transformer transformer

放
wordpiece model
到
GPT
门

: NLP
入
从
LP
N

The Illustrated Transformer - Jay Alammar - Visualizing Machine Learning One Concept at A Time.
No ratings yet
The Illustrated Transformer - Jay Alammar - Visualizing Machine Learning One Concept at A Time.
5 pages
NLP Preprocessing Steps
No ratings yet
NLP Preprocessing Steps
20 pages
Transformers in NLP 1
No ratings yet
Transformers in NLP 1
9 pages
Lesson 14 - Transformer
No ratings yet
Lesson 14 - Transformer
124 pages
腾讯研究院AIGC发展趋势报告2023
No ratings yet
腾讯研究院AIGC发展趋势报告2023
68 pages
14.chapter10 AdvancedDeepLearningForText
No ratings yet
14.chapter10 AdvancedDeepLearningForText
22 pages
Unit 1
No ratings yet
Unit 1
12 pages
Transformers Illustraded
No ratings yet
Transformers Illustraded
31 pages
21551A05C8 3-2 Internship Report
No ratings yet
21551A05C8 3-2 Internship Report
49 pages
AE556 2024 Topic7 Transformer
No ratings yet
AE556 2024 Topic7 Transformer
49 pages
Unit 3
No ratings yet
Unit 3
32 pages
DocBERT - BERT For Document Classification
No ratings yet
DocBERT - BERT For Document Classification
7 pages
U1 NLP App Solved
No ratings yet
U1 NLP App Solved
26 pages
L3 Transformer and PLMs
No ratings yet
L3 Transformer and PLMs
111 pages
Tranformrerz
No ratings yet
Tranformrerz
62 pages
Facial Emotion Recognition Based Real-Time Learner Engagement Detection System in Online Learning
No ratings yet
Facial Emotion Recognition Based Real-Time Learner Engagement Detection System in Online Learning
61 pages
08 Transformer
No ratings yet
08 Transformer
56 pages
Natural Language Pre-Processing: Prepared By: Syed Afroz Ali
No ratings yet
Natural Language Pre-Processing: Prepared By: Syed Afroz Ali
81 pages
1 cs772 Introduction Week of 3jan22
No ratings yet
1 cs772 Introduction Week of 3jan22
53 pages
ML For NLP-LO3
No ratings yet
ML For NLP-LO3
61 pages
Major Base 3
No ratings yet
Major Base 3
43 pages
NLP Final
No ratings yet
NLP Final
33 pages
AI CH 5
No ratings yet
AI CH 5
37 pages
NLP 1ST Unit
No ratings yet
NLP 1ST Unit
44 pages
NLP Short Que Ans
No ratings yet
NLP Short Que Ans
21 pages
Vtu Questions From Previous Ai ML Question Papers
No ratings yet
Vtu Questions From Previous Ai ML Question Papers
4 pages
02 Text Preprocessing Part1
No ratings yet
02 Text Preprocessing Part1
35 pages
The Decoder: Deconstructed
No ratings yet
The Decoder: Deconstructed
35 pages
AI4youngster - 6 - Topic NLP
No ratings yet
AI4youngster - 6 - Topic NLP
66 pages
Literature Review
No ratings yet
Literature Review
32 pages
10 Encdec Attention Notes
No ratings yet
10 Encdec Attention Notes
29 pages
07-Dlintro Deep Learning NLP
No ratings yet
07-Dlintro Deep Learning NLP
31 pages
Massp2023 NLP
No ratings yet
Massp2023 NLP
26 pages
The 7 NLP Techniques That Will Change How You Communicate in The Future (Part I)
No ratings yet
The 7 NLP Techniques That Will Change How You Communicate in The Future (Part I)
19 pages
Natural Language Processing (NLP) Zero To Mastery Part I - Foundations - by ChenDataBytes - Dec, 2023 - Medium
No ratings yet
Natural Language Processing (NLP) Zero To Mastery Part I - Foundations - by ChenDataBytes - Dec, 2023 - Medium
22 pages
ChatGPT - MyLearning On Coding For NLP
No ratings yet
ChatGPT - MyLearning On Coding For NLP
10 pages
NLP 160709201345
No ratings yet
NLP 160709201345
61 pages
Natural Language Processing With Deep Learning CS224N/Ling284
No ratings yet
Natural Language Processing With Deep Learning CS224N/Ling284
62 pages
On Machine Learning
No ratings yet
On Machine Learning
20 pages
Transformers - Introduction
No ratings yet
Transformers - Introduction
22 pages
Dataiku - Get Up To Speed With NLP
No ratings yet
Dataiku - Get Up To Speed With NLP
16 pages
NLP Preprocessing Steps 1740444240
No ratings yet
NLP Preprocessing Steps 1740444240
20 pages
NLP - Natural Language Processing
No ratings yet
NLP - Natural Language Processing
74 pages
Visual Transformer
No ratings yet
Visual Transformer
18 pages
NLP Record300
No ratings yet
NLP Record300
24 pages
Not All Samples Are Created Equal
No ratings yet
Not All Samples Are Created Equal
13 pages
Unit 1
No ratings yet
Unit 1
19 pages
Transformers
No ratings yet
Transformers
23 pages
Machine Learning Basics Lecture 7: Multiclass Classification
No ratings yet
Machine Learning Basics Lecture 7: Multiclass Classification
28 pages
Introduction To Machine Learning: by Prof. Mohini Chaudhari
No ratings yet
Introduction To Machine Learning: by Prof. Mohini Chaudhari
16 pages
NLP Roadmap 1
No ratings yet
NLP Roadmap 1
10 pages
Adnan Amin
No ratings yet
Adnan Amin
19 pages
Chapter-1 Introduction To NLP
No ratings yet
Chapter-1 Introduction To NLP
12 pages
Artificial Neural Networks: An Overview: August 2023
No ratings yet
Artificial Neural Networks: An Overview: August 2023
11 pages
Thuyết Trình TWP
No ratings yet
Thuyết Trình TWP
7 pages
NLP Answers
No ratings yet
NLP Answers
6 pages
Project Report
No ratings yet
Project Report
16 pages
NPL Assignment 1
No ratings yet
NPL Assignment 1
5 pages
Complete NLP Guide - From Fundamentals To Deep Learning With TensorFlow
No ratings yet
Complete NLP Guide - From Fundamentals To Deep Learning With TensorFlow
13 pages
Generative Artificial Intelligence - Opportunities and Challenges of Large Language Models - SpringerLink
No ratings yet
Generative Artificial Intelligence - Opportunities and Challenges of Large Language Models - SpringerLink
8 pages
DAA FinalReport
No ratings yet
DAA FinalReport
14 pages
Supervised Vs Unsupervised Learning
No ratings yet
Supervised Vs Unsupervised Learning
20 pages
Baba Is AI Break The Rules To Beat The Benchmark
No ratings yet
Baba Is AI Break The Rules To Beat The Benchmark
8 pages
NeurIPS 2022 Revised
No ratings yet
NeurIPS 2022 Revised
9 pages
Transformer
No ratings yet
Transformer
5 pages
Statistical Learning Framework
No ratings yet
Statistical Learning Framework
7 pages
NLP Roadmap 1
No ratings yet
NLP Roadmap 1
10 pages
Deep Learning Lecture 28 April
No ratings yet
Deep Learning Lecture 28 April
4 pages
Transformer
No ratings yet
Transformer
5 pages
ART16000108
No ratings yet
ART16000108
6 pages
Unit I NLP
No ratings yet
Unit I NLP
5 pages
Overview of The Transformer-Based Models For NLP Tasks
No ratings yet
Overview of The Transformer-Based Models For NLP Tasks
5 pages
Sample Term Paper
No ratings yet
Sample Term Paper
7 pages
Handwritten Hindi Character Recognition Using MultipleClassifiers in Machine Learning
No ratings yet
Handwritten Hindi Character Recognition Using MultipleClassifiers in Machine Learning
6 pages
GAN Script
No ratings yet
GAN Script
5 pages
Imp ML
No ratings yet
Imp ML
8 pages
L2CS-Net 2022
No ratings yet
L2CS-Net 2022
5 pages
Object Detection in Pytorch Using Mask R-CNN
No ratings yet
Object Detection in Pytorch Using Mask R-CNN
4 pages
0 Yqn EK3 VG 4 He OTv 089 KX SI1 Ij Wzu Ax T1 Ag Gev OKKJE
No ratings yet
0 Yqn EK3 VG 4 He OTv 089 KX SI1 Ij Wzu Ax T1 Ag Gev OKKJE
4 pages
Stanford CS 224N Deep Learning For NLP Practice Quiz Pack
No ratings yet
Stanford CS 224N Deep Learning For NLP Practice Quiz Pack
4 pages
Introduction To NLP - First - Week - Lecture - 2st
No ratings yet
Introduction To NLP - First - Week - Lecture - 2st
4 pages
A Machine Learning Approach: SVM For Image Classification in CBIR
No ratings yet
A Machine Learning Approach: SVM For Image Classification in CBIR
7 pages
ABAIM Vertical Flyer 2022
No ratings yet
ABAIM Vertical Flyer 2022
1 page
4 Week Report
No ratings yet
4 Week Report
1 page
Deep Learning 101
No ratings yet
Deep Learning 101
1 page
Components of Ai System Design PDF
No ratings yet
Components of Ai System Design PDF
1 page
Ats Friendly Technical Resume
No ratings yet
Ats Friendly Technical Resume
1 page
Analog Dialogue, Volume 47, Number 2
From Everand
Analog Dialogue, Volume 47, Number 2
Analog Dialogue
No ratings yet
Electronics II Essentials
From Everand
Electronics II Essentials
The Editors of REA
No ratings yet