0% found this document useful (0 votes)

40 views37 pages

(Slide) Neural Machine Translation

The document discusses neural machine translation, including an introduction covering the basics of NMT and how it is evaluated. It then provides outlines on NMT using Transformer models and using pre-trained language models.

Uploaded by

Minh Lợi Nguyễn

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

40 views37 pages

(Slide) Neural Machine Translation

Uploaded by

Minh Lợi Nguyễn

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 37

AI VIETNAM

All-in-One Course

NLP Project

Neural Machine Translation

AI VIET NAM
Nguyen Quoc Thai

1
Year 2023
Outline
Ø Introduction
Ø NMT using Transformer
Ø NMT using Pre-trained LMs

2
Introduction
! Translate a sentence w(s) in a source language (input) to a sentence w(t) in the
target language (output)

3
Introduction
! Translate a sentence w(s) in a source language (input) to a sentence w(t) in the
target language (output)

Automatic Speech Natural Language Natural Language

Recognition (ASR) Understanding (NLU) Generation (NLG)

translation of spoken a computer’s ability to generate natural

language into text understand language language by a computer

q Syntax
q Semantics
q Phonology
q Pragmatics
q Morphology

4
Introduction
! Translate a sentence w(s) in a source language (input) to a sentence w(t) in the
target language (output)

Ø Can be formulated as an optimization problem:

! (") = argmax 𝜃( 𝑤 (%) , 𝑤 (&) )
𝑤
$(")
Where 𝜃 is a scoring function over source and target sentences
Ø Requires two components:
q Learning algorithm to compute parameters of 𝜃
! (")
q Decoding algorithm for computing the best translation 𝑤

5
Introduction

1950 1980 1990 2007 2015

6
Introduction
! Evaluating translation quality

Ø Human judgement
q Given: machine translation output
q Given: source / reference translation
q Task: assess the quality of machine translation output
Ø Different translations of “A Vinay le gusta Python”

7
Introduction
! Evaluating translation quality

Ø Two main criteria:

q Adequacy: Translation w(t) should adequately reflect the linguistic content of w(s)
q Fluency: Translation w(t) should be fluent text in the target language

Ø Different translations of “A Vinay le gusta Python”

8
Introduction
! Evaluating translation quality

Ø Two main criteria:

q Adequacy: Translation w(t) should adequately reflect the linguistic content of w(s)
q Fluency: Translation w(t) should be fluent text in the target language

Ø Adequacy and fluency: Adequacy Fluency

5 All meaning 5 Flawless English
4 Most meaning 4 Good English
3 Much meaning 3 Non-native English
2 Little meaning 2 Disfluent English
1 None 1 Incomprehensible

9
Introduction
! Evaluating Metrics

Ø Manual evaluation is most accurate, but expensive

Ø Automated evaluation metrics:
q Compare system hypothesis with reference translations
q BLEU Score (BiLingual Evaluation Understudy): Modified n-gram Precision
q SacreBLEU Score (A Call for Clarity in Reporting BLEU Scores)

10
Introduction
! Evaluating Metrics

Precision and Recall of words

System A A officials responsibility of airport safety
Reference A officials are responsible for airport security

Ø Precision: Ø Recall:
correct 3 correct 3
= = 50% = = 43%
output − length 6 reference − length 7
Ø F-measure:
PxR 0.5 x 0.43
= = 46%
(P + R)/2 (0.5 + 0.43)/2

11
Introduction
! Evaluating Metrics

Precision and Recall of words

v Flaw: no penalty for reordering
System A A officials responsibility of airport safety
Reference A officials are responsible for airport security
System B airport security A officials are responsible

Metric System A System B

Precision 50% 100%
Recall 43% 86%
F-measure 46% 92,5%

12
Introduction
! Evaluating Metrics

BLEU
v N-gram overlap between machine translation output and reference translation
v Compute precision for n-grams of size 1 to 4
v Add brevity penalty (for too short translations)
* )/*
output − length
BLEU = min 1, > precision)
reference − length
'()
v Typically computed over the entire corpus, not single sentences

13
Introduction
! Evaluating Metrics

BLEU 1-gram
System A A officials responsibility of airport safety
Reference A officials are responsible for airport security
System B airport security A officials are responsible
Metric System A System B
Precision (1 gram) 3/6 6/6
Precision (2 gram)
Precision (3 gram)
Precision (4 gram)
Brevity penalty
BLEU
14
Introduction
! Evaluating Metrics

BLEU
System A A officials responsibility of airport safety
Reference A officials are responsible for airport security
System B airport security A officials are responsible

2 -gram Metric System A System B

Precision (1 gram) 3/6 6/6
Precision (2 gram) 1/5 4/5
Precision (3 gram) 0/4 2/4
Precision (4 gram) 0/3 1/4
Brevity penalty 6/7 6/7
BLEU 0 0.52
15
Introduction
! Evaluating Metrics

BLEU
-
r
logBLEU = min 1 − , 0 + B w,logp,
c
,()
r: reference-length, c: output (candidate)-length
n: n-gram (1,2,3,4), wn: weight of n-gram
uniform weights wn=1/n
pn: precision n-gram
SacreBLEU (A Call for Clarity in Reporting BLEU)

16
Introduction
! Evaluating Metrics

17
Outline
Ø Introduction
Ø NMT using Transformer
Ø NMT using Pre-trained LMs

18
NMT using Transformer
! Sequence to Sequence

v A single neural network is used to translate from source to target

v Architecture: Encoder-Decoder
v Encoder: Convert source sentence (input) into a vector/matrix (State)
v Decoder: Convert encoding into a sentence in target language (output)

Input Decoder State Encoder Output

Thought Vector
Capture all information of input sentence

19
NMT using Transformer
! Transformer Model

20
NMT using Transformer
! Training
Target
I go to work <end>

Loss
Prediction I go _earn work <end>

t
_ôi
đi ENCODER DECODER
l
_àm

<start> I go to work
21
NMT using Transformer
! Training

How to choose “Best candidate”

Output Sequence (Target)

ENCODER DECODER

Input Sequence (Source) 22

NMT using Transformer
! Greedy Decoding

23
Outline
Ø Introduction
Ø NMT using Transformer
Ø NMT using Pre-trained LMs

24
NMT using Pre-trained LMs
! Pre-trained LMs

25
NMT using Pre-trained LMs
! Pre-trained LMs

Source
26
NMT using Pre-trained LMs
! Pre-trained LMs

27
NMT using Pre-trained LMs
! Pre-trained LMs: BERT

v BERT: An encoder-only model

H 𝟏:𝒏
v Maps an input sequence to a contextualized sequence: 𝒇𝜽𝑩𝑬𝑹𝑻 : 𝑿𝟏:𝒏 ⟶ 𝑿

28
NMT using Pre-trained LMs
! Pre-trained LMs: BERT

29
NMT using Pre-trained LMs
! Pre-trained LMs: GPT2

v GPT2: A decoder-only model, use uni-directional (causal) self-attention

v Maps an input sequence to a “next word” logit vector sequence:
𝒇𝜽𝑮𝑷𝒀𝟐 : 𝑿𝟎:𝒎4𝟏 ⟶ 𝑳𝟏:𝒎

30
NMT using Pre-trained LMs
! Pre-trained LMs: GPT2

31
NMT using Pre-trained LMs
! Encoder-Decoder with BERT and GPT2

32
NMT using Pre-trained LMs
! BERT for Encoder

33
NMT using Pre-trained LMs
! BERT for Decoder

34
NMT using Pre-trained LMs
! GPT2 for Decoder

35
NMT using Pre-trained LMs
! Experiment

v Dataset: IWSLT’15 English-Vietnamese

Training: 133 317 Validation: 1 553 Test: 1 269
Experiment Model ScareBLEU
#1 Standard Transformer (Greedy Search) 24.66 55.9/30.3/18.5/11.8
#2 BERT-to-BERT (Greedy Search) 25.41 53.8/31.8/19.8/12.3
#3 BERT-to-GPT2 (Greedy Search) 23.56 49.1/28.5/18.4/12.0

36
Thanks!
Any questions?

Real Time Voice Translator
No ratings yet
Real Time Voice Translator
28 pages
Neural Machine Translation
No ratings yet
Neural Machine Translation
29 pages
Machine Translation
No ratings yet
Machine Translation
10 pages
How To Think Like Da Vinci PDF
100% (5)
How To Think Like Da Vinci PDF
48 pages
Book - Handbook of Collaborative Learning (2013)
100% (1)
Book - Handbook of Collaborative Learning (2013)
498 pages
Machine Translation Mondal 2023
No ratings yet
Machine Translation Mondal 2023
90 pages
Lect 07 - MT and Seq2seq
No ratings yet
Lect 07 - MT and Seq2seq
86 pages
Language Modelling Approaches To Adaptive Machine Translation
No ratings yet
Language Modelling Approaches To Adaptive Machine Translation
132 pages
UNIT 6 Applications of NLP
No ratings yet
UNIT 6 Applications of NLP
7 pages
Ai Final Print
No ratings yet
Ai Final Print
23 pages
Paper Review
No ratings yet
Paper Review
41 pages
LLM AI4Bharath
No ratings yet
LLM AI4Bharath
101 pages
Unit 5
No ratings yet
Unit 5
42 pages
Challenges in NMT - 2004.05809
No ratings yet
Challenges in NMT - 2004.05809
22 pages
Improving Machine Translation With Large Language Models - A Preliminary Study With Cooperative Decoding
No ratings yet
Improving Machine Translation With Large Language Models - A Preliminary Study With Cooperative Decoding
19 pages
Neural Machine Translation A Review of Methods Resources and - 2020 - AI Ope
No ratings yet
Neural Machine Translation A Review of Methods Resources and - 2020 - AI Ope
17 pages
Philipp Koehn: Neural Machine Translation
No ratings yet
Philipp Koehn: Neural Machine Translation
11 pages
Machine Learning in Translation (Peng Wang, David B. Sawyer) (Z-Library)
No ratings yet
Machine Learning in Translation (Peng Wang, David B. Sawyer) (Z-Library)
219 pages
Electronics 14 00243
No ratings yet
Electronics 14 00243
30 pages
Assignment 2 Report
No ratings yet
Assignment 2 Report
10 pages
Statistical Machine Translation: The Basic, The Novel, and The Speculative
No ratings yet
Statistical Machine Translation: The Basic, The Novel, and The Speculative
81 pages
Team ACK at SemEval-2025 Task 2 Beyond Word-For-Wo
No ratings yet
Team ACK at SemEval-2025 Task 2 Beyond Word-For-Wo
13 pages
Natural Language Processing For Language Translation
No ratings yet
Natural Language Processing For Language Translation
23 pages
Natural Language Processing Unit 5
No ratings yet
Natural Language Processing Unit 5
23 pages
1st Review-Tarun
No ratings yet
1st Review-Tarun
19 pages
Cs224n 2020 Lecture08 NMT
No ratings yet
Cs224n 2020 Lecture08 NMT
77 pages
2503 06594v1-LaMaTE
No ratings yet
2503 06594v1-LaMaTE
36 pages
Module 5
No ratings yet
Module 5
17 pages
Tanujasynopsis
No ratings yet
Tanujasynopsis
8 pages
Bilingual Machine Translation
No ratings yet
Bilingual Machine Translation
8 pages
Machine Translation With Statistical Approach
No ratings yet
Machine Translation With Statistical Approach
33 pages
Duplichecker Plagiarism Report
No ratings yet
Duplichecker Plagiarism Report
4 pages
Hang & Chao, Machine Translation Evaluation. A Survey (Paper 2016)
No ratings yet
Hang & Chao, Machine Translation Evaluation. A Survey (Paper 2016)
17 pages
A Novel Based Translation Model From English To Telugu
No ratings yet
A Novel Based Translation Model From English To Telugu
4 pages
Challenges in NMT - 1706.03872
No ratings yet
Challenges in NMT - 1706.03872
12 pages
Phase 1 Project
No ratings yet
Phase 1 Project
18 pages
Machine Translation Systems and Quality Assessment A Systematic Review
No ratings yet
Machine Translation Systems and Quality Assessment A Systematic Review
27 pages
Machine Translation
No ratings yet
Machine Translation
58 pages
Combining The Best of Both Worlds: A Method For Hybrid NMT and LLM Translation
No ratings yet
Combining The Best of Both Worlds: A Method For Hybrid NMT and LLM Translation
9 pages
Automated Language Translator Device Review 01
No ratings yet
Automated Language Translator Device Review 01
18 pages
359 1632 1 PB
No ratings yet
359 1632 1 PB
5 pages
Google PDF
No ratings yet
Google PDF
23 pages
ASWIN TS Unit 3 NLP Translations Gen AI
No ratings yet
ASWIN TS Unit 3 NLP Translations Gen AI
5 pages
A Recipe For Arabic-English Neural Machine Translation
No ratings yet
A Recipe For Arabic-English Neural Machine Translation
5 pages
Language and National Identity in Asia PDF
100% (1)
Language and National Identity in Asia PDF
477 pages
English To Luganda Translation
No ratings yet
English To Luganda Translation
13 pages
Quinn Thesis Final On NMT
No ratings yet
Quinn Thesis Final On NMT
29 pages
13 Machine Translation
No ratings yet
13 Machine Translation
22 pages
Transformer 1806.06957
No ratings yet
Transformer 1806.06957
12 pages
Ai 2
No ratings yet
Ai 2
6 pages
MTS 2007 Koehn 3 PDF
No ratings yet
MTS 2007 Koehn 3 PDF
128 pages
RCSHPPR 22
No ratings yet
RCSHPPR 22
5 pages
Is Neural Machine Translation Ready For Deployment
No ratings yet
Is Neural Machine Translation Ready For Deployment
8 pages
FN Paper 2
No ratings yet
FN Paper 2
13 pages
Wa0194.
No ratings yet
Wa0194.
4 pages
ChatGPTvs - GoogleTranslate HiT-IT-2023-proceedings
No ratings yet
ChatGPTvs - GoogleTranslate HiT-IT-2023-proceedings
12 pages
Google Neural Machine Translation System
No ratings yet
Google Neural Machine Translation System
23 pages
Bangla To English Machine Translation
No ratings yet
Bangla To English Machine Translation
112 pages
Machine Learning in Translation Corpora Processing
No ratings yet
Machine Learning in Translation Corpora Processing
281 pages
Marathi To English Neural Machine Translation With Near Perfect Corpus and Transformers
No ratings yet
Marathi To English Neural Machine Translation With Near Perfect Corpus and Transformers
5 pages
Neural Machine Translation Advised by Statistical Machine Translation
No ratings yet
Neural Machine Translation Advised by Statistical Machine Translation
7 pages
NLP Unit 1
100% (1)
NLP Unit 1
34 pages
Chapter 1: Introduction To Literacy and Numeracy Skills
No ratings yet
Chapter 1: Introduction To Literacy and Numeracy Skills
21 pages
Motor Imaging Strategy On Students' Vocabulary in Reading Comprehension
No ratings yet
Motor Imaging Strategy On Students' Vocabulary in Reading Comprehension
19 pages
NCE Sample Test Questions (2015)
No ratings yet
NCE Sample Test Questions (2015)
3 pages
Monitoring Tool
100% (4)
Monitoring Tool
2 pages
Math Lesson Plans
100% (1)
Math Lesson Plans
9 pages
Coaching Philosophy
100% (1)
Coaching Philosophy
24 pages
Lecture Notes On Immanuel Kant
No ratings yet
Lecture Notes On Immanuel Kant
28 pages
Labeling Theory
100% (1)
Labeling Theory
3 pages
DLL G6 Q3 WEEK 3 ALL SUBJECTS (Mam Inkay Peralta) .Docx Version 1
No ratings yet
DLL G6 Q3 WEEK 3 ALL SUBJECTS (Mam Inkay Peralta) .Docx Version 1
66 pages
Final Activity and Paper
No ratings yet
Final Activity and Paper
9 pages
The Effect of Pre-Reading Activities On The Reading Comprehension Performance of Ilami High School Students
No ratings yet
The Effect of Pre-Reading Activities On The Reading Comprehension Performance of Ilami High School Students
7 pages
Analisis Implementasi Perencanaan Dan Penganggaran Kegiatan Percepatan Penurunan Aki Berbasis Kinerja Di Dinas Kesehatan Provinsi Jawa Tengah Tahun 2012
No ratings yet
Analisis Implementasi Perencanaan Dan Penganggaran Kegiatan Percepatan Penurunan Aki Berbasis Kinerja Di Dinas Kesehatan Provinsi Jawa Tengah Tahun 2012
13 pages
Automatic Speech Recognition
No ratings yet
Automatic Speech Recognition
30 pages
English 10 Q1 W1D4
No ratings yet
English 10 Q1 W1D4
4 pages
ES
No ratings yet
ES
17 pages
9.2 AI in Education
No ratings yet
9.2 AI in Education
14 pages
Eng8-Q2 Mod4 Version5
No ratings yet
Eng8-Q2 Mod4 Version5
11 pages
Active and Passive
No ratings yet
Active and Passive
9 pages
English 8 Q2 WK1
No ratings yet
English 8 Q2 WK1
33 pages
Exercise Logic With Ans
100% (1)
Exercise Logic With Ans
11 pages
Simple Past Tense Recount TEXT Explain and Example
No ratings yet
Simple Past Tense Recount TEXT Explain and Example
5 pages
How To Teach Speaking
No ratings yet
How To Teach Speaking
25 pages
Leadership Competency Inventory
No ratings yet
Leadership Competency Inventory
13 pages
German Dataset Tasks
No ratings yet
German Dataset Tasks
6 pages
MAT6007 Session4 MP Neuron Perceptrons
No ratings yet
MAT6007 Session4 MP Neuron Perceptrons
15 pages
Think Questions
No ratings yet
Think Questions
2 pages
Branches of Linguistics
No ratings yet
Branches of Linguistics
2 pages

(Slide) Neural Machine Translation

Uploaded by

(Slide) Neural Machine Translation

Uploaded by

AI VIETNAM

Neural Machine Translation

Automatic Speech Natural Language Natural Language

translation of spoken a computer’s ability to generate natural

Ø Can be formulated as an optimization problem:

1950 1980 1990 2007 2015

Ø Two main criteria:

Ø Different translations of “A Vinay le gusta Python”

Ø Two main criteria:

Ø Adequacy and fluency: Adequacy Fluency

Ø Manual evaluation is most accurate, but expensive

Precision and Recall of words

Precision and Recall of words

Metric System A System B

2 -gram Metric System A System B

v A single neural network is used to translate from source to target

Input Decoder State Encoder Output

How to choose “Best candidate”

Input Sequence (Source) 22

v BERT: An encoder-only model

v GPT2: A decoder-only model, use uni-directional (causal) self-attention

v Dataset: IWSLT’15 English-Vietnamese

You might also like