NLP Script

Uploaded by

Anh Lê Tuấn

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

9 views2 pages

NLP Script

Uploaded by

Anh Lê Tuấn

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 2

"Good evening, everyone, we are Group 9 and I am excited to present our project on 'Static

Idiom Integrated Machine Translation,' focusing on improving the accuracy of translations

between English and Vietnamese.
I. Introduction
- Accurately translating between English and Vietnamese presents significant challenges due to
substantial linguistic differences. These challenges include differing grammar structures, word
orders, and especially the use of idiomatic expressions, which often do not have direct
translations and require contextual understanding to translate correctly.
- Moreover, deploying robust model on mobile or integrated devices like CPU also poses a
problem due to the limitation of computational resources.
- To address these challenges, we have implemented two primary approaches:
1. Utilizing a pre-trained T5-en-vi mode, which was pre-trained on a large corpus of
bilingual texts, and we will enhance it specifically for idiomatic cases.
2. Speeding up model execution through quantization techniques.
II. Basic Architectures
So before moving to the details of out approaches, I will make a brief overview of the basic
model structures used in our project.
1. Seq2Seq
A seq2seq model is composed of an encoder and a decoder that typically implemented as RNNs.
- The encoder is responsible for processing the input sequence and capturing its essential
information, which is stored as the hidden state of the network and, in a model with attention
mechanism, a context vector. The context vector is the weighted sum of the input hidden states
and is generated for every time instance in the output sequences.
- The decoder takes the context vector and hidden states from the encoder and generates the final
output sequence. At each step, it considers the previously generated elements, the context vector,
and the input sequence information to make predictions for the next element in the output
sequence.
- The attention mechanism enables the model to selectively focus on different parts of the input
sequence during the decoding process. At each decoder step, an alignment model calculates the
attention score using the current decoder state and all of the attention hidden vectors as input.
2. T5 model
T5, or Text-to-Text Transfer Transformer, is a Transformer based architecture that uses a text-to-
text approach.
Encoder
The input tokens are first converted into vectors using the input embedding layer. Positional
encoding is added to these embeddings, and then they pass through multiple encoder layers, each
consisting of a multi-head attention mechanism, followed by a residual connection with layer
normalization, and a feed-forward network with another residual connection and normalization.
Decoder
Output tokens are similarly converted into vectors through the output embedding layer, with
positional encoding added. These embeddings pass through several decoder layers, each starting
with masked multi-head attention, followed by residual connections with layer normalization,
multi-head attention focusing on the encoder’s output, another set of residual connections and
normalization, and a feed-forward network with final residual connections and normalization.
Output Processing
The final output from the decoder is projected through a linear layer to transform it into the
desired output dimension. A softmax function is then applied to convert these projections into
probability distributions over the output vocabulary.
This architecture’s unique strength is its Flexible Text-to-Text Format: It can handle various
tasks by converting them all into a text-to-text format.

Verbal Advantage Powerful 3500 Vocabulary Words Yasser PDF
75% (4)
Verbal Advantage Powerful 3500 Vocabulary Words Yasser PDF
763 pages
Lesson 14 - Transformer
No ratings yet
Lesson 14 - Transformer
124 pages
NLP_Answers
No ratings yet
NLP_Answers
13 pages
Computer Vision 12 Vision Language Models(1)
No ratings yet
Computer Vision 12 Vision Language Models(1)
56 pages
Cs 224N: Assignment #4: 1. Neural Machine Translation With Rnns (45 Points)
No ratings yet
Cs 224N: Assignment #4: 1. Neural Machine Translation With Rnns (45 Points)
10 pages
Attention is all you need
No ratings yet
Attention is all you need
15 pages
The Influence of Culture On Arabic/English/Arabic Translation of Idioms and Proverbs
No ratings yet
The Influence of Culture On Arabic/English/Arabic Translation of Idioms and Proverbs
69 pages
Neural Machine Translation, Seq2seq, and Attention
No ratings yet
Neural Machine Translation, Seq2seq, and Attention
17 pages
The Best of Both Worlds: Combining Recent Advances in Neural Machine Translation
No ratings yet
The Best of Both Worlds: Combining Recent Advances in Neural Machine Translation
11 pages
CS5984 Final Report
No ratings yet
CS5984 Final Report
57 pages
LTRC-MT Simple & Effective Hindi-English Neural Machine Translation Systems at WAT 2019
No ratings yet
LTRC-MT Simple & Effective Hindi-English Neural Machine Translation Systems at WAT 2019
4 pages
konuralp
No ratings yet
konuralp
15 pages
L15-Transformer1 (1)
No ratings yet
L15-Transformer1 (1)
19 pages
Deep Learning_6_1730105277528
No ratings yet
Deep Learning_6_1730105277528
5 pages
Lec 7 Trans(decoder)+ViT
No ratings yet
Lec 7 Trans(decoder)+ViT
20 pages
Attention is all you need
No ratings yet
Attention is all you need
19 pages
Exploring Sequence-to-Sequence Models _ Understanding the power of Encoder and Decoder Architecture _ by Sachinsoni _ Medium
No ratings yet
Exploring Sequence-to-Sequence Models _ Understanding the power of Encoder and Decoder Architecture _ by Sachinsoni _ Medium
18 pages
Polynomial Expansion Paper
No ratings yet
Polynomial Expansion Paper
4 pages
Notes 2 Transformer Model Architecture
No ratings yet
Notes 2 Transformer Model Architecture
4 pages
sequence to sequence
No ratings yet
sequence to sequence
4 pages
AATN Merged
No ratings yet
AATN Merged
139 pages
Selenium Documentation
No ratings yet
Selenium Documentation
173 pages
Japanese Period & Rebirth of Freedom
No ratings yet
Japanese Period & Rebirth of Freedom
43 pages
M5 Topic 1 - Encoder Decoder
No ratings yet
M5 Topic 1 - Encoder Decoder
21 pages
generative AI Unit 3 notes
No ratings yet
generative AI Unit 3 notes
8 pages
Visualizing A Neural Machine Translation Model
No ratings yet
Visualizing A Neural Machine Translation Model
38 pages
unit5 3
No ratings yet
unit5 3
48 pages
DL CO4 PPT-1
No ratings yet
DL CO4 PPT-1
29 pages
15 - NEW 2020 ATTENTION ENC DEC TRANSFORMERS Lect15
No ratings yet
15 - NEW 2020 ATTENTION ENC DEC TRANSFORMERS Lect15
50 pages
Encoder_Decoder_Transformers_Notes
No ratings yet
Encoder_Decoder_Transformers_Notes
6 pages
Transformer
No ratings yet
Transformer
55 pages
Sequence-To-Sequence, Attention, Transformer - Machine Learning Lecture
No ratings yet
Sequence-To-Sequence, Attention, Transformer - Machine Learning Lecture
20 pages
Independence in Colombia
No ratings yet
Independence in Colombia
17 pages
A Practical Survey On Faster and Lighter Transformers - 2023 - Fournier Et Al
No ratings yet
A Practical Survey On Faster and Lighter Transformers - 2023 - Fournier Et Al
40 pages
Active & Passive Ex.
No ratings yet
Active & Passive Ex.
33 pages
Generative AI
No ratings yet
Generative AI
54 pages
DAA FinalReport
No ratings yet
DAA FinalReport
14 pages
shivam final
No ratings yet
shivam final
34 pages
Interchange 3 - Workbook Answer Key - 5th Ed
No ratings yet
Interchange 3 - Workbook Answer Key - 5th Ed
1 page
Attention Is All You Need
No ratings yet
Attention Is All You Need
15 pages
2. Encoder-Decoder Sequence to Sequence Architechure
No ratings yet
2. Encoder-Decoder Sequence to Sequence Architechure
16 pages
ACM Journals Primary Article Template Latest Version 4
No ratings yet
ACM Journals Primary Article Template Latest Version 4
31 pages
Summaries of The Chapters
No ratings yet
Summaries of The Chapters
29 pages
Unlocking Linguistic Intelligence_ Attention Mechanisms and Transformer Architectures in NLP (1)
No ratings yet
Unlocking Linguistic Intelligence_ Attention Mechanisms and Transformer Architectures in NLP (1)
117 pages
Cutting Edge Advanced A Part A Final Exam Student
100% (1)
Cutting Edge Advanced A Part A Final Exam Student
8 pages
Transformer-Transducer End-to-End Speech Recognition with Self-Attention
No ratings yet
Transformer-Transducer End-to-End Speech Recognition with Self-Attention
5 pages
EPC Mathematics 2020 Exminer Report
No ratings yet
EPC Mathematics 2020 Exminer Report
19 pages
Worksheet 2 - English2 - Lesson 2 - Unit 3
No ratings yet
Worksheet 2 - English2 - Lesson 2 - Unit 3
5 pages
2024_Transformer_master
No ratings yet
2024_Transformer_master
50 pages
Tianzheng Troy Wang CIS498EAS499 Submission
No ratings yet
Tianzheng Troy Wang CIS498EAS499 Submission
51 pages
Aiayn
No ratings yet
Aiayn
15 pages
Pervasive Attention 2D Convolutional Neural Networks for Sequence-to-Sequence Prediction
No ratings yet
Pervasive Attention 2D Convolutional Neural Networks for Sequence-to-Sequence Prediction
11 pages
LLM
No ratings yet
LLM
41 pages
Unit_2_Generative_AI[1]
No ratings yet
Unit_2_Generative_AI[1]
14 pages
Transformer
No ratings yet
Transformer
5 pages
Deep Neural Network Module 7 Attention Transformer
No ratings yet
Deep Neural Network Module 7 Attention Transformer
40 pages
Unit 10
No ratings yet
Unit 10
6 pages
Unit 2 Lesson A
No ratings yet
Unit 2 Lesson A
4 pages
attention
No ratings yet
attention
15 pages
Transformers
No ratings yet
Transformers
27 pages
2 - Comparative - and - Historical - Linguistics PDF
No ratings yet
2 - Comparative - and - Historical - Linguistics PDF
10 pages
1706.03762v1
No ratings yet
1706.03762v1
15 pages
TED Talk
No ratings yet
TED Talk
2 pages
What Is A Transformer
No ratings yet
What Is A Transformer
11 pages
Thuyết Trình TWP
No ratings yet
Thuyết Trình TWP
7 pages
Complete NLP Guide_ From Fundamentals to Deep Learning with TensorFlow
No ratings yet
Complete NLP Guide_ From Fundamentals to Deep Learning with TensorFlow
13 pages
Transformer
No ratings yet
Transformer
5 pages
Dzexams 1am Anglais E1 20191 691903
No ratings yet
Dzexams 1am Anglais E1 20191 691903
5 pages
Understanding Transformer model architectures - Practical Artificial Intelligence
No ratings yet
Understanding Transformer model architectures - Practical Artificial Intelligence
6 pages
Kpit 7
No ratings yet
Kpit 7
15 pages
Deep Learning Workshop
No ratings yet
Deep Learning Workshop
3 pages
Enunciation: Aspects of Subjectivity in Meaning Construction
No ratings yet
Enunciation: Aspects of Subjectivity in Meaning Construction
59 pages
Test Complete Syllabus
No ratings yet
Test Complete Syllabus
4 pages
Transformer Architecture
No ratings yet
Transformer Architecture
18 pages
REPORT-MTechPESJul23BGrp2-3 (22-02-25)
No ratings yet
REPORT-MTechPESJul23BGrp2-3 (22-02-25)
15 pages
V.P.Manikandan: (Mechanical Engineering) 298, Sudalai Koil Street, Madampillaitharmam, Tirunelveli-627114 Email: Tel
No ratings yet
V.P.Manikandan: (Mechanical Engineering) 298, Sudalai Koil Street, Madampillaitharmam, Tirunelveli-627114 Email: Tel
2 pages
Rahul Prakash Salunke
No ratings yet
Rahul Prakash Salunke
2 pages
Example File
No ratings yet
Example File
3 pages
English As A Foreign Language Area Curricular Objectives of The English As A Foreign Language Area For Subnivel Elemental of Educación General Básica
No ratings yet
English As A Foreign Language Area Curricular Objectives of The English As A Foreign Language Area For Subnivel Elemental of Educación General Básica
7 pages
Attn Is All You Need
No ratings yet
Attn Is All You Need
15 pages
De_va_Dap_an_Tieng_anh_7_a25c2
No ratings yet
De_va_Dap_an_Tieng_anh_7_a25c2
4 pages
Elementary Greek II Syllabus
No ratings yet
Elementary Greek II Syllabus
4 pages
Narrative Report Reading Report
No ratings yet
Narrative Report Reading Report
4 pages
Action Plan in English - 2023
0% (1)
Action Plan in English - 2023
2 pages
Malik Resume
No ratings yet
Malik Resume
2 pages
EXPOSITORY teXT sTRUCTURE
No ratings yet
EXPOSITORY teXT sTRUCTURE
4 pages
Doctor Duck Teachers Notes
No ratings yet
Doctor Duck Teachers Notes
3 pages
ECA2+ Tests Language Test 8A 2018
No ratings yet
ECA2+ Tests Language Test 8A 2018
4 pages
The Compete Ccna 200-301 Study Guide: Network Engineering Edition
From Everand
The Compete Ccna 200-301 Study Guide: Network Engineering Edition
Joe Spoto
5/5 (4)
C# Package Mastery: 100 Essentials in 1 Hour - 2024 Edition
From Everand
C# Package Mastery: 100 Essentials in 1 Hour - 2024 Edition
Tenko
No ratings yet

NLP Script

Uploaded by

NLP Script

Uploaded by

"Good evening, everyone, we are Group 9 and I am excited to present our project on 'Static

Idiom Integrated Machine Translation,' focusing on improving the accuracy of translations

You might also like