0% found this document useful (0 votes)

56 views5 pages

Proceedings of International Ethical Hacking Conference 2018

Bilingual Machine Translation: English to Bengali

Uploaded by

Multi Vac

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

56 views5 pages

Proceedings of International Ethical Hacking Conference 2018

Bilingual Machine Translation: English to Bengali

Uploaded by

Multi Vac

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 5

Advances in Intelligent Systems and Computing 811

Mohuya Chakraborty
Satyajit Chakrabarti
Valentina Emilia Balas · J. K. Mandal
Editors

Proceedings of
International
Ethical Hacking
Conference 2018
eHaCON 2018, Kolkata, India
Bilingual Machine Translation: English
to Bengali

Sauvik Bal, Supriyo Mahanta, Lopa Mandal and Ranjan Parekh

Abstract The present work proposes a methodology of machine translation system

which takes English sentences as input and produces appropriate Bengali sentences
as output using natural language processing (NLP) techniques. It first uses a parse tree
for syntactic analysis of the sentence structure and then applies semantic analysis for
extracting the meaning of the words. An inverse function is then provided to fit that
into the Bengali syntax. A dictionary as a separate file is used for mapping between the
English words and their Bengali counterparts. The novelty of the present work lies in
the fact that it combines both a syntax-based and a meaning-based analysis to arrive at
the proper translation. The effectiveness of the algorithm has been demonstrated with
examples of different English sentence conversions with several rules, and the results
have been compared with that of the Google translator to show the improvements
achieved.

Keywords POS tagging · Machine translation · Parse tree · Rule-based system

S. Bal (B) · S. Mahanta (B)

University of Engineering & Management, Jaipur, India
e-mail: [email protected]
S. Mahanta
e-mail: [email protected]
L. Mandal
Institute of Engineering & Management, Kolkata, India
e-mail: [email protected]
R. Parekh
Jadavpur University, Kolkata, India
e-mail: [email protected]

© Springer Nature Singapore Pte Ltd. 2019 247

M. Chakraborty et al. (eds.), Proceedings of International Ethical Hacking
Conference 2018, Advances in Intelligent Systems and Computing 811,
https://doi.org/10.1007/978-981-13-1544-2_21

[email protected]
248 S. Bal et al.

1 Introduction

Language translation is one of the important applications in the present scenario as

today’s world is considered to be a global village. If a person has to move from
one location to another and is not aware of the regional language of that location,
it would be very difficult for him/her to communicate. Not only it is relevant in a
global scenario where multiple languages come into consideration, but also in a local
setting where two or more neighboring countries might share the same language with
similar/dissimilar dialects. For example, in India and Bangladesh, many people use
Bengali as their mother tongue though with different dialects. All these make machine
translation to be an important area of research. The present work aims to translate
a worldwide used language viz. English into a regional language viz. Bengali. The
main challenge of language translation is that often a simple mapping between words
does not produce expected results. Restructuring of the sentences as well as analysis
of the inherent meaning is also necessary for correct outputs. In the existing process,
there are so many sentences where the translation does not give meaningful output
due to problem of proper analysis of sentences, lack of resources etc. The present
work proposed a novel approach where English to Bengali language conversion is
done based on some grammatical rules. The proposed work is based on the version
of the language used by the Bengali people of West Bengal, India.

2 Literature Survey

A good translator system should contain all words and their corresponding trans-
lated words. The main problem of this kind of system is limited available vocabu-
lary. Fuzzy-If-Then-Rule is one of the frequently used methodologies for machine
translation [1]. In the process of translation from one language to another, there are
some challenges like, lack of resources, different tools, pronunciation dictionary,
different language modeling, dialog modeling, content summarization etc. [2]. More
research is required to increase the accuracy rate when translation is done in case of
low resource languages and in the cases where the volume of target language vocab-
ulary is limited [3]. Another approach of language translation is based on the tense
where English sentences can be used as input. This kind of system uses context free
grammars for analyzing the syntactical structure of the input which helps to translate
the sentence and verify the accuracy of the output [4]. Machine translation may be
achieved by deep learning-based neural network (DNN). Memory-augmented neu-
ral network is introduced with this mechanism where the memory structure does not
consist of any phrase [5]. Another method of machine translation is to retrieve by
audio analysis and feature extraction. This kind of process can solve the ambiguity
problem in sentence translation to improve the output [6]. Another approach is used

[email protected]
Bilingual Machine Translation: English to Bengali 249

for translation where values from the New Testament were used as training values. If
the proper resources are not available and the machine is not properly trained, accu-
racy rate will be decreased [7]. Example-based machine translation is found to be
another methodology, used in this case. The problem of this methodology is limited
knowledge base. It makes the system inefficient for translation where low-resourced
language is used [8]. Machine translation is also important for question–answering
sessions. The main problem for this type of system is word ambiguity. By using the
matrix factorization, this can be improved. If there are dynamic question–answering
sessions, large vocabulary and proper learning would be required for accuracy [9].
For speech to test conversion, machine translation is also important. If the speech
is in different language, it is important to have the proper resources for translation.
This kind of system extracts the meaning of input sentence. So, proper decision-
making algorithm and proper training is needed [10, 11]. Deep learning is one of
the important concepts for natural language processing. For language translation, it
is important to choose the right decision. Based on the past experience, training can
be done and system can take the proper decision by using the concept of deep learn-
ing [12]. Sometimes language translation efficiency is reduced when phrase-based
translation is required for long sentences. Sequence of the words in inputted language
may differ with output language. So, rule-based system is required to improve the
translation quality [13]. If there is any complex sentence, tree-based method can be
applied for simplification. So, the splitting and decision making should be proper for
accurate language translation [14]. If there is any sentence with complicated struc-
ture, the parse tree may not be created properly. So, it is very important to generate
parse tree, so that the translation can be done efficiently [15]. At the time of machine
translation, it is very important to detect sub phrase as well as clause detection. If
there is any error in clause detection, the translation may not be done properly [16,
17]. Parsing-based sentence simplification is one of the methods where keywords
can be extracted. This process follows dependency-based parsing technique [18].
The study of related works shows that, due to the lack of resources, tools, vocab-
ularies, it is not always possible to translate the English sentence into regional lan-
guage by using the existing methodologies. If the translation is not properly done,
the meaning of the translated statement may not be appropriate. The main reason of
this problem is improper analysis of the sentences. Generally, in existing systems,
some general rules are applied that fails to do the proper conversion in some cases,
e.g., if the first letter of some name is given in small letter, the output of existing
system drastically changes. This is one of the major drawbacks of existing system.
Parts of Speech (POS) tagging does not work properly in these cases. It shows that
priority should be given to make the translation system intelligent enough to analyze
of the sentences properly.

[email protected]
250 S. Bal et al.

3 Methodology

The present work proposes a novel methodology of English to Bengali text trans-
lation. Here, an English text or sentence or a paragraph is used as input and the
system generates its appropriate Bengali meaning. So, first of all, the English text
is taken as input to the system. Then, the sentence is broken into words and then
by using the Parts of Speech (POS) Tagger, it retrieves the Parts of Speech of each
word. Then, the words are clustered into three groups, i.e.—Subject, Verb, Object,
and some other required parts (e.g., WH-words, exclamatory expression etc.). After
that, the parse tree is generated for English text and converted into the parse tree of
Bengali language by using different Bengali grammatical rules [19]. Here, a separate
file is used as database where the English word and the respective Bengali meanings
are stored. After judging the syntactical structure of the sentence, the appropriate
Bengali words are selected and used. Finally, the output of the system is generated
in Bengali language. The proposed system is shown with the help of a bock diagram
in Fig. 1.
In the present work, the types of sentences taken as input are shown in Fig. 2.
Here, two examples of assertive and interrogative sentences are taken and demon-
strated how they actually work.
Assertive Sentence: First recognize the pattern of the input sentence.
In English, the pattern is: Sub + Verb + Obj
e.g., “I am going to school.”

I (sub) am going (verb) to school (obj)

Now, as per the Bengali grammar [19], reconstruct as the pattern: Sub + Obj +
Verb

I (sub) school to (obj) am going (verb)

Fetch corresponding Bengali words.

Interrogative Sentence: First recognize the pattern of the sentence.
e.g., What is the capital of India?
So, the pattern in English is: “wh” word + obj + Sub

What (sub) is the capital (obj) of India (Sub)

Now the pattern as per the Bengali language is reconstructed.

So, pattern in Bengali is: sub + obj + “wh” word

[email protected]

Komoiboros Inggoris-Kadazandusun
43% (7)
Komoiboros Inggoris-Kadazandusun
140 pages
Unit 2B.: That'S Me in The Picture!
No ratings yet
Unit 2B.: That'S Me in The Picture!
15 pages
ICIIS2007 Transliteration
No ratings yet
ICIIS2007 Transliteration
6 pages
PHD Thesis Machine Translation
100% (3)
PHD Thesis Machine Translation
7 pages
Q1 Week1
No ratings yet
Q1 Week1
91 pages
Modul Bahasa Inggris
No ratings yet
Modul Bahasa Inggris
59 pages
Narration For BCS English PDF
100% (1)
Narration For BCS English PDF
10 pages
Marathi To English Sentence Translator For Simple
No ratings yet
Marathi To English Sentence Translator For Simple
5 pages
Unit 8 Stopping by The Woods, Class IX
No ratings yet
Unit 8 Stopping by The Woods, Class IX
14 pages
Machine Learning in Translation Corpora Processing
No ratings yet
Machine Learning in Translation Corpora Processing
281 pages
On Application of Natural Language Processing in Machine Translation
No ratings yet
On Application of Natural Language Processing in Machine Translation
5 pages
As 25
No ratings yet
As 25
324 pages
Final Research Paper
100% (1)
Final Research Paper
5 pages
EAPP Lecture
No ratings yet
EAPP Lecture
7 pages
Natural Language Processing
No ratings yet
Natural Language Processing
12 pages
A Hybrid Approach Using Phrases and Rules For Hindi To English Machine Translation
100% (1)
A Hybrid Approach Using Phrases and Rules For Hindi To English Machine Translation
17 pages
NLP Module 6
No ratings yet
NLP Module 6
183 pages
Vygotsky S-C Theory Moodle
No ratings yet
Vygotsky S-C Theory Moodle
7 pages
Amdework Asefa Belay
No ratings yet
Amdework Asefa Belay
119 pages
An English-Assamese Machine Translation System: Moirangthem Tiken Singh Rajdeep Borgohain
No ratings yet
An English-Assamese Machine Translation System: Moirangthem Tiken Singh Rajdeep Borgohain
6 pages
NLP M5 Part-2 SPP
No ratings yet
NLP M5 Part-2 SPP
62 pages
Machine Translation Mondal 2023
No ratings yet
Machine Translation Mondal 2023
90 pages
Group 7 453
No ratings yet
Group 7 453
52 pages
English - Module 4 BK 2
No ratings yet
English - Module 4 BK 2
134 pages
Machine Translation
No ratings yet
Machine Translation
58 pages
Gold Exp B2 U1to3 Review Lang Test A
No ratings yet
Gold Exp B2 U1to3 Review Lang Test A
3 pages
Class 12 The Last Lesson Notes CH - 1'
No ratings yet
Class 12 The Last Lesson Notes CH - 1'
24 pages
Unit 5
No ratings yet
Unit 5
42 pages
Machine Translation
No ratings yet
Machine Translation
38 pages
NLP Unit V
No ratings yet
NLP Unit V
18 pages
Chat
No ratings yet
Chat
23 pages
Text Operations 2021
No ratings yet
Text Operations 2021
45 pages
JETIR1806940
No ratings yet
JETIR1806940
12 pages
Seminar Sample Report
No ratings yet
Seminar Sample Report
20 pages
Reflect Ev Ls L3u2 Test
No ratings yet
Reflect Ev Ls L3u2 Test
8 pages
Machine Translation Approaches and Survey For Indian Languages
No ratings yet
Machine Translation Approaches and Survey For Indian Languages
18 pages
Transcript - Ty McGowan - 2nd Interview
No ratings yet
Transcript - Ty McGowan - 2nd Interview
22 pages
Fin Irjmets1702791465
No ratings yet
Fin Irjmets1702791465
5 pages
English HL P1 Nov 2024
No ratings yet
English HL P1 Nov 2024
13 pages
Syntactic and Semantic
No ratings yet
Syntactic and Semantic
4 pages
2016 - An Efficient English To Hindi Machine Translation System Using Hybrid Mechanism
No ratings yet
2016 - An Efficient English To Hindi Machine Translation System Using Hybrid Mechanism
5 pages
Article 16
No ratings yet
Article 16
8 pages
2016 Kituku, Muchemi & Nganga - Review On Machine Translation Approaches
No ratings yet
2016 Kituku, Muchemi & Nganga - Review On Machine Translation Approaches
8 pages
JSeva-ODEP-PhD - PristupniRad - Automatic Language Translation
No ratings yet
JSeva-ODEP-PhD - PristupniRad - Automatic Language Translation
13 pages
Lattice Based Lexical Transfer in Bengal
No ratings yet
Lattice Based Lexical Transfer in Bengal
8 pages
Machine Translation With Statistical Approach
No ratings yet
Machine Translation With Statistical Approach
33 pages
A Case Study On English-Malayalam Machine Translat
No ratings yet
A Case Study On English-Malayalam Machine Translat
8 pages
Individual Learner Differences
No ratings yet
Individual Learner Differences
15 pages
2018 - Generating Noun Declension-Case Markers For English To Indian Languages in Declension Rule Based MT Systems
No ratings yet
2018 - Generating Noun Declension-Case Markers For English To Indian Languages in Declension Rule Based MT Systems
7 pages
Bilingual Machine Translation
No ratings yet
Bilingual Machine Translation
8 pages
Multilingual Translator and Interpreter
No ratings yet
Multilingual Translator and Interpreter
6 pages
A Sanskrit-to-English Machine Translation Using Hybridization of Direct and Rule-Based Approach
No ratings yet
A Sanskrit-to-English Machine Translation Using Hybridization of Direct and Rule-Based Approach
20 pages
G20.1.405 Level 4 Ex.05
No ratings yet
G20.1.405 Level 4 Ex.05
3 pages
Common Devices in Poetry
No ratings yet
Common Devices in Poetry
3 pages
Sanskrit-English Translator With NLP
No ratings yet
Sanskrit-English Translator With NLP
4 pages
JETIR2211403
No ratings yet
JETIR2211403
6 pages
Evaluating Letter of Request
No ratings yet
Evaluating Letter of Request
4 pages
Referance 3
No ratings yet
Referance 3
4 pages
Termpaper
No ratings yet
Termpaper
6 pages
Machine Translation of Vedic Sanskrit Using Deep Learning Algorithm
No ratings yet
Machine Translation of Vedic Sanskrit Using Deep Learning Algorithm
4 pages
IJSRET V10 Issue3 125
No ratings yet
IJSRET V10 Issue3 125
3 pages
Machine Translation For English To Kanna
No ratings yet
Machine Translation For English To Kanna
8 pages
Comparative Study of Machine Translation Techniques
No ratings yet
Comparative Study of Machine Translation Techniques
16 pages
Machine Translation Approaches Issues An
No ratings yet
Machine Translation Approaches Issues An
7 pages
Extending Capabilities of English To Marathi Machi PDF
No ratings yet
Extending Capabilities of English To Marathi Machi PDF
8 pages
Ielts Academic Top 40 Language Frequency 2022
No ratings yet
Ielts Academic Top 40 Language Frequency 2022
2 pages
Machine Translation Development For Indian Languages and Its Approaches
No ratings yet
Machine Translation Development For Indian Languages and Its Approaches
21 pages
Atatürk Eğitim Fakültesi İngilizce Öğretmenliği 4 EN 2021 WEB
No ratings yet
Atatürk Eğitim Fakültesi İngilizce Öğretmenliği 4 EN 2021 WEB
1 page
1.1 General: Resourced" Languages. To Enhance The Translation Performance of Dissimilar Language
No ratings yet
1.1 General: Resourced" Languages. To Enhance The Translation Performance of Dissimilar Language
18 pages
Temp Research Paper
No ratings yet
Temp Research Paper
5 pages
Vietnam Has Witnessed The Boom of English Since The Initiation of The Economic Reform Known As Doi Moi in 1986
No ratings yet
Vietnam Has Witnessed The Boom of English Since The Initiation of The Economic Reform Known As Doi Moi in 1986
2 pages
Systematic Review On Techniques of Machine Translation For Indian Languages
No ratings yet
Systematic Review On Techniques of Machine Translation For Indian Languages
6 pages
Narrative Text - Google Formulir
No ratings yet
Narrative Text - Google Formulir
6 pages
Machine Translation and Its Approaches: Vanlalmuansangi Khenglawt, Lal Anpuia
No ratings yet
Machine Translation and Its Approaches: Vanlalmuansangi Khenglawt, Lal Anpuia
5 pages
Cameron Chaudhry - 2021 AP Close Reading Exercise Henry's Speech
No ratings yet
Cameron Chaudhry - 2021 AP Close Reading Exercise Henry's Speech
4 pages
2017 Oct Conf Machine Translation PDF
No ratings yet
2017 Oct Conf Machine Translation PDF
9 pages
Automated Machine Translation For Regional Languages: Problem Statement
No ratings yet
Automated Machine Translation For Regional Languages: Problem Statement
2 pages
English To Yorùbá Machine Translation System Using Rule-Based Approach
No ratings yet
English To Yorùbá Machine Translation System Using Rule-Based Approach
6 pages
(IJCST-V9I1P20) :T. Madhavi Kumari, Dr. A. Vinaya Babu
No ratings yet
(IJCST-V9I1P20) :T. Madhavi Kumari, Dr. A. Vinaya Babu
6 pages
Interactive English To Urdu Machine Translation Using Example-Based Approach
100% (2)
Interactive English To Urdu Machine Translation Using Example-Based Approach
8 pages
Learning Translation Rules From Bilingual English - Filipino Corpus
No ratings yet
Learning Translation Rules From Bilingual English - Filipino Corpus
10 pages
Graduate Diploma in Tesol - Assignment Brief For Module 1 - Section A - B - Tesol O3
No ratings yet
Graduate Diploma in Tesol - Assignment Brief For Module 1 - Section A - B - Tesol O3
8 pages
Machine Translation Using Open NLP and Rules Based System English To Marathi Translator
No ratings yet
Machine Translation Using Open NLP and Rules Based System English To Marathi Translator
4 pages
Hindi To English Machine Translation
No ratings yet
Hindi To English Machine Translation
4 pages
Voice Based Translator
No ratings yet
Voice Based Translator
4 pages
Extending Capabilities of English To Marathi Machine Translator
No ratings yet
Extending Capabilities of English To Marathi Machine Translator
8 pages
First Conditional Advice Interactive Worksheet
No ratings yet
First Conditional Advice Interactive Worksheet
2 pages
Spelling Menu For November
No ratings yet
Spelling Menu For November
1 page
The Newbie’s Guidebook to ChatGPT: A Beginner's Tutorial: The Newbie’s Guidebook
From Everand
The Newbie’s Guidebook to ChatGPT: A Beginner's Tutorial: The Newbie’s Guidebook
Timothy King
No ratings yet
Python Text Mining: Perform Text Processing, Word Embedding, Text Classification and Machine Translation
From Everand
Python Text Mining: Perform Text Processing, Word Embedding, Text Classification and Machine Translation
Alexandra George
No ratings yet

Proceedings of International Ethical Hacking Conference 2018

Uploaded by

Proceedings of International Ethical Hacking Conference 2018

Uploaded by

Advances in Intelligent Systems and Computing 811

Sauvik Bal, Supriyo Mahanta, Lopa Mandal and Ranjan Parekh

Abstract The present work proposes a methodology of machine translation system

Keywords POS tagging · Machine translation · Parse tree · Rule-based system

S. Bal (B) · S. Mahanta (B)

© Springer Nature Singapore Pte Ltd. 2019 247

Language translation is one of the important applications in the present scenario as

I (sub) am going (verb) to school (obj)

I (sub) school to (obj) am going (verb)

Fetch corresponding Bengali words.

What (sub) is the capital (obj) of India (Sub)

Now the pattern as per the Bengali language is reconstructed.

You might also like