0% found this document useful (0 votes)

81 views4 pages

ACL 2020 Proceedings Template 2 PDF

This document discusses using LSTM networks for natural language processing tasks like text classification and named entity recognition. It explores using a unidirectional LSTM for text classification of Malayalam sentences, which achieved an average accuracy of 84%. For named entity recognition of Hindi words, a bidirectional LSTM was used since it requires stronger contextual information. The bidirectional LSTM achieved 99% accuracy on one data set but only 20% F1 score on another, showing it struggled more with named entity recognition. The document outlines the methodology used, including word embedding and network architectures, and discusses the results, noting precision was lower for text classification while recall was lowest for the "datenum" class in named entity recognition.

Uploaded by

Vijay Shankar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

81 views4 pages

ACL 2020 Proceedings Template 2 PDF

Uploaded by

Vijay Shankar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 4

LSTM for Natural Language Processing

Vijay Shankar A Premjith B

Center for Computational Engineering Center for Computational Engineering
And Networking And Networking
Amrita School of Engineering Amrita School of Engineering
Amrita Vishwa Vidyapeetam ,India Amrita Vishwa Vidyapeetam ,India
[email protected]

Abstract words to and run. This learning needs to continue

over many iterations so that the Network will able
This work aims to study the application of
Long Short term network to the two important
to understand the word contextual information. So
classification tasks associated with Natural lan- it is important that a sequential Network be used in-
guage processing, namely, Text classification stead of conventional Neural networks. The use of
and Named entity recognition. In the former LSTM For Text classification is explored in (Sem-
the individual sentences taken as a single entity berecki and Maciejewski (2017)), where different
are associated with a particular category, in the vectorial representations have been explored in the
later every word of the sentence is classifies on study. This study proposes the use of direct word
the basis of its category. Because of the fact
to index conversion, displaying the efficiency of
that contextual information is more important
in the Named entity recognition than text clas- the Neural Networks in understanding the sentence
sification, we try a bidirectional LSTM for the in a minimalist way of a sequential representations.
Named Entity recognition. While it has been The word embedding here used is direct and sim-
observed that the LSTM on Text classification ple. When the Sequential Neural Networks work
gives an average accuracy of 84 percent, the directly on sequences and their ability to learn and
named entity recognition although giving an understand thos sequences without any complex
accuracy of about 99 percent on one set of the
embedding algorithms is important and interesting.
data fails with an F1 score on a another set of
data giving a low F1 score of 20 percentage.
Instead of working on vectored representations, di-
The work has been carried out on non English rect representation must work.
languages namely, Malayalam and Hindi, ex- The case of Named Entity recognition is much
ploring the prospect of Natural Language pro- more comprehensive. The need for contextual in-
cessing in Indian Languages using LSTM. formation is similar to that of classification but
here every word is marked for its characteristics.
1 Introduction
For example, ” I am going to India ” here India
Text classification is an important task in Natural is a location, I can be taken as an subject and go-
Language processing, it is a paradigm in which a ing is the verb. This is the kind of task associ-
machine or a computer will understand the nature ated with named entity recognition. In this work
of Sentences to tell the computer what a linguistic we classify Malayalam sentences into three cate-
information represents. Long Short term memory gories, Business, sports and Entertainment. While
are sequential networks which take into considera- the Named Entity Recognition is for categorizing
tion the sequential nature of the words in a sentence Hindi words as , Location, Occupation, Date, num-
to explain the characteristic of a particular word in ber, name, event, things, organization and miscella-
the bigger picture of an entire sentence, rather than neous classes as others. By and large the number
as an individual entity. It is well known that words of words are higher for others. For this stronger
in a sentence are known by the company they keep need of contextual information is essential, hence,
( Udupa et al. (2009)). For example we always go we consider the need for Bidirectional LSTMS of
to a place or come from a place. The identifier of Named Entity Recognition considering its strong
a word becomes its context. ” Raj , Run to India” , ability to learn contextual information. ((Melamud
when a neural network wishes to understand India , et al., 2016).This method actually facilitates the
it will understand it as India as on the basis of the consideration of both the previous word and the
next word , so this can help in Named Entity recog-
nition.

2 Methodology
The first step is the conversion of words into a rep-
resentative form for computational algorithms to
understand. The simplest method which has been
explored in this method is to attach a unique iden-
tifier to each word under consideration. A keras (a) Malayalam news task classes
tokenizer ((Chollet et al., 2015)) has been used for
the Malayalam Text to integer conversion , but in
the case of Hindi it gave poorer results with a sin-
gle number representing multiple words. The will
result in ambiguity and result in considerable re-
duction in the accuracy. When the sentences are
converted into a sequence of number from a se-
quence of words , they need to be converted into
vectors of similar length in order to facilitate train-
ing by neural networks. The number of sentences
in Text classification is much lower than the num- (b) Hindi named Entity Recognition categories
ber of sentences in the Named Entity recognition
Figure 1: Number of classes for Malayalam text classi-
task. The Named Entity recognition task has a fication and Hindi Named Entity recognition
considerable higher number of words classified as
others than all other classes combined. The number
of classes in the text classification 1a is balanced for every time step of the training. The input vector
and the number of words in the 1b are tilting in length is chosen as the size of the longest sentence
favour of others in a very high manner. This is a in both the cases. For optimization we used to
big challenge which emerges in classification tasks RMSprop ((Dauphin et al., 2015)), which is the
where the ’others’ is trained several times over and division of the learning rate by a running average.
over again . to facilitate compatibility with a smaller data set
A 64 unit LSTM is used for the Classification and the Hindi NER used Adam optimizer ((Balles
of Malayalam text , with a 256 layer Neurons with and Hennig, 2018)) for training , since the data
’Relu’ Layer as activation and the final layer with set here is considerably bigger, where the learning
3 neurons depicting the classes are given a sigmoid rate is adaptive varied and is known to perform
layer as in 2a. The word embedding is chosen well on sparse matrices , especially on Natural Lan-
on the basis of the input dimension of the vector guage processing since we go on to work on sparse
under consideration. Dropouts,meaning killing of matrices with all the padding as well.
certain neurons during the training process to pre-
3 Results and Discussion
vent over fitting. The difference between ’sigmoid’
and ’Relu’ is that the relu does not have a negative The result for the text classification and the Named
branch. The training is carried for one epoch in Entity recognition is encouraging as depicted in
case of the LSTM and for the Bidirectionall LSTM 3a and 3b respectively, where the heat plot of dif-
a single training is carried out on batches.The archi- ferent classification metrics is enlisted for each of
tecture of the bidriectional LSTM is given in 2a. A the classes.One can easily see that although the
single epoch took one hour and it was considerably accuracy is high , the precision comes down con-
difficult to train the network further, in contrast siderably for the text classification, and the recall
with the LSTM which trained for 10 epochs. The is very low for the Name Entity recognition. It is
word embedding dimension has been chosen on the interesting to note that the ’datenum’ class has the
basis of the total number of words and a time dis- lowest recall, this can be intuitive understood on
tributed output, see for example (Altché and de La the fact that although many words the algorithm
Fortelle (2017),is used which represents an output encountered are dates and numbers, the algorithm
(a) Malayalam news task

(b) Hindi named Entity Recognition

Figure 3: Results for LSTM and Bidirectional LSTM

applied to Malayalam Text classification and Named
Entity Recognition

is so selective that it rejects most of the words

which are actually dates .The results of the Text
classification translate to independent tests, but the
Named entity recognition fails considerably giving
a very little f1 score of 0.2. This is a clear result
of over fitting. With better hyper parameter tuning
(a) LSTM for Malayalam Text Classification and trying out different architectures one might be
able to develop a closely related architecture which
will give good results even without standardized
vectorial representations.

(b) Hindi named Enity Recognition using Bidirec-

tional LSTM

Figure 2: Number of classes for Malayalam text classi-

fication and Hindi Named Entity recognition
References
F. Altché and A. de La Fortelle. 2017. An lstm network
for highway trajectory prediction. In 2017 IEEE
20th International Conference on Intelligent Trans-
portation Systems (ITSC), pages 353–359.
Lukas Balles and Philipp Hennig. 2018. Dissect-
ing adam: The sign, magnitude and variance of
stochastic gradients. In Proceedings of the 35th In-
ternational Conference on Machine Learning, vol-
ume 80 of Proceedings of Machine Learning Re-
search, pages 404–413, Stockholmsmässan, Stock-
holm Sweden. PMLR.

François Chollet et al. 2015. Keras. https://keras.

io.

Yann N. Dauphin, Harm de Vries, and Yoshua Bengio.

2015. Equilibrated adaptive learning rates for non-
convex optimization.
Oren Melamud, Jacob Goldberger, and Ido Dagan.
2016. context2vec: Learning generic context em-
bedding with bidirectional LSTM. In Proceedings
of The 20th SIGNLL Conference on Computational
Natural Language Learning, pages 51–61, Berlin,
Germany. Association for Computational Linguis-
tics.

P. Semberecki and H. Maciejewski. 2017. Deep learn-

ing methods for subject text classification of arti-
cles. In 2017 Federated Conference on Computer
Science and Information Systems (FedCSIS), pages
357–360.

Raghavendra Udupa, Abhijit Bhole, and Pushpak Bhat-

tacharyya. 2009. “a term is known by the company it
keeps”: On selecting a good expansion set in pseudo-
relevance feedback. In Advances in Information Re-
trieval Theory, pages 104–115, Berlin, Heidelberg.
Springer Berlin Heidelberg.

RDBMS Practical
33% (3)
RDBMS Practical
20 pages
You Customize The Data Model For SAP MDG. Which Entity Type Properties Can You Define?
0% (1)
You Customize The Data Model For SAP MDG. Which Entity Type Properties Can You Define?
2 pages
Basic Flowchart - Data Visualizer: Three Easy Steps To Create Process Diagrams From Your Data
No ratings yet
Basic Flowchart - Data Visualizer: Three Easy Steps To Create Process Diagrams From Your Data
14 pages
Data Encoder Duties
100% (1)
Data Encoder Duties
1 page
LDAP Directories Explained
No ratings yet
LDAP Directories Explained
291 pages
Mis Sbi
No ratings yet
Mis Sbi
18 pages
Transfer Learning in Natural Language Processing PDF
0% (1)
Transfer Learning in Natural Language Processing PDF
238 pages
Power BI PG
No ratings yet
Power BI PG
7 pages
PP 90264 LIMS Integration Compliance ArabLab2017 PP90264 en
No ratings yet
PP 90264 LIMS Integration Compliance ArabLab2017 PP90264 en
25 pages
"Student Record Management System": A Project Report ON
No ratings yet
"Student Record Management System": A Project Report ON
11 pages
A Comprehensive Survey On Applications of Transformers For Deep Learning Tasks
No ratings yet
A Comprehensive Survey On Applications of Transformers For Deep Learning Tasks
58 pages
Identification of Orientation of Galaxies in The Galaxy Zoo Dataset Using Spectral Clustering
No ratings yet
Identification of Orientation of Galaxies in The Galaxy Zoo Dataset Using Spectral Clustering
5 pages
Sap Technical
No ratings yet
Sap Technical
15 pages
Monthly Notices of The Royal Astronomical Society MNRAS LaTeX Template and Guide For Authors 1
No ratings yet
Monthly Notices of The Royal Astronomical Society MNRAS LaTeX Template and Guide For Authors 1
5 pages
Natural Language Programming Mini Project Mumbai University
No ratings yet
Natural Language Programming Mini Project Mumbai University
15 pages
CNN-SVM Learning Approach Based For White Blood Cell Classification
No ratings yet
CNN-SVM Learning Approach Based For White Blood Cell Classification
26 pages
Experiment No 1 AIM: To Study About Database Management System (DBMS) and Relational Database Management System (RDBMS) - Database Management System
No ratings yet
Experiment No 1 AIM: To Study About Database Management System (DBMS) and Relational Database Management System (RDBMS) - Database Management System
55 pages
Transactions of The Association For COmputational Linguistics PDF
No ratings yet
Transactions of The Association For COmputational Linguistics PDF
14 pages
Project Plan - Kel 5 PDF
No ratings yet
Project Plan - Kel 5 PDF
5 pages
Tech Doc 2 (Repaired)
No ratings yet
Tech Doc 2 (Repaired)
22 pages
Warehouse Management System
No ratings yet
Warehouse Management System
4 pages
Recurrent Convolutional Neural Networks For Text Classification
No ratings yet
Recurrent Convolutional Neural Networks For Text Classification
7 pages
Bangkok
No ratings yet
Bangkok
370 pages
Research Paper
No ratings yet
Research Paper
6 pages
Problem Statement:: Rule-Based Machine Translation (RBMT), Statistical Machine Translation (SMT), Neural
No ratings yet
Problem Statement:: Rule-Based Machine Translation (RBMT), Statistical Machine Translation (SMT), Neural
4 pages
Organisation of Knowledge2
No ratings yet
Organisation of Knowledge2
6 pages
Research On Text Classification Based On CNN and LSTM: Yuandong Luan Shaofu Lin
No ratings yet
Research On Text Classification Based On CNN and LSTM: Yuandong Luan Shaofu Lin
4 pages
1708 03446
No ratings yet
1708 03446
10 pages
Mogrifier LSTM
No ratings yet
Mogrifier LSTM
13 pages
Bidirectional LSTM-CRF For Biomedical Named Entity Recognition
No ratings yet
Bidirectional LSTM-CRF For Biomedical Named Entity Recognition
4 pages
18CN627 Big Data Framework For Data Science: Centre For Excellence in Computational Engineering and Networking
No ratings yet
18CN627 Big Data Framework For Data Science: Centre For Excellence in Computational Engineering and Networking
1 page
Your Scores For The General Test Taken On August 30, 2020
No ratings yet
Your Scores For The General Test Taken On August 30, 2020
3 pages
Context Based Text-Generation Using LSTM Networks
No ratings yet
Context Based Text-Generation Using LSTM Networks
11 pages
Named Entity Recognition Using Deep Learning
100% (1)
Named Entity Recognition Using Deep Learning
21 pages
Spam Text Classification Using LSTM Recurrent Neural Network
No ratings yet
Spam Text Classification Using LSTM Recurrent Neural Network
5 pages
Placement ASME: 1 Message Tue, Apr 13, 2021 at 2:42 AM
No ratings yet
Placement ASME: 1 Message Tue, Apr 13, 2021 at 2:42 AM
7 pages
Named Entity Recognition With Bidirectional Lstm-Cnns
No ratings yet
Named Entity Recognition With Bidirectional Lstm-Cnns
14 pages
On The Vietnamese Name Entity Recognition: A Deep Learning Method Approach
No ratings yet
On The Vietnamese Name Entity Recognition: A Deep Learning Method Approach
5 pages
Recurrent Neural Network For Text Classification With Multi-Task Learning
No ratings yet
Recurrent Neural Network For Text Classification With Multi-Task Learning
7 pages
Adversarial Multi-Task Learning For Text Classification: A B A B
No ratings yet
Adversarial Multi-Task Learning For Text Classification: A B A B
10 pages
Hindi Text Classification
No ratings yet
Hindi Text Classification
7 pages
DM Lab Manual IV Cse I Sem
No ratings yet
DM Lab Manual IV Cse I Sem
36 pages
A Survey of Named Entity Recognition Techniques
No ratings yet
A Survey of Named Entity Recognition Techniques
8 pages
Anjali Vishwakarma: Named Entity Recognition
No ratings yet
Anjali Vishwakarma: Named Entity Recognition
14 pages
Bidirectional Long Short-Term Memory For Automatic English To Kannada Back-Transliteration
No ratings yet
Bidirectional Long Short-Term Memory For Automatic English To Kannada Back-Transliteration
11 pages
Sanyam Modi Review PAPER PDF
No ratings yet
Sanyam Modi Review PAPER PDF
3 pages
Next Word Prediction Using Machine Learning Techniques: Cybersecurity November 2022
No ratings yet
Next Word Prediction Using Machine Learning Techniques: Cybersecurity November 2022
12 pages
Feedback and Task Analysis For E-Commerce Sites: University of South Africa, University of Glasgow
No ratings yet
Feedback and Task Analysis For E-Commerce Sites: University of South Africa, University of Glasgow
17 pages
Slides
No ratings yet
Slides
26 pages
Neural Machine Translation: A Review and Survey
No ratings yet
Neural Machine Translation: A Review and Survey
91 pages
340-Article Text-644-1-10-20210531
No ratings yet
340-Article Text-644-1-10-20210531
14 pages
Context Based
No ratings yet
Context Based
10 pages
Pre-Trained Models For Natural Language Processing: A Survey
No ratings yet
Pre-Trained Models For Natural Language Processing: A Survey
31 pages
Character-Aware Neural Language Models
No ratings yet
Character-Aware Neural Language Models
9 pages
12007-Article (PDF) - 24616-1-10-20201002
No ratings yet
12007-Article (PDF) - 24616-1-10-20201002
76 pages
1508.06615 - PTB Character Aware Neural Language Models Yoon Kim
No ratings yet
1508.06615 - PTB Character Aware Neural Language Models Yoon Kim
9 pages
6 - RNN LSTM & Gru
No ratings yet
6 - RNN LSTM & Gru
14 pages
Zhou 2020
No ratings yet
Zhou 2020
5 pages
66 GB 95 GB 90 GB: 23 GB 21 GB 63 GB 19 GB
No ratings yet
66 GB 95 GB 90 GB: 23 GB 21 GB 63 GB 19 GB
4 pages
CNN vs. LSTM For Turkish Text Classification
No ratings yet
CNN vs. LSTM For Turkish Text Classification
6 pages
Land Records and Parish Maps Research Guide
No ratings yet
Land Records and Parish Maps Research Guide
2 pages
Text Classification Improved BT Integrating Bidirectional LSTM With Two-Dimensional Max Pooling
No ratings yet
Text Classification Improved BT Integrating Bidirectional LSTM With Two-Dimensional Max Pooling
11 pages
LSTM
No ratings yet
LSTM
5 pages
Trend
No ratings yet
Trend
47 pages
Personal History Mis
No ratings yet
Personal History Mis
1 page
2305 19523
No ratings yet
2305 19523
22 pages
NLP Exp1
No ratings yet
NLP Exp1
5 pages
The Significance of Enterprise Resource Planning
No ratings yet
The Significance of Enterprise Resource Planning
2 pages
Thattinaphanich 2019
No ratings yet
Thattinaphanich 2019
6 pages
Horizon Europe DMP v5
No ratings yet
Horizon Europe DMP v5
12 pages
Field Extraction Index Time Vs Search Time
No ratings yet
Field Extraction Index Time Vs Search Time
5 pages
Dynamic Embedding Projection-Gated
No ratings yet
Dynamic Embedding Projection-Gated
10 pages
3-Natural Language Processing With Attention Models
No ratings yet
3-Natural Language Processing With Attention Models
62 pages
FLDS CSOC After Action Report Form
No ratings yet
FLDS CSOC After Action Report Form
4 pages
RFI Advanced GUI - Naval Systems 31aug20
No ratings yet
RFI Advanced GUI - Naval Systems 31aug20
23 pages
Learning Text Similarity With Siamese Recurrent Networks: Paul Neculoiu, Maarten Versteegh Mihai Rotaru
No ratings yet
Learning Text Similarity With Siamese Recurrent Networks: Paul Neculoiu, Maarten Versteegh Mihai Rotaru
10 pages
LSTM
No ratings yet
LSTM
5 pages
Database Assignment
No ratings yet
Database Assignment
5 pages
1 s2.0 S0893608005001206 Main
No ratings yet
1 s2.0 S0893608005001206 Main
9 pages
(B) Text Generation Using Long Short-Term Memory Network
No ratings yet
(B) Text Generation Using Long Short-Term Memory Network
9 pages
E-Services ToT Module7
No ratings yet
E-Services ToT Module7
38 pages
Character-Based Neural Networks For Sentence Pair Modeling
No ratings yet
Character-Based Neural Networks For Sentence Pair Modeling
7 pages
Nature of Reference and Information Service
No ratings yet
Nature of Reference and Information Service
7 pages
Purge and Archival Process in Pega
No ratings yet
Purge and Archival Process in Pega
3 pages
NLP - Machine Learning
No ratings yet
NLP - Machine Learning
23 pages
Unit 3
No ratings yet
Unit 3
4 pages
Qiu Et Al. - 2020 - Pre-Trained Models For Natural Language Processing
No ratings yet
Qiu Et Al. - 2020 - Pre-Trained Models For Natural Language Processing
28 pages
BiLSTM BPTT
No ratings yet
BiLSTM BPTT
8 pages
Decision Support System
No ratings yet
Decision Support System
15 pages
1 s2.0 S0957417423031688 Main
No ratings yet
1 s2.0 S0957417423031688 Main
48 pages
Slide
No ratings yet
Slide
28 pages
Named Entity Recognition With Deep Learning: Haobin Yu
No ratings yet
Named Entity Recognition With Deep Learning: Haobin Yu
96 pages
Artificial Intelligence
No ratings yet
Artificial Intelligence
13 pages
Paper 2 DK
No ratings yet
Paper 2 DK
20 pages
Problem Statement - Employees Database Management System
No ratings yet
Problem Statement - Employees Database Management System
1 page
Python Text Mining: Perform Text Processing, Word Embedding, Text Classification and Machine Translation
From Everand
Python Text Mining: Perform Text Processing, Word Embedding, Text Classification and Machine Translation
Alexandra George
No ratings yet
Visual Word: Unlocking the Power of Image Understanding
From Everand
Visual Word: Unlocking the Power of Image Understanding
Fouad Sabry
No ratings yet

ACL 2020 Proceedings Template 2 PDF

Uploaded by

ACL 2020 Proceedings Template 2 PDF

Uploaded by

LSTM for Natural Language Processing

Vijay Shankar A Premjith B

Abstract words to and run. This learning needs to continue

(b) Hindi named Entity Recognition

Figure 3: Results for LSTM and Bidirectional LSTM

is so selective that it rejects most of the words

(b) Hindi named Enity Recognition using Bidirec-

Figure 2: Number of classes for Malayalam text classi-

François Chollet et al. 2015. Keras. https://keras.

Yann N. Dauphin, Harm de Vries, and Yoshua Bengio.

P. Semberecki and H. Maciejewski. 2017. Deep learn-

Raghavendra Udupa, Abhijit Bhole, and Pushpak Bhat-

You might also like