0% found this document useful (0 votes)

288 views7 pages

Hindi Text Classification

This document compares different deep learning models for Hindi text classification, including CNNs, LSTMs, and attention-based neural networks. It evaluates FastText word embeddings and multilingual sentence embeddings from BERT and LASER on translated Hindi datasets. The main contributions are comparing CNN and LSTM models for Hindi classification, evaluating Hindi FastText word embeddings, and assessing multilingual pretrained sentence embeddings from BERT and LASER on Hindi text.

Uploaded by

Kushagra Bhatia

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

288 views7 pages

Hindi Text Classification

Uploaded by

Kushagra Bhatia

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 7

Deep Learning for Hindi Text Classification :

A Comparison

Ramchandra Joshi1 , Purvi Goel2 , and Raviraj Joshi2

1
Department of Computer Engineering, Pune Institute of Computer Technology
arXiv:2001.10340v1 [cs.IR] 19 Jan 2020

2
Department of Computer Science and Engineering, Indian Institute of Technology
Madras
{rbjoshi1309, goyalpoorvi, ravirajoshi}@gmail.com

Abstract. Natural Language Processing (NLP) and especially natural

language text analysis have seen great advances in recent times. Usage
of deep learning in text processing has revolutionized the techniques for
text processing and achieved remarkable results. Different deep learning
architectures like CNN, LSTM, and very recent Transformer have been
used to achieve state of the art results variety on NLP tasks. In this work,
we survey a host of deep learning architectures for text classification
tasks. The work is specifically concerned with the classification of Hindi
text. The research in the classification of morphologically rich and low re-
source Hindi language written in Devanagari script has been limited due
to the absence of large labeled corpus. In this work, we used translated
versions of English data-sets to evaluate models based on CNN, LSTM
and Attention. Multilingual pre-trained sentence embeddings based on
BERT and LASER are also compared to evaluate their effectiveness for
the Hindi language. The paper also serves as a tutorial for popular text
classification techniques.

Keywords: Natural language processing · Convolutional neural net-

works · Recurrent neural networks · Sentence embedding · Hindi text
classification.

1 Introduction
Natural language processing represents computational techniques used for pro-
cessing human language. The language can either be represented in terms of
text or speech. NLP in the context of deep learning has become very popular
because of its ability to handle text which is far from being grammatically cor-
rect. Ability to learn from the data have made the machine learning system
powerful enough to process any type of unstructured text. Machine learning ap-
proaches have been used to achieve state of the art results on NLP tasks like
text classification, machine translation, question answering, text summarization,
text ranking, relation classification, and others.
The focus of our work is text classification of Hindi language. Text classifica-
tion is the most widely used NLP task. It finds application in sentiment analy-
sis, spam detection, email classification, and document classification to name a
ii R. Joshi et al.

few. It is an integral component of conversational systems for intent detection.

There have been very few text classification works in literature focusing on the
resource-constrained Hindi language. While the most important reason for this
is unavailability of large training data; another reason is generalizability of deep
learning architectures to different languages. However, Hindi is morphologically
rich and relatively free word order language so we investigate the performance of
different models on Hindi text classification task. Moreover, there has been a sub-
stantial rise in Hindi language digital content in recent years. Service providers,
e-commerce industries are now targeting local languages to improve their visi-
bility. Increase in the robustness of translation and transliteration systems have
also contributed to the rise of NLP systems for Hindi text. This work will help
in the selection of right models and provide a suitable benchmark for further
research in Hindi text classification tasks.
In order to create Hindi dataset in Devanagari script, standard datasets like
TREC, SST were translated using Google translate. This is indeed great times
for translation systems. Self attention-based models like transformer have re-
sulted in best in class results for translation tasks. We believe this is the right
time to evaluate the translated data sets. Even the multi-lingual datasets like
XNLI [3] used for evaluation of natural language inference tasks is based on the
translation.
Current text classification algorithms are mainly based on CNNs and RNNs.
They work at the sentence level, paragraph level or document level. In this work,
we consider sentence-level classification tasks. Each sentence split into a sequence
of word tokens and passed to classification algorithms. Instead of passing the raw
character tokens, each word is mapped to a numerical vector. The sequence of
vectors is processed by classification algorithms. A very common approach is
to learn these distributed vectorial representations using unsupervised learning
techniques like word2vec [8]. The similarity of the word vectors are correlated to
the semantic similarity between actual words. This gives some useful semantic
properties to low dimensional word vectors. Usage of pre-trained vectors have
shown to give superior results and are thus the de-facto method to represent
word tokens in all NLP models. In this work, we use FastText word vectors,
pre-trained on Hindi corpus. The embedding matrix is used as an input to deep
learning models. Naive bag of words approach is to average the word embed-
dings and then use a linear classifier or feed-forward neural network to classify
the resulting sentence embedding. A more sophisticated approach is to pass the
sequence of word vectors through LSTM and use final hidden state representa-
tion for classification. CNNs are also pretty popular for sentence classification
tasks where a fixed length padded word vector sequence is passed through the
CNNs. We explore different variations of LSTM, CNN, and Attention-based neu-
ral networks for comparison.
Learning universal sentence representations is another area of active research.
These sentence representations are used for classification tasks. General idea is
to use a large amount of labeled or unlabelled corpus to learn sentence repre-
sentations in a supervised or unsupervised setting. This is similar to learning
Deep Learning for Hindi Text Classification : A Comparison iii

word vectors externally and using them in the target task. These approaches
represent transfer learning in the context of NLP. Models like Skip-Thought
Vectors, Universal Sentence Encoder by Google, InferSent, and BERT have to
be used to learn sentence embeddings. Using pre-trained sentence embeddings
lowers the training time and is more robust on small target data sets. In this
work, we also evaluate pre-trained multi-lingual sentence embedding obtained
using BERT and LASER to draw a better comparison.
Main contributions of this paper are:

– Compare variations of CNN and LSTM models for Hindi text classification.
– Effectiveness of Hindi Fast-Text word embedding is evaluated.
– Effectiveness of multi-lingual pre-trained sentence embedding based on BERT
and LASER is evaluated on Hindi corpus.

2 Related Work

There has been limited literature on Hindi text classification. Arora Piyush in
his early work [1] used traditional n-gram and weighted n-grams method for
sentiment analysis of Hindi text. Tummalapalli et al. [9] used deep learning
techniques- basic CNN, LSTM and multi-Input CNN for evaluating the clas-
sification accuracy of Hindi and Telugu texts. Their main focus was capturing
morphological variations in Hindi language using word-level and character-level
features. CNN based models performed better as compared to LSTM and SVM
using n-gram features. The datasets used were created using translation. In this
work, we are concerned with the performance of different model architectures
and word vectors so we do not consider character level or subword level features.
In general, there has been a lot of research on text classification and sentiment
analysis employing supervised and sem-supervised techniques. Kim et al. [6] pro-
posed CNN based architecture for classification of English sentences. A simple
bag of words model based on averaging of fast text word vectors was proposed
in [5]. They proposed a simple fast baseline for sentence classification tasks.
Usage of RNNs for text classification was introduced in [7] and Bi-LSTM was
augmented with simple attention in [10]. Classification results of these models
on Hindi text are reported in this work.
Sentence embeddings evaluated in this work include multi-lingual LASER em-
beddings [2] and multi-lingual BERT based embeddings [4]. LASER uses Bi-
LSTM encoder to generate embeddings whereas BERT is based on Transformer
architecture. LASER takes a neural machine translation approach for learning
sentence representations. It builds a sequence to sequence model using Bi-LSTM
encoder-decoder architecture. The encoder Bi-LSTM is used to generate sentence
representations. BERT, on the other hand, uses bi-directional transformer en-
coder for learning word and sentence representations. It uses masked language
model as the pre-training objective to mitigate the problem of unidirectional
training in simple language model next word prediction task.
iv R. Joshi et al.

3 Datasets
– TREC question dataset which involves classifying a question sentence into
six types. The dataset has predefined train-test split. It has 5452 training
samples and 500 testing samples. 10 % of the training data was randomly
held out for validation.
– Stanford Sentiment Treebank datasets SST-1 and SST-2. SST-1 contains
one sentence movie reviews which are rated in the scale of 1-5 going from
positive to negative. The dataset has predefined train-test-dev split. It has
8544 training samples, 2210 testing samples, and 1101 validation samples.
SST-2 is a binary version of SST-1 where there are only two labels positive
and negative. It has 6920 training samples, 1821 testing samples, and 872
validation samples.

Original English versions of this dataset are translated to Hindi using Google
Translate. A language model was trained using Hindi wiki corpus and used to
filter out noisy sentences. We assume no out of vocabulary words as fast text
model generates word embeddings for unknown words as well. A common vo-
cabulary of 31k words is created and fast-text vectors are used to initialize the
embedding matrix.

4 Model Architectures
The data samples comprise of a sequence of words so different sequence process-
ing models are explored in this work. While the most natural sequence processing
model is LSTM, other models are equally applicable as the sequence length is
short.
– BOW: The bag of words model does not consider the sequence of words.
The word vectors of input sentence are averaged to get a sentence embedding
of size 300. This is followed by a dense layer of size equal to the number of
output classes. Softmax output is given to cross-entropy loss function and
Adam is used as an optimizer.
– BOW + Attention: In this model, instead of simply averaging, a weighted
average of word vectors is taken to generate sentence embedding. The size of
sentence embedding is 300 and is followed by a dense layer similar to BOW
model. The weights for the individual time step is learned by passing the
corresponding word vector through a linear layer of size 300 × 1. Softmax
over these computed weights gives the probabilistic attention scores. This
attention approach is described in [10].
– CNN: The sequence of word embeddings are passed through three 1D con-
volutions of kernel sizes 2, 3, and 4. Each convolution uses a filter size of
128. The output of each of the 1-D convolution is max pooled over time and
concatenated to get the sentence representation. The size of this sentence
representation is 384 dimensions. There is a final dense layer of size equal to
the number of output classes.
Deep Learning for Hindi Text Classification : A Comparison v

– LSTM: The word vectors are passed as input to two-layer stacked LSTM.
The output of the final time step is given as an input to a dense layer for
classification. LSTM cell size is 128 and the size of final time step output
which is treated as sentence representation is 128.
– Bi-LSTM: The sequence of word embedding is passed through two stacked
bi-directional LSTM. The output is max pooled over time and followed by a
dense layer of size equal to the number of output classes. LSTM cell size is 128
and the size of max-pooled output which is treated as sentence representation
is 256.
– CNN + Bi-LSTM: The sequence of word embeddings are passed through
a 1D convolution of kernel size 3 and filter size 256. The output is passed
through a bi-directional LSTM. The output of Bi-LSTM is max pooled over
time and followed by a final dense layer.
– Bi-LSTM + Attention: This is similar to Bi-LSTM model. The difference
is that instead of max-pooling over the output of Bi-LSTM an attention
mechanism is employed as described above.
– LASER and BERT: Single pre-trained model for learning multilingual
sentence representations in the form of BERT and LASER was released by
Google and Facebook respectively. BERT is a 12 layer transformer based
model trained on multilingual data of 104 languages. LASER a 5 layer Bi-
LSTM model pre-trained on multilingual data of 93 languages. Both of these
models have Hindi as one of the training languages. The sentence embeddings
extracted from these models are used without any fine-tuning or modifica-
tions. The pre-trained sentence embeddings are extracted from the corre-
sponding models and subjected to a dense layer of 512 units. It is further
connected to a dense layer of size equal to the number of output classes over
which softmax is computed. BERT generated 768-dimensional embedding
whereas the dimension of LASER embeddings were 1024.

5 Results and Discussion

Performance of different models based on CNN and LSTM were evaluated on

translated versions of TREC, SST-1, and SST-2 datasets. Different versions of
input word vectors were given to the models for comparison. Pre-trained fast text
embeddings trained on Hindi corpus were compared against random initialization
of word vectors. The random values were sampled from a continuous uniform
distribution in a half-open interval [0.0, 1.0). Moreover, in one setting pre-trained
fast-text embeddings were fine-tuned whereas in other settings they remained
static. Keeping the word vector layer un-trainable allows better handling of
words that were not seen during training as all the word vectors follow the
same distribution. However, the domain of the corpus on which the word vectors
were pre-trained may be different from the target domain. In such cases, fine-
tuning the trained word vectors helps model adapt to the domain of the target
corpus. So re-training the fast text vectors and keeping them static has its pros
and cons. Table 1 shows the results of the comparison. The three versions of
vi R. Joshi et al.

word vectors are indicated as random for random initialization, fast text for
trainable fast-text initialization, and fast text-static for un-trainable fast text
initialization. Out of all the models vanilla CNN performs the best for all the
datasets. CNNs have known to perform best for short texts and same is visible
here as the datasets under consideration do not have long sentences. There is a
small difference in the performance of different LSTM model. However, Bi-LSTM
with max-pooling performed better than its attention version and unidirectional
LSTM. Bag of words based on attention fared better than the simple bag of
words model. Attention was particularly helpful with the usage of static fast text
word vectors. Stacked CNN-LSTM models were somewhere between LSTM and
CNN based models. We did not see a huge drop in performance due to random
initialization of word vectors. But the performance across different epochs was
very stable with fast text initialization. Finally, as compared to generic sentence
embeddings obtained from BERT and LASER, specific embeddings obtained
from custom models performed better. LASER was able to reach close to the
best performing model. This shows that LASER was able to capture important
discriminative features of a sentence required for the task at hand whereas BERT
failed to capture the same.

Table 1: Classification accuracies of different models

Model / Dataset TREC SST-1 SST-2
BOW fast text-static 62.4 32.2 63.1
fast text 87.2 40.4 77.3
random 84.4 39.3 76.9
BOW-Attn fast text-static 76.2 37.4 72.8
fast text 88.2 39.3 78.0
random 86.0 36.9 75.4
LSTM fast text-static 86.6 40.2 75.5
fast text 87.8 40.8 78.1
random 86.8 40.7 76.8
Bi-LSTM fast text-static 87.0 40.8 76.4
fast text 89.8 41.9 78.0
random 87.6 40.2 72.9
Bi-LSTM-Attn fast text-static 85.0 39.0 76.4
fast text 88.6 40.1 78.6
random 86.0 39.5 76.0
CNN fast text-static 91.2 41.2 78.2
fast text 92.8 42.9 79.4
random 87.8 40.2 77.1
CNN+Bi-LSTM fast text-static 89.6 40.4 78.3
fast text 90.5 41.0 77.4
random 87.6 38.2 72.3
LASER 89.0 41.4 75.9
BERT 77.6 35.6 68.5
Deep Learning for Hindi Text Classification : A Comparison vii

6 Conclusion

In this work, we compared different deep learning approaches for Hindi sentence
classification. The word vectors were initialized using fast text word vectors
trained on Hindi corpus and random word vectors. This work also serves the
evaluation of fast text word embeddings for Hindi sentence classification task.
CNN models perform better than LSTM based models on the datasets consid-
ered in this paper. Although we would expect BOW to perform the worst it
has numbers comparable to LSTM and CNN. Therefore if we can trade off ac-
curacy for speed BOW is useful. LSTMs do not do better than CNNs may be
because the word order is relaxed in Hindi. Sentence representations captured by
LASER multilingual model were rich as compared to BERT. However, overall
custom trained models on specific datasets performed better than lightweight
models directly utilizing sentence encodings. Although the real advantage of
multi-lingual embeddings can be better evaluated on tasks involving text from
multiple languages.

References
1. Arora, P.: Sentiment analysis for hindi language (2013)
2. Artetxe, M., Schwenk, H.: Massively multilingual sentence embeddings for zero-
shot cross-lingual transfer and beyond. arXiv preprint arXiv:1812.10464 (2018)
3. Conneau, A., Lample, G., Rinott, R., Williams, A., Bowman, S.R., Schwenk,
H., Stoyanov, V.: Xnli: Evaluating cross-lingual sentence representations. arXiv
preprint arXiv:1809.05053 (2018)
4. Devlin, J., Chang, M.W., Lee, K., Toutanova, K.: Bert: Pre-training of deep bidirec-
tional transformers for language understanding. arXiv preprint arXiv:1810.04805
(2018)
5. Joulin, A., Grave, E., Bojanowski, P., Mikolov, T.: Bag of tricks for efficient text
classification. arXiv preprint arXiv:1607.01759 (2016)
6. Kim, Y.: Convolutional neural networks for sentence classification. arXiv preprint
arXiv:1408.5882 (2014)
7. Lai, S., Xu, L., Liu, K., Zhao, J.: Recurrent convolutional neural networks for text
classification. In: Twenty-ninth AAAI conference on artificial intelligence (2015)
8. Pennington, J., Socher, R., Manning, C.: Glove: Global vectors for word repre-
sentation. In: Proceedings of the 2014 conference on empirical methods in natural
language processing (EMNLP). pp. 1532–1543 (2014)
9. Tummalapalli, M., Chinnakotla, M., Mamidi, R.: Towards better sentence classifi-
cation for morphologically rich languages (2018)
10. Zhou, P., Shi, W., Tian, J., Qi, Z., Li, B., Hao, H., Xu, B.: Attention-based bidirec-
tional long short-term memory networks for relation classification. In: Proceedings
of the 54th Annual Meeting of the Association for Computational Linguistics (Vol-
ume 2: Short Papers). pp. 207–212 (2016)

GD655,675-3EO Shop Manual PDF
95% (19)
GD655,675-3EO Shop Manual PDF
1,002 pages
Operation Manual 1015 Marine
No ratings yet
Operation Manual 1015 Marine
95 pages
Creating Instructional Materials Learning Task
100% (2)
Creating Instructional Materials Learning Task
10 pages
A Guide To The National Initiative For Cybersecurity Education (NICE) Cybersecurity Workforce Framework (2.0) (PDFDrive)
No ratings yet
A Guide To The National Initiative For Cybersecurity Education (NICE) Cybersecurity Workforce Framework (2.0) (PDFDrive)
554 pages
HISCL-800 E 07 Installation
No ratings yet
HISCL-800 E 07 Installation
135 pages
Transfer Learning in Natural Language Processing PDF
0% (1)
Transfer Learning in Natural Language Processing PDF
238 pages
Overview of MCS-51 Family of Microcontrollers and Memory Organization
No ratings yet
Overview of MCS-51 Family of Microcontrollers and Memory Organization
21 pages
Course Outline - C.2 IS602 - Spreadsheet Modeling For T&O Managers (Students Copy)
No ratings yet
Course Outline - C.2 IS602 - Spreadsheet Modeling For T&O Managers (Students Copy)
8 pages
Musso Owner Manual Small
No ratings yet
Musso Owner Manual Small
482 pages
GymnaUniphy Phyaction 740,790 - Service Manual
No ratings yet
GymnaUniphy Phyaction 740,790 - Service Manual
68 pages
L1 - 2 - Shipyard Productivity PDF
No ratings yet
L1 - 2 - Shipyard Productivity PDF
33 pages
Typical Wiring Diagram: Deep Sea Electronics 053-085 Issue 3
No ratings yet
Typical Wiring Diagram: Deep Sea Electronics 053-085 Issue 3
2 pages
(Basic Training) IMS Bearer Network ISSUE 5.0
No ratings yet
(Basic Training) IMS Bearer Network ISSUE 5.0
63 pages
Ps - 4618service Training - Self Study Programme 470 - The Touareg 2011 - Electrics Electronics - Design and Function
No ratings yet
Ps - 4618service Training - Self Study Programme 470 - The Touareg 2011 - Electrics Electronics - Design and Function
56 pages
Cultural, Social and Political Change
No ratings yet
Cultural, Social and Political Change
15 pages
Week 5-6 Emtech
No ratings yet
Week 5-6 Emtech
19 pages
PEV Project
No ratings yet
PEV Project
28 pages
Asynchronous and Synchronous Machines (EL-208) (IV SEM EL Session 2017-18) Tutorial Sheet 1 (Dated 08.01.2018) (3 Phase Induction Motors
No ratings yet
Asynchronous and Synchronous Machines (EL-208) (IV SEM EL Session 2017-18) Tutorial Sheet 1 (Dated 08.01.2018) (3 Phase Induction Motors
10 pages
Text Classification Ner, Pos: Prashant K. Sharma
No ratings yet
Text Classification Ner, Pos: Prashant K. Sharma
72 pages
Gojek - Management Strategy
No ratings yet
Gojek - Management Strategy
42 pages
Cyber Defense Magazine-August 2023
No ratings yet
Cyber Defense Magazine-August 2023
144 pages
Survey Article: Inter-Coder Agreement For Computational Linguistics
No ratings yet
Survey Article: Inter-Coder Agreement For Computational Linguistics
42 pages
CS File - 20190407161830
No ratings yet
CS File - 20190407161830
38 pages
MC145406 P
No ratings yet
MC145406 P
10 pages
Indo Language
No ratings yet
Indo Language
16 pages
A Sensitivity Analysis of Convolutional Neural Networks For Sentence Classification
No ratings yet
A Sensitivity Analysis of Convolutional Neural Networks For Sentence Classification
18 pages
PE Notes
No ratings yet
PE Notes
35 pages
Module 1 (Introduction To Mio JR.)
No ratings yet
Module 1 (Introduction To Mio JR.)
18 pages
Character Level Text Classification Via Convolutional Neural Network and Gated Recurrent Unit
No ratings yet
Character Level Text Classification Via Convolutional Neural Network and Gated Recurrent Unit
11 pages
Government With Algorithm Strategy To Improve Sync
No ratings yet
Government With Algorithm Strategy To Improve Sync
12 pages
NLP m4
No ratings yet
NLP m4
97 pages
Set 3 Mscit Objectives
No ratings yet
Set 3 Mscit Objectives
2 pages
NMT Based Similar Language Translation For Hindi - Marathi
No ratings yet
NMT Based Similar Language Translation For Hindi - Marathi
4 pages
10 1016@j Neunet 2006 12 005 PDF
No ratings yet
10 1016@j Neunet 2006 12 005 PDF
9 pages
Brain - Inspired Computing: Wozniak Et Al. Yin Et Al. Masquelier
No ratings yet
Brain - Inspired Computing: Wozniak Et Al. Yin Et Al. Masquelier
5 pages
Sequential Short-Text Classification With Recurrent and Convolutional Neural Networks
No ratings yet
Sequential Short-Text Classification With Recurrent and Convolutional Neural Networks
6 pages
Siru GHE. Ionel IMAA English Project
No ratings yet
Siru GHE. Ionel IMAA English Project
2 pages
11 - Vietnamese Text Classification and Sentiment Based
No ratings yet
11 - Vietnamese Text Classification and Sentiment Based
3 pages
Deep Learning in Natural Language Processing A State-of-the-Art Survey
No ratings yet
Deep Learning in Natural Language Processing A State-of-the-Art Survey
6 pages
Garbage Collection in The Java Hotspot Virtual Machine
No ratings yet
Garbage Collection in The Java Hotspot Virtual Machine
7 pages
Rentcubo Com
No ratings yet
Rentcubo Com
6 pages
Newton-Raphson LFA
No ratings yet
Newton-Raphson LFA
1 page
DeepLearning Text
No ratings yet
DeepLearning Text
21 pages
Kbutty Resume 2022 Sask
No ratings yet
Kbutty Resume 2022 Sask
2 pages
A Survey of Text Classification With Transformers How Wide How Large How Long How Accurate How Expensive How Safe
No ratings yet
A Survey of Text Classification With Transformers How Wide How Large How Long How Accurate How Expensive How Safe
14 pages
CNN vs. LSTM For Turkish Text Classification
No ratings yet
CNN vs. LSTM For Turkish Text Classification
6 pages
Report On Text Classification Using CNN, RNN & HAN - Jatana - Medium
No ratings yet
Report On Text Classification Using CNN, RNN & HAN - Jatana - Medium
15 pages
Impact of Convolutional Neural Network and Fasttext Embedding On Text Classification
No ratings yet
Impact of Convolutional Neural Network and Fasttext Embedding On Text Classification
17 pages
Traffic Module 2
No ratings yet
Traffic Module 2
28 pages
Zhou 2020
No ratings yet
Zhou 2020
5 pages
Recurrent Convolutional Neural Networks For Text Classification
No ratings yet
Recurrent Convolutional Neural Networks For Text Classification
7 pages
CH01 PPT
No ratings yet
CH01 PPT
77 pages
Natural Language Processing
No ratings yet
Natural Language Processing
8 pages
Project Plan - Kel 5 PDF
No ratings yet
Project Plan - Kel 5 PDF
5 pages
NLP Lab Internal Question Bank
No ratings yet
NLP Lab Internal Question Bank
5 pages
A Unified Architecture For Natural Language Processing
No ratings yet
A Unified Architecture For Natural Language Processing
15 pages
1 s2.0 S0925231221010997 Main
No ratings yet
1 s2.0 S0925231221010997 Main
14 pages
Dynamic Embedding Projection-Gated
No ratings yet
Dynamic Embedding Projection-Gated
10 pages
NLP Text Classification Week4
No ratings yet
NLP Text Classification Week4
26 pages
Transformer-Based Regression Models For Assessing Reading Passage Complexity: A Deep Learning Approach in Natural Language Processing
No ratings yet
Transformer-Based Regression Models For Assessing Reading Passage Complexity: A Deep Learning Approach in Natural Language Processing
14 pages
Chapter 12
No ratings yet
Chapter 12
16 pages
Sharp Gf777
No ratings yet
Sharp Gf777
6 pages
Enhancing Text Classification Through Novel Deep Learning Sequential Attention Fusion Architecture
No ratings yet
Enhancing Text Classification Through Novel Deep Learning Sequential Attention Fusion Architecture
12 pages
Trend
No ratings yet
Trend
47 pages
Summaries of The Chapters
No ratings yet
Summaries of The Chapters
29 pages
Talking Points
No ratings yet
Talking Points
8 pages
Thuyết Trình TWP
No ratings yet
Thuyết Trình TWP
7 pages
Applsci 10 05841
No ratings yet
Applsci 10 05841
14 pages
Pre Trained Models For NLP
No ratings yet
Pre Trained Models For NLP
15 pages
Unit - 4 DL
No ratings yet
Unit - 4 DL
33 pages
Text Classification Research Paper 2
No ratings yet
Text Classification Research Paper 2
7 pages
Text Classification Using NLP
No ratings yet
Text Classification Using NLP
28 pages
1.machine Learning and Its Applications
No ratings yet
1.machine Learning and Its Applications
75 pages
Unit 5 - Aiaaia
No ratings yet
Unit 5 - Aiaaia
19 pages
What Is Natural Language Processing (NLP)
No ratings yet
What Is Natural Language Processing (NLP)
15 pages
Conversation Guide - IBM Power Virtual Server
No ratings yet
Conversation Guide - IBM Power Virtual Server
23 pages
14 LookingForward
No ratings yet
14 LookingForward
48 pages
Large-Scale News Classification Using BERT Languag
No ratings yet
Large-Scale News Classification Using BERT Languag
9 pages
NLP Prep
No ratings yet
NLP Prep
14 pages
DL Unit-IV
No ratings yet
DL Unit-IV
20 pages
Mihai Surdeanu, Marco Antonio Valenzuela-Escarcega - Deep Learning For Natural Language Processing - A Gentle Introduction-Cambridge University Press (2024)
No ratings yet
Mihai Surdeanu, Marco Antonio Valenzuela-Escarcega - Deep Learning For Natural Language Processing - A Gentle Introduction-Cambridge University Press (2024)
345 pages
Unit 5 DL
No ratings yet
Unit 5 DL
11 pages
APznzaYD23xZzgrNn UY T9fGgJbB0 Kfhgt21x0vaHH4qfIvCmiqGVPY37T19O
No ratings yet
APznzaYD23xZzgrNn UY T9fGgJbB0 Kfhgt21x0vaHH4qfIvCmiqGVPY37T19O
10 pages
NLP Handwritten Notes
No ratings yet
NLP Handwritten Notes
26 pages
Big Data Analytics Chap 11
No ratings yet
Big Data Analytics Chap 11
8 pages
UNIT-III Text Classification
No ratings yet
UNIT-III Text Classification
4 pages
Struktura e Diplomes
No ratings yet
Struktura e Diplomes
3 pages
Text Classification Using NLP
No ratings yet
Text Classification Using NLP
8 pages
A M3 RD Ipjn Yd Ps GKF
No ratings yet
A M3 RD Ipjn Yd Ps GKF
20 pages
Knowledge 04 00022 v2
No ratings yet
Knowledge 04 00022 v2
25 pages
Unit-III NLP
No ratings yet
Unit-III NLP
15 pages
NLP 160709201345
No ratings yet
NLP 160709201345
61 pages

Hindi Text Classification

Uploaded by

Hindi Text Classification

Uploaded by

Deep Learning for Hindi Text Classification :

Ramchandra Joshi1 , Purvi Goel2 , and Raviraj Joshi2

Abstract. Natural Language Processing (NLP) and especially natural

Keywords: Natural language processing · Convolutional neural net-

few. It is an integral component of conversational systems for intent detection.

5 Results and Discussion

Performance of different models based on CNN and LSTM were evaluated on

Table 1: Classification accuracies of different models

You might also like