0% found this document useful (0 votes)

69 views14 pages

CNN LSTM Hybrid Approach For Sentiment Analysis

: In recent years, one of the most popular study subjects has been sentiment analysis. It is employed to ascertain the text's actual intention. It is primarily interested in the processing and analysis of natural language data. The development of technology and the phenomenal rise of social media have produced a vast volume of confusing textual information. It's critical to examine the feelings that underlie such writings. Sentiment analysis reveals the core of irrational beliefs kept

Uploaded by

IJRASETPublications

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

69 views14 pages

CNN LSTM Hybrid Approach For Sentiment Analysis

Uploaded by

IJRASETPublications

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 14

11 V May 2023

https://doi.org/10.22214/ijraset.2023.52191
International Journal for Research in Applied Science & Engineering Technology (IJRASET)
ISSN: 2321-9653; IC Value: 45.98; SJ Impact Factor: 7.538
Volume 11 Issue V May 2023- Available at www.ijraset.com

CNN LSTM Hybrid Approach for Sentiment

Analysis
Mansi Jain1, Purvit Vashishtha2, Aman Satyam3, Smriti Sehgal4
Amity University, Noida, India

Abstract: In recent years, one of the most popular study subjects has been sentiment analysis. It is employed to ascertain the
text's actual intention. It is primarily interested in the processing and analysis of natural language data. The development of
technology and the phenomenal rise of social media have produced a vast volume of confusing textual information. It's critical
to examine the feelings that underlie such writings. Sentiment analysis reveals the core of irrational beliefs kept in enormous
volumes of text. The primary objective is to get the computer to comprehend the backdrop of the data so that it may be divided
into material that is good or bad. (i) Several machine learning models, including Naive Bayes, XGboost, Random Forest, LGB
Machine, etc., are trained in this study. (ii) The implementation of the deep learning model Bi-LSTM, whose accuracy has
showed promise. (iii) Bidirectional Encoder Representations from Transformers (BERT), a pre-trained language model that
used an external Bi-LSTM model, was implemented. Then, a new approach of CNN-LSTM hybrid model is applied to IMDb
dataset which performed better than all the models.
Keywords: Sentiment analysis, natural language processing, machine learning, deep learning, BERT, CNN-LSTM.

I. INTRODUCTION
Nowadays, individuals want to make decisions depending upon recommendations to save time, whether they are purchasing a
product or viewing a movie. Understanding client behavior is crucial for successful marketing. Companies have made it possible for
customers to leave reviews in order to better understand the decisions made by their customers. But managing such a massive
volume of data is a difficult process. Sentiment analysis is a wise approach to help resolve the question of whether the product
achieves its goal or not. Besides the benefits that consumers gaining from this user-generated material, a large number of company
sectors are also effectively utilizing this developing technology and are employing sentiment analysis to examine the preferences of
their particular clients. It is crucial to understand the motivation underlying any text because of this. Figure 1 illustrates three
different approaches to sentiment analysis: machine learning methods, deep learning methods like BERT and Bidirectional LSTM.

A. Challenges
In sentiment analysis there are some considerable challenges which should be encountered to obtain the best result.
Majority of the data is written in English; however other languages are severely underrepresented. that gives a troublesome
experience in analyzing and training the data. Relying on previously stored data might give a problem as there is a possibility that
opinion of customers might get modified over the period of time.
While performing it with traditional machine learning algorithms, performance is somewhat not up to the mark because of its
approach towards larger datasets. Performance of machine learning models on larger datasets is lower as compared to deep learning
models.

©IJRASET: All Rights are Reserved | SJ Impact Factor 7.538 | ISRA Journal Impact Factor 7.894 | 3096
International Journal for Research in Applied Science & Engineering Technology (IJRASET)
ISSN: 2321-9653; IC Value: 45.98; SJ Impact Factor: 7.538
Volume 11 Issue V May 2023- Available at www.ijraset.com

B. Contributions of the Paper

This study proposes utilizing machine learning, deep learning, and pre-trained models to perform sentiment analysis and
determining the most effective model from these approaches.
Machine learning models which are implemented in this paper are: Random Forest, LGB machine, XGboost, Naïve Bayes, Gradient
Boosting and Decision Tree and accuracy and receiver operative characteristic scores are used to compare them.
Deep learning techniques are employed to develop a Bidirectional Long Short-Term Memory (LSTM) model, which utilizes a
specialized type of Recurrent Neural Network (RNN). This variant, known as Bidirectional Long Short-Term Memory networks
(BI-LSTM), has demonstrated remarkable capabilities and surpasses traditional machine learning models in addressing the issue of
long-term dependencies.
BERT, a pretrained model, is also used which has performed best of all the models used in this paper.
Hybrid Model is applied to IMDb dataset which performed better than all the other models trained.

C. Approaches Considered in the Paper

1) Machine Learning Based Approach
Figure 2 illustrates the three different types of machine learning-based methodologies: supervised, unsupervised, and semi-
supervised learning. In supervised learning, the papers are clearly labelled. Unsupervised learning uses text categorization to find
texts that are neither categorized nor tagged [2]. Notwithstanding this, the aim of semi-supervised learning is just to train a set of
data using both a big volume of unlabeled data and a minimal bit of labelled data [2]. Mostly in initial days of machine learning,
the usage of sarcastic language created a lot of uncertainty since the technology couldn't really discern the meaning intended by
the phrases. Negation detection was yet another obstacle that machine learning was unable to tackle. As was previously indicated,
machine learning algorithms were unable to resolve all problems.

2) BERT Approach
BERT, which stands for Bidirectional Encoder Representations from Transformers, is an advanced deep learning technique used in
the field of natural language processing (NLP). Created by Google AI Language, BERT is a model based on neural networks that
utilizes the Transformer architecture to understand the contextual relationships among words in a given text dataset. Unlike
conventional NLP models that process text in a unilateral, sequential manner, BERT is a bidirectional model that accounts for the
entire input sentence or paragraph to generate context-sensitive word embeddings. BERT undergoes pre-training using massive
volumes of textual data, followed by fine-tuning on particular natural language processing (NLP) assignments like text
categorization, answering questions, and identifying named entities. BERT has attained impressive results on a diverse range of
NLP benchmarks and has emerged as a prevalent choice for various NLP applications.

3) Bidirectional LSTM Model Approach

Bidirectional Long Short-Term Memory (LSTM) is a sophisticated neural network architecture that has proven to be highly
effective in modeling sequential data. In contrast to conventional LSTMs that only consider past inputs, Bidirectional LSTMs
leverage a dual-recurrent structure that takes into account both past and future context. By utilizing this bi-directional approach,
Bidirectional LSTMs are capable of capturing long-range dependencies and intricate patterns in the input sequence. Bidirectional
LSTMs employ a gating mechanism that regulates the flow of information and enables them to selectively remember or forget
specific information based on its relevance to the task at hand. This results in a robust and dynamic model that can efficiently
process and interpret complex sequential data. Due to their unparalleled performance and versatility, Bidirectional LSTMs have
become an indispensable tool for various applications in natural language processing, speech recognition, and time series
forecasting.

©IJRASET: All Rights are Reserved | SJ Impact Factor 7.538 | ISRA Journal Impact Factor 7.894 | 3097
International Journal for Research in Applied Science & Engineering Technology (IJRASET)
ISSN: 2321-9653; IC Value: 45.98; SJ Impact Factor: 7.538
Volume 11 Issue V May 2023- Available at www.ijraset.com

4) CNN-LSTM Model
This hybrid model is trained on IMDb dataset with English and French language text because of its large available size of data.
More size of data will provide more info to the model and therefore more generalized will be the model.

II. RELATED WORK

Depending on the dataset as well as the issue specification, there are several machine learning as well as cutting-edge deep learning
techniques for processing natural language and performing sentiment analysis. There are several academic publications that have
used various algorithms that built their customized algorithms via simply piling various classifiers and fine-tuning them properly.
Below is a discussion of a few of them:
In paper [2], Marta Fernandes introduced a unique technique for aiding triage medical professionals in patient stratification and
identifying patients at increased risk of ICU admission. The data underwent stratified random sampling to divide it into training
(70%) and testing (30%) sets. The model was then trained using 10-fold cross-validation. The logistic regression model
outperformed the other two, along with random forests and a random under sampling boosting technique.
Within his research [3], Joshua Acosta used Google's Word2Vec model to do sentiment analysis on tweets mentioning US airline
firms. These model's word embeddings are frequently utilized to understand nuance and generate high-dimensional vectors within a
spatial context, which are subsequently classified using machine learning methodologies. Word2Vec performed the best in terms of
accuracy (72%), followed by support vector machine and logistic regression machine learning models.
Word2Vec is just a vector-based encoding of phrases that retains semantic links amongst phrases like fundamental linear algebraic
operations, according to studies by Arman S. Zharmagambetov [5]. In regards of computing performance, the aforementioned
approach outperforms the other possibilities. With their investigation, they came to the conclusion that deep learning outperformed
the Bag of Words model, which produced only marginally favorable solutions that could have been due to the amount of noise and
word grouping inaccuracy that had developed during the pre-processing stage.
In her study [6], Monisha Kanakaraj used word-sense separation as well as semantic interpretation as natural language processing
(NLP) techniques substantially increase reliability of classification. The collected linguistic data is subjected to ensemble - based
processing in order to evaluate mood. Multimodal categorization combines the impact of various individual classifiers on a certain
classification job. Because of the greater extent of variability used in estimating the partitions even during identification of feature
vector subsets, tests revealed that perhaps the ensemble classifier outperforms traditional machine learning classifiers by just a
factor of three to five percent.
In his article [7], Mr. Jeevan Anil Phand, utilized Twitter data to conduct sentiment analysis. In order to achieve this, tweets were
first extracted and then categorized using Stanford NLP as either good, negative, or neutral. On some datasets, such as the India vs.
Pakistan (Match), the Stanford NLP technique fared better when making predictions, with an accuracy of roughly 100% as opposed
to 89% for the Amazon data.
In paper [4], to examine the emotions conveyed in movie reviews on IMDb, a LSTM classifier was employed, utilizing the
Recurrent Neural Network (RNN) algorithm. The data was appropriately processed and divided after classification to enhance
performance. The outcomes reveal a top classification accuracy of 89.9%, indicating that incorporating the suggested technique into
current text-based sentiment analysis is a promising approach.
N Sriram, in paper [8], promoted recurrent neural network language-based model i.e., LSTM which has capability to retain and
forget information because of logic gates used in its architecture. Sentiment analysis is performed on US Airline dataset having two
classes: positive and negative. This paper focuses on only two classifications, positive and negative, and with enhanced sentiment
analysis, a third class, neutral, can be introduced in the future.
Though deep learning models performed significantly well than machine learning models, they still have chances of overfitting of
the model and bad performance on test or validation data. To avoid or overcome this obstacle, we move towards hybrid neural
network models.

III. METHODOLOGY
A. Dataset and Preprocessing
Datasets which we are used were Amazon Reviews Dataset and IMDb Review Dataset. Both datasets were created by combining
from different sources. Amazon Reviews Data consists of total 10000 English and German language reviews containing three
feature columns: text, sentiment, and title. In this research, text is considered which contains full review of the product.

©IJRASET: All Rights are Reserved | SJ Impact Factor 7.538 | ISRA Journal Impact Factor 7.894 | 3098
International Journal for Research in Applied Science & Engineering Technology (IJRASET)
ISSN: 2321-9653; IC Value: 45.98; SJ Impact Factor: 7.538
Volume 11 Issue V May 2023- Available at www.ijraset.com

IMDb dataset contains total 75000 English and French language reviews from users on various movies which consists of two
columns: review and sentiment. For both the datasets, it is binary classification problem because of two classes positive and
negative i.e., 1 and 0 respectively.
Though reviews are of range between 1 to 5 stars for any product or movie, but the metadata provided with both datasets states that
reviews are already compiled between two classes positive and negative.

1) Data Cleaning
The initial step in training a model involves data cleaning, which aims to eliminate redundant words and phrases from texts. The
objective of this process is to enhance the machine learning model's performance by removing unnecessary elements from the data.
E.g.: Text in raw data- “#5 star is My review regarding the movie Titanic! which I watched @ hall/cinema.</p>”
The following items must be eliminated at this stage:
● Punctuation: removed redundant punctuation. After this step-“5 star is My review regarding the movie Titanic which I watched
hall cinema </p>”
● HTML tags and emojis: removed html tags and certain emojis from text. After this step-“5 star is my review is regarding the
movie titanic which i watched hall cinema”
After performing above steps, pre-processing is done on cleaned text data.

2) Text Pre-processing
Text Pre-processing step is also very crucial step in natural language processing as textual data is not recognized by machine
learning model which is required to be transformed into numerical data. Some preprocessing steps are:
● Lemmatization: Lemmatizer is used from nltk.stem.wordnet library to remove tenses from sentences. It is faster than stemming
and is used when dataset size is large. After this step- “5 star review regarding movie titanic watch hall cinema.”
● TF-IDF Vectorizer: TF-IDF (Term frequency- Inverse Document Frequency) is a mathematical technique in natural language
processing that is applied on cleaned text columns after separating data into training and testing sets to tokenize and generate
word frequency scores. TF-IDF Vectorizer takes an array input of corpus or text and assigns importance to unique words scaled
by its importance across all documents or sentences in the corpus. Output from this is an array having values for each word
relative to all the words present in the corpus or document.

B. Implementation using Machine Learning

Ensemble methods or Stacking model are meta-algorithms that integrate two or more machine learning approaches into a single
predictive model to reduce variation, bias, or enhance predictions. This strategy outperforms a single model in terms of prediction
performance. In this paper, top three machine learning models based on results after training on both the datasets are stacked based
on ROC scores. After preprocessing and applying TF-IDF method on text column, this stacked model with three algorithms is
trained on the training set and trained model is tested on testing data.

©IJRASET: All Rights are Reserved | SJ Impact Factor 7.538 | ISRA Journal Impact Factor 7.894 | 3099
International Journal for Research in Applied Science & Engineering Technology (IJRASET)
ISSN: 2321-9653; IC Value: 45.98; SJ Impact Factor: 7.538
Volume 11 Issue V May 2023- Available at www.ijraset.com

C. Implementation using Bidirectional Long Short-Term Memory

The Bidirectional LSTM (shown in Figure 8) is an enhanced version of the Gated Recurrent Neural Network model. It is built upon
the concept of bidirectional RNN [10], which examines sequential data in both the forward and backward directions using separate
hidden layers. The diagram visually demonstrates this setup. In Bidirectional LSTMs, the two hidden layers are connected to the
same output layer. In several disciplines, it has been demonstrated that bidirectional networks outperform unidirectional networks.

D. Implementation using Bidirectional Encoder Representations from Transformers (BERT)

BERT is one of pre-trained language models which provides context to words that have been learnt previously from unannotated
training set of data. There are many variants of BERT model and in this paper, Distil-BERT base model is utilized. Distil-BERT is a
distilled version of BERT which is a small, light, and fast transformer model having less parameters than bert-base-uncased. It was
pretrained with following motives:
1) Distillation Loss: The model was trained to produce the same results as the BERT basic model.
2) Masked language modelling (MLM): While considering a sentence, it covers 15% of the words in the input at random and then
runs the full masked text through the model. This differs from standard recurrent neural networks (RNNs), which typically
consider the words sequentially. It enables the model to learn a two-way representation of the sentence.
A random input is given to BERT model which first of all, generates contextualized embeddings through encoders. The embeddings
generated can be used to represent the feature for that particular token which acts as an input for decoder that predicts possible
classes based on the type of classification, whether it is binary or multiclass.

E. The Proposed Hybrid CNN-LSTM Model

The suggested hybrid model is detailed in depth in this section. IMDb review dataset is taken first of all because of large samples
which can be used for better training and testing. Next, the first step of model training is performed i.e., data cleaning in which
unnecessary punctuation is removed, converted uppercase letter to lower case letter, removed html tags and certain emojis from text,
articles and conjunctions are also removed. After this, tokenization is performed which breaks the raw text into small parts.
Embedding layer is used after the tokenizer, that converts each word into a fixed length vector of defined size. Next, batch
normalization layer is used to achieve more stability through normalization of the layers, inputs are re-scaled and recentred. Then, a
convolutional layer is utilized for feature extraction. The output of CNN layers acts as input for Max-Pooling layer which performs
feature reduction. Then, LSTM layer is used to get a sequence output rather than a single valued output. After this, dense layer with
ReLU activation function is added for generalization of output from LSTM layer and later, dense layer with sigmoid activation
function is used to classify text messages according to our classification model which is binary i.e., 0 and 1.
Sigmoid function is a logistic function that ranges from 0 to 1, as defined by the formula given below-

CNN, Max-Pooling and LSTM are explained further in detail.

©IJRASET: All Rights are Reserved | SJ Impact Factor 7.538 | ISRA Journal Impact Factor 7.894 | 3100
International Journal for Research in Applied Science & Engineering Technology (IJRASET)
ISSN: 2321-9653; IC Value: 45.98; SJ Impact Factor: 7.538
Volume 11 Issue V May 2023- Available at www.ijraset.com

Pseudo Code of the proposed hybrid model

1) CNN
Convolutional Layer's primary job is to extract meaningful characteristics from text data. It is possible to accomplish this by
convolution operations are performed on the word vectors generated by the Embedding Layer. As a nonlinear function, the function
Rectified Linear Unit is used. It is defined as below-

The ReLU activation function returns x if the value is positive; else, it returns 0.

Detailed working of CNN is described as follows: -

2) MaxPooling
The Convolution technique generates feature maps with a high-level vector representation. A Max-Pooling layer is placed after the
CNN layer to aid in the selection of meaningful information by decreasing weak activation information. This aids in avoiding
overfitting due to outlier text.

©IJRASET: All Rights are Reserved | SJ Impact Factor 7.538 | ISRA Journal Impact Factor 7.894 | 3101
International Journal for Research in Applied Science & Engineering Technology (IJRASET)
ISSN: 2321-9653; IC Value: 45.98; SJ Impact Factor: 7.538
Volume 11 Issue V May 2023- Available at www.ijraset.com

3) LSTM
The Linear Support Vector Machine (LSTM) represents a particular sort of RNN geared towards integrating contemporary &
anterior information. The whole thing comprises of a recollection block & a total of three gates known to govern the movement of
data through the LSTM module at a given point in time. The aforementioned gates oversight the way the current cell within the
memory & the current concealed state are altered concurrently.

F. Evaluation Metrics
Upon building the model, its efficacy is evaluated using several indicators of effectiveness such as 'accuracy', 'precision score',
'ROC', 'AUC score', and so on. The efficacy of a model made up of statistics or machine learning is assessed using indicators of
assessment. Any undertaking necessitates the assessment of machine learning systems. A model is capable of being evaluated via a
variety of evaluation measures. Models are examined in this research based on accuracy and ROC score since quality indicators
solely are not able to determine the optimum strategy. Certain measurements have been addressed below:
● Accuracy Score: It is the proportion of appropriate forecasting to overall forecasts.

 Precision Score: The degree of accuracy is determined by how many good predictions for classes are really affirmative class
forecasts.

● Receiver Operating Characteristic (ROC) Score: It corresponds to the region beneath the contour, that gauges how well a
system for classification can distinguish amongst several distinct categories. It has a value spectrum of 0 to 1, with 1 denoting
optimal performance or the highest achievable effectiveness and 0 signifying the system's most minimal effectiveness.
● Early Stopping: It is a technique which is used to avoid overfitting. In this, by specifying patience value, model will continue to
train until the validation loss stops decreasing.

IV. RESULTS AND DISCUSSION

A. Machine Learning Results:
In machine learning, some bagging and boosting models are trained for sentiment analysis along with a variation of SVM classifier
i.e., Linear-SVC which outperformed all other machine learning models as shown in the table 1. Also, an Ensemble model of top
three best performing models are trained naming: Linear-SVC, Random Forest and LGB machine in case of Amazon Reviews
Dataset gave the best scores. Results, after applying different machine learning models are as follows:
1) Amazon Reviews Dataset: Table-1 delineates the most efficient model that gave evidently decent results with an Accuracy of
0.791, Precision Score of 0.689 and ROC Score of 0.779 when trained on Amazon reviews dataset was the Ensemble model of
the following three models i.e. LGB Machine, Naive Bayes and Linear-SVC.

©IJRASET: All Rights are Reserved | SJ Impact Factor 7.538 | ISRA Journal Impact Factor 7.894 | 3102
International Journal for Research in Applied Science & Engineering Technology (IJRASET)
ISSN: 2321-9653; IC Value: 45.98; SJ Impact Factor: 7.538
Volume 11 Issue V May 2023- Available at www.ijraset.com

Table 1: shows the Accuracy and ROC Curve Score of Machine learning models for Amazon Reviews

2) IMDb Reviews dataset: Table 2 clearly depicts the three most efficient models when trained on IMDb reviews dataset gave
reasonable results, namely, LGB Machine, Naive Bayes and Linear-SVC. Ensemble model of these three is made to get the best
possible results among machine learning models for IMDb reviews dataset.
With the Ensemble model we get an Accuracy of 0.894, Precision Score of 0.898 and ROC Score of 0.894.

Table 2: shows the Accuracy and ROC Curve Score of Machine learning models for IMDb Reviews

©IJRASET: All Rights are Reserved | SJ Impact Factor 7.538 | ISRA Journal Impact Factor 7.894 | 3103
International Journal for Research in Applied Science & Engineering Technology (IJRASET)
ISSN: 2321-9653; IC Value: 45.98; SJ Impact Factor: 7.538
Volume 11 Issue V May 2023- Available at www.ijraset.com

B. Deep Learning Results

Bidirectional LSTM Model, which is a variant of LSTM model, scored with good accuracy of 77% and ROC score of 0.81 for
Amazon Reviews dataset (figure 17) and for IMDb dataset (figure 18), 85.5% accuracy and 0.92 ROC score were the results.
Results are:

©IJRASET: All Rights are Reserved | SJ Impact Factor 7.538 | ISRA Journal Impact Factor 7.894 | 3104
International Journal for Research in Applied Science & Engineering Technology (IJRASET)
ISSN: 2321-9653; IC Value: 45.98; SJ Impact Factor: 7.538
Volume 11 Issue V May 2023- Available at www.ijraset.com

C. BERT Model Results

Distil-BERT, a distilled version of BERT which is a small, light, and fast transformer model having less parameters than bert-base-
uncased gave an Accuracy score of 0.86 and ROC Score of 0.935 on IMDb dataset while in case of Amazon dataset, it gave an
Accuracy Score of 0.79 and ROC Score of 0.86. Below are the Accuracy and ROC Curves for the same:

©IJRASET: All Rights are Reserved | SJ Impact Factor 7.538 | ISRA Journal Impact Factor 7.894 | 3105
International Journal for Research in Applied Science & Engineering Technology (IJRASET)
ISSN: 2321-9653; IC Value: 45.98; SJ Impact Factor: 7.538
Volume 11 Issue V May 2023- Available at www.ijraset.com

D. Hybrid Model Results

While training this model, “RMSProp” optimizer is used instead of Adam optimizer as it was able to produce greater results after
hyperparameter tuning of the model. Hybrid Model was trained on IMDb Reviews dataset and not on Amazon dataset. Results show
that the Hybrid model gave us an Accuracy Score of 0.9 and ROC Score of 0.96. The Accuracy curve and the ROC curve are as
follows:

V. CONCLUSION AND FUTURE WORK

In this paper, machine learning models such as Light Gradient Boosting Machine (LGBM), Naïve Bayes, Random Forest, Linear
Support Vector Classifier (SVC), Decision tree, AdaBoost, Gradient Boosting and Extreme Gradient Boosting (XGBoost) are
trained along with an Ensemble model for Light Gradient Boosting Machine, Linear Support Vector Classifier and Random Forest.
The model based on ensembles outperformed all other predictive models. For diversity, two multilingual datasets i.e., Amazon
review dataset and IMDb review dataset is taken. The Amazon dataset contains reviews in both English and German language while
the IMDb dataset encompasses reviews in English and French. Bi-LSTM model, when trained on the IMDb review dataset, gave a
good accuracy of 85.5% and 0.92 ROC score and the Amazon review dataset gave an accuracy of 77% and ROC score of 0.81.

©IJRASET: All Rights are Reserved | SJ Impact Factor 7.538 | ISRA Journal Impact Factor 7.894 | 3106
International Journal for Research in Applied Science & Engineering Technology (IJRASET)
ISSN: 2321-9653; IC Value: 45.98; SJ Impact Factor: 7.538
Volume 11 Issue V May 2023- Available at www.ijraset.com

The BERT approach gave an Accuracy score of 0.86 and ROC Score of 0.935 on IMDb dataset while in the case of Amazon
dataset, it gave an Accuracy Score of 0.79 and ROC Score of 0.86, which were better than the Bi-LSTM numbers. For the sake of
identifying IMDb reviews, an integrated approach constructed using CNN and LSTM can be utilized. The findings from the
experiment highlighted that our advocated deep learning hypothesis, particularly is built upon a mix of CNN and LSTM divisions,
beat all other models with about 90% accuracy and a ROC score of 0.96, revealing the algorithm's outstanding effectiveness. The
use of this method could enormously boost the opinions of others by differentiating constructive and negative feedback in order in
order to better comprehend the preferences and interests of individuals from varied backgrounds, as well as assist build the link
amongst consumers and enterprises. We haven’t applied the hybrid model on the Amazon dataset because of its small size.
For future works, to obtain better accuracy, tuning and adding of certain methods like dropout to avoid overfitting to dataset and
applying different optimizers like SGD, Adam etc. can also be done. The proposed model can also be trained on multi-class
sentiment analysis problems with slight modification in the last layer i.e, instead of sigmoid function, SoftMax function can be used.

REFERENCES
[1] E. Aydoğan and M. A. Akcayol, "A comprehensive srvey for sentiment analysis tasks using machine learning techniques," 2016 International Symposium on
INnovations in Intelligent SysTems and Applications (INISTA), 2016, pp. 1-7, doi: 10.1109/INISTA.2016.7571856.
[2] Fernandes, Marta & Sun, Haoqi & Jain, Aayushee & Alabsi, Haitham & Brenner, Laura & Ye, Elissa & Ge, Wendong & Collens, Sarah & Leone, Michael &
Das, Sudeshna & Robbins, Gregory & Mukerji, Shibani & Westover, M Brandon. (2020). Classification of the Disposition of Patients Hospitalized with
COVID-19: Reading Discharge Summaries Using Natural Language Processing (Preprint). 10.2196/preprints.25457.
[3] Acosta, Joshua, Norissa Lamaute, Mingxiao Luo, Ezra Finkelstein and Andreea Cotoranu. “Sentiment Analysis of Twitter Messages Using Word 2 Vec.”
(2017).
[4] S. M. Qaisar, "Sentiment Analysis of IMDb Movie Reviews Using Long Short-Term Memory," 2020 2nd International Conference on Computer and
Information Sciences (ICCIS), 2020, pp. 1-4, doi: 10.1109/ICCIS49240.2020.9257657.
[5] A. S. Zharmagambetov and A. A. Pak, "Sentiment analysis of a document using deep learning approach and decision trees," 2015 Twelve International
Conference on Electronics Computer and Computation (ICECCO), 2015, pp. 1-4, doi: 10.1109/ICECCO.2015.7416902.
[6] Kanakaraj, Monisha & Guddeti, Rammohana Reddy. (2015). Performance analysis of Ensemble methods on Twitter sentiment analysis using NLP techniques.
Proceedings of the 2015 IEEE 9th International Conference on Semantic Computing, IEEE ICSC 2015. 169-170. 10.1109/ICOSC.2015.7050801.
[7] Phand, S.A., & Phand, J.A. (2017). Twitter sentiment classification using stanford NLP. 2017 1st International Conference on Intelligent Systems and
Information Management (ICISIM), 1-5.
[8] R. Monika, S. Deivalakshmi and B. Janet, "Sentiment Analysis of US Airlines Tweets Using LSTM/RNN," 2019 IEEE 9th International Conference on
Advanced Computing (IACC), 2019, pp. 92-95, doi: 10.1109/IACC48062.2019.8971592.
[9] P. Vateekul and T. Koomsubha, "A study of sentiment analysis using deep learning techniques on Thai Twitter data," 2016 13th International Joint Conference
on Computer Science and Software Engineering (JCSSE), 2016, pp. 1-6, doi: 10.1109/JCSSE.2016.7748849.
[10] Fernandes M, Mendes R, Vieira SM, Leite F, Palos C, Johnson A, et al. (2020) Predicting Intensive Care Unit admission among patients presenting to the
emergency department using machine learning and natural language processing. PLoS ONE 15(3): e0229331. https://doi.org/ 10.1371/journal.pone.0229331
[11] Szlosek, Donald A, and Jonathan Ferrett. “Using Machine Learning and Natural Language Processing Algorithms to Automate the Evaluation of Clinical
Decision Support in Electronic Medical Record Systems.” EGEMS (Washington, DC) vol. 4,3 1222. 10 Aug. 2016, doi:10.13063/2327-9214.1222
[12] Abu Kwaik, Kathrein & Saad, Motaz & Chatzikyriakidis, Stergios & Dobnik, Simon. (2019). LSTM-CNN Deep Learning Model for Sentiment Analysis of
Dialectal Arabic. 10.1007/978-3-030-32959-4_8.
[13] Ghourabi, Abdallah, Mahmood A. Mahmood, and Qusay M. Alzubi. 2020. "A Hybrid CNN-LSTM Model for SMS Spam Detection in Arabic and English
Messages" Future Internet 12.
[14] M. Schuster and K. K. Paliwal, “Bidirectional recurrent neural networks,” IEEE Transactions on Signal Processing, vol. 45, no. 11, pp. 2673–2681, 1997.

NLP Final Mini Project
No ratings yet
NLP Final Mini Project
17 pages
Optimization of Sentiment Analysis Using BERT
No ratings yet
Optimization of Sentiment Analysis Using BERT
5 pages
A Review On Advances in Sentiment Analysis A Deep Learning Approach Using Transformer Based Models
No ratings yet
A Review On Advances in Sentiment Analysis A Deep Learning Approach Using Transformer Based Models
5 pages
GENERATIVE AI - Final Web
100% (2)
GENERATIVE AI - Final Web
80 pages
2019, Pradha - Effective Text Data Preprocessing Technique For Sentiment Analysis in Social Media Data
No ratings yet
2019, Pradha - Effective Text Data Preprocessing Technique For Sentiment Analysis in Social Media Data
8 pages
Credit Card Fraud Detection Using Machine Learning and Blockchain
100% (1)
Credit Card Fraud Detection Using Machine Learning and Blockchain
9 pages
Skill Verification System Using Blockchain SkillVio
No ratings yet
Skill Verification System Using Blockchain SkillVio
6 pages
IoT-Based Smart Medicine Dispenser
100% (1)
IoT-Based Smart Medicine Dispenser
8 pages
TNP Portal Using Web Development and Machine Learning
No ratings yet
TNP Portal Using Web Development and Machine Learning
9 pages
Real Time Human Body Posture Analysis Using Deep Learning
100% (1)
Real Time Human Body Posture Analysis Using Deep Learning
7 pages
Image Detection and Real Time Object Detection
100% (1)
Image Detection and Real Time Object Detection
8 pages
FULLTEXT02
No ratings yet
FULLTEXT02
46 pages
Design and Analysis of Components in Off-Road Vehicle
No ratings yet
Design and Analysis of Components in Off-Road Vehicle
23 pages
Sentiment Analysis Based On Deep Learning - A Comparative Study
No ratings yet
Sentiment Analysis Based On Deep Learning - A Comparative Study
29 pages
Controlled Hand Gestures Using Python and OpenCV
No ratings yet
Controlled Hand Gestures Using Python and OpenCV
7 pages
Emotion Detection On Text Using Machine Learning and Deep Learning Techniques
No ratings yet
Emotion Detection On Text Using Machine Learning and Deep Learning Techniques
12 pages
Sarcastic Tweet - MGR
No ratings yet
Sarcastic Tweet - MGR
26 pages
Role of Artificial Intelligence in Emotion Recognition
No ratings yet
Role of Artificial Intelligence in Emotion Recognition
5 pages
Sentiment Analysis and Review Classification Using Deep Learning
No ratings yet
Sentiment Analysis and Review Classification Using Deep Learning
8 pages
A Supervised Deep Learning-Based Sentiment Analysis by The Implementation of Word2Vec and Glove Embedding Techniques
No ratings yet
A Supervised Deep Learning-Based Sentiment Analysis by The Implementation of Word2Vec and Glove Embedding Techniques
34 pages
Aspect-Based Sentiment Analysis On Flipkart Data
No ratings yet
Aspect-Based Sentiment Analysis On Flipkart Data
8 pages
Aspect-Based Sentiment Analysis On Flipkart Data
No ratings yet
Aspect-Based Sentiment Analysis On Flipkart Data
8 pages
Toxic Comments Classification
No ratings yet
Toxic Comments Classification
10 pages
Restaurant Review Production Analysis Using Python
No ratings yet
Restaurant Review Production Analysis Using Python
33 pages
Week 1
No ratings yet
Week 1
4 pages
Tweet Sentiment & Emotion Analysis
No ratings yet
Tweet Sentiment & Emotion Analysis
8 pages
Topology Optimisation of Piston
No ratings yet
Topology Optimisation of Piston
8 pages
Analysis of The Evolution of Advanced Transformer-Based Language Models: Experiments On Opinion Mining
No ratings yet
Analysis of The Evolution of Advanced Transformer-Based Language Models: Experiments On Opinion Mining
16 pages
Welco ME
No ratings yet
Welco ME
15 pages
Smart Parking System Using MERN Stack
No ratings yet
Smart Parking System Using MERN Stack
6 pages
Emotion Detection On Social Media
No ratings yet
Emotion Detection On Social Media
7 pages
Business Support System For Local Stores
No ratings yet
Business Support System For Local Stores
8 pages
2 +intelligent+2024+paper+1
No ratings yet
2 +intelligent+2024+paper+1
12 pages
A E A T - B L M: E O M: Nalysis of The Volution of Dvanced Ransformer Ased Anguage Odels Xperiments On Pinion Ining
No ratings yet
A E A T - B L M: E O M: Nalysis of The Volution of Dvanced Ransformer Ased Anguage Odels Xperiments On Pinion Ining
16 pages
Sentiment Analysis Using Machine Learning
No ratings yet
Sentiment Analysis Using Machine Learning
5 pages
Sentiment Analysis With Machine Learning and Deep Learning A Survey of Techniques and Applications
No ratings yet
Sentiment Analysis With Machine Learning and Deep Learning A Survey of Techniques and Applications
11 pages
11 100-111 Tajet A+comprehensive+study
No ratings yet
11 100-111 Tajet A+comprehensive+study
12 pages
Kunal Dinesh
No ratings yet
Kunal Dinesh
12 pages
Research On The Application of Deep Learning-Based BERT Model in Sentiment Analysis
No ratings yet
Research On The Application of Deep Learning-Based BERT Model in Sentiment Analysis
10 pages
2 4asurvey
No ratings yet
2 4asurvey
25 pages
Tweet Sentiment & Emotion Analysis
No ratings yet
Tweet Sentiment & Emotion Analysis
8 pages
Sentiment Recognition in Customer Reviews Using Deep Learning
No ratings yet
Sentiment Recognition in Customer Reviews Using Deep Learning
10 pages
Target-Dependent Sentiment Classification With BERT: Zhengjie Gao, Ao Feng, Xinyu Song, and Xi Wu
No ratings yet
Target-Dependent Sentiment Classification With BERT: Zhengjie Gao, Ao Feng, Xinyu Song, and Xi Wu
19 pages
BERT A Review of Applications in Sentiment Analysi
No ratings yet
BERT A Review of Applications in Sentiment Analysi
11 pages
Comparison of Word Embedding Features Using Deep Learning in Sentiment Analysis
No ratings yet
Comparison of Word Embedding Features Using Deep Learning in Sentiment Analysis
10 pages
XLNet Transfer Learning Model For Sentimental Analysis
No ratings yet
XLNet Transfer Learning Model For Sentimental Analysis
9 pages
GR22
No ratings yet
GR22
8 pages
Twitter Sentiment Analysis Using Different Algorithms
No ratings yet
Twitter Sentiment Analysis Using Different Algorithms
6 pages
A Literature Review Enhancing Sentiment
No ratings yet
A Literature Review Enhancing Sentiment
11 pages
Sentiment Analysis From Movie Reviews Us
No ratings yet
Sentiment Analysis From Movie Reviews Us
5 pages
Eeg Based Emotion Classification Using Deep Learning Models
No ratings yet
Eeg Based Emotion Classification Using Deep Learning Models
4 pages
Conference Template A4
No ratings yet
Conference Template A4
10 pages
13 I January 2025
No ratings yet
13 I January 2025
6 pages
A Comparative Study of Some Selected Classifiers On An Imbalanced Dataset For Sentiment Analysis
No ratings yet
A Comparative Study of Some Selected Classifiers On An Imbalanced Dataset For Sentiment Analysis
7 pages
A Benchmark Study in Sentiment Analysis With Deep Neural Networks
No ratings yet
A Benchmark Study in Sentiment Analysis With Deep Neural Networks
6 pages
Short Text Sentiment Classification Using Bayesian
No ratings yet
Short Text Sentiment Classification Using Bayesian
16 pages
Formation of Smart Sentiment Analysis Technique For Big Data
No ratings yet
Formation of Smart Sentiment Analysis Technique For Big Data
8 pages
Amit Anand Presentation Sem4 Deep Learning Based Sentiment Analysis-2
No ratings yet
Amit Anand Presentation Sem4 Deep Learning Based Sentiment Analysis-2
12 pages
Sentimental Analysis of Product Review Data Using Deep Learning
No ratings yet
Sentimental Analysis of Product Review Data Using Deep Learning
5 pages
EJMTC1866511614549600
No ratings yet
EJMTC1866511614549600
7 pages
Emotion Detection Via Bert-Based Deep Learning Approaches in Natural Language Processing (#1524075) - 4105316
No ratings yet
Emotion Detection Via Bert-Based Deep Learning Approaches in Natural Language Processing (#1524075) - 4105316
12 pages
1383-Article Text-6285-2-10-20240305
No ratings yet
1383-Article Text-6285-2-10-20240305
8 pages
Analysing The Sentiments Using Natural Language
No ratings yet
Analysing The Sentiments Using Natural Language
9 pages
Hate Speech Detection Using Machine Learning
No ratings yet
Hate Speech Detection Using Machine Learning
5 pages
Deep-Sentiment: Sentiment Analysis Using Ensemble of CNN and Bi-LSTM Models
No ratings yet
Deep-Sentiment: Sentiment Analysis Using Ensemble of CNN and Bi-LSTM Models
6 pages
CryptoDrive A Decentralized Car Sharing System
100% (1)
CryptoDrive A Decentralized Car Sharing System
9 pages
NILES2021 Paper 43
No ratings yet
NILES2021 Paper 43
5 pages
REFERENCE
No ratings yet
REFERENCE
4 pages
Sentiment Analysis For Social Networks Using Machi
No ratings yet
Sentiment Analysis For Social Networks Using Machi
4 pages
Research Ashish
No ratings yet
Research Ashish
7 pages
Comparison of BERT and XLNet Accuracy With Classical Methods and Algorithms in Text Classification
No ratings yet
Comparison of BERT and XLNet Accuracy With Classical Methods and Algorithms in Text Classification
3 pages
Survey of Deep Learning Approaches For Twitter Text Classification
No ratings yet
Survey of Deep Learning Approaches For Twitter Text Classification
7 pages
Structural Analysis of The Performance of The Diagrid System With and Without Shear Wall
No ratings yet
Structural Analysis of The Performance of The Diagrid System With and Without Shear Wall
13 pages
Twitter Sentiment Analysis Using Deep Learning
No ratings yet
Twitter Sentiment Analysis Using Deep Learning
5 pages
Design and Analysis of Fixed Brake Caliper Using Additive Manufacturing
No ratings yet
Design and Analysis of Fixed Brake Caliper Using Additive Manufacturing
9 pages
Dark Store E-Commerce Website Using Sentiment Analysis Prediction
No ratings yet
Dark Store E-Commerce Website Using Sentiment Analysis Prediction
6 pages
Study and Analysis of Non-Newtonian Fluid Speed Bump
No ratings yet
Study and Analysis of Non-Newtonian Fluid Speed Bump
8 pages
Adsorption Study On Waste Water Characteristics by Using Natural Bio-Adsorbents
No ratings yet
Adsorption Study On Waste Water Characteristics by Using Natural Bio-Adsorbents
6 pages
Opinion Mining Using Machine Learning
No ratings yet
Opinion Mining Using Machine Learning
3 pages
11 V May 2023
No ratings yet
11 V May 2023
34 pages
Pneumonia Detection Using X-Rays by Deep Learning
No ratings yet
Pneumonia Detection Using X-Rays by Deep Learning
6 pages
Low Cost Scada System For Micro Industry
No ratings yet
Low Cost Scada System For Micro Industry
5 pages
Fund Future Empowering The Crowdfunding
No ratings yet
Fund Future Empowering The Crowdfunding
6 pages
Advanced Wireless Multipurpose Mine Detection Robot
No ratings yet
Advanced Wireless Multipurpose Mine Detection Robot
7 pages
Study and Analysis of Non-Newtonian Fluid Speed Bump
No ratings yet
Study and Analysis of Non-Newtonian Fluid Speed Bump
8 pages
Comparative in Vivo Study On Quality Analysis On Bisacodyl of Different Brands
No ratings yet
Comparative in Vivo Study On Quality Analysis On Bisacodyl of Different Brands
17 pages
Design and Analysis of Fixed-Segment Carrier at Carbon Thrust Bearing
No ratings yet
Design and Analysis of Fixed-Segment Carrier at Carbon Thrust Bearing
10 pages
Se of Optimism Software To Observe Effect of Different Sources in Optical Fiber
No ratings yet
Se of Optimism Software To Observe Effect of Different Sources in Optical Fiber
7 pages
BIM Data Analysis and Visualization Workflow
No ratings yet
BIM Data Analysis and Visualization Workflow
7 pages
Lesson Plan 2. Terms, Concepts and Their Use in Sociology
No ratings yet
Lesson Plan 2. Terms, Concepts and Their Use in Sociology
4 pages
A Review On Speech Emotion Classification Using Linear Predictive Coding and Neural Networks
No ratings yet
A Review On Speech Emotion Classification Using Linear Predictive Coding and Neural Networks
5 pages
Air Conditioning Heat Load Analysis of A Cabin
No ratings yet
Air Conditioning Heat Load Analysis of A Cabin
9 pages
Code-Switching and Mother-Tongue-Based Instruction in Grade-One Mathematics: A Comparative Analysis
No ratings yet
Code-Switching and Mother-Tongue-Based Instruction in Grade-One Mathematics: A Comparative Analysis
10 pages
Lesson 1. Nature of Research
No ratings yet
Lesson 1. Nature of Research
42 pages
The Family Diversity, Inequality, and Social Change 3th Edition PDF
No ratings yet
The Family Diversity, Inequality, and Social Change 3th Edition PDF
32 pages
M2 1st ORALCOMM SY20 21
No ratings yet
M2 1st ORALCOMM SY20 21
11 pages
FLCT BSEd Fil2B
No ratings yet
FLCT BSEd Fil2B
16 pages
Classroom Rules
No ratings yet
Classroom Rules
6 pages
Dorothea Orem Self-Care Deficit Model
No ratings yet
Dorothea Orem Self-Care Deficit Model
25 pages
The Principles of The Synchronicity Theory: Appli
No ratings yet
The Principles of The Synchronicity Theory: Appli
3 pages
Behaviorism M.A
No ratings yet
Behaviorism M.A
53 pages
March 2025 AS Timetable FWS
No ratings yet
March 2025 AS Timetable FWS
2 pages
Eagles Island Final Document - Compressed - 2021.10.01
No ratings yet
Eagles Island Final Document - Compressed - 2021.10.01
71 pages
Unit 5 Contingency and Situational Leadership
No ratings yet
Unit 5 Contingency and Situational Leadership
14 pages
Reviewer in NSTP II PDF
No ratings yet
Reviewer in NSTP II PDF
7 pages
Unit VII Cognition 34 Vs 35
No ratings yet
Unit VII Cognition 34 Vs 35
42 pages
Algorithm
No ratings yet
Algorithm
17 pages
P.E Theory Unit 2
No ratings yet
P.E Theory Unit 2
7 pages
Readings in The Philippine History - Lesson 1
No ratings yet
Readings in The Philippine History - Lesson 1
7 pages
Sociology - Mid Term 1 24-25
No ratings yet
Sociology - Mid Term 1 24-25
5 pages
Oral Com Hometask 5
No ratings yet
Oral Com Hometask 5
2 pages
Activity Report - Introduction To Stat
No ratings yet
Activity Report - Introduction To Stat
3 pages
BCSS Sec Unit 1
No ratings yet
BCSS Sec Unit 1
16 pages
Inferential Statistics
No ratings yet
Inferential Statistics
6 pages
Certificate of Registration: Western Mindanao State University
No ratings yet
Certificate of Registration: Western Mindanao State University
1 page
Adaptive Weight Assignment Scheme For Multi-Task Learning
No ratings yet
Adaptive Weight Assignment Scheme For Multi-Task Learning
6 pages
Internship Notification 2025
No ratings yet
Internship Notification 2025
2 pages
Instructional Planning Model Elements Pros Cons: - Bloom's Taxonomy
No ratings yet
Instructional Planning Model Elements Pros Cons: - Bloom's Taxonomy
1 page
19TUIT033 Resume
No ratings yet
19TUIT033 Resume
1 page

CNN LSTM Hybrid Approach For Sentiment Analysis

Uploaded by

CNN LSTM Hybrid Approach For Sentiment Analysis

Uploaded by

11 V May 2023

CNN LSTM Hybrid Approach for Sentiment

B. Contributions of the Paper

C. Approaches Considered in the Paper

3) Bidirectional LSTM Model Approach

II. RELATED WORK

B. Implementation using Machine Learning

C. Implementation using Bidirectional Long Short-Term Memory

D. Implementation using Bidirectional Encoder Representations from Transformers (BERT)

E. The Proposed Hybrid CNN-LSTM Model

CNN, Max-Pooling and LSTM are explained further in detail.

Pseudo Code of the proposed hybrid model

Detailed working of CNN is described as follows: -

IV. RESULTS AND DISCUSSION

B. Deep Learning Results

C. BERT Model Results

D. Hybrid Model Results

V. CONCLUSION AND FUTURE WORK

You might also like