F Sentiment Analysis On Large Scale Amazon Product Review
F Sentiment Analysis On Large Scale Amazon Product Review
net/publication/325756171
CITATIONS READS
252 24,765
3 authors:
SEE PROFILE
All content following this page was uploaded by Tanjim Ul Haque on 03 June 2019.
Abstract—The world we see nowadays is becoming more straightforwardly influence future customer purchase decisions
digitalized. In this digitalized world e-commerce is taking the [1]. Similarly, negative reviews often cause sales loss [2]. For
ascendancy by making products available within the reach of those understanding the feedback of customers and polarizing
customers where the customer doesn’t have to go out of their accordingly over a large amount of data is the goal. There are
house. As now a day’s people are relying on online products so some similar works done over amazon dataset. In [5] did
the importance of a review is going higher. For selecting a opinion mining over small set of dataset of Amazon product
product, a customer needs to go through thousands of reviews to reviews to understand the polarized attitudes towards the
understand a product. But in this prospering day of machine products.
learning, going through thousands of reviews would be much
easier if a model is used to polarize those reviews and learn from In our model, we used both manual and active learning
it. We used supervised learning method on a large scale amazon approach to label our datasets. In the active learning process
dataset to polarize it and get satisfactory accuracy. different classifiers are used to provide accuracy until reaching
satisfactory level. After getting satisfactory result we took
Keywords—Sentiment analysis, pool based active learning, those labeled datasets and processed it. From the processed
feature extraction, text classification, machine learning. dataset we extracted features that are then classified by
different classifiers. We used combination of two kinds of
I. INTRODUCTION approaches to extract features: the bag of words approach and
tf-idf & Chi square approach for getting higher accuracy.
As the commercial site of the world is almost fully undergone II. RELATED WORKS
in online platform people is trading products through different
e-commerce website. And for that reason reviewing products
before buying is also a common scenario. Also now a day, So far, much of the research papers related to product reviews,
customers are more inclined towards the reviews to buy a sentiment analysis or opinion mining has been done recently.
product. So analyzing the data from those customer reviews to In the work [3] Elli, Maria and Yi-Fan extracted sentiment
make the data more dynamic is an essential field nowadays. In from the reviews and analyze the result to build up a business
this age of increasing machine learning based algorithms model. They have claimed that demonstrated tools were robust
reading thousands of reviews to understand a product is rather enough to give them high accuracy. The use of business
time consuming where we can polarize a review on particular analytics made their decision more appropriate. They also
category to understand its popularity among the buyers all over worked on detecting emotions from review, gender based on
the world. the names, also detecting fake reviews. The commonly used
programming language was python and R. They mainly used
The objective of this paper is to categorize the positive and Multinomial Naïve Bayesian (MNB) and support vector
negative feedbacks of the customers over different products machine (SVM) as their main classifiers. In paper [4] the
and build a supervised learning model to polarize large amount author applied existing supervised learning algorithms to
of reviews. A study on amazon last year revealed over 88% of predict a reviews rating on a given numerical scale using only
online shoppers trust reviews as much as personal text. They have used hold out cross validation using 70% data
recommendations. Any online item with large amount of as training data and 30% data as testing data. In this paper the
positive reviews provides a powerful comment of the author used different classifiers to determine the precision and
legitimacy of the item. Conversely, books, or any other online recall values. The author in Paper [5] applied and extended the
item, without reviews puts potential prospects in a state of current work in the field of natural language processing and
distrust. Quite simply, more reviews look more convincing. sentiment analysis to data from Amazon review datasets. Naïve
People value the consent and experience of others and the Bayesian and decision list classifiers were used to tag a given
review on a material is the only way to understand others review as positive or negative. They have selected books and
impression on the product. Opinions, collected from users‟ kindle section review from amazon. The author in [6] aimed to
experiences regarding specific products or topics, build a system that visualizes the reviews sentiment in the form
ACCESSORIES
CELLPHINE &
FP (False Positive) represents numbers of correct data Naïve Bayes
misclassified Stochastic Gradient
91.88 84.93 0.9 0.93 0.91
Descent
FN (False Negative) represents numbers of incorrect
data classified as correct Random Forest 92.72 88.20 0.967 0.967 0.97
TN (True Negative) is the numbers of incorrect data Logistic regression 88.2 81.99 0.87 0.88 0.88
classified Decision tree 91.45 83.71 0.95 0.95 0.95
Table-1: Experiment result for cellphone & accessories data
Precision: Precision measures the exactness of a classifier,
how many of the return documents are correct. A higher Accuracy Accuracy F1
precision means less false positives, while a lower precision Dataset Classifier Precision Recall
10 Fold 5 Fold score
means more false positive. Precision (P) is the ratio of numbers
of instance correctly classified from total. It can be defined as- Linear support
94.02 89.76 0.9889 0.971 0.98
Vector machine
Multinomial
91.57 89.77 0.98 0.93 0.96
Naïve Bayes
MUSICAL
Stochastic Gradient
92.89 88.264 0.99 0.96 0.98
Recall: Recall calculates the sensitivity of a classifier; how Descent
many positive data it returns. Higher recall means less false Random Forest 93.56 88.51 0.98 0.97 0.975
negatives. Recall is the ratio of number of instance accurately
Logistic regression 91.34 87.14 0.96 0.95 0.95
classified to the total number of predicted instance. This can be
shown as- Decision tree 92.45 86.27 0.969 0.96 0.96
Naïve Bayes
Stochastic
92.61 90.96 0.964 0.988 0.975
Accuracy: Accuracy predicts how often the classifier makes Gradient Descent
the correct prediction. Accuracy is the ratio between the Random Forest 92.89 91.14 0.968 0.988 0.978
number of correct predictions and the total number of
prediction. Logistic
88.96 87.843 0.919 0.955 0.937
regression
Decision tree 91.569 87.50 0.962 0.9669 0.96
72.95%
Amazon Reviews,business Review of In this research we proposed a supervised learning model to
analytics with sentiment 2016 cellphone& polarize a large amount of product review dataset which was
analysis [11] accessories unlabeled. We proposed our model which is a supervised
80.11% learning method and used a mix of 2 kinds of feature extractor
approach. We described the basic theory behind the model,
Sentimetn Analysis in reviews of books 84.44% approaches we used in our research and the performance
Amazon Reviews Using measure for the conducted experiment over quite a large data.
2013 (6)
Probalbilistic Machine We also compared our result with some of the similar works
Learning [5] reviews of Kindle 87.33%
regarding product review. We also went through different
kinds of research papers regarding sentiment analysis over a
text based dataset. We were able to achieve accuracy over 90%
Mining somparative with the F1 measure, precision and recall over 90%. We tried
opinions from customer Customer product
reviews for competitive
2011 (234)
reviews
61.00% different simulation using cross validation, training-testing
intelligence [12] ratio, and different feature extraction process for comparing
varying amount of data to achieve promising results. In most of
the cases 10 fold provided a better accuracy while Support
Vector Machine (SVM) provided best classifying results. It is
hard to gather huge amount of gold standard dataset for this
Amazing: A sentiment purpose as e-commerce sites have their limitations on giving
E commerce
mining & Retrieval System 2009 (125) 87.60% data publicly. Also scraping data can be a problem as we can‟t
reviews
[12]
scrape enough data to consider it as real-life public reviews
over different products.
70.00% Some future works which can be included to improve the
Review on books 70.00% model and also to make it more effective in practical cases.
80.00% Our future works include applying PCA (Principal Component
"Feature Selection Methods Analysis) in active learning process to fully automate data
62.00%
in Sentiment Analysis and labeling process with less assistance from the oracle. The
Sentiment Classification of 2016 Review on music 80.00%
Amazon Product Reviews"
model can be incorporate with programs that can interact with
68.00% customer seeking a score of a particular product. As we used a
[8]
62.00% large scale dataset we can apply the model on local market
Review on 80.00% sites to get better accuracy and usability. And lastly we will
Camera
68.00% try to continue this research until we generalize this model to
all kinds of text based reviews and comments.
Review of
cellphone& 93.57%
Proposed accessories
Model
2018 Review of REFERENCES
Electronics 93.52%
Reviews of music
94.02% [1] Samha,Xu,Xia, Wong & Li “Opinion Annotation in Online
Instruments Chinese Product Reviews.” In Proceedings of LREC
Table-4: Comparative Analysis conference, 2008.
Different researches listed in the table have conducted [2]. Nina Isabel Holleschovsky, “The social influence factor:
Impact of online product review characteristics on consumer
different pre-processing steps and feature extraction processes.
purchasing decisions”, 5 th IBA Bachelor Thesis Conference,
As in our research we tied to improvise all the extraction
Enschede, The Netherlands 2015
processes and preprocessing steps and pick the best accuracy
from it. Pull based active learning process have contributed [3]Elli, Maria Soledad, and Yi-Fan Wang. "Amazon Reviews,
labeling and selecting the best reviews as our training and business analytics with sentiment analysis." 2016
testing data. Use of different preprocessing process helped
[4]Xu, Yun, Xinhui Wu, and Qinxia Wang. "Sentiment [10]Text mining for yelp dataset challenge; Mingshan Wang;
Analysis of Yelp„s Ratings Based on Text Reviews." (2015). University of California San Diego, (2017)
[5] Rain, Callen. "Sentiment Analysis in Amazon Reviews [11] Elli, Maria Soledad, and Yi-Fan Wang. "Amazon
Using Probabilistic Machine Learning."Swarthmore College Reviews, business analytics with sentiment analysis." 2016
(2013).
[12] Xu, Kaiquan, et al. "Mining comparative opinions from
[6] Bhatt, Aashutosh, et al. "Amazon Review Classification customer reviews for Competitive Intelligence." Decision
and Sentiment Analysis." International Journal of Computer support systems 50.4 (2011): 743-754.
Science and Information Technologies 6.6 (2015): 5107-5110.
[13] Miao, Q., Li, Q., & Dai, R. (2009). AMAZING: A
[7]Chen, Weikang, Chihhung Lin, and Yi-Shu Tai."Text-Based sentiment mining and retrieval system. Expert Systems with
Rating Predictions on Amazon Health & Personal Care Product Applications, 36(3), 7192-7198.
Review." (2015)
[14] He, Ruining, and Julian McAuley. "Ups and downs:
[8]Shaikh, Tahura, and DeepaDeshpande. "Feature Selection Modeling the visual evolution of fashion trends with one-
Methods in Sentiment Analysis and Sentiment Classification of class collaborative filtering." Proceedings of the 25th
Amazon Product Reviews.",(2016) International Conference on World Wide Web.International
World Wide Web Conferences Steering Committee, 2016.
[9]Nasr, Mona Mohamed, Essam Mohamed Shaaban, and
Ahmed Mostafa Hafez. "Building Sentiment analysis Model
using Graphlab." IJSER, 2017