Sentiment Analysis Tool For Amazon Product Reviews
Sentiment Analysis Tool For Amazon Product Reviews
https://doi.org/10.22214/ijraset.2023.50223
International Journal for Research in Applied Science & Engineering Technology (IJRASET)
ISSN: 2321-9653; IC Value: 45.98; SJ Impact Factor: 7.538
Volume 11 Issue IV Apr 2023- Available at www.ijraset.com
Abstract: Sentiment analysis has become a vital component of modern data analysis, particularly for businesses that rely on
customer input to improve their products and services. We employ Natural Language Processing (NLP) techniques to
analyze sentiment in Amazon product reviews in this study. Our major goal is to categorize the assessments based on whether
they are good, negative, or neutral. We’ll use Amazon product review data, which includes a large number of reviews from
various categories, such as books, electronics, and clothes.
Keywords: Web Scraping, Sentiment Analysis, Amazon, Product Reviews, Positive, Negative, Neural, Beautiful Soup, Natural
Language Tool Kit(NLTK), Flask, VADER.
I. INTRODUCTION
The project is based on Amazon product reviews since Amazon is one of the largest online marketplaces, with millions of products
and billions of customer reviews. Asa result, analyzing the mood of Amazon customer reviewsmay give valuable insight into
how people act, what they prefer, and how they feel about certain things. We apply natural language processing algorithms for
sentiment analysis on Amazon product reviews in this research.
The power of sentiment analysis to extract key insights from massive amounts of textual data has lately piqued the curiosity of many
in natural language processing. A customer review or social media post, for example, might be the topic of a senti- ment analysis,
which categorizes the text’s sentiment, attitude, and opinion as positive, negative, or neutral. Businesses want this data since it may
help them better grasp the viewpointsof their customers and modify the quality of their goods and services.
The project outputs can assist the consumer and businessin understanding the product better, whether it is to acquire a product or
improve the product’s experience.
©IJRASET: All Rights are Reserved | SJ Impact Factor 7.538 | ISRA Journal Impact Factor 7.894 | 859
International Journal for Research in Applied Science & Engineering Technology (IJRASET)
ISSN: 2321-9653; IC Value: 45.98; SJ Impact Factor: 7.538
Volume 11 Issue IV Apr 2023- Available at www.ijraset.com
III. OBJECTIVES
The primary goal of this study is to gain a better under- standing of emotion towards a product as represented through text reviews.
People, it is assumed, will not spend much time comprehending and comparing the text reviews one by one.
IV. IMPLEMENTATION
Data from Amazon product reviews were obtained via web scraping. For this case, we used the Beautiful Soup package to read and
extract HTML pages. The proceeding step is to clean and prepare the data by removing superfluous information such as URLs,
HTML components, and non-alphanumeric characters. Stop words should also be eliminated, and words should be
stemmed/lemmatized to standardize the language. We used the NLTK package, which has stemmer and tokeniza- tion techniques
that help us to stem and tokenize text. Andthis provides us with guidelines for categorizing the reviewsas positive, negative, or
neutral, we used the VADER package, which has a sentiment analyzer that helps us to generate scores for the individual sentence, as
in this case, it is the review titles. The review titles retrieved for that product are the data utilized for sentiment analysis.
©IJRASET: All Rights are Reserved | SJ Impact Factor 7.538 | ISRA Journal Impact Factor 7.894 | 860
International Journal for Research in Applied Science & Engineering Technology (IJRASET)
ISSN: 2321-9653; IC Value: 45.98; SJ Impact Factor: 7.538
Volume 11 Issue IV Apr 2023- Available at www.ijraset.com
Visualize the sentiment analysis data using pie charts to gain insights into the overall sentiment of the product reviews. When
depicted the pie chart depicts the total ratings for the retrieved pages per review, which are likewise calculated in percentage
terms. This is achieved using pandas, plotty express, and Seaborn libraries to generate pie charts in a visually appealing format.
Consequently, the outcomes area representation and analyzed version of those data, which assists us in identifying potential for
product or marketing plan improvement. This sentiment analysis procedure may be automated for new Amazon product reviews,
and the sentimentanalysis tool can be deployed to a web application. As a result,any potential consumer can utilize it in the future.
It should be noted that these processes are not necessarily linear and that iteration may be required to increase the model’s
performance. Several tools and frameworks, such as NLTK, are also available to aid the implementation process.
V. RESULTS
The result that is generated using this tool are as follows, and these results include an overview of reviews extracted and following
up with the results of the sentiment analysis for the review title and the comparison of ratings(out of 5) in the form of a pie chart.
And this pie chart is in form with equivalent percentages of those ratings.
The review titles are the inputs considered by this senti- mental analysis tool because that is one parameter where the user tries to
give short phrases for what he is trying to express. Generating individual sentiment scores for those titles can give us an overview of
what the user is probably thinking. Andafter giving the title, he/she expresses thoughts and overviews his thoughts regarding the
product itself. Hence we decided to take this title as input for the tool to generate sentimental analysis and display appropriate
results for understanding.
©IJRASET: All Rights are Reserved | SJ Impact Factor 7.538 | ISRA Journal Impact Factor 7.894 | 861
International Journal for Research in Applied Science & Engineering Technology (IJRASET)
ISSN: 2321-9653; IC Value: 45.98; SJ Impact Factor: 7.538
Volume 11 Issue IV Apr 2023- Available at www.ijraset.com
By portraying the rating’s comparison individually repre- sented as percentages also helps the user to conclude as this is a better
representation of ratings than the usual representation of individual bars on the Amazon review page.
REFERENCES
[1] Fankar Armash Aslam, Hawa Nabeel Mohhamed, ‘’Efficient Way Of Web Development Using Python And Flask,” International Journal of Advanced
Computer Research, January 2015.
[2] Vidhi Singrodia, Anirban Mitra, Subrata Paul, ‘’A Review on WebScraping and its Applications,” 2019.
[3] Anil Singh Parihar, ‘’A Study on Sentiment Analysis of Product Reviews,” International Conference on Soft-computing and Network Security (ICSNS), 2018.
[4] Dilesh Tanna, Manasi Dudhan, ‘’Sentiment Analysis on Social Media for Emotion Classification,” 19 June 2020.
[5] Binju Saju, Siji Jose, ‘’Comprehensive study on sentiment analysis:Types, approaches, recent applications, tools and APIs.,” Federal In- stitute of Science and
Technolgy, Mookanoor, Kerala, July 2020.
[6] Zoubir AM, Iskander DR, ‘’Bootstrap methods and applications [from the guest editors],” IEEE Signal Process Mag., 2007.
[7] Venus Khim-Sen Liew, ‘’An overview on various ways of bootstrap methods,” University Malaysia, Sarawak, January 2008. .
©IJRASET: All Rights are Reserved | SJ Impact Factor 7.538 | ISRA Journal Impact Factor 7.894 | 862