Sentiment Analysis Using Machine Learning Algorithms

The document discusses a research project focused on sentiment analysis using machine learning algorithms to classify tweets as positive or negative. It outlines the methodology, including data extraction, preprocessing, and classification using the NLTK dataset, and presents experimental results demonstrating the model's accuracy. Future work aims to enhance the model's capabilities, including predicting sarcasm and applying the technique to Arabic tweets.

Uploaded by

Nivashini G

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

55 views23 pages

Sentiment Analysis Using Machine Learning Algorithms

Uploaded by

Nivashini G

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 23

SENTIMENT ANALYSIS USING

MACHINE LEARNING ALGORITHMS

Ms.G.NIVASHINI - RESEARCH SCHOLAR
Ms. M. SHIMA - RESEARCH SCHOLAR
Dr.R.HEMALATHA - RESEARCH SUPERVISOR
PG & RESEARCH DEPARTMENT OF COMPUTER SCIENCE,
TIRUPPUR KUMARAN COLLEGE FOR WOMEN
TIRUPPUR, TAMILNADU
CONTENTS
 ABSTRACT
 INTRODUCTION
 RELATED WORK
 PROPOSED SYSTEM
 EXPERIMENTATION AND RESULTS
 CONCLUSION AND FUTURE WORK
 REFERENCES
ABSTRACT
The goal of this work is to use Machine
Learning (ML) methods to create a classifier
that can predict a comment's polarity.
Our work is essentially divided into three
tasks: data extraction, processing and
modelling. Our model is constructed using the
NLTK dataset.
Based on a supervised probabilistic machine
learning algorithm, we tended to create a
classifier to classify our tweets into positive
and negative sentiments then we opt for two
experiments to evaluate the performance of
our model.
INTRODUCTION

 The transition from web 1.0 to web 2.0 has made it

simpler for individuals to create and exchange ideas,
opinions, and methods online.
 As a result, the amount of subjective information, or
opinion data, on the Internet is growing rapidly.
 Sentiment analysis (SA), which focuses on opinion
mining (identification and classification) from textual
data, is one of the concepts that have emerged from
the growing purpose of gathering, analyzing, and
using this data.
 More people are using the internet and different
social media platforms to voice their thoughts and
ideas.
 As a result, there are now more user-generated
sentences. Including sentiment information that is
too complex for humans to read and comprehend.
 automatic analysis of opinions expressed on
various web platforms is becoming more and more
important for making effective decisions.
RELATED WORK

Sentiment analysis has garnered significant

scholarly interest in recent years as a result of
the widespread distribution of internet
evaluations.
As a result, a great deal of research has been
done in this field. Data preparation to
eliminate data noise has been covered by
some writers.
The findings demonstrated that sentiment
trending words in sentiment analysis have
some bearing on the prediction's outcome.
The accuracy of the prediction findings
declines after the high frequency words are
eliminated, particularly for the distinct high
frequency terms of each class.
Additional research on the classification of
Malayalam tweets as either positive or
negative using various machine learning
techniques, including NB, SVM, and RF
Comparing the classification performance and
accuracy of algorithms is the main focus of the
majority of this field's research (ML).
PROPOSED SYSTEM

 Algorithms for machine learning. To get to the

evaluation phase, we must first complete the
Tweet collection phase, followed by the
preprocessing, data preparation, and classification
stages.
 Python provides high-level tools and an easy-to-
use syntax, and because Anaconda is the best
way to install machine learning packages, we
decided to utilize it as our development
environment.
• A. Phase of Data Collection
The data utilized in this work consists of a
dataset of sample English tweets from the NLTK
package. NLTK’s Twitter corpus currently
contains a sample of 20k (20,000 non
sentimental tweets) Tweets.
• B. Preprocessing Phase
The language also in its original form can not be
processed accurately by a machine, so we have to clean
up our tweets to make it easier to understand and use
by a supervised machine learning algorithm.
 Data tokenization
 Delete stop words
 Remove URL
 Remove @ mentions
 Change to lowercase
• C. Preparing data
In the data preparation step, we will
convert the tokens to Python dictionary format
using words as keywords and True as values,
mixed at random, in order to get the data ready
for sentiment analysis.
• D. Phase of Classification
– The machine learning algorithm can be used to
learn from the training data once the data has
been separated into training and test sets.
– The algorithms listed below were applied: Naïve
Bayes (NB) classification (supervised, probabilistic
classification)
EXPERIMENTATION AND RESULTS

 Every experiment is conducted using an Intel (R)

Core (TM) i3-6006U processor with a CPU running
at 2.00 GHZ and 4.00 GB of RAM.
 Using supervised learning, we were able to classify
tweeter reviews with a sufficient degree of
accuracy.
 The primary goal of this project is to educate the
computer to read and comprehend human-typed
English sentences and to categorize them as either
positive or negative emotions.
We will be using the NLTK package in Python
for all of the NLP tasks in this tutorial.
Once the samples have been downloaded, we
are ready to begin processing the data. The
first part of understanding data is to use a
process called tokenization, or splitting strings
into smaller parts called tokens.
The basic way to divide language into tokens
is to divide text based on spaces and
punctuation. First of all we need to download
the punkt module which helps us tokenize
words and phrases.
We will construct a sentiment analysis model
that would link tweets with either a good or
negative attitude.
By default, all good tweets are included in the
data, followed by all negative tweets.
We should supply an unbiased sample of our
data for the model to be trained on. We've
included code to randomly arrange the data
using the.shuffle() method of random in order
to prevent bias.
CONCLUSION AND FUTURE WORK

Sentiment detection is a developing field with a

number of difficulties. The study of strategies
and tactics that guarantee the automatic
categorization of emotions into positive or
negative polarity is the goal of this endeavor.
This article employs a variety of methods.
The most recent ones are produced using
information from NLTK's Twitter corpus, which at
the moment comprises 30,000 tweet samples.
Before converting all tweets to lowercase, we
preprocess the data using tokenization,
lemmatization, the removal of stop words,
URLs, @ mentions, punctuation, and special
characters.
We must supply enough training data to train
our model appropriately since future work will
also involve enhancing it to predict sarcasm
Future plans call for applying the classification
technique to evaluate its efficacy with Arabic
tweets, given the high volume of generated
per minute, many of which are in the Arabic
language.
REFERENCES

• [1] J. Li, S. Fong, Y. Zhuang, and R. Khoury, “ Hierarchical

Classification in Text Mining for Sentiment Analysis,” in 2014
International Conference on Soft Computing and Machine
Intelligence, september. 2014, p. 46-51, doi: 10.1109/ISCMI.201
• 4.37.
• [2] H. Parveen and S. Pandey, “Sentiment analysis on Twitter
Dataset using Naive Bayes algorithm ,” in 2016 2nd International
Conference on Applied and Theoretical Computing and
Communication Technology (iCATccT), july. 2016, p. 416-419, doi:
10.1109/ICATCCT.2016.7912034.
• [3] S. S. and P. K.v., “ Sentiment analysis of malayalam tweets using
machine learning techniques ,” ICT Express, april. 2020, doi:
10.1016/j.icte.2020.04.003.
THANK YOU

Sentiment Analysis for Data Scientists
No ratings yet
Sentiment Analysis for Data Scientists
22 pages
Twitte Analysis
No ratings yet
Twitte Analysis
53 pages
Batch-6c Minipro Doc Rev-2
No ratings yet
Batch-6c Minipro Doc Rev-2
33 pages
Micro-Blogging Sentimental Analysis On Twitter Data Using Naïve Bayes
No ratings yet
Micro-Blogging Sentimental Analysis On Twitter Data Using Naïve Bayes
7 pages
Introduction
No ratings yet
Introduction
27 pages
Se Write-Up
No ratings yet
Se Write-Up
2 pages
Real-Time Twitter Sentiment Analysis
100% (1)
Real-Time Twitter Sentiment Analysis
19 pages
Sentiment Analysis Final Documentation Report
50% (2)
Sentiment Analysis Final Documentation Report
21 pages
Abstract
No ratings yet
Abstract
2 pages
IC-RTETM Final Sentiment Analysis
No ratings yet
IC-RTETM Final Sentiment Analysis
13 pages
Fin Ijprems1714118825
No ratings yet
Fin Ijprems1714118825
6 pages
10 1109@icaccs48705 2020 9074208
No ratings yet
10 1109@icaccs48705 2020 9074208
3 pages
MP 1
No ratings yet
MP 1
14 pages
2944 Suhashini Chaurasia
No ratings yet
2944 Suhashini Chaurasia
15 pages
MINI
No ratings yet
MINI
9 pages
Twitter Sentiment Analysis System
No ratings yet
Twitter Sentiment Analysis System
5 pages
Sentiment Analysis On Twitter Data Using Machine Learning Algorithms in Python
No ratings yet
Sentiment Analysis On Twitter Data Using Machine Learning Algorithms in Python
14 pages
Twitter Sentiment Analysis Survey
No ratings yet
Twitter Sentiment Analysis Survey
7 pages
Sentiment Analysis On Twitter Data Using Machine Learning Algorithms in Python
No ratings yet
Sentiment Analysis On Twitter Data Using Machine Learning Algorithms in Python
15 pages
Sentiment Analysis of Twitter Data My
75% (4)
Sentiment Analysis of Twitter Data My
14 pages
Twitter Sentiment Analysis
100% (2)
Twitter Sentiment Analysis
10 pages
Machine Learning For Sentiment Analysis of Twitter Data
No ratings yet
Machine Learning For Sentiment Analysis of Twitter Data
9 pages
Sentiment Analysis
No ratings yet
Sentiment Analysis
11 pages
Twitter Sentiment Analysis Project
No ratings yet
Twitter Sentiment Analysis Project
18 pages
Twiiter Sentiment Analysis
No ratings yet
Twiiter Sentiment Analysis
15 pages
Machine Learning With Advance Model
No ratings yet
Machine Learning With Advance Model
19 pages
Urdu Sentiment Analysis Guide
No ratings yet
Urdu Sentiment Analysis Guide
18 pages
CMU Qatar CS Senior Thesis 2015
No ratings yet
CMU Qatar CS Senior Thesis 2015
38 pages
Sentiment Analysis for Students
No ratings yet
Sentiment Analysis for Students
26 pages
Social Media Sentiment Ppt1
No ratings yet
Social Media Sentiment Ppt1
16 pages
Pre Processing
No ratings yet
Pre Processing
9 pages
NLP Project (Documentation)
No ratings yet
NLP Project (Documentation)
8 pages
Mini Project
No ratings yet
Mini Project
16 pages
Twitter Sentiment Analysis Project
No ratings yet
Twitter Sentiment Analysis Project
7 pages
Digital Assignment-1 Literature Review On Twitter Sentiment Analysis Name: G.Tirumala Reg No: 16BCE0202 1)
No ratings yet
Digital Assignment-1 Literature Review On Twitter Sentiment Analysis Name: G.Tirumala Reg No: 16BCE0202 1)
9 pages
Sentimental Analysis On Twitter Data Using Naive Bayes: Ijarcce
No ratings yet
Sentimental Analysis On Twitter Data Using Naive Bayes: Ijarcce
4 pages
6 Project Report Sem6
No ratings yet
6 Project Report Sem6
13 pages
Project Review
No ratings yet
Project Review
17 pages
Natural Language Processing (Ue16Cs333) MINI-PROJECT (2019) Sentiment Analysis
No ratings yet
Natural Language Processing (Ue16Cs333) MINI-PROJECT (2019) Sentiment Analysis
2 pages
Sentiment Analysis of Social Media With Python - by Haaya Naushan - Towards Data Science
No ratings yet
Sentiment Analysis of Social Media With Python - by Haaya Naushan - Towards Data Science
9 pages
Sentiment Analysis for Airlines
No ratings yet
Sentiment Analysis for Airlines
4 pages
Project Proposal Machine Learning: Title: Team Members
No ratings yet
Project Proposal Machine Learning: Title: Team Members
2 pages
Synopsis 6th Sem
No ratings yet
Synopsis 6th Sem
5 pages
Lab Report - CSE 816
No ratings yet
Lab Report - CSE 816
17 pages
(Soft Computing)
No ratings yet
(Soft Computing)
20 pages
Projec Niraj Nishad
No ratings yet
Projec Niraj Nishad
11 pages
Natural Language Processing For Sentiment Analysis - Ankur Shukla
No ratings yet
Natural Language Processing For Sentiment Analysis - Ankur Shukla
27 pages
Machine Learning Sentiment Analysis
No ratings yet
Machine Learning Sentiment Analysis
5 pages
Comment Analyser Thesis
No ratings yet
Comment Analyser Thesis
63 pages
Senti bp1
No ratings yet
Senti bp1
2 pages
Thesis - Aru Omarali
No ratings yet
Thesis - Aru Omarali
34 pages
Improvement in Sentiment Analysis of Twitter Texts Using Machine Learning Algorithms
No ratings yet
Improvement in Sentiment Analysis of Twitter Texts Using Machine Learning Algorithms
21 pages
SentA Russir Day2
No ratings yet
SentA Russir Day2
33 pages
Minor Project Report
No ratings yet
Minor Project Report
29 pages
Project Review On The Opinion Minin
No ratings yet
Project Review On The Opinion Minin
4 pages
Session 7
No ratings yet
Session 7
17 pages
Python Twitter Sentiment Analysis
No ratings yet
Python Twitter Sentiment Analysis
20 pages
Projec Niraj Nishad
No ratings yet
Projec Niraj Nishad
11 pages
GCSE/GCE Physics Mark Scheme
No ratings yet
GCSE/GCE Physics Mark Scheme
16 pages
PFP Project Proposal Most Recent
No ratings yet
PFP Project Proposal Most Recent
6 pages
Dsdm-Unit1 241031 194317
No ratings yet
Dsdm-Unit1 241031 194317
38 pages
Worksheet - Week 2 - Day 5
No ratings yet
Worksheet - Week 2 - Day 5
3 pages
English Vocabulary Booster: Family
No ratings yet
English Vocabulary Booster: Family
2 pages
Midterm Exam in Hbo
No ratings yet
Midterm Exam in Hbo
5 pages
Supervisory Plan for Modular Learning
No ratings yet
Supervisory Plan for Modular Learning
2 pages
Character Reference for Quinn & Inok
0% (1)
Character Reference for Quinn & Inok
2 pages
How Could I Hide My Face
No ratings yet
How Could I Hide My Face
5 pages
Mark Scheme (Results) January 2023
No ratings yet
Mark Scheme (Results) January 2023
17 pages
Eco 245
No ratings yet
Eco 245
6 pages
General Biology 1
No ratings yet
General Biology 1
4 pages
Introduction To Socio Cultural and Anthropological Concepts
No ratings yet
Introduction To Socio Cultural and Anthropological Concepts
17 pages
EFL Lesson Planning for Secondary Schools
No ratings yet
EFL Lesson Planning for Secondary Schools
19 pages
Policy Insights for Parenting Support
No ratings yet
Policy Insights for Parenting Support
30 pages
Edukasyong Pantahanan at Pangkabuhayan and Technology and Livelihood Education Grades 4-6 December 2013
No ratings yet
Edukasyong Pantahanan at Pangkabuhayan and Technology and Livelihood Education Grades 4-6 December 2013
10 pages
COLLEGE Week 2
No ratings yet
COLLEGE Week 2
5 pages
CSC388 Syllabus
No ratings yet
CSC388 Syllabus
8 pages
Yhills Intern-8
No ratings yet
Yhills Intern-8
26 pages
History Exam Marking Guide
No ratings yet
History Exam Marking Guide
9 pages
1000 Most Common English Phrases
No ratings yet
1000 Most Common English Phrases
2 pages
SHRM Alignment of HR Function With Business Strategy
No ratings yet
SHRM Alignment of HR Function With Business Strategy
5 pages
Checklist
No ratings yet
Checklist
3 pages
Notary Renewal Petition
No ratings yet
Notary Renewal Petition
3 pages
Understanding Verb Tenses and Aspects
No ratings yet
Understanding Verb Tenses and Aspects
19 pages
Self-Discipline Guide for Teens
No ratings yet
Self-Discipline Guide for Teens
11 pages
Andrea Redinger Keynote at KDP Event
No ratings yet
Andrea Redinger Keynote at KDP Event
2 pages
Discrete Mathematics, 1ma462, Spring 2021
No ratings yet
Discrete Mathematics, 1ma462, Spring 2021
2 pages
Ideal Gases Lecture 3
No ratings yet
Ideal Gases Lecture 3
34 pages
Axis Education More Functional Skillbuilders English
100% (2)
Axis Education More Functional Skillbuilders English
12 pages

Sentiment Analysis Using Machine Learning Algorithms

Uploaded by

Sentiment Analysis Using Machine Learning Algorithms

Uploaded by

SENTIMENT ANALYSIS USING

MACHINE LEARNING ALGORITHMS

 The transition from web 1.0 to web 2.0 has made it

Sentiment analysis has garnered significant

 Algorithms for machine learning. To get to the

 Every experiment is conducted using an Intel (R)

Sentiment detection is a developing field with a

• [1] J. Li, S. Fong, Y. Zhuang, and R. Khoury, “ Hierarchical

You might also like