Sentiment Analysis Poster

Uploaded by

Ghada Amakrane

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

102 views1 page

Sentiment Analysis Poster

Uploaded by

Ghada Amakrane

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 1

Sentiment Analysis

Feature addition and Accuracy improvement

Prof. Brian Reese, Akshina Banerjee

University of Minnesota – Twin Cities, College of Liberal Arts, Institute of Linguistics

Introduction Research Question Results and Conclusion

v Definition: v Research Question: v Significance testing:
Sentiment analysis refers to the use of natural language processing, text analysis and computational The previous illustration revealed the following shortcoming of the BoW model:
BoW vs Dependency model
linguistics to identify and extract subjective information in source materials. In simpler terms, it is a tool Information on dependency is excluded. Dependency refers to how a word modifies another Accuray rates
built in computer software to determine the opinion of the writer of a piece of text. Such texts generally
and contributes to a meaning shift of the modified word. 0.9
either bear a positive, negative or neutral mood and that is what sentiment analysis seeks to find.
Example: ‘never’ modifies ‘failed’ to mean situations of success or neither success nor 0.89
For example, failure.
1. “The value of X company’s shares skyrocketed” – Positive Sentiment Thus, this research seeks to ADD the feature of dependencies to the traditional method of 0.88
2. “This movie is the worst that I have seen in years” – Negative Sentiment text classification to check whether accuracy rates of sentiment analysis improves. 0.87
3. “The product was not the best but it suited well with some of my requirements.” – Neutral So as to not exclude any lexical item, all the words in a given text will be extracted, along with
Sentiment each dependency. 0.86

Illustration: 0.85
v Text Classification:
0.84

0.83

0.82
25-75% 20-80% 10-90%
Dependency model (% accuracy) Bag of Words model (% accuracy)

Methodology • The p-value for the difference in the accuracy rates under all the three splits is less than 0.00001.
• Since 𝛼= 0.05 was chosen, any p-value below 0.05 indicates that the results are significant.

• Data used: IMDB movie data set v 10-fold cross validation :

• Total number of movie reviews: 42929 Dependency Model BoW Model

• Steps taken to add feature and calculate accuracy score: Cross Validation: Scores: Cross Validation: Scores:
1 0.877 1 0.843
v Traditional Method of Text Representation:
v Step 1: 2 0.842 2 0.849
Bag of Words (BoW) approach Dependency 3 0.836 3 0.852
Movie review with sentiment Movie review representation:
Parser dependency parsed
4 0.850 4 0.853
• What it is : A set of words that is chosen before the text classification. The selection can be 5 0.857 5 0.849
made in multiple ways, e.g. it could be the n most frequent words in the entire training corpus. Example: Dependent: ‘never’; Governor: ‘failed’ 6 0.854 6 0.844
v Step 2: 7 0.839 7 0.858
Feature 8 0.838 8 0.844
• How it is used : The words from the text are matched to the existing words (and the sentiments Movie review representation: Formation of dependency
9 0.860 9 0.848
that they denote) in BoW and then the classifier gives a prediction of the sentiment. dependency parsed Extraction pairs
10 0.828 10 0.845
Example: dependent + governor : ‘never + failed’
• Why it is used : Since this approach does not involve the employment of any linguistic structure, • The observations under the cross validation are puzzling because the deviations from the mean accuracy
v Step 3: score are high.
it is simple and this simplicity makes BoW popular
• The same cross validation for the BoW model has very small deviations in accuracy scores, if any.
Movie review representation: Scikit
Dependency pairs, individual Naïve Bayes Classifier v Conclusion:
• Example : Classification of ‘The movie was great’ (extraction of nouns and adjectives): Words Learn
• Adding the feature of dependencies significantly improved the accuracy rates.
BoW = {movie, film, great, horrible, tedious} Text representation = {movie:1, film:0, great:1, • Further research should look into:
v Step 4: • Why the cross validation is showing an anomalous behavior in case of the dependency
horrible:0, tedious:0}. This information is passed on to the training algorithm which will be trained to Test-Train model.
associate individual features (e.g great:1) with sentiment labels – in this case “positive” Classifier Accuracy rates • Tokenizing the corpus for html tags and re-running the experiment.
Split • The most informative features to see how the dependency pairs are classified.

v Criticism for BoW :

Data / Observations Acknowledgments and Selected References
No linguistic structure is considered while classification because two main assumptions of BoW are – (a)
Acknowledgments:
Word order/ word position does NOT matter, (b) Lexical category of words (nouns vs verbs vs adjectives
• I would like to thank my mentor Professor Brian Reese for immense support and active involvement throughout the research project.
etc) does NOT matter.
• I would like to thank a fellow student and friend Aaron Free for his massive contribution towards the computational part of the project.
Train-Test split Dependency model (accuracy) • Last but not the least, I would like to thank Undergraduate Research Opportunity Program (UROP) for funding this project.

v Illustration of Problem: Selected References:

25-75% 0.887 • Pang, Bo, and Lillian Lee. "Opinion mining and sentiment analysis." Foundations and trends in information retrieval 2.1-2 (2008): 1-
Classification of ‘The company never failed’ (extraction of nouns and verbs):
135.
Let BoW = { company, failed } Text representation = { company: 1, failed: 1, succeded: 0 } 20-80% 0.886 • Maas, Andrew L., et al. "Learning word vectors for sentiment analysis." Proceedings of the 49th Annual Meeting of the Association for
Computational Linguistics: Human Language Technologies-Volume 1. Association for Computational Linguistics, 2011.
Sentiment Label = “Negative” WRONG SENTIMENT LABEL!
10-90% 0.884

NLP - PPT - Module 3 - Naïve Bayes, Text Classification and Sentiment
100% (1)
NLP - PPT - Module 3 - Naïve Bayes, Text Classification and Sentiment
86 pages
Unit Ii
No ratings yet
Unit Ii
20 pages
Winter Semester 2023-24 CSE3015 ETH AP2023246000714 Quiz-I-Question-Paper
No ratings yet
Winter Semester 2023-24 CSE3015 ETH AP2023246000714 Quiz-I-Question-Paper
74 pages
Machine Learning Unit 1 Notes
No ratings yet
Machine Learning Unit 1 Notes
22 pages
A Natural Language Processing For Sentiment Analysis From Text Using Deep Learning Algorithm
No ratings yet
A Natural Language Processing For Sentiment Analysis From Text Using Deep Learning Algorithm
7 pages
ML Notes-1
No ratings yet
ML Notes-1
59 pages
Sentiment Analysis Using Feature Selection and Machine Learning Algorithms
No ratings yet
Sentiment Analysis Using Feature Selection and Machine Learning Algorithms
48 pages
Sentiment Analysis On Movie Reviews: Natural Language Processing UML602 Project Report
No ratings yet
Sentiment Analysis On Movie Reviews: Natural Language Processing UML602 Project Report
13 pages
CS230
No ratings yet
CS230
6 pages
A Review On Advances in Sentiment Analysis A Deep Learning Approach Using Transformer Based Models
No ratings yet
A Review On Advances in Sentiment Analysis A Deep Learning Approach Using Transformer Based Models
5 pages
Fake Product Review Monitoring & Removal and Sentiment Analysis of Genuine Reviews
No ratings yet
Fake Product Review Monitoring & Removal and Sentiment Analysis of Genuine Reviews
4 pages
NLP - Module 2
No ratings yet
NLP - Module 2
54 pages
Lecture 6 - Word2Vec and Text Classification
No ratings yet
Lecture 6 - Word2Vec and Text Classification
66 pages
STD XI - Eco - Organisation of Data - Notes
No ratings yet
STD XI - Eco - Organisation of Data - Notes
3 pages
Unit 5 Classification PDF
No ratings yet
Unit 5 Classification PDF
131 pages
05 Text Classification - Naive Bayes
No ratings yet
05 Text Classification - Naive Bayes
64 pages
NB 24 Aug
No ratings yet
NB 24 Aug
79 pages
E1815-01 Film System Classification PDF
No ratings yet
E1815-01 Film System Classification PDF
6 pages
Naive Bayes
No ratings yet
Naive Bayes
56 pages
Text Representation: Lecture # 6
No ratings yet
Text Representation: Lecture # 6
21 pages
NLP Chapter 2
No ratings yet
NLP Chapter 2
79 pages
4 Naive Bayes
No ratings yet
4 Naive Bayes
82 pages
An Ontology-Based Sentiment Classification Methodology For Online Consumer Reviews
100% (2)
An Ontology-Based Sentiment Classification Methodology For Online Consumer Reviews
7 pages
Multi Layer Perceptron 1
No ratings yet
Multi Layer Perceptron 1
54 pages
Classification
No ratings yet
Classification
81 pages
Sentiment Analysis Using Machine Learning Classifiers
No ratings yet
Sentiment Analysis Using Machine Learning Classifiers
41 pages
Koo Che Mesh Kian 2020
No ratings yet
Koo Che Mesh Kian 2020
26 pages
Computer Vision Questions
No ratings yet
Computer Vision Questions
16 pages
Bag - of - Words NLP
No ratings yet
Bag - of - Words NLP
23 pages
MCQ Deep Learning Engineering Syllabus 1to 5 Unit ..
No ratings yet
MCQ Deep Learning Engineering Syllabus 1to 5 Unit ..
2 pages
PPPT
No ratings yet
PPPT
20 pages
03 ML Essentials
No ratings yet
03 ML Essentials
52 pages
3 Classification 1
No ratings yet
3 Classification 1
55 pages
Decision Tree
No ratings yet
Decision Tree
44 pages
7 Time Series Datasets For Machine Learning
No ratings yet
7 Time Series Datasets For Machine Learning
8 pages
Improving The Accuracy of Pre-Trained Word Embeddings For Sentiment Analysis
No ratings yet
Improving The Accuracy of Pre-Trained Word Embeddings For Sentiment Analysis
15 pages
Wa0002
No ratings yet
Wa0002
21 pages
Prediction of Poultry Yield Using Data Mining Techniques
No ratings yet
Prediction of Poultry Yield Using Data Mining Techniques
17 pages
Course - Machine Learning A-Z - AI, Python & R + ChatGPT Prize (2025) - Udemy Business
No ratings yet
Course - Machine Learning A-Z - AI, Python & R + ChatGPT Prize (2025) - Udemy Business
18 pages
2MLIntrodpart 2
No ratings yet
2MLIntrodpart 2
42 pages
Chapter 4 Text Classification
No ratings yet
Chapter 4 Text Classification
28 pages
IDTA For NLP
No ratings yet
IDTA For NLP
16 pages
Heart Disease
No ratings yet
Heart Disease
26 pages
Lecture 3 Sentiment Analysis
No ratings yet
Lecture 3 Sentiment Analysis
41 pages
Tribhuvan University: Institute of Engineering
No ratings yet
Tribhuvan University: Institute of Engineering
48 pages
Fake Product Review Monitoring & Removal and Sentiment Analysis of Genuine Reviews
No ratings yet
Fake Product Review Monitoring & Removal and Sentiment Analysis of Genuine Reviews
4 pages
Report
No ratings yet
Report
30 pages
Supp
No ratings yet
Supp
13 pages
Sentiment Analysis of Twitter
No ratings yet
Sentiment Analysis of Twitter
26 pages
Uwb at Semeval-2016 Task 5: Aspect Based Sentiment Analysis
No ratings yet
Uwb at Semeval-2016 Task 5: Aspect Based Sentiment Analysis
8 pages
Final Presentation
No ratings yet
Final Presentation
18 pages
Addressing Sentiment Analysis Challenges
No ratings yet
Addressing Sentiment Analysis Challenges
8 pages
Satellite Remote Sensing of Wetlands: Stacy L. Ozesmi and Marvin E. Bauer
No ratings yet
Satellite Remote Sensing of Wetlands: Stacy L. Ozesmi and Marvin E. Bauer
22 pages
Sentiment Analysis: Srishti Chaubey
No ratings yet
Sentiment Analysis: Srishti Chaubey
40 pages
2012 Liviu P. Dinu, Iulia Iuga, 2012. The Naive Bayes Classifier in Opinion Mining - in Search of The Best Feature
No ratings yet
2012 Liviu P. Dinu, Iulia Iuga, 2012. The Naive Bayes Classifier in Opinion Mining - in Search of The Best Feature
12 pages
ML Mid Question Solve
No ratings yet
ML Mid Question Solve
19 pages
Jurnal
No ratings yet
Jurnal
19 pages
Sentiment Analysis of Talaash Movie Reviews Using Text Mining Approach
No ratings yet
Sentiment Analysis of Talaash Movie Reviews Using Text Mining Approach
9 pages
Classification of Movie Reviews Using Complemented Naive Bayesian Classifier
No ratings yet
Classification of Movie Reviews Using Complemented Naive Bayesian Classifier
7 pages
Module4 TextAnalytics
No ratings yet
Module4 TextAnalytics
9 pages
Group 4 MovieReview
No ratings yet
Group 4 MovieReview
10 pages
Ambigu
No ratings yet
Ambigu
13 pages
Sentiment Analysis With Contextual Embeddings and Self-Attention
No ratings yet
Sentiment Analysis With Contextual Embeddings and Self-Attention
10 pages
MLRD 2
No ratings yet
MLRD 2
15 pages
A Sentiment Analysis Approach Through Deep Learning For A Movie Review
No ratings yet
A Sentiment Analysis Approach Through Deep Learning For A Movie Review
9 pages
MADHU-IEEE Update
No ratings yet
MADHU-IEEE Update
5 pages
W04 3253 PDF
No ratings yet
W04 3253 PDF
7 pages
AAIML
No ratings yet
AAIML
10 pages
Learning Based Approach For Hindi Text S 77957aeb
No ratings yet
Learning Based Approach For Hindi Text S 77957aeb
8 pages
CS771: GROUP-19 Sentiment Analysis in Movie Reviews: Project Report
No ratings yet
CS771: GROUP-19 Sentiment Analysis in Movie Reviews: Project Report
28 pages
MADHU IEEE Updated 27 05 24
No ratings yet
MADHU IEEE Updated 27 05 24
5 pages
08 An Example of NN Using ReLu
No ratings yet
08 An Example of NN Using ReLu
10 pages
Ganesh 2020
No ratings yet
Ganesh 2020
6 pages
Cs221 Report
No ratings yet
Cs221 Report
16 pages
Advanced Techniques in Text-Based Emotion Recognition and Conversations
No ratings yet
Advanced Techniques in Text-Based Emotion Recognition and Conversations
5 pages
Smote-DL: A Deep Learning Based Plant Disease Detection Method
No ratings yet
Smote-DL: A Deep Learning Based Plant Disease Detection Method
6 pages
Sentiment Analysis in Twitter: Rohit Kumar Jha (11615) Sakaar Khurana (10627)
No ratings yet
Sentiment Analysis in Twitter: Rohit Kumar Jha (11615) Sakaar Khurana (10627)
9 pages
Age and Gender Prediction in Open Domain Text
No ratings yet
Age and Gender Prediction in Open Domain Text
8 pages
Engineering Applications of Arti Ficial Intelligence: Berna Alt Inel, Murat Can Ganiz, Banu Diri
No ratings yet
Engineering Applications of Arti Ficial Intelligence: Berna Alt Inel, Murat Can Ganiz, Banu Diri
12 pages
Klasifikasi Tingkat Kematangan Buah Markisa Menggunakan Jaringan Syaraf Tiruan Berbasis Pengolahan Citra
No ratings yet
Klasifikasi Tingkat Kematangan Buah Markisa Menggunakan Jaringan Syaraf Tiruan Berbasis Pengolahan Citra
8 pages
Analysis of Data Mining Classification With Decision Tree Technique
No ratings yet
Analysis of Data Mining Classification With Decision Tree Technique
7 pages
Sentiment Classification of Reviews Using Sentiwordnet: 9Th. It & T Conference
No ratings yet
Sentiment Classification of Reviews Using Sentiwordnet: 9Th. It & T Conference
10 pages
Sciencedirect: Chetashri Bhadane, Hardi Dalal, Heenal Doshi
No ratings yet
Sciencedirect: Chetashri Bhadane, Hardi Dalal, Heenal Doshi
8 pages
Improved Feature Extraction and Classification - Sentiment Analysis - Trupthi2016
No ratings yet
Improved Feature Extraction and Classification - Sentiment Analysis - Trupthi2016
6 pages
Assignment No 4: Text Classification Using Naive Bayes
No ratings yet
Assignment No 4: Text Classification Using Naive Bayes
6 pages
Sentiment Analysis Using Support Vector Machine Based On Feature Selection and Semantic Analysis
No ratings yet
Sentiment Analysis Using Support Vector Machine Based On Feature Selection and Semantic Analysis
5 pages
Paper - 2
No ratings yet
Paper - 2
4 pages
An Introduction To Feature Extraction
No ratings yet
An Introduction To Feature Extraction
2 pages
Module ECM3420 (2020) Learning From Data
No ratings yet
Module ECM3420 (2020) Learning From Data
2 pages
Naive Bayes Classifier: Fundamentals and Applications
From Everand
Naive Bayes Classifier: Fundamentals and Applications
Fouad Sabry
No ratings yet

Sentiment Analysis Poster

Uploaded by

Sentiment Analysis Poster

Uploaded by

Sentiment Analysis

Feature addition and Accuracy improvement

University of Minnesota – Twin Cities, College of Liberal Arts, Institute of Linguistics

Introduction Research Question Results and Conclusion

• Data used: IMDB movie data set v 10-fold cross validation :

• Total number of movie reviews: 42929 Dependency Model BoW Model

v Criticism for BoW :

v Illustration of Problem: Selected References:

You might also like