0% found this document useful (0 votes)

14 views36 pages

Naive Bayes and Sentiment Classification: CS6431 Natural Language Processing Spring 2023

The document discusses the application of Naive Bayes classifiers in sentiment classification, focusing on text categorization tasks such as sentiment analysis and spam detection. It outlines the supervised learning approach, the importance of feature representation, and the challenges faced, including the handling of unknown words and negation. Additionally, it covers evaluation metrics, statistical significance testing, and methods for improving classifier performance.

Uploaded by

ursady4u

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

14 views36 pages

Naive Bayes and Sentiment Classification: CS6431 Natural Language Processing Spring 2023

Uploaded by

ursady4u

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 36

NAIVE BAYES AND SENTIMENT

CLASSIFICATION
Spring 2023 CS6431 Natural Language Processing
B1:
Speech and Language Processing (Third Edition draft
– Jan2022)
Daniel Jurafsky, James H. Martin
Credits
1. B1
Assignment
Read:
B1: Chapter 4

Problems: Exercise problems of Chapter 4

Text Categorization
 Assigning a label/category to an entire sentence/document
 Sentiment analysis
 Assigning a positive or negative orientation that a writer expresses toward
some object.
 Book reviews, movie reviews, product reviews etc.

 Spam detection
 Authorship attribution
 Subject category assignment
Supervised Learning Approach
 Input:
 𝑑1 , 𝑐1 , 𝑑2 , 𝑐2 , … , (𝑑𝑁 , 𝑐𝑁 )
 And an unknown document 𝑑

 Output
 The class label for 𝑑
Naive Bayes Classifiers
Bag-of-words

Position is ignored, only frequencies are used

Naïve Bayes
 Returns the class 𝑐which
Ƹ has the maximum posterior probability given
the document.

 Plugging the Bayes rule in the above

 Dropping denominator
 Document 𝑑 be represented as a set of features 𝑓1 , 𝑓2 , … , 𝑓𝑛
 Two simplifying assumptions
 Position
of the word is not considered (does not matter)
 Naïve Bayes assumption

 The final equation

 The calculations are done in log space, to avoid underflow and
increase speed

Becomes
Training the Naive Bayes Classifier
 How to compute 𝑃 𝑐 and 𝑃(𝑤𝑖 |𝑐)?
𝑁𝑐
 𝑃 𝑐 =
𝑁𝑑𝑜𝑐
 𝑁𝑐 :
number of documents labelled with 𝑐
 𝑁𝑑𝑜𝑐 : be the total number of documents

 We’ll assume a feature is just the existence of a word in the

document’s bag of words

 𝑐:topic/class label
 𝑉: vocabulary of the dataset
A problem
 Consider the problem of movie reviews
 Imagine, no positive review in the training set contains “fantastic” but
the test set does

 Probability for class “positive” will be zero

 Solution: Laplace (add-one) smoothing

 Note: vocabulary 𝑉 consists of the union of all the word types in all classes,
not just the words in one class 𝑐 (why?)
More things to remove
 Unknown Words: words in test data but not in training data
 Ignore them / remove them from test document/sentence
 Stop words removal
 Very frequent words like ‘the’ and ‘a’.
 Sort by frequency and take top 10-100 entries as stop words

 Or, use pre-defined list

𝑁𝑐
𝑃 𝑐 =
𝑁𝑑𝑜𝑐
Improvements
 For a text classification task, whether a word occurs or not seems to
matter more than its frequency
 Clipthe word counts in each document at 1
 Binary Naïve Bayes
Counts can be greater than 1 in Binary NB!
 Dealing with negation
I really like this movie (+ve)
 I didn’t like this movie (-ve)
◼ Negation alters the meaning of every word
 Insufficient training data
 Inaccuratetraining using Naïve Bayes
 Derive features using sentiment lexicons
◼ Lists of words that are pre-annotated with positive or negative sentiment.

◼ Add a feature for +ve and –ve

◼ Count of +ve/-ve feature = count of words from the corresponding lexicon
Naive Bayes as a Language Model
 Assigns probability to N-grams and sentences, hence it can also be
seen as a Language Model
Evaluation Metrics
Confusion Matrix
 Precision and Recall alone are not sufficient (why?)
F-measure

𝛽 > 1 favors recall

𝛽 < 1 favors precision
𝛽 = 1 equal importance to precision and recall

2𝑃𝑅
 𝐹𝐵=1 or 𝐹1 =
𝑃+𝑅
 Harmonic mean is more conservative than arithmetic mean
◼ Closer to the smaller of the two numbers
Evaluating more than two classes
𝑘-fold Cross Validation
Statistical Significance Testing
 How to decide if model/classifier 𝐴 is better than 𝐵?
 𝑀 𝐴, 𝑥 : performance of model/classifier 𝐴 on test set 𝑥
 𝑀 𝐵, 𝑥 : performance of model/classifier 𝐵 on test set 𝑥
𝛿 𝑥 (effect size) = 𝑀 𝐴, 𝑥 − 𝑀 𝐵, 𝑥
 Consider 𝛿 𝑥 = .04
 We want to check if 𝐴’s superiority over 𝐵 is likely to hold again if we
checked another test set 𝑥′
 We define two hypothesis
 𝐻0 : 𝛿 𝑥 ≤ 0 (Null Hypothesis, 𝐴 is not better than 𝐵)
 𝐻1 : 𝛿 𝑥 > 0 (𝐴 is better than 𝐵)
 𝐻0 : 𝛿 𝑥 ≤ 0 (Null Hypothesis, 𝐴 is not better than 𝐵)
 We want to test if can confidently rule out the null hypothesis and instead
support 𝐻1 , i.e., 𝐴 is better
 Let 𝑋: R.V. over all test sets

 p-value: the probability, assuming the null hypothesis 𝐻0 is true, of seeing

the 𝛿 𝑥 that we saw or one even greater
 If 𝐻0 is indeed true
◼ Large 𝛿(𝑥): highly surprising, p-value should be low, reject null hypothesis
◼ Small (+ve) 𝛿(𝑥): less surprising even if 𝐻0 is true, p-value should be high
 Threshold (like .01)
◼ p-value < .01, reject null hypothesis
 We say that a result (e.g., “A is better than B”) is statistically significant if the
𝛿 we saw has a probability that is below the threshold and we therefore reject
this null hypothesis.
 How to estimate 𝑃-values?
 Create multiple test sets and measure
 Use a threshold to accept/reject a hypothesis
The Paired Bootstrap Test
 Bootstrapping: repeatedly drawing large numbers of smaller samples
with replacement

Distribution of
𝟙(𝑥): if 𝑥 is true, and 0 otherwise
𝛿 values
 Goal: assume 𝐻0 and estimate how accidental/surprising 𝛿(𝑥) is
 Since the above distribution is biased towards 𝛿 𝑥 = .2, to capture
how surprising 𝛿(𝑥) is, we compute:
 Suppose,
 10,000 bootstrapped test sets (𝑥 (𝑖) s) are created
 Threshold is .01

 Above gives p-value of .0047 (< threshold)

◼ Thus, reject the null hypothesis

Carpentry and Joinery Guidance Booklet
No ratings yet
Carpentry and Joinery Guidance Booklet
19 pages
Naïve Bayes
No ratings yet
Naïve Bayes
15 pages
Homework3 Sol
No ratings yet
Homework3 Sol
5 pages
Aerobic, Muscle Strengthening, and Bone Strengthening Activities For Fitness Development
100% (1)
Aerobic, Muscle Strengthening, and Bone Strengthening Activities For Fitness Development
3 pages
Jahangirnagar University: Admit Card Faculty of Law (Unit-F)
No ratings yet
Jahangirnagar University: Admit Card Faculty of Law (Unit-F)
2 pages
05 Text Classification - Naive Bayes
No ratings yet
05 Text Classification - Naive Bayes
64 pages
BAI601 Module 3 PDF
No ratings yet
BAI601 Module 3 PDF
19 pages
NLP - PPT - Module 3 - Naïve Bayes, Text Classification and Sentiment
100% (1)
NLP - PPT - Module 3 - Naïve Bayes, Text Classification and Sentiment
86 pages
05 Text Classification - Naive Bayes
No ratings yet
05 Text Classification - Naive Bayes
64 pages
Multimedia Application L7 - For
No ratings yet
Multimedia Application L7 - For
46 pages
Multimedia Application L8
No ratings yet
Multimedia Application L8
68 pages
NB 24 Aug
No ratings yet
NB 24 Aug
79 pages
Naive Bayes With Sentiment Classification
No ratings yet
Naive Bayes With Sentiment Classification
82 pages
4 Naive Bayes
No ratings yet
4 Naive Bayes
82 pages
24 Shivangi DMDW
No ratings yet
24 Shivangi DMDW
12 pages
Naive Bayes
No ratings yet
Naive Bayes
56 pages
Maximum Likelihood Estimation
No ratings yet
Maximum Likelihood Estimation
6 pages
NB 24 Aug
No ratings yet
NB 24 Aug
85 pages
Na Ive Bayes Classifier
No ratings yet
Na Ive Bayes Classifier
3 pages
In4080 2022 Lecture 03
No ratings yet
In4080 2022 Lecture 03
62 pages
NaiveBayes N Text Analytics
No ratings yet
NaiveBayes N Text Analytics
20 pages
Naivebayes 2021
No ratings yet
Naivebayes 2021
77 pages
4 NB 2024
No ratings yet
4 NB 2024
82 pages
Lecture 8-1 - Text Classification, Naïve Bayes, Vector Space Classification
No ratings yet
Lecture 8-1 - Text Classification, Naïve Bayes, Vector Space Classification
38 pages
Multinomial NB
No ratings yet
Multinomial NB
52 pages
16 - Naïve Bayes Classifier
No ratings yet
16 - Naïve Bayes Classifier
21 pages
Sentiment Analysis
No ratings yet
Sentiment Analysis
48 pages
NLP NB
No ratings yet
NLP NB
52 pages
MLRD 2
No ratings yet
MLRD 2
15 pages
NB 24 Aug
No ratings yet
NB 24 Aug
82 pages
NOTES
No ratings yet
NOTES
15 pages
Week 4
No ratings yet
Week 4
45 pages
07 Naive Bayes
No ratings yet
07 Naive Bayes
6 pages
Naive Bayes Classification
100% (3)
Naive Bayes Classification
10 pages
Naive Bayes
No ratings yet
Naive Bayes
9 pages
Bayesian Learning
No ratings yet
Bayesian Learning
49 pages
T4L1 Naive Bayes
No ratings yet
T4L1 Naive Bayes
50 pages
Lecture - 4.1 - Bayes Classifier
No ratings yet
Lecture - 4.1 - Bayes Classifier
31 pages
Tackling The Poor Assumptions of Naive Bayes Text Classifiers
No ratings yet
Tackling The Poor Assumptions of Naive Bayes Text Classifiers
8 pages
Bayes Classifier
No ratings yet
Bayes Classifier
35 pages
CSL0777 L24
No ratings yet
CSL0777 L24
38 pages
ML CLassification Naive Bayes
No ratings yet
ML CLassification Naive Bayes
6 pages
Naive Bayes
No ratings yet
Naive Bayes
12 pages
Chapter 4 Text Classification
No ratings yet
Chapter 4 Text Classification
28 pages
AI Lec 04+05 - Naive Bayes
No ratings yet
AI Lec 04+05 - Naive Bayes
55 pages
Week 6a
No ratings yet
Week 6a
33 pages
Lecture 7
No ratings yet
Lecture 7
15 pages
Naive Bayes
No ratings yet
Naive Bayes
4 pages
LM3 - Naive Bayes Model
No ratings yet
LM3 - Naive Bayes Model
21 pages
Naive Bayes - Report (Repaired)
No ratings yet
Naive Bayes - Report (Repaired)
5 pages
Lecture13 Nbayes
No ratings yet
Lecture13 Nbayes
56 pages
Naïve Bayes Classifier: April 25, 2006
No ratings yet
Naïve Bayes Classifier: April 25, 2006
19 pages
Lecture3 Linear Classifiers
No ratings yet
Lecture3 Linear Classifiers
36 pages
Resentation On Aïve Bayesian Lassification
No ratings yet
Resentation On Aïve Bayesian Lassification
38 pages
ML Unit No.4 Naïve Bayes Classifiers PPT Notes
No ratings yet
ML Unit No.4 Naïve Bayes Classifiers PPT Notes
47 pages
Lecture03 Naive Bayes
No ratings yet
Lecture03 Naive Bayes
33 pages
Naive Bayes Classifier
No ratings yet
Naive Bayes Classifier
6 pages
Text Mining - Classification
No ratings yet
Text Mining - Classification
28 pages
Classification
No ratings yet
Classification
81 pages
NLP ch4 l1
No ratings yet
NLP ch4 l1
23 pages
Text Classification
No ratings yet
Text Classification
60 pages
Zambian Language G 5-7
No ratings yet
Zambian Language G 5-7
67 pages
Sociological Imagination Study Guide Quiz With Answers
No ratings yet
Sociological Imagination Study Guide Quiz With Answers
4 pages
Listening Comprehension Passages
100% (1)
Listening Comprehension Passages
4 pages
Ntu Cost Sheet
No ratings yet
Ntu Cost Sheet
2 pages
SEMANTIC MAPPIN WPS Office
No ratings yet
SEMANTIC MAPPIN WPS Office
12 pages
674 - Esl A1 Level MCQ Test With Answers Elementary Test 1
No ratings yet
674 - Esl A1 Level MCQ Test With Answers Elementary Test 1
8 pages
Motivation Science - Burkley
No ratings yet
Motivation Science - Burkley
1,793 pages
Assessment and Evaluation in Nursing Education
No ratings yet
Assessment and Evaluation in Nursing Education
7 pages
4 Srihari Resume SDE
No ratings yet
4 Srihari Resume SDE
1 page
Resume
No ratings yet
Resume
3 pages
ISO-9001: 2000 Awareness Training: Presentation By: Shashikant Gupta
No ratings yet
ISO-9001: 2000 Awareness Training: Presentation By: Shashikant Gupta
62 pages
Planeaciones Normal
No ratings yet
Planeaciones Normal
88 pages
Velkley, Richard (Seth Benardete On de Anima)
No ratings yet
Velkley, Richard (Seth Benardete On de Anima)
17 pages
Teaching Strategies For Response To Literature
100% (2)
Teaching Strategies For Response To Literature
11 pages
Igilik Saya
No ratings yet
Igilik Saya
11 pages
Sage HR Africa - Profile Brochure - May14 - Press
No ratings yet
Sage HR Africa - Profile Brochure - May14 - Press
8 pages
2024년 3월 고2 모고 24번 변형문제
No ratings yet
2024년 3월 고2 모고 24번 변형문제
134 pages
Physics Project Plan
No ratings yet
Physics Project Plan
2 pages
Synthesis Writing Template: I. Introduction - MUST HAVE ALL THREE
No ratings yet
Synthesis Writing Template: I. Introduction - MUST HAVE ALL THREE
4 pages
P.E. 9 - Q1 - Module1b
No ratings yet
P.E. 9 - Q1 - Module1b
13 pages
Teaching PowerPoint Slides - Chapter 3
100% (1)
Teaching PowerPoint Slides - Chapter 3
17 pages
2024 Summer Brochure 1
No ratings yet
2024 Summer Brochure 1
5 pages
Functional Grammar: An Introduction
No ratings yet
Functional Grammar: An Introduction
248 pages
Leadership Styles: 5 Major Styles of Leadership
100% (1)
Leadership Styles: 5 Major Styles of Leadership
5 pages
Experience of Literature Syllabus
No ratings yet
Experience of Literature Syllabus
8 pages
Consent Letter For Vaccination
No ratings yet
Consent Letter For Vaccination
2 pages
MS Computer Engineering Degree Requirement Worksheet: Course Title Units Prerequisite Semester
No ratings yet
MS Computer Engineering Degree Requirement Worksheet: Course Title Units Prerequisite Semester
2 pages

Naive Bayes and Sentiment Classification: CS6431 Natural Language Processing Spring 2023

Uploaded by

Naive Bayes and Sentiment Classification: CS6431 Natural Language Processing Spring 2023

Uploaded by

NAIVE BAYES AND SENTIMENT

Problems: Exercise problems of Chapter 4

Position is ignored, only frequencies are used

 Plugging the Bayes rule in the above

 The final equation

 We’ll assume a feature is just the existence of a word in the

 Probability for class “positive” will be zero

 Or, use pre-defined list

◼ Add a feature for +ve and –ve

𝛽 > 1 favors recall

 p-value: the probability, assuming the null hypothesis 𝐻0 is true, of seeing

 Above gives p-value of .0047 (< threshold)

You might also like