Text Classification: Slides Adapted From Lyle Ungar and Dan Jurafsky

This document discusses text classification using supervised machine learning methods. It introduces naive Bayes classification, which uses Bayes' rule to classify text documents based on word frequencies. The naive Bayes classifier makes independence assumptions between words. It learns classification models by estimating word probabilities in each class from training data and uses these probabilities to classify new documents. The document provides an example of classifying a new document using naive Bayes.

Uploaded by

Matthew Miceli

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

84 views29 pages

Text Classification: Slides Adapted From Lyle Ungar and Dan Jurafsky

Uploaded by

Matthew Miceli

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 29

Text Classification

Slides adapted from Lyle Ungar and Dan Jurafsky

Example: Positive or negative movie review?
Example: what is the subject of this article?
Text Classification
Text Classification: Definition
Supervised learning: classification methods

Any kind of classifier

• Naïve Bayes
• Logistic Regression
• Support-vector machines
• K-Nearest Neighbors
• Neural Networks
Text Classification: Naïve Bayes

Naïve Bayes Intuition

Text: Bag of words representation
Text: Bag of words using a subset of words
Text: Bag of words representation (vectors)
Bayes’ Rule for document and classes
Bayes’ Rule and MAP (I)
Bayer’s Rule and MAP (II)
Bayes’ Rule and MAP (III)
Naïve Bayes Independence Assumptions
Naïve Bayes Classifier
Learning Naïve Bayes Model: Prior
First attempt: maximum likelihood to estimate parameters
Simply use the frequencies in the data

Fraction of documents belonging to topic j

Learning Naïve Bayes Model: Conditional Probabilities

Fraction of times word wi appears

in all words in documents of topic cj

xi = wi, word

Create mega-document for topic j by concatenating all docs in the topic

Zero probability problems
Laplace (add-1) smoothing for Naïve Bayes
Algorithm with smoothing parameter
Example
Class
c0
c0
c0
c1
?
Example
Class
c0
c0
c0
c1
?

Priors:
Example
Class
c0
c0
c0
c1
?

Priors:

P(c0)=3/4
P(c1)=1/4
Example
Class
c0
c0
c0
c1
?
Conditional Probabilities:
Example
Class
c0
c0
c0
c1
?
Conditional Probabilities:

Choosing a class
P(c0|d5)

P(c1|d5)
Summary

Business Statistics
100% (22)
Business Statistics
506 pages
NLP - PPT - Module 3 - Naïve Bayes, Text Classification and Sentiment
100% (1)
NLP - PPT - Module 3 - Naïve Bayes, Text Classification and Sentiment
86 pages
ML Unit No.4 Naïve Bayes Classifiers PPT Notes
No ratings yet
ML Unit No.4 Naïve Bayes Classifiers PPT Notes
47 pages
49 Machine Learning
No ratings yet
49 Machine Learning
300 pages
Winter Semester 2023-24 CSE3015 ETH AP2023246000714 Quiz-I-Question-Paper
No ratings yet
Winter Semester 2023-24 CSE3015 ETH AP2023246000714 Quiz-I-Question-Paper
74 pages
NB 24 Aug
No ratings yet
NB 24 Aug
85 pages
05 Text Classification - Naive Bayes
No ratings yet
05 Text Classification - Naive Bayes
64 pages
NB 24 Aug
No ratings yet
NB 24 Aug
79 pages
05 Text Classification - Naive Bayes
No ratings yet
05 Text Classification - Naive Bayes
64 pages
Multimedia Application L8
No ratings yet
Multimedia Application L8
68 pages
Scikit Learn
No ratings yet
Scikit Learn
25 pages
Naive Bayes With Sentiment Classification
No ratings yet
Naive Bayes With Sentiment Classification
82 pages
Lecture 6 - Word2Vec and Text Classification
No ratings yet
Lecture 6 - Word2Vec and Text Classification
66 pages
AM Machine Classification (Reactive Phase)
No ratings yet
AM Machine Classification (Reactive Phase)
15 pages
Text Classification
No ratings yet
Text Classification
60 pages
In4080 2022 Lecture 03
No ratings yet
In4080 2022 Lecture 03
62 pages
4 Naive Bayes
No ratings yet
4 Naive Bayes
82 pages
04-Textcat Text Class
No ratings yet
04-Textcat Text Class
77 pages
L5 TextClassification Updated
No ratings yet
L5 TextClassification Updated
179 pages
Multimedia Application L7 - For
No ratings yet
Multimedia Application L7 - For
46 pages
NB 24 Aug
No ratings yet
NB 24 Aug
82 pages
Multinomial NB
No ratings yet
Multinomial NB
52 pages
04 - 1 06 Naivebayes
No ratings yet
04 - 1 06 Naivebayes
65 pages
Naive Bayes
No ratings yet
Naive Bayes
56 pages
Naïve Bayes: The Task of Text Classification
No ratings yet
Naïve Bayes: The Task of Text Classification
34 pages
4 NB 2024
No ratings yet
4 NB 2024
82 pages
Lecture13 Nbayes
No ratings yet
Lecture13 Nbayes
56 pages
Lecture 5-1 Naive
No ratings yet
Lecture 5-1 Naive
44 pages
Naivebayes 2021
No ratings yet
Naivebayes 2021
77 pages
Text Classification
No ratings yet
Text Classification
53 pages
NLP NB
No ratings yet
NLP NB
52 pages
Text Classification Using TF-IDF and Machine Learning
No ratings yet
Text Classification Using TF-IDF and Machine Learning
30 pages
3 - Naive Bayes
No ratings yet
3 - Naive Bayes
60 pages
BAI601 Module 3 PDF
No ratings yet
BAI601 Module 3 PDF
19 pages
Lecture03 Naive Bayes
No ratings yet
Lecture03 Naive Bayes
33 pages
Sentiment Analysis
No ratings yet
Sentiment Analysis
48 pages
Support Vector Machine
No ratings yet
Support Vector Machine
45 pages
Naive Bayes and Sentiment Classification
No ratings yet
Naive Bayes and Sentiment Classification
23 pages
Text Classification in ML
No ratings yet
Text Classification in ML
47 pages
Deep Learning MCQ
No ratings yet
Deep Learning MCQ
6 pages
Bag - of - Words NLP
No ratings yet
Bag - of - Words NLP
23 pages
04 Textcat
No ratings yet
04 Textcat
101 pages
DS-05 Introduction To Machine Learning
No ratings yet
DS-05 Introduction To Machine Learning
103 pages
Naive Bayes
No ratings yet
Naive Bayes
11 pages
Naive Bayes Sentiment Analysis
No ratings yet
Naive Bayes Sentiment Analysis
23 pages
Text Classification
No ratings yet
Text Classification
24 pages
Text Classification
No ratings yet
Text Classification
11 pages
Unit 2
No ratings yet
Unit 2
26 pages
Lecture Feb20&25
No ratings yet
Lecture Feb20&25
11 pages
NLP ch4 l1
No ratings yet
NLP ch4 l1
23 pages
Lect 05
No ratings yet
Lect 05
17 pages
Resentation On Aïve Bayesian Lassification
No ratings yet
Resentation On Aïve Bayesian Lassification
38 pages
Text Classification and Naïve Bayes: The Task of Text Classifica1on
No ratings yet
Text Classification and Naïve Bayes: The Task of Text Classifica1on
74 pages
Text Classification
No ratings yet
Text Classification
7 pages
24 Shivangi DMDW
No ratings yet
24 Shivangi DMDW
12 pages
Myppt
No ratings yet
Myppt
14 pages
A Survey On Machine Learning Techniques
No ratings yet
A Survey On Machine Learning Techniques
8 pages
Text Classification PDF
No ratings yet
Text Classification PDF
56 pages
NaiveBayes N Text Analytics
No ratings yet
NaiveBayes N Text Analytics
20 pages
Mechine Learning
No ratings yet
Mechine Learning
7 pages
Naive Bayes and Sentiment
No ratings yet
Naive Bayes and Sentiment
19 pages
Gis Gemini
No ratings yet
Gis Gemini
12 pages
01 What Is Text Classification 8-12
No ratings yet
01 What Is Text Classification 8-12
4 pages
6 Finetuning For Classification - Build A Large Language Model (From Scratch)
No ratings yet
6 Finetuning For Classification - Build A Large Language Model (From Scratch)
49 pages
Na Ive Bayes Classifier
No ratings yet
Na Ive Bayes Classifier
3 pages
Data Mining Models and Evaluation Techniques
No ratings yet
Data Mining Models and Evaluation Techniques
59 pages
DLVS Unit 1
No ratings yet
DLVS Unit 1
36 pages
Naive Bayes Explanation Cleaned
No ratings yet
Naive Bayes Explanation Cleaned
2 pages
ALPR
No ratings yet
ALPR
15 pages
05 Naive Bayes - Relationship To Language Modeling 4-35
No ratings yet
05 Naive Bayes - Relationship To Language Modeling 4-35
2 pages
02 Naive Bayes 3-19
No ratings yet
02 Naive Bayes 3-19
2 pages
What Is Machine Learning
No ratings yet
What Is Machine Learning
6 pages
Mathematical Model
No ratings yet
Mathematical Model
34 pages
Classification of Multimodal Spam Using Deep Learning
No ratings yet
Classification of Multimodal Spam Using Deep Learning
45 pages
A Study of Machine Learning Algorithms On Email Spam Classification
No ratings yet
A Study of Machine Learning Algorithms On Email Spam Classification
10 pages
Customer Churn Prediction in The Telecom Sector
No ratings yet
Customer Churn Prediction in The Telecom Sector
6 pages
MTCSE1201
No ratings yet
MTCSE1201
2 pages
Personalized Diet Recommendation System in Healthcare
No ratings yet
Personalized Diet Recommendation System in Healthcare
31 pages
Approval Sheet: Isabela State University
No ratings yet
Approval Sheet: Isabela State University
17 pages
Does BERT Make Any Sense? Interpretable Word Sense Disambiguation With Contextualized Embeddings
No ratings yet
Does BERT Make Any Sense? Interpretable Word Sense Disambiguation With Contextualized Embeddings
10 pages
A Novel Cotton Mapping Index Combining Sentinel-1 SAR and Sentinel-2
No ratings yet
A Novel Cotton Mapping Index Combining Sentinel-1 SAR and Sentinel-2
19 pages
ML Assignment No. 3: 3.1 Title
No ratings yet
ML Assignment No. 3: 3.1 Title
6 pages
Diabetes Prediction Using Machine Learning
No ratings yet
Diabetes Prediction Using Machine Learning
12 pages
Data Science Process and Machine Learning
No ratings yet
Data Science Process and Machine Learning
6 pages
Guide To YAMNet - Sound Event Classifier
No ratings yet
Guide To YAMNet - Sound Event Classifier
10 pages
Time Delay Neural Network
No ratings yet
Time Delay Neural Network
6 pages
A Weighted Majority Voting Ensemble Approach For Classification
No ratings yet
A Weighted Majority Voting Ensemble Approach For Classification
6 pages
Tratification of Dengue Fever Using SMO and NSGA-II Optimization Algorithms.
No ratings yet
Tratification of Dengue Fever Using SMO and NSGA-II Optimization Algorithms.
4 pages

Text Classification: Slides Adapted From Lyle Ungar and Dan Jurafsky

Uploaded by

Text Classification: Slides Adapted From Lyle Ungar and Dan Jurafsky

Uploaded by

Text Classification

Slides adapted from Lyle Ungar and Dan Jurafsky

Any kind of classifier

Naïve Bayes Intuition

Fraction of documents belonging to topic j

Fraction of times word wi appears

Create mega-document for topic j by concatenating all docs in the topic

You might also like