0% found this document useful (0 votes)

50 views21 pages

Tutorial 4

This document discusses Naive Bayes and Gaussian Bayes classifiers. It begins by introducing Bayes' rule and the Naive Bayes assumption of conditional independence between features given the class. It then provides an example of using a Bernoulli Naive Bayes model for spam classification. The document derives the maximum likelihood estimators for the Naive Bayes model parameters. It also discusses using a Gaussian distribution instead of conditional independence, and derives the maximum likelihood estimators for the Gaussian Bayes model parameters.

Uploaded by

Sriram Mudunuri

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

50 views21 pages

Tutorial 4

Uploaded by

Sriram Mudunuri

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 21

Naive Bayes and Gaussian Bayes Classifier

Mengye Ren
[email protected]

October 18, 2015

Mengye Ren Naive Bayes and Gaussian Bayes Classifier October 18, 2015 1 / 21
Naive Bayes

Bayes Rules:
p(x|t)p(t)
p(t|x) =
p(x)
Naive Bayes Assumption:
D
Y
p(x|t) = p(xj |t)
j=1

Likelihood function:

L(θ) = p(x, t|θ) = p(x|t, θ)p(t|θ)

Mengye Ren Naive Bayes and Gaussian Bayes Classifier October 18, 2015 2 / 21
Example: Spam Classification

Each vocabulary is one feature dimension.

We encode each email as a feature vector x ∈ {0, 1}|V |
xj = 1 iff the vocabulary xj appears in the email.

We want to model the probability of any word xj appearing in an

email given the email is spam or not.
Example: $10,000, Toronto, Piazza, etc.

Idea: Use Bernoulli distribution to model p(xj |t)

Example: p(“$10, 000”|spam) = 0.3

Mengye Ren Naive Bayes and Gaussian Bayes Classifier October 18, 2015 3 / 21
Bernoulli Naive Bayes

Assuming all data points x (i) are i.i.d. samples, and p(xj |t) follows a
Bernoulli distribution with parameter µjt

D (i)
Y x (i)
p(x (i) |t (i) ) = µjtj(i) (1 − µjt (i) )(1−xj )

j=1

N N D (i)
Y Y Y x (i)
p(t|x) ∝ p(t (i) )p(x (i) |t (i) ) = p(t (i) ) µjtj(i) (1 − µjt (i) )(1−xj )

i=1 i=1 j=1

where p(t) = πt . Parameters πt , µjt can be learnt using maximum

likelihood.

Mengye Ren Naive Bayes and Gaussian Bayes Classifier October 18, 2015 4 / 21
Derivation of maximum likelihood estimator (MLE)

θ = [µ, π]

log L(θ) = log p(x, t|θ)

 
N D
(i) (i)
X X
= log πt (i) + xj log µjt (i) + (1 − xj ) log(1 − µjt (i) )
i=1 j=1

P
Want: arg maxθ log L(θ) subject to k πk = 1

Mengye Ren Naive Bayes and Gaussian Bayes Classifier October 18, 2015 5 / 21
Derivation of maximum likelihood estimator (MLE)
Take derivative w.r.t. µ
 
N (i) (i)
∂ log L(θ) xj 1 − xj
1 t (i) = k 
X
=0⇒ − =0
∂µjk µjk 1 − µjk
i=1

N h i
1 t (i) = k (i) (i)
X
xj (1 − µjk ) − 1 − xj µjk = 0
i=1

N N
1 t (i) = k µjk = 1 t (i) = k xj(i)
X X

i=1 i=1

i=1 1 t
PN (i)
(i)
= k xj
µjk =
i=1 1
PN
t (i) = k

Mengye Ren Naive Bayes and Gaussian Bayes Classifier October 18, 2015 6 / 21
Derivation of maximum likelihood estimator (MLE)

Use Lagrange multiplier to derive π

P N
∂L(θ) ∂ κ πκ 1
1 t (i) = k)
X
+λ =0⇒λ=−
∂πk ∂πk πk
i=1

i=1 1
PN
t (i) = k)

πk = −
λ
P
Apply constraint: k πk = 1 ⇒ λ = −N

i=1 1
PN
t (i) = k)

πk =
N

Mengye Ren Naive Bayes and Gaussian Bayes Classifier October 18, 2015 7 / 21
Spam Classification Demo

Mengye Ren Naive Bayes and Gaussian Bayes Classifier October 18, 2015 8 / 21
Gaussian Bayes Classifier

Instead of assuming conditional independence of xj , we model p(x|t) as a

Gaussian distribution and the dependence relation of xj is encoded in the
covariance matrix.

Multivariate Gaussian distribution:

1 1 T −1
f (x) = p exp − (x − µ) Σ (x − µ)
(2π)D det(Σ) 2

µ: mean, Σ: covariance matrix, D: dim(x)

Mengye Ren Naive Bayes and Gaussian Bayes Classifier October 18, 2015 9 / 21
Derivation of maximum likelihood estimator (MLE)

q
θ = [µ, Σ, π], Z = (2π)D det(Σ)

1 1 T −1
p(x|t) = exp − (x − µ) Σ (x − µ)
Z 2

log L(θ) = log p(x, t|θ) = log p(t|θ) + log p(x|t, θ)

N
X 1 (i) T
= log πt (i) − log Z − x − µt (i) Σ−1
t (i)
x (i)
− µ t (i)
2
i=1

P
Want: arg maxθ log L(θ) subject to k πk = 1

Mengye Ren Naive Bayes and Gaussian Bayes Classifier October 18, 2015 10 / 21
Derivation of maximum likelihood estimator (MLE)

Take derivative w.r.t. µ

N
∂ log L X
=− 1 t (i) = k Σ−1 (x (i) − µk ) = 0
∂µk
i=0

i=1 1 t
PN (i) = k x (i)

µk =
i=1 1
PN
t (i) = k

Mengye Ren Naive Bayes and Gaussian Bayes Classifier October 18, 2015 11 / 21
Derivation of maximum likelihood estimator (MLE)

Take derivative w.r.t. Σ−1 (not Σ)

Note:
∂ det(A) T
= det(A)A−1
∂A
det(A)−1 = det A−1

∂x T Ax
= xx T
∂A
ΣT = Σ

N
" #
∂ log L ∂ log Zk 1 (i)
1 t =k −
X
(i) (i) T
=− − (x − µk )(x − µk ) =0
∂Σ−1k i=0
∂Σ −1
k
2

Mengye Ren Naive Bayes and Gaussian Bayes Classifier October 18, 2015 12 / 21
Derivation of maximum likelihood estimator (MLE)
q
Zk = (2π)D det(Σk )

1
−1 − 2
∂ log Zk 1 ∂Zk D 1 D ∂ det Σ
= = (2π)− 2 det(Σk )− 2 (2π) 2 k
∂Σ−1k
Z k ∂Σ −1
k ∂Σ −1
k

1 1 3
− 2 1
= det(Σ−1 det Σ−1 det Σ−1
T
k )
2 − k k Σk = − Σk
2 2
N 1
∂ log L 1 (i)
1 t =k
X
(i) (i) T
=− Σk − (x − µk )(x − µk ) = 0
∂Σ−1k i=0
2 2

T
i=1 1
PN
t (i) = k x (i) − µk x (i) − µk

Σk =
i=1 1
PN
t (i) = k

Mengye Ren Naive Bayes and Gaussian Bayes Classifier October 18, 2015 13 / 21
Derivation of maximum likelihood estimator (MLE)

i=1 1
PN
t (i) = k)

πk =
N
(Same as Bernoulli)

Mengye Ren Naive Bayes and Gaussian Bayes Classifier October 18, 2015 14 / 21
Gaussian Bayes Classifier Demo

Mengye Ren Naive Bayes and Gaussian Bayes Classifier October 18, 2015 15 / 21
Gaussian Bayes Classifier

If we constrain Σ to be diagonal, then we can rewrite p(xj |t) as a product

of p(xj |t)

1 1 T −1
p(x|t) = p exp − (xj − µjt ) Σt (xk − µkt )
(2π)D det(Σt ) 2

D D
Y 1 1 Y
= p exp − ||xj − µjt ||22 = p(xj |t)
(2π)D Σt,jj 2Σt,jj
j=1 j=1

Diagonal covariance matrix satisfies the naive Bayes assumption.

Mengye Ren Naive Bayes and Gaussian Bayes Classifier October 18, 2015 16 / 21
Gaussian Bayes Classifier

Case 1: The covariance matrix is shared among classes

p(x|t) = N (x|µt , Σ)

Case 2: Each class has its own covariance

p(x|t) = N (x|µt , Σt )

Mengye Ren Naive Bayes and Gaussian Bayes Classifier October 18, 2015 17 / 21
Gaussian Bayes Binary Classifier Decision Boundary

If the covariance is shared between classes,

p(x|t = 1) = p(x|t = 0)

1 1
log π1 − (x − µ1 )T Σ−1 (x − µ1 ) = log π0 − (x − µ0 )T Σ−1 (x − µ0 )
2 2

C + x T Σ−1 x − 2µT −1 T −1 T −1 T −1
1 Σ x + µ1 Σ µ1 = x Σ x − 2µ0 Σ x + µ0 Σ µ0
T −1
h i
2(µ0 − µ1 )T Σ−1 x − (µ0 − µ1 )T Σ−1 (µ0 − µ1 ) = C

⇒ aT x − b = 0

The decision boundary is a linear function (a hyperplane in general).

Mengye Ren Naive Bayes and Gaussian Bayes Classifier October 18, 2015 18 / 21
Relation to Logistic Regression

We can write the posterior distribution p(t = 0|x) as

p(x, t = 0) π0 N (x|µ0 , Σ)
=
p(x, t = 0) + p(x, t = 1) π0 N (x|µ0 , Σ) + π1 N (x|µ1 , Σ)
−1
π1 1 T −1 1 T −1
= 1+ exp − (x − µ1 ) Σ (x − µ1 ) + (x − µ0 ) Σ (x − µ0 )
π0 2 2

π1 1 T −1 −1
= 1 + exp log + (µ1 − µ0 )T Σ−1 x + µ 1 Σ µ1 − µT
0 Σ −1
µ 0
π0 2
1
=
1 + exp(−w T x − b)

Mengye Ren Naive Bayes and Gaussian Bayes Classifier October 18, 2015 19 / 21
Gaussian Bayes Binary Classifier Decision Boundary

If the covariance is not shared between classes,

p(x|t = 1) = p(x|t = 0)

1 1
log π1 − (x − µ1 )T Σ−1 T −1
1 (x − µ1 ) = log π0 − (x − µ0 ) Σ0 (x − µ0 )
2 2

x T Σ−1 −1
x − 2 µT −1 T −1
x + µT T

1 − Σ0 1 Σ 1 − µ0 Σ 0 0 Σ0 µ0 − µ1 Σ1 µ1 = C

⇒ x T Qx − 2b T x + c = 0

The decision boundary is a quadratic function. In 2-d case, it looks

like an ellipse, or a parabola, or a hyperbola.

Mengye Ren Naive Bayes and Gaussian Bayes Classifier October 18, 2015 20 / 21
Thanks!

Mengye Ren Naive Bayes and Gaussian Bayes Classifier October 18, 2015 21 / 21

CS411 Tut4 Handout
No ratings yet
CS411 Tut4 Handout
51 pages
Machine Learning 10-601: Today: - Bayes Classifiers - Conditional Independence - Naïve Bayes Readings
No ratings yet
Machine Learning 10-601: Today: - Bayes Classifiers - Conditional Independence - Naïve Bayes Readings
51 pages
3 Classification KNN NB Annotated
No ratings yet
3 Classification KNN NB Annotated
54 pages
ESGB - Naive Bayes and Logistic Regression
No ratings yet
ESGB - Naive Bayes and Logistic Regression
36 pages
Week6 - Naive Bayes
No ratings yet
Week6 - Naive Bayes
68 pages
Supervised Classification 3601
No ratings yet
Supervised Classification 3601
39 pages
5 ML NaiveBayes
No ratings yet
5 ML NaiveBayes
45 pages
Generative and Discriminative Classifiers: Naive Bayes and Logistic Regression
No ratings yet
Generative and Discriminative Classifiers: Naive Bayes and Logistic Regression
17 pages
NaiveBayersClassification BA (1)
No ratings yet
NaiveBayersClassification BA (1)
36 pages
UCS_401_Unit-LV_Probabilistic Models Normal Distribution and Its Geometric Interpretations_03
No ratings yet
UCS_401_Unit-LV_Probabilistic Models Normal Distribution and Its Geometric Interpretations_03
14 pages
Class 06 07 Naive Bayes
No ratings yet
Class 06 07 Naive Bayes
91 pages
07 - Bayesian Learning
No ratings yet
07 - Bayesian Learning
55 pages
05_lecturenote_NB
No ratings yet
05_lecturenote_NB
10 pages
Krishna Seva - Kenneth R Valpey
No ratings yet
Krishna Seva - Kenneth R Valpey
290 pages
09 Naive Bayes
No ratings yet
09 Naive Bayes
23 pages
Bayesian Classifiers: Lectured by Ha Hoang Kha, Ph.D. Ho Chi Minh City University of Technology
No ratings yet
Bayesian Classifiers: Lectured by Ha Hoang Kha, Ph.D. Ho Chi Minh City University of Technology
31 pages
Naive Bayes
No ratings yet
Naive Bayes
41 pages
Lecture 3-Revision_Part2
No ratings yet
Lecture 3-Revision_Part2
25 pages
Naive Bayes
No ratings yet
Naive Bayes
11 pages
Dr. Arslan Shaukat
No ratings yet
Dr. Arslan Shaukat
18 pages
NBayes-1-20-2011-ann
No ratings yet
NBayes-1-20-2011-ann
21 pages
The Naive Bayes Model, Maximum-Likelihood Estimation, and The EM Algorithm
No ratings yet
The Naive Bayes Model, Maximum-Likelihood Estimation, and The EM Algorithm
21 pages
Probabilistic Models in Machine Learning: Unit - III Chapter - 1
No ratings yet
Probabilistic Models in Machine Learning: Unit - III Chapter - 1
18 pages
Comfort Theory Revised
No ratings yet
Comfort Theory Revised
4 pages
Naive Bayes.pptx (1)
No ratings yet
Naive Bayes.pptx (1)
19 pages
lecture3-linear-classifiers
No ratings yet
lecture3-linear-classifiers
36 pages
Naive Bayes Classifier in Machine Learning - Javatpoint
No ratings yet
Naive Bayes Classifier in Machine Learning - Javatpoint
19 pages
Quality Control of Concrete
No ratings yet
Quality Control of Concrete
17 pages
Bayesian Decision Theory and Learning: Jayanta Mukhopadhyay Dept. of Computer Science and Engg
No ratings yet
Bayesian Decision Theory and Learning: Jayanta Mukhopadhyay Dept. of Computer Science and Engg
56 pages
DP-Lite: User Guide
No ratings yet
DP-Lite: User Guide
118 pages
Maximum Likelihood Estimation
No ratings yet
Maximum Likelihood Estimation
6 pages
NBayes Log Reg
No ratings yet
NBayes Log Reg
18 pages
Ba Yes Naive
No ratings yet
Ba Yes Naive
15 pages
Underpinning Good Practice Guide
No ratings yet
Underpinning Good Practice Guide
22 pages
Generative and Discriminative Classifiers: Naive Bayes and Logistic Regression
No ratings yet
Generative and Discriminative Classifiers: Naive Bayes and Logistic Regression
17 pages
Bayesian Learning: Based On "Machine Learning", T. Mitchell, Mcgraw Hill, 1997, Ch. 6
No ratings yet
Bayesian Learning: Based On "Machine Learning", T. Mitchell, Mcgraw Hill, 1997, Ch. 6
54 pages
CSL0777 L24
No ratings yet
CSL0777 L24
38 pages
Naive Bayes
No ratings yet
Naive Bayes
12 pages
6. Naive Bayes
No ratings yet
6. Naive Bayes
26 pages
Math 132 Notes
No ratings yet
Math 132 Notes
43 pages
Statistical Inference INF312 - Is - Lecture 03 - Part 3
No ratings yet
Statistical Inference INF312 - Is - Lecture 03 - Part 3
18 pages
BR-M416A SM-RT53: Front Disc Brake
No ratings yet
BR-M416A SM-RT53: Front Disc Brake
1 page
ML 05 Bayesian Classifier
No ratings yet
ML 05 Bayesian Classifier
19 pages
16_Naïve Bayes Classifier
No ratings yet
16_Naïve Bayes Classifier
21 pages
Naïve Bayes Classifier: April 25, 2006
No ratings yet
Naïve Bayes Classifier: April 25, 2006
19 pages
Module05 - Bayesian Reasoning
No ratings yet
Module05 - Bayesian Reasoning
37 pages
Trail Box
No ratings yet
Trail Box
24 pages
07_Naive_Bayes
No ratings yet
07_Naive_Bayes
6 pages
Naive Bayes Classifiers - Parta
No ratings yet
Naive Bayes Classifiers - Parta
17 pages
L3 (Week3) Bayesian Classifier
No ratings yet
L3 (Week3) Bayesian Classifier
21 pages
651276118-Naive-Bayes-Classifier-in-Machine-Learning-Javatpoint
No ratings yet
651276118-Naive-Bayes-Classifier-in-Machine-Learning-Javatpoint
23 pages
Parts Manual
No ratings yet
Parts Manual
146 pages
CSIS0270/COMP3270: 12b. Statistical Learning - Bayes Classifier
No ratings yet
CSIS0270/COMP3270: 12b. Statistical Learning - Bayes Classifier
15 pages
module_3_Last Part
No ratings yet
module_3_Last Part
16 pages
Naive Bates Classifier
No ratings yet
Naive Bates Classifier
18 pages
A Journey Through Time - Us Forces in Malir Ww2
No ratings yet
A Journey Through Time - Us Forces in Malir Ww2
7 pages
Bayesian Classification: Dr. Navneet Goyal BITS, Pilani
No ratings yet
Bayesian Classification: Dr. Navneet Goyal BITS, Pilani
35 pages
Lecture 7
No ratings yet
Lecture 7
15 pages
Lecture - 4.1 - Bayes Classifier
No ratings yet
Lecture - 4.1 - Bayes Classifier
31 pages
Practical-3 Ritesh
No ratings yet
Practical-3 Ritesh
5 pages
FY Photochemistry MM
No ratings yet
FY Photochemistry MM
16 pages
NOTES
No ratings yet
NOTES
15 pages
PeChe 2 Map Analysis and Summary of Responses
No ratings yet
PeChe 2 Map Analysis and Summary of Responses
9 pages
Ultra Low Power: Bluetooth 5.0 BLE
No ratings yet
Ultra Low Power: Bluetooth 5.0 BLE
20 pages
Naive Bayes Classifier in Machine Learning
No ratings yet
Naive Bayes Classifier in Machine Learning
16 pages
Na Ive Bayes Classifier
No ratings yet
Na Ive Bayes Classifier
3 pages
Bayesian Classification: Dr. Navneet Goyal BITS, Pilani
No ratings yet
Bayesian Classification: Dr. Navneet Goyal BITS, Pilani
35 pages
Minutes of Site Meeting 260810
No ratings yet
Minutes of Site Meeting 260810
8 pages
Wk08
No ratings yet
Wk08
10 pages
Wall-Mounted-Fans-Models-BSDDP-BSBP-Catalog-172
No ratings yet
Wall-Mounted-Fans-Models-BSDDP-BSBP-Catalog-172
18 pages
Products Bus County Spec PDF
No ratings yet
Products Bus County Spec PDF
9 pages
Naïve Bayes Classifier Algorithm
No ratings yet
Naïve Bayes Classifier Algorithm
11 pages
Softening Final
100% (1)
Softening Final
23 pages
Naive Bayesian Classifier: National Institute of Technology Sikkim
No ratings yet
Naive Bayesian Classifier: National Institute of Technology Sikkim
6 pages
Qip Ice 14 Fuels
No ratings yet
Qip Ice 14 Fuels
40 pages
Flora Combined
No ratings yet
Flora Combined
37 pages
Corporate Vocabulary
No ratings yet
Corporate Vocabulary
8 pages
Pgm5 With Output
No ratings yet
Pgm5 With Output
13 pages
Grundfos Technical Datasheet SP 270-4D G
No ratings yet
Grundfos Technical Datasheet SP 270-4D G
4 pages
Animal Riddles: Can You Find Out What I Am?
No ratings yet
Animal Riddles: Can You Find Out What I Am?
3 pages
Teen Team Robot Dupli-Kate Rex Splode Atom Eve (3) : Origin
No ratings yet
Teen Team Robot Dupli-Kate Rex Splode Atom Eve (3) : Origin
3 pages
Pa6 GF15 - Basf Ultramid B3eg3
No ratings yet
Pa6 GF15 - Basf Ultramid B3eg3
2 pages
New InRow DX 600mm - Product Introduction
No ratings yet
New InRow DX 600mm - Product Introduction
30 pages
Enzymes and Cardiac Markers Harr
No ratings yet
Enzymes and Cardiac Markers Harr
6 pages
Bat Segment Folder en New CD cd2016
No ratings yet
Bat Segment Folder en New CD cd2016
8 pages
Pre-Mock Question Paper Format Paper 1 (1)
No ratings yet
Pre-Mock Question Paper Format Paper 1 (1)
9 pages
WS-WSH-685-3210-DEC-RPT-NS-000138
No ratings yet
WS-WSH-685-3210-DEC-RPT-NS-000138
1 page
Ded Dic Png 14 01 Schematic Plan Diagram Drainage
No ratings yet
Ded Dic Png 14 01 Schematic Plan Diagram Drainage
1 page
Worked Examples in Mathematics for Scientists and Engineers
From Everand
Worked Examples in Mathematics for Scientists and Engineers
G. Stephenson
No ratings yet
Mathematical Formulas for Economics and Business: A Simple Introduction
From Everand
Mathematical Formulas for Economics and Business: A Simple Introduction
K.H. Erickson
4/5 (4)