0% found this document useful (0 votes)

34 views11 pages

Unit 3-Bayesian Logistic

Bayesian logistic regression models the probability of class membership p(C|φ) as a sigmoid function of a linear predictor. Exact Bayesian inference is intractable, so Laplace approximation is used to fit a Gaussian distribution q(w) approximating the posterior p(w|t). The predictive distribution is obtained by convolving this Gaussian with the sigmoid, which is approximated using the probit function for computational simplicity.

Uploaded by

Sridarshini Vikkram

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

34 views11 pages

Unit 3-Bayesian Logistic

Uploaded by

Sridarshini Vikkram

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 11

Bayesian Logistic Regression

• Logistic regression is a discriminative
probabilistic linear classifier: p(C |) =  (w
1
T

)
• Exact Bayesian inference for Logistic
Regression p(C)p(
1|)   (w
T
w)dw is intractable, because:

1.Evaluation of posterior distribution p(w|t)

– Needs normalization of prior p(w)=N(w|m0,S0) times
likelihood (a product of sigmoids) p(t | w)  y 1 
N
tn 1 n
t
n n
n y
• Solution: use Laplace approximation to 
get
1
 q(w)
Gaussian
2. Evaluation of predictive distribution p(C1 |) ! 
(w T )q(w)dw
– Convolution of sigmoid and Gaussian
• Solution: Approximate Sigmoid by Probit
Laplace Approximation (summary)
• Need mode w0 of posterior distribution p(w|t)
– Done by a numerical optimization algorithm
• Fit a Gaussian centered at the mode
1 1/2 
q(w) = f (w) = A exp -1 (w -w0 )T A (w - 
Ww) (2π)M/2
 0 
-1
= 2N (w |w0, A )
– Needs second derivatives of log A   ln f (w) |w=w
0

posterior
• Equivalent to finding Hessian matrix
SN   ln p(w | t) 0 S  yn (1-
1
n  nnT
y ) n i1
Evaluation of Posterior Distribution
• Gaussian prior
p(w)=N(w|m0,S0)
– Where m0 and S0 are hyper-parameters
• Posterior distribution
p(w|t)  p(w)p(t|
w) where t =(t1,..,tN)T
N
– Substituting = y {1
1− n
p(t | w) ∏
tn
t
n n
−y n=
1 1
}
n
ln p(w|t)   (w  m0 )1S0 (w  m0 )
2
T
 (t n ln yn  (1  t n )ln(1  yn ) 
 i1
const
• yn )  (w
where T
n
Gaussian Approximation of Posterior
• Maximize posterior p(w|t) to give
– MAP solution wmap
• Done by numerical optimization
– Defines mean of the Gaussian
• Covariance given by
– Inverse of matrix of 2nd derivatives of negative n
1
SN   ln p(w|t)  S  y n (1  yn )n nT
log-likelihood 0

i1 

• Gaussian approximation to posterior

q(w)  N (w | w map , SN )

• Need to marginalize wrt this distribution to

make predictions
Predictive Distribution
• Predictive distribution for class C1, given
new feature vector  (x
)
– Obtained by marginalizing wrt posterior p(w|t)
Sum rule
p(C1 | , t)   p(C1 ,w | ,
t)dw Product rule
=  p(C1 | , t,w) p(w|
t)dw
=  p(C1 |  ,w) p(w|t)dw Given  and w, C1 is indep of
t Approximate p(w|t) by Gaussian q(w)
! 
correspondingT probability for class C2
(w  )q(w)dw
p(C2 | ,t )  1  p(C1 | ,t )
Predictive distrib. is a Convolution
p(C1 | , t) ! 
(w T  )q(w)dw
– Function σ(wTϕ) depends on w only through its
projection onto ϕ
– Denoting a = wTϕ we have
 (w  ) !   (a  w T T
 )
• where δ is the Dirac delta function
(a)da
– Thus
  (w T
 )q(w)dw    (a) p(a)dawhere p(a)    (a 
w T  )q(w)dw
• Can evaluate p(a) because
– the delta function imposes a linear constraint on w
– Since q(w) is Gaussian, its marginal is also Gaussian
a  [a]   p(a)da   q(w)wT  dw  map T

• Evaluate its mean and covariance w
 var[a]   p(a)a 2  [a]2
a
2

=  q(w) (w T2 )2  (mNT ) dw  T SN



da 
Variational Approximation to Predictive Distribution

• Predictive distribution is

p(C1 | t)    (a)

p(a)da =   (a)N a | a ,  2
a da


• Convolution of Sigmoid-Gaussian is intractable
• Use probit instead of logistic sigmoid
1

0.5

0
!5 0 5
Approximation using Probit

p(C1 | t)=  (a)N a | a , a2 da


• Use probit which is similar to Logistic sigmoid 1

– Defined as
a

 (a)   N ( | 0.5

• Approximate σ(a)
0,1)d
by Φ(λa) 0

• Find λ such that two functions have same slope at origin

!5 0 5

Approximate  (a) by  (a)

Find suitable value of  by requiring that two have same slope at origin, which yields  2 = /8

• Convolution of probit with Gaussian is a probit

2   
  (a)N (a | , )da   
  2   2
1/ 2 

– Thus p(C1 | , t)=  (a)N a | a ,2a da 


!  ( 2
a )a)
where (
 ( 2 )  (1   2 /
8)1/2
Probit Classification
Applying it to

p(C1 | t)=  (a)N a | a ,2a
da
We 
have p(C1 | , t)     a2 ) a
(
where 
a  w T
  a2  T S N
map 
Decision boundary corresponding to p(C1|ϕ,t) =0.5 is given by
a  0
This is the same solution as
w Tmap  0
Thus marginalization has no effect!
When minimizing misclassification rate with equal prior probabilities

For more complex decision criteria it plays important role

Summary
• Logistic regression is a linear probabilistic
discriminative model p(C | x )   (w
1
T

)
• Bayesian Logistic Regression is intractable
• Using Laplacian the posterior parameter
distribution p(w|t) can be approximated as a
Gaussian
• Predictive distribution is convolution of
sigmoids and Gaussian p(C | ) !  
1
(w )q(w)dw
T

– Probit yields convolution as probit

Why Machines Learn PDF
No ratings yet
Why Machines Learn PDF
151 pages
Robocode
100% (2)
Robocode
114 pages
Factor Graphs For Robot Perception
100% (1)
Factor Graphs For Robot Perception
144 pages
Engineering Management-2
No ratings yet
Engineering Management-2
23 pages
Semiconductor Physics and Devices
No ratings yet
Semiconductor Physics and Devices
2 pages
Smart Money, Dumb Money, and Learning Type From Price
No ratings yet
Smart Money, Dumb Money, and Learning Type From Price
23 pages
Chapter 1 Uncertainty
No ratings yet
Chapter 1 Uncertainty
32 pages
Group Sequential and Confirmatory Adaptive Designs in Clinical Trials (FULL VERSION DOWNLOAD)
100% (8)
Group Sequential and Confirmatory Adaptive Designs in Clinical Trials (FULL VERSION DOWNLOAD)
14 pages
Bayesian Basics: Ryan P. Adams
No ratings yet
Bayesian Basics: Ryan P. Adams
7 pages
Uncertainty in KBS
No ratings yet
Uncertainty in KBS
5 pages
Deep Learning Basics Lecture 3 Regularization I
No ratings yet
Deep Learning Basics Lecture 3 Regularization I
32 pages
Probabilistic Graphical Models: David Sontag
No ratings yet
Probabilistic Graphical Models: David Sontag
44 pages
Module 2
No ratings yet
Module 2
17 pages
ML Unit 3 Part 1
No ratings yet
ML Unit 3 Part 1
36 pages
Bayesian Methods in Risk Assessment
No ratings yet
Bayesian Methods in Risk Assessment
59 pages
03-Logistic Regression
No ratings yet
03-Logistic Regression
59 pages
Murphy Book Solution
No ratings yet
Murphy Book Solution
100 pages
Chap1 Bishop
No ratings yet
Chap1 Bishop
35 pages
E-Note 14654 Content Document 20231228101425AM
No ratings yet
E-Note 14654 Content Document 20231228101425AM
10 pages
Logistic Regression
No ratings yet
Logistic Regression
36 pages
AAPOR Guidance Nonprob Precision 042216
No ratings yet
AAPOR Guidance Nonprob Precision 042216
5 pages
Previewpdf
No ratings yet
Previewpdf
97 pages
Alexander Kruel 2010 Guide Bayes Theorem
No ratings yet
Alexander Kruel 2010 Guide Bayes Theorem
4 pages
PPTX
No ratings yet
PPTX
12 pages
Bayesian Decision Theory: CS479/679 Pattern Recognition Dr. George Bebis
No ratings yet
Bayesian Decision Theory: CS479/679 Pattern Recognition Dr. George Bebis
64 pages
Sequential Analysis Hypothesis Testing and Changepoint Detection (Etc.) (Z-Library)
No ratings yet
Sequential Analysis Hypothesis Testing and Changepoint Detection (Etc.) (Z-Library)
600 pages
AI Unit 5 Notes
No ratings yet
AI Unit 5 Notes
35 pages
Lecture 3-Revision - Part2
No ratings yet
Lecture 3-Revision - Part2
25 pages
189 Cheat Sheet Nominicards PDF
No ratings yet
189 Cheat Sheet Nominicards PDF
2 pages
Lecture 5 - Logistic Regression
No ratings yet
Lecture 5 - Logistic Regression
28 pages
Unit 3-Generative Models
No ratings yet
Unit 3-Generative Models
23 pages
BML Lecture Notes
No ratings yet
BML Lecture Notes
126 pages
Lecture 07
No ratings yet
Lecture 07
26 pages
Linearclassification
No ratings yet
Linearclassification
31 pages
Semantic Anomaly Detection With Large Language Models
No ratings yet
Semantic Anomaly Detection With Large Language Models
27 pages
CS3491 Unit 2 Aiml
100% (1)
CS3491 Unit 2 Aiml
21 pages
ML - Unit 2
No ratings yet
ML - Unit 2
155 pages
Likelihood and Bayesian Inference With Applications in Biology and Medicine 2nd Edition Unlimited Download
100% (15)
Likelihood and Bayesian Inference With Applications in Biology and Medicine 2nd Edition Unlimited Download
14 pages
We Are Intechopen, The World'S Leading Publisher of Open Access Books Built by Scientists, For Scientists
No ratings yet
We Are Intechopen, The World'S Leading Publisher of Open Access Books Built by Scientists, For Scientists
21 pages
3-LG Eval
No ratings yet
3-LG Eval
52 pages
Introduction To The Theory of Complex Systems Stefan Thurner - Download The Ebook Now To Never Miss Important Information
100% (1)
Introduction To The Theory of Complex Systems Stefan Thurner - Download The Ebook Now To Never Miss Important Information
55 pages
1.2.6 Advanced
No ratings yet
1.2.6 Advanced
5 pages
Use of Fin Equation To Calculate Nusselt Numbers For Rotating Disks
No ratings yet
Use of Fin Equation To Calculate Nusselt Numbers For Rotating Disks
10 pages
04 - Linear-Classification-2024
No ratings yet
04 - Linear-Classification-2024
65 pages
Text Classification Using Logistics Regression
No ratings yet
Text Classification Using Logistics Regression
64 pages
Opamp
No ratings yet
Opamp
11 pages
CS229 Lecture 3 PDF
100% (1)
CS229 Lecture 3 PDF
35 pages
5 LR Apr 7 2021
No ratings yet
5 LR Apr 7 2021
94 pages
Unit II NOTES
No ratings yet
Unit II NOTES
31 pages
Lec18 Logistic Regression
No ratings yet
Lec18 Logistic Regression
17 pages
Key Concepts in Probabilistic Learning and SVMs
No ratings yet
Key Concepts in Probabilistic Learning and SVMs
15 pages
4.2 Generative
No ratings yet
4.2 Generative
21 pages
ML - MU - Unit - 2 - Supervised Learning-Classification Techniques
No ratings yet
ML - MU - Unit - 2 - Supervised Learning-Classification Techniques
153 pages
Slide 2
No ratings yet
Slide 2
30 pages
ML-chap10 2024 110300
No ratings yet
ML-chap10 2024 110300
29 pages
Machine Learning - Unit 2
No ratings yet
Machine Learning - Unit 2
104 pages
Logistic Regression (Probability Concepts) and Perceptron
No ratings yet
Logistic Regression (Probability Concepts) and Perceptron
20 pages
Linear Models For Classification
No ratings yet
Linear Models For Classification
21 pages
Mathews 2008
No ratings yet
Mathews 2008
9 pages
AMA534
No ratings yet
AMA534
2 pages
Unit 3-Discriminative Models
No ratings yet
Unit 3-Discriminative Models
29 pages
Lecture Notes Chapt13
No ratings yet
Lecture Notes Chapt13
15 pages
Linear Classification: 1 1 N N I D I
No ratings yet
Linear Classification: 1 1 N N I D I
33 pages
Cheatsheet Supervised Learning
100% (1)
Cheatsheet Supervised Learning
4 pages
Machine Learning: Probabilistic View of Linear Regression Logistic Regression Hyperplane Based Classifiers and Perceptron
No ratings yet
Machine Learning: Probabilistic View of Linear Regression Logistic Regression Hyperplane Based Classifiers and Perceptron
67 pages
Week 3 Lecture Notes
No ratings yet
Week 3 Lecture Notes
7 pages
Ch2Regression and Regularization1
No ratings yet
Ch2Regression and Regularization1
45 pages
2021 Logistic Regression
No ratings yet
2021 Logistic Regression
33 pages
M02Logistic Regression Logistic RegressioLogistic Regressionn
No ratings yet
M02Logistic Regression Logistic RegressioLogistic Regressionn
19 pages
Generalized Linear Model
No ratings yet
Generalized Linear Model
67 pages
07 - Bayesian Learning
No ratings yet
07 - Bayesian Learning
55 pages
Logistic Regression: Adapted From: Tom Mitchell's Machine Learning Book Evan Wei Xiang and Qiang Yang
No ratings yet
Logistic Regression: Adapted From: Tom Mitchell's Machine Learning Book Evan Wei Xiang and Qiang Yang
15 pages
Bayesian Decision Theory
No ratings yet
Bayesian Decision Theory
65 pages
ML Assignment
No ratings yet
ML Assignment
20 pages
Week 4 Logistic
No ratings yet
Week 4 Logistic
21 pages
Chapter 4: Linear Models For Classification: Grit Hein & Susanne Leiberg
No ratings yet
Chapter 4: Linear Models For Classification: Grit Hein & Susanne Leiberg
21 pages
Bishop CH 3 Notes
No ratings yet
Bishop CH 3 Notes
6 pages
Logistic Regression Notes
No ratings yet
Logistic Regression Notes
23 pages
Machine Learning and Pattern Recognition Week 10 - Bayes - Logistic - Regression
No ratings yet
Machine Learning and Pattern Recognition Week 10 - Bayes - Logistic - Regression
4 pages
Logistic Regression
No ratings yet
Logistic Regression
19 pages
04 Probability and Learning PDF
No ratings yet
04 Probability and Learning PDF
34 pages
Logistic Regression
No ratings yet
Logistic Regression
8 pages
Machine Learning and Pattern Recognition Bayesian Complexity Control
No ratings yet
Machine Learning and Pattern Recognition Bayesian Complexity Control
4 pages
Lecture2 2013
No ratings yet
Lecture2 2013
60 pages
W8 - Logistic Regression
No ratings yet
W8 - Logistic Regression
18 pages
Bayesian Learning: Berrin Yanikoglu
No ratings yet
Bayesian Learning: Berrin Yanikoglu
64 pages
Bayesian Decision Theory
No ratings yet
Bayesian Decision Theory
63 pages
Logistic Regression
No ratings yet
Logistic Regression
6 pages
Bayesian Individual
No ratings yet
Bayesian Individual
2 pages
Cheatsheet Supervised Learning
No ratings yet
Cheatsheet Supervised Learning
4 pages
Lecture 6
No ratings yet
Lecture 6
19 pages
South Africa Heart Disease Project: Omar M. Osama Deyaa Eldeen A. Almahallawi June 16, 2010
No ratings yet
South Africa Heart Disease Project: Omar M. Osama Deyaa Eldeen A. Almahallawi June 16, 2010
7 pages
Theory of Approximation
From Everand
Theory of Approximation
N. I. Achieser
No ratings yet
(Ebook) Machine Learning For Algorithmic Trading by Stefan Jansen Instant Download
100% (3)
(Ebook) Machine Learning For Algorithmic Trading by Stefan Jansen Instant Download
63 pages
A-level Maths Revision: Cheeky Revision Shortcuts
From Everand
A-level Maths Revision: Cheeky Revision Shortcuts
Scool Revision
3.5/5 (8)

Unit 3-Bayesian Logistic

Uploaded by

Unit 3-Bayesian Logistic

Uploaded by

Bayesian Logistic Regression

Bayesian Logistic Regression

1.Evaluation of posterior distribution p(w|t)

• Gaussian approximation to posterior

• Need to marginalize wrt this distribution to

=  q(w) (w T2 )2  (mNT ) dw  T SN

 (a)   N ( | 0.5

• Find λ such that two functions have same slope at origin

Approximate  (a) by  (a)

• Convolution of probit with Gaussian is a probit

– Thus p(C1 | , t)=  (a)N a | a ,2a da 

For more complex decision criteria it plays important role

– Probit yields convolution as probit

You might also like