0% found this document useful (0 votes)

11 views4 pages

PR Mod1

Uploaded by

priyanka.debnath.cse.2021

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

11 views4 pages

PR Mod1

Uploaded by

priyanka.debnath.cse.2021

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 4

Bayesian Decision Theory

• Bayesian Decision Theory is a fundamental statistical approach to the problem of pattern classification.
• It is considered as the ideal pattern classifier and often used as the benchmark for other algorithms
because its decision rule automatically minimizes its loss function.
• It involves making decisions based on probabilities and the cost of decisions.
• The core idea is to use the probability of different outcomes to make optimal decisions.
• The entire purpose of the Bayes Decision Theory is to help us select decisions that will cost us the least
‘risk’. There is always some sort of risk attached to any decision we choose.

Example

Basic Decision:

For a customer to buy a computer in a store , there will be two classes:

w1 – Yes (Customer will buy a computer)

w2 – No (Customer will not buy a computer)

According to the previous customer records, the probability of customer buying P(w1) and probability of
customer not buying P(w2) will be calculated.

For a new customer,

If P(w1) > P(w2), then the customer will buy a computer (w1)

And, if P(w2) > P(w1), then the customer will not buy a computer (w2)

However, based on just previous records, it will always give the same decision for all future customers. This is
illogical and absurd. So we need something that will help us in making better decisions for future customers. We
do that by introducing some features.

Let’s say we add a feature ‘x’ where ‘x’ denotes the age of the customer. Now with this added feature, we will be
able to make better decisions. To do this, we need to know what Bayes Theorem is.

Bayes Theorem :

For our class w1 and feature ‘x’, we have:

There are 4 terms in this formula that we need to understand:

• Prior – P(w1) is the Prior Probability that w1 is true before the data is observed
• Posterior – P(w1 | x) is the Posterior Probability that w1 is true after the data is observed.
• Evidence – P(x) is the Total Probability of the Data
• Likelihood – P(x | w1) is the information about w1 provided by ‘x’

P(w1 | x) is read as Probability of w1 given x.

More Precisely, it is the probability that a customer will buy a computer, given a specific customer’s age.

Risk calculation

There is always going to be some amount of ‘risk’ or error made in the decision. So, we also need to determine the
probability of error made in a decision.

The y-axis is the posterior probability P(w(i) | x) and

the x-axis is our feature ‘x’. The axis where the
posterior probability for both the classes is equal,
that axis is called our decision boundary. So at
Decision Boundary:
P(w1 | x) = P(w2 | x)

But, as you can see in the graph, there is some non-zero magnitude of w2 to the left of the decision boundary. Also,
there is some non-zero magnitude of w1 to the right of the decision boundary. This extension of another class over
another class is what you call a risk or probability error.

Calculation of Probability Error

To calculate the probability of error for class w1, we need to find the probability that the class is w2 in the area that
is to the left of the decision boundary. Similarly, the probability of error for class w2 is the probability that the class is
w1 in the area that is to the right of the decision boundary.
Mathematically, for favour of w2, the probability of error is P(w1|x) and vice versa.

Loss function

Let there C no. of classes ( also called states of nature)

C → { w1, w2, w3,…………………………….wc }

Also there can be more actions apart from Yes or No for a particular class as it was in the example. Let there be ‘a’
no. of actions, denoted by α.
a → ( α1, α2, α3 ,……………… αa}

The loss function will be:

λ ( αi | wj ) → Loss incurred for taking action αi when the state of nature is wj

Generalized Bayes theory

Let we have a feature vector X which is d-dimensional.

Let for this X, action taken is αi. So for true state of nature wj
Loss function is λ (αi/wj)

Average Risk / average loss

R ( αi | x) = ∑𝑐𝑗=1 𝜆 (𝛼𝑖/𝑤𝑗) 𝑃 ( 𝑤𝑗 | 𝑥 ) [ thus taking summation for all the states of nature ]
This is also called Risk function / conditional risk / expected Loss. So that αi action will be taken for which this
loss/risk is minimum. So this is also called Minimum risk Classifier.

Two class problem ( Minimum Risk Classifier)

No. of classes = 2, {w1,w2}

Actions : {a1,a2} [ where a1 means object belongs to class w1 and vice versa ]

Loss incurred for taking action I when the true state of nature is wj =

…….(1)
Risk function =

So , for 2 class sproblems these expressions simply become:

If < then action a1 is taken and vice versa.

Under that condition, to take an action in favour of a1, this leads to :

→Here λ11 and λ22 are loss incurred for taking correct decisions. And λ21, λ12 are loss incurred for wrong decisions.
→So 21,12 must be greater than 11 and 22 (ideally 11 and 22 should be zero).
→So (λ21- λ11 ) > 0 , same for the other term.
→Loss functions are predefined based on the application.

Derivatives of Minimum Risk Classifier

1. Minimum Error Rate classifier

If action ai is taken then true state of nature is wi

Now,

( c no. of true states of nature)

Expected Risk, is eq.(1).

For i != j becomes 1 so,

Now, there is only one value of I which is = j, nnd sum of probabilities of all the values of i will be 1.
Therefore, sum of i!=j is

To minimize the R(ai|x) , the above expression has to be minimized. Thus, P(wi|x) has to be maximized.
Thus the point is proved that whenever the P(wi|x) will be maximum, the Risk will be minimum and as per
the simple bayes decision theory also, that action ai will be taken.

Disciminant Functions
This can be considered a box
which
which will compute various
functional models to compute
c no. functions g(x) , each for
each class/states of nature.
Depending on the results of
this functions, applying max
criteria the decision of true
state of nature can be taken.

ThThese functions(g(x)) are called

DISCRIMINANT FUNCTIONS

Thus,

Nature of Discriminant Functions

A/c to Minimum Risk classifier, the risk R(ai|x) has to be minimum to take action a1.

However, for gi(x), it has to maximum to take the action ai.

Thus,

A/c to Minimum Error Rate Classifier, 1-P(wi|x) has to be minimum which means P(w1|x) should be maximum.
Thus,

CII 181 - 2 - v2 Integrated Project Risk Assessment (IPRA)
100% (2)
CII 181 - 2 - v2 Integrated Project Risk Assessment (IPRA)
128 pages
Notes and Solutions For: Pattern Recognition by Sergios Theodoridis and Konstantinos Koutroumbas.
100% (1)
Notes and Solutions For: Pattern Recognition by Sergios Theodoridis and Konstantinos Koutroumbas.
209 pages
BCG & Bain Case Interview Case Interviews
100% (2)
BCG & Bain Case Interview Case Interviews
69 pages
Bayesian Decision Theory: CS479/679 Pattern Recognition Dr. George Bebis
No ratings yet
Bayesian Decision Theory: CS479/679 Pattern Recognition Dr. George Bebis
64 pages
Systems Engineering For Commercial Aircraft
100% (6)
Systems Engineering For Commercial Aircraft
315 pages
Bayes 1
No ratings yet
Bayes 1
25 pages
Managing Risks of Large Scale Construction Projects: Dr. Prasanta Dey
100% (1)
Managing Risks of Large Scale Construction Projects: Dr. Prasanta Dey
5 pages
SENIOR HSE ENGINEER'S INTERVIEW QUESTIONNAIRE Rev01
No ratings yet
SENIOR HSE ENGINEER'S INTERVIEW QUESTIONNAIRE Rev01
3 pages
B4.1 Clinical Evaluation
No ratings yet
B4.1 Clinical Evaluation
49 pages
Key Differences Between Industrial All Risk (IAR) and PAR
No ratings yet
Key Differences Between Industrial All Risk (IAR) and PAR
2 pages
Linearclassification
No ratings yet
Linearclassification
31 pages
03 Bayes Nearest Neighbors
No ratings yet
03 Bayes Nearest Neighbors
34 pages
Decision Theory - I Part 5mar24
No ratings yet
Decision Theory - I Part 5mar24
38 pages
Unit2 Notes
No ratings yet
Unit2 Notes
1 page
Baes Theory
No ratings yet
Baes Theory
76 pages
Lec 6
No ratings yet
Lec 6
20 pages
Attributes of Innovative School Administrators in A State University
No ratings yet
Attributes of Innovative School Administrators in A State University
8 pages
Bartlett 08 A
No ratings yet
Bartlett 08 A
18 pages
TESDA NCII - Unit 1-Starting A Career in Technical Drafting
No ratings yet
TESDA NCII - Unit 1-Starting A Career in Technical Drafting
22 pages
22-23 323 Week7Notes
No ratings yet
22-23 323 Week7Notes
15 pages
Classification Example
No ratings yet
Classification Example
12 pages
Final Year Project End
No ratings yet
Final Year Project End
48 pages
Bayesian Decision Theory
No ratings yet
Bayesian Decision Theory
39 pages
AIML Lect7 Bayes
No ratings yet
AIML Lect7 Bayes
48 pages
PR January20 03 PDF
No ratings yet
PR January20 03 PDF
74 pages
Statistics 512 Notes 25: Decision Theory: of Nature. The Set of All Possible Value of
No ratings yet
Statistics 512 Notes 25: Decision Theory: of Nature. The Set of All Possible Value of
11 pages
Statistical Learning Theory
No ratings yet
Statistical Learning Theory
100 pages
DA Unit 2
No ratings yet
DA Unit 2
124 pages
3.1 Binary Classification
No ratings yet
3.1 Binary Classification
4 pages
2023 LSE MY474 Applied Machine Learning Social Science, Lecture3
No ratings yet
2023 LSE MY474 Applied Machine Learning Social Science, Lecture3
58 pages
Bayesian Decision Theory
No ratings yet
Bayesian Decision Theory
5 pages
Lecture 11
No ratings yet
Lecture 11
49 pages
pr2 Bayes
No ratings yet
pr2 Bayes
44 pages
Bayesian Theory Daniel Restrepo
No ratings yet
Bayesian Theory Daniel Restrepo
8 pages
Lecture 2 3
No ratings yet
Lecture 2 3
72 pages
Statistical Learning Theory: 18.657: Mathematics of Machine Learning
No ratings yet
Statistical Learning Theory: 18.657: Mathematics of Machine Learning
9 pages
Solutions To Selected Problems-Duda, Hart
67% (3)
Solutions To Selected Problems-Duda, Hart
12 pages
Lecturer4 - Bayesian Decision Theory
No ratings yet
Lecturer4 - Bayesian Decision Theory
40 pages
Lecture 7 Baysian Classifier
No ratings yet
Lecture 7 Baysian Classifier
25 pages
Opi SG Hse 017 Ups r01 and Attachment
No ratings yet
Opi SG Hse 017 Ups r01 and Attachment
38 pages
Decisiontheory 0
No ratings yet
Decisiontheory 0
13 pages
Future Ventures Ipo
No ratings yet
Future Ventures Ipo
504 pages
Decision Theory Lecture Ii PDF
No ratings yet
Decision Theory Lecture Ii PDF
12 pages
Stat Risk
No ratings yet
Stat Risk
6 pages
Decision Theory - Lecture Material
No ratings yet
Decision Theory - Lecture Material
7 pages
Pattern Recognition
No ratings yet
Pattern Recognition
76 pages
Bayesian
No ratings yet
Bayesian
21 pages
Lec 6
No ratings yet
Lec 6
14 pages
Revised Lecture Notes 2
No ratings yet
Revised Lecture Notes 2
16 pages
SDA Bayes
No ratings yet
SDA Bayes
12 pages
Statistics 512 Notes 26: Decision Theory Continued: FX FX D
No ratings yet
Statistics 512 Notes 26: Decision Theory Continued: FX FX D
11 pages
Bayesian Decision Theory
No ratings yet
Bayesian Decision Theory
65 pages
AKASH CHOWDHURY - Oops
No ratings yet
AKASH CHOWDHURY - Oops
13 pages
Wa0048.
No ratings yet
Wa0048.
6 pages
RN Notes
No ratings yet
RN Notes
119 pages
Machine Learning: Tools, Techniques, Applications (2013-14-I) # 1
No ratings yet
Machine Learning: Tools, Techniques, Applications (2013-14-I) # 1
5 pages
Bayes Decision Theory
No ratings yet
Bayes Decision Theory
53 pages
Ch1 - Bayesian Analysis
No ratings yet
Ch1 - Bayesian Analysis
5 pages
Bayesian Theory
No ratings yet
Bayesian Theory
66 pages
Theory For Classification and Linear Models (I)
No ratings yet
Theory For Classification and Linear Models (I)
32 pages
A Good Generic Risk Register For Projects
100% (1)
A Good Generic Risk Register For Projects
7 pages
Point Estimation
No ratings yet
Point Estimation
5 pages
Lecture 2 Part 1: Statistical Analysis (Bayesian Decision Theory, Probability Theory)
No ratings yet
Lecture 2 Part 1: Statistical Analysis (Bayesian Decision Theory, Probability Theory)
22 pages
Bayesian Decision Theory: Prof. Richard Zanibbi
No ratings yet
Bayesian Decision Theory: Prof. Richard Zanibbi
47 pages
Bayes&Voice Recognition
No ratings yet
Bayes&Voice Recognition
76 pages
01 Procurement Consultancy Services
No ratings yet
01 Procurement Consultancy Services
38 pages
Bayesian Decision Theory
No ratings yet
Bayesian Decision Theory
63 pages
CFAP-Syllabus Summer 2021
No ratings yet
CFAP-Syllabus Summer 2021
31 pages
Homework1 Solutions
No ratings yet
Homework1 Solutions
5 pages
GENSOC M2 Lesson 7
No ratings yet
GENSOC M2 Lesson 7
6 pages
Lecture 5
No ratings yet
Lecture 5
16 pages
Inv617 - Jba2514b - Sovereign Bond Spreads - Nurul Ain Nabila BT Mohd Sabri
No ratings yet
Inv617 - Jba2514b - Sovereign Bond Spreads - Nurul Ain Nabila BT Mohd Sabri
4 pages
Lecture 2
No ratings yet
Lecture 2
46 pages
Project Management Foundations Risk
No ratings yet
Project Management Foundations Risk
20 pages
Literature Review University of Reading
100% (1)
Literature Review University of Reading
7 pages
How To Implement Best Practices Security & Controls in A Digital Transformation - VEON's Journey
No ratings yet
How To Implement Best Practices Security & Controls in A Digital Transformation - VEON's Journey
13 pages
(QTNHTM) (Nhóm 6)
No ratings yet
(QTNHTM) (Nhóm 6)
46 pages
Akash Chowdhury - Ip
No ratings yet
Akash Chowdhury - Ip
6 pages
IMU Final Report 20.12.18 (29.1.2019)
No ratings yet
IMU Final Report 20.12.18 (29.1.2019)
28 pages
Cimsan's Case Study Assignment
No ratings yet
Cimsan's Case Study Assignment
10 pages
Akash Chowdhury - Ai
No ratings yet
Akash Chowdhury - Ai
7 pages
BC Plant Hire CC T A BC Carriers V Grenco (Sa) (Pty) LTD 2004 (4) Sa 550 (C)
No ratings yet
BC Plant Hire CC T A BC Carriers V Grenco (Sa) (Pty) LTD 2004 (4) Sa 550 (C)
6 pages
Akash Chowdhury
No ratings yet
Akash Chowdhury
7 pages
Rithwik Green Power and Aviation Private Limited
No ratings yet
Rithwik Green Power and Aviation Private Limited
6 pages
Computer Networks
No ratings yet
Computer Networks
6 pages
Risk Assessment Power Tong Installation
No ratings yet
Risk Assessment Power Tong Installation
5 pages
Akash Chowdhury Dbms
No ratings yet
Akash Chowdhury Dbms
6 pages
Ross, D R G - 20 03 19rdtd
No ratings yet
Ross, D R G - 20 03 19rdtd
4 pages
Q4-LP1 - Tle 9
No ratings yet
Q4-LP1 - Tle 9
2 pages
Qutoe-Pratap Chandra Mandal (Fire)
No ratings yet
Qutoe-Pratap Chandra Mandal (Fire)
1 page
Calculus I Essentials
From Everand
Calculus I Essentials
Editors of REA
1/5 (1)
The mathematics of quantum mechanics
From Everand
The mathematics of quantum mechanics
Alessio Mangoni
No ratings yet
Exercises of Limits
From Everand
Exercises of Limits
Simone Malacrida
No ratings yet
Exercises of Statistical Inference
From Everand
Exercises of Statistical Inference
Simone Malacrida
No ratings yet
Mathematical Optimization: Fundamentals and Applications
From Everand
Mathematical Optimization: Fundamentals and Applications
Fouad Sabry
No ratings yet
K Nearest Neighbor Algorithm: Fundamentals and Applications
From Everand
K Nearest Neighbor Algorithm: Fundamentals and Applications
Fouad Sabry
No ratings yet

PR Mod1

Uploaded by

PR Mod1

Uploaded by

Bayesian Decision Theory

For a customer to buy a computer in a store , there will be two classes:

w1 – Yes (Customer will buy a computer)

w2 – No (Customer will not buy a computer)

For a new customer,

For our class w1 and feature ‘x’, we have:

There are 4 terms in this formula that we need to understand:

P(w1 | x) is read as Probability of w1 given x.

The y-axis is the posterior probability P(w(i) | x) and

Calculation of Probability Error

Let there C no. of classes ( also called states of nature)

The loss function will be:

λ ( αi | wj ) → Loss incurred for taking action αi when the state of nature is wj

Generalized Bayes theory

Let we have a feature vector X which is d-dimensional.

Average Risk / average loss

Two class problem ( Minimum Risk Classifier)

No. of classes = 2, {w1,w2}

So , for 2 class sproblems these expressions simply become:

If < then action a1 is taken and vice versa.

Under that condition, to take an action in favour of a1, this leads to :

Derivatives of Minimum Risk Classifier

1. Minimum Error Rate classifier

If action ai is taken then true state of nature is wi

( c no. of true states of nature)

Expected Risk, is eq.(1).

For i != j becomes 1 so,

ThThese functions(g(x)) are called

Nature of Discriminant Functions

However, for gi(x), it has to maximum to take the action ai.

You might also like