0% found this document useful (0 votes)

62 views5 pages

Homework1 Solutions

1. This document describes a homework assignment for a Bayesian decision theory course. It includes 4 problems assessing Bayesian classification, decision boundaries, and reasoning. 2. The homework is due on February 14th and must be submitted in class. Students may discuss the assignment but must complete their own work. Copying answers from other sources constitutes cheating. 3. The first problem examines Bayesian and randomized decision rules for classification and derives expressions for their risks. The second problem analyzes Bayesian classification boundaries for normal distributions with one and two features. The third problem formulates a classical Bayesian reasoning problem about criminals in jail cells.

Uploaded by

Mohan Virkhare

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

62 views5 pages

Homework1 Solutions

Uploaded by

Mohan Virkhare

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 5

CSE 455/555 Spring 2011 Homework 1: Bayesian Decision Theory

Jason J. Corso
Computer Science and Engineering
SUNY at Buffalo
[email protected]
Date Assigned 24 Jan 2011
Date Due 14 Feb 2011
Homework must be submitted in class. No late work will be accepted.

Remember, you are permitted to discuss this assignment with other students in the class (and not in the
class), but you must write up your own work from scratch.
I am sure the answers to some or all of these questions can be found on the internet. Copying from any
another source is indeed cheating.
This class has a zero tolerance policy toward cheaters and cheating. Don’t do it.

Problem 1: Bayesian Decision Rule (30%)

Suppose the task is to classify the input signal x into one of K classes ω ∈ {1, 2, . . . , K} such that the
action α(x) = i means classifying x into class i. The Bayesian decision rule is to maximize the posterior
probability

αBayes (x) = ω ∗ = arg max p(ω|x) .

Suppose we replace it by a randomized decision rule, which classifies x to class i following the posterior
probability p(ω = i|x), i.e.,

αrand (x) = ω ∼ p(ω|x) .

Solution:
Maximizing the posterior probability is equivalent to minimizing the overall risk.
Using the zero-one loss function, the overall risk for the Bayes Decision Rule is:
I
RBayes = R(αBayes (x)|x)p(x)dx
I n
o
= 1 − max P (ωj |x) | j = 1, ..., k p(x)dx

For simplicity, the class with max posterior probability is abbreviated as ωmax , and
we get: I
RBayes = (1 − P (ωmax |x))p(x)dx.

1. What is the overall risk Rrand for this decision rule? Derive it in terms of the posterior probability
using the zero-one loss function.

1
Solution:
For any given x, the probability of each class j = 1, ..., k being the correct
class is P (ωj |k). With the randomized algorithm, it will select the correct
class with probability P (ωj |k), which means that it will select the wrong class
with
P probability 1 − P (ωj|k). Thus, the zero-one conditional risk will become
j P (ω j |x) 1 − P (ωj |x) on average. Thus,
I nX
o
Rrand = P (ωj |x) 1 − P (ωj |x) p(x)dx
j
I nX
2
o
= P (ωj |x) − P (ωj |x) p(x)dx
j
I h X i
= 1− P (ωj |x)2 p(x)dx
j

2. Show that this risk Rrand is always no smaller than the Bayes risk RBayes . Thus, we cannot benefit
from the randomized decision.
Solution:
Proving Rrand ≥ RBayes is equivalent to proving j P (ωj |x)2 ≤ P (ωmax |x):
P

X X
P (ωj |x)2 ≤ P (ωj |x)P (ωmax |x) = P (ωmax |x),
j j

thus proved. Rrand is always no smaller than RBayes .

3. Under what conditions on the posterior are the two decision rules the same?
Solution:
When the posterior probabilities of all classes are uniform distributions with equiv-
alent value.

Problem 2: Bayesian Classification Boundaries for the Normal Distribution (30%) Suppose we have a
two-class recognition problem with salmon (ω = 1) and sea bass (ω = 2).

1. First, assume we have one feature, the pdfs are the Gaussians N (0, σ 2 ) and N (1, σ 2 ) for the two
classes, respectively. Show that the threshold τ minimizing the average risk is equal to

1 λ12 P (ω2 )
τ= − σ 2 ln (1)
2 λ21 P (ω1)

where we have assumed λ11 = λ22 = 0.

Solution:

2
Define the R(τ ) is the average risk for the threshold τ :
Z τ Z +∞
R(τ ) = λ12 P (ω2 )p(x|ω = 2) dx + λ21 P (ω1 )p(x|ω = 1) dx
0 τ

Get derivative about τ for R(τ ), then obtain the minimization when the derivative
equals to 0, so make it equals to 0:

1 τ2 1 (τ −1)2
λ12 P (ω2 ) · √ e− 2σ2 − λ21 P (ω1 ) · √ e− 2σ2 = 0
2πσ 2 2πσ
Therefore,
1 λ12 P (ω2 )
τ= − σ 2 ln
2 λ21 P (ω1 )
2. Next, suppose we have two features x = (x1 , x2 ) and the two class-conditional densities, p(x|ω = 1)
and p(x|ω = 2), are 2D Gaussian distributions centered at points (4, 11) and (10, 3) respectively
with the same covariance matrix Σ = 3I (with I is the identity matrix). Suppose the priors are
P (ω = 1) = 0.6 and P (ω = 2) = 0.4.

(a) Suppose we use a Bayes decision rule, write the two discriminant functions g1 (x) and g2 (x).
Solution:
According to bayes decision rule:

1 − (x1 −4)2 +(x2 −11)2

g1 (x) = p(x|ω = 1) · P (ω = 1) = e 2∗3 · 0.6
2π · 3
1 − (x1 −10)2 +(x2 −3)2
g2 (x) = p(x|ω = 2) · P (ω = 2) = e 2∗3 · 0.4
2π · 3
(b) Derive the equation for the decision boundary g1 (x) = g2 (x). Solution:
Derive the equation for the decision boundary g1 (x) = g2 (x), we get:
3
−6x1 + 8x2 − 14 + 3 ln
=0
2
(c) How would the decision boundary change if we changed the priors? the covariances?
Solution:

3
When all the covariance matrices are the same, the decision boundary will be a
straight line in two dimensional case, and plane or hyper-plane in three or higher
dimensional space. In particular if the covariance matrix are in a special diagonal
form Σ = σ 2 · I, the direction of the decision surface will be perpendicular to
the direction between two means. If the covariance matrices are different for
each class, then the quadratic term in the function of decision surface will not be
canceled, and thus it will not be a straight plane. Let’s restrict us to a simpler case
that all the covariance matrices are the same and analyze the influence of class
priors to the position of the decision boundary. Now we can write the decision
boundary using the following equation:

wT (x − x0 ) = 0
where
w = Σ−1 (µi − µj )
and
1 ln[P (ωi )/P (ωj )]
x0 = (µi + µj ) − (µi − µj )
2 (µi − µj )T Σ−1 (µi − µj )
In general, while the ratio of the distance between the means of the classes and the
covariance are relatively close to one, as the prior of a class increases, the decision
boundary will move toward the other class. But if the variance is relatively small as
compared to the distance between the two means, i.e., the denominator is relatively
large in above equation, the influence of class priors will be relatively small to the
position of the decision boundary, e.g., in case of two well separated Gaussians
and very peaked at the corresponding mean. On the other hand, if the variance is
relatively large as compared to the distance between the two means, the position of
the decision boundary is mainly determined by the class priors (e.g., it should be
intuitive while two classes are heavily overlapped, the decision are mainly based
on the prior knowledge we have).
(d) Using computer software, sample 100 points from each of the two densities. Draw them and
draw the boundary on the feature space (the 2D plane). Solution:

Problem 4: Bayesian Reasoning (40%)

Formulate and solve this classical problem using the Bayes rule. There are three criminals A, B, and C
waiting in three separate jail cells. One of them will be executed in the next morning when the sun rises. A
is very nervous, as he has 1/3 chance to be the one. He tries to get some information from the janitor: “I
know you cannot tell me whether I will be executed in the next morning, but can you tell me which of my
inmates B and C will not be executed? Because one of them will not be executed anyway, by pointing out
who will not be executed, you are not telling me any information.” This sounds quite logical. So the Janitor
tells A that C won’t be executed. At a second thought, A gets much more worried. Before he asked the
janitor, he thought he had 1/3 chance, but with C excluded, he seems to have 1/2 chance. A says to himself:
“What did I do wrong? Why did I ask the janitor?”

1. Formulate the problem using the Bayes rule, i.e. what are the random variables and the input data?
What are the meaning of the prior and posterior probabilities in this problem?

4
Solution:
Since who’s going to be executed tomorrow has already been decided before A
asked the janitor - otherwise the janitor won’t be able to tell A which one of A’s
inmates will live - we have:
The random variable is all the possible answers from the janitor; the input data
(observation) is the janitor’s answer; the prior probability is the chance of A be-
ing executed before observing the janitor’s answer; the posterior probability is the
chance of A being executed after observing the janitor’s answer.
2. What are the probability values for the prior?
Solution:
Let EX , where X = {A, B, C}, denote the event that X is going to be executed. The
prior probability of A being executed tomorrow is P (EA ) = P (EB ) = P (EC ) =
1
3 (suppose they kill at random).

3. What are the probability values for the likelihood?

Solution:
Let LY , where Y = {B, C}, denote the event that: knowing that X will be exe-
cuted tomorrow, the likelihood of the janitor telling A that Y will live. The like-
lihoods are: P (LB |EA ) = P (LC |EA ) = 21 ; P (LB |EB ) = 0, P (LC |EB ) = 1;
P (LB |EC ) = 1, P (LC |EC ) = 0.
4. Calculate the posterior probability (you need to derive the probability values with intermediate steps,
not simply showing the final values).
Solution:
The posterior probability of A being executed is

P (LC |EA ) · P (EA )

P (EA |LC ) =
P (LC )
P (LC |EA ) · P (EA )
=
P (LC |EA ) · P (EA ) + P (LC |EB ) · P (EB ) + P (LC |EC ) · P (EC )
1/2 · 1/3
=
1/2 · 1/3 + 1 · 1/3 + 0 · 1/3
1
= .
3
5. What is the probability of A being executed after he knows that C is excluded?
Solution:
As shown in the posterior probability, it’s still 13 .
6. Did the janitor tell us any information about A’s fate?
Solution:
NO
7. Explain how the Bayes rule helps you. Solution:
Helps decompose complicated problems into tractable parts, especially to deter-
mine the probability of an event A after observing the happening of event B.

Sample Questions Pattern Recognition
No ratings yet
Sample Questions Pattern Recognition
8 pages
Credit Eda Case Study Analysis
75% (4)
Credit Eda Case Study Analysis
13 pages
Ethical Consideration in Human Resource Management A Study of Some Selected Service Organisations in Dehradun PDF
No ratings yet
Ethical Consideration in Human Resource Management A Study of Some Selected Service Organisations in Dehradun PDF
18 pages
Research Proposal
No ratings yet
Research Proposal
44 pages
Trend Control V7
No ratings yet
Trend Control V7
66 pages
Benvironment Statistics of Nepal 2019 PDF
No ratings yet
Benvironment Statistics of Nepal 2019 PDF
257 pages
A Quick Guide To Quantitative Research in The Social Sciences
No ratings yet
A Quick Guide To Quantitative Research in The Social Sciences
26 pages
Bayesian Classifier and ML Estimation: 6.1 Conditional Probability
100% (3)
Bayesian Classifier and ML Estimation: 6.1 Conditional Probability
11 pages
Brms
No ratings yet
Brms
210 pages
Chapter 3 - Demand Forecasting: # Het Begint Met Een Idee
No ratings yet
Chapter 3 - Demand Forecasting: # Het Begint Met Een Idee
32 pages
Introduction To Machine Learning CS - 229
No ratings yet
Introduction To Machine Learning CS - 229
109 pages
Thesis PDF
No ratings yet
Thesis PDF
55 pages
CANoe Tool Tutorial
No ratings yet
CANoe Tool Tutorial
62 pages
9961 27328 1 PB
No ratings yet
9961 27328 1 PB
8 pages
Practical Statistics by Example Using Mi
No ratings yet
Practical Statistics by Example Using Mi
17 pages
Untitled: Alma Rohmah Fusur 07/06/2022
No ratings yet
Untitled: Alma Rohmah Fusur 07/06/2022
30 pages
6-Sampling Distributions Review
No ratings yet
6-Sampling Distributions Review
6 pages
Business Stat CHAPTER 6
No ratings yet
Business Stat CHAPTER 6
5 pages
Correlation 1
No ratings yet
Correlation 1
9 pages
Lecture 3-Linear-Regression-Part2
No ratings yet
Lecture 3-Linear-Regression-Part2
45 pages
Lecture 7 Baysian Classifier
No ratings yet
Lecture 7 Baysian Classifier
25 pages
Statistica
No ratings yet
Statistica
40 pages
Bayesian Decision Theory: CS479/679 Pattern Recognition Dr. George Bebis
No ratings yet
Bayesian Decision Theory: CS479/679 Pattern Recognition Dr. George Bebis
64 pages
Sem 3 Mock 2022 ST Thomas-Q Maths T
No ratings yet
Sem 3 Mock 2022 ST Thomas-Q Maths T
3 pages
Nearest Centroid Classifier With Centroid-Based Outlier Removal For Classification
No ratings yet
Nearest Centroid Classifier With Centroid-Based Outlier Removal For Classification
8 pages
9) Interpreting Histograms
No ratings yet
9) Interpreting Histograms
14 pages
2023 LSE MY474 Applied Machine Learning Social Science, Lecture3
No ratings yet
2023 LSE MY474 Applied Machine Learning Social Science, Lecture3
58 pages
Chapter Four 4. Functions of Random Variables
No ratings yet
Chapter Four 4. Functions of Random Variables
6 pages
Lecture 2.4-2.5
No ratings yet
Lecture 2.4-2.5
16 pages
Linearclassification
No ratings yet
Linearclassification
31 pages
An Introduction To Bayesian Statistics
100% (9)
An Introduction To Bayesian Statistics
20 pages
Linear Regression Sample Problem
No ratings yet
Linear Regression Sample Problem
1 page
ANOVA
No ratings yet
ANOVA
36 pages
2 Unit PR Statistical Decision Making
No ratings yet
2 Unit PR Statistical Decision Making
61 pages
Get (Ebook PDF) An Introduction To Categorical Data Analysis by Alan Agresti Free All Chapters
100% (8)
Get (Ebook PDF) An Introduction To Categorical Data Analysis by Alan Agresti Free All Chapters
54 pages
Practical Statistics For Geographers and Earth Scientists 1st Edition Nigel Walford Download PDF
100% (3)
Practical Statistics For Geographers and Earth Scientists 1st Edition Nigel Walford Download PDF
45 pages
Worksheet 1
No ratings yet
Worksheet 1
5 pages
MIDTERM LAB QUIZ 1 - Attempt Review Software
No ratings yet
MIDTERM LAB QUIZ 1 - Attempt Review Software
8 pages
pr2 Bayes
No ratings yet
pr2 Bayes
44 pages
Lecture 5
No ratings yet
Lecture 5
23 pages
Statistical Perspective
No ratings yet
Statistical Perspective
85 pages
Unit II AI
No ratings yet
Unit II AI
43 pages
Lecture 11
No ratings yet
Lecture 11
49 pages
AIML Lect7 Bayes
No ratings yet
AIML Lect7 Bayes
48 pages
Chapter 4
No ratings yet
Chapter 4
57 pages
Bayesian Decision Theory
No ratings yet
Bayesian Decision Theory
39 pages
Bayesian Theory Daniel Restrepo
No ratings yet
Bayesian Theory Daniel Restrepo
8 pages
Machine Learning Models and Theories
No ratings yet
Machine Learning Models and Theories
38 pages
Bayes Reasoning
No ratings yet
Bayes Reasoning
45 pages
Bayesian Theory
No ratings yet
Bayesian Theory
66 pages
Bayesian Decision Theory
No ratings yet
Bayesian Decision Theory
65 pages
Bayesian Decision Theory and Learning: Jayanta Mukhopadhyay Dept. of Computer Science and Engg
No ratings yet
Bayesian Decision Theory and Learning: Jayanta Mukhopadhyay Dept. of Computer Science and Engg
56 pages
SDA Bayes
No ratings yet
SDA Bayes
12 pages
Bayesian Classifier Notes
No ratings yet
Bayesian Classifier Notes
9 pages
HW 2 Sol
No ratings yet
HW 2 Sol
7 pages
Lecture Notes For Probability and Statistics
No ratings yet
Lecture Notes For Probability and Statistics
7 pages
Bayesian
No ratings yet
Bayesian
14 pages
Revised Lecture Notes 2
No ratings yet
Revised Lecture Notes 2
16 pages
HW 3 Sol
No ratings yet
HW 3 Sol
6 pages
Bayes Decision Theory
No ratings yet
Bayes Decision Theory
53 pages
Theory For Classification and Linear Models (I)
No ratings yet
Theory For Classification and Linear Models (I)
32 pages
HW 4
No ratings yet
HW 4
6 pages
PR Mod1
No ratings yet
PR Mod1
4 pages
Stat Risk
No ratings yet
Stat Risk
6 pages
Bayesian Decision Theory
No ratings yet
Bayesian Decision Theory
63 pages
Bayes&Voice Recognition
No ratings yet
Bayes&Voice Recognition
76 pages
Exercises 1 Bayasian Decision Theory
No ratings yet
Exercises 1 Bayasian Decision Theory
13 pages
Decision Theory - Lecture Material
No ratings yet
Decision Theory - Lecture Material
7 pages
Statistical Learning Theory: 18.657: Mathematics of Machine Learning
No ratings yet
Statistical Learning Theory: 18.657: Mathematics of Machine Learning
9 pages
Lecture 2 Part 1: Statistical Analysis (Bayesian Decision Theory, Probability Theory)
No ratings yet
Lecture 2 Part 1: Statistical Analysis (Bayesian Decision Theory, Probability Theory)
22 pages
Bayesian Uncertainty Quantification
No ratings yet
Bayesian Uncertainty Quantification
23 pages
Statistics 512 Notes 25: Decision Theory: of Nature. The Set of All Possible Value of
No ratings yet
Statistics 512 Notes 25: Decision Theory: of Nature. The Set of All Possible Value of
11 pages
Bayesian Decision Theory: Prof. Richard Zanibbi
No ratings yet
Bayesian Decision Theory: Prof. Richard Zanibbi
47 pages
Bayesian Learning: Thanks To Nir Friedman, HU
No ratings yet
Bayesian Learning: Thanks To Nir Friedman, HU
41 pages
Bayes ML Tutorial
No ratings yet
Bayes ML Tutorial
69 pages
Statistical Inference For Engineers and Data Scientists Solutions Manual
No ratings yet
Statistical Inference For Engineers and Data Scientists Solutions Manual
12 pages
Bayes Decision Theory: How To Make Decisions in The Presence of Uncertainty?
No ratings yet
Bayes Decision Theory: How To Make Decisions in The Presence of Uncertainty?
16 pages
Homework 1: 1. Solve The Following Problems From Chapter 2 of The Text Book: 7, 12, 13, 31, 38
No ratings yet
Homework 1: 1. Solve The Following Problems From Chapter 2 of The Text Book: 7, 12, 13, 31, 38
6 pages
3assignment Sol
No ratings yet
3assignment Sol
7 pages
Statistics 512 Notes 26: Decision Theory Continued: FX FX D
No ratings yet
Statistics 512 Notes 26: Decision Theory Continued: FX FX D
11 pages
CS263 - Bayesian Decision Theory
No ratings yet
CS263 - Bayesian Decision Theory
16 pages
Point Estimation
No ratings yet
Point Estimation
5 pages
Lecture 5
No ratings yet
Lecture 5
16 pages
A Beginner's Notes On Bayesian Econometrics (Art)
No ratings yet
A Beginner's Notes On Bayesian Econometrics (Art)
21 pages
Differential Forms
From Everand
Differential Forms
Henri Cartan
5/5 (2)
Bayes
No ratings yet
Bayes
10 pages
Assign 1
No ratings yet
Assign 1
5 pages
Worked Examples in Mathematics for Scientists and Engineers
From Everand
Worked Examples in Mathematics for Scientists and Engineers
G. Stephenson
No ratings yet
Schuerger - Statement of Understanding
No ratings yet
Schuerger - Statement of Understanding
10 pages

Homework1 Solutions

Uploaded by

Homework1 Solutions

Uploaded by

CSE 455/555 Spring 2011 Homework 1: Bayesian Decision Theory

Problem 1: Bayesian Decision Rule (30%)

αBayes (x) = ω ∗ = arg max p(ω|x) .

αrand (x) = ω ∼ p(ω|x) .

thus proved. Rrand is always no smaller than RBayes .

where we have assumed λ11 = λ22 = 0.

1 − (x1 −4)2 +(x2 −11)2

Problem 4: Bayesian Reasoning (40%)

3. What are the probability values for the likelihood?

P (LC |EA ) · P (EA )

You might also like