0% found this document useful (0 votes)

93 views56 pages

Bayesian Networks Slides

The document discusses Bayesian learning and Bayes' theorem. Specifically, it provides definitions and explanations of key Bayesian concepts like prior and posterior probabilities, maximum a posteriori hypotheses, and naive Bayes classifiers. It also provides an example application of Bayes' theorem to calculate the probability that a patient has cancer given a positive test result.

Uploaded by

সুনীত ভট্টাচার্য

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

93 views56 pages

Bayesian Networks Slides

Uploaded by

সুনীত ভট্টাচার্য

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 56

March 24, 2017

March 24, 2017 1 / 35

Bayesian Learning

Bayes Theorem

MAP, ML hypotheses

MAP learners

Bayes optimal classifier

Naive Bayes learner

Bayesian belief networks

March 24, 2017 2 / 35

Bayesian Learning-Advantages

Bayesian reasoning provides a probabilistic approach to inference.

It is based on the assumption that the quantities of interest are governed by

probability distributions and that optimal decisions can be made by reasoning
about these probabilities together with observed data.

It is important to machine learning because it provides a quantitative

approach to weighing the evidence supporting alternative hypotheses.

Bayesian reasoning provides the basis for learning algorithms that directly
manipulate probabilities, as well as a framework for analyzing the operation
of other algorithms that do not explicitly manipulate probabilities.

March 24, 2017 3 / 35

Bayesian Learning -Relevance

Bayesian learning algorithms that calculate explicit probabilities for

hypotheses, such as the naive Bayes classifier, are among the most practical
approaches to certain types of learning problems.

they provide a useful perspective for understanding many learning algorithms

that do not explicitly manipulate probabilities

March 24, 2017 4 / 35

Bayes Theorem

P(D|h)P(h)
P(h|D) =
P(D)

P(h) = initial probability that hypothesis h holds before we have observed

the training data (called Prior Probability).

P(D) = prior probability of training data D (prior probability that training

data D will be observed (i.e., the probability of D given no knowledge about
which hypothesis holds).

P(D|h) = probability of D given h ( the probability of observing data D

given some world in which hypothesis h holds.)

P(h|D) = probability of h given D (posterior probability of h: it

reflects our confidence that h holds after we have seen the training data D)

March 24, 2017 5 / 35

Observation

P(D|h)P(h)
P(h|D) =
P(D)

P(h|D) increases with P(h) and with P(D|h) according to Bayes theorem.

P(h|D) decreases as P (D) increases, because the more probable it is that D

will be observed independent of h, the less evidence D provides in support of
h.

March 24, 2017 6 / 35

Choosing Hypothesis1

P(D|h)P(h)
P(h|D) =
P(D)
In many learning scenarios, the learner considers some set of candidate hypotheses H and is interested in
finding the most probable hypothesis h H given the observed data D (or at least one of the maximally
probable if there are several). Any such maximally probable hypothesis is called a maximum a posteriori
(MAP) hypothesis. We can determine the MAP hypotheses by using Bayes theorem to calculate the posterior
probability of each candidate hypothesis.

Maximum a posteriori hypothesis hMAP :

hMAP = arg max P(h|D)

P(D|h)P(h)
= arg max
hH P(D)
= arg max P(D|h)P(h)
hH

If assume P(hi ) = P(hj ) then can further simplify, and choose the Maximum likelihood (ML) hypothesis

hML = arg max P(D|hi )

hi H

1 argmax f(x) x X : The value of x that maximises f(x), argmax x 2 = -3 where x {1, 2, 3}
March 24, 2017 7 / 35
Bayes Theorem
Does patient have cancer or not?
A patient takes a lab test and the result comes back positive. The test
returns a correct positive result in only 98% of the cases in which the
disease is actually present, and a correct negative result in only 97% of
the cases in which the disease is not present. Furthermore, .008 of the
entire population have this cancer.

March 24, 2017 8 / 35

Bayes Theorem
Does patient have cancer or not?
A patient takes a lab test and the result comes back positive. The test
returns a correct positive result in only 98% of the cases in which the
disease is actually present, and a correct negative result in only 97% of
the cases in which the disease is not present. Furthermore, .008 of the
entire population have this cancer.