0% found this document useful (0 votes)

7 views32 pages

Chapter 6 Bayesianlearning

Uploaded by

Army

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

7 views32 pages

Chapter 6 Bayesianlearning

Uploaded by

Army

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

You are on page 1/ 32

6.

Bayesian Learning

Introduction
– Bayesian learning algorithms calculate explicit
probabilities for hypotheses
– Practical approach to certain learning problems
– Provide useful perspective for understanding
learning algorithms
Real Life applications

• Text-based classification such as spam or junk

mail filtering, author identification, or topic
categorization
• Medical diagnosis such as given the presence of
a set of observed symptoms during a disease.
• Identifying the probability of new patients having
the disease
• Network security such as detecting illegal
intrusion or anomaly in computer networks
Drawbacks:
– Typically requires initial knowledge of many
probabilities
– In some cases, significant computational cost
required to determine the Bayes optimal
hypothesis (linear in the number of candidate
hypotheses)
Bayes Theorem
Best hypothesis  most probable hypothesis
Notation
P(h): prior probability of hypothesis h
P(D): prior probability that dataset D be
observed
P(D|h): prior probability of D given h
P(h|D): posterior probability of h
• Bayes Theorem
P(h|D) = P(D|h) P(h) / P(D)

• Maximum a posteriori hypothesis

hMAP  argmaxhH P(h|D)
= argmaxhH P(D|h) P(h)

• Maximum likelihood hypothesis

For a new patient the lab test returns a positive

result. Should be diagnose cancer or not?
P(+|cancer)P(cancer)=0.0078 P(-|cancer)P(cancer)=0.0298
 hMAP = cancer
5.3 Bayes Theorem and Concept Learning
What is the relationship between Bayes theorem
and concept learning?

– Brute Force Bayes Concept Learning

1. For each hypothesis hH calculate P(h|D)
2. Output hMAP  argmaxhH P(h|D)

1er. Escuela Red ProTIC - Tandil, 18-28 de Abril, 2006

– We must choose P(h) and P(D|h) from prior
knowledge
Let’s assume:
1. The training data D is noise free
2. The target concept c is contained in H
3. We consider a priori all the hypotheses equally
probable
 P(h) = 1/|H|  hH

1er. Escuela Red ProTIC - Tandil, 18-28 de Abril, 2006

5. Bayesian Learning

Since the data is assumed noise free:

P(D|h)=1 if di=h(xi)  di  D
P(D|h)=0 otherwise

Brute-force MAP learning

– If h is inconsistent with D:
P(h|D) = P(D|h).P(h)/P(D) = 0.P(h)/P(D) = 0

– If h is consistent with D:
P(h|D) = 1. (1/|H|) / (|VSH,D| / |H|) = 1/ |VSH,D|

1er. Escuela Red ProTIC - Tandil, 18-28 de Abril, 2006

5. Bayesian Learning

 P(D|h)=1/|VSH,D| if h is consistent with D

P(D|h)=0 otherwise
Every consistent hypothesis is a MAP hypothesis

Consistent Learners
– Learning algorithms whose outputs are
hypotheses that commit zero errors over the
training examples (consistent hypotheses)

1er. Escuela Red ProTIC - Tandil, 18-28 de Abril, 2006

5. Bayesian Learning

Under the assumed conditions, Find-S is a

consistent learner

The Bayesian framework allows to characterize

the behavior of learning algorithms, identifying
P(h) and P(D|h) under which they output optimal
(MAP) hypotheses

1er. Escuela Red ProTIC - Tandil, 18-28 de Abril, 2006

5. Bayesian Learning

1er. Escuela Red ProTIC - Tandil, 18-28 de Abril, 2006

5. Bayesian Learning

1er. Escuela Red ProTIC - Tandil, 18-28 de Abril, 2006

5. Bayesian Learning

6.4 Maximum Likelihood and LSE Hypotheses

Learning a continuous-valued target function
(regression or curve fitting)

H = Class of real-valued functions defined over X

h:X L learns f : X  
(xi,di)  D di = f(xi) + i i=1,m
f : noise-free target function : white noise
N(0,)

1er. Escuela Red ProTIC - Tandil, 18-28 de Abril, 2006

5. Bayesian Learning

1er. Escuela Red ProTIC - Tandil, 18-28 de Abril, 2006

5. Bayesian Learning

Under these assumptions, any learning algorithm

that minimizes the squared error between the output
hypothesis predictions and the training data will
output a ML hypothesis:

hML = argmaxhH p(D|h)

= argmaxhH i=1,m p(di|h)
= argmaxhH i=1,m exp{-[di-h(xi)]2/22}
= argminhH i=1,m [di-h(xi)]2 = hLSE

1er. Escuela Red ProTIC - Tandil, 18-28 de Abril, 2006

5. Bayesian Learning

5.5 ML Hypotheses for Predicting Probabilities

– We wish to learn a nondetermnistic function
f : X  {0,1}
that is, the probabilities that f(x)=0 and f(x)=1

– Training data D = (xi,di)

– We assume that any particular instance xi is

independent of hypothesis h

1er. Escuela Red ProTIC - Tandil, 18-28 de Abril, 2006

5. Bayesian Learning

Then

P(D|h) = i=1,m P(xi,di|h) = i=1,m P(di|h, xi) P(xi)

P(di|h,xi) = h(xi) if di=1

P(di|h,xi) =1-h(xi) if di=0

 P(di|h,xi) = h(xi)di [1-h(xi)]1-di

1er. Escuela Red ProTIC - Tandil, 18-28 de Abril, 2006

5. Bayesian Learning

hML = argmaxhH i=1,m h(xi)di [1-h(xi)]1-di

= argmaxhH i=1,m di log[h(xi)] + [1-di] log[1-
h(xi)]
= argminhH [Cross Entropy]

Cross Entropy 
- i=1,m di log[h(xi)] + [1-di] log[1-h(xi)]

1er. Escuela Red ProTIC - Tandil, 18-28 de Abril, 2006

5. Bayesian Learning

5.6 Minimum Description Length Principle

hMAP = argmaxhH P(D|h) P(h)
= argminhH {-log2P(D|h)-log2P(h)}

 short hypotheses are

preferred

Description Length LC(h): Number of bits required

to encode message h using code C

1er. Escuela Red ProTIC - Tandil, 18-28 de Abril, 2006

5. Bayesian Learning

– - log2P(h)  LCH(h): Description length of h under

the optimal (most compact) encoding of H
– - log2P(D|h)  LCD |h(D|h): Description length of
training data D given hypothesis h

 hMAP = argminhH {LCH(h) + LCD |h(D|h)}

MDL Principle:
Choose hMDL = argminhH {LC1(h) + LC2(D|h)}

1er. Escuela Red ProTIC - Tandil, 18-28 de Abril, 2006

5. Bayesian Learning

5.7 Bayes Optimal Classifier

What is the most probable classification of a new
instance given the training data?

Answer: argmaxvjV hH P(vj|h) P(h|D)

where vj V are the possible classes

 Bayes Optimal Classifier

1er. Escuela Red ProTIC - Tandil, 18-28 de Abril, 2006

5. Bayesian Learning

5.9 Naïve Bayes Classifier

Given the instance x=(a1,a2,...,an)
vMAP = argmaxvjV P(x|vj) P(vj)

The Naïve Bayes Classifier assumes conditional

independence of attribute values :
vNB = argmaxvjV P(vj) i=1,n P(ai|vj)

1er. Escuela Red ProTIC - Tandil, 18-28 de Abril, 2006

5. Bayesian Learning

5.10 An Example: Learning to Classify Text

Task: “Filter WWW pages that discuss ML topics”
• Instance space X contains all possible text documents
• Training examples are classified as “like” or “dislike”

How to represent an arbitrary document?

• Define an attribute for each word position
• Define the value of the attribute to be the English word
found in that position

1er. Escuela Red ProTIC - Tandil, 18-28 de Abril, 2006

5. Bayesian Learning

vNB = argmaxvjV P(vj) i=1,Nwords P(ai|vj)

V {like,dislike} ai 50.000 distinct words in

English

 We must estimate ~ 2 x 50.000 x Nwords

conditional probabilities P(ai|vj)

This can be reduced to 2 x 50.000 terms by

considering
P(ai=wk|vj) = P(am=wk|vj)  i,j,k,m
1er. Escuela Red ProTIC - Tandil, 18-28 de Abril, 2006
5. Bayesian Learning

– How to choose the conditional probabilities?

m-estimate:
P(wk|vj) = (nk + 1) / (Nwords + |Vocabulary|)

nk : number of times word wk is found

|Vocabulary| : total number of distinct words

Concrete example: Assigning articles to 20 usenet

newsgroups  Accuracy:
89%
1er. Escuela Red ProTIC - Tandil, 18-28 de Abril, 2006
5. Bayesian Learning

5.11 Bayesian Belief Networks

Bayesian belief networks assume conditional
independence only between subsets of the
attributes
– Conditional independence
• Discrete-valued random variables X,Y,Z

• X is conditionally independent of Y given Z if

P(X |Y,Z)= P(X |Z)

1er. Escuela Red ProTIC - Tandil, 18-28 de Abril, 2006

5. Bayesian Learning

1er. Escuela Red ProTIC - Tandil, 18-28 de Abril, 2006

5. Bayesian Learning

Representation
• A Bayesian network represents the joint probability
distribution of a set of variables
• Each variable is represented by a node

• Conditional independence assumptions are

indicated by a directed acyclic graph
• Variables are conditionally independent of its
nondescendents in the network given its inmediate
predecessors

1er. Escuela Red ProTIC - Tandil, 18-28 de Abril, 2006

5. Bayesian Learning

The joint probabilities are calculated as

P(Y1,Y2,...,Yn) = i=1,n P [Yi|Parents(Yi)]

The values P [Yi|Parents(Yi)] are stored in tables

associated to nodes Yi

Example:
P(Campfire=True|Storm=True,BusTourGroup=True)=0.4

1er. Escuela Red ProTIC - Tandil, 18-28 de Abril, 2006

5. Bayesian Learning

Inference
• We wish to infer the probability distribution for
some variable given observed values for (a subset
of) the other variables
• Exact (and sometimes approximate) inference of
probabilities for an arbitrary BN is NP-hard
• There are numerous methods for probabilistic
inference in BN (for instance, Monte Carlo), which
have been shown to be useful in many cases

1er. Escuela Red ProTIC - Tandil, 18-28 de Abril, 2006

5. Bayesian Learning

Learning Bayesian Belief Networks

Task: Devising effective algorithms for learning BBN
from training data
– Focus of much current research interest
– For given network structure, gradient ascent can be
used to learn the entries of conditional probability
tables
– Learning the structure of BBN is much more difficult,
although there are successful approaches for some
particular problems

1er. Escuela Red ProTIC - Tandil, 18-28 de Abril, 2006

Wharton Business Analytics Coursera Quiz
100% (2)
Wharton Business Analytics Coursera Quiz
155 pages
Naive Bayes
No ratings yet
Naive Bayes
60 pages
ML - Unit4pdf
No ratings yet
ML - Unit4pdf
65 pages
Features of Bayesian Learning Methods
No ratings yet
Features of Bayesian Learning Methods
39 pages
ML UNIT-5 Notes PDF
No ratings yet
ML UNIT-5 Notes PDF
41 pages
Bayesian Learning Unit 3 PDF
No ratings yet
Bayesian Learning Unit 3 PDF
18 pages
ML Unit-4
No ratings yet
ML Unit-4
24 pages
Bayesian Learning: Based On "Machine Learning", T. Mitchell, Mcgraw Hill, 1997, Ch. 6
No ratings yet
Bayesian Learning: Based On "Machine Learning", T. Mitchell, Mcgraw Hill, 1997, Ch. 6
54 pages
Practicedump: Free Practice Dumps - Unlimited Free Access of Practice Exam
No ratings yet
Practicedump: Free Practice Dumps - Unlimited Free Access of Practice Exam
5 pages
Bayesian Learning
No ratings yet
Bayesian Learning
44 pages
Bayesian Decision Theory and Learning: Jayanta Mukhopadhyay Dept. of Computer Science and Engg
No ratings yet
Bayesian Decision Theory and Learning: Jayanta Mukhopadhyay Dept. of Computer Science and Engg
56 pages
Unit 3 Bayesian Learning
No ratings yet
Unit 3 Bayesian Learning
49 pages
SL09. Bayesian Learning
No ratings yet
SL09. Bayesian Learning
4 pages
Lecture 9: Bayesian Learning: Cognitive Systems II - Machine Learning SS 2005
No ratings yet
Lecture 9: Bayesian Learning: Cognitive Systems II - Machine Learning SS 2005
39 pages
ML Unit-Iii
No ratings yet
ML Unit-Iii
178 pages
ML - Unit-3 Chapter - 6 (Bayes Theorem) - Notes
No ratings yet
ML - Unit-3 Chapter - 6 (Bayes Theorem) - Notes
123 pages
Machine Learning: Lecture 6: Bayesian Learning (Based On Chapter 6 of Mitchell T.., Machine Learning, 1997)
No ratings yet
Machine Learning: Lecture 6: Bayesian Learning (Based On Chapter 6 of Mitchell T.., Machine Learning, 1997)
15 pages
6.1 Bayesian Learning
No ratings yet
6.1 Bayesian Learning
33 pages
Lecture 06 Bayesian Networks 07112022 011127pm
No ratings yet
Lecture 06 Bayesian Networks 07112022 011127pm
33 pages
Bayesian
No ratings yet
Bayesian
91 pages
15CS73 Module 4
No ratings yet
15CS73 Module 4
60 pages
L13 Bayesian Methods
No ratings yet
L13 Bayesian Methods
30 pages
Bayesian Learning
No ratings yet
Bayesian Learning
81 pages
Probabilistic Models in Machine Learning: Unit - III Chapter - 1
No ratings yet
Probabilistic Models in Machine Learning: Unit - III Chapter - 1
18 pages
Unit 3
No ratings yet
Unit 3
99 pages
Unit III
No ratings yet
Unit III
19 pages
Assignment - ANOVA Test
No ratings yet
Assignment - ANOVA Test
7 pages
ML - Unit-3 Chapter - 6 (Bayes Theorem) - Notes
No ratings yet
ML - Unit-3 Chapter - 6 (Bayes Theorem) - Notes
31 pages
ML Unit 3 Part 1
No ratings yet
ML Unit 3 Part 1
36 pages
Bayesian Learning: Artificial Intelligence and Machine Learning 18CS71
No ratings yet
Bayesian Learning: Artificial Intelligence and Machine Learning 18CS71
24 pages
UNIT4 - Part2 Aiml
No ratings yet
UNIT4 - Part2 Aiml
46 pages
ML Unit 3 Bayesian - Learning (Textbook)
No ratings yet
ML Unit 3 Bayesian - Learning (Textbook)
25 pages
Slide07 Bayes
No ratings yet
Slide07 Bayes
51 pages
Module - 4 QB Solved-1
No ratings yet
Module - 4 QB Solved-1
31 pages
Module - 4 Bayeian Learning
No ratings yet
Module - 4 Bayeian Learning
44 pages
Unit 6
No ratings yet
Unit 6
19 pages
Bayesian Learning
No ratings yet
Bayesian Learning
49 pages
18CS71 Module 4
No ratings yet
18CS71 Module 4
30 pages
3.1 New
No ratings yet
3.1 New
12 pages
Bayes Algorithm
No ratings yet
Bayes Algorithm
26 pages
Module 5
No ratings yet
Module 5
24 pages
Module - 5 - Notes BAYESIAN Learning Notes
No ratings yet
Module - 5 - Notes BAYESIAN Learning Notes
24 pages
ML - Unit 1 - Part Ii
No ratings yet
ML - Unit 1 - Part Ii
18 pages
Visit:: Join Telegram To Get Instant Updates: Contact: MAIL: Instagram: Instagram: Whatsapp Share
No ratings yet
Visit:: Join Telegram To Get Instant Updates: Contact: MAIL: Instagram: Instagram: Whatsapp Share
25 pages
Coursera - Online Courses From Top Universities
0% (2)
Coursera - Online Courses From Top Universities
3 pages
Module 2 Notes
No ratings yet
Module 2 Notes
24 pages
Lecture - 4.1 - Bayes Classifier
No ratings yet
Lecture - 4.1 - Bayes Classifier
31 pages
Bayesian Learning: Salma Itagi, Svit
No ratings yet
Bayesian Learning: Salma Itagi, Svit
14 pages
Ba Yes Naive
No ratings yet
Ba Yes Naive
15 pages
Unit 2 Bayesian Learning
No ratings yet
Unit 2 Bayesian Learning
50 pages
ML Unit 4-1-24
No ratings yet
ML Unit 4-1-24
24 pages
Lesson 6.0 Supervised Learning With Naive Bayes Classifiers
No ratings yet
Lesson 6.0 Supervised Learning With Naive Bayes Classifiers
13 pages
Module 4 - Bayesian Learning
No ratings yet
Module 4 - Bayesian Learning
36 pages
Statistics Module 4, Testing Hypotheses, The Critical Ratio
No ratings yet
Statistics Module 4, Testing Hypotheses, The Critical Ratio
69 pages
Correlation and Linear Regression
No ratings yet
Correlation and Linear Regression
25 pages
Naïve Bayes Classifier: April 25, 2006
No ratings yet
Naïve Bayes Classifier: April 25, 2006
19 pages
D-Raghavarao Block Designs
0% (1)
D-Raghavarao Block Designs
224 pages
Wa0002.
No ratings yet
Wa0002.
24 pages
E-Note 14654 Content Document 20231228101425AM
No ratings yet
E-Note 14654 Content Document 20231228101425AM
10 pages
IML Module 3
No ratings yet
IML Module 3
95 pages
Monte Carlo
No ratings yet
Monte Carlo
59 pages
LSJ PDF
No ratings yet
LSJ PDF
433 pages
@vtudeveloper - in ML Mod 4
No ratings yet
@vtudeveloper - in ML Mod 4
11 pages
Module 5
No ratings yet
Module 5
30 pages
Bcs602 ML Mod-4 Notes @vtunetwork
No ratings yet
Bcs602 ML Mod-4 Notes @vtunetwork
31 pages
Unit 4
No ratings yet
Unit 4
24 pages
Probability and Statistics Lecture Notes
100% (2)
Probability and Statistics Lecture Notes
2 pages
Midterm Solution
No ratings yet
Midterm Solution
6 pages
Essentials of Bio Statistics Research Methodology
No ratings yet
Essentials of Bio Statistics Research Methodology
7 pages
EC2020 Elements of Econometrics
No ratings yet
EC2020 Elements of Econometrics
4 pages
Lesson 4 - Measures of Variation
100% (1)
Lesson 4 - Measures of Variation
3 pages
Final Exam Sample Test
No ratings yet
Final Exam Sample Test
13 pages
Unit 5
No ratings yet
Unit 5
3 pages
Notes STA408 - Chapter 2 PDF
No ratings yet
Notes STA408 - Chapter 2 PDF
4 pages
Sample Exam With Solutions. Econometrics II 2015.
No ratings yet
Sample Exam With Solutions. Econometrics II 2015.
15 pages
Decompositin Lengkap PDF
No ratings yet
Decompositin Lengkap PDF
18 pages
CROSSTABS
No ratings yet
CROSSTABS
5 pages
C 7
No ratings yet
C 7
25 pages
Bản sao 08 - Two Populations Hypothesis Testing
No ratings yet
Bản sao 08 - Two Populations Hypothesis Testing
9 pages
Week 10 - Concept Notes - 0
No ratings yet
Week 10 - Concept Notes - 0
18 pages
Bahir Dar University Bahir Dar Institute of Technology Faculty of Computing Department of Computer Science
No ratings yet
Bahir Dar University Bahir Dar Institute of Technology Faculty of Computing Department of Computer Science
4 pages
7.1 Hwork
No ratings yet
7.1 Hwork
7 pages
An Information Measure For Classification: by C. S. Wallace and D. M. Boulton
No ratings yet
An Information Measure For Classification: by C. S. Wallace and D. M. Boulton
10 pages
Sample Problems On Finding The Probabilities Using The Area Under Standard Normal Curve
No ratings yet
Sample Problems On Finding The Probabilities Using The Area Under Standard Normal Curve
2 pages
Math 2240 Midterm 2018 Mechanical PDF
No ratings yet
Math 2240 Midterm 2018 Mechanical PDF
6 pages
Introductory Statistics Note Sheet
No ratings yet
Introductory Statistics Note Sheet
2 pages
Non-Normal Process Capability Indices
No ratings yet
Non-Normal Process Capability Indices
6 pages
A First Course in Functional Analysis
From Everand
A First Course in Functional Analysis
Martin Davis
No ratings yet
A Short Course in Discrete Mathematics
From Everand
A Short Course in Discrete Mathematics
Edward A. Bender
3/5 (1)
Mathematical Optimization: Fundamentals and Applications
From Everand
Mathematical Optimization: Fundamentals and Applications
Fouad Sabry
No ratings yet

Chapter 6 Bayesianlearning

Uploaded by

Chapter 6 Bayesianlearning

Uploaded by

6.

• Text-based classification such as spam or junk

• Maximum a posteriori hypothesis

• Maximum likelihood hypothesis

For a new patient the lab test returns a positive

– Brute Force Bayes Concept Learning

1er. Escuela Red ProTIC - Tandil, 18-28 de Abril, 2006

1er. Escuela Red ProTIC - Tandil, 18-28 de Abril, 2006

Since the data is assumed noise free:

Brute-force MAP learning

1er. Escuela Red ProTIC - Tandil, 18-28 de Abril, 2006

 P(D|h)=1/|VSH,D| if h is consistent with D

1er. Escuela Red ProTIC - Tandil, 18-28 de Abril, 2006

Under the assumed conditions, Find-S is a

The Bayesian framework allows to characterize

1er. Escuela Red ProTIC - Tandil, 18-28 de Abril, 2006

1er. Escuela Red ProTIC - Tandil, 18-28 de Abril, 2006

1er. Escuela Red ProTIC - Tandil, 18-28 de Abril, 2006

6.4 Maximum Likelihood and LSE Hypotheses

H = Class of real-valued functions defined over X

1er. Escuela Red ProTIC - Tandil, 18-28 de Abril, 2006

1er. Escuela Red ProTIC - Tandil, 18-28 de Abril, 2006

Under these assumptions, any learning algorithm

hML = argmaxhH p(D|h)

1er. Escuela Red ProTIC - Tandil, 18-28 de Abril, 2006

5.5 ML Hypotheses for Predicting Probabilities

– Training data D = (xi,di)

– We assume that any particular instance xi is

1er. Escuela Red ProTIC - Tandil, 18-28 de Abril, 2006

P(D|h) = i=1,m P(xi,di|h) = i=1,m P(di|h, xi) P(xi)

P(di|h,xi) = h(xi) if di=1

 P(di|h,xi) = h(xi)di [1-h(xi)]1-di

1er. Escuela Red ProTIC - Tandil, 18-28 de Abril, 2006

hML = argmaxhH i=1,m h(xi)di [1-h(xi)]1-di

1er. Escuela Red ProTIC - Tandil, 18-28 de Abril, 2006

5.6 Minimum Description Length Principle

 short hypotheses are

Description Length LC(h): Number of bits required

1er. Escuela Red ProTIC - Tandil, 18-28 de Abril, 2006

– - log2P(h)  LCH(h): Description length of h under

 hMAP = argminhH {LCH(h) + LCD |h(D|h)}

1er. Escuela Red ProTIC - Tandil, 18-28 de Abril, 2006

5.7 Bayes Optimal Classifier

Answer: argmaxvjV hH P(vj|h) P(h|D)

 Bayes Optimal Classifier

1er. Escuela Red ProTIC - Tandil, 18-28 de Abril, 2006

5.9 Naïve Bayes Classifier

The Naïve Bayes Classifier assumes conditional

1er. Escuela Red ProTIC - Tandil, 18-28 de Abril, 2006

5.10 An Example: Learning to Classify Text

How to represent an arbitrary document?

1er. Escuela Red ProTIC - Tandil, 18-28 de Abril, 2006

vNB = argmaxvjV P(vj) i=1,Nwords P(ai|vj)

V {like,dislike} ai 50.000 distinct words in

 We must estimate ~ 2 x 50.000 x Nwords

This can be reduced to 2 x 50.000 terms by

– How to choose the conditional probabilities?

nk : number of times word wk is found

Concrete example: Assigning articles to 20 usenet

5.11 Bayesian Belief Networks

• X is conditionally independent of Y given Z if

P(X |Y,Z)= P(X |Z)

1er. Escuela Red ProTIC - Tandil, 18-28 de Abril, 2006

1er. Escuela Red ProTIC - Tandil, 18-28 de Abril, 2006

• Conditional independence assumptions are

1er. Escuela Red ProTIC - Tandil, 18-28 de Abril, 2006

The joint probabilities are calculated as

P(Y1,Y2,...,Yn) = i=1,n P [Yi|Parents(Yi)]

The values P [Yi|Parents(Yi)] are stored in tables

1er. Escuela Red ProTIC - Tandil, 18-28 de Abril, 2006

1er. Escuela Red ProTIC - Tandil, 18-28 de Abril, 2006

Learning Bayesian Belief Networks

1er. Escuela Red ProTIC - Tandil, 18-28 de Abril, 2006

You might also like