0% found this document useful (0 votes)

84 views70 pages

Computer Science CPSC 322: Bayesian Networks: Construction

This document provides an overview of Bayesian networks and their construction. It discusses how the chain rule and conditional independence allow Bayesian networks to compactly represent joint probability distributions. The chain rule decomposes a joint distribution into a product of conditional distributions. Conditional independence further simplifies distributions by allowing some conditions to be omitted. Bayesian networks graphically represent dependencies between variables using a directed acyclic graph and define conditional probability distributions for each variable given its parents.

Uploaded by

minemine

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

84 views70 pages

Computer Science CPSC 322: Bayesian Networks: Construction

Uploaded by

minemine

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 70

Computer Science CPSC 322

Lecture 20
Bayesian Networks:
Construction

1
Lecture Overview

• Recap lecture 19
• Bayesian networks: construction
• Defining Conditional Probabilities in a Bnet
• Considerations on Network Structure (time
permitting)

2
Chain Rule
• Allows representing a Join Probability Distribution
(JPD) as the product of conditional probability
distributions

Theorem: Chain Rule

𝑛𝑛

𝑃𝑃(𝑓𝑓1⋀ … ⋀𝑓𝑓𝑛𝑛) = � 𝑃𝑃(𝑓𝑓𝑓𝑓|𝑓𝑓𝑖𝑖 − 1 ⋀ … ⋀𝑓𝑓1)

𝑖𝑖=1

3
Chain Rule example
𝑛𝑛

𝑃𝑃(𝑓𝑓1⋀ … ⋀𝑓𝑓𝑛𝑛) = � 𝑃𝑃(𝑓𝑓𝑓𝑓|𝑓𝑓𝑖𝑖 − 1 ⋀ … ⋀𝑓𝑓1)

𝑖𝑖=1

• We can represent the JPD as a product of marginal

distributions
• We can simplify some terms when the variables
involved are marginally independent or conditionally
independent

5
Marginal Independence

• Intuitively: if X ╨ Y, then
• learning that Y=y does not change your belief in X
• and this is true for all values y that Y could take

• For example, weather is marginally independent

of the result of a coin toss 6
Exploiting marginal independence
• Recall the product rule
p(X=x ˄ Y=y) = p(X=x | Y=y) × p(Y=y)
• If X and Y are marginally independent,
p(X=x | Y=y) = p(X=x)
• Thus we have
p(X=x ˄ Y=y) = p(X=x) × p(Y=y)
• In distribution form
p(X,Y) = p(X) × p(Y)

7
Exploiting marginal independence

Exponentially fewer than the JPD!

8
Given the binary variables A,B,C,D,
To specify P(A,B,C,D) one needs the JDP below To specify P(A)×P(B) ×P(C)×P(D)
one needs the JDPs below
A B C D P(A,B,C,D)
T T T T A P(A)
T T T F T
T T F T F
T T F F
T F T T B P(B)
T F T F
T
T F F T
F
T F F F
F T T T C P(C)
F T T F T
F T F T
F
F T F F
F F T T D P(D)
F F T F
T
F F F T
F
F F F F 9
Conditional Independence

• Intuitively: if X and Y are conditionally independent given Z,

then
• learning that Y=y does not change your belief in X
when we already know Z=z
• and this is true for all values y that Y could take
and all values z that Z could take
10
Example for Conditional Independence
• Whether light l1 is lit (Lit-l1 ) and the position of switch s2
(Up-s2 ) are not marginally independent
• The position of the switch determines whether there is
power in the wire w0 connected to the light
Up-s2

Lit-l1

• However, whether light l1 is lit is conditionally independent from the

position of switch s2 given whether there is power in wire w0 (Power-w0)
• Once we know Power-w0, learning values for Up-s2 does not change our
beliefs about Lit-l1
• I.e., Lit-l1 is conditionally independent of Up-s2 given Power-w0

Up-s2

Power-w0

Lit-l1
11
Conditional vs. Marginal Independence
Two variables can be
Understood
Material
Conditionally but not marginally independent Assignment Exam
• ExamGrade and AssignmentGrade Grade Grade

• ExamGrade and AssignmentGrade given UnderstoodMaterial

Up-s2
• Lit-l1 and Up-s2
Power_w0
• Lit-l1 and Up-s2 given Power_w0
Lit_l1

Marginally but not conditionally independent Smoking Fire

At Sensor
• SmokingAtSensor and Fire
• SmokingAtSensor and Fire given Alarm Alarm

Both marginally and conditionally independent Power_w0

Canucks Win
• CanucksWinStanleyCup and Lit_l1
• CanucksWinStanleyCup and Lit_l1 given Power_w0 Lit_l1

Neither marginally nor conditionally independent

Cloudiness Wind
• Temperature and Cloudiness
• Temperature and Cloudiness given Wind Temperature 12
Exploiting Conditional Independence
Example 2: Boolean variables A,B,C,D
• D is conditionally independent of both A and B given C
 We can rewrite P(D | A,B,C) as P(D|C)
• P(D|C) is much simpler to specify than P(D | A,B,C) !

13
If A, B, C, D are Boolean variables
P(D | A,B,C) is given by the following table
A B C P(D=T|A,B,C) P(D=F|A,B,C)
T T T
T T F
T F T
T F F
F T T
F T F
F F T
F F F

8 – each row represents the probability distribution for D given

the values that A, B and C take in that row
P(D|C) is given by the following table
2 – each row represents the
C P(D=T|C) P(D=F|C)
probability distribution for D given
the value that C takes in that row
T
F 14
Putting It All Together
• Given the JPD P(A,B,C,D),
we can apply the chain rule to get
P( A, B, C , D) = P( A) × P( B | A) × P(C | A, B) × P( D | A, B, C )

• If D is conditionally independent of A and B given C, we

can rewrite the above as
P( A, B, C , D) = P( A) × P( B | A) × P(C | A, B) × P( D | C )

Under independence we gain compactness (fewer/smaller

distributions to deal with)
• The chain rule allows us to write the JPD as a product of conditional
distributions
• Conditional independence allows us to write them more compactly
15
Bayesian (or Belief) Networks

• Bayesian networks and their extensions are

Representation & Reasoning systems
explicitly defined to exploit independence in
probabilistic reasoning

16
Bayesian Networks: Intuition

• A graphical representation for a joint probability distribution

• Nodes are random variables
• Directed edges between nodes reflect dependence
Up-s2
• Some informal examples:
Power-w0

Lit-l1

Understood
Material Smoking
Fire
At Sensor
Assignment Exam
Grade Grade
Alarm
17
Belief (or Bayesian) networks
Def. A Belief network consists of
• a directed, acyclic graph (DAG) where each node is associated
with a random variable Xi
• A domain for each variable Xi
• a set of conditional probability distributions for each node Xi given
its parents Pa(Xi) in the graph
P (Xi | Pa(Xi))

• The parents Pa(Xi) of a variable Xi are those Xi directly

depends on
• A Bayesian network is a compact representation of the
JDP for a set of variables (X1, …,Xn )
P(X1, …,Xn) = ∏ni= 1 P (Xi | Pa(Xi))
18
Lecture Overview

• Recap lecture 19
• Bayesian networks: construction
• Defining Conditional Probabilities in a Bnet
• Considerations on Network Structure (time
permitting)

19
How to build a Bayesian network
1. Define a total order over the random variables: (X1, …,Xn)
2. Apply the chain rule Predecessors of Xi in
the total order defined
P(X1, …,Xn) = ∏ni= 1 P(Xi | X1, … ,Xi-1) over the variables

3. For each Xi, , select the smallest set of predecessors Pa(Xi)

such that
Xi is conditionally
independent from all its
P(Xi | X1, … ,Xi-1) = P (Xi | Pa(Xi)) other predecessors
given Pa(Xi)

4. Then we can rewrite

P(X1, …,Xn) = ∏ni= 1 P (Xi | Pa(Xi))
• This is a compact representation of the initial JPD
• factorization of the JPD based on existing conditional independencies
20
among the variables
How to build a Bayesian network (cont’d)
5. Construct the Bayesian Net (BN)
• Nodes are the random variables
• Draw a directed arc from each variable in Pa(Xi) to Xi
• Define a conditional probability table (CPT) for each
variable Xi:
• P(Xi | Pa(Xi))

21
Example for BN construction: Fire Diagnosis
You want to diagnose whether there is a fire in a building
• You can receive reports (possibly noisy) about whether everyone is
leaving the building
• If everyone is leaving, this may have been caused by a fire alarm
• If there is a fire alarm, it may have been caused by a fire or by
tampering
• If there is a fire, there may be smoke
Start by choosing the random variables for this domain, here all are Boolean:
• Tampering (T) is true when the alarm has been tampered with
• Fire (F) is true when there is a fire
• Alarm (A) is true when there is an alarm
• Smoke (S) is true when there is smoke
• Leaving (L) is true if there are lots of people leaving the building
• Report (R) is true if the sensor reports that lots of people are leaving the
building
Next apply the procedure described earlier
22
Example for BN construction: Fire Diagnosis
1. Define a total ordering of variables:
- Let’s chose an order that follows the causal sequence of events
- Fire (F), Tampering (T), Alarm, (A), Smoke (S) Leaving (L) Report
(R)
2. Apply the chain rule

P(F,T,A,S,L,R) =

23
24
Example for BN construction: Fire Diagnosis
1. Define a total ordering of variables:
- Let’s chose an order that follows the causal sequence of events
- Fire (F), Tampering (T), Alarm, (A), Smoke (S) Leaving (L) Report
(R)
2. Apply the chain rule

Fire

Fire (F) is the first variable in the ordering, X1. It does not have
parents.

Tampering Fire

• Tampering (T) is independent of fire (learning that one is

true/false would not change your beliefs about the
probability of the other)

27
Example
P(F)P (T ) P (A | F,T) P (S | F,T,A) P (L | F,T,A,S) P (R | F,T,A,S,L)

Tampering Fire

• Tampering (T) is independent of fire (learning that one is

true/false would not change your beliefs about the
probability of the other)

28
Fire Diagnosis Example
P(F)P (T ) P (A | F,T) P (S | F,T,A) P (L | F,T,A,S) P (R | F,T,A,S,L)

Tampering Fire

Alarm

• Alarm (A) depends on both Fire and Tampering: it could

be caused by either or both

Tampering Fire

Alarm
Smoke

• Smoke (S) is caused by Fire, and so is independent of

Tampering and Alarm given whether there is a Fire

Tampering Fire

Alarm
Smoke

• Smoke (S) is caused by Fire, and so is independent of

Tampering and Alarm given whether there is a Fire

Tampering Fire

Alarm
Smoke

Leaving

• Leaving (L) is caused by Alarm, and thus is independent

of the other variables given Alarm

32
Fire Diagnosis Example
P(F)P (T ) P (A | F,T) P (S | F) P (L | A) P (R | F,T,A,S,L)

Tampering Fire

Alarm
Smoke

Leaving

Report

• Report ( R) is caused by Leaving, and thus is independent

of the other variables given Leaving

33
Fire Diagnosis Example
P(F)P (T ) P (A | F,T) P (S | F) P (L | A) P (R | L)

Tampering Fire

Alarm
Smoke

Leaving

Report

The result is the Bayesian network above, and its corresponding, very
compact factorization of the original JPD

P(F,T,A,S,L,R)= P(F)P (T ) P (A | F,T) P (S | F) P (L | A) P (R | L)

34
Example for BN construction: Fire Diagnosis

• Note that we intermixed steps 3, 4 and 5, just because sometime it is

easier to reason about conditional dependencies graphically
• However, you can do step 3 and 4 first
• That this, you can simplify the product before building the network
• Still have to reason about dependencies between each node and its
predecessors in the total order

P(F)P (T | F) P (A | F,T) P (S | F,T,A) P (L | F,T,A,S) P (R | F,T,A,S,L)

35
36
Fire Diagnosis Example
P(F)P (T ) P (A | F,T) P (S | F) P (L | A) P (R | L)
5. Construct the Bayesian Net (BN)
• Nodes are the random variables
• Draw a directed arc from each variable in Pa(Xi) to Xi
• Define a conditional probability table (CPT) for each variable Xi:
• P(Xi | Pa(Xi))

Tamperi
Fire
ng

Alarm
Smoke

Leaving

Report

37
Lecture Overview

• Recap lecture 19
• Bayesian networks: construction
• Defining Conditional Probabilities in a Bnet
• Considerations on Network Structure (time
permitting)

38
Example for BN construction: Fire Diagnosis

A. 1 B. 2 C. 4 D. 8
39
Example for BN construction: Fire Diagnosis

• We are not done yet: must specify the Conditional Probability Table
(CPT) for each variable. All variables are Boolean.
• How many probabilities do we need to specify for this Bayesian network?
• For instance, how many probabilities do we need to explicitly specify
for Fire? P(Fire): 1 probability –> P(Fire = T)
Because P(Fire = F) = 1 - P(Fire = T)
40
Example for BN construction: Fire Diagnosis
P(Fire=t)
0.01

• How many probabilities do we need to explicitly specify

for Alarm?

41
Example for BN construction: Fire Diagnosis
P(Fire=t)
0.01

• How many probabilities do we need to explicitly specify

for Alarm?
P(Alarm|Tampering, Fire): 4 probabilities, 1 probability
for each of the 4 instantiations of the parents
42
Example for BN construction: Fire Diagnosis
P(Fire=t)
0.01

Tampering T Fire F P(Alarm=t|T,F) P(Alarm=f|T,F) We don’t need to speficy

t t 0.5 0.5 explicitly P(Alarm=f|T,F)
t f 0.85 0.15 since probabilities in each
f t 0.99 0.01 row must sum to 1
f f 0.0001 0.9999

Each row of this table is a conditional probability distribution

43
Example for BN construction: Fire Diagnosis

• How many probabilities do we need to explicitly specify for the whole

Bayesian network?

A. 6 B. 12 C. 20 D. 26-1

44
Example for BN construction: Fire Diagnosis
P(Tampering=t) P(Fire=t)
0.02 0.01

Tampering T Fire F P(Alarm=t|T,F) Fire F P(Smoke=t |F)

t t 0.5 t 0.9
t f 0.85 f 0.01
f t 0.99
f f 0.0001 Alarm P(Leaving=t|A)
t 0.88
Leaving P(Report=t|L) f 0.001
t 0.75
f 0.01

……..probabilities in total, compared to the of the JPD for

P(T,F,A,S,L,R)

45
Example for BN construction: Fire Diagnosis
P(Tampering=t) P(Fire=t)
0.02 0.01

Tampering T Fire F P(Alarm=t|T,F) Fire F P(Smoke=t |F)

t t 0.5 t 0.9
t f 0.85 f 0.01
f t 0.99
f f 0.0001 Alarm P(Leaving=t|A)
t 0.88
Leaving P(Report=t|L) f 0.001
t 0.75
f 0.01

12 probabilities in total, compared to the 26 -1= 63 of the JPD for

P(T,F,A,S,L,R)

46
Example for BN construction: Fire Diagnosis

How many probabilities do we need to specify for this Bayesian network?

 P(Tampering): 1 probability P(T = t)
 P(Alarm|Tampering, Fire): 4 (independent)
– 1 probability for each of the 4 instantiations of the parents
 For all other variables with only one parent
– 2 probabilities: one for the parent being true and one for the parent
being false
47
 In total: 1+1+4+2+2+2 = 12 (compared to 26 -1= 63 for full JPD!)
Example for BN construction: Fire Diagnosis
P(Tampering=t) P(Fire=t)
0.02 0.01

Tampering T Fire F P(Alarm=t|T,F) Fire F P(Smoke=t |F)

t t 0.5 t 0.9

t f 0.85 f 0.01

f t 0.99
Alarm P(Leaving=t|A)
f f 0.0001
t 0.88
f 0.001
Leaving P(Report=t|L)
t 0.75 Once we have the CPTs in the network,
f 0.01 we can compute any entry of the JPD
P(Tampering=t, Fire=f, Alarm=t, Smoke=f, Leaving=t, Report=t) =

P(Tampering=t) x P(Fire=f)xP(Alarm=t| Tampering=t, Fire=f)xP(Smoke=f| Fire = f)xP(Leaving=t|

Alarm=t) x P(Report=t|Leaving=t) =
= 0.02 x (1-0.01) x 0.85 x (1-0.01) x 0.88 x 0.75 = 0.126
48
In Summary
• In a Belief network, the JPD of the variables
involved is defined as the product of the local
conditional distributions
P (X1, … ,Xn) = ∏i P(Xi | X1, … ,Xi-1) = ∏ i P (Xi | Parents(Xi))

• Any entry of the JPD can be computed given the

CPTs in the network

Once we know the JPD, we can answer any

query about any subset of the variables
- (see Inference by Enumeration topic)
Thus, a Belief network allows one to
answer any query on any subset of the
variables

49
Bayesian Networks: Types of Query/Inference
Diagnostic Predictive Mixed Intercausal

Person smokes
Fire Fire happens There is no fire
next to sensor
P(F=t)=1 F=f
P(F|L=t)=? S=t
Fire Fire P(F|A=t,T=t)=?
Alarm
Smoking
at
Alarm Alarm Fire Sensor

Leaving P(A|F=f,L=t)=?

People are leaving Leaving Leaving Alarm

P(L=t)=1
P(L|F=t)=? People are leaving Alarm goes off
P(L=t)=1 P(A=T) = 1

There are algorithms that leverage the Bnet structure to perform

query answer efficiently
- For instance variable elimination, which we will cover soon
- First, however, we will think a bit more about network structure
50
Learning Goals so Far
• Given a JPD
• Marginalize over specific variables
• Compute distributions over any subset of the variables
• Use inference by enumeration
• to compute joint posterior probability distributions over any subset
of variables given evidence
• Define and use marginal and conditional independence
• Build a Bayesian Network for a given domain (structure)
• Specify the necessary conditional probabilities
• Compute the representational savings in terms of number
of probabilities required

51
Compactness
• In a Bnet, how many rows do we need to explicitly
store for the CPT of a Boolean variable Xi with k
Boolean parents?
Compactness
• A CPT for a Boolean variable Xi with k Boolean parents
has 2k rows for the combinations of parent values
• If each variable has no more than k parents, the complete
network requires to specify n2k numbers
• For k<< n, this is a substantial improvement,
• the numbers required grow linearly with n, vs. O(2n) for
the full joint distribution
• E.g., if we have a Bnets with 30 boolean variables, each
with 5 parents
• Need to specify 30*25 probability
• But we need 230 for JPD
Realistic BNet: Liver Diagnosis
Source: Onisko et al., 1999

~ 60 nodes, max 4 parents per node

Need ~ 60 x 24 = 15 x 26 probabilities instead of 260 probabilities for the JPD
Compactness
• What happens if the network is fully connected?
• Or k ≈ n
• Not much saving compared to the numbers needed to
specify the full JPD
• Bnets are useful in sparse (or locally structured)
domains
• Domains in with each component interacts with (is related
to) a small fraction of other components
• What if this is not the case in a domain we need to reason
about?

May need to make simplifying assumptions to reduce

the dependencies in a domain
“Where do the numbers (CPTs) come
from?”
From experts
• Tedious
• Costly
• Not always reliable
From data => Machine Learning
• There are algorithms to learn both structures and
numbers (CPSC 340, CPSC 422)
• Can be hard to get enough data

Still, usually better than specifying the full JPD

What if we use a different ordering?
• What happens if we use the following order:
• Leaving; Tampering; Report; Smoke; Alarm; Fire.

• We end up with a completely different network structure! (try

it as an exercise)
Leaving Tampering

Report Alarm

Smoke Fire

• Which of the two structures is better?

Which Structure is Better?
Leaving Tampering

Report Alarm

Smoke Fire

• Non-causal network is less compact: 1+2+2+4+8+8 = 25 numbers

needed
• Deciding on conditional independence is hard in non-causal directions
• Causal models and conditional independence seem hardwired for
humans!
• Specifying the conditional probabilities may be harder than in causal
direction
• For instance, we have lost the direct dependency between alarm and one
of its causes, which essentially describes the alarm’s reliability (info often
provided by the maker) 58
Example contd.
• Other than that, our two Bnets for the Alarm problem are
equivalent as long as they represent the same probability
distribution
Leaving Tampering

Report Alarm

Smoke Fire

Variable ordering: L,T,R,S,A,F

Variable ordering: T,F,A,S,L,R

P(T,F,A,S,L,R) = P (T) P (F) P (A | T,F) P (L | A) P (R|L) =

= P(L)P(T|L)P(R|L)P(S|L,T)P(A|S,L,T) P(F|S,A,T)
i.e., they are equivalent if the corresponding CPTs are
specified so that they satisfy the equation above
Are there wrong network structures?
• Given an order of variables, a network with arcs in excess
to those required by the direct dependencies implied by
that order are still ok
• Just not as efficient Leaving Tampering

Alarm
Report

Smoke Fire

P (L)P(T|L)P(R|L) P(S|L,R,T) P(A|S,L,T) P(F|S,A,T) =

Leaving Tampering

Alarm
Report

Smoke Fire
Are there wrong network structures?
• How can a network structure be wrong?
• If it misses directed edges that are required
• E.g. an edge is missing below, making Fire conditionally
independent of Alarm given Tampering and Smoke

Leaving Tampering

Report Alarm

Smoke Fire

But they are not:

for instance, P(Fire = t| Smoke = f, Tampering = F, Alarm = T) should
be
higher than P(Fire = t| Smoke = f, Tampering = f),
Are there wrong network structures?
• How can a network structure be wrong?
• If it misses directed edges that are required
• E.g. an edge is missing below: Fire is not conditionally
independent of Alarm | {Tampering, Smoke}

Leaving Tampering

Report Alarm

Smoke Fire

But remember what we said a few slides back.

Sometimes we may need to make simplifying
assumptions - e.g. assume conditional
independence when it does not actually hold – in
order to reduce complexity
Summary of Dependencies in a Bayesian Network
In 1, 2 and 3, X and Y are dependent (grey areas represent existing
evidence/observations)
Y X
1 E Z

2 E

3 E
Z

• In 3, X and Y become dependent as soon as there is evidence on Z or on any

of its descendants.
• This is because knowledge of one possible cause given evidence of the effect
explains away the other cause
Dependencies in a Bayesian Network:
summary
In 1, 2 and 3, X and Y are dependent (grey areas represent existing
evidence/observations)
1
Up(s2)

Y X Power(w0)
1 E Z
Lit(l1)
Z 2
Understood
Material
2 E
Assignment Exam
Grade Grade

3 E
Z
3
Smoking Fire
At Sensor

Alarm

• In 3, X and Y become dependent as soon as there is evidence on Z or on any

of its descendants.
• This is because knowledge of one possible cause given evidence of the effect
explains away the other cause
Or Conditional Independencies
Or, blocking paths for probability propagation. Three ways in which a path
between Y to X (or viceversa) can be blocked, given evidence E

Y E X
1 Z

3
Z

• In 3, X and Y are independent if there is no evidence on their common effect

(recall fire and tampering in the alarm example
Or Conditional Independencies
Or, blocking paths for probability propagation. Three ways in which a path
between Y to X (or viceversa) can be blocked, given evidence E
Up(s2) 1
Y E X
1 Power(w0)
Z
Lit(l1) 2
Z Understood
Material

2 Assignment Exam
Grade Grade

Z Smoking 3
Fire
At Sensor

Alarm

• In 3, X and Y are independent if there is no evidence on their common effect

(recall fire and tampering in the alarm example
Practice in the AISpace Applet
• Open the Belief and Decision Networks applet
• Load the problem: Conditional Independence Quiz
• Click on Independence Quiz
Practice in the AISpace Applet
• Answer Quizzes in the Conditional Independence Quiz Panel
Learning Goals so Far
• Given a JPD
• Marginalize over specific variables
• Compute distributions over any subset of the variables
• Use inference by enumeration
• to compute joint posterior probability distributions over any subset
of variables given evidence
• Define and use marginal and conditional independence
• Build a Bayesian Network for a given domain (structure)
• Specify the necessary conditional probabilities
• Compute the representational savings in terms of number of
probabilities required
• Identify dependencies/independencies between nodes in a Bayesian
Network

Now we will see how to do inference in BNETS

22cse61 Module 4
No ratings yet
22cse61 Module 4
110 pages
Bayesian Networks
No ratings yet
Bayesian Networks
48 pages
12.uncertainty Reasoning Class
No ratings yet
12.uncertainty Reasoning Class
68 pages
Lec7 - Bayesian Network I
No ratings yet
Lec7 - Bayesian Network I
62 pages
BN DBN SSM HMM - Ghahramani
No ratings yet
BN DBN SSM HMM - Ghahramani
30 pages
Bayesian Neworks
No ratings yet
Bayesian Neworks
32 pages
BayesianNetwork in Ai For 6th Sem
No ratings yet
BayesianNetwork in Ai For 6th Sem
17 pages
Lecture 4-5 Reasoning With Uncertainty-2
No ratings yet
Lecture 4-5 Reasoning With Uncertainty-2
34 pages
Unit-V POAI
No ratings yet
Unit-V POAI
50 pages
13 Bayes-Net
No ratings yet
13 Bayes-Net
19 pages
COMP538: Introduction To Bayesian Networks
No ratings yet
COMP538: Introduction To Bayesian Networks
53 pages
Baysian Belief Networks
No ratings yet
Baysian Belief Networks
32 pages
Bayesian Network
No ratings yet
Bayesian Network
33 pages
L12 Bayesian Network
No ratings yet
L12 Bayesian Network
35 pages
Chapter 4 Bayesian Networks
No ratings yet
Chapter 4 Bayesian Networks
62 pages
Bayesian Belief Network
No ratings yet
Bayesian Belief Network
53 pages
2021 Lecture09 BayesianNetworks
No ratings yet
2021 Lecture09 BayesianNetworks
60 pages
Bayesian Networks
No ratings yet
Bayesian Networks
45 pages
Ba Yes Network
No ratings yet
Ba Yes Network
73 pages
Bayesian Networks
No ratings yet
Bayesian Networks
16 pages
Bayesian Networks: A Tutorial
No ratings yet
Bayesian Networks: A Tutorial
73 pages
Bayesian Belief Network
No ratings yet
Bayesian Belief Network
41 pages
Bayesian Networks: (Aka Bayes Nets, Belief Nets) (One Type of Graphical Model)
No ratings yet
Bayesian Networks: (Aka Bayes Nets, Belief Nets) (One Type of Graphical Model)
18 pages
Lecture Bayesian Networks
No ratings yet
Lecture Bayesian Networks
50 pages
4.2 Bayes-Nets
No ratings yet
4.2 Bayes-Nets
33 pages
Bayesian Networks and Inference
No ratings yet
Bayesian Networks and Inference
50 pages
Monte Carlo Artificial Intelligence: Bayesian Networks
No ratings yet
Monte Carlo Artificial Intelligence: Bayesian Networks
26 pages
CS480 Lecture October24th
No ratings yet
CS480 Lecture October24th
90 pages
Bayesian Network
No ratings yet
Bayesian Network
20 pages
BayesianNetworks Reduced
No ratings yet
BayesianNetworks Reduced
14 pages
Bayesian Networks
No ratings yet
Bayesian Networks
24 pages
Bayesian Networks Analysis
No ratings yet
Bayesian Networks Analysis
51 pages
Unit V - Graphical Models
No ratings yet
Unit V - Graphical Models
43 pages
Bayesian Network
No ratings yet
Bayesian Network
21 pages
Bays Theorem
No ratings yet
Bays Theorem
42 pages
Lecture-8 Machine Learning With Python
No ratings yet
Lecture-8 Machine Learning With Python
35 pages
BayesianNetworks Reduced
No ratings yet
BayesianNetworks Reduced
14 pages
13 Bayes Nets
No ratings yet
13 Bayes Nets
38 pages
Libpgm For Bayesian Networks: Dr. A. Obulesh Associate Professor
No ratings yet
Libpgm For Bayesian Networks: Dr. A. Obulesh Associate Professor
59 pages
Artificial Intelligence: Adina Magda Florea
No ratings yet
Artificial Intelligence: Adina Magda Florea
36 pages
AIFA 25 Bayesian Logic 120324
No ratings yet
AIFA 25 Bayesian Logic 120324
33 pages
PPT06-Probabilistic Reasoning
No ratings yet
PPT06-Probabilistic Reasoning
31 pages
The Economic Definition of Ore
100% (1)
The Economic Definition of Ore
161 pages
Good BayesianNetworksPrimer
No ratings yet
Good BayesianNetworksPrimer
23 pages
Module 5 2
No ratings yet
Module 5 2
41 pages
Bayesian Networks: Section 1 - 2
No ratings yet
Bayesian Networks: Section 1 - 2
16 pages
2 Information Theory
No ratings yet
2 Information Theory
40 pages
Unit 6
No ratings yet
Unit 6
126 pages
Bayesian Belief Network
No ratings yet
Bayesian Belief Network
6 pages
Bayesian Network - Problem
100% (1)
Bayesian Network - Problem
4 pages
Power Apps Guidance Fusion Dev Ebook
No ratings yet
Power Apps Guidance Fusion Dev Ebook
278 pages
5 Uncertainity Problems
No ratings yet
5 Uncertainity Problems
30 pages
Aiml Unit 2
No ratings yet
Aiml Unit 2
15 pages
Unit Iv Learning
No ratings yet
Unit Iv Learning
40 pages
EECS6895 AdvancedBigDataAnalytics Lecture6
No ratings yet
EECS6895 AdvancedBigDataAnalytics Lecture6
81 pages
Unit-5 Bayes' Rule and Bayesian Network
No ratings yet
Unit-5 Bayes' Rule and Bayesian Network
9 pages
An Introduction To Artificial Intelligence: Chapter 13 &14.1-14.2: Uncertainty & Bayesian Networks
No ratings yet
An Introduction To Artificial Intelligence: Chapter 13 &14.1-14.2: Uncertainty & Bayesian Networks
31 pages
Bayesian Belief Network
100% (1)
Bayesian Belief Network
7 pages
Bayesian Networks: Machine Learning, Lecture (Jaakkola)
No ratings yet
Bayesian Networks: Machine Learning, Lecture (Jaakkola)
8 pages
Cloud Computing: Shailendra Singh Professor Department of Computer Science & Engineering NITTTR, Bhopal
100% (2)
Cloud Computing: Shailendra Singh Professor Department of Computer Science & Engineering NITTTR, Bhopal
24 pages
Bayesian Networks
No ratings yet
Bayesian Networks
7 pages
Bayesian Belief Network in Artificial Intelligence
No ratings yet
Bayesian Belief Network in Artificial Intelligence
10 pages
CS
No ratings yet
CS
15 pages
Google Analytics
No ratings yet
Google Analytics
7 pages
Computer Implementation For 1D and 2D Problems: 4.1 MATLAB Code For 1D FEM (Steady1d.m)
No ratings yet
Computer Implementation For 1D and 2D Problems: 4.1 MATLAB Code For 1D FEM (Steady1d.m)
41 pages
Cbus Reverse Engineered Documentation
No ratings yet
Cbus Reverse Engineered Documentation
66 pages
Factoring Polynomials: Be Sure Your Answers Will Not Factor Further!
No ratings yet
Factoring Polynomials: Be Sure Your Answers Will Not Factor Further!
5 pages
Remote Procedure Call (RPC)
No ratings yet
Remote Procedure Call (RPC)
50 pages
Face Mask Detection
No ratings yet
Face Mask Detection
44 pages
Upfc PHD Thesis
100% (3)
Upfc PHD Thesis
7 pages
20211706271839tangazo La Kazi Bodi Ya Nit, Ticd, Kuwasa, Mof & Moh
No ratings yet
20211706271839tangazo La Kazi Bodi Ya Nit, Ticd, Kuwasa, Mof & Moh
12 pages
MDT Presentation Set 2
No ratings yet
MDT Presentation Set 2
6 pages
Grade 9 Pre June 2024 Marking Guidelines
No ratings yet
Grade 9 Pre June 2024 Marking Guidelines
10 pages
Unit IV Aiml
No ratings yet
Unit IV Aiml
32 pages
The Ethiopian Orthodox Tewahido Church Faith and Order: HOLY THURSDAY A Review Before The Final Exam Readings Appointed
No ratings yet
The Ethiopian Orthodox Tewahido Church Faith and Order: HOLY THURSDAY A Review Before The Final Exam Readings Appointed
3 pages
Bumps and Pothole Detection Report Final
No ratings yet
Bumps and Pothole Detection Report Final
64 pages
Continuous Versions of Firefly Algorithm: A Review
No ratings yet
Continuous Versions of Firefly Algorithm: A Review
48 pages
The Magic Cafe Forums - Red Streamlined Convertible by David Regal
No ratings yet
The Magic Cafe Forums - Red Streamlined Convertible by David Regal
3 pages
Wago Perspecto 762: Manual
No ratings yet
Wago Perspecto 762: Manual
50 pages
2020.11.28.402297v2.full - Using Pumas AI
No ratings yet
2020.11.28.402297v2.full - Using Pumas AI
35 pages
Graph Theory (J) : Grace He March 2021
No ratings yet
Graph Theory (J) : Grace He March 2021
4 pages
Presentation of Final Project 2
No ratings yet
Presentation of Final Project 2
26 pages
Schematic Diagram: 7-1. Circuit Descriptions
No ratings yet
Schematic Diagram: 7-1. Circuit Descriptions
6 pages
Ipv4 Addressing: © 2008 Cisco Systems, Inc. All Rights Reserved. Cisco Confidential Presentation - Id
No ratings yet
Ipv4 Addressing: © 2008 Cisco Systems, Inc. All Rights Reserved. Cisco Confidential Presentation - Id
35 pages
Escalation Points
No ratings yet
Escalation Points
2 pages
29.11.2024 FN Seating
No ratings yet
29.11.2024 FN Seating
4 pages
Understanding The Priority Queue With Custom
No ratings yet
Understanding The Priority Queue With Custom
3 pages
GPU-Co Processing
No ratings yet
GPU-Co Processing
8 pages
Kramer VM 2h2 Um 4
No ratings yet
Kramer VM 2h2 Um 4
16 pages
Tilahun-Tawhid2019 Article SwarmHyperheuristicFramework PDF
No ratings yet
Tilahun-Tawhid2019 Article SwarmHyperheuristicFramework PDF
28 pages
Published Paper
No ratings yet
Published Paper
13 pages
Project Chatbot Using Python
No ratings yet
Project Chatbot Using Python
2 pages
18CS34 CES Questionnaire
No ratings yet
18CS34 CES Questionnaire
2 pages
Light Control Dual LED Flasher Using NOR Gate
From Everand
Light Control Dual LED Flasher Using NOR Gate
GURUPRASAD N H
No ratings yet

Computer Science CPSC 322: Bayesian Networks: Construction

Uploaded by

Computer Science CPSC 322: Bayesian Networks: Construction

Uploaded by

Computer Science CPSC 322

Theorem: Chain Rule

𝑃𝑃(𝑓𝑓1⋀ … ⋀𝑓𝑓𝑛𝑛) = � 𝑃𝑃(𝑓𝑓𝑓𝑓|𝑓𝑓𝑖𝑖 − 1 ⋀ … ⋀𝑓𝑓1)

𝑃𝑃(𝑓𝑓1⋀ … ⋀𝑓𝑓𝑛𝑛) = � 𝑃𝑃(𝑓𝑓𝑓𝑓|𝑓𝑓𝑖𝑖 − 1 ⋀ … ⋀𝑓𝑓1)

• We can represent the JPD as a product of marginal

• For example, weather is marginally independent

Exponentially fewer than the JPD!

• Intuitively: if X and Y are conditionally independent given Z,

• However, whether light l1 is lit is conditionally independent from the

• ExamGrade and AssignmentGrade given UnderstoodMaterial

Marginally but not conditionally independent Smoking Fire

Both marginally and conditionally independent Power_w0

Neither marginally nor conditionally independent

8 – each row represents the probability distribution for D given

• If D is conditionally independent of A and B given C, we

Under independence we gain compactness (fewer/smaller

• Bayesian networks and their extensions are

• A graphical representation for a joint probability distribution

• The parents Pa(Xi) of a variable Xi are those Xi directly

3. For each Xi, , select the smallest set of predecessors Pa(Xi)

4. Then we can rewrite

• Tampering (T) is independent of fire (learning that one is

• Tampering (T) is independent of fire (learning that one is

• Alarm (A) depends on both Fire and Tampering: it could

• Smoke (S) is caused by Fire, and so is independent of

• Smoke (S) is caused by Fire, and so is independent of

• Leaving (L) is caused by Alarm, and thus is independent

• Report ( R) is caused by Leaving, and thus is independent

P(F,T,A,S,L,R)= P(F)P (T ) P (A | F,T) P (S | F) P (L | A) P (R | L)

• Note that we intermixed steps 3, 4 and 5, just because sometime it is

P(F)P (T | F) P (A | F,T) P (S | F,T,A) P (L | F,T,A,S) P (R | F,T,A,S,L)

• How many probabilities do we need to explicitly specify

• How many probabilities do we need to explicitly specify

Tampering T Fire F P(Alarm=t|T,F) P(Alarm=f|T,F) We don’t need to speficy

Each row of this table is a conditional probability distribution

• How many probabilities do we need to explicitly specify for the whole

Tampering T Fire F P(Alarm=t|T,F) Fire F P(Smoke=t |F)

……..probabilities in total, compared to the of the JPD for

Tampering T Fire F P(Alarm=t|T,F) Fire F P(Smoke=t |F)

12 probabilities in total, compared to the 26 -1= 63 of the JPD for

How many probabilities do we need to specify for this Bayesian network?

Tampering T Fire F P(Alarm=t|T,F) Fire F P(Smoke=t |F)

P(Tampering=t) x P(Fire=f)xP(Alarm=t| Tampering=t, Fire=f)xP(Smoke=f| Fire = f)xP(Leaving=t|

• Any entry of the JPD can be computed given the

Once we know the JPD, we can answer any

People are leaving Leaving Leaving Alarm

There are algorithms that leverage the Bnet structure to perform

~ 60 nodes, max 4 parents per node

May need to make simplifying assumptions to reduce

Still, usually better than specifying the full JPD

• We end up with a completely different network structure! (try

• Which of the two structures is better?

• Non-causal network is less compact: 1+2+2+4+8+8 = 25 numbers

Variable ordering: L,T,R,S,A,F

Variable ordering: T,F,A,S,L,R

P(T,F,A,S,L,R) = P (T) P (F) P (A | T,F) P (L | A) P (R|L) =

P (L)P(T|L)P(R|L) P(S|L,R,T) P(A|S,L,T) P(F|S,A,T) =

But they are not:

But remember what we said a few slides back.

• In 3, X and Y become dependent as soon as there is evidence on Z or on any

• In 3, X and Y become dependent as soon as there is evidence on Z or on any

• In 3, X and Y are independent if there is no evidence on their common effect

• In 3, X and Y are independent if there is no evidence on their common effect

Now we will see how to do inference in BNETS

You might also like