Lecture 12
Lecture 12
Hypothesis averaging:
Compute the probability that C applies to some
new object y by averaging the predictions of all
hypotheses h, weighted by p(h|X):
p( y Î C | X ) = å$
p( y Î C | h) p(h | X )
!#!"
hÎH é 1 if yÎh
=ê
ë 0 if yÏh
= å p(h | X )
h É{ y , X }
Examples:
16
Examples:
16
8
2
64
Examples:
16
23
19
20
+ Examples Human generalization Bayesian Model
60
60 80 10 30
60 52 57 55
16
16 8 2 64
16 23 19 20
Summary of the Bayesian model
“tufa”
“tufa”
“tufa”
Learning rectangle concepts
Bayesian
concept learning
with tree-structured
hypothesis space
Exploring different models
• Different priors?
– More complex language-like hypothesis spaces, allowing
exceptions, compound concepts, and much more…
Exploring different models
• Different priors?
– More complex language-like hypothesis spaces, allowing
exceptions, compound concepts, and much more…
Another word learning game
Hypothesis
Space: h1
h2
h2
h2
– 60, 80, 10, 30, 40, 20, 90, 80, 60, 40, 10, 20, 80, 30, 90, 60,
40, 30, 60, 80, 20, 90, 10, 30, 40, 90, 10, 60, 20, 80, 30
• Why “multiples of 10 except 50 and 70” more likely here?
– 60, 80, 10, 30, 55, 40, 20, 80, 50, 10,
• Why “multiples of 10, plus 55” more likely here?
Constructing more flexible priors: A
language for thought?
• Start with a base set of regularities R and combination
operators C. Hypothesis space = closure of R under C.
– C = {and, or}: H = unions and intersections of regularities in R (e.g.,
“multiples of 10 between 30 and 70”).
• Defining a prior:
– The Bayesian Occam’s Razor:
• Model classes defined by number of combinations.
• More combinations more hypotheses lower prior
Prior: p (h)
All hypotheses
µc µc
µc
c c
µ c µ
20 µ 200000
2000
– 60, 80, 10, 30, 40, 20, 90, 80, 60, 40, 10, 20, 80, 30, 90, 60,
40, 30, 60, 80, 20, 90, 10, 30, 40, 90, 10, 60, 20, 80, 30
• Why “multiples of 10 except 50 and 70” more likely here?
– 60, 80, 10, 30, 40, 20, 90, 80, 60, 40, 10, 20, 80, 30, 90, 60,
40, 30, 60, 80, 20, 90, 10, 30, 40, 90, 10, 60, 20, 80, 30
• Why “multiples of 10 except 50 and 70” more likely here?