Knowledge in Learning: Fall 2005
Knowledge in Learning: Fall 2005
Chapter 19
Fall 2005
Problems: nondeterministic, no
guarantee for simplest and correct h,
need backtrack
CSE 471/598 by H. Liu 6
Least-commitment search
Keeping only one h as its best guess is the
problem -> Can we keep as many as possible?
Version space (candidate elimination) Algorithm
incremental
least-commitment
From intervals to boundary sets
G-set and S-set
S0 – the most specific set contains nothing <0,0,…,0>
G0 – the most general set covers everything <?,?,…,?>
Everything between is guaranteed to be consistent
with examples.
VS tries to generalize S0 and specialize G0
incrementally
CSE 471/598 by H. Liu 7
Version space
Generalization and specialization (Fig 19.4):
find d-sets that contain only true/+, and true/-;
Sj can only be generalized and Gj can only be
specialized
False positive for Si, too general, discard it
False negative for Si, too specific, generalize it minimally
False positive for Gi, too general, specialize it minimally
False negative for Gi, too specific, discard it
When to stop
One concept left (Si = Gi)
The version space collapses (G is more special than S, or..)
Run out of examples
An example with 4 instances from Tom Mitchell’s
book
Generality
Empirical analysis of efficiency in EBL
study