0% found this document useful (0 votes)

18 views

ML LAB Task-1 Task-2 Notes

The document discusses concept learning, which involves inferring a Boolean-valued function from training examples. It describes the process of learning a target concept, such as predicting enjoyment of a water sport based on various weather attributes, and introduces algorithms like FIND-S and CANDIDATE-ELIMINATION for hypothesis generation and refinement. The goal is to find a hypothesis that accurately classifies instances as positive or negative based on the training data.

Uploaded by

andrajub4u

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

18 views

ML LAB Task-1 Task-2 Notes

Uploaded by

andrajub4u

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 12

VARDHAMAN COLLEGE OF ENGINEERING

(AUTONOMOUS)
Affiliated to JNTUH, Approved by AICTE, Accredited by NAAC with A++ Grade, ISO 9001:2015 Certified
Kacharam, Shamshabad, Hyderabad - 501218, Telangana, India

CONCEPT LEARNING

 Learning involves acquiring general concepts from specific training examples. Example:
People continually learn general concepts or categories such as "bird," "car," "situations
in which I should study more in order to pass the exam," etc.
 Each such concept can be viewed as describing some subset of objects or events defined
over a larger set
 Alternatively, each concept can be thought of as a Boolean-valued function defined over
this larger set. (Example: A function defined over all animals, whose value is true for
birds and false for other animals).

Definition: Concept learning - Inferring a Boolean-valued function from training

examples of its input and output

A CONCEPT LEARNING TASK

Consider the example task of learning the target concept "Days on which Aldo enjoyshis
favorite water sport”

Example Sky AirTemp Humidity Wind Wate Forecast Enjoy

r Sport
1 Sunny Warm Normal Strong Warm Same Yes

2 Sunny Warm High Strong Warm Same Yes

3 Rainy Cold High Strong Warm Change No

4 Sunny Warm High Strong Cool Change Yes

Table: Positive and negative training examples for the target concept EnjoySport.

The task is to learn to predict the value of EnjoySport for an arbitrary day, based on
thevalues of its other attributes?

What hypothesis representation is provided to the learner?

 Let’s consider a simple representation in which each hypothesis consists of

aconjunction of constraints on the instance attributes.
 Let each hypothesis be a vector of six constraints, specifying the values
of the six attributes Sky, AirTemp, Humidity, Wind, Water, and Forecast.

1
For each attribute, the hypothesis will either
 Indicate by a "?' that any value is acceptable for this attribute,
 Specify a single required value (e.g., Warm) for the attribute, or
 Indicate by a "Φ" that no value is acceptable

If some instance x satisfies all the constraints of hypothesis h, then h classifies x as a

positive example (h(x) = 1).

The hypothesis that PERSON enjoys his favorite sport only on cold days with high humidityis
represented by the expression
(?, Cold, High, ?, ?, ?)

The most general hypothesis-that every day is a positive example-is represented by

(?, ?, ?, ?, ?, ?)

The most specific possible hypothesis-that no day is a positive example-is represented by

(Φ, Φ, Φ, Φ, Φ, Φ)
Notation

 The set of items over which the concept is defined is called the set of instances,
which is denoted by X.

Example: X is the set of all possible days, each represented by the attributes: Sky,
AirTemp, Humidity, Wind, Water, and Forecast

 The concept or function to be learned is called the target concept, which is denoted by
c. c can be any Boolean valued function defined over the instances X

c: X→ {O, 1}

Example: The target concept corresponds to the value of the attribute EnjoySport
(i.e., c(x) = 1 if EnjoySport = Yes, and c(x) = 0 if EnjoySport = No).

 Instances for which c(x) = 1 are called positive examples, or members of the target concept.
 Instances for which c(x) = 0 are called negative examples, or non-members of
the target concept.
 The ordered pair (x, c(x)) to describe the training example consisting of the instance
x and its target concept value c(x).
 D to denote the set of available training examples

 The symbol H to denote the set of all possible hypotheses that the learner may consider
regarding the identity of the target concept. Each hypothesis h in H represents a Boolean-
valued function defined over X
h: X→{O, 1}

The goal of the learner is to find a hypothesis h such that h(x) = c(x) for all x in X.
2
 Given:
 Instances X: Possible days, each described by the attributes
 Sky (with possible values Sunny, Cloudy, and Rainy),
 AirTemp (with values Warm and Cold),
 Humidity (with values Normal and High),
 Wind (with values Strong and Weak),
 Water (with values Warm and Cool),
 Forecast (with values Same and Change).

 Hypotheses H: Each hypothesis is described by a conjunction of constraints on the

attributes Sky, AirTemp, Humidity, Wind, Water, and Forecast. The constraints may
be "?" (any value is acceptable), “Φ” (no value is acceptable), or a specific value.

 Target concept c: EnjoySport : X → {0, l}

 Training examples D: Positive and negative examples of the target function

 Determine:
 A hypothesis h in H such that h(x) = c(x) for all x in X.

Table: The EnjoySport concept learning task.

The inductive learning hypothesis

Any hypothesis found to approximate the target function well over a sufficiently large set of
training examples will also approximate the target function well over other
unobservedexamples.

3
CONCEPT LEARNING AS SEARCH

 Concept learning can be viewed as the task of searching through a large

space of hypotheses implicitly defined by the hypothesis representation.
 The goal of this search is to find the hypothesis that best fits the training examples.

Example:
Consider the instances X and hypotheses H in the EnjoySport learning task. The attribute
Sky has three possible values, and AirTemp, Humidity, Wind, Water, Forecast each have
two possible values, the instance space X contains exactly
3.2.2.2.2.2 = 96 distinct instances
5.4.4.4.4.4 = 5120 syntactically distinct hypotheses within H.

Every hypothesis containing one or more "Φ" symbols represents the empty set of instances;
that is, it classifies every instance as negative.
1 + (4.3.3.3.3.3) = 973. Semantically distinct hypotheses

General-to-Specific Ordering of Hypotheses

Consider the two hypotheses

h1 = (Sunny, ?, ?, Strong, ?, ?)
h2 = (Sunny, ?, ?, ?, ?, ?)

 Consider the sets of instances that are classified positive by hl and by h2.
 h2 imposes fewer constraints on the instance, it classifies more instances as positive.
So, any instance classified positive by hl will also be classified positive by h2.
Therefore, h2 is more general than hl.

Given hypotheses hj and hk, hj is more-general-than or- equal do hk if and only if any
instancethat satisfies hk also satisfies hi

Definition: Let hj and hk be Boolean-valued functions defined over X. Then hj is more

general-than- or-equal-to hk (written hj ≥ hk) if and only if

( xX ) [(hk (x) = 1) → (hj (x) = 1)]

4
 In the figure, the box on the left represents the set X of all instances, the box on the
right the set H of all hypotheses.
 Each hypothesis corresponds to some subset of X-the subset of instances that it
classifies positive.
 The arrows connecting hypotheses represent the more - general -than relation, with
the arrow pointing toward the less general hypothesis.
 Note the subset of instances characterized by h2 subsumes the subset
characterized by hl , hence h2 is more - general– than h1

FIND-S: FINDING A MAXIMALLY SPECIFIC HYPOTHESIS

FIND-S Algorithm

1. Initialize h to the most specific hypothesis in H

2. For each positive training instance x
For each attribute constraint a in
i
h
If the constraint a is satisfied by x
i
Then do nothing
Else replace a in h by the next more general constraint that is satisfied by x
i

3. Output hypothesis h

5
To illustrate this algorithm, assume the learner is given the sequence of training
examplesfrom the EnjoySport task

Example Sky AirTemp Humidity Wind Water Forecast EnjoySport

1 Sunny Warm Normal Strong Warm Same Yes
2 Sunny Warm High Strong Warm Same Yes
3 Rainy Cold High Strong Warm Change No
4 Sunny Warm High Strong Cool Change Yes

 The first step of FIND-S is to initialize h to the most specific hypothesis in H

h - (Ø, Ø, Ø, Ø, Ø, Ø)

 Consider the first training example

x1 = <Sunny Warm Normal Strong Warm Same>, +

Observing the first training example, it is clear that hypothesis h is too specific.
None of the "Ø" constraints in h are satisfied by this example, so each is replaced by
the next more general constraint that fits the example
h1 = <Sunny Warm Normal Strong Warm Same>

 Consider the second training example

x2 = <Sunny, Warm, High, Strong, Warm, Same>, +

The second training example forces the algorithm to further generalize h, this
time substituting a "?" in place of any attribute value in h that is not satisfied by
the new example
h2 = <Sunny Warm ? Strong Warm Same>

 Consider the third training example

x3 = <Rainy, Cold, High, Strong, Warm, Change>, -

Upon encountering the third training the algorithm makes no change to h. The FIND-S
algorithm simply ignores every negative example.
h3 = < Sunny Warm ? Strong Warm Same>

 Consider the fourth training example

x4 = <Sunny Warm High Strong Cool Change>, +

6
The fourth example leads to a further generalization of h
h4 = < Sunny Warm ? Strong ? ? >

The key property of the FIND-S algorithm

 FIND-S is guaranteed to output the most specific hypothesis within H that is
consistent with the positive training examples
 FIND-S algorithm’s final hypothesis will also be consistent with the negative
examples provided the correct target concept is contained in H, and provided the
training examples are correct.
Unanswered by FIND-S

1. Has the learner converged to the correct target concept?

2. Why prefer the most specific hypothesis?
3. Are the training examples consistent?
4. What if there are several maximally specific consistent hypotheses?

7
VERSION SPACES AND THE CANDIDATE-ELIMINATION ALGORITHM

The key idea in the CANDIDATE-ELIMINATION algorithm is to output a description of theset of all
hypotheses consistent with the training examples

Representation

Definition: consistent- A hypothesis h is consistent with a set of training examples D if and

only if h(x) = c(x) for each example (x, c(x)) in D.

Consistent (h, D)  ( x, c(x)  D) h(x) =

c(x)) Note difference between definitions of consistent and

satisfies
 An example x is said to satisfy hypothesis h when h(x) = 1, regardless of
whether x is a positive or negative example of the target concept.
 An example x is said to consistent with hypothesis h iff h(x) = c(x)

Definition: version space- The version space, denoted V S with respect to hypothesis
space
H, D
H and training examples D, is the subset of hypotheses from H consistent with the
training examples in D
V S {h  H | Consistent (h, D)}
H, D

The LIST-THEN-ELIMINATION algorithm

The LIST-THEN-ELIMINATE algorithm first initializes the version space to contain

all hypotheses in H and then eliminates any hypothesis found inconsistent with any
training example.

VersionSpace c a list containing every hypothesis in H

1. For each training example, (x, c(x))
remove from VersionSpace any hypothesis h for which h(x) ≠ c(x)
2. Output the list of hypotheses in VersionSpace

The LIST-THEN-ELIMINATE Algorithm

 List-Then-Eliminate works in principle, as long as version space is finite.

 However, since it requires exhaustive enumeration of all hypotheses in practice
it is not feasible.
8
A More Compact Representation for Version Spaces

The version space is represented by its most general and least general members. These
members form general and specific boundary sets that delimit the version space within the
partially ordered hypothesis space.

Definition: The general boundary G, with respect to hypothesis space H and training data D,
is the set of maximally general members of H consistent with D

G {g  H | Consistent (g, D)(g'  H)[(g'  g)  Consistent(g', D)]}

Definition: The specific boundary S, with respect to hypothesis space H and training data D,
is the set of minimally general (i.e., maximally specific) members of H consistent with D.

S {s  H | Consistent (s, D)(s'  H)[(s  s')  Consistent(s', D)]}

g
CANDIDATE-ELIMINATION Learning Algorithm

The CANDIDATE-ELIMINTION algorithm computes the version space containing all

hypotheses from H that are consistent with an observed sequence of training examples .

Initialize G to the set of maximally general hypotheses

in H Initialize S to the set of maximally specific
hypotheses in H For each training example d, do
• If d is a positive example
• Remove from G any hypothesis inconsistent with d
• For each hypothesis s in S that is not consistent with d
• Remove s from S
• Add to S all minimal generalizations h of s such that
• h is consistent with d, and some member of G is more general than h
• Remove from S any hypothesis that is more general than another hypothesis in S

• If d is a negative example
• Remove from S any hypothesis inconsistent with d
• For each hypothesis g in G that is not consistent with d
• Remove g from G
• Add to G all minimal specializations h of g such that
• h is consistent with d, and some member of S is more specific than h
• Remove from G any hypothesis that is less general than another hypothesis in G

CANDIDATE- ELIMINTION algorithm using version spaces

9
An Illustrative Example
Example Sky AirTemp Humidity Wind Water Forecast EnjoySport
1 Sunny Warm Normal Strong Warm Same Yes
2 Sunny Warm High Strong Warm Same Yes
3 Rainy Cold High Strong Warm Change No
4 Sunny Warm High Strong Cool Change Yes

CANDIDATE-ELIMINTION algorithm begins by initializing the version space to the set

ofall hypotheses in H;

Initializing the G boundary set to contain the most general hypothesis in H

G0 ?, ?, ?, ?, ?, ?

Initializing the S boundary set to contain the most specific (least general) hypothesis
S0 , , , , , 

 When the first training example is presented, the CANDIDATE-ELIMINTION algorithm
checks the S boundary and finds that it is overly specific and it fails to cover the positive
example.
 The boundary is therefore revised by moving it to the least more general hypothesis that
covers this new example
 No update of the G boundary is needed in response to this training example because G o
correctly covers this example

 When the second training example is observed, it has a similar effect of generalizing
S further to S2, leaving G again unchanged i.e., G2 = G1 = G0

10
 Consider the third training example. This negative example reveals that the G
boundaryof the version space is overly general, that is, the hypothesis in G
incorrectly predicts that this new example is a positive example.
 The hypothesis in the G boundary must therefore be specialized until it correctly
classifies this new negative example

Given that there are six attributes that could be specified to specialize G 2, why are there
only three new hypotheses in G3?
For example, the hypothesis h = (?, ?, Normal, ?, ?, ?) is a minimal specialization of
G2 that correctly labels the new example as a negative example, but it is not included
in G3. The reason this hypothesis is excluded is that it is inconsistent with the
previously encountered positive examples

11
 Consider the fourth training example.

 This positive example further generalizes the S boundary of the version space. It
also results in removing one member of the G boundary, because this member fails
to cover the new positive example

After processing these four examples, the boundary sets S 4 and G4 delimit the version space

of all hypotheses consistent with the set of incrementally observed training examples.

Midmark B23 - Service Manual
100% (1)
Midmark B23 - Service Manual
36 pages
Edgardfreitas 2016
No ratings yet
Edgardfreitas 2016
100 pages
module 2 AI n ML notes
No ratings yet
module 2 AI n ML notes
16 pages
Unit 1-Concept Learning
No ratings yet
Unit 1-Concept Learning
59 pages
ML Notes Module2
No ratings yet
ML Notes Module2
16 pages
Unit2_4
No ratings yet
Unit2_4
7 pages
Concept Learning and Genrel To Specific Ordering - 2
No ratings yet
Concept Learning and Genrel To Specific Ordering - 2
46 pages
M2 - Concept Learning
No ratings yet
M2 - Concept Learning
64 pages
CSE543: Machine Learning: Lecture 2: August 6, 2014
No ratings yet
CSE543: Machine Learning: Lecture 2: August 6, 2014
27 pages
UNIT_1_notes
No ratings yet
UNIT_1_notes
16 pages
Concept Learning
No ratings yet
Concept Learning
71 pages
UNIT1
No ratings yet
UNIT1
82 pages
1.concept Learning
No ratings yet
1.concept Learning
50 pages
Artificial Intelligence and Machine Learning 18CS71
No ratings yet
Artificial Intelligence and Machine Learning 18CS71
17 pages
Lecture 5.2
No ratings yet
Lecture 5.2
8 pages
UNIT_1_notes
No ratings yet
UNIT_1_notes
16 pages
Concept Learning - QB - Solutions
No ratings yet
Concept Learning - QB - Solutions
13 pages
Chapter 2 Concept Learning
No ratings yet
Chapter 2 Concept Learning
36 pages
Concept Learning
No ratings yet
Concept Learning
11 pages
ED317 Statistical Machine learning
No ratings yet
ED317 Statistical Machine learning
174 pages
Concept Learning
No ratings yet
Concept Learning
42 pages
Machine Learning Notes Unit 1
No ratings yet
Machine Learning Notes Unit 1
25 pages
Lecture Notes - 18CS71 - Machine Learning - Module 1: Introduction
No ratings yet
Lecture Notes - 18CS71 - Machine Learning - Module 1: Introduction
16 pages
Lecture3 Concept Learning
No ratings yet
Lecture3 Concept Learning
42 pages
1 Concept-Learning
No ratings yet
1 Concept-Learning
25 pages
2 concept-learning
No ratings yet
2 concept-learning
42 pages
ML_Lecture_2_Version_Spaces
No ratings yet
ML_Lecture_2_Version_Spaces
32 pages
Lecture 2
No ratings yet
Lecture 2
31 pages
UNIT-1
No ratings yet
UNIT-1
43 pages
Module 1- Concept Learning (1)
No ratings yet
Module 1- Concept Learning (1)
50 pages
ML 02 Concept
No ratings yet
ML 02 Concept
7 pages
Combined ML
100% (1)
Combined ML
705 pages
Module 1-2
No ratings yet
Module 1-2
19 pages
3 Ml Ch2 Concept Learning Short
No ratings yet
3 Ml Ch2 Concept Learning Short
16 pages
concept learning
No ratings yet
concept learning
18 pages
CH2 ConceptLearning
No ratings yet
CH2 ConceptLearning
38 pages
Chapter 11
No ratings yet
Chapter 11
55 pages
Aiml Lab Exp 1 (Find S)
No ratings yet
Aiml Lab Exp 1 (Find S)
24 pages
Machine Learning: Bilal Khan
No ratings yet
Machine Learning: Bilal Khan
40 pages
Concept
No ratings yet
Concept
43 pages
5 - AIML - Module3 - PPT
No ratings yet
5 - AIML - Module3 - PPT
37 pages
Unit-2 MLT Handouts-Up To Bayesian Learning
No ratings yet
Unit-2 MLT Handouts-Up To Bayesian Learning
16 pages
ML Lab Program - VTU
No ratings yet
ML Lab Program - VTU
4 pages
Find S
No ratings yet
Find S
4 pages
ML unit -I part II
No ratings yet
ML unit -I part II
9 pages
IAT-I Question Paper With Solution of 18CS71 Artificial Intelligence and Machine Learning Oct-2022-Dr - Swathi Y
No ratings yet
IAT-I Question Paper With Solution of 18CS71 Artificial Intelligence and Machine Learning Oct-2022-Dr - Swathi Y
7 pages
ML Unit 1 29 45
No ratings yet
ML Unit 1 29 45
17 pages
Concept Learning and The General-To-Specific Ordering2
No ratings yet
Concept Learning and The General-To-Specific Ordering2
19 pages
03-computational cognitive science
No ratings yet
03-computational cognitive science
42 pages
Ex.no.2_Find S Algorithm
No ratings yet
Ex.no.2_Find S Algorithm
3 pages
Lec01 Conceptlearning
100% (1)
Lec01 Conceptlearning
49 pages
2.concept Learning
No ratings yet
2.concept Learning
21 pages
S Algorithm
No ratings yet
S Algorithm
19 pages
ML 02
No ratings yet
ML 02
25 pages
unit - 1
No ratings yet
unit - 1
29 pages
Candidate Elimination Algo
No ratings yet
Candidate Elimination Algo
13 pages
ML Unit 1
No ratings yet
ML Unit 1
44 pages
Find - S Algorithm
No ratings yet
Find - S Algorithm
17 pages
ML Reference-Material-II
No ratings yet
ML Reference-Material-II
25 pages
Set Theory Essentials
From Everand
Set Theory Essentials
Emil Milewski
No ratings yet
Theory of Approximation
From Everand
Theory of Approximation
N. I. Achieser
No ratings yet
Group Theory I Essentials
From Everand
Group Theory I Essentials
Emil Milewski
No ratings yet
Types of ML
No ratings yet
Types of ML
27 pages
Types of Learning (1)
No ratings yet
Types of Learning (1)
12 pages
Session-2 - BuzzWords ByteCode Programs
No ratings yet
Session-2 - BuzzWords ByteCode Programs
17 pages
Session-4 - Control Statements
No ratings yet
Session-4 - Control Statements
10 pages
Staff Development Programme
No ratings yet
Staff Development Programme
13 pages
ARCHtoolbox - Light Fixture (Luminaire) Types
No ratings yet
ARCHtoolbox - Light Fixture (Luminaire) Types
3 pages
U Bub Ms A PR Instructions 202223
No ratings yet
U Bub Ms A PR Instructions 202223
30 pages
Inmex India Connect
No ratings yet
Inmex India Connect
10 pages
Dsnizkp
No ratings yet
Dsnizkp
20 pages
UfiSpace-Open-Aggregation-Router-S9600-72XC-Datasheet
No ratings yet
UfiSpace-Open-Aggregation-Router-S9600-72XC-Datasheet
2 pages
Elation Artiste Picasso - User Manual
No ratings yet
Elation Artiste Picasso - User Manual
48 pages
Lesson Plan - Electrical Materials and Supplies
100% (1)
Lesson Plan - Electrical Materials and Supplies
5 pages
Cassandra and DataStax Enterprise Essentials
No ratings yet
Cassandra and DataStax Enterprise Essentials
38 pages
I/O Port Structure: by B. Prasanthi, Assistant Professor Department. of ECE AITS, Rajampet
100% (1)
I/O Port Structure: by B. Prasanthi, Assistant Professor Department. of ECE AITS, Rajampet
40 pages
UTS Brochure
No ratings yet
UTS Brochure
20 pages
Dolly Invention Field
No ratings yet
Dolly Invention Field
21 pages
Air Fuel Ratio Control 3516B
No ratings yet
Air Fuel Ratio Control 3516B
4 pages
Owner's Manual: 2 Door Compact Refrigerator
0% (1)
Owner's Manual: 2 Door Compact Refrigerator
24 pages
Standard Procedures Manual (Conduct of EIA)
No ratings yet
Standard Procedures Manual (Conduct of EIA)
3 pages
Bug Scouts GDD
No ratings yet
Bug Scouts GDD
14 pages
Samsung Le40n87bdx Gtu40hen Le46n87bdx Gtu46hen PDF
No ratings yet
Samsung Le40n87bdx Gtu40hen Le46n87bdx Gtu46hen PDF
144 pages
Telecom Egypt Business Trunk Accessing Solution v0.5
No ratings yet
Telecom Egypt Business Trunk Accessing Solution v0.5
17 pages
Chapter 1 Embedded Systems
No ratings yet
Chapter 1 Embedded Systems
10 pages
Ritters Crypto Glossary
No ratings yet
Ritters Crypto Glossary
128 pages
Internet and Computer Virus
No ratings yet
Internet and Computer Virus
12 pages
1 4990045931098341906
No ratings yet
1 4990045931098341906
3 pages
Artificial Intelligence: (Document Subtitle)
No ratings yet
Artificial Intelligence: (Document Subtitle)
9 pages
DS2477 Security UG6751_Rev 4_VER0100h
No ratings yet
DS2477 Security UG6751_Rev 4_VER0100h
62 pages
Biostatistics (HS167) Lab Manual: # Variable Name Variable Label Codes and Parameters (Dots Represent Missing Data)
No ratings yet
Biostatistics (HS167) Lab Manual: # Variable Name Variable Label Codes and Parameters (Dots Represent Missing Data)
15 pages
Chapter 1
No ratings yet
Chapter 1
6 pages
Optima 7x00 Series
No ratings yet
Optima 7x00 Series
12 pages
Career Interest Survey
No ratings yet
Career Interest Survey
1 page
Sharma Et Al. (2022)
No ratings yet
Sharma Et Al. (2022)
17 pages

ML LAB Task-1 Task-2 Notes

Uploaded by

ML LAB Task-1 Task-2 Notes

Uploaded by

VARDHAMAN COLLEGE OF ENGINEERING

Definition: Concept learning - Inferring a Boolean-valued function from training

A CONCEPT LEARNING TASK

Example Sky AirTemp Humidity Wind Wate Forecast Enjoy

2 Sunny Warm High Strong Warm Same Yes

3 Rainy Cold High Strong Warm Change No

4 Sunny Warm High Strong Cool Change Yes

What hypothesis representation is provided to the learner?

 Let’s consider a simple representation in which each hypothesis consists of

If some instance x satisfies all the constraints of hypothesis h, then h classifies x as a

The most general hypothesis-that every day is a positive example-is represented by

The most specific possible hypothesis-that no day is a positive example-is represented by

 Hypotheses H: Each hypothesis is described by a conjunction of constraints on the

 Target concept c: EnjoySport : X → {0, l}

Table: The EnjoySport concept learning task.

The inductive learning hypothesis

 Concept learning can be viewed as the task of searching through a large

General-to-Specific Ordering of Hypotheses

Consider the two hypotheses

Definition: Let hj and hk be Boolean-valued functions defined over X. Then hj is more

( xX ) [(hk (x) = 1) → (hj (x) = 1)]

FIND-S: FINDING A MAXIMALLY SPECIFIC HYPOTHESIS

1. Initialize h to the most specific hypothesis in H

Example Sky AirTemp Humidity Wind Water Forecast EnjoySport

 The first step of FIND-S is to initialize h to the most specific hypothesis in H

 Consider the first training example

 Consider the second training example

 Consider the third training example

 Consider the fourth training example

The key property of the FIND-S algorithm

1. Has the learner converged to the correct target concept?

Definition: consistent- A hypothesis h is consistent with a set of training examples D if and

Consistent (h, D)  ( x, c(x)  D) h(x) =

c(x)) Note difference between definitions of consistent and

The LIST-THEN-ELIMINATION algorithm

The LIST-THEN-ELIMINATE algorithm first initializes the version space to contain

VersionSpace c a list containing every hypothesis in H

The LIST-THEN-ELIMINATE Algorithm

 List-Then-Eliminate works in principle, as long as version space is finite.

G {g  H | Consistent (g, D)(g'  H)[(g'  g)  Consistent(g', D)]}

S {s  H | Consistent (s, D)(s'  H)[(s  s')  Consistent(s', D)]}

The CANDIDATE-ELIMINTION algorithm computes the version space containing all

Initialize G to the set of maximally general hypotheses

CANDIDATE- ELIMINTION algorithm using version spaces

CANDIDATE-ELIMINTION algorithm begins by initializing the version space to the set

Initializing the G boundary set to contain the most general hypothesis in H

You might also like