0% found this document useful (0 votes)

26 views43 pages

Concept

Uploaded by

radhikasn

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

26 views43 pages

Concept

Uploaded by

radhikasn

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 43

CONCEPT LEARNING

It is a process of abstraction and generalization

from the data.
Concept learning requires three things:

1. Input - Training dataset which is a .set of training instances

2. Output - Target concept or Target function f It is a mapping

function f(x) from input x to output y.

3. Test - New instances to test the learned model.

Concept learning is defined as-"Given a set of hypotheses, the learner

searches through the hypothesis space to identify the best hypothesis
that matches the target concept".
Here, in this set of training instances,
The independent attributes considered are ‘Horns', 'Tail', 'Tusks', 'Paws', 'Fur', 'Color',
'Hooves' and 'Size’.
The dependent attribute is 'Elephant'. The target concept is to identify the animal to be
an Elephant.
Representation of a Hypothesis

• A hypothesis 'h' approximates a target function 'f' to represent the

relationship between the independent attributes and the dependent
attribute of the training instances.

• Each hypothesis is represented as a conjunction of attribute

conditions in the antecedent part.

• For example, (Tail= Short)/\ (Color= Black)....

• The set of hypothesis in the search space is called as hypotheses. Hypotheses are the
plural form of hypothesis.

• 'H' is used to represent the hypotheses.

• 'h' is used to represent a candidate hypothesis.

• each attribute can take value as either '?’ or ‘ᶲ’ or can hold a single
value.

• "?" denotes that the attribute can take any value [e.g., Color= ?]
• ‘ᶲ’ denotes that the attribute cannot take any value, i.e., it represents a null value [e.g., Horns= ‘ᶲ’
• Single value denotes a specific single value from acceptable values of the attribute, i.e., the attribute
'Tail' can take a value as 'short' [e.g., Tail= Short]
• The different hypotheses that can be predicted for the target concept are
The most general hypothesis can allow any value for each of the attribute.
It is represented as : <?, ?, ?, ?, ?, ?, ?, ?>.
This hypothesis indicates that any animal can be an elephant.

The most specific hypothesis will not allow any value for each of the
attribute < ᶲ ,ᶲ ,ᶲ ,ᶲ ,ᶲ ,ᶲ ,ᶲ ,ᶲ >
hypothesis indicates that no animal can be an elephant.
Hypothesis space
• Hypothesis space is the set of all possible hypotheses that
approximates the target function f.

• The subset of hypothesis space that is consistent with all-observed

training instances is called as Version Space.

• Version space represents the only hypotheses that are used for the
classification.
• For example, each of the attribute given in the Table 3.1 has the following
possible set of values.
• Considering these values for each of the attribute, there are (2 x 2 x 2 x 2
x 2 x 3 x 2 x 2) =384 distinct instances covering all the 5 instances in the
training dataset.

So, we can generate (4 x 4 x 4 x 4 x 4 x 5 x 4 x 4) =81,920 distinct hypotheses when including two more
values[?, ᶲ] for each of the attribute
Heuristic search space
• Heuristic search is a search strategy that finds an optimized
hypothesis/solution to a problem

• It by iteratively improving the hypothesis/solution based on a given

heuristic function or a cost measure.

• Several commonly used heuristic search methods are hill climbing

methods, constraint satisfaction problems, best-first search,
simulated-annealing, A* algorithm, and genetic algorithms
Generalization and Specialization

• By generalization of the most specific hypothesis and by specialization of the

most general hypothesis
• The hypothesis space can be searched or an approximate hypothesis that
matches all positive instances but does not mat any negative instance.
• Searching the Hypothesis Space
• There are two ways of learning the hypothesis, consistent with all training instances
from the large hypothesis space.

1. Specialization- General to Specific learning

2. Generalization - Specific to General learning
• Generalization- Specific to General Learning :
This learning methodology will search through the hypothesis space for an
approximate hypothesis by generalizing the most specific hypothesis.

Example : Consider the training instances shown in Table 3.1 and illustrate
Specific to General Learning.

Solution: We will start from all false or the most specific hypothesis to
determine the most restrictive specialization. Consider only the positive
instances and generalize the most specific hypothesis. Ignore the negative
instances.
• The most specific hypothesis is taken now, which will not classify any instance
to true.
•h=<ᶲ ᶲ ᶲ ᶲ ᶲ ᶲ ᶲ ᶲ>
• Read the first instance I1, to generalize the hypothesis h so that this positive
instance can be classified by the hypothesis hl.
• I1: No Short Yes No No Black No Big Yes (Positive instance)

h1= <No Short Yes No No Black No Big>

• When reading the second instance I2, it is a negative instance, so ignore
it. h1=h2

I2: Yes Short No No No Brown Yes Medium No(Negative instance)

h2 == <No Short Yes No No Black No Big>

• when reading the third instance I3, it is a positive instance so generalize h2
to h3 to accommodate it. The resulting h3 is generalized.
h2 == <No Short Yes No No Black No Big>

I3: No Short Yes No No Black No Medium Yes (Positive instance)

h3 =< No Short Yes No No Black No ?>

• Ignore I4 since it is a negative instance so,h4=h3
h3 =< No Short Yes No No Black No ?>

I4: No Long No Yes Yes White No Medium No (Negative instance)

h4 == <No Short Yes No No Black No ?>

• When reading the fifth instance I5, h4 is further generalized to h5.

h4 == <No Short Yes No No Black No ?>

I5: No Short Yes Yes Yes Black No Big Yes (Positive instance)

h5 == <No Short Yes ? ? Black No ?>

• After observing all the

positive instances, an
approximate hypothesis
h5 is generated which
can now classify any
subsequent positive
instance to true.
Example 2:
Consider sample training instances shown in Table 1, which describes the symptoms
of the persons and their Covid-19 test result. Apply specific to general learning to
search for an approximate hypothesis in the hypothesis space.
• Example: illustrate learning by Specialization - General to Specific Leaming
for the data instances shown in Table 3.1.
Specialization - General to Specific Learning
• The hypothesis space for an approximate hypothesis by specializing
the most general hypothesis.
• illustrate learning by Specialization - General to Specific Leaming for
the data instances shown in Table 3.1.
• Start from the most general hypothesis which will make true all
positive and negative instances.

• h=<? ? ? ? ? ? ? ?>

Yes (Positive instance)

I1: No Short Yes No No Black No Big

hl =<? ? ? ? ? ? ? ?>
I2: Yes Short No
No No No Brown Yes Medium

hl =<? ? ? ? ? ? ? ?>

h2=<No ? ? ? ? ? ?
?>

<? long Yes ? ? ? ? ?>

<? ? ? yes yes Black ? ?>

<? ? ? ? ? ? No ?>

<? ? ? Big>
? ?
? ?

h2 imposes constraints so that it will not classify a negative instance to true.

Yes (Positive instance)
I3: No Short Yes No No Black No Medium

• h3=h2
h2=<No ? ? ? ? ? ?
?>

<? long Yes ? ? ? ? ?>

<? ? ? yes yes Black ? ?>

<? ? ? ? ? ? No ?>

<? ? ? Big>
? ?
? ?
• I4
No(Negative instance)
I4: No Long No Yes Yes White No Medium

h4=<? ? Yes ? ? ? ?
?>

<? ? ? ? ? Black ? ?>

<? ? ? Big>
? ?
? ?

Remove any hypothesis inconsistent with this negative instance.

I5: No Short Yes Yes Yes Black No Big Yes (Positive instance)

h5=h4
h5=<? ? Yes ? ? ? ?
?>

<? ? ? ? ? Black ? ?>

<? ? ? Big>
? ?
? ?

Thus, h5 is the hypothesis space generated which will classify the positive instances
to true and negative instances to false.
Example 2:
Consider sample training instances shown in Table 1, which describes the symptoms
of the persons and their Covid-19 test result. Apply general to specific learning to
search for an approximate hypothesis in the hypothesis space.
Hypothesis Space Search by Find-S Algorithm

• Find-S algorithm is guaranteed to converge to the most specific

hypothesis in H that is consistent with the positive instances in the
training dataset.

• This algorithm considers only the positive instances and eliminates

negative instances while generating the hypothesis.
• Consider the training dataset of 4 instances shown in Table 3.2. It
contains the details of the performance of students and their
likelihood of getting a job offer or not in their final semester. Apply
the Find-S algorithm.
• Step 1: Initialize 'h' to the most specific hypothesis. There are 6 attributes, so
for each attribute, we initially fill ‘ᶲ ’ in the initial hypothesis 'h'.
•h=<ᶲ ᶲ ᶲ ᶲ ᶲ ᶲ >
• Step 2: Generalize the initial hypothesis for the first positive instance. I1 is a
positive instance, so generalize the most specific hypothesis 'h’ to include
this positive instance. Hence
I1: >=9 YES Excellent Good Fast Yes Positive
instance

h1: >=9 YES Excellent Good Fast Yes

• Step 3: Scan the next instance I2, since I2 is a positive instance. Generalize
'h' to include positive instance I2. For each of the non-matching attribute
value in 'h' put a '?' to include this positive instance. The third attribute value
is mismatching in 'h' with I2, so put a ‘?’.

h1: >=9 YES Excellent Good Fast Yes

I2: >=9 Yes Good Good Fast

Yes Positive instance

h2 | >=9 Yes ? Good Fast Yes>

• Now,scan I3. Since it is a negative instance, ignore it. Hence, the
hypothesis remains the same without any change after scanning I3.
• h3=h2

h2 | >=9 Yes ? Good Fast Yes>

I3: >=8 No Good Good Fast

No Negative instance

• Ignore It
h3 | >=9 Yes ? Good Fast Yes>
• Now scan I4. Since it is a positive instance, check for mismatch in the
hypothesis 'h' with I4.
• The 5th and 6th attribute value are mismatching, so add'?’ to those
attributes in 'h'.
Yes >
h3 | >=9 Yes ? Good Fast

I4: >=9 Yes Good Good Slow No Positive instance

h4: >=9 Yes ? Good ? ? >

• Now, the final hypothesis generated with Find-S algorithm is:

• It includes all positive instances and obviously ignores any negative

instance

h: < >=9 Yes ? Good ? ? >

Limitations of Find-S Algorithm
• Find-S algorithm tries to find a hypothesis that is consistent with
positive instances, ignoring all negative instances.
• As long as the training dataset is consistent, the hypothesis found by
this algorithm may be consistent.
• The algorithm finds only one unique hypothesis, where in there may be
many other hypotheses that are consistent with the training dataset.
• Many times, training dataset may contain some errors; hence such in
consistent data instances
• Can mislead this algorithm in determining the consistent hypothesis
since it ignores negative instances.
• step!1:
• I1 = <Sunny, Warm, Normal, Strong, Warm, Same> Yes(+ve)
• h1 = <Sunny, Warm, Normal, Strong, Warm, Same>

• Step2:
• h1 = <Sunny, Warm, Normal, Strong, Warm, Same>
• I2 = <Sunny, Warm, High, Strong, Warm, Same> Yes(+ve)

• h2 = <Sunny, Warm, ?, Strong, Warm, Same>

Step3:
• h2 = <Sunny, Warm, ?, Strong, Warm, Same>
• I3 = <Rainy, Cold, High, Strong, Warm, Change> No(-Ve)
• I3 is Negative example Hence ignored

• h3=h2

• h3 = <Sunny, Warm, ?, Strong, Warm, Same>

Step 4:

• h3 = <Sunny, Warm, ?, Strong, Warm, Same>

• I4 = <Sunny, Warm, High, Strong, Cool, Change> Yes(+Ve)

• h4 = <Sunny, Warm, ?, Strong, ?, ?>

• The final maximally specific hypothesis is

• <Sunny, Warm, ?, Strong, ?, ?>

Alberto - Leon-Garcia 2009 Student Solutions Manual
86% (7)
Alberto - Leon-Garcia 2009 Student Solutions Manual
204 pages
Combined ML
100% (1)
Combined ML
705 pages
Montaj MAGMAP Filtering: Tutorial
No ratings yet
Montaj MAGMAP Filtering: Tutorial
82 pages
Find - S Algorithm
No ratings yet
Find - S Algorithm
17 pages
Math 2 Album
100% (7)
Math 2 Album
95 pages
C Lab Questions
100% (9)
C Lab Questions
20 pages
Chapter 6 Numercal Methods: Ahmad Shukri Yahaya School of Civil Engineering USM
No ratings yet
Chapter 6 Numercal Methods: Ahmad Shukri Yahaya School of Civil Engineering USM
40 pages
Candidate Elimination Algorithm
No ratings yet
Candidate Elimination Algorithm
24 pages
Optimising Safety Relief and Flare Systems
100% (1)
Optimising Safety Relief and Flare Systems
8 pages
Concept Learning
No ratings yet
Concept Learning
71 pages
M2 - Concept Learning
No ratings yet
M2 - Concept Learning
64 pages
Unit 1-Concept Learning
No ratings yet
Unit 1-Concept Learning
59 pages
Chapter 2 Concept Learning
No ratings yet
Chapter 2 Concept Learning
36 pages
1.concept Learning
No ratings yet
1.concept Learning
50 pages
UNIT1
No ratings yet
UNIT1
82 pages
Concept Learning and Genrel To Specific Ordering - 2
No ratings yet
Concept Learning and Genrel To Specific Ordering - 2
46 pages
1 Concept-Learning
No ratings yet
1 Concept-Learning
25 pages
5 - AIML - Module3 - PPT
No ratings yet
5 - AIML - Module3 - PPT
37 pages
Aiml Lab Exp 1 (Find S)
No ratings yet
Aiml Lab Exp 1 (Find S)
24 pages
Module 3
No ratings yet
Module 3
70 pages
ED317 Statistical Machine Learning
No ratings yet
ED317 Statistical Machine Learning
174 pages
Machine Learning: Bilal Khan
No ratings yet
Machine Learning: Bilal Khan
40 pages
03-Computational Cognitive Science
No ratings yet
03-Computational Cognitive Science
42 pages
Concept Learning
No ratings yet
Concept Learning
33 pages
ML Lecture
No ratings yet
ML Lecture
73 pages
Find S Algorithm
No ratings yet
Find S Algorithm
7 pages
ML 02
No ratings yet
ML 02
25 pages
ML Lecture 2 Version Spaces
No ratings yet
ML Lecture 2 Version Spaces
32 pages
2 Concept-Learning
No ratings yet
2 Concept-Learning
42 pages
Concept Learning - QB - Solutions
No ratings yet
Concept Learning - QB - Solutions
13 pages
Concept Learning
No ratings yet
Concept Learning
42 pages
CH2 ConceptLearning
No ratings yet
CH2 ConceptLearning
38 pages
ML Unit - I Part II
No ratings yet
ML Unit - I Part II
9 pages
ML - PPT - mOD1 - Concept Learning
No ratings yet
ML - PPT - mOD1 - Concept Learning
54 pages
Lecture3 Concept Learning
No ratings yet
Lecture3 Concept Learning
42 pages
Basics of Learning Theory
No ratings yet
Basics of Learning Theory
35 pages
3 ML Ch2 Concept Learning Short
No ratings yet
3 ML Ch2 Concept Learning Short
16 pages
ML Notes Module2
No ratings yet
ML Notes Module2
16 pages
Lecture 2
No ratings yet
Lecture 2
31 pages
Hypothesis in ML
No ratings yet
Hypothesis in ML
16 pages
ITML U1 Overview
No ratings yet
ITML U1 Overview
45 pages
Module 2 AI N ML Notes
No ratings yet
Module 2 AI N ML Notes
16 pages
MLP - Iv Eee
No ratings yet
MLP - Iv Eee
36 pages
ML Lab Programs
No ratings yet
ML Lab Programs
42 pages
ML LAB Task-1 Task-2 Notes
No ratings yet
ML LAB Task-1 Task-2 Notes
12 pages
Lec 37 ML
No ratings yet
Lec 37 ML
8 pages
ML Module - 1-1
No ratings yet
ML Module - 1-1
25 pages
U1 - ML
No ratings yet
U1 - ML
5 pages
Unit - 1
No ratings yet
Unit - 1
29 pages
Unit 3
No ratings yet
Unit 3
16 pages
Module 5 Concept Space - Version Space - Candidate Elimination
No ratings yet
Module 5 Concept Space - Version Space - Candidate Elimination
25 pages
ML-Module 2-P2
No ratings yet
ML-Module 2-P2
53 pages
Candidate Elimination Algo
No ratings yet
Candidate Elimination Algo
13 pages
Boiler Level Control
100% (1)
Boiler Level Control
20 pages
UNIT 1 Notes
No ratings yet
UNIT 1 Notes
16 pages
Ex - No.2 - Find S Algorithm
No ratings yet
Ex - No.2 - Find S Algorithm
3 pages
Machine Learning Notes Unit 1
No ratings yet
Machine Learning Notes Unit 1
25 pages
Unit 1
No ratings yet
Unit 1
43 pages
CSE543: Machine Learning: Lecture 2: August 6, 2014
No ratings yet
CSE543: Machine Learning: Lecture 2: August 6, 2014
27 pages
AI Lecture 34
No ratings yet
AI Lecture 34
54 pages
Module-2 FIND-S
No ratings yet
Module-2 FIND-S
22 pages
S Algorithm
No ratings yet
S Algorithm
19 pages
Experiment - 6 Study The Plant Population Density by Quadrat Method
No ratings yet
Experiment - 6 Study The Plant Population Density by Quadrat Method
2 pages
ML 02 Concept
No ratings yet
ML 02 Concept
7 pages
UNIT I: Concept Learning
No ratings yet
UNIT I: Concept Learning
22 pages
Hypothesis in ML
No ratings yet
Hypothesis in ML
8 pages
Revision Grade 2
No ratings yet
Revision Grade 2
9 pages
Appendix
No ratings yet
Appendix
14 pages
Data Sufficiency Question Bank
No ratings yet
Data Sufficiency Question Bank
5 pages
GAYSAN ENDUSTRI KATALOGU Sikistirildi 1 Min
No ratings yet
GAYSAN ENDUSTRI KATALOGU Sikistirildi 1 Min
61 pages
Hybrid Web Recommender Systems
No ratings yet
Hybrid Web Recommender Systems
33 pages
FSM in VHDL
No ratings yet
FSM in VHDL
10 pages
FSC115 Kinematics
No ratings yet
FSC115 Kinematics
68 pages
Adaptation of Mars Scale For Online Students
No ratings yet
Adaptation of Mars Scale For Online Students
6 pages
Simulation of EMI Filters Using Matlab
No ratings yet
Simulation of EMI Filters Using Matlab
4 pages
Wa0002.
No ratings yet
Wa0002.
57 pages
Beamer Class Example8 Warsaw
No ratings yet
Beamer Class Example8 Warsaw
28 pages
Big Book For Buckyballs Tricks
0% (2)
Big Book For Buckyballs Tricks
6 pages
Intro To Plant Taxonomy Notes
No ratings yet
Intro To Plant Taxonomy Notes
26 pages
FIRMW
No ratings yet
FIRMW
30 pages
Pharmaceutical Supply Chain and Inventory Management Strategies PDF
No ratings yet
Pharmaceutical Supply Chain and Inventory Management Strategies PDF
13 pages
Module 2 - Chapter 3
No ratings yet
Module 2 - Chapter 3
23 pages
3.structure of C 9.1-9.14
No ratings yet
3.structure of C 9.1-9.14
38 pages
CPP
No ratings yet
CPP
4 pages
2b. Spatial Weight Matrices
No ratings yet
2b. Spatial Weight Matrices
6 pages
Code Challenges For A Level 21 40 1
No ratings yet
Code Challenges For A Level 21 40 1
10 pages
Notes M5
No ratings yet
Notes M5
13 pages
Infrastructure Asset Management With Power System Applications First Edition Tjernberg Download
No ratings yet
Infrastructure Asset Management With Power System Applications First Edition Tjernberg Download
54 pages
Measurement
No ratings yet
Measurement
112 pages
Chem - DSE - Applications of Computers in Chemistry
No ratings yet
Chem - DSE - Applications of Computers in Chemistry
3 pages
Prmo 2013 QP
No ratings yet
Prmo 2013 QP
2 pages
Mces Assg-2
No ratings yet
Mces Assg-2
1 page
Theory of Machines Lab#1
No ratings yet
Theory of Machines Lab#1
6 pages
Term 2 Syllabus Class Xi
No ratings yet
Term 2 Syllabus Class Xi
6 pages
Machine Learning Methods
No ratings yet
Machine Learning Methods
27 pages
Assignment1 CPP N
No ratings yet
Assignment1 CPP N
1 page
Hypothesis Testing: Six Sigma Thinking, #6
From Everand
Hypothesis Testing: Six Sigma Thinking, #6
Sumeet Savant
No ratings yet

Concept

Uploaded by

Concept

Uploaded by

CONCEPT LEARNING

It is a process of abstraction and generalization

1. Input - Training dataset which is a .set of training instances

2. Output - Target concept or Target function f It is a mapping

3. Test - New instances to test the learned model.

Concept learning is defined as-"Given a set of hypotheses, the learner

• A hypothesis 'h' approximates a target function 'f' to represent the

• Each hypothesis is represented as a conjunction of attribute

• For example, (Tail= Short)/\ (Color= Black)....

• 'H' is used to represent the hypotheses.

• 'h' is used to represent a candidate hypothesis.

• The subset of hypothesis space that is consistent with all-observed

• It by iteratively improving the hypothesis/solution based on a given

• Several commonly used heuristic search methods are hill climbing

• By generalization of the most specific hypothesis and by specialization of the

1. Specialization- General to Specific learning

h1= <No Short Yes No No Black No Big>

I2: Yes Short No No No Brown Yes Medium No(Negative instance)

h2 == <No Short Yes No No Black No Big>

I3: No Short Yes No No Black No Medium Yes (Positive instance)

h3 =< No Short Yes No No Black No ?>

I4: No Long No Yes Yes White No Medium No (Negative instance)

h4 == <No Short Yes No No Black No ?>

h4 == <No Short Yes No No Black No ?>

h5 == <No Short Yes ? ? Black No ?>

• After observing all the

Yes (Positive instance)

<? long Yes ? ? ? ? ?>

<? ? ? yes yes Black ? ?>

h2 imposes constraints so that it will not classify a negative instance to true.

<? long Yes ? ? ? ? ?>

<? ? ? yes yes Black ? ?>

<? ? ? ? ? Black ? ?>

Remove any hypothesis inconsistent with this negative instance.

<? ? ? ? ? Black ? ?>

• Find-S algorithm is guaranteed to converge to the most specific

• This algorithm considers only the positive instances and eliminates

h1: >=9 YES Excellent Good Fast Yes

h1: >=9 YES Excellent Good Fast Yes

I2: >=9 Yes Good Good Fast

h2 | >=9 Yes ? Good Fast Yes>

h2 | >=9 Yes ? Good Fast Yes>

I3: >=8 No Good Good Fast

I4: >=9 Yes Good Good Slow No Positive instance

h4: >=9 Yes ? Good ? ? >

• It includes all positive instances and obviously ignores any negative

h: < >=9 Yes ? Good ? ? >

• h2 = <Sunny, Warm, ?, Strong, Warm, Same>

• h3 = <Sunny, Warm, ?, Strong, Warm, Same>

• h3 = <Sunny, Warm, ?, Strong, Warm, Same>

• h4 = <Sunny, Warm, ?, Strong, ?, ?>

• The final maximally specific hypothesis is

You might also like