DI-ML-concept learning-CEA

The Candidate Elimination Algorithm (CEA) incrementally builds a version space based on a hypothesis space and examples, refining hypotheses by removing those inconsistent with the examples. It aims to find all consistent hypotheses while managing general and specific boundaries, making it more accurate and flexible than the Find-S algorithm. However, CEA is more complex, requires more memory, and may be slower with large datasets, potentially leading to overfitting.

Uploaded by

yeshw537

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

21 views33 pages

DI-ML-concept learning-CEA

Uploaded by

yeshw537

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 33

Concept Learning

Candidate Elimination Algorithm

Candidate elimination algorithm
• The candidate elimination algorithm
incrementally builds the version space given a
hypothesis space H and a set E of examples.
• The examples are added one by one; each
example possibly shrinks the version space by
removing the hypotheses that are inconsistent
with the example. T
• The candidate elimination algorithm does this by
updating the general and specific boundary for
each new example.
Candidate Elimination Algorithm (CEA)
Concept

• The concept to learn is when does a person Aldo enjoy

sports.
• The answer is Boolean :
• Yes – enjoys sport or
• No – doesn’t enjoy sport.
Hypothesis Space
• The actual space in which we search is huge. So
restrict to a hypothesis representation of the
search space with only the attributes in the
training dataset.
• The easiest way to represent is by taking the
conjunction of all the attributes.
• <sunny, warm, normal, strong, warm, same, yes>
is a hypothesis represented by <x,c(x)>. Here c(x)
is ‘yes’.
• We say that if it is,
• Sunny and warm and normal and strong and
warm and same, then Aldo enjoys sport
Consistent
• A hypothesis h is said to be consistent on x if
h(x) = c(x) for the training set x
• A hypothesis h is consistent on a dataset D, if
h(x) = c(x) for all x in D
Version Space and minimal
generalization
• The goal is to find all the consistent hypotheses
on D. That is, find all hypothesis in hypothesis
space H that are consistent with dataset D.
• This set is termed the Version Space.
Minimal Generalization:
• 0 is replaced with specific attribute value
or
• specific attribute value is replaced with ?
Algorithm Overview
• Goal
Create two sets G and S.
G = Set of all general hypotheses consistent with D
S = Set of all specific hypotheses consistent with D
• Step 1 : Initialise
G_0 = Most General Hypotheses = <?, ?, ?, ?, ?, ?>
S_0 = Most Specific Hypotheses = <0, 0, 0, 0, 0, 0>
G_0 and S_0 – the 0 mentions the number of
instances already looked in the dataset.
Algorithm Overview(Contd..)
• Step 2: Perform Step 3 for all the instances in
the training dataset
• Step 3: Check if it is a positive label or
negative label
– That is, EnjoySport = Yes is positive
– Perform Step 3.1., if the example is positive and
Step 3.2 for negative examples
Step 3.1. The instance(x) is positive.
Step 3.1.1. Check G
Take the hypothesis(g) in G one by one and if it is
inconsistent with x, remove the g from G
Step 3.1.2. Check S
Take the hypothesis(s) in S one by one and check with x.
If s is inconsistent with x,
- Remove s from S
- Find all the minimal generalizations of s such that:
– They are consistent with x. Note that the
generalization must be minimum.
– They(s) are less general than some hypothesis in G.
– Insert them in S
- Check the hypothesis in S. If any hypothesis is more
general than another hypothesis remove the hypothesis.
Step 3.2. The instance(x) is negative.
(Note: We swap G and S w.r.t step 3.1)
Step 3.2.1. Check S
Take the hypothesis(s) in S one by one and if it is inconsistent
with x, remove the s from S
Step 3.2.2. Check G
Take the hypothesis(g) in G one by one and check with x.
If g is inconsistent with x,
- Remove g from G
- Find all the minimal specializations of g such that:
• They(g) are consistent with x. Note that the
generalization must be minimum.
• They are more general than some hypothesis in S.
• Insert them in G
- Check the hypothesis in G. If any hypothesis is less general
than another hypothesis remove the hypothesis.
The Candidate Elimination Algorithm (CEA) is an
improvement over the Find-S algorithm for classification
tasks.
Will the CEA converge to correct
hypothesis?
Partially learned concept-example
How can partially learned concepts be
used?
• Partially learned concepts can be used to
evaluate examples and update the hypothesis
space accordingly.
• When presented with examples, compare the
values to determine whether they support or
contradict the current hypotheses.
• This helps in eliminating hypotheses that are
inconsistent with the examples and narrowing
down the hypothesis space.
Advantages of CEA over Find-S
• Improved accuracy: CEA considers both positive and negative
examples to generate the hypothesis, which can result in
higher accuracy when dealing with noisy or incomplete data.
• Flexibility: CEA can handle more complex classification tasks,
such as those with multiple classes or non-linear decision
boundaries.
• More efficient: CEA reduces the number of hypotheses by
generating a set of general hypotheses and then eliminating
them one by one. This can result in faster processing and
improved efficiency.
• Better handling of continuous attributes: CEA can handle
continuous attributes by creating boundaries for each
attribute, which makes it more suitable for a wider range of
datasets.
Disadvantages of CEA in comparison
with Find-S
• More complex: CEA is a more complex algorithm than Find-S,
which may make it more difficult for beginners or those
without a strong background in machine learning to use and
understand.
• Higher memory requirements: CEA requires more memory to
store the set of hypotheses and boundaries, which may make
it less suitable for memory-constrained environments.
• Slower processing for large datasets: CEA may become slower
for larger datasets due to the i nc re a s e d numbe r o f
hypotheses generated.
• Higher potential for overfitting: The increased complexity of
CEA may make it more prone to overfitting on the training
data, especially if the dataset is small or has a high degree of
noise.
Exercise-CEA
Exercise-2
• For the dataset given below, find the specific
and generic boundary using Candidate
elimination algorithm
Inductive Bias
Inductive Bias
Inductive Bias(Contd..)
Inductive Bias
• Inductive bias can be defined as the set of assumptions
or biases that a learning algorithm employs to make
predictions on unseen data based on its training data.
• These assumptions are inherent in the algorithm's
design and serve as a foundation for learning and
generalization.
• The inductive bias of an algorithm influences how it
selects a hypothesis (a possible explanation or model)
from the hypothesis space (the set of all possible
hypotheses) that best fits the training data.
• It helps the algorithm navigate the trade-off between
fitting the training data perfectly (overfitting) and
generalizing well to unseen data (underfitting).
An Unbiased Learner
An Unbiased Learner(Contd..)

It is impossible to achieve learning without any biases. It suggests

that all learning is inherently influenced by some form of bias
Inductive Bias-a formal representation

A |= B "for every evaluation: B evaluates to

true if only all elements of A evaluate to true"
Modelling Inductive Systems

Candidate Elimination Algorithm
No ratings yet
Candidate Elimination Algorithm
24 pages
Machine Learning Lab File
No ratings yet
Machine Learning Lab File
48 pages
Combined ML
100% (1)
Combined ML
705 pages
Concept Learning
No ratings yet
Concept Learning
71 pages
M2 - Concept Learning
No ratings yet
M2 - Concept Learning
64 pages
ML Unit 2
No ratings yet
ML Unit 2
66 pages
UNIT1
No ratings yet
UNIT1
82 pages
Lecture No. 2: AU-KBC Research Centre, MIT Campus, Anna University
No ratings yet
Lecture No. 2: AU-KBC Research Centre, MIT Campus, Anna University
86 pages
Candidate - Elimination Algorihm
No ratings yet
Candidate - Elimination Algorihm
39 pages
1 4
No ratings yet
1 4
26 pages
Lecture 2
No ratings yet
Lecture 2
31 pages
Concept Learning
No ratings yet
Concept Learning
42 pages
Chapter 2 Concept Learning
No ratings yet
Chapter 2 Concept Learning
36 pages
CH2 ConceptLearning
No ratings yet
CH2 ConceptLearning
38 pages
Chapter Four: Theory of Production and Cost
No ratings yet
Chapter Four: Theory of Production and Cost
33 pages
ML 02
No ratings yet
ML 02
25 pages
5 - AIML - Module3 - PPT
No ratings yet
5 - AIML - Module3 - PPT
37 pages
ML Unit 1
No ratings yet
ML Unit 1
35 pages
ML Lec. 02
No ratings yet
ML Lec. 02
32 pages
Pert20 - Knowledge in Learning
No ratings yet
Pert20 - Knowledge in Learning
39 pages
Aiml Lab Exp 1 (Find S)
No ratings yet
Aiml Lab Exp 1 (Find S)
24 pages
Concept Learning
No ratings yet
Concept Learning
33 pages
1 Concept-Learning
No ratings yet
1 Concept-Learning
25 pages
More-General-Relation: - h2 h1 and h2 h3 - But There Is No More-General Relation Between h1 and h3
No ratings yet
More-General-Relation: - h2 h1 and h2 h3 - But There Is No More-General Relation Between h1 and h3
12 pages
Concept Learning - QB - Solutions
No ratings yet
Concept Learning - QB - Solutions
13 pages
ITML U1 Overview
No ratings yet
ITML U1 Overview
45 pages
Module 5 Concept Space - Version Space - Candidate Elimination
No ratings yet
Module 5 Concept Space - Version Space - Candidate Elimination
25 pages
2 Concept-Learning
No ratings yet
2 Concept-Learning
42 pages
Lecture3 Concept Learning
No ratings yet
Lecture3 Concept Learning
42 pages
Outcome Based Lab Report
No ratings yet
Outcome Based Lab Report
22 pages
Lab 2
No ratings yet
Lab 2
7 pages
03-Computational Cognitive Science
No ratings yet
03-Computational Cognitive Science
42 pages
AI&ML-QB-2 (Solutions)
No ratings yet
AI&ML-QB-2 (Solutions)
8 pages
Candidate Elimination Algo
No ratings yet
Candidate Elimination Algo
13 pages
Module 2
No ratings yet
Module 2
15 pages
ML Unit - I Part II
No ratings yet
ML Unit - I Part II
9 pages
Time To Explore
No ratings yet
Time To Explore
4 pages
IAT-I Question Paper With Solution of 18CS71 Artificial Intelligence and Machine Learning Oct-2022-Dr - Swathi Y
No ratings yet
IAT-I Question Paper With Solution of 18CS71 Artificial Intelligence and Machine Learning Oct-2022-Dr - Swathi Y
7 pages
Unit 3
No ratings yet
Unit 3
16 pages
ML Lab Programs
No ratings yet
ML Lab Programs
42 pages
MLT Key
No ratings yet
MLT Key
71 pages
22K61A0618 - Removed - Lab Manual Sasi CLD
No ratings yet
22K61A0618 - Removed - Lab Manual Sasi CLD
25 pages
Machine Learning Notes Unit 1
No ratings yet
Machine Learning Notes Unit 1
25 pages
15CSL76
No ratings yet
15CSL76
35 pages
ML Lec4
No ratings yet
ML Lec4
7 pages
Candidate Elimination
No ratings yet
Candidate Elimination
2 pages
AI Lecture 34
No ratings yet
AI Lecture 34
54 pages
ML 02 Concept
No ratings yet
ML 02 Concept
7 pages
Find S Algorithm
No ratings yet
Find S Algorithm
7 pages
Lab Program-2
No ratings yet
Lab Program-2
4 pages
Candidate
No ratings yet
Candidate
4 pages
22PCOAM16 - Machine Learning - Session 5 Candidate Elimination Algorithm
No ratings yet
22PCOAM16 - Machine Learning - Session 5 Candidate Elimination Algorithm
11 pages
Unit 1
No ratings yet
Unit 1
20 pages
UNIT I: Concept Learning
No ratings yet
UNIT I: Concept Learning
22 pages
Machine Learning - Concept Learning
No ratings yet
Machine Learning - Concept Learning
3 pages
Machine Learning - 3
No ratings yet
Machine Learning - 3
7 pages
Microsoft Azure Fundamentals: Microsoft AZ-900 Dumps Available Here at
No ratings yet
Microsoft Azure Fundamentals: Microsoft AZ-900 Dumps Available Here at
9 pages
ML 2 Micro
No ratings yet
ML 2 Micro
6 pages
Baker
No ratings yet
Baker
4 pages
The Hippocampus in Clinical Neuroscience Frontiers of Neurology and Neuroscience Vol 34
No ratings yet
The Hippocampus in Clinical Neuroscience Frontiers of Neurology and Neuroscience Vol 34
306 pages
Transmission Servicing Volvo 850
No ratings yet
Transmission Servicing Volvo 850
7 pages
IMRD Factors Affecting Skills
No ratings yet
IMRD Factors Affecting Skills
3 pages
Abraham Wondale
No ratings yet
Abraham Wondale
73 pages
Practise Questions For Test 2
No ratings yet
Practise Questions For Test 2
10 pages
Code of Professional Responsibility
No ratings yet
Code of Professional Responsibility
6 pages
Chapter-Three Understand Consumer Behavior 3.1 Consumer Buying Behavior
No ratings yet
Chapter-Three Understand Consumer Behavior 3.1 Consumer Buying Behavior
11 pages
Winback - en Brochure Rshock Version J3 Mars 2021 A
100% (1)
Winback - en Brochure Rshock Version J3 Mars 2021 A
12 pages
Eagle Point
100% (1)
Eagle Point
5 pages
T-309 - Leadership Studies
No ratings yet
T-309 - Leadership Studies
331 pages
Department of Educat
No ratings yet
Department of Educat
3 pages
Order 19973751
No ratings yet
Order 19973751
37 pages
Trigonometry 15 Dec1.
No ratings yet
Trigonometry 15 Dec1.
107 pages
Doctrinal
No ratings yet
Doctrinal
42 pages
9/11 Commission Interview Requests For Defense Department Personnel
No ratings yet
9/11 Commission Interview Requests For Defense Department Personnel
6 pages
Investing For Inclusion Exploring Lgbti Lens
No ratings yet
Investing For Inclusion Exploring Lgbti Lens
48 pages
7th Sem Mech Internal Question Papers
No ratings yet
7th Sem Mech Internal Question Papers
16 pages
MBA Assignment NIBM
No ratings yet
MBA Assignment NIBM
48 pages
Production of Amorphous SIlica From Geothermal Sludge of Dieng Indonesia
No ratings yet
Production of Amorphous SIlica From Geothermal Sludge of Dieng Indonesia
9 pages
WORKBOOK - Product Design Workshop-2
No ratings yet
WORKBOOK - Product Design Workshop-2
34 pages
Myroslava
No ratings yet
Myroslava
1 page
Lab 4.5.1 Observing TCP and UDP Using Netstat (Instructor Version)
No ratings yet
Lab 4.5.1 Observing TCP and UDP Using Netstat (Instructor Version)
7 pages
Academic and Support Services: San Carlos Campus Organizational Chart
No ratings yet
Academic and Support Services: San Carlos Campus Organizational Chart
1 page
Optical Fiber Communication: Technology and Systems: Chapter 1: Introduction
No ratings yet
Optical Fiber Communication: Technology and Systems: Chapter 1: Introduction
44 pages
Sugar As On 01-08-2024
No ratings yet
Sugar As On 01-08-2024
1 page
Blue Professional Modern CV Resume
No ratings yet
Blue Professional Modern CV Resume
1 page
RVM100 Instruction Manual
No ratings yet
RVM100 Instruction Manual
7 pages
Machine Learning Interview Questions
From Everand
Machine Learning Interview Questions
Tech Interviews
4.5/5 (2)
Random Optimization: Fundamentals and Applications
From Everand
Random Optimization: Fundamentals and Applications
Fouad Sabry
No ratings yet
Mathematical Optimization: Fundamentals and Applications
From Everand
Mathematical Optimization: Fundamentals and Applications
Fouad Sabry
No ratings yet
Best First Search: Fundamentals and Applications
From Everand
Best First Search: Fundamentals and Applications
Fouad Sabry
No ratings yet

DI-ML-concept learning-CEA

Uploaded by

DI-ML-concept learning-CEA

Uploaded by

Concept Learning

Candidate Elimination Algorithm

• The concept to learn is when does a person Aldo enjoy

It is impossible to achieve learning without any biases. It suggests

A |= B "for every evaluation: B evaluates to

You might also like