0% found this document useful (0 votes)

33 views37 pages

5 - AIML - Module3 - PPT

The document discusses various topics related to machine learning including learning theory, similarity based learning, and regression analysis. It defines key concepts like learning, concept learning, target functions, and hypothesis representation. It also describes algorithms like Find-S for learning the most specific hypothesis and version spaces for identifying consistent hypotheses.

Uploaded by

Deepak B L

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

33 views37 pages

5 - AIML - Module3 - PPT

Uploaded by

Deepak B L

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 37

AIML Module-3

Deepak B L
Assistant Professor
Dept. of CSE,SJCIT
TOPICS

1. Basics of Learning theory

2. Similarity Based Learning
3. Regression Analysis
Introduction to learning
The process of acquiring knowledge and expertise through study,
experience, or being taught is called as learning.

Learning types/methods
1.Learn by memorization
2.Learn by examples
3.Learn by being taught
4.Learning by critical thinking
5.Self learning
6.Learning to solve problems
7.Learning by generalizing explanations
• According to Tom Mitchell, “A computer program is said to be
learning from experience (E), with respect to some task (T). Thus,
the performance measure (P) is the performance at task T, which
is measured by P, and it improves with experience E.”

• Example: In Spam E-Mail detection,

✔ Task, T: To classify mails into Spam or Not Spam.
✔ Performance measure, P: Total percent of mails being correctly
classified as being “Spam” or “Not Spam”.
✔ Experience, E: Set of Mails with label “Spam”
Design of a learning system
A system that is built around a learning algorithm is called a learning
system.
steps
1. Choosing a training experience
2. Choosing a target function
3. Representation of a target function
4. Function approximation
Training experience

✔ It is important to note that the data or experience that we fed to

the algorithm must have a significant impact on the Success or
Failure of the Model. So Training data or experience should be
chosen wisely.
✔ The training experience will be able to provide direct or indirect
feedback regarding choices. For example: While Playing chess the
training data will provide feedback to itself like instead of this
move if this is chosen the chances of success increases.
Choosing target function

✔ According to the knowledge fed to the algorithm the machine

learning will choose NextMove function which will describe what
type of legal moves should be taken.
✔ For example : While playing chess with the opponent, when
opponent will play then the machine learning algorithm will
decide what be the number of possible legal moves taken in
order to get success.
Representation for Target function

✔ When the machine algorithm will know all the possible legal
moves the next step is to choose the optimized move
✔ For Example : while playing chess machine have 4 possible
moves, so the machine will choose that optimized move which
will provide success to it.
Function Approximation Algorithm

✔ An optimized move cannot be chosen just with the training data.

The training data had to go through with set of example and
through these examples the training data will approximates
which steps are chosen and after that machine will provide
feedback on it.
✔ For Example : When a training data of Playing chess is fed to
algorithm so at that time it is not machine algorithm will fail or
get success and again from that failure or success it will measure
while next move what step should be chosen and what is its
success rate.
Introduction to concept learning
A Concept is a subset of objects or events defined over a larger set
[Example: The concept of a bird is the subset of all objects (i.e., the set of
all things or all animals) that belong to the category of bird.]

Alternatively, a concept is a boolean-valued function defined over

this larger set [Example: a function defined over all animals whose
value is true for birds and false for every other animal].
What is Concept-Learning?

• Given a set of examples labeled as members or non-members

of a concept, concept-learning consists of automatically
inferring the general definition of this concept.

• In other words, concept-learning consists of approximating a

Boolean-valued function from training examples of its input
and output.
Concept learning requires three things
1. Input: training dataset which is a set of training
instances, each labelled with the name of a concept. Use
this past experience to train and build the model.
2. Output: target function f. it is a mapping function f(x)
from input x output y. it is to determine specific features
to identify an object.
3. Test: new instances to test the learned model.
A Concept Learning Task

• Consider the example task of learning the target

concept "days on which my friend Aldo enjoys his
favorite water sport.“
• Table describes a set of example days, each
represented by a set of attributes.
• The attribute Enjoy Sport indicates whether or not
Aldo enjoys his favorite water sport on this day.
• The task is to learn to predict the value of Enjoy Sport
for an arbitrary day, based
Table 2.1 Positive & negative training examples for the target concept EnjoySport.
Representation of Hypothesis
• Hypothesis ‘h’ approximates a target function ‘f’ t represent the
relationship between the independent attributes and the dependent
attribute of the training instances.
• It predicts the model that best maps the inputs to outputs.
• Each hypothesis is represented in conjunction of attribute
conditions.
• Example: (sky=sunny) ^ ( wind=strong)
• Conjunction of constraints on each attribute where:
✔ “?” means “any value is acceptable”( sky=?)
✔ single values denotes a specific single value from
acceptable values of the attributes (wind=strong)
✔ “∅” means “no value is acceptable” (rain= ∅)
• Example of a hypothesis: <?, Cold, High, ?, ?, ?> (If the air
temperature is cold and the humidity high then it is a good day
for water sports)
• Goal: To infer the “best” concept-description from the set of all
possible hypotheses (“best” means “which best generalizes to
all (known or unknown) elements of the instance space”
concept-learning is an ill-defined task)
• The most general hypothesis-that every day is a good day for
water sports, positive example-is represented by
<?, ?, ?, ?, ?, ?>
• The most specific possible hypothesis-that no day is a positive
example-is represented by <0,0,0,0,0,0>
Generalization and Specification

• Searching the hypothesis Space

✔ Specialization – General to Specific learning
✔ Generalization – Specific to General learning
Find-S, a Maximally Specific Hypothesis
Learning Algorithm
• Problem1: Implement and demonstrate the FIND-S algorithm for
finding the most specific hypothesis based on a given set of training
data samples. Read the training data from a .CSV file.
Algorithm:
1. Initialize h to the most specific hypothesis in H
2. For each positive training instance x
For each attribute constraint ai in h
If the constraint ai in h is satisfied by x then do nothing
else replace ai in h by the next more general
constraint that is satisfied by x
3. Output hypothesis h
Illustration:
Step1: Find S
Step2: Find S
Step2: Find S
Iteration 3 and Step 4: Find S
Properties of Find-S
✔ Hypothesis space described by conjunctions of attributes
✔ Find-S will output the most specific hypothesis within H that is consistent
with the positive training examples
✔ The output hypothesis will also be consistent with the negative examples,
provided the target concept is contained in H.

v
limitations of Find-S
✔ to find a hypothesis that is consistent with positive instances, ignoring all
negative instances. As long as the training dataset is consistent, the
hypothesis found by this algorithm is consistent.
✔ The algorithm finds only one unique hypothesis, where in there may be many
other hypothesis that are consistent with the training dataset.
✔ Inconsistent data instances can mislead this algorithm in determining the
consistent hypothesis.
Complaints about Find-S

• Can’t tell if the learner has converged to the target

concept, in the sense that it is unable to determine
whether it has found the only hypothesis
consistent with the training examples.
• Can’t tell when training data is inconsistent, as
it ignores negative training examples.
• Why prefer the most specific hypothesis?
• What if there are multiple maximally specific
hypothesis?
Version Spaces
The version space contains the subset of hypothesis from
the hypothesis space that is consistent with all training
instances in the training dataset.
List-Then-Elimination Algorithm
Eliminating any hypothesis that are inconsistent
Algorithm:
Input: Version Space- a list of all hypothesis.
Output: Set of consistent hypothesis.
✔ Initialize the version space with a list of hypothesis.
✔ For each training instance,
•remove from version space any hypothesis that is inconsistent.
Similarity-based Learning
•Similarity-based learning is a supervised learning techniques that
predicts the class label of test instance by determining the
similarity of the test instance with the training instance.

•similarity-based learning belongs to the family of instance based

learning which is used to solve both classification and regression
problems.

•instance based learning makes prediction by computing distance

or similarities between test instance and specific set of training
instance.
similarity based classifiers uses similarity measures to locate the
nearest neighbors and classify the instance .
Similarity learning is also called as instance -based
learning/just-in time learning since it does not build an abstract
model of the training instances and performs lazy learning when
classifying a new instances.
One more way to classify Machine Learning systems is based on how
they generalize.
1. Instance-based Learning
•the system learns the examples by heart, then generalizes to new
cases using a similarity measure. For example, Spam Filter flags the
email identical to known spam emails also can be programmed to
flag the emails that are very similar to known spam emails.
2. Model-based Learning
•In Model-based Learning, a model is built using the training
examples, then use that model to make predictions. How do we know
the model performs best? We need to define a utility function a.k.a
fitness function that measures how good the model is or define a cost
function that measures how bad the model is.
Instance based learning Model-based learning

Lazy learners Eager learners

Processing of training instances is done Processing of training instances is done
only during testing phase only during training phase

No model is built with the training Generalizes a model with the training
instances before receives a test instance instances before it receives a test
instance
Predicts the class of the test instance Predicts the class of the test instance
directly from the training data from the model built

Slow in testing phase Fast in testing phase

Learns by making many local Learns by creating global approximation
approximations
Some of the instance-based learning algorithms are :
✔ K Nearest Neighbor (KNN)
✔ Self-Organizing Map (SOM)
✔ Learning Vector Quantization (LVQ)
✔ Variants of Nearest Neighbor learning
✔ Locally Weighted Regression
✔ Radial Basis Function (RBF) networks
Nearest-Neighbor Learning
✔It is a non-parametric method that
makes predictions based on the
similarity of data points in a given
dataset.
✔ The K-NN algorithm works by
finding the K nearest neighbors to a
given data point based on a distance
metric, such as Euclidean distance.
Algorithm: k-NN
Input: training dataset T, distance metric d, Test instance t, no. of
nearest neighbors k
Output: Predicted class or category
Prediction: for test instance t,
✔ For each instance I in T, compute the distance between the test
instance t and every other instance I in the training dataset using a
distance metric

✔ Sort the distances in an ascending order and select the first k

nearest training data instances to the test instance.
✔ Predict the class of the test instance by majority voting or mean of
the k selected nearest instance.
Advantages
•The KNN algorithm is very easy to implement
•Nearly optimal in the large sample limit
•Uses local information which can yield highly adaptive
behavior
•Lends itself very easily to parallel implementation
Disadvantages
•Large storage requirements
•Computationally intensive recall
•Highly susceptible to the curse of dimensionality
Weighted K-Nearest-Neighbor algorithm
✔ It is a non-parametric method that
makes predictions based on the
similarity of data points in a given
dataset.
✔ The K-NN algorithm works by
finding the K nearest neighbors to a
given data point based on a distance
metric, such as Euclidean distance.
Regression
• Regression in machine learning is a technique used to capture the
relationships between independent and dependent variables, with the main
purpose of predicting an outcome.
• Regression has a wide range of real-life applications. It is essential for any
machine learning problem that involves continuous numbers – this
includes, but is not limited to, a host of examples, including:
✔ Financial forecasting (like house price estimates, or stock prices)
✔ Sales and promotions forecasting
✔ Testing automobiles
✔ Weather analysis and prediction
✔ Time series forecasting
Types of Regression
Regression Methods

Linear regression Non-linear regression

Logical regression
methods methods

Single linear Multiple linear Polynomial

regression regression regression

Assignment 1
No ratings yet
Assignment 1
4 pages
Mechanics Problem
No ratings yet
Mechanics Problem
9 pages
M2 - Concept Learning
No ratings yet
M2 - Concept Learning
64 pages
Unit 1
No ratings yet
Unit 1
43 pages
Concept Learning
No ratings yet
Concept Learning
71 pages
Lecture Series On Machine Learning: Ravi Gupta G. Bharadwaja Kumar
No ratings yet
Lecture Series On Machine Learning: Ravi Gupta G. Bharadwaja Kumar
77 pages
UNIT1
No ratings yet
UNIT1
82 pages
Aiml Lab Exp 1 (Find S)
No ratings yet
Aiml Lab Exp 1 (Find S)
24 pages
ML Lecture 2 Version Spaces
No ratings yet
ML Lecture 2 Version Spaces
32 pages
Module 2
No ratings yet
Module 2
15 pages
ML Unit 1
No ratings yet
ML Unit 1
156 pages
Unit 1
No ratings yet
Unit 1
45 pages
Lecture Notes - 18CS71 - Machine Learning - Module 1: Introduction
No ratings yet
Lecture Notes - 18CS71 - Machine Learning - Module 1: Introduction
16 pages
1.concept Learning
No ratings yet
1.concept Learning
50 pages
Chapter 11
No ratings yet
Chapter 11
55 pages
21CS54 Aiml Module3 PPT
No ratings yet
21CS54 Aiml Module3 PPT
102 pages
Module 3 - AIML
No ratings yet
Module 3 - AIML
134 pages
Module 1 - Concept Learning
No ratings yet
Module 1 - Concept Learning
50 pages
U1 - ML
No ratings yet
U1 - ML
5 pages
Chapter 2 Concept Learning
No ratings yet
Chapter 2 Concept Learning
36 pages
3 ML Ch2 Concept Learning Short
No ratings yet
3 ML Ch2 Concept Learning Short
16 pages
Module 3
No ratings yet
Module 3
70 pages
Machine Learning Notes Unit 1
No ratings yet
Machine Learning Notes Unit 1
25 pages
Basics of Learning Theory
No ratings yet
Basics of Learning Theory
35 pages
AIML Module - 03 21CS4
No ratings yet
AIML Module - 03 21CS4
34 pages
Module3 PPT
No ratings yet
Module3 PPT
132 pages
ML Unit 1
No ratings yet
ML Unit 1
35 pages
ML Notes Module2
No ratings yet
ML Notes Module2
16 pages
ML 02
No ratings yet
ML 02
25 pages
ITML U1 Overview
No ratings yet
ITML U1 Overview
45 pages
AIML Module - 03
No ratings yet
AIML Module - 03
34 pages
Module 03
No ratings yet
Module 03
54 pages
ML-Module 2-P2
No ratings yet
ML-Module 2-P2
53 pages
03-Computational Cognitive Science
No ratings yet
03-Computational Cognitive Science
42 pages
ML Lecture
No ratings yet
ML Lecture
73 pages
Concept Learning - QB - Solutions
No ratings yet
Concept Learning - QB - Solutions
13 pages
Last Time: - Web As A Graph - What Is Link Analysis
No ratings yet
Last Time: - Web As A Graph - What Is Link Analysis
78 pages
ML Module - 1-1
No ratings yet
ML Module - 1-1
25 pages
Machine Learning: Bilal Khan
No ratings yet
Machine Learning: Bilal Khan
40 pages
Course. Introduction To Machine Learning Lecture 1. Introduction To ML
No ratings yet
Course. Introduction To Machine Learning Lecture 1. Introduction To ML
46 pages
Aiml M3 C1
No ratings yet
Aiml M3 C1
59 pages
2.concept Learning
No ratings yet
2.concept Learning
21 pages
Artificial Intelligence and Machine Learning 18CS71
No ratings yet
Artificial Intelligence and Machine Learning 18CS71
17 pages
ML
No ratings yet
ML
79 pages
Machine Learning (UNIT-1 - PART ONE)
No ratings yet
Machine Learning (UNIT-1 - PART ONE)
24 pages
ML Unit 1-2-57
No ratings yet
ML Unit 1-2-57
56 pages
Unit - 1
No ratings yet
Unit - 1
29 pages
CH2 ConceptLearning
No ratings yet
CH2 ConceptLearning
38 pages
Effective Applications of Learning: Speech Recognition
No ratings yet
Effective Applications of Learning: Speech Recognition
52 pages
Lecture 5.2
No ratings yet
Lecture 5.2
8 pages
Concept Learning
No ratings yet
Concept Learning
11 pages
Concept Learning
No ratings yet
Concept Learning
42 pages
Module 3
No ratings yet
Module 3
41 pages
Concept Learning and The General-To-Specific Ordering2
No ratings yet
Concept Learning and The General-To-Specific Ordering2
19 pages
Machine Learning Overview
No ratings yet
Machine Learning Overview
54 pages
Combined ML
100% (1)
Combined ML
705 pages
Concept Learning and Genrel To Specific Ordering - 2
No ratings yet
Concept Learning and Genrel To Specific Ordering - 2
46 pages
Proposal Az
No ratings yet
Proposal Az
15 pages
Complete Download Bayesian Theory and Applications 1st Edition Paul Damien PDF All Chapters
100% (17)
Complete Download Bayesian Theory and Applications 1st Edition Paul Damien PDF All Chapters
71 pages
AI-based Ransomware Detection A Comprehensive Review
No ratings yet
AI-based Ransomware Detection A Comprehensive Review
30 pages
Python For Control Engineering
No ratings yet
Python For Control Engineering
54 pages
Implementation of Pattern Matching Algorithm
No ratings yet
Implementation of Pattern Matching Algorithm
4 pages
Introduction To Differential Equations
No ratings yet
Introduction To Differential Equations
9 pages
Weakly Nonlinear Oscillations
No ratings yet
Weakly Nonlinear Oscillations
15 pages
FNCE Cheat Sheet Midterm 1
No ratings yet
FNCE Cheat Sheet Midterm 1
1 page
Caesar Ciphers: An Introduction To Cryptography: Purdue University GK-12 2006-07
No ratings yet
Caesar Ciphers: An Introduction To Cryptography: Purdue University GK-12 2006-07
17 pages
Perspectives On System Identification
No ratings yet
Perspectives On System Identification
17 pages
Bai 7 - Thu Vien
No ratings yet
Bai 7 - Thu Vien
20 pages
MLT Unit 2 Notes
No ratings yet
MLT Unit 2 Notes
58 pages
SLB 1
No ratings yet
SLB 1
1 page
Anshul's Resume
No ratings yet
Anshul's Resume
1 page
Add Polynomials Se
No ratings yet
Add Polynomials Se
4 pages
EEET2197 Tute9 Soln
No ratings yet
EEET2197 Tute9 Soln
10 pages
Modeling and Controller Designing of Rotary Inverted Pendulum (RIP) - Comparison by Using Various Design Methods
No ratings yet
Modeling and Controller Designing of Rotary Inverted Pendulum (RIP) - Comparison by Using Various Design Methods
8 pages
23-SIMPLEC Algorithm For Colocated Meshes
No ratings yet
23-SIMPLEC Algorithm For Colocated Meshes
31 pages
(MAA 5.14) IMPLICIT DIFFERENTIATION - MORE KINEMATICS - Solutions
No ratings yet
(MAA 5.14) IMPLICIT DIFFERENTIATION - MORE KINEMATICS - Solutions
7 pages
RSA Cryptosystem Using Python
No ratings yet
RSA Cryptosystem Using Python
3 pages
Elmer Models Manual
No ratings yet
Elmer Models Manual
342 pages
Https Duckduckgo Com Q "Cryptool"+aes+histogram+256&norw 1&t Ffab&ia Web
No ratings yet
Https Duckduckgo Com Q "Cryptool"+aes+histogram+256&norw 1&t Ffab&ia Web
6 pages
DSA5102 Lecture11
No ratings yet
DSA5102 Lecture11
44 pages
Lecture 1: Automation in Environmental Engineering Lecture 1
No ratings yet
Lecture 1: Automation in Environmental Engineering Lecture 1
45 pages
Operations Research: Transportation Problem Find Optimal Solution
No ratings yet
Operations Research: Transportation Problem Find Optimal Solution
13 pages
Anitha Christopher Automata Theory Lecture Notes
No ratings yet
Anitha Christopher Automata Theory Lecture Notes
80 pages
Econ 213 Course Outline 2023 - 2024
No ratings yet
Econ 213 Course Outline 2023 - 2024
3 pages
Exponential Function
No ratings yet
Exponential Function
22 pages

5 - AIML - Module3 - PPT

Uploaded by

5 - AIML - Module3 - PPT

Uploaded by

AIML Module-3

1. Basics of Learning theory

• Example: In Spam E-Mail detection,

✔ It is important to note that the data or experience that we fed to

✔ According to the knowledge fed to the algorithm the machine

✔ An optimized move cannot be chosen just with the training data.

Alternatively, a concept is a boolean-valued function defined over

• Given a set of examples labeled as members or non-members

• In other words, concept-learning consists of approximating a

• Consider the example task of learning the target

• Searching the hypothesis Space

• Can’t tell if the learner has converged to the target

•similarity-based learning belongs to the family of instance based

•instance based learning makes prediction by computing distance

Lazy learners Eager learners

Slow in testing phase Fast in testing phase

✔ Sort the distances in an ascending order and select the first k

Linear regression Non-linear regression

Single linear Multiple linear Polynomial

You might also like