0% found this document useful (0 votes)

7 views8 pages

ML Unit Iv

The document discusses instance-based learning methods, particularly focusing on K-Nearest Neighbor (KNN) and locally weighted regression. It outlines the advantages and disadvantages of instance-based learning, the algorithm for KNN, and the concept of distance-weighted nearest neighbor learning. Additionally, it covers case-based learning and radial basis functions, providing insights into their applications and methodologies.

Uploaded by

mohan

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

7 views8 pages

ML Unit Iv

Uploaded by

mohan

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 8

UNIT IV

INSTANCE BASED LEARNING

1. WRITE SHORT NOTES ON INSTANCE BASED LEARNING. (PART – B)
Introduction
 Instance-based learning methods such as nearest neighbor and locally weighted regression are
conceptually straight forward approaches to approximating real-valued or discrete-valued target
functions.
 Learning in these algorithms consists of simply storing the presented training data.
 Instance-based approaches can construct a different approximation to the target function for each
distinct query instance that must be classified
Advantages:
1. Training is very fast
2. Learn complex target function
3. Don’t lose information
Disadvantages:
 The cost of classifying new instances can be high.
 In many instance-based approaches, especially nearest-neighbor approaches, is that they typically
consider all attributes of the instances when attempting to retrieve similar training examples from
memory.
K-NEAREST NEIGHBOR LEARNING
2. WRITE DOWN THE ALGORITHM FOR K-NEAREST NEIGHBOR LEARNING. (PART – C)
3. WRITE SHORT NOTES ON DISTANCE WEIGHTED NEAREST NEIGHBOR LEARNING.
(PART – B)
 The most basic instance-based method is the K-Nearest Neighbor Learning. This algorithm assumes
all instances correspond to points in the n-dimensional space Rn.
 The nearest neighbors of an instance are defined in terms of the standard Euclidean distance.
 Let an arbitrary instance x be described by the feature vector:

Where, ar(x) denotes the value of the rth attribute of instance x.

 Then the distance between two instances xi and xj is defined to be d(xi, xj) Where,

 In nearest-neighbor learning the target function may be either discrete-valued or real- valued.
 Let us first consider learning discrete-valued target functions of the form

Where, V is the finite set {v1, . . . vs}

 The k-Nearest Neighbor algorithm for approximation a discrete-valued target function is given
below:
 The value f(xq)returned by this algorithm as its estimate of f(xq) is just the most common value of f
among the k training examples nearest to xq.
 If k=1, then the1-Nearest Neighbor algorithm assigns to f(xq) the value f(xi). Where xi is the
training instance nearest to xq.
 For larger values of k, the algorithm assigns the most common value among the k-nearest training
examples.
 Below figure illustrates the operation of the k-Nearest Neighbor algorithm for the case where the
instances are points in a two-dimensional space and where the target function is Boolean valued.

 The positive and negative training examples are shown by “+” and “-” respectively. A query point
xqis shown as well.
 The 1-Nearest Neighbor algorithm classifies xq as a positive example in this figure, whereas the 5-
Nearest Neighbor algorithm classifies it as a negative example.
 Below figure shows the shape of this decision surface induced by1-NearestNeighborover the entire
instance space.
 The decision surface is a combination of convex polyhedra surrounding each of the training
examples.

 For every training example, the polyhedron indicates the set of query points whose classification will
be completely determined by that training example.
 Query points outside the polyhedron are closer to some other training example. This kind of diagram
is often called the Voronoi diagram of the set of training example.
 The K-Nearest Neighbor algorithm for approximation a real-valued target function is given below:

Distance-Weighted Nearest Neighbor Algorithm

 The refinement to the k-NEAREST NEIGHBOR Algorithm is to weight the contribution of each of
the k-neighbors according to their distance to the query point xq, giving greater weight to closer
neighbors.
 For example, in the k-Nearest Neighbor algorithm, which approximates discrete-valued target
functions, we might weight the vote of each neighbor according to the inverse square of its distance
from xq
 Distance-Weighted Nearest Neighbor Algorithm for approximation a discrete-valued target
functions:

 Distance-Weighted Nearest Neighbor Algorithm for approximation a Real-valued target functions:

Terminology
 Regression means approximating a real-valued target function.
 Residual is the error𝑓̂(x)-f(x) in approximating the target function.
 Kernel function is the function of distance that is used to determine the weight of each training
example.
 In other words, the kernel function is the function K such that
wi=K(d(xi, xq))

LOCALLY WEIGHTED REGRESSION

4. WRITE DOWN THE STEPS INVOLVED IN LOCALLY WEIGHTED REGRESSION.
(PART – B)
 The phrase "locally weighted regression" is called local because the function is approximated based
only on data near the query point, weighted because the contribution of each training example is
weighted by its distance from the query point, and regression because this is the term used widely in
the statistical learning community for the problem of approximating real-valued functions.
 Given a new query instance xq, the general approach in locally weighted regression is to construct
an approximation 𝑓̂ that fits the training examples in the neighborhood surrounding xq.
 This approximation is then used to calculate the value 𝑓̂(xq), which is output as the estimated target
value for the query instance.
Locally Weighted Linear Regression
 Consider locally weighted regression in which the target function f is approximated near xq using a
linear function of the form

Where, ai(x) denotes the value of the ith attribute of the instance x
 Derived methods are used to choose weights that minimize the squared error summed over the set D
of training examples using gradient descent

Which led us to the gradient descent training rule

Where, η is a constant learning rate.

 Need to modify this procedure to derive a local approximation rather than a global one.
 The simple way is to redefine the error criterion E to emphasize fitting the local training examples.
Three possible criteria are given below.

1. Minimize the squared error over just the k-nearest neighbors:

2. Minimize the squared error over the entire set D of training examples, while weighting the error of
each training example by some decreasing function K of its distance from xq:

3. Combine1and 2:
 If we choose criterion three and re-derive the gradient descent rule, we obtain the following training rule

 The differences between this new rule and the rule given by Equation(3) are that the contribution of instance
x to the weight update is now multiplied by the distance penalty K(d(xq, x)), and that the error is summed
over only the k nearest training examples.

RADIAL BASIS FUNCTIONS

5. GIVE AN EXAMPLE OF RADIAL BASIS FUNCTION. (PART – B)
 One approach to function approximation that is closely related to distance-weighted regression and also to
artificial neural networks is learning with radial basis functions
 In this approach, the learned hypothesis is a function of the form

Where, each xu is an instance from X and where the kernel function K u(d(xu,x))is defined so that it
decreases as the distance d(xu, x) increases.

𝑓̂ is a global approximation to f(x), the contribution from each of the K u(d(xu,x)) terms is localized to a
 Here k is a user provided constant that specifies the number of kernel functions to be included.

region nearby the point xu.

 Choose each function Ku(d(xu, x)) to be a Gaussian function centered at the point xuwith some variance 𝜎u2

sufficiently large number k of such Gaussian kernels and provided the width 𝜎2 of each kernel can be
 The functional form of equ(1) can approximate any function with arbitrarily small error, provided a

separately specified
 The function given by equ(1) can be viewed as describing a two layer network where the first layer of units
computes the values of the various Ku(d(xu, x)) and where the second layer computes a linear combination of
these first-layer unit values
Example: Radial basis function (RBF) network:
 Given a set of training examples of the target function, RBF networks are typically trained in a two-stage
process.
1. First, the number k of hidden units is determined and each hidden unit u is defined by choosing the values of
xuand 𝜎u2that define its kernel function Ku(d(xu, x))
2. Second, the weights w, are trained to maximize the fit of the network to the training data, using the global
error criterion given by

 Because the kernel functions are held fixed during this second stage, the linear weight values w, can be
trained very efficiently
 Several alternative methods have been proposed for choosing an appropriate number of hidden units or,
equivalently, kernel functions.
 One approach is to allocate a Gaussian kernel function for each training example (xi,f (xi)), centering this

Each of these kernels may be assigned the same width 𝜎2.

Gaussian at the point xi.

 Given this approach, the RBF network learns a global approximation to the target function in which each
training example (xi, f (xi)) can influence the value of f only in the neighbourhood of xi.
 A second approach is to choose a set of kernel functions that is smaller than the number of training examples.
 This approach can be much more efficient than the first approach, especially when the number of training
examples is large.

CASE-BASED LEARNING
6. WRITE ABOUT CASE BASED LEARNING. (PART – C)
 In case-based learning, which is a concept often applied in machine learning, the system learns from
individual instances or cases rather than relying solely on general rules or models.
 It’s a type of lazy learning approach where the system stores and uses specific instances to make predictions
or decisions.
 In the context of machine learning, case-based learning involves:
1. Case Representation: Each instance or case is represented by a set of attributes or features. These
attributes describe the characteristics of the case.
2. Case Retrieval: When a new query or instance is presented to the system, it searches through its stored
cases to find the most similar cases to the query.
3. Case Adaptation: The system adapts the information from retrieved cases to generate a prediction or
solution for the new query.
4. Case Base Maintenance: The system might update its case base over time by adding new cases or
removing outdated ones to improve its performance.
 Case-based learning is particularly useful when there is a lack of clear rules or patterns that can be learned
from data directly.
 It’s often employed in areas where domain knowledge and context play a significant role in making
decisions.
 The success of case-based learning heavily depends on the quality of case representation, similarity
measures, and the adaptability of retrieved cases to the new situations.
A prototypical example of a case-based reasoning:
 The CADET system employs case-based reasoning to assist in the conceptual design of simple mechanical
devices such as water faucets.
 It uses a library containing approximately 75 previous designs and design fragments to suggest conceptual
designs to meet the specifications of new design problems.
 Each instance stored in memory (e.g.,a water pipe) is represented by describing both its structure and its
qualitative function.
 New design problems are then presented by specifying the desired function and requesting the corresponding
structure.
 The problem setting is illustrated in below figure:
 The function is represented in terms of the qualitative relationships among the water- flow levels and
temperatures at its inputs and outputs.
 In the functional description, an arrow with a "+" label indicates that the variable at the arrow head increases
with the variable at its tail.
 A "-" label indicates that the variable at the head decreases with the variable at the tail.
 Here Qc refers to the flow of cold water into the faucet, Q h to the input flow of hot water, and Q m to the
single mixed flow out of the faucet.
 Tc, Th, and Tmrefer to the temperatures of the cold water, hot water, and mixed water respectively.
 The variable Ct denotes the control signal for temperature that is input to the faucet, and C f denotes the
control signal for water flow.
 The controls Ctand Cfare to influence the water flows Qcand Qh, thereby indirectly influencing the faucet
output flow Qmand temperature Tm.

 CADET searches its library for stored cases whose functional descriptions match the design problem.
 If an exact match is found, indicating that some stored case implements exactly the desired function, then this
case can be returned as a suggested solution to the design problem.
 If no exact match occurs, CADET may find cases that match various subgraphs of the desired functional
specification.

Reference:
1. Tom M. Mitchell,―Machine Learning, McGraw-Hill Education (India) Private Limited, 2013.

PART – A (1 MARK)
1. ------------------- is an instance-based learner.
a) Eager Learner b) Lazy Learner c) Both (A) and (B) d) None of the Above

2. Machine Learning has various function representation, which of the following is not numerical functions?
a) Case-based b) Neural Network c) Linear Regression d) Support Vector Machines

3. Which of the following will be true about k in K-Nearest Neighbor in terms of Bias?
a) When you decrease the k the bias will be increases
b) When you increase the k the bias will be increases
c) Both (A) and (B)
d) None of the Above
4. Which of the following statements is false about k-Nearest Neighbor algorithm?
a) It stores all available cases and classifies new cases based on a similarity measure
b) It has been used in statistical estimation and pattern recognition
c) It cannot be used for regression
d) The input consists of the k closest training examples in the feature space
5. What are the advantages of Nearest neighbour algorithm?
a) Training is very fast b) Can learn complex target functions
c) Don’t lose information d) All of these
6. What if the target function is real valued in kNN algorithm?
a) Calculate the mean of the k nearest neighbours
b) Calculate the SD of the k nearest neighbor
c) None of these
d) All of the above
7. What is/are advantage(s) of Locally Weighted Regression?
a) Pointwise approximation of complex target function
b) Earlier data has no influence on the new ones
c) Both A & B d) None of these
8. Which network is more accurate when the size of training set between small to medium?
a) PNN/GRNN b) RBF c)K-means clustering d) None of these
9. What is/are true about RBF network?
a) A kind of supervised learning
b) Design of NN as curve fitting problem
c) Use of multidimensional surface to interpolate the test data
d) All of these
10. In k-NN algorithm, given a set of training examples and the value of k < size of training set (n), the algorithm
predicts the class of a test example to be the. What is/are advantages of CBR?
a) Least frequent class among the classes of k closest training examples.
b) Most frequent class among the classes of k closest training examples.
c) Class of the closest point.
d) Most frequent class among the classes of the k farthest training examples.

PART – B (5 MARKS)
1. Write short notes on instance based learning. (Refer P. No.1, Q. No.1)
2. Write short notes on distance weighted nearest neighbor learning. (Refer P. No.1, Q. No.3)
3. Write down the steps involved in locally weighted regression. (Refer P. No.4, Q. No.4)
4. Give an example of radial basis function. (Refer P. No.5, Q. No.5)

PART – C (10 MARKS)

1. Write down the algorithm for k-nearest neighbor learning. (Refer P. No.1, Q. No.2)
2. Write about case based learning. (Refer P. No.6, Q. No.6)

The Restless Heart, No Rest Since Birth
No ratings yet
The Restless Heart, No Rest Since Birth
11 pages
Symptoms of Ca3 Problems
100% (1)
Symptoms of Ca3 Problems
4 pages
ML Module5Notes
No ratings yet
ML Module5Notes
20 pages
15 Ec 834
No ratings yet
15 Ec 834
26 pages
Module 5 Part 2 3
No ratings yet
Module 5 Part 2 3
19 pages
Visit:: Join Telegram To Get Instant Updates: Contact: MAIL: Instagram: Instagram: Whatsapp Share
No ratings yet
Visit:: Join Telegram To Get Instant Updates: Contact: MAIL: Instagram: Instagram: Whatsapp Share
20 pages
Supervised Learning: Instance Based Learning
No ratings yet
Supervised Learning: Instance Based Learning
16 pages
Instance Based Learning: Artificial Intelligence and Machine Learning 18CS71
No ratings yet
Instance Based Learning: Artificial Intelligence and Machine Learning 18CS71
19 pages
Ai&ml Module 5 Final
No ratings yet
Ai&ml Module 5 Final
14 pages
Unit-4 ML
No ratings yet
Unit-4 ML
12 pages
MCA 4th Sem
No ratings yet
MCA 4th Sem
18 pages
MLT Unit 3 Part 2
No ratings yet
MLT Unit 3 Part 2
57 pages
18CS71 AI & ML Module 5 Notes
No ratings yet
18CS71 AI & ML Module 5 Notes
21 pages
BTech V KCS 055 Unit3
No ratings yet
BTech V KCS 055 Unit3
12 pages
CS8082U4L01 - K-Nearest Neighbour Learning
No ratings yet
CS8082U4L01 - K-Nearest Neighbour Learning
21 pages
AML Mod5
No ratings yet
AML Mod5
33 pages
Bcs602 ML Mod-3 Notes @vtunetwork
No ratings yet
Bcs602 ML Mod-3 Notes @vtunetwork
28 pages
Locally Weighted Linear Regression
No ratings yet
Locally Weighted Linear Regression
11 pages
Lec05 InstanceBased
No ratings yet
Lec05 InstanceBased
13 pages
Instance Based Learning
No ratings yet
Instance Based Learning
21 pages
ML Lec7
No ratings yet
ML Lec7
5 pages
Unit 3
No ratings yet
Unit 3
12 pages
Module 4 A
No ratings yet
Module 4 A
29 pages
ML Unit V
No ratings yet
ML Unit V
10 pages
Replace All Valid Mathematical Equations With High
No ratings yet
Replace All Valid Mathematical Equations With High
6 pages
ML - Module 3 - Chapter 4 RNSIT
No ratings yet
ML - Module 3 - Chapter 4 RNSIT
5 pages
INSTANCE Based Learning
No ratings yet
INSTANCE Based Learning
12 pages
K-NN Algorithm Overview
No ratings yet
K-NN Algorithm Overview
8 pages
UNIT 3 - INSTANCE BASED LEARNING Akgec
No ratings yet
UNIT 3 - INSTANCE BASED LEARNING Akgec
14 pages
RBF, KNN, SVM, DT
No ratings yet
RBF, KNN, SVM, DT
9 pages
Instance Based Learning
100% (1)
Instance Based Learning
27 pages
Module 5
No ratings yet
Module 5
94 pages
Lecture 6 Value Function Approximation
No ratings yet
Lecture 6 Value Function Approximation
56 pages
COMP 4901Z: Reinforcement Learning: 2.3 Value Function Approximation
No ratings yet
COMP 4901Z: Reinforcement Learning: 2.3 Value Function Approximation
55 pages
KNN Notes
No ratings yet
KNN Notes
6 pages
ML Module - 5 QB Solved-1
No ratings yet
ML Module - 5 QB Solved-1
11 pages
CS8082U4L02 - Locally Weighted Regression
No ratings yet
CS8082U4L02 - Locally Weighted Regression
13 pages
RL Chap 4
No ratings yet
RL Chap 4
7 pages
2223 ML Lecture04
No ratings yet
2223 ML Lecture04
46 pages
Lecture 6: Value Function Approximation: David Silver
No ratings yet
Lecture 6: Value Function Approximation: David Silver
56 pages
Jntuk R20 ML Unit-Ii
No ratings yet
Jntuk R20 ML Unit-Ii
37 pages
ML-Module 3
No ratings yet
ML-Module 3
64 pages
CH 2
No ratings yet
CH 2
30 pages
Module IV - K NN
No ratings yet
Module IV - K NN
15 pages
Overview of Supervised Learning
No ratings yet
Overview of Supervised Learning
41 pages
Lesson 4 - Supervised Learning
No ratings yet
Lesson 4 - Supervised Learning
36 pages
Nearest Neighbour
No ratings yet
Nearest Neighbour
25 pages
ML UNIT-4 Chapter-1 Instance Based Learning
No ratings yet
ML UNIT-4 Chapter-1 Instance Based Learning
23 pages
Instance Based Learning: 09s1: COMP9417 Machine Learning and Data Mining
No ratings yet
Instance Based Learning: 09s1: COMP9417 Machine Learning and Data Mining
9 pages
Aiml K2
No ratings yet
Aiml K2
8 pages
Module 3-1
No ratings yet
Module 3-1
46 pages
Module 3
No ratings yet
Module 3
25 pages
Difference Between Instance-And Model-Based Learning
No ratings yet
Difference Between Instance-And Model-Based Learning
35 pages
Chapter 6: Classification and Prediction: Classify Predictions
No ratings yet
Chapter 6: Classification and Prediction: Classify Predictions
23 pages
The Nearest Neighbour Algorithm
No ratings yet
The Nearest Neighbour Algorithm
3 pages
Classification
No ratings yet
Classification
31 pages
Memory-Based Learning: ENPM808F: Robot Learning Summer 2017
No ratings yet
Memory-Based Learning: ENPM808F: Robot Learning Summer 2017
56 pages
BARTEC Engineers Manual
No ratings yet
BARTEC Engineers Manual
12 pages
Campus Map
No ratings yet
Campus Map
1 page
Cement Project
No ratings yet
Cement Project
16 pages
Math C4 Practice
No ratings yet
Math C4 Practice
53 pages
Ikea2 Bam B2 SC 6501 - 01 PDF
No ratings yet
Ikea2 Bam B2 SC 6501 - 01 PDF
1 page
Magnetism Part 1
No ratings yet
Magnetism Part 1
8 pages
Javascriptinterviewquestions 240713104909 D9bedd8b
No ratings yet
Javascriptinterviewquestions 240713104909 D9bedd8b
25 pages
ASHS Financial Aid Application - 2025-2026
No ratings yet
ASHS Financial Aid Application - 2025-2026
6 pages
Version 1.00 Edit 5 June 2015 Download The Latest & Color Version of This Manual at
No ratings yet
Version 1.00 Edit 5 June 2015 Download The Latest & Color Version of This Manual at
8 pages
Conservation Strategies and Plannings of Pench Tiger Reserve
No ratings yet
Conservation Strategies and Plannings of Pench Tiger Reserve
5 pages
2018-1 - Classifications in Brief Tonnis Classification of Hip Osteoarthritis
No ratings yet
2018-1 - Classifications in Brief Tonnis Classification of Hip Osteoarthritis
5 pages
Mated Ttbbi 3
No ratings yet
Mated Ttbbi 3
1 page
Tour de Samos 2025 Results Overall
No ratings yet
Tour de Samos 2025 Results Overall
1 page
109.1 8 3英題目
No ratings yet
109.1 8 3英題目
6 pages
RM 64
No ratings yet
RM 64
632 pages
Vasilka
No ratings yet
Vasilka
4 pages
Periodic Health Examination Form 2 2020
No ratings yet
Periodic Health Examination Form 2 2020
2 pages
Unit 8
No ratings yet
Unit 8
62 pages
(YEAR 3) Math Worksheet
No ratings yet
(YEAR 3) Math Worksheet
7 pages
Week 7 Milestone Worksheet Completed
No ratings yet
Week 7 Milestone Worksheet Completed
16 pages
Giancoli Chap 3 Vectors Kinematics in 2 Dimensions
No ratings yet
Giancoli Chap 3 Vectors Kinematics in 2 Dimensions
37 pages
National Apprenticeship Training Scheme (NATS)
No ratings yet
National Apprenticeship Training Scheme (NATS)
5 pages
Corolla Diesel PDF
No ratings yet
Corolla Diesel PDF
2 pages
Magic and The Mind
No ratings yet
Magic and The Mind
379 pages
Nitoprime Zincrich TDS
No ratings yet
Nitoprime Zincrich TDS
2 pages
Blown Film
0% (1)
Blown Film
4 pages
Climate Change
No ratings yet
Climate Change
5 pages
GMAT - 2018.PDF Version 1
No ratings yet
GMAT - 2018.PDF Version 1
21 pages

ML Unit Iv

Uploaded by

ML Unit Iv

Uploaded by

UNIT IV

INSTANCE BASED LEARNING

Where, ar(x) denotes the value of the rth attribute of instance x.

Where, V is the finite set {v1, . . . vs}

Distance-Weighted Nearest Neighbor Algorithm

 Distance-Weighted Nearest Neighbor Algorithm for approximation a Real-valued target functions:

LOCALLY WEIGHTED REGRESSION

Which led us to the gradient descent training rule

Where, η is a constant learning rate.

1. Minimize the squared error over just the k-nearest neighbors:

RADIAL BASIS FUNCTIONS

region nearby the point xu.

Each of these kernels may be assigned the same width 𝜎2.

PART – C (10 MARKS)

You might also like