0% found this document useful (0 votes)

66 views37 pages

KNN, LVQ, Som

The document discusses three machine learning algorithms: k-nearest neighbors (kNN), learning vector quantization (LVQ), and self-organizing maps (SOM). kNN is an instance-based algorithm that classifies new examples based on their similarity to stored examples. LVQ is a supervised nearest neighbor method that optimizes reference vectors during training. SOM performs unsupervised mapping of high-dimensional data to a lower dimensional space while preserving topological properties.

Uploaded by

Kimberley Rodriguez Lopez

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

66 views37 pages

KNN, LVQ, Som

Uploaded by

Kimberley Rodriguez Lopez

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

You are on page 1/ 37

kNN, LVQ, SOM

 Instance Based Learning

 K-Nearest Neighbor Algorithm

 (LVQ) Learning Vector Quantization

 (SOM) Self Organizing Maps
Instance based learning
 Approximating real valued or discrete-
valued target functions
 Learning in this algorithm consists of
storing the presented training data
 When a new query instance is
encountered, a set of similar related
instances is retrieved from memory and
used to classify the new query instance
 Construct only local approximation to the target
function that applies in the neighborhood of the
new query instance
 Never construct an approximation designed to
perform well over the entire instance space
 Instance-based methods can use vector or
symbolic representation
 Appropriate definition of „neighboring“ instances
 Disadvantage of instance-based methods
is that the costs of classifying new
instances can be high
 Nearly all computation takes place at
classification time rather than learning
time
K-Nearest Neighbor algorithm
 Most basic instance-based method

 Data are represented in a vector space

 Supervised learning
Feature space
  x , f (x ) , x ) ,..., x , f (x ) 
(1) (1) (2) (2) (n ) (n)
f (x

x1

x
  2 d

x   ..   d
xy   i i
(x  y ) 2

 .. i1



x d

 In nearest-neighbor learning the target function
may be either discrete-valued or real valued
 Learning a discrete valued function

 f : d  V , V is the finite set {v1,......,vn}

 For discrete-valued, the k-NN returns the most

common value among the k training examples
 nearest to xq.
 Training algorithm
 For each training example <x,f(x)> add the example
to the list
 Classification algorithm
 Given a query instance xq to be classified
• Let x1,..,xk k instances which are nearest to xq
k
arg max
fˆ (x q ) 
v V
 (v, f (x ))
i
i1

• Where (a,b)=1 if a=b, else (a,b)= 0 (Kronecker function)


Definition of Voronoi diagram

 the decision surface induced by 1-NN for a typical set

of training examples.

_
_
_ _
.
+
_ .
+
xq + . . .
_ + .
Zur Anzeige wird der QuickTime™
Dekompressor „TIFF (LZW)“
benötigt.
Zur Anzeige wird der QuickTime™
Dekompressor „TIFF (LZW)“
benötigt.

 kNN rule leeds to partition of the space into cells (Vornoi

cells) enclosing the training points labelled as belonging to
the same class
 The decision boundary in a Vornoi tessellation of the feature
space resembles the surface of a crystall
1-Nearest Neighbor

query point qf

nearest neighbor qi
3-Nearest Neighbors

query point qf

3 nearest neighbors
2x,1o
7-Nearest Neighbors

query point qf

7 nearest neighbors
3x,4o
How to determine
the good value for k?
 Determined experimentally
 Start with k=1 and use a test set to validate
the error rate of the classifier
 Repeat with k=k+2
 Choose the value of k for which the error rate
is minimum

 Note: k should be odd number to avoid ties

Continous-valued
target functions
 kNN approximating continous-valued
target functions
 Calculate the mean value of the k nearest
training examples rather than calculate
their most common value
k

 f (x ) i
f :  
d
fˆ (x q )  i1
k
Distance Weighted
 Refinement to kNN is to weight the
contribution of each k neighbor according
to the distance to the query point xq
 Greater weight to closer neighbors
 For discrete target functions
k
arg max
fˆ (x q ) 
vV
 w  (v, f (x ))
i i
i1

 1
 if xq  xi
wi  d(x q , x i ) 2
 
 1 else
Distance Weighted
 For real valued functions
k

 w f (x )i i
fˆ (x q )  i1
k

w i
i1

 1
  if xq  xi
wi  d(x q , x i ) 2

 1 else
Curse of Dimensionality
 Imagine instances described by 20 features (attributes) but
only 3 are relevant to target function
 Curse of dimensionality: nearest neighbor is easily misled
when instance space is high-dimensional
 Dominated by large number of irrelevant features

Possible solutions
 Stretch j-th axis by weight zj, where z1,…,zn chosen to minimize
prediction error (weight different features differently)
 Use cross-validation to automatically choose weights z1,…,zn
 Note setting zj to zero eliminates this dimension alltogether
(feature subset selection)
 PCA
When to Consider Nearest
Neighbors
 Instances map to points in Rd
 Less than 20 features (attributes) per instance,
typically normalized
 Lots of training data
Advantages:
 Training is very fast
 Learn complex target functions
 Do not loose information
Disadvantages:
 Slow at query time
 Presorting and indexing training samples into search trees
reduces time
 Easily fooled by irrelevant features (attributes)
LVQ
(Learning Vector Quantization)
 A nearest neighbor method, because the
smallest distance of the unknown vector from a
set of reference vectors is sought
 However not all examples are stored as in kNN,
but a a fixed number of reference vectors for
each class v (for discrete function f) {v1,......,vn}
 The value of the reference vectors is optimized
during learning process
 The supervised learning
 rewards correct classification
 puished incorrect classification

 0 < (t) < 1 is a monotonically decreasing

scalar function
LVQ Learning (Supervised)
Initialization of reference vectors m; t=0;
do
{
chose xi from the dataset
mc nearest reference vector according to d2
if classified correctly, the class v of mc is equal to class of v of xi
mc (t 1)  mc (t)  (t)[xi (t)  mc (t)]
if classified incorrectly, the class v of mc is different to class of v of xi
mc (t 1)  mc (t)  (t)[xi (t)  mc (t)]
 t++;
}
until number of iterations t max_iterations

 After learning the space Rd is partitioned
by a Vornoi tessalation corresponding to
mi

 The exist extension to the basic LVQ,

called LVQ2, LVQ3
LVQ Classification
 Given a query instance xq to be classified

 Let xanswer be the reference vector which is

nearest to xq, determine the corresponding
vanswer
Kohonen Self Organizing Maps
 Unsupervised learning
 Labeling, supervised

 Perform a topologically ordered mapping from

high dimensional space onto two-dimensional
space
 The centroids (units) are arranged in a layer
(two dimensional space), units physically near
each other in a two-dimensional space respond
to similar input
 0 < (t) < 1 is a monotonically decreasing
scalar function
 NE(t) is a neighborhood function is decreasing
with time t
 The topology of the map is defined by NE(t)
 The dimension of the map is smaller (equal) then the
dimension of the data space
 Usually the dimension of a map is two
 For tow dimensional map the number of the
centroids should have a integer valued square
root
 a good value to start is around 102 centroids
Neighborhood on the map
SOM Learning (Unsupervised)
Initialization of center vectors m; t=0;
do
{
chose xi from the dataset
mc nearest reference vector according to d2
For all mr near mc on the map
mr (t 1)  mr (t)  (t)[xi (t)  mr (t)] for r  NEC (t)
t++;
}
until number of iterations t max_iterations

Supervised labeling
 The network can be labeled in two ways

 (A) For each known class represented by

a vector the closest centroid is searched
and labeled accordingly
 (B) For every centroid is is tested to which
known class represented by a vector it is
closest
 Example of
labeling of
10 classes, Zur Anzeige wird der QuickTime™
Dekompressor „TIFF (LZW)“
benötigt.

0,..,9
 10*10
centroids
 2-dim map
Animal
example
Zur Anzeige wird der QuickTime™
Dekompressor „TIFF (LZW)“
benötigt.
Poverty map of countries

Zur Anzeige wird der QuickTime™

Dekompressor „TIFF (LZW)“
benötigt.
Ordering process
of 2 dim data
random 2 dim points

Zur Anzeige wird der QuickTime™

Dekompressor „TIFF (LZW)“
benötigt.

Zur Anzeige wird der QuickTime™

Dekompressor „TIFF (LZW)“
benötigt.

2-dim map 1-dim map

 Instance Based Learning
 K-Nearest Neighbor Algorithm

 (LVQ) Learning Vector Quantization

 (SOM) Self Organizing Maps
 Bayes Classification
 Naive Bayes

Complete Answer Guide For Solution Manual For Digital Design 5th Edition by Mano ISBN 0132774208 9780132774208
100% (33)
Complete Answer Guide For Solution Manual For Digital Design 5th Edition by Mano ISBN 0132774208 9780132774208
52 pages
Classification (NaiveBayes KNN SVM DecisionTrees)
No ratings yet
Classification (NaiveBayes KNN SVM DecisionTrees)
105 pages
Introduction To Classification - PPT Slides 1
No ratings yet
Introduction To Classification - PPT Slides 1
62 pages
Essentials of Digital Signal Processing (2014)
90% (10)
Essentials of Digital Signal Processing (2014)
763 pages
Unit 5 Learning With Algorithm
No ratings yet
Unit 5 Learning With Algorithm
7 pages
Learning Vector Quantization
No ratings yet
Learning Vector Quantization
100 pages
ML and Ai Unit 04 and Unit 05
No ratings yet
ML and Ai Unit 04 and Unit 05
58 pages
Unit 3
No ratings yet
Unit 3
100 pages
Supervised Classification Notes
No ratings yet
Supervised Classification Notes
31 pages
Supervised Learning
No ratings yet
Supervised Learning
6 pages
CS8082U4L01 - K-Nearest Neighbour Learning
No ratings yet
CS8082U4L01 - K-Nearest Neighbour Learning
21 pages
Lecture 2 - Nearest-Neighbors Methods
No ratings yet
Lecture 2 - Nearest-Neighbors Methods
57 pages
Dsbdunitiii T1729232981820-1
No ratings yet
Dsbdunitiii T1729232981820-1
26 pages
ML Unit-2 (CEC)
No ratings yet
ML Unit-2 (CEC)
96 pages
Handout 03 Classic Classifiers
No ratings yet
Handout 03 Classic Classifiers
39 pages
CSCI946 W5-Classification
No ratings yet
CSCI946 W5-Classification
72 pages
Linear Vector Quntization
No ratings yet
Linear Vector Quntization
22 pages
CZ4032 Data Analytics & Mining Notes
No ratings yet
CZ4032 Data Analytics & Mining Notes
16 pages
4.4-InstanceBasedLearning Part 2
No ratings yet
4.4-InstanceBasedLearning Part 2
16 pages
3.1 Feature Selection
No ratings yet
3.1 Feature Selection
35 pages
SVM Presentation
No ratings yet
SVM Presentation
27 pages
Topic 08 - Data Modelling - Part II
No ratings yet
Topic 08 - Data Modelling - Part II
59 pages
FastLSVM MLDM09
No ratings yet
FastLSVM MLDM09
16 pages
KNN & Support Vector Machines: Dr.S.Vasantharathna
No ratings yet
KNN & Support Vector Machines: Dr.S.Vasantharathna
22 pages
Supervised Learning - SVM - DT
No ratings yet
Supervised Learning - SVM - DT
43 pages
Module 3
100% (1)
Module 3
79 pages
Entropy (S) Log (P) : I 1c I I
No ratings yet
Entropy (S) Log (P) : I 1c I I
5 pages
Machine Learning Lecture 02
No ratings yet
Machine Learning Lecture 02
25 pages
Kohonen Self Organizing Maps
No ratings yet
Kohonen Self Organizing Maps
36 pages
Module 3
No ratings yet
Module 3
79 pages
ML04 KNN-SVM 2024-2025
No ratings yet
ML04 KNN-SVM 2024-2025
57 pages
Document
No ratings yet
Document
6 pages
MLT Notes
No ratings yet
MLT Notes
17 pages
2EL1730-ML-Lecture04-Non Parametric Learning and Nearest Neighbor
No ratings yet
2EL1730-ML-Lecture04-Non Parametric Learning and Nearest Neighbor
47 pages
Chapter 6 ML Classifications
100% (1)
Chapter 6 ML Classifications
51 pages
Unit 1,2,3
No ratings yet
Unit 1,2,3
17 pages
Session 5
No ratings yet
Session 5
36 pages
K-Nearest Neighbors Algorithm - Wikipedia
No ratings yet
K-Nearest Neighbors Algorithm - Wikipedia
10 pages
ML U4 Omkar Pawar
No ratings yet
ML U4 Omkar Pawar
11 pages
Day 4 Content
No ratings yet
Day 4 Content
35 pages
Machine Learning
No ratings yet
Machine Learning
32 pages
Domain of A Rational Function
No ratings yet
Domain of A Rational Function
19 pages
CH 7
No ratings yet
CH 7
33 pages
MLCH9
No ratings yet
MLCH9
45 pages
5c. Nearest Neighbour Classifier
No ratings yet
5c. Nearest Neighbour Classifier
2 pages
Data Mining Lecture 10B: Classification
No ratings yet
Data Mining Lecture 10B: Classification
62 pages
Spam Not Spam
No ratings yet
Spam Not Spam
7 pages
Accelerated Data Science Introduction To Machine Learning Algorithms
No ratings yet
Accelerated Data Science Introduction To Machine Learning Algorithms
37 pages
ML ModuleUntitled 2
No ratings yet
ML ModuleUntitled 2
8 pages
Pattern Recognition & Learning II: © UW CSE Vision Faculty
No ratings yet
Pattern Recognition & Learning II: © UW CSE Vision Faculty
47 pages
Instance Based Learning
No ratings yet
Instance Based Learning
20 pages
SVM Class
No ratings yet
SVM Class
33 pages
This Is
No ratings yet
This Is
7 pages
Introduction To: Support Vector Machines
No ratings yet
Introduction To: Support Vector Machines
53 pages
"Classifiers": R & D Project by Under The Guidance of
No ratings yet
"Classifiers": R & D Project by Under The Guidance of
59 pages
C. Cifarelli Et Al - Incremental Classification With Generalized Eigenvalues
No ratings yet
C. Cifarelli Et Al - Incremental Classification With Generalized Eigenvalues
25 pages
Unit 3 Ds
No ratings yet
Unit 3 Ds
10 pages
Evaluation of Different Classifier
No ratings yet
Evaluation of Different Classifier
4 pages
2IIG0 Cheat Sheet 1
No ratings yet
2IIG0 Cheat Sheet 1
2 pages
Knots and Quantum Gravity
100% (2)
Knots and Quantum Gravity
241 pages
Project Report 2
No ratings yet
Project Report 2
11 pages
Articol Informatica Economica
No ratings yet
Articol Informatica Economica
10 pages
RMM 27 FINAL - Compressed
50% (2)
RMM 27 FINAL - Compressed
105 pages
Barun Kumar Singh - BSC-301
No ratings yet
Barun Kumar Singh - BSC-301
10 pages
03.coordinate Geometry For JEE
No ratings yet
03.coordinate Geometry For JEE
5 pages
Supervised Learning - Support Vector Machines and Feature Reduction
No ratings yet
Supervised Learning - Support Vector Machines and Feature Reduction
11 pages
1stQ Week2 BM
No ratings yet
1stQ Week2 BM
23 pages
DLL Math10 q1 Week 2
No ratings yet
DLL Math10 q1 Week 2
4 pages
Coordinate Geometry of The Circle Additional Maths
No ratings yet
Coordinate Geometry of The Circle Additional Maths
16 pages
Chapter 3: Functions and Their Graphs Learning Outcome
No ratings yet
Chapter 3: Functions and Their Graphs Learning Outcome
26 pages
Calculus Symbols
No ratings yet
Calculus Symbols
1 page
Schaumx27s Outline of Vector Analysis 2ed
No ratings yet
Schaumx27s Outline of Vector Analysis 2ed
29 pages
Digital Circuits and Logic Design
No ratings yet
Digital Circuits and Logic Design
64 pages
13.3 Real Numbers & Normalized Floating-Point
No ratings yet
13.3 Real Numbers & Normalized Floating-Point
17 pages
Examination Date Sheet I Semester Dec. 2024 Regular
No ratings yet
Examination Date Sheet I Semester Dec. 2024 Regular
6 pages
Imp Overland Trail Reflection
No ratings yet
Imp Overland Trail Reflection
3 pages
Shaheen Public School & College: Final Term Examination 2025
No ratings yet
Shaheen Public School & College: Final Term Examination 2025
2 pages
BSEd MATH 3A COLAO EVELYN DLP
No ratings yet
BSEd MATH 3A COLAO EVELYN DLP
7 pages
Casey & Lam - A Tensor Method For The Kinematical Analysis of Systems of Rigid Bodies
No ratings yet
Casey & Lam - A Tensor Method For The Kinematical Analysis of Systems of Rigid Bodies
11 pages
Notes 04
No ratings yet
Notes 04
7 pages
Filter Banks, Short-Time Fourier Analysis, and The Phase Vocoder
No ratings yet
Filter Banks, Short-Time Fourier Analysis, and The Phase Vocoder
7 pages
04a Practice Test Set 3B - Paper 1H
No ratings yet
04a Practice Test Set 3B - Paper 1H
11 pages
16EI7201 - Computer Control of Process - IAQB
No ratings yet
16EI7201 - Computer Control of Process - IAQB
6 pages
Kyouken E2019
No ratings yet
Kyouken E2019
3 pages
A - Novel - Chaotic - Image - Encryption - Algorithm - Based - On - Coordinate - Descent - and - SHA-256
No ratings yet
A - Novel - Chaotic - Image - Encryption - Algorithm - Based - On - Coordinate - Descent - and - SHA-256
15 pages
Shear and Moment Diagrams of An Overhung Beam Using Singularity Functions
No ratings yet
Shear and Moment Diagrams of An Overhung Beam Using Singularity Functions
5 pages
ES-341: Numerical Analysis: Dr. Mazhar Ali Mehboob Ul Haq (TA)
No ratings yet
ES-341: Numerical Analysis: Dr. Mazhar Ali Mehboob Ul Haq (TA)
12 pages
Latihan Soal Uas Ipa A Dan Ips A
No ratings yet
Latihan Soal Uas Ipa A Dan Ips A
6 pages
Python Assignment-5-1
No ratings yet
Python Assignment-5-1
1 page
Worked Examples in Mathematics for Scientists and Engineers
From Everand
Worked Examples in Mathematics for Scientists and Engineers
G. Stephenson
No ratings yet

KNN, LVQ, Som

Uploaded by

KNN, LVQ, Som

Uploaded by

kNN, LVQ, SOM

 Instance Based Learning

 (LVQ) Learning Vector Quantization

 Data are represented in a vector space

 f : d  V , V is the finite set {v1,......,vn}

 For discrete-valued, the k-NN returns the most

• Where (a,b)=1 if a=b, else (a,b)= 0 (Kronecker function)

 the decision surface induced by 1-NN for a typical set

 kNN rule leeds to partition of the space into cells (Vornoi

 Note: k should be odd number to avoid ties

 0 < (t) < 1 is a monotonically decreasing

 The exist extension to the basic LVQ,

 Let xanswer be the reference vector which is

 Perform a topologically ordered mapping from

 (A) For each known class represented by

Zur Anzeige wird der QuickTime™

Zur Anzeige wird der QuickTime™

Zur Anzeige wird der QuickTime™

2-dim map 1-dim map

 (LVQ) Learning Vector Quantization

You might also like