0% found this document useful (0 votes)

58 views

Machine Learning-Lecture#7-Fall 2020

K-means clustering is an unsupervised learning technique that groups unlabeled data points into a specified number of clusters (K) based on feature similarity. It works by assigning data points to the cluster with the closest centroid and iteratively updating centroids until clusters are stable or the maximum number of iterations is reached. While efficient and easy to apply, K-means clustering has limitations such as being sensitive to initialization and not able to handle clusters of varying shapes, sizes, or densities.

Uploaded by

Syed Ali Raza Naqvi

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

58 views

Machine Learning-Lecture#7-Fall 2020

Uploaded by

Syed Ali Raza Naqvi

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 18

Lecture#7

Unsupervised Learning
(Clustering)

1
What is Cluster Analysis?
Finding groups of objects such that the objects in a group
will be similar (or related) to one another and different
from (or unrelated to) the objects in other groups

Inter-cluster
Intra-cluster distances are
distances are maximized
minimized
Applications of Cluster Analysis
Understanding
– Group students who
succeed and fails in
the same exercises

Summarization
– Reduce the size of
large data sets

Clustering precipitation
in Australia
What is not Cluster Analysis?

Supervised classification
– Have class label information


Simple segmentation
– Dividing students into different registration groups
alphabetically, by last name
Types of Clusterings
A clustering is a set of clusters

Important distinction between hierarchical and

partitional sets of clusters

 Partitional Clustering
– A division data objects into non-overlapping subsets
(clusters) such that each data object is in exactly
one subset

 Hierarchical clustering
– A set of nested clusters organized as a hierarchical
tree
K-means Clustering

Partitional clustering approach

Each cluster is associated with a centroid (center
point)
Each point is assigned to the cluster with the closest
centroid
Number of clusters, K, must be specified
The basic algorithm is very simple
K-means Clustering
Partitional Clustering

Original Points
Partitional Clustering

Original Points with initial centres

Partitional Clustering

Original Points with clusters iteration 1

Partitional Clustering

Original Points with new centres

Partitional Clustering

Original Points with clusters and new centres iteration 2

Partitional Clustering

Original Points with clusters and new centres iteration 3

Partitional Clustering

Final clusters and centres A Partitional

Clustering
Two different K-means Clusterings
3

2.5

1.5
Original Points

y
1

0.5

-2 -1.5 -1 -0.5 0 0.5 1 1.5 2

3 3

2.5 2 .5

2 2

1.5 1 .5

y
y

1 1

0.5 0 .5

0 0

-2 - 1.5 -1 -0 .5 0 0 .5 1 1.5 2 -2 - 1.5 -1 -0.5 0 0 .5 1 1 .5 2

x x

Optimal Clustering Sub-optimal Clustering

Property of K-means

Sum of Squared Error (SSE) diminishes after each
iteration.

The SSE is not necessarily the optimal one .

K
SSE    (mi , x)
dist 2

i 1 xCi
Advantages of K-means

Is efficient.
 Can be computed in a distributive way.
 Is easy to apply.
Limitations of K-means
 How to determine the best K?
 May give a sub-optimal solution.

K.means has problems when clusters are of
differing
– Sizes
– Densities
– Non-globular shapes


K-means is sensible to outliers.

Ase 747 - Transformative Learning
No ratings yet
Ase 747 - Transformative Learning
2 pages
Clustering in Machine Learning
No ratings yet
Clustering in Machine Learning
20 pages
Unit 4
No ratings yet
Unit 4
74 pages
Chapter 5. Clustering Algorithms-Stud
No ratings yet
Chapter 5. Clustering Algorithms-Stud
44 pages
DM Lecture 06
No ratings yet
DM Lecture 06
32 pages
Introduction To Unsupervised Learning:: Clustering
No ratings yet
Introduction To Unsupervised Learning:: Clustering
21 pages
Week 9
No ratings yet
Week 9
66 pages
ML Unit-4 Final 2024-25
No ratings yet
ML Unit-4 Final 2024-25
28 pages
DSML-ML09. Unsupervised Learning
No ratings yet
DSML-ML09. Unsupervised Learning
69 pages
Lecture 6
No ratings yet
Lecture 6
14 pages
Datamining-lect5 - Clustering. the K-means Algorithm. Hierarchical Clustering. the DBSCAN Algorithm. Clustering Evaluation
No ratings yet
Datamining-lect5 - Clustering. the K-means Algorithm. Hierarchical Clustering. the DBSCAN Algorithm. Clustering Evaluation
110 pages
Cluster Analysis 1731695796
No ratings yet
Cluster Analysis 1731695796
91 pages
Data Mining - Clustering
No ratings yet
Data Mining - Clustering
90 pages
Clustering
No ratings yet
Clustering
84 pages
Lecture 1 (UNIT 1)
No ratings yet
Lecture 1 (UNIT 1)
68 pages
cz4041 10 Clustering
No ratings yet
cz4041 10 Clustering
67 pages
Unit 5
No ratings yet
Unit 5
63 pages
Unit 4 Clustering - K-Means and Hierarchical
No ratings yet
Unit 4 Clustering - K-Means and Hierarchical
40 pages
P-3 1 2-Kmeans
No ratings yet
P-3 1 2-Kmeans
43 pages
IT3080 Lecture04 2023
No ratings yet
IT3080 Lecture04 2023
56 pages
DMDWUNITV
No ratings yet
DMDWUNITV
72 pages
What Is Cluster Analysis?: - Cluster: A Collection of Data Objects
No ratings yet
What Is Cluster Analysis?: - Cluster: A Collection of Data Objects
42 pages
Lecture 9 Clustering
No ratings yet
Lecture 9 Clustering
36 pages
Clustering K-Means
100% (2)
Clustering K-Means
28 pages
Clustering
No ratings yet
Clustering
125 pages
Data Mining Lecture Notes-1: Bsc. (H) Computer Science: Vi Semester Teacher: Ms. Sonal Linda
No ratings yet
Data Mining Lecture Notes-1: Bsc. (H) Computer Science: Vi Semester Teacher: Ms. Sonal Linda
40 pages
22AIP3101A Session 9
No ratings yet
22AIP3101A Session 9
38 pages
datamining-lect8
No ratings yet
datamining-lect8
79 pages
UNIT-5 PPT
No ratings yet
UNIT-5 PPT
85 pages
Clustering Algorithm: An Unsupervised Learning Approach
No ratings yet
Clustering Algorithm: An Unsupervised Learning Approach
23 pages
Clustering Algorithm
No ratings yet
Clustering Algorithm
47 pages
Clustering
No ratings yet
Clustering
104 pages
ML L14 Clustering
No ratings yet
ML L14 Clustering
59 pages
DSV_Unit 3_Data Analysis in Depth
No ratings yet
DSV_Unit 3_Data Analysis in Depth
53 pages
K Mean Clustering1
No ratings yet
K Mean Clustering1
23 pages
Unsupervised Learning: K-Means Clustering
No ratings yet
Unsupervised Learning: K-Means Clustering
23 pages
unsupervised_learning_1
No ratings yet
unsupervised_learning_1
40 pages
Cluster
100% (1)
Cluster
72 pages
Unit V - Clustering
No ratings yet
Unit V - Clustering
19 pages
8. Clustering
No ratings yet
8. Clustering
80 pages
Cluster
No ratings yet
Cluster
50 pages
Machine Learning Unsupervised
No ratings yet
Machine Learning Unsupervised
20 pages
Clustering FinancialData
No ratings yet
Clustering FinancialData
38 pages
Module 5
No ratings yet
Module 5
98 pages
Clustering-Part1.pptx
No ratings yet
Clustering-Part1.pptx
84 pages
07Clustering
No ratings yet
07Clustering
34 pages
Week 10 Lecture - Introduction to Clustering(1)
No ratings yet
Week 10 Lecture - Introduction to Clustering(1)
35 pages
Clustering
No ratings yet
Clustering
39 pages
Machine Learning
No ratings yet
Machine Learning
23 pages
Clustering-Part1
No ratings yet
Clustering-Part1
79 pages
Clustering Algorithms
No ratings yet
Clustering Algorithms
19 pages
What is Unsupervised Learning (1)
No ratings yet
What is Unsupervised Learning (1)
9 pages
Soft Vs Hard Clustering
No ratings yet
Soft Vs Hard Clustering
5 pages
Chap7 Basic Cluster Analysis
No ratings yet
Chap7 Basic Cluster Analysis
82 pages
Unit 4
No ratings yet
Unit 4
40 pages
Week 9. Unsupervised Learning
No ratings yet
Week 9. Unsupervised Learning
32 pages
ML4 Unsupervised Learning
No ratings yet
ML4 Unsupervised Learning
60 pages
Intro Data Science: Cluster Analysis
No ratings yet
Intro Data Science: Cluster Analysis
60 pages
Chapter_6 (2)
No ratings yet
Chapter_6 (2)
54 pages
Java Programming: Algorithms and Structures
From Everand
Java Programming: Algorithms and Structures
Tanushri Kaniyar
No ratings yet
Autodesk Maya 2022: A Comprehensive Guide, 13th Edition
From Everand
Autodesk Maya 2022: A Comprehensive Guide, 13th Edition
Prof. Sham Tickoo
No ratings yet
Chap 8 XI Mathematics
No ratings yet
Chap 8 XI Mathematics
7 pages
English X MCQS Key Chapter 14
No ratings yet
English X MCQS Key Chapter 14
2 pages
MBSD NOTES (PDF 2)
No ratings yet
MBSD NOTES (PDF 2)
25 pages
Lectures To Take VU
No ratings yet
Lectures To Take VU
1 page
CV Ali
No ratings yet
CV Ali
3 pages
Lecture#9: Support Vector Machine (SVM)
No ratings yet
Lecture#9: Support Vector Machine (SVM)
18 pages
No-01 M.S NLC Final Check List Standard PDI For 40 Ft-Half Body Trailer....
No ratings yet
No-01 M.S NLC Final Check List Standard PDI For 40 Ft-Half Body Trailer....
4 pages
PGD Contract MGT Cases - 23-5-21 - Part 2
No ratings yet
PGD Contract MGT Cases - 23-5-21 - Part 2
1 page
Raza CHP 5
No ratings yet
Raza CHP 5
3 pages
Raza
No ratings yet
Raza
4 pages
CHP 7 MCQ
No ratings yet
CHP 7 MCQ
5 pages
Computer Mock Plan
No ratings yet
Computer Mock Plan
1 page
07 Chapter 3
No ratings yet
07 Chapter 3
5 pages
EDUC 5010 Unit 5 Written Assignment
100% (1)
EDUC 5010 Unit 5 Written Assignment
5 pages
REFLECT
No ratings yet
REFLECT
2 pages
Assessment in Learning 2: Preliminary Examination
No ratings yet
Assessment in Learning 2: Preliminary Examination
4 pages
Consultation - Verification - Evaluation Possible Visitor Questions
100% (1)
Consultation - Verification - Evaluation Possible Visitor Questions
7 pages
DME-004 Essentials of Project Design - CARE International
No ratings yet
DME-004 Essentials of Project Design - CARE International
45 pages
Comparative Study of K-Means and Hierarchical Clustering Techniques
No ratings yet
Comparative Study of K-Means and Hierarchical Clustering Techniques
7 pages
Exploring Conceptual Understanding and Problem
No ratings yet
Exploring Conceptual Understanding and Problem
7 pages
Ds4015 Big Data Analytics QB
No ratings yet
Ds4015 Big Data Analytics QB
155 pages
Philippine Culture and Museums
No ratings yet
Philippine Culture and Museums
8 pages
Mini Task Grasps (Reflection Paper)
No ratings yet
Mini Task Grasps (Reflection Paper)
2 pages
One Week FDP Brochure 22.6.23
No ratings yet
One Week FDP Brochure 22.6.23
2 pages
SId Resume Finaledit PDF
No ratings yet
SId Resume Finaledit PDF
2 pages
Video Review 1, Beyza Eylül Ergün, 2353324
No ratings yet
Video Review 1, Beyza Eylül Ergün, 2353324
3 pages
Group 1 Reporting
No ratings yet
Group 1 Reporting
29 pages
Sample Thesis For Mass Communication Students
100% (3)
Sample Thesis For Mass Communication Students
4 pages
MHI 08 June 2011
No ratings yet
MHI 08 June 2011
4 pages
Machine Learning/Ai For Iot, M2M, and Computer Communication
No ratings yet
Machine Learning/Ai For Iot, M2M, and Computer Communication
3 pages
ALSUPGuide2020 1
No ratings yet
ALSUPGuide2020 1
1 page
Module 6: Curriculum Design
No ratings yet
Module 6: Curriculum Design
4 pages
EE-232 Signals and Systems-2
No ratings yet
EE-232 Signals and Systems-2
6 pages
Resma - Video Critique 02
No ratings yet
Resma - Video Critique 02
2 pages
Lesson 36 Logic Propositions
No ratings yet
Lesson 36 Logic Propositions
45 pages
Speaking and Listening Pathway
100% (2)
Speaking and Listening Pathway
320 pages
Kohlberg DEMO
No ratings yet
Kohlberg DEMO
81 pages
IT Officer Job Description
No ratings yet
IT Officer Job Description
8 pages
Board_Question_Paper_Solving_Timetable_for_Grade_X_&_XII_3
No ratings yet
Board_Question_Paper_Solving_Timetable_for_Grade_X_&_XII_3
1 page
Research Paper (Ojhug
No ratings yet
Research Paper (Ojhug
5 pages
Sherry B. Ortner - Is Female To Male As Nature Is To Culture - READING HIGHLIGHTED
No ratings yet
Sherry B. Ortner - Is Female To Male As Nature Is To Culture - READING HIGHLIGHTED
12 pages

Machine Learning-Lecture#7-Fall 2020

Uploaded by

Machine Learning-Lecture#7-Fall 2020

Uploaded by

Lecture#7

Important distinction between hierarchical and

Partitional clustering approach

Original Points with initial centres

Original Points with clusters iteration 1

Original Points with new centres

Original Points with clusters and new centres iteration 2

Original Points with clusters and new centres iteration 3

Final clusters and centres A Partitional

-2 -1.5 -1 -0.5 0 0.5 1 1.5 2

-2 - 1.5 -1 -0 .5 0 0 .5 1 1.5 2 -2 - 1.5 -1 -0.5 0 0 .5 1 1 .5 2

Optimal Clustering Sub-optimal Clustering

You might also like