0% found this document useful (0 votes)

60 views29 pages

Clustering: Unsupervised Learning

This document discusses clustering and the k-means algorithm. It begins with an introduction to clustering and unsupervised learning. It then explains the k-means algorithm, including how it initializes cluster centroids randomly, assigns examples to the closest centroid, and updates the centroids to be the average of each cluster. It discusses how to choose the number of clusters k using the elbow method by plotting cost against k.

Uploaded by

PravinkumarGhodake

We take content rights seriously. If you suspect this is your content, claim it here.

0% found this document useful (0 votes)

60 views29 pages

Clustering: Unsupervised Learning

Uploaded by

PravinkumarGhodake

We take content rights seriously. If you suspect this is your content, claim it here.

You are on page 1/ 29

Clustering

Unsupervised learning
introduction

Machine Learning
Supervised learning

Training set:
Andrew Ng
Unsupervised learning

Training set:
Andrew Ng
Applications of clustering

Market segmentation Social network analysis

Image credit: NASA/JPL-Caltech/E. Churchwell (Univ. of Wisconsin, Madison)

Organize computing clusters Astronomical data analysis

Andrew Ng
Clustering
K-means
algorithm
Machine Learning
Andrew Ng
Andrew Ng
Andrew Ng
Andrew Ng
Andrew Ng
Andrew Ng
Andrew Ng
Andrew Ng
Andrew Ng
K-means algorithm
Input:
- (number of clusters)
- Training set

(drop convention)

Andrew Ng
K-means algorithm

Randomly initialize cluster centroids

Repeat {
for = 1 to
:= index (from 1 to ) of cluster centroid
closest to
for = 1 to
:= average (mean) of points assigned to cluster

}
Andrew Ng
K-means for non-separated clusters

T-shirt sizing

Weight
Height

Andrew Ng
Clustering
Optimization
objective
Machine Learning
K-means optimization objective
= index of cluster (1,2,…, ) to which example is currently
assigned
= cluster centroid ( )
= cluster centroid of cluster to which example has been
assigned
Optimization objective:

Andrew Ng
K-means algorithm

Randomly initialize cluster centroids

Repeat {
for = 1 to
:= index (from 1 to ) of cluster centroid
closest to
for = 1 to
:= average (mean) of points assigned to cluster
}
Andrew Ng
Clustering
Random
initialization
Machine Learning
K-means algorithm

Randomly initialize cluster centroids

Repeat {
for = 1 to
:= index (from 1 to ) of cluster centroid
closest to
for = 1 to
:= average (mean) of points assigned to cluster
}
Andrew Ng
Random initialization
Should have

Randomly pick training

examples.

Set equal to these

examples.

Andrew Ng
Local optima

Andrew Ng
Random initialization
For i = 1 to 100 {

Randomly initialize K-means.

Run K-means. Get .
Compute cost function (distortion)

Pick clustering that gave lowest cost

Andrew Ng
Clustering
Choosing the
number of clusters
Machine Learning
What is the right value of K?

Andrew Ng
Choosing the value of K
Elbow method:
Cost function

Cost function
1 2 3 4 5 6 7 8 1 2 3 4 5 6 7 8

(no. of clusters) (no. of clusters)

Andrew Ng
Choosing the value of K
Sometimes, you’re running K-means to get clusters to use for some
later/downstream purpose. Evaluate K-means based on a metric for
how well it performs for that later purpose.

E.g. T-shirt sizing T-shirt sizing

Weight
Weight

Height Height
Andrew Ng

Chapter 5 - K-Mean Clustering
No ratings yet
Chapter 5 - K-Mean Clustering
32 pages
07 Clustering 2024
No ratings yet
07 Clustering 2024
51 pages
Clustering and K-Means Algorithm
No ratings yet
Clustering and K-Means Algorithm
81 pages
K Mean Clustering
No ratings yet
K Mean Clustering
59 pages
Clustering and K-Mean Algorithm
No ratings yet
Clustering and K-Mean Algorithm
38 pages
Deeplearning - Ai Deeplearning - Ai
No ratings yet
Deeplearning - Ai Deeplearning - Ai
169 pages
Clustering Classification and Intro Neural Network
No ratings yet
Clustering Classification and Intro Neural Network
168 pages
Unit 4
No ratings yet
Unit 4
125 pages
ML Unit III
No ratings yet
ML Unit III
82 pages
M3 - Unsupervised Machine Learning
No ratings yet
M3 - Unsupervised Machine Learning
35 pages
ML Module5 Clustering
No ratings yet
ML Module5 Clustering
71 pages
04 - KMeans Clustering
No ratings yet
04 - KMeans Clustering
56 pages
Kmeans
No ratings yet
Kmeans
92 pages
ML Lecture#04
No ratings yet
ML Lecture#04
40 pages
2021 Clustering
No ratings yet
2021 Clustering
50 pages
Deeplearning - Ai Deeplearning - Ai
No ratings yet
Deeplearning - Ai Deeplearning - Ai
59 pages
CSE 411 ML CH 7
No ratings yet
CSE 411 ML CH 7
24 pages
WINSEM2021-22 ECE6093 ETH VL2021220505450 Reference Material I 23-03-2022 Slides Kmeans
No ratings yet
WINSEM2021-22 ECE6093 ETH VL2021220505450 Reference Material I 23-03-2022 Slides Kmeans
28 pages
K Means Clustering
No ratings yet
K Means Clustering
27 pages
Unit 4
No ratings yet
Unit 4
46 pages
19.1. Partitioning-Based Clustering Algorithms
No ratings yet
19.1. Partitioning-Based Clustering Algorithms
27 pages
Clustering Part1
No ratings yet
Clustering Part1
84 pages
Unsupervised Learning
No ratings yet
Unsupervised Learning
27 pages
K Means - Ipynb - Colab
No ratings yet
K Means - Ipynb - Colab
10 pages
Unsupervised Learning 1
No ratings yet
Unsupervised Learning 1
40 pages
Week 4 - Lecture Slides - K-Means, Mixture Models, & EM
No ratings yet
Week 4 - Lecture Slides - K-Means, Mixture Models, & EM
65 pages
Chapter 3 ML
No ratings yet
Chapter 3 ML
27 pages
Week 9
No ratings yet
Week 9
66 pages
Clustering
No ratings yet
Clustering
28 pages
Lecture 13
No ratings yet
Lecture 13
29 pages
P-3 1 2-Kmeans
No ratings yet
P-3 1 2-Kmeans
43 pages
ML Application in Signal Processing and Communication Engineering
No ratings yet
ML Application in Signal Processing and Communication Engineering
27 pages
Unit 4
No ratings yet
Unit 4
22 pages
Clustering (Class 38-39)
No ratings yet
Clustering (Class 38-39)
45 pages
2 - K-Mean
No ratings yet
2 - K-Mean
39 pages
Machine Learning Chapter 3
No ratings yet
Machine Learning Chapter 3
12 pages
Clustering: Unsupervised Learning
No ratings yet
Clustering: Unsupervised Learning
44 pages
K-MEANS CLUSTERING PPT Kpu
No ratings yet
K-MEANS CLUSTERING PPT Kpu
4 pages
K Means Algorithm
No ratings yet
K Means Algorithm
4 pages
DSML-ML09. Unsupervised Learning
No ratings yet
DSML-ML09. Unsupervised Learning
69 pages
K Means Clustering
No ratings yet
K Means Clustering
22 pages
K Means
No ratings yet
K Means
9 pages
Clustering
No ratings yet
Clustering
4 pages
Clustering Algorithm
No ratings yet
Clustering Algorithm
47 pages
Clustering: Introducción Al Aprendizaje No Supervisado
No ratings yet
Clustering: Introducción Al Aprendizaje No Supervisado
37 pages
WWW Simplilearn Com Tutorials Machine Learning Tutorial K Means Clustering Algor
No ratings yet
WWW Simplilearn Com Tutorials Machine Learning Tutorial K Means Clustering Algor
19 pages
Unit 4 Clustering - K-Means and Hierarchical
No ratings yet
Unit 4 Clustering - K-Means and Hierarchical
40 pages
Clusterin G: Unsupervised Learning
No ratings yet
Clusterin G: Unsupervised Learning
29 pages
Clustering
No ratings yet
Clustering
6 pages
Clustering Kmeans
No ratings yet
Clustering Kmeans
6 pages
Clustering: Unsupervised Learning Introduc3on
No ratings yet
Clustering: Unsupervised Learning Introduc3on
29 pages
Presentation: Operating System Concept CS-582
No ratings yet
Presentation: Operating System Concept CS-582
13 pages
Mod4 - Unsupervised Learning
No ratings yet
Mod4 - Unsupervised Learning
9 pages
Stat 390 Presentation 2
No ratings yet
Stat 390 Presentation 2
14 pages
02.1 K-Means Example
No ratings yet
02.1 K-Means Example
12 pages
Unsupervised Learning
No ratings yet
Unsupervised Learning
12 pages
1 The K-Medoids Algorithm
No ratings yet
1 The K-Medoids Algorithm
5 pages
Unsupervised Learning
No ratings yet
Unsupervised Learning
24 pages
13: Clustering: Unsupervised Learning - Introduction
No ratings yet
13: Clustering: Unsupervised Learning - Introduction
4 pages
Derivative Analytics With Python
No ratings yet
Derivative Analytics With Python
15 pages
Aluminum 6063 T5
100% (1)
Aluminum 6063 T5
3 pages
Machine Learning
No ratings yet
Machine Learning
122 pages
Stiffness Method Beam
No ratings yet
Stiffness Method Beam
8 pages
COMP3711: Design and Analysis of Algorithms: Tutorial 5 Hkust
100% (1)
COMP3711: Design and Analysis of Algorithms: Tutorial 5 Hkust
31 pages
Line Follower LEGO NXT Robot
No ratings yet
Line Follower LEGO NXT Robot
10 pages
Kelley C.T. - Iterative Methods For optimization-SIAM (1999)
No ratings yet
Kelley C.T. - Iterative Methods For optimization-SIAM (1999)
188 pages
Aluminum 6061 T8
No ratings yet
Aluminum 6061 T8
2 pages
Control Systems Notes DEE M2 June
No ratings yet
Control Systems Notes DEE M2 June
34 pages
Community Detection
No ratings yet
Community Detection
72 pages
Tutorial 5
No ratings yet
Tutorial 5
12 pages
Anushka Tech IITK-2
No ratings yet
Anushka Tech IITK-2
1 page
Unit Iii Greedy and Dynamic Programming
No ratings yet
Unit Iii Greedy and Dynamic Programming
120 pages
Acturial Science Unit 1
No ratings yet
Acturial Science Unit 1
39 pages
Chapter 6 - Optimization Models With Integer Variables: Page 1
No ratings yet
Chapter 6 - Optimization Models With Integer Variables: Page 1
14 pages
Linear Regression With Multiple Variables
100% (1)
Linear Regression With Multiple Variables
38 pages
Preview: Comparison of Machine Learning Algorithms and Their Ensembles For Botnet Detection
100% (2)
Preview: Comparison of Machine Learning Algorithms and Their Ensembles For Botnet Detection
11 pages
Docs Slides Lecture15
No ratings yet
Docs Slides Lecture15
37 pages
Lesson2 1
No ratings yet
Lesson2 1
49 pages
Aluminum 6063 T831
No ratings yet
Aluminum 6063 T831
2 pages
Tutorial 2
No ratings yet
Tutorial 2
12 pages
Multidimensional Scaling by Optimizing Goodness of Fit To A Nonmetric Hypothesis
No ratings yet
Multidimensional Scaling by Optimizing Goodness of Fit To A Nonmetric Hypothesis
27 pages
Aluminum 6061 T91
No ratings yet
Aluminum 6061 T91
2 pages
Tutorial 14 - Importing Implicit Into Explicit
No ratings yet
Tutorial 14 - Importing Implicit Into Explicit
6 pages
Support Vector Machines Optimization Objective: Machine Learning
No ratings yet
Support Vector Machines Optimization Objective: Machine Learning
31 pages
Application Example: Photo OCR Problem Description and Pipeline
No ratings yet
Application Example: Photo OCR Problem Description and Pipeline
29 pages
Attacking OpenSSL Implementation of ECDSA With A Few Signatures.
No ratings yet
Attacking OpenSSL Implementation of ECDSA With A Few Signatures.
11 pages
Aluminum 6063 T835
No ratings yet
Aluminum 6063 T835
2 pages
Large Scale Machine Learning
No ratings yet
Large Scale Machine Learning
24 pages
Recommender Systems Problem Formulation: Machine Learning
No ratings yet
Recommender Systems Problem Formulation: Machine Learning
22 pages
Aluminum 6063 T4
No ratings yet
Aluminum 6063 T4
2 pages
An Introduction To Symmetry in TLA+ - Jack Vanlightly
No ratings yet
An Introduction To Symmetry in TLA+ - Jack Vanlightly
15 pages
ML Lab Exp 7 K-Means Clustering
No ratings yet
ML Lab Exp 7 K-Means Clustering
14 pages
Aluminum 6063 T83
No ratings yet
Aluminum 6063 T83
2 pages
Lecture1 Slides
No ratings yet
Lecture1 Slides
10 pages
Composite Delamination
No ratings yet
Composite Delamination
13 pages
ITM Chapter 5 New On Probability Distributions
No ratings yet
ITM Chapter 5 New On Probability Distributions
16 pages
Fretting Simulation For Crankshaft-Counterweight Contact: A. Mäntylä and C. Lönnqvist
No ratings yet
Fretting Simulation For Crankshaft-Counterweight Contact: A. Mäntylä and C. Lönnqvist
17 pages
MTH403 Assignment 231219
No ratings yet
MTH403 Assignment 231219
2 pages
Aluminum 6063 T1
No ratings yet
Aluminum 6063 T1
3 pages
Unit 4 Practice Test
No ratings yet
Unit 4 Practice Test
8 pages
Tutorial 4
No ratings yet
Tutorial 4
8 pages
Cadd Lab
No ratings yet
Cadd Lab
5 pages
The Laws of Thermodynamics - Boundless Chemistry - pdf1
No ratings yet
The Laws of Thermodynamics - Boundless Chemistry - pdf1
4 pages
Modeling Damage
No ratings yet
Modeling Damage
15 pages
Aiml Mahammad Hussain 2
No ratings yet
Aiml Mahammad Hussain 2
6 pages
Microcraking in Composites
No ratings yet
Microcraking in Composites
10 pages
Lesson Plan ME-102 Thermodynamics (EE)
No ratings yet
Lesson Plan ME-102 Thermodynamics (EE)
3 pages
STA4026S 2021 - Continuous Assessment 2 Ver0.0 - 2021!09!29
No ratings yet
STA4026S 2021 - Continuous Assessment 2 Ver0.0 - 2021!09!29
6 pages
Tutorial 8 - Bolts
No ratings yet
Tutorial 8 - Bolts
5 pages
Tutorial 9
No ratings yet
Tutorial 9
6 pages
Maulana Abul Kalam Azad University of Technology, West Bengal (Formerly West Bengal University of Technology)
No ratings yet
Maulana Abul Kalam Azad University of Technology, West Bengal (Formerly West Bengal University of Technology)
2 pages
Controllability, Observability and Multivariable Zeros: Example 1
No ratings yet
Controllability, Observability and Multivariable Zeros: Example 1
7 pages
Solution:: TSN2101/TOS2111 - Tutorial 4 (Process Scheduling) - Solutions
No ratings yet
Solution:: TSN2101/TOS2111 - Tutorial 4 (Process Scheduling) - Solutions
4 pages
Profound Python Data Science
From Everand
Profound Python Data Science
Onder Teker
No ratings yet

Clustering: Unsupervised Learning

Uploaded by

Clustering: Unsupervised Learning

Uploaded by

Clustering

Market segmentation Social network analysis

Image credit: NASA/JPL-Caltech/E. Churchwell (Univ. of Wisconsin, Madison)

Organize computing clusters Astronomical data analysis

Randomly initialize cluster centroids

Randomly initialize cluster centroids

Randomly initialize cluster centroids

Randomly pick training

Set equal to these

Randomly initialize K-means.

Pick clustering that gave lowest cost

(no. of clusters) (no. of clusters)

E.g. T-shirt sizing T-shirt sizing

You might also like