0% found this document useful (0 votes)

94 views19 pages

Unsupervised Learning - Clustering

This document discusses unsupervised machine learning and clustering. It provides an overview of clustering techniques like k-means clustering. K-means clustering partitions unlabeled data into k clusters where each cluster has a centroid. It discusses how k-means works by assigning data points to the closest centroid, recomputing centroids, and repeating until convergence. The document notes some weaknesses of k-means like sensitivity to outliers, initial seeds, and not knowing the optimal number of clusters k beforehand.

Uploaded by

Spandan Rout ms17a058

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

94 views19 pages

Unsupervised Learning - Clustering

Uploaded by

Spandan Rout ms17a058

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 19

Unsupervised Machine Learning -

Clustering

May 2020

SECRET
Knowledge Share –Session plan
Topic Application Schedule

Overview of Machine learning and feature selection Generic 19-Feb

Regression - Supervised Machine Learning Market Share Forecast/ Inventory 13-Mar
Obsolescence

Classification - Supervised Machine Learning Technician Attrition 10-Apr

Clustering - Unsupervised Machine Learning Dealer/Parts Clustering 8-May
Bagging & Boosting - Ensemble Methods Service Parts Forecasting 5-Jun
Genetic Algorithm -Reinforcement Learning Vehicle Route optimization 3-Jul
Linear programming and mathematical optimization Container Loading/Vanning 31-Jul
Dimension Reduction & Pattern Search - Generic 28-Aug
Unsupervised Machine Learning

Descriptive, Predictive & prescriptive Analytics

2
SECRET
Machine Learning Universe

SECRET 3
What is clustering?

• The organization of unlabeled data into similarity groups

called clusters.
• A cluster is a collection of data items which are “similar”
between them, and “dissimilar” to data items in other clusters.
Historic application of clustering

SECRET 5
Clustering techniques

Divisive

K-means
K-Means clustering
• K-means (MacQueen, 1967) is a partitional clustering algorithm
• Let the set of data points D be {x1, x2, …, xn},
where xi = (xi1, xi2, …, xir) is a vector in X  Rr, and r is the
number of dimensions.
• The k-means algorithm partitions the given data into
k clusters:
– Each cluster has a cluster center, called centroid.
– k is specified by the user
K-means clustering example: step 1

SECRET 8
K-means clustering example – step 2

SECRET 9
K-means clustering example – step 3

SECRET 10
K-means clustering example

SECRET 11
K-means clustering example

SECRET 12
K-means clustering example

SECRET 13
Weaknesses of K-means
• The algorithm is only applicable if the mean is
defined.
– For categorical data, k-mode - the centroid is
represented by most frequent values.
• The user needs to specify k.
• Sensitive to initial seed
• The algorithm is sensitive to outliers
– Outliers are data points that are very far away
from other data points.
– Outliers could be errors in the data recording or
some special data points with very different values.
Optimal Number of cluster

Within Cluster Sum of Squares (WCSS)

Optimal Number of cluster
Sensitivity to initial seeds

Random selection of seeds (centroids) Random selection of seeds (centroids)

Iteration 1 Iteration 2 Iteration 1 Iteration 2

Outlier
s

SECRET 18
K-means summary

• Despite weaknesses, k-means is still the most

popular algorithm due to its simplicity and
efficiency
• No clear evidence that any other clustering
algorithm performs better in general
• Comparing different clustering algorithms is a
difficult task. No one knows the correct
clusters!

Instant Download Vector Mechanics For Engineers 12th Edition Ferdinand Pierre Beer - Ebook PDF PDF All Chapters
100% (5)
Instant Download Vector Mechanics For Engineers 12th Edition Ferdinand Pierre Beer - Ebook PDF PDF All Chapters
25 pages
Matrices One Shot #BB
100% (1)
Matrices One Shot #BB
158 pages
Lecture 2.1.1 To 2.1.2
No ratings yet
Lecture 2.1.1 To 2.1.2
97 pages
Unit 4 - Cloud Programming Models
100% (2)
Unit 4 - Cloud Programming Models
21 pages
Lecture 15 Unsupervised Clustering
No ratings yet
Lecture 15 Unsupervised Clustering
73 pages
L7 Clustering
No ratings yet
L7 Clustering
58 pages
UNIT-6 K Means Clustering
No ratings yet
UNIT-6 K Means Clustering
12 pages
DM Lecture 06
No ratings yet
DM Lecture 06
32 pages
Lecture 4.6 Unsupervised-Learning Clustering
No ratings yet
Lecture 4.6 Unsupervised-Learning Clustering
60 pages
21csc305p Machine Learning Unit 3 - Updated
No ratings yet
21csc305p Machine Learning Unit 3 - Updated
147 pages
Week 14 and 15 Machine Learning Unsupervised 2
No ratings yet
Week 14 and 15 Machine Learning Unsupervised 2
25 pages
Training Day 26
No ratings yet
Training Day 26
32 pages
K Mean Clustering
No ratings yet
K Mean Clustering
59 pages
Wa0033.
No ratings yet
Wa0033.
38 pages
UnSupervised Learning
No ratings yet
UnSupervised Learning
40 pages
Unit 4
No ratings yet
Unit 4
125 pages
Week 10
No ratings yet
Week 10
41 pages
Unsupervised Learning Update
No ratings yet
Unsupervised Learning Update
37 pages
ML Application in Signal Processing and Communication Engineering
No ratings yet
ML Application in Signal Processing and Communication Engineering
27 pages
P-3 1 2-Kmeans
No ratings yet
P-3 1 2-Kmeans
43 pages
9.1. Machine Learning Unsupervised Learning-1
No ratings yet
9.1. Machine Learning Unsupervised Learning-1
57 pages
KMeans Clustering
No ratings yet
KMeans Clustering
16 pages
Cummins Part Catalog
100% (1)
Cummins Part Catalog
11 pages
K Means Clustering
No ratings yet
K Means Clustering
29 pages
ML Lecture06 Unsupervised Learning
No ratings yet
ML Lecture06 Unsupervised Learning
87 pages
Working of K Means Algorithm - YashBhure
No ratings yet
Working of K Means Algorithm - YashBhure
14 pages
K Means Algorithm
No ratings yet
K Means Algorithm
4 pages
Week 9
No ratings yet
Week 9
66 pages
Som New
No ratings yet
Som New
21 pages
Process Analysis and Simulation in Chemical Engineering
No ratings yet
Process Analysis and Simulation in Chemical Engineering
5 pages
K Means
No ratings yet
K Means
40 pages
Viewsonic Va1912w Service Manual
0% (1)
Viewsonic Va1912w Service Manual
66 pages
JSS 2 Social Studies Third Term e
No ratings yet
JSS 2 Social Studies Third Term e
26 pages
Clustering Techniques - Hierarchical, K-Means Clustering
No ratings yet
Clustering Techniques - Hierarchical, K-Means Clustering
22 pages
Week 11
No ratings yet
Week 11
49 pages
1 Kmeans
No ratings yet
1 Kmeans
13 pages
Clustering
No ratings yet
Clustering
125 pages
Clustering K-Means
100% (2)
Clustering K-Means
28 pages
Introduction To Unsupervised Learning:: Clustering
No ratings yet
Introduction To Unsupervised Learning:: Clustering
21 pages
K Means Final
No ratings yet
K Means Final
10 pages
Unsupervised Learning
No ratings yet
Unsupervised Learning
83 pages
Clustering FinancialData
No ratings yet
Clustering FinancialData
38 pages
ML UNIT 4 Sir
No ratings yet
ML UNIT 4 Sir
42 pages
Mini Project
No ratings yet
Mini Project
8 pages
Clustering
No ratings yet
Clustering
84 pages
Chapter 9
No ratings yet
Chapter 9
8 pages
K Means
No ratings yet
K Means
9 pages
Wepik Unveiling The Power of K Means Algorithm 20240320054442bjkX
No ratings yet
Wepik Unveiling The Power of K Means Algorithm 20240320054442bjkX
10 pages
Experiment 10 Vtu ML
No ratings yet
Experiment 10 Vtu ML
5 pages
Basic Programming Civic nd1
No ratings yet
Basic Programming Civic nd1
13 pages
Unit 4 Clustering - K-Means and Hierarchical
No ratings yet
Unit 4 Clustering - K-Means and Hierarchical
40 pages
10.lab Activity
No ratings yet
10.lab Activity
11 pages
Clustering Algorithm
No ratings yet
Clustering Algorithm
47 pages
KMeans Clustering Report
No ratings yet
KMeans Clustering Report
2 pages
WWW Simplilearn Com Tutorials Machine Learning Tutorial K Means Clustering Algor
No ratings yet
WWW Simplilearn Com Tutorials Machine Learning Tutorial K Means Clustering Algor
19 pages
Minor Project
No ratings yet
Minor Project
10 pages
Clustering
No ratings yet
Clustering
18 pages
All The Links For IT
No ratings yet
All The Links For IT
133 pages
Electronics 09 01295 v2
No ratings yet
Electronics 09 01295 v2
12 pages
FML Unit4
No ratings yet
FML Unit4
14 pages
K-Means Clustering
No ratings yet
K-Means Clustering
6 pages
K Means Clustering
No ratings yet
K Means Clustering
22 pages
03 - PowerScale Hardware Installation-SSP - Participant Guide
No ratings yet
03 - PowerScale Hardware Installation-SSP - Participant Guide
98 pages
Exp 7
No ratings yet
Exp 7
3 pages
Practical 5
No ratings yet
Practical 5
3 pages
K Means Clustering
No ratings yet
K Means Clustering
3 pages
Open Source Flood Mapping Tools - Qgis River Gis A
No ratings yet
Open Source Flood Mapping Tools - Qgis River Gis A
8 pages
K Mean
No ratings yet
K Mean
12 pages
K Mean
No ratings yet
K Mean
7 pages
K, Eans
No ratings yet
K, Eans
4 pages
Computer Arithmetic 1. Addition and Subtraction of Unsigned Numbers
No ratings yet
Computer Arithmetic 1. Addition and Subtraction of Unsigned Numbers
19 pages
Video Cassette Recorder NV-HV61 Series: Operating Instructions
No ratings yet
Video Cassette Recorder NV-HV61 Series: Operating Instructions
20 pages
BDA Record
No ratings yet
BDA Record
34 pages
Daryl Kim Tech Resume
No ratings yet
Daryl Kim Tech Resume
2 pages
Turn Off Unnecessary Windows Services
No ratings yet
Turn Off Unnecessary Windows Services
3 pages
Communication Eng Hindi
No ratings yet
Communication Eng Hindi
128 pages
Hotel Automation
No ratings yet
Hotel Automation
3 pages
Web Services - 252 Course Outline
No ratings yet
Web Services - 252 Course Outline
6 pages
1548microsoft 365 For Dummies 1st Edition Jennifer Reed - Read The Ebook Online or Download It To Own The Full Content
100% (4)
1548microsoft 365 For Dummies 1st Edition Jennifer Reed - Read The Ebook Online or Download It To Own The Full Content
50 pages
LPS 3
No ratings yet
LPS 3
10 pages
The New Standard: High Speed Stability
No ratings yet
The New Standard: High Speed Stability
16 pages
Midibox 2
No ratings yet
Midibox 2
8 pages
CSEC English SBA - Gibran Nizamali
No ratings yet
CSEC English SBA - Gibran Nizamali
27 pages
Practice Q Ans
No ratings yet
Practice Q Ans
11 pages
Paper 2
No ratings yet
Paper 2
6 pages
" A Puzzle A Day To Learn, Code, and Play " Visit: Description Example
No ratings yet
" A Puzzle A Day To Learn, Code, and Play " Visit: Description Example
1 page
Loco App User Guide en
No ratings yet
Loco App User Guide en
14 pages
WeconnectU Training Manual
No ratings yet
WeconnectU Training Manual
2 pages
ASTM C762 86 1994 E1
No ratings yet
ASTM C762 86 1994 E1
2 pages

Unsupervised Learning - Clustering

Uploaded by

Unsupervised Learning - Clustering

Uploaded by

Unsupervised Machine Learning -

Overview of Machine learning and feature selection Generic 19-Feb

Classification - Supervised Machine Learning Technician Attrition 10-Apr

Descriptive, Predictive & prescriptive Analytics

• The organization of unlabeled data into similarity groups

Within Cluster Sum of Squares (WCSS)

Random selection of seeds (centroids) Random selection of seeds (centroids)

Iteration 1 Iteration 2 Iteration 1 Iteration 2

• Despite weaknesses, k-means is still the most

You might also like