Unsupervised Learning

Uploaded by

Revathi Kalyanasundaram

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

50 views12 pages

Unsupervised Learning

Uploaded by

Revathi Kalyanasundaram

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 12

K-means Clustering

Presented By:
Akanksha Kaushik
Assistant Professor
The NorthCap University
Introduction to Unsupervised
Machine Learning

• Unsupervised Machine Learning is the process of

teaching a computer to use unlabeled, unclassified data
and enabling the algorithm to operate on that data
without supervision.

• Without any previous data training, the machine’s job in

this case is to organize unsorted data according to
parallels, patterns, and variations.
Introduction to K-
means Clustering

• K-Means Clustering is an Unsupervised Machine

Learning algorithm, which groups the unlabeled dataset
into different clusters.
• K means clustering, assigns data points to one of the K
clusters depending on their distance from the center of
the clusters.
• It starts by randomly assigning the clusters centroid in
the space.
• Then each data point assign to one of the cluster based
on its distance from centroid of the cluster.
• After assigning each point to one of the cluster, new
cluster centroids are assigned.
• This process runs iteratively until it finds good cluster.
• In the analysis we assume that number of cluster is given
in advanced, and we must put points in one of the group.
• In some cases, K is not clearly defined, and we must think
about the optimal number of K.
• K Means clustering performs best data is well separated. When
data points overlapped this clustering is not suitable.
• K Means is faster as compared to other clustering technique.
• It provides strong coupling between the data points.
• K Means cluster do not provide clear information regarding the
quality of clusters.
• Different initial assignment of cluster centroid may lead to
different clusters.
• Also, K Means algorithm is sensitive to noise. It may have stuck in
local minima.
Objective of k-means
clustering
• The goal of clustering is to divide the population
or set of data points into a number of groups so
that the data points within each group are more
comparable to one another and different from the
data points within the other groups.
• It is essentially a grouping of things based on how
similar and different they are to one another.
How k-means clustering
works?
• We are given a data set of items, with certain features, and
values for these features (like a vector).
• The task is to categorize those items into groups.
• To achieve this, we will use the K-means algorithm, an
unsupervised learning algorithm.
• ‘K’ in the name of the algorithm represents the number of
groups/clusters we want to classify our items into.
• The algorithm will categorize the items into k
groups or clusters of similarity.
• To calculate that similarity, we will use the
Euclidean distance as a measurement.
• The algorithm works as follows:

• First, we randomly initialize k points, called means

or cluster centroids.
• We categorize each item to its closest mean, and
we update the mean’s coordinates, which are the
averages of the items categorized in that cluster so
far.
• We repeat the process for a given number of
iterations and at the end, we have our clusters.
• The “points” mentioned above are called means because they
are the mean values of the items categorized in them.
• To initialize these means, we have a lot of options.
• An intuitive method is to initialize the means at random
items in the data set.
• Another method is to initialize the means at random values
between the boundaries of the data set (if for a feature x, the
items have values in [0,3], we will initialize the means with
values for x at [0,3]).
• The above algorithm in pseudocode is as follows:
Initialize k means with random values
--> For a given number of iterations:

--> Iterate through items:

--> Find the mean closest to the item by calculating

the euclidean distance of the item with each of the means

--> Assign item to mean

--> Update mean by shifting it to the average of the items in

that cluster
Let’s get started

10.1007 978 3 030 43887 6 PDF
100% (1)
10.1007 978 3 030 43887 6 PDF
755 pages
Inductive Learning and Machine Learning
100% (1)
Inductive Learning and Machine Learning
321 pages
RapidMiner Fact Sheet
No ratings yet
RapidMiner Fact Sheet
11 pages
BI MCQs
33% (3)
BI MCQs
20 pages
Unit 5
No ratings yet
Unit 5
20 pages
K Means Clustering
No ratings yet
K Means Clustering
11 pages
Introduction To Data Science and Machine Learning
No ratings yet
Introduction To Data Science and Machine Learning
23 pages
Ai Board Paper
No ratings yet
Ai Board Paper
6 pages
Clustering K-Means
100% (2)
Clustering K-Means
28 pages
IP™ Modules: Working Together For A Safer World
No ratings yet
IP™ Modules: Working Together For A Safer World
12 pages
Class Xii Summer Holiday Homework All Merged
No ratings yet
Class Xii Summer Holiday Homework All Merged
97 pages
Clustering
No ratings yet
Clustering
125 pages
Unit IV
No ratings yet
Unit IV
96 pages
Clustering Part1
No ratings yet
Clustering Part1
84 pages
Clustering
No ratings yet
Clustering
84 pages
ML Unit 4
No ratings yet
ML Unit 4
110 pages
Unit 4
No ratings yet
Unit 4
125 pages
Unsupervised Learning 2024-PPG
No ratings yet
Unsupervised Learning 2024-PPG
85 pages
Clustering Algorithm
No ratings yet
Clustering Algorithm
47 pages
Eti 22618
0% (1)
Eti 22618
11 pages
04-FSSR DS610 2024 2025T1 Kmeans
No ratings yet
04-FSSR DS610 2024 2025T1 Kmeans
57 pages
Week 9
No ratings yet
Week 9
66 pages
Algo
No ratings yet
Algo
59 pages
3 Hotspot Analysis Module
0% (1)
3 Hotspot Analysis Module
9 pages
Chapter 3 - For Class
No ratings yet
Chapter 3 - For Class
52 pages
Data Science Final Report
No ratings yet
Data Science Final Report
61 pages
DM&BAFall2204 2
No ratings yet
DM&BAFall2204 2
61 pages
Unit 4 Clustering - K-Means and Hierarchical
No ratings yet
Unit 4 Clustering - K-Means and Hierarchical
40 pages
Big Data Journal
No ratings yet
Big Data Journal
50 pages
L7 Clustering
No ratings yet
L7 Clustering
58 pages
Data Mining For Financial Statement Analysis
100% (1)
Data Mining For Financial Statement Analysis
4 pages
Kmea
No ratings yet
Kmea
53 pages
ML UNIT 4 Sir
No ratings yet
ML UNIT 4 Sir
42 pages
DSV - Unit 3 - Data Analysis in Depth
No ratings yet
DSV - Unit 3 - Data Analysis in Depth
53 pages
ML Unit-4 Final 2024-25
No ratings yet
ML Unit-4 Final 2024-25
28 pages
Big Data and Artificial Intelligence in The Fields of Accounting and Auditing A Bibliometric Analysis
No ratings yet
Big Data and Artificial Intelligence in The Fields of Accounting and Auditing A Bibliometric Analysis
28 pages
Lecture - 10 Unsupervised Learning & K-Means Clustering
No ratings yet
Lecture - 10 Unsupervised Learning & K-Means Clustering
31 pages
Swayam Week 4
No ratings yet
Swayam Week 4
4 pages
K Mean Clustering
No ratings yet
K Mean Clustering
32 pages
ML Unit-2
No ratings yet
ML Unit-2
31 pages
Data Mining - IMT Nagpur-Manish
No ratings yet
Data Mining - IMT Nagpur-Manish
82 pages
Unsupervised Learning
No ratings yet
Unsupervised Learning
23 pages
K Mean Clustering
No ratings yet
K Mean Clustering
27 pages
K Mean Clustering1
No ratings yet
K Mean Clustering1
23 pages
DBSCAN Presentation
No ratings yet
DBSCAN Presentation
10 pages
AI, ML & DS Brochure
No ratings yet
AI, ML & DS Brochure
10 pages
Unsupervised Learning
No ratings yet
Unsupervised Learning
27 pages
Week 14 and 15 Machine Learning Unsupervised 2
No ratings yet
Week 14 and 15 Machine Learning Unsupervised 2
25 pages
Clustering Algorithms
No ratings yet
Clustering Algorithms
19 pages
K Means Algorithm
No ratings yet
K Means Algorithm
4 pages
Chapter 4 ML
No ratings yet
Chapter 4 ML
30 pages
Unit 4 Aam
No ratings yet
Unit 4 Aam
26 pages
ADL LAB Manual
No ratings yet
ADL LAB Manual
27 pages
16 K Mean Clustring 1 18052023 095249am 08042024 093324am
No ratings yet
16 K Mean Clustring 1 18052023 095249am 08042024 093324am
20 pages
Kmean
No ratings yet
Kmean
24 pages
ML Unit3
No ratings yet
ML Unit3
21 pages
K Means Clustering
No ratings yet
K Means Clustering
22 pages
Clustering Techniques - Hierarchical, K-Means Clustering
No ratings yet
Clustering Techniques - Hierarchical, K-Means Clustering
22 pages
Clustering
No ratings yet
Clustering
18 pages
Introduction To Unsupervised Learning:: Clustering
No ratings yet
Introduction To Unsupervised Learning:: Clustering
21 pages
ML Unit5 Notes
No ratings yet
ML Unit5 Notes
18 pages
ML 12
No ratings yet
ML 12
19 pages
K - Means Clustering
No ratings yet
K - Means Clustering
13 pages
U1 - KMeans - 5th Sem - DS
No ratings yet
U1 - KMeans - 5th Sem - DS
14 pages
Journal of Parallel and Distributed Computing
No ratings yet
Journal of Parallel and Distributed Computing
13 pages
Unit 3 - KmeansClustering
No ratings yet
Unit 3 - KmeansClustering
17 pages
KMeans Clustering
No ratings yet
KMeans Clustering
16 pages
A Paper With 12pt Global Font Size
No ratings yet
A Paper With 12pt Global Font Size
13 pages
K Means Clustering
No ratings yet
K Means Clustering
11 pages
Lesson 5 - Unsupervised Learning
No ratings yet
Lesson 5 - Unsupervised Learning
11 pages
1 Kmeans
No ratings yet
1 Kmeans
13 pages
Assessing A Single Classification Algorithm and Two Classification Algorithms
No ratings yet
Assessing A Single Classification Algorithm and Two Classification Algorithms
12 pages
Customer Segmentation Literature Review 1
No ratings yet
Customer Segmentation Literature Review 1
8 pages
Mod4 - Unsupervised Learning
No ratings yet
Mod4 - Unsupervised Learning
9 pages
10.lab Activity
No ratings yet
10.lab Activity
11 pages
Machine Learning Chapter 3
No ratings yet
Machine Learning Chapter 3
12 pages
Swayam Week 7 Assignment
No ratings yet
Swayam Week 7 Assignment
7 pages
Machine Learning Algorithms Overview
No ratings yet
Machine Learning Algorithms Overview
6 pages
Minor Project
No ratings yet
Minor Project
10 pages
23080-Article Text-35909-1-10-20210530
No ratings yet
23080-Article Text-35909-1-10-20210530
8 pages
Chapter 9
No ratings yet
Chapter 9
8 pages
Data Mining
No ratings yet
Data Mining
7 pages
Clustering Kmeans
No ratings yet
Clustering Kmeans
6 pages
K Mean
No ratings yet
K Mean
7 pages
Agricultural Crop Yield
No ratings yet
Agricultural Crop Yield
7 pages
ISYE 6740 - (SU22) Syllabus
No ratings yet
ISYE 6740 - (SU22) Syllabus
6 pages
A Tutorial On Clustering Algorithms
No ratings yet
A Tutorial On Clustering Algorithms
4 pages
K Mean
No ratings yet
K Mean
12 pages
Survey Paper On UWSN
No ratings yet
Survey Paper On UWSN
4 pages
Swyam Week2 Assignment
No ratings yet
Swyam Week2 Assignment
3 pages
AI11
No ratings yet
AI11
2 pages
22 - 14.8247 - Muhamad Saka Sotyasaksi - POSTER
No ratings yet
22 - 14.8247 - Muhamad Saka Sotyasaksi - POSTER
1 page
Machine Learning with Clustering: A Visual Guide for Beginners with Examples in Python
From Everand
Machine Learning with Clustering: A Visual Guide for Beginners with Examples in Python
Artem Kovera
No ratings yet
K Nearest Neighbor Algorithm: Fundamentals and Applications
From Everand
K Nearest Neighbor Algorithm: Fundamentals and Applications
Fouad Sabry
No ratings yet

Unsupervised Learning

Uploaded by

Unsupervised Learning

Uploaded by

K-means Clustering

• Unsupervised Machine Learning is the process of

• Without any previous data training, the machine’s job in

• K-Means Clustering is an Unsupervised Machine

• First, we randomly initialize k points, called means

--> Iterate through items:

--> Find the mean closest to the item by calculating

--> Assign item to mean

--> Update mean by shifting it to the average of the items in

You might also like