Clustering

Uploaded by

hpi351446

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

9 views12 pages

Clustering

Uploaded by

hpi351446

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 12

Clustering

What is Clustering?

Clustering is the most popular version of

unsupervised learning.

In unsupervised learning, the goal is to identify

patterns or structures in the data without any
prior knowledge of what to expect.

In Clustering, the goal is to group data points

based on their similarity.
Why it is useful?

Suppose you are the head of a retail store and

wish to understand the preferences of your
customers. Can you look at the details of each
customer and devise a unique business strategy
for each one of them?
What you can do is cluster all of your customers
into, say 5 groups based on their purchasing
habits and use a separate strategy for each
group.
Desirable Properties of
a Clustering Algorithm

1. Scalability (in terms of both time and space)

2. Ability to deal with different data types
3. Minimal requirements for domain knowledge
to determine input parameters
4. Able to deal with noise and outliers
5. Insensitive to the order of input records
6. Incorporation of user-specified constraints
7. Interpretability and usability
Clustering Algorithms

a. Exclusive Clustering
Exclusive clustering is a form of grouping that requires
a data point to exist only in one cluster. This can also
be referred to as “hard” clustering. The K-means
clustering algorithm is an example of exclusive
clustering.
Clustering Algorithms

b. Overlapping Clustering
Overlapping clusters differs from exclusive clustering in
that it allows data points to belong to multiple clusters
with separate degrees of membership. “Soft” or fuzzy k-
means clustering is an example of overlapping
clustering.
Clustering Algorithms

c. Hierarchical Clustering
Hierarchical clustering can be categorized in two ways;
agglomerative or divisive. Agglomerative clustering is
considered a “bottoms-up approach.” Its data points
are isolated as separate groupings initially, and then
they are merged together iteratively on the basis of
similarity until one cluster has been achieved.
Clustering Algorithms

d. Probabilistic Clustering
In probabilistic clustering, data points are clustered
based on the likelihood that they belong to a particular
distribution. The Gaussian Mixture Model (GMM) is one
of the most commonly used probabilistic clustering
methods.
Applications of
Clustering

1. Marketing: To characterize & discover

customer segments for marketing purposes.
2. Biology: For classification among different
species of plants and animals.
3. Libraries: Clustering different books on the
basis of topics and information.
4. City Planning: To make groups of houses and
to study their values based on their geographical
locations and other factors present.
Challenges

1. Computational complexity due to a high volume

of training data.
2. Longer training times
3. Higher risk of inaccurate results
4. Human intervention to validate output variables
5. Lack of transparency into the basis on which
data was clustered
Follow #DataRanch on
LinkedIn for more...
[email protected]

linkedin.com/company/dataranch

Classification and Clustering
No ratings yet
Classification and Clustering
8 pages
ML Mod 4 Part 1
No ratings yet
ML Mod 4 Part 1
99 pages
Unit 4
No ratings yet
Unit 4
96 pages
All Merged Chap 5
No ratings yet
All Merged Chap 5
45 pages
Clustering
No ratings yet
Clustering
44 pages
Lecture Unsupervised (17!04!2024)
No ratings yet
Lecture Unsupervised (17!04!2024)
61 pages
Unit 4 Notes
No ratings yet
Unit 4 Notes
66 pages
Clustering
No ratings yet
Clustering
21 pages
Unsupervised Machine Learning
No ratings yet
Unsupervised Machine Learning
59 pages
4.unit 4 ML Q&A
No ratings yet
4.unit 4 ML Q&A
73 pages
Unit 4
No ratings yet
Unit 4
53 pages
DM Lecture 06
No ratings yet
DM Lecture 06
32 pages
Clustering in Machine Learning
No ratings yet
Clustering in Machine Learning
21 pages
Unit III Clustering
No ratings yet
Unit III Clustering
47 pages
Week 9. Unsupervised Learning
No ratings yet
Week 9. Unsupervised Learning
32 pages
Datamining Lect8
No ratings yet
Datamining Lect8
79 pages
ML Unit-Iii
No ratings yet
ML Unit-Iii
18 pages
Week 10 Lecture - Introduction To Clustering
No ratings yet
Week 10 Lecture - Introduction To Clustering
35 pages
ML Unit 3
No ratings yet
ML Unit 3
24 pages
E-Note 28966 Content Document 20241211091351PM
No ratings yet
E-Note 28966 Content Document 20241211091351PM
69 pages
Module 5
No ratings yet
Module 5
91 pages
Clustering
No ratings yet
Clustering
57 pages
FPA Unit 3
No ratings yet
FPA Unit 3
17 pages
Lecturer-1 Unit 3
No ratings yet
Lecturer-1 Unit 3
31 pages
7.introduction To Clustering
No ratings yet
7.introduction To Clustering
11 pages
Untitled Document
No ratings yet
Untitled Document
32 pages
Clustering
No ratings yet
Clustering
38 pages
Cluster Analysis
No ratings yet
Cluster Analysis
22 pages
Unit 4
No ratings yet
Unit 4
40 pages
Clustering: An Overview: Key Concepts Objective
No ratings yet
Clustering: An Overview: Key Concepts Objective
12 pages
DW & DM Unit 4 Notes
No ratings yet
DW & DM Unit 4 Notes
40 pages
22AIP3101A Session 9
No ratings yet
22AIP3101A Session 9
38 pages
Unit 3 Unsupervised Learning Algorith
No ratings yet
Unit 3 Unsupervised Learning Algorith
15 pages
Foundation Class X PCMB
No ratings yet
Foundation Class X PCMB
1,571 pages
Unit-4 ML
No ratings yet
Unit-4 ML
16 pages
Artificial Intelligence Lec 5
No ratings yet
Artificial Intelligence Lec 5
20 pages
ML CH 4
No ratings yet
ML CH 4
51 pages
Cluster Analysis: Basic Concepts and Algorithms
No ratings yet
Cluster Analysis: Basic Concepts and Algorithms
141 pages
Unit-5 Clustering (March 16, 24)
No ratings yet
Unit-5 Clustering (March 16, 24)
25 pages
Clustering
No ratings yet
Clustering
29 pages
01 Introduction Clustering
No ratings yet
01 Introduction Clustering
11 pages
Unit - 4 (ML)
No ratings yet
Unit - 4 (ML)
13 pages
Lecture Notes For Chapter 8: by Tan, Steinbach, Kumar
No ratings yet
Lecture Notes For Chapter 8: by Tan, Steinbach, Kumar
93 pages
Unit 4
No ratings yet
Unit 4
74 pages
Clustering Algorithm
No ratings yet
Clustering Algorithm
47 pages
DSA Presentation Group 6
No ratings yet
DSA Presentation Group 6
34 pages
Unsupervised Learning
No ratings yet
Unsupervised Learning
23 pages
Clustering U 5
No ratings yet
Clustering U 5
2 pages
Fuzzy Meaning
No ratings yet
Fuzzy Meaning
6 pages
Clustering
No ratings yet
Clustering
8 pages
Classify Clustering
No ratings yet
Classify Clustering
31 pages
Clustering
No ratings yet
Clustering
6 pages
Unsupervised Machine Learning
No ratings yet
Unsupervised Machine Learning
10 pages
Clustering New
No ratings yet
Clustering New
6 pages
Assignment 4
No ratings yet
Assignment 4
40 pages
All India Machinery Data
0% (1)
All India Machinery Data
1,705 pages
Clustering
No ratings yet
Clustering
3 pages
Cyclotron
72% (61)
Cyclotron
20 pages
Structure and Written Expression: Section Two
100% (1)
Structure and Written Expression: Section Two
26 pages
Clustering Algorithm: An Unsupervised Learning Approach
No ratings yet
Clustering Algorithm: An Unsupervised Learning Approach
23 pages
Inspection Report Shore Quantity Report Ullage Report Time Sheet / Time Log Sample Report Quality Report
No ratings yet
Inspection Report Shore Quantity Report Ullage Report Time Sheet / Time Log Sample Report Quality Report
21 pages
Grid Audit Report Format
100% (1)
Grid Audit Report Format
7 pages
Data Analysis Using ChatGPT4
No ratings yet
Data Analysis Using ChatGPT4
19 pages
Data Analysis Using ChatGPT4
No ratings yet
Data Analysis Using ChatGPT4
19 pages
Study of Tig Welding
100% (1)
Study of Tig Welding
11 pages
Machine Learning & Data Mining: Understanding
No ratings yet
Machine Learning & Data Mining: Understanding
7 pages
An Introduction To Clustering and Different Methods of Clustering
No ratings yet
An Introduction To Clustering and Different Methods of Clustering
9 pages
PGP Aiml2024
No ratings yet
PGP Aiml2024
22 pages
Clustering
No ratings yet
Clustering
5 pages
Diani The Concept of Social Movement
No ratings yet
Diani The Concept of Social Movement
26 pages
Alemite Oil Mist Application Manual
100% (1)
Alemite Oil Mist Application Manual
34 pages
Date Reference Description Valuedate Deposit Withdrawal Balance
No ratings yet
Date Reference Description Valuedate Deposit Withdrawal Balance
26 pages
GAGEtrak Pro 8 Intro Guide
No ratings yet
GAGEtrak Pro 8 Intro Guide
119 pages
Whiplash Project
No ratings yet
Whiplash Project
11 pages
Electrical Thumb Rules You MUST Follow Part 5
No ratings yet
Electrical Thumb Rules You MUST Follow Part 5
3 pages
Storage Tank Protection Using VCI 2
No ratings yet
Storage Tank Protection Using VCI 2
9 pages
Critical Elements For A Successful Energy Transition - A Systematic Review
No ratings yet
Critical Elements For A Successful Energy Transition - A Systematic Review
21 pages
Fault Analysis and Voltage Control 3
No ratings yet
Fault Analysis and Voltage Control 3
24 pages
Alter Table: Table - Name ADD Column - Name Datatype
No ratings yet
Alter Table: Table - Name ADD Column - Name Datatype
5 pages
CSC403 - Software Engineering BOSU
No ratings yet
CSC403 - Software Engineering BOSU
13 pages
Sentiment Analysis
No ratings yet
Sentiment Analysis
15 pages
Patient Clinical Audit Case Study Example
No ratings yet
Patient Clinical Audit Case Study Example
3 pages
Dijkstra's Algorithm: 1 N Ij I J 1
No ratings yet
Dijkstra's Algorithm: 1 N Ij I J 1
5 pages
Data Analysis
No ratings yet
Data Analysis
8 pages
DrWeb Crash
No ratings yet
DrWeb Crash
12 pages
Nba Lab Details May 2014
No ratings yet
Nba Lab Details May 2014
38 pages
X4751 enUS 4751 CementIndustryBrochure 010920
No ratings yet
X4751 enUS 4751 CementIndustryBrochure 010920
12 pages
Mosi Debat
No ratings yet
Mosi Debat
8 pages
Configuration E3D V5 Folder :: Bltouch Hotend (Stock) : /01 - Mk4 - Hex - Nuts/02 - Bltouch
No ratings yet
Configuration E3D V5 Folder :: Bltouch Hotend (Stock) : /01 - Mk4 - Hex - Nuts/02 - Bltouch
5 pages
Origins of Lift
No ratings yet
Origins of Lift
5 pages
Rapid Serial Visual Presentation in Dynamic Graph Visualization
No ratings yet
Rapid Serial Visual Presentation in Dynamic Graph Visualization
8 pages
Graph 2 Worksheet
No ratings yet
Graph 2 Worksheet
2 pages
Guidelines ITR 2020-21-For Mentor and Students
No ratings yet
Guidelines ITR 2020-21-For Mentor and Students
2 pages
Machine Learning with Clustering: A Visual Guide for Beginners with Examples in Python
From Everand
Machine Learning with Clustering: A Visual Guide for Beginners with Examples in Python
Artem Kovera
No ratings yet