0% found this document useful (0 votes)

51 views9 pages

Clustering

Clustering is an unsupervised machine learning technique used to group unlabeled data points that are similar to each other. It divides the data points into a number of groups where data points within each group are more similar to other data points in the same group than those in other groups. Clustering is commonly used in pattern recognition, image analysis, and machine learning. It is useful for data reduction, finding natural groupings in data, and outlier detection. Common clustering models include connectivity, centroid, distribution, and density models. Clustering has various applications such as market segmentation, data analysis, social network analysis, and image segmentation.

Uploaded by

nikhil shinde

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

51 views9 pages

Clustering

Uploaded by

nikhil shinde

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 9

Jan 30 2023

Introduction to Clustering
 It is basically a type of unsupervised learning method.
 An unsupervised learning method is a method in
which we draw references from datasets consisting of
input data without labeled responses.
 Generally, it is used as a process to find meaningful
structure, explanatory underlying processes,
generative features, and groupings inherent in a set of
examples
Overview
 Clustering is the task of dividing the population or
data points into a number of groups such that data
points in the same groups are more similar to other
data points in the same group and dissimilar to the
data points in other groups.
 It is basically a collection of objects on the basis of
similarity and dissimilarity between them.
Overview
 It is a main task of exploratory data analysis, and a
common technique for statistical data analysis, used in
many fields, including
 pattern recognition,
 image analysis,
 machine learning.
Overview
Overview
Why Clustering
 Clustering is very much important as it determines the
intrinsic grouping among the unlabelled data present.
 There are no criteria for good clustering. It depends on the
user, what is the criteria they may use which satisfy their
need. For instance, we could be interested in finding
representatives for homogeneous groups (data reduction),
in finding “natural clusters” and describe their unknown
properties (“natural” data types), in finding useful and
suitable groupings (“useful” data classes) or in finding
unusual data objects (outlier detection).
 This algorithm must make some assumptions that
constitute the similarity of points and each assumption
make different and equally valid clusters.
Cluster Model types
Typical cluster models include:
 Connectivity models: for example, hierarchical
clustering builds models based on distance connectivity.
 Centroid models: for example, the k-means
algorithm represents each cluster by a single mean vector.
 Distribution models: clusters are modeled using statistical
distributions, such as multivariate normal
distributions used by the expectation-maximization
algorithm.
 Density models: for example, DBSCAN and OPTICS defines
clusters as connected dense regions in the data space.
Clustering Uses
The clustering technique can be widely used in various
tasks. Some most common uses of this technique are:
 Market Segmentation
 Statistical data analysis
 Social network analysis
 Image segmentation
 Anomaly detection, etc.

Key Difference Between in SAP ECC & S4HANA
No ratings yet
Key Difference Between in SAP ECC & S4HANA
10 pages
A Step by Step Backpropagation Example - Matt Mazur
No ratings yet
A Step by Step Backpropagation Example - Matt Mazur
7 pages
Combining Multiple Sources of Knowledge in Deep Cnns For Action Recognition
No ratings yet
Combining Multiple Sources of Knowledge in Deep Cnns For Action Recognition
8 pages
T05 PDF
No ratings yet
T05 PDF
24 pages
FPGA Implementation of A Convolutional Neural Network For Wake Up Word Detection - Project Assignment - Ole Martin Skafsa - NTNU
No ratings yet
FPGA Implementation of A Convolutional Neural Network For Wake Up Word Detection - Project Assignment - Ole Martin Skafsa - NTNU
120 pages
See If You Can Do This! On Language Focus: WMSU-ISMP-GU-001.00
No ratings yet
See If You Can Do This! On Language Focus: WMSU-ISMP-GU-001.00
4 pages
Lecture 3 - MDPs and Dynamic Programming
No ratings yet
Lecture 3 - MDPs and Dynamic Programming
66 pages
Data Analytic For Accounting (DAFA) Main Reference
No ratings yet
Data Analytic For Accounting (DAFA) Main Reference
448 pages
Data Driven Decisions For Business
100% (1)
Data Driven Decisions For Business
14 pages
Three Keys To Building A Data Driven Strategy
No ratings yet
Three Keys To Building A Data Driven Strategy
4 pages
Thesis Object Tracking
100% (3)
Thesis Object Tracking
4 pages
Builder Chat Log
No ratings yet
Builder Chat Log
124 pages
Slam Checklist
No ratings yet
Slam Checklist
1 page
Efficient Multivariable Submarine Depth-Control System Design PDF
0% (1)
Efficient Multivariable Submarine Depth-Control System Design PDF
12 pages
Machine Learning Yearning
100% (1)
Machine Learning Yearning
9 pages
Edge AI and IoT Integration
No ratings yet
Edge AI and IoT Integration
3 pages
Week 1 Introduction To The Machine Learning Course
No ratings yet
Week 1 Introduction To The Machine Learning Course
10 pages
MLP - Week 6 - MNIST - LogitReg - Ipynb - Colaboratory
No ratings yet
MLP - Week 6 - MNIST - LogitReg - Ipynb - Colaboratory
19 pages
A Review of Dimensionality Reduction Techniques For Efficient INTERNATIONAL CONFERENCE ON RECENT TRENDS IN ADVANCED COMPUTING Computation Computation
No ratings yet
A Review of Dimensionality Reduction Techniques For Efficient INTERNATIONAL CONFERENCE ON RECENT TRENDS IN ADVANCED COMPUTING Computation Computation
8 pages
Lecture 09 ML
No ratings yet
Lecture 09 ML
26 pages
Lec 20
No ratings yet
Lec 20
33 pages
dd2437 Annda
No ratings yet
dd2437 Annda
45 pages
(IJCST-V11I6P2) :ms. Madhuri P. Narkhede, Dr. Harshali B Patil
No ratings yet
(IJCST-V11I6P2) :ms. Madhuri P. Narkhede, Dr. Harshali B Patil
5 pages
Evaluation Metrics: Anand Avati
No ratings yet
Evaluation Metrics: Anand Avati
31 pages
XSS Cross-Site Scripting Attack Detection by Machine Learning Classifiers
No ratings yet
XSS Cross-Site Scripting Attack Detection by Machine Learning Classifiers
5 pages
Sharding in MongoDB
No ratings yet
Sharding in MongoDB
3 pages
Mount Zion College of Engineering & Technology: 100 Marks (Answer All The Questions) PART A - 2 Mark Qs (10x2 20)
100% (1)
Mount Zion College of Engineering & Technology: 100 Marks (Answer All The Questions) PART A - 2 Mark Qs (10x2 20)
1 page
Deep Neural Network Approachesfor Video Based Human Activity Recognition
No ratings yet
Deep Neural Network Approachesfor Video Based Human Activity Recognition
4 pages
CNN Based Features Extraction For Age Estimation A
No ratings yet
CNN Based Features Extraction For Age Estimation A
9 pages
AI Brochure - PDF 3
No ratings yet
AI Brochure - PDF 3
8 pages

Clustering

Uploaded by

Clustering

Uploaded by

Jan 30 2023

You might also like