Cluster Analysis Unit 4.

Cluster analysis is an unsupervised machine learning technique that groups similar objects together without predefined labels. It works by computing the distances or similarities between objects to perform hierarchical clustering. Hierarchical clustering builds clusters iteratively by either merging the most similar clusters in an agglomerative approach or splitting the least similar clusters in a divisive approach. The results can be visualized in a dendrogram to show how the clusters are merged or split as the similarity threshold changes.

Uploaded by

Rohan bhatkal

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

24 views16 pages

Cluster Analysis Unit 4.

Uploaded by

Rohan bhatkal

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 16

SHEETAL CHABUKSWAR (Assistant professor)

CLUSTER ANALYSIS

Cluster analysis is more primitive technique in that no assumption are

made concerning the number of groups or the group structure. Grouping is done
on the basis of similarity or distance (dissimilarity). The input required are
similarities can be computed distance and similarities coefficient for pair of
items.
SHEETAL CHABUKSWAR (Assistant professor)
SHEETAL CHABUKSWAR (Assistant professor)
SHEETAL CHABUKSWAR (Assistant professor)

Suppose five individual having following characterless – Hight , weight , eye

color , hair color , handedness , gender.
SHEETAL CHABUKSWAR (Assistant professor)
SHEETAL CHABUKSWAR (Assistant professor)

Similarities and association measure for pair of variables.

SHEETAL CHABUKSWAR (Assistant professor)

Hierarchical Clustering
To find “reasonable” cluster without having to look at all configuration.
Hierarchical clustering technique proceed by rather a series of successive
merger or series of successive division.
There are two type of hierarchic methods, agglomerative hierarchic
method and Division hierarchic method (Division of individual)
Agglomerative method starts with individual object. There are many
clusters as subject the most similar object. The most similar objects are first
grouped and their initial groups are merge according to their similarities. When
similarities decrease, all sub groups are merge into single cluster.
Divisive method works in the opposite direction. And initial single group
of objects is divided into two sub groups such that the objects in one sub group
are far from the objects in the other. These sub group are further divided into dis
similar sub groups, the process continue until there are as many sub groups as
objects i.e. until each object form a group.
The results are both agglomerative method and divisive method may be
displays as follows.
SHEETAL CHABUKSWAR (Assistant professor)

Steps in agglomerative method. (items and variable)

1) Start with N cluster, each containing a single entity and an N × N
symmetric matrix of distances. (or similarities) D = dik
2) Search the distance matrix for the nearest (most similar) pair of clusters
let the distance between “most similar”” cluster U and V be d uv
3) Merge cluster U V label the newly form cluster (U V) update the entry in
the distance matrix by
i) Deleting the rows and Colum corresponding cluster U V.
ii) Adding row and Colum giving the distance between cluster (U V)
and the remaining cluster.
4) Repeat step 2 and 3 a total of (N-1) times (all objects will be in single
cluster at termination of algorithm) after the algorithm termination record
the identity of clusters that are merge and the levels (distances or
similarities) at which the merger take place.

Single linkage
Similarly, we must find the smallest distance D = { dik} and merge the
corresponding objects , say U and V , to get the cluster (UV).fpr step 3 general
algorithm, the distances between (UV) and any other cluster W are computed by
d(uv)w = min {duw,dvw}
Here the quantities duw and dvw are the distances between the nearest neighbors
Of clusters U and W and cluster V and W respectively.
1) Consider the matrix
SHEETAL CHABUKSWAR (Assistant professor)
SHEETAL CHABUKSWAR (Assistant professor)

Single linkage Dendrogram for distance between five objects

Complete linkage method

SHEETAL CHABUKSWAR (Assistant professor)

Complete linkage dendrogram for distance between five objects

SHEETAL CHABUKSWAR (Assistant professor)
SHEETAL CHABUKSWAR (Assistant professor)

Suppose we are measure two variable X1 and X2 for each of four item A,B,C and
D. The data are given below
SHEETAL CHABUKSWAR (Assistant professor)
SHEETAL CHABUKSWAR (Assistant professor)
SHEETAL CHABUKSWAR (Assistant professor)

Negative Autopsy
No ratings yet
Negative Autopsy
21 pages
Anglicisms in Europe
No ratings yet
Anglicisms in Europe
30 pages
SENIOR HIGH SCHOOL-English For Academic and Professional Purposes
No ratings yet
SENIOR HIGH SCHOOL-English For Academic and Professional Purposes
8 pages
Titus Frankenstein Maslow
No ratings yet
Titus Frankenstein Maslow
6 pages
Abap On Hana: 1. Introduction and Technical Concepts of SAP HANA 2. Introduction To HANA Studio
No ratings yet
Abap On Hana: 1. Introduction and Technical Concepts of SAP HANA 2. Introduction To HANA Studio
29 pages
Anthropology of The Body
100% (1)
Anthropology of The Body
25 pages
Lecture-11 Cluster Analysis-1
No ratings yet
Lecture-11 Cluster Analysis-1
28 pages
Lecture 7 Clustring
No ratings yet
Lecture 7 Clustring
10 pages
Lecture-9 Cluster Analysis - LAK
No ratings yet
Lecture-9 Cluster Analysis - LAK
4 pages
10. Cluster Analysis(1)
No ratings yet
10. Cluster Analysis(1)
37 pages
Introduction To Clustering: Alka Arora Sr. Scientist
No ratings yet
Introduction To Clustering: Alka Arora Sr. Scientist
57 pages
Topic 6d - Hierarchical Algorithm
No ratings yet
Topic 6d - Hierarchical Algorithm
38 pages
Cluster Analysis Introduction
No ratings yet
Cluster Analysis Introduction
23 pages
Hierarchical Clustering
No ratings yet
Hierarchical Clustering
32 pages
Cluster Analysis Techniques
No ratings yet
Cluster Analysis Techniques
33 pages
Cluster Analysis Hierarchical & - Means
No ratings yet
Cluster Analysis Hierarchical & - Means
41 pages
K Medoids
No ratings yet
K Medoids
101 pages
19 - Clustering in Operation Research
No ratings yet
19 - Clustering in Operation Research
11 pages
Hierarchicalclustering
No ratings yet
Hierarchicalclustering
20 pages
Intermediate R - Cluster Analysis
33% (3)
Intermediate R - Cluster Analysis
27 pages
Unit 5 Clustering
No ratings yet
Unit 5 Clustering
70 pages
Data Mining Functionalities
No ratings yet
Data Mining Functionalities
13 pages
Module 3
No ratings yet
Module 3
123 pages
DM Unit-Iv
No ratings yet
DM Unit-Iv
20 pages
Marielle Caccam Jewel Refran
No ratings yet
Marielle Caccam Jewel Refran
100 pages
What Is Cluster Analysis?: - Cluster: A Collection of Data Objects
No ratings yet
What Is Cluster Analysis?: - Cluster: A Collection of Data Objects
51 pages
Graph Partitioning Advance Clustering Technique
No ratings yet
Graph Partitioning Advance Clustering Technique
14 pages
ML Co4 Session 29
No ratings yet
ML Co4 Session 29
36 pages
What Is Cluster Analysis?
No ratings yet
What Is Cluster Analysis?
24 pages
Cluster Analysis
No ratings yet
Cluster Analysis
101 pages
Cluster Analysis: Biological Data Analysis and Chemometrics
No ratings yet
Cluster Analysis: Biological Data Analysis and Chemometrics
41 pages
Hierarchical
No ratings yet
Hierarchical
31 pages
Clustering: Sridhar S Department of IST Anna University
No ratings yet
Clustering: Sridhar S Department of IST Anna University
91 pages
ML Clustering Algorithm
No ratings yet
ML Clustering Algorithm
29 pages
M4 - Clustering
No ratings yet
M4 - Clustering
43 pages
Chapter-5-Cluster Analysis PDF
No ratings yet
Chapter-5-Cluster Analysis PDF
5 pages
20 - 1 - ML - UNSUP - 02 - Hierarchical Clustering
No ratings yet
20 - 1 - ML - UNSUP - 02 - Hierarchical Clustering
41 pages
Clustering
No ratings yet
Clustering
27 pages
Clustering and Association Rule
No ratings yet
Clustering and Association Rule
69 pages
ML12 Clustering
No ratings yet
ML12 Clustering
34 pages
Clustering
No ratings yet
Clustering
64 pages
MachineLearning Unit IV
No ratings yet
MachineLearning Unit IV
51 pages
Clustering and Applications and Trends in Data Mining
No ratings yet
Clustering and Applications and Trends in Data Mining
42 pages
Agglomerative Clustering
No ratings yet
Agglomerative Clustering
6 pages
ML Unit Iii
No ratings yet
ML Unit Iii
12 pages
DEU CSC5045 Intelligent System Applications Using Fuzzy - 4+clustering
No ratings yet
DEU CSC5045 Intelligent System Applications Using Fuzzy - 4+clustering
61 pages
CV w4 - Recognition - Statistical Based
No ratings yet
CV w4 - Recognition - Statistical Based
42 pages
ML Lec-18
No ratings yet
ML Lec-18
21 pages
An Overview On Clustering Methods: T. Soni Madhulatha
No ratings yet
An Overview On Clustering Methods: T. Soni Madhulatha
7 pages
Unit-7 Finalized
No ratings yet
Unit-7 Finalized
20 pages
DM & W - Unit - 3
No ratings yet
DM & W - Unit - 3
34 pages
MA Unit 5
No ratings yet
MA Unit 5
7 pages
MDA Session 4
No ratings yet
MDA Session 4
5 pages
Unit - 4 DMA
No ratings yet
Unit - 4 DMA
145 pages
Hierarchical Clustering: Required Data
No ratings yet
Hierarchical Clustering: Required Data
6 pages
UNIT V DWM Notes
No ratings yet
UNIT V DWM Notes
18 pages
Lecture 02 - Cluster Analysis 1
No ratings yet
Lecture 02 - Cluster Analysis 1
59 pages
Clustering
No ratings yet
Clustering
69 pages
Cluster Analysis
No ratings yet
Cluster Analysis
60 pages
ML-Module 5-P1
No ratings yet
ML-Module 5-P1
45 pages
CS40003 (Data Analytics) : Term Project
No ratings yet
CS40003 (Data Analytics) : Term Project
10 pages
DM Clustering
No ratings yet
DM Clustering
51 pages
22it601 - Data Mining and Warehousing: Lecture Notes Template
No ratings yet
22it601 - Data Mining and Warehousing: Lecture Notes Template
10 pages
Unit IV
No ratings yet
Unit IV
51 pages
Clustering
0% (1)
Clustering
127 pages
Mechanics I Essentials
From Everand
Mechanics I Essentials
The Editors of REA
No ratings yet
Naupada Sample Data
No ratings yet
Naupada Sample Data
6,196 pages
Role and Responciblity
No ratings yet
Role and Responciblity
6 pages
Tyba Sem 5 - Paper - V - Mcqs Set 2
No ratings yet
Tyba Sem 5 - Paper - V - Mcqs Set 2
14 pages
Practical Linear 2021
No ratings yet
Practical Linear 2021
3 pages
Shilpa Resume
No ratings yet
Shilpa Resume
2 pages
Chisquare Jigyasa
No ratings yet
Chisquare Jigyasa
2 pages
Problems Human Space Exploration
No ratings yet
Problems Human Space Exploration
4 pages
Astropol 2
No ratings yet
Astropol 2
37 pages
Ethics & Behaviour
No ratings yet
Ethics & Behaviour
19 pages
Instruction On The Group Design Project (Lab Sheet)
No ratings yet
Instruction On The Group Design Project (Lab Sheet)
2 pages
Verbal Implication - Steve Andreas
No ratings yet
Verbal Implication - Steve Andreas
7 pages
Panduan S1 2015-2016
No ratings yet
Panduan S1 2015-2016
67 pages
ChE Objective Type Questions Compilation-Dean Medina 8-19-12
No ratings yet
ChE Objective Type Questions Compilation-Dean Medina 8-19-12
144 pages
Boxwood What Is A Technology Operating Model' and Which Should I Have
No ratings yet
Boxwood What Is A Technology Operating Model' and Which Should I Have
4 pages
Buita - Nathan - Ecclesial Community Observation Report
No ratings yet
Buita - Nathan - Ecclesial Community Observation Report
1 page
Stat and Prob-Q4-Summative 1-Cycle 1
No ratings yet
Stat and Prob-Q4-Summative 1-Cycle 1
3 pages
Travel Manitoba Cutting Edge 2012 Worksheet Excepts
No ratings yet
Travel Manitoba Cutting Edge 2012 Worksheet Excepts
21 pages
An Improved Heaviside Approach To Partial Fraction Expansion and Its Applications
No ratings yet
An Improved Heaviside Approach To Partial Fraction Expansion and Its Applications
3 pages
Smile Design
No ratings yet
Smile Design
55 pages
Grade 6 Ruby
No ratings yet
Grade 6 Ruby
86 pages
Expressions Ex
No ratings yet
Expressions Ex
4 pages
1st Practice Test 1 Levels 5-7 - No Calculator (379kB)
No ratings yet
1st Practice Test 1 Levels 5-7 - No Calculator (379kB)
28 pages
Mips PPT Slides
No ratings yet
Mips PPT Slides
35 pages
CN 33 Assessment 2 5
No ratings yet
CN 33 Assessment 2 5
5 pages
Resume Thanawat Eng
No ratings yet
Resume Thanawat Eng
2 pages
Weighted Mean
No ratings yet
Weighted Mean
230 pages
0.3 Functions and Graphs Contemporary Calculus 1
No ratings yet
0.3 Functions and Graphs Contemporary Calculus 1
12 pages
8-Q4-A. DLL
No ratings yet
8-Q4-A. DLL
36 pages
Chapter 6 - Data Presentation
100% (1)
Chapter 6 - Data Presentation
50 pages
Docuementation
No ratings yet
Docuementation
7 pages

Cluster Analysis Unit 4.

Uploaded by

Cluster Analysis Unit 4.

Uploaded by

SHEETAL CHABUKSWAR (Assistant professor)

Cluster analysis is more primitive technique in that no assumption are

Suppose five individual having following characterless – Hight , weight , eye

Similarities and association measure for pair of variables.

Steps in agglomerative method. (items and variable)

Single linkage Dendrogram for distance between five objects

Complete linkage method

Complete linkage dendrogram for distance between five objects

You might also like