0% found this document useful (0 votes)

118 views8 pages

Hierarchical Clustering Algorithm

Uploaded by

csomab4u

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

118 views8 pages

Hierarchical Clustering Algorithm

Uploaded by

csomab4u

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

International Journal of Information and Computation Technology.

ISSN 0974-2239 Volume 3, Number 11 (2013), pp. 1225-1232

A Study of Hierarchical Clustering Algorithm

Yogita Rani¹ and Dr. Harish Rohil2

1
Reseach Scholar, Department of Computer Science & Application,
CDLU, Sirsa-125055, India.
2
Assistant Professor, Department of Computer Science & Application,
CDLU, Sirsa-125055, India.

Abstract

Clustering is the process of grouping the data into classes or clusters,

so that objects within a cluster have high similarity in comparison to
one another but these objects are very dissimilar to the objects that are
in other clusters. Clustering methods are mainly divided into two
groups: hierarchical and partitioning methods. Hierarchical clustering
combine data objects into clusters, those clusters into larger clusters,
and so forth, creating a hierarchy of clusters. In partitioning clustering
methods various partitions are constructed and then evaluations of
these partitions are performed by some criterion. This paper presents
detailed discussion on some improved hierarchical clustering
algorithms. In addition to this, author have given some criteria on the
basis of which one can also determine the best among these mentioned
algorithms.

Keywords: Hierarchical clustering; BIRCH; CURE; clusters ;data

mining.

1. Introduction
Data mining allows us to extract knowledge from our historical data and predict
outcomes of our future situations. Clustering is an important data mining task. It can be
described as the process of organizing objects into groups whose members are similar
in some way. Clustering can also be define as the process of grouping the data into
classes or clusters, so that objects within a cluster have high similarity in comparison
to one another but are very dissimilar to objects in other clusters. Mainly clustering can
be done by two methods: Hierarchical and Partitioning method [1].
1226 Yogita Rani & Dr. Harish Rohil

In data mining hierarchical clustering works by grouping data objects into a tree of
cluster. Hierarchical clustering methods can be further classified into agglomerative
and divisive hierarchical clustering. This classification depends on whether the
hierarchical decomposition is formed in a bottom-up or top-down fashion. Hierarchical
techniques produce a nested sequence of partitions, with a single, all inclusive cluster
at the top and singleton clusters of individual objects at the bottom. Each intermediate
level can be viewed as combining two clusters from the next lower level or splitting a
cluster from the next higher level. The result of a hierarchical clustering algorithm can
be graphically displayed as tree, called a dendrogram. This tree graphically displays
the merging process and the intermediate clusters. This graphical structure shows how
points can be merged into a single cluster.
Hierarchical methods suffer from the fact that once we have performed either
merge or split step, it can never be undone. This inflexibility is useful in that it leads to
smaller computation costs by not having to worry about a combinatorial number of
different choices. However, such techniques cannot correct mistaken decisions that
once have taken. There are two approaches that can help in improving the quality of
hierarchical clustering: (1) Firstly to perform careful analysis of object linkages at each
hierarchical partitioning or (2) By integrating hierarchical agglomeration and other
approaches by first using a hierarchical agglomerative algorithm to group objects into
micro-clusters, and then performing macro-clustering on the micro-clusters using
another clustering method such as iterative relocation [2].

2. Related Work
Chris ding and Xiaofeng He, introduced the merging and splitting process in
hierarchical clustering method. They provides a comprehensive analysis of selection
methods and proposes several new methods that determine how to best select the next
cluster for split or merge operation on cluster. The author performs extensive
clustering experiments to test 8 selection methods, and found that the average
similarity is the best method in divisive clustering and the Min-Max linkage is the best
in agglomerative clustering. Cluster balance was a key factor there to achieve good
performance. They also introduced the concept of objective function saturation and
clustering target distance to effectively assess the quality of clustering [3].
Marjan Kuchakist et al. gives an overview of some specific hierarchical clustering
algorithm. Firstly, author classified clustering algorithms, and then the main focused
was on hierarchical clustering algorithms. One of the main purposes of describing
these algorithms was to minimize disk I/O operations, consequently reducing time
complexity. They have also declared attributes, disadvantages and advantages of all the
considered algorithms. Finally, comparison between all of them was done according to
their similarity and difference [4].
Tian Zhang et al. proposed al. proposed an agglomerative hierarchical clustering
method named BIRCH (Balanced Iterative Reducing and Clustering using
Hierarchies), and verified that it was especially suitable for large databases. BIRCH
A Study of Hierarchical Clustering Algorithm 1227

incrementally and dynamically clusters incoming multi-dimensional metric data points

so that best quality clusters can be produced with available resources. BIRCH can
typically produce a good cluster with a single scan of the data, and improve the quality
further with a few additional scans of the data. BIRCH was also the first clustering
algorithm proposed in the database area that can handle noise effectively. The author
also evaluate BIRCH’s time/space efficiency, data input order sensitivity, and cluster
quality through several experiments [5]
Sudipto Guha et al. proposed a new hierarchical clustering algorithm called CURE
that is stronger to outliers, and identifies clusters having non-spherical shapes and wide
variances in size. This is achieved in CURE process by representing each cluster by a
certain fixed number of points that are generated by selecting well scattered points
from the cluster and then shrinking them toward the center of the cluster by a specified
fraction. To handle large databases, CURE employs a combination of random
sampling and partitioning. Along with the description of CURE algorithm, the author
also described, type of features it uses, and why it uses different techniques [6].

3. Hierarchical Clustering Algorithms

Hierarchical clustering is a method of cluster analysis which seeks to build a hierarchy
of clusters. The quality of a pure hierarchical clustering method suffers from its
inability to perform adjustment, once a merge or split decision has been executed.
Then it will neither undo what was done previously, nor perform object swapping
between clusters. Thus merge or split decision, if not well chosen at some step, may
lead to some-what low-quality clusters. One promising direction for improving the
clustering quality of hierarchical methods is to integrate hierarchical clustering with
other techniques for multiple phase clustering. So in this paper, we describe a few
improved hierarchical clustering algorithms that overcome the limitations that exist in
pure hierarchical clustering algorithms.

3.1 CURE (Clustering Using REpresentatives)

CURE is an agglomerative hierarchical clustering algorithm that creates a balance
between centroid and all point approaches. Basically CURE is a hierarchical clustering
algorithm that uses partitioning of dataset. A combination of random sampling and
partitioning is used here so that large database can be handled. In this process a random
sample drawn from the dataset is first partitioned and then each partition is partially
clustered. The partial clusters are then again clustered in a second pass to yield the
desired clusters. It is confirmed by the experiments that the quality of clusters
produced by CURE is much better than those found by other existing algorithms [6].
Figure.1 shows how the CURE process is performed. Furthermore, it is
demonstrated that random sampling and partitioning enable CURE to not only
outperform other existing algorithms but also to scale well for large databases without
sacrificing clustering quality. CURE is more robust to outliers, and identifies clusters
having non-spherical shapes and wide variances in size. CURE achieves this by
1228 Yogita Rani & Dr. Harish Rohil

representing each cluster by a certain fixed number of points that are generated by
selecting well scattered points from the cluster and then shrinking them toward the
centre of the cluster by a specified fraction[8].

Figure 1: CURE Process. This diagram appears courtesy of [Guha, 2000].

3.2 BIRCH (Balanced Iterative Reducing and Clustering using Hierarchies)

BIRCH is an agglomerative hierarchical clustering algorithm and especially suitable
for very large databases. This method has been designed so as to minimize the number
of I/O operations. BIRCH process begins by partitioning objects hierarchically using
tree structure and then applies other clustering algorithms to refine the clusters. It
incrementally and dynamically clusters incoming data points and try to produce the
best quality clustering with the available resources like available memory and time
constraints. BIRCH process mainly takes four phases to produce best quality clusters
In this process two concepts are introduced, clustering feature and clustering
feature tree (CF tree), which are used to summarize cluster representations. A CF tree
is a height-balanced tree that stores the clustering features for a hierarchical clustering.
BIRCH can typically find a good clustering with a single scan of the data, and improve
the quality further with a few additional scans [5].

3.3 ROCK (RObust Clustering using linKs)

ROCK is a robust agglomerative hierarchical-clustering algorithm based on the notion
of links. It is also appropriate for handling large data sets. For merging data points,
ROCK employs links between data points not the distance between them. ROCK
algorithm is most suitable for clustering data that have boolean and categorical
attributes. In this algorithm, cluster similarity is based on the number of points from
different clusters that have neighbors in common. ROCK not only generate better
quality cluster than traditional algorithm but also exhibit good scalability property [6].

Draw random Clusters with Label data in

sample links disk

Figure 2: ROCK Process.

A Study of Hierarchical Clustering Algorithm 1229

The steps involved in clustering using ROCK are described in figure 2. In this
process after drawing random sample from the database, a hierarchical clustering
algorithm that employs links is applied to sample data points. Finally the clusters
involving only the sample points are used to assign the remaining data points on disk
to the appropriate cluster.

3.4 CHEMELEON Algorithm

CHEMELEON is an agglomerative hierarchical clustering algorithm that uses
dynamic modeling. It is a hierarchical algorithm that measures the similarity of two
cluster based on dynamic model. The merging process using the dynamic model
facilitates discovery of natural and homogeneous clusters. The methodology of
dynamic modeling of clusters that is used in CHEMELEON is applicable to all types
of data as long as a similarity matrix can be constructed.
The algorithm process mainly consist of two phases: firstly partitioning of data
points are done to form sub-clusters, using a graph partitioning, after that have to do
repeatedly merging of sub-clusters that come from previous stage to obtain final
clusters. The algorithm is proven to find clusters of diverse shapes, densities, and sizes
in two-dimensional space. CHEMELEON is an efficient algorithm that uses a dynamic
model to obtain clusters of arbitrary shapes and arbitrary densities [7].

3.5 Linkage Algorithms

Linkage algorithms are agglomerative hierarchical methods that consider merging of
clusters is based on distance between clusters. Three important types of linkage
algorithms are Single-link(S-link), Average-link (Ave-link) and Complete-link (Com-
link). In the Single-link, distance between two subsets is the shortest distance between
them. In the Average-link, distance between two subsets is the average distance
between them and in the Complete-link, distance between two subsets is the largest
distance between them. Single-link is sensitive to the presence of outliers and the
difficulty in dealing with severe differences in the density of clusters. On the other
hand, displays total insensibility to shape and size of clusters .Average-linkage is
sensitive to the shape and size of clusters. Thus, it can easily fail when clusters have
complicated forms departing from the hyper spherical shape. Complete-linkage is not
strongly affected by outliers, but can break large clusters, and has trouble with convex
shapes [8].

3.6 Leaders–Subleaders
Leaders-Subleaders is an efficient hierarchical clustering algorithm that is suitable for
large data sets. In order to generate a hierarchical structure for finding the subgroups or
sub-clusters, incremental clustering principles is used within each cluster. Leaders–
Subleaders is an extension of the leader algorithm. Leader algorithm can be described
as an incremental algorithm in which L leaders each representing a cluster are
generated using a suitable threshold value. There are mainly two major features of
1230 Yogita Rani & Dr. Harish Rohil

Leaders–Subleaders. First is effective clustering and second is prototype selection for

pattern classification.
In this algorithm, after finding L leaders using the leader algorithm, the next step is
to generate subleaders, also called the representatives of the sub clusters, within each
cluster that is represented by a leader. This sub-cluster generation process is done by
choosing a suitable sub threshold value. Subleaders in turn help in classifying the
given new or test data more accurately. This procedure may be extended to more than
two levels. An h level hierarchical structure can be generated in only h database scans
and is computationally less expensive compared to other hierarchical clustering
algorithms [1].

3.7 Bisecting k-Means

Bisecting k-Means (BKMS) is a divisive hierarchical clustering algorithm. It was
proposed by Steinbach et al. (2000) in the context of document clustering. Bisecting k-
means always finds the partition with the highest overall similarity, which is calculated
based on the pair wise similarity of all points in a cluster. This procedure will stop until
the desired number of clusters is obtained. As reported, the bisecting k-means
frequently outperforms the standard k-means and agglomerative clustering approaches.
In addition, the bisecting k-means time complexity is O(nk) where n is the number of
items and k is the number of clusters. Advantage of BKMS is low computational cost.
BKMS is identified to have better performance than k-means (KMS) agglomerative
hierarchical algorithms for clustering large documents [9].

4. Conclusion
This paper presents an overview of improved hierarchical clustering algorithm.
Hierarchical clustering is a method of cluster analysis which seeks to build a hierarchy
of clusters. The quality of a pure hierarchical clustering method suffers from its
inability to perform adjustment, once a merge or split decision has been executed. This
merge or split decision, if not well chosen at some step, may lead to some-what low-
quality clusters. One promising direction for improving the clustering quality of
hierarchical methods is to integrate hierarchical clustering with other techniques for
multiple phase clustering. These types of modified algorithm have been discussed in
our paper in detail.

References

[1] Pavel Berkhin (2000), Survey of Clustering Data Mining techniques ,Accrue
Software, Inc..
[2] Jiawei Han and Micheline Kamber (2006), Data Mining: Concepts and
Techniques, The MorganKaufmann/Elsevier India.
A Study of Hierarchical Clustering Algorithm 1231

[3] Chris ding and Xiaofeng He (2002), Cluster Merging And Splitting In
Hierarchical Clustering Algorithms.
[4] MarjanKuchaki Rafsanjani, Zahra Asghari Varzaneh, Nasibeh Emami
Chukanlo (2012), A survey of hierarchical clustering algorithms, The Journal
of Mathematics and Computer Science, 5,.3, pp.229- 240.
[5] Tian Zhang, Raghu Ramakrishnan, MironLinvy (1996), BIRCH: an efficient
data clustering method for large databases, International Conference on
Management of Data, In Proc. of 1996 ACM-SIGMOD Montreal, Quebec.
[6] Sudipto Guha, Rajeev Rastogi, Kyuseok Shim (1998), CURE: An Efficient
Clustering Algorithm For Large
[7] Databases, In Proc. of 1998 ACM-SIGMOD lnt. Conference on Management
of Data.
[8] G.Karypis, E.H.Han and V.Kumar (1999), CHAMELEON: Hierarchical
clustering using dynamic modeling, IEEE Computer, 32, pp. 68-75.
[9] J.A.S. Almeida, L.M.S. Barbosa, A.A.C.C. Pais and S.J. Formosinho (2007),
Improving Hierarchical Cluster Analysis: A new method with outlier detection
and automatic clustering, Chemo metrics and Intelligent Laboratory Systems,
87, pp. 208-217.
[10] L. Feng, M-H Qiu, Y-X. Wang, Q-L. Xiang, Y-F. Yang and K. Liu (2010), A
fast divisive clustering algorithm using an improved discrete particle swarm
optimizer, Pattern Recognition Letters, 31, pp. 1216-1225.
1232 Yogita Rani & Dr. Harish Rohil

20 Ijictv3n10spl PDF
No ratings yet
20 Ijictv3n10spl PDF
8 pages
Comparative Analysis of BIRCH and CURE Hierarchical Clustering Algorithm Using WEKA 3.6.9
No ratings yet
Comparative Analysis of BIRCH and CURE Hierarchical Clustering Algorithm Using WEKA 3.6.9
5 pages
Hierarchical Clustering PDF
No ratings yet
Hierarchical Clustering PDF
5 pages
Bottom-Up Hierarchical Clustering Algorithm
No ratings yet
Bottom-Up Hierarchical Clustering Algorithm
14 pages
Hierarchical Clustering Insights
No ratings yet
Hierarchical Clustering Insights
46 pages
HierarchicalClusteringASurvey Published7-3!9!871
No ratings yet
HierarchicalClusteringASurvey Published7-3!9!871
5 pages
ML Module Iv
No ratings yet
ML Module Iv
27 pages
13 Birch
No ratings yet
13 Birch
8 pages
Hierarchical Clustering Guide
No ratings yet
Hierarchical Clustering Guide
5 pages
Electronics 11 02735 v2
No ratings yet
Electronics 11 02735 v2
19 pages
Multilevel Techniques For The Clustering Problem
No ratings yet
Multilevel Techniques For The Clustering Problem
15 pages
4.4 Hierarchical Clustering Methods
No ratings yet
4.4 Hierarchical Clustering Methods
39 pages
Herichycal Cluster - March2020
No ratings yet
Herichycal Cluster - March2020
29 pages
Herichycal March2020
No ratings yet
Herichycal March2020
29 pages
Heirarchical Clustering
No ratings yet
Heirarchical Clustering
22 pages
Data Mining Unit 5
No ratings yet
Data Mining Unit 5
30 pages
Research 1
No ratings yet
Research 1
36 pages
Clustering Algorithms Overview
No ratings yet
Clustering Algorithms Overview
6 pages
Optimal Partitioning in Clustering Algorithms
No ratings yet
Optimal Partitioning in Clustering Algorithms
9 pages
List of Figures Chapter 1: State of The Art
No ratings yet
List of Figures Chapter 1: State of The Art
25 pages
6902 An Applied Algorithmic Foundation For Hierarchical Clustering
No ratings yet
6902 An Applied Algorithmic Foundation For Hierarchical Clustering
10 pages
New Hierarchical Clustering Algorithm
No ratings yet
New Hierarchical Clustering Algorithm
5 pages
Efficient Clustering Algorithm For Large Database
No ratings yet
Efficient Clustering Algorithm For Large Database
25 pages
IssuesChallenges and Tools of Clustering Algorithm
No ratings yet
IssuesChallenges and Tools of Clustering Algorithm
7 pages
Toward High Dimensiona Agustino
No ratings yet
Toward High Dimensiona Agustino
43 pages
Hierarchical Methods in Unsupervised Learning
No ratings yet
Hierarchical Methods in Unsupervised Learning
35 pages
Clustering Algorithms in Data Mining
No ratings yet
Clustering Algorithms in Data Mining
51 pages
Agnes
No ratings yet
Agnes
25 pages
001 - Clustering - Jain - Dubes (1) - 69-103
No ratings yet
001 - Clustering - Jain - Dubes (1) - 69-103
40 pages
Clustering Jain Dubes (1) - 69-103
No ratings yet
Clustering Jain Dubes (1) - 69-103
5 pages
Overlapping Clustering
No ratings yet
Overlapping Clustering
8 pages
Unt III (DS)
No ratings yet
Unt III (DS)
49 pages
Hierarchical Clustering Overview
No ratings yet
Hierarchical Clustering Overview
20 pages
Hierarchical Clusters
No ratings yet
Hierarchical Clusters
6 pages
Unit-IV Cluster Outlier Analysis
No ratings yet
Unit-IV Cluster Outlier Analysis
21 pages
Clustering Algorithms in Data Mining Review
No ratings yet
Clustering Algorithms in Data Mining Review
7 pages
Clustering Algorithms and Validation Techniques
No ratings yet
Clustering Algorithms and Validation Techniques
5 pages
Clustering
No ratings yet
Clustering
19 pages
Short Questions For Hierarchical Clustering
No ratings yet
Short Questions For Hierarchical Clustering
3 pages
2.11 Hierarchical Clustering - Agglomerative & Divisive Clustering
No ratings yet
2.11 Hierarchical Clustering - Agglomerative & Divisive Clustering
11 pages
DOC-20231118-WA0008new Unit 5
100% (1)
DOC-20231118-WA0008new Unit 5
15 pages
Clustering Techniques in ML
No ratings yet
Clustering Techniques in ML
3 pages
Cluster Analysis in Data Mining Techniques
No ratings yet
Cluster Analysis in Data Mining Techniques
15 pages
By Lior Rokach and Oded Maimon: Clustering Methods
No ratings yet
By Lior Rokach and Oded Maimon: Clustering Methods
5 pages
Hierarchical Clustering PDF
No ratings yet
Hierarchical Clustering PDF
7 pages
10 1109@TNNLS 2018 2853407
No ratings yet
10 1109@TNNLS 2018 2853407
15 pages
Hierarchical Clustering
No ratings yet
Hierarchical Clustering
2 pages
Report 2
No ratings yet
Report 2
7 pages
BIRCH: A New Data Clustering Algorithm and Its Applications
No ratings yet
BIRCH: A New Data Clustering Algorithm and Its Applications
42 pages
DataMining Unit4 Notes
No ratings yet
DataMining Unit4 Notes
27 pages
Clustering Techniques Explained
No ratings yet
Clustering Techniques Explained
12 pages
Hierarchical Clustering in Machine Learning
No ratings yet
Hierarchical Clustering in Machine Learning
10 pages
Hierarchical Cluster Analysis 1
No ratings yet
Hierarchical Cluster Analysis 1
13 pages
DMT Unit-5
No ratings yet
DMT Unit-5
10 pages
List of Figures Chapter 1: State of The Art
No ratings yet
List of Figures Chapter 1: State of The Art
25 pages
AI20 - Hierarchical-Clustering
No ratings yet
AI20 - Hierarchical-Clustering
31 pages
Hierarchical Clustering Methods Explained
No ratings yet
Hierarchical Clustering Methods Explained
31 pages
DM Clustering UNIT4
No ratings yet
DM Clustering UNIT4
36 pages
DM Module 4
No ratings yet
DM Module 4
17 pages
Deep Learning for Stroke Classification
No ratings yet
Deep Learning for Stroke Classification
60 pages
Complete Advances in Structural Engineering-Optimization: Emerging Trends in Structural Optimization Sinan Melih Nigdeli PDF For All Chapters
100% (7)
Complete Advances in Structural Engineering-Optimization: Emerging Trends in Structural Optimization Sinan Melih Nigdeli PDF For All Chapters
65 pages
2.5.1 Feedforward Neural Networks: Products Solutions Purchase Support Community Company Our Sites
No ratings yet
2.5.1 Feedforward Neural Networks: Products Solutions Purchase Support Community Company Our Sites
2 pages
AI Fundamentals for Beginners
No ratings yet
AI Fundamentals for Beginners
54 pages
Novel Smart Water Metering and Management System F
No ratings yet
Novel Smart Water Metering and Management System F
8 pages
Introduction to Data Science Concepts
No ratings yet
Introduction to Data Science Concepts
56 pages
Advanced Python & Data Science Course
No ratings yet
Advanced Python & Data Science Course
9 pages
Paper AIML
0% (1)
Paper AIML
5 pages
Sentenced To Prison by A Machine - Assessment Questions (Lechner)
No ratings yet
Sentenced To Prison by A Machine - Assessment Questions (Lechner)
3 pages
Machine Learning Algorithms For GeoSpatial Data. Applications and
No ratings yet
Machine Learning Algorithms For GeoSpatial Data. Applications and
9 pages
Unit - 1
No ratings yet
Unit - 1
42 pages
Jaydeep Yaduvanshi CV
No ratings yet
Jaydeep Yaduvanshi CV
1 page
KNN Algorithm: Classification Example
No ratings yet
KNN Algorithm: Classification Example
2 pages
Types of Analytics
No ratings yet
Types of Analytics
4 pages
Master's Thesis Help on Support Vector Machines
100% (3)
Master's Thesis Help on Support Vector Machines
7 pages
B.Tech CSE AI&ML Curriculum
No ratings yet
B.Tech CSE AI&ML Curriculum
61 pages
Real-Time Smart Driver Sleepiness Detection by Eye Aspect Ratio Using Computer Vision
No ratings yet
Real-Time Smart Driver Sleepiness Detection by Eye Aspect Ratio Using Computer Vision
10 pages
Artificial Intelligence A Modern Approach 4th Edition by Stuart Russell, Peter Norvig ISBN 0137505132 9780137505135 ebook open-access pdf
100% (2)
Artificial Intelligence A Modern Approach 4th Edition by Stuart Russell, Peter Norvig ISBN 0137505132 9780137505135 ebook open-access pdf
160 pages
Research Report
No ratings yet
Research Report
47 pages
Grade 9 AI Exam Answer Key
No ratings yet
Grade 9 AI Exam Answer Key
6 pages
Konouz Abdelaziz CV
No ratings yet
Konouz Abdelaziz CV
2 pages
Random Forest Analysis on Titanic Data
No ratings yet
Random Forest Analysis on Titanic Data
4 pages
AI Decision Making in Big Data Era
No ratings yet
AI Decision Making in Big Data Era
22 pages
Kelompok 2 Deeplearning
No ratings yet
Kelompok 2 Deeplearning
6 pages
Ultimate Shear Strength Prediction For Slender Reinforced Concrete Beams Without Transverse Reinforcement Using Machine Learning Approach
No ratings yet
Ultimate Shear Strength Prediction For Slender Reinforced Concrete Beams Without Transverse Reinforcement Using Machine Learning Approach
12 pages
Artificial Intelligence and Machine Learning in Business
No ratings yet
Artificial Intelligence and Machine Learning in Business
3 pages
Machine Learning and Its Applications 1st Edition Peter Wlodarczak Download
100% (2)
Machine Learning and Its Applications 1st Edition Peter Wlodarczak Download
83 pages
Mashhadimoslem Et Al 2024 Computational and Machine Learning Methods For Co2 Capture Using Metal Organic Frameworks
No ratings yet
Mashhadimoslem Et Al 2024 Computational and Machine Learning Methods For Co2 Capture Using Metal Organic Frameworks
34 pages
Afan Oromo Text Keyword Extraction Using Machine Learning
100% (1)
Afan Oromo Text Keyword Extraction Using Machine Learning
18 pages
Final Project Report
No ratings yet
Final Project Report
70 pages

Hierarchical Clustering Algorithm

Uploaded by

Hierarchical Clustering Algorithm

Uploaded by

International Journal of Information and Computation Technology.

ISSN 0974-2239 Volume 3, Number 11 (2013), pp. 1225-1232

A Study of Hierarchical Clustering Algorithm

Yogita Rani¹ and Dr. Harish Rohil2

Clustering is the process of grouping the data into classes or clusters,

Keywords: Hierarchical clustering; BIRCH; CURE; clusters ;data

incrementally and dynamically clusters incoming multi-dimensional metric data points

3. Hierarchical Clustering Algorithms

3.1 CURE (Clustering Using REpresentatives)

Figure 1: CURE Process. This diagram appears courtesy of [Guha, 2000].

3.2 BIRCH (Balanced Iterative Reducing and Clustering using Hierarchies)

3.3 ROCK (RObust Clustering using linKs)

Draw random Clusters with Label data in

Figure 2: ROCK Process.

3.4 CHEMELEON Algorithm

3.5 Linkage Algorithms

Leaders–Subleaders. First is effective clustering and second is prototype selection for

3.7 Bisecting k-Means

You might also like