CLIQUE is a bottom-up clustering algorithm for high-dimensional data that partitions dimensions into subspaces to efficiently identify meaningful clusters. It uses a grid structure to evaluate density and forms clusters by connecting dense units, applying the a priori principle for density determination. While it automatically identifies subspaces and scales well, its reliance on global thresholds and grid size can limit flexibility and make results less intuitive.
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0 ratings0% found this document useful (0 votes)
7 views2 pages
Overview of CLIQUE Algorithm
CLIQUE is a bottom-up clustering algorithm for high-dimensional data that partitions dimensions into subspaces to efficiently identify meaningful clusters. It uses a grid structure to evaluate density and forms clusters by connecting dense units, applying the a priori principle for density determination. While it automatically identifies subspaces and scales well, its reliance on global thresholds and grid size can limit flexibility and make results less intuitive.
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
You are on page 1/ 2
Overview of CLIQUE Algorithm
CLIQUE is a bottom-up algorithm designed for clustering high-dimensional data
by dividing dimensions into subspaces, leveraging the a priori principle to prune potential clusters efficiently. The algorithm aims to identify meaningful clusters within subspaces rather than considering all dimensions, which can be challenging. Subspace Division and Density Identification CLIQUE partitions each dimension into intervals, creating a grid structure that allows for the exploration of single-dimensional subspaces and their combinations. It identifies dense areas (crowded) and sparse areas (noise) within these subspaces, marking the latter as outliers. Steps in the CLIQUE Algorithm Partitioning: Each dimension is divided into non-overlapping rectangular units, allowing for the counting of data points within each unit. Density Evaluation: A unit is considered dense if the number of data points exceeds a user-defined thresholds. Cluster Formation: Dense units are connected to form clusters, with the algorithm recursively merging higher-dimensional spaces. A Priori Principle Application The a priori principle states that if a k-dimensional unit is dense, all its projections into lower dimensions must also be dense. This principle aids in efficiently determining the density of candidate units. Strengths and Weaknesses of CLIQUE Strengths: Automatically identifies subspaces where data clusters exist. Insensitive to the order of input records and does not require prior knowledge of data distribution. Scales well and can find overlapping clusters. Weaknesses: The algorithm's rigidity is due to the requirement of a global density threshold and grid size, which may limit its flexibility. Clustering results can be less intuitive, especially with large datasets, making it difficult to visualize the overall data grouping. Conclusion CLIQUE is a powerful tool for clustering in high-dimensional spaces, but users must be aware of its limitations and the need for careful parameter selection to ensure meaningful results.