0% found this document useful (0 votes)

8 views32 pages

9 Fuzzy Clustering

The document provides an overview of Fuzzy Clustering, specifically focusing on the Fuzzy C-Means (FCM) algorithm, which allows for soft boundaries in clustering where each object can belong to multiple clusters with varying degrees of membership. It outlines the input and output requirements for FCM, the algorithm's steps, and includes a demonstration using the Iris dataset. Additionally, it discusses the implementation of FCM using the skfuzzy library for clustering and prediction.

Uploaded by

uwork064

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

8 views32 pages

9 Fuzzy Clustering

Uploaded by

uwork064

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 32

Fuzzy Clustering

Content
●
Introduction
●
FCM: Input, output and restriction
●
Fuzzy C-Means Algorithm
●
Demonstration and Example (SKFuzzy)

@LimCK 2 / 32
Introduction
●
Clustering problem: grouping a set of objects in
such a way that objects in the same group are
more similar to each other than to those in other
groups

@LimCK 3 / 32
Introduction
●
Application of clustering
a) Data analysis b) Image segmentation, etc

@LimCK 4 / 32
Introduction
●
K-Means – hard boundaries, each object belongs to 1 cluster.

@LimCK 5 / 32
Introduction
●
K-Means – hard boundaries, each object belongs to 1 cluster.
●
Fuzzy C-Means (FCM) – soft boundaries. Each object belongs
to every cluster with some weight (membership degree)

@LimCK 6 / 32
FCM: Input, output and restriction
●
Input: Unlabeled data set / objects
● X = { x1, x2, …, xj, … , xN}
N is the number of data/objects
xj is a p dimension vector (real number)
●
Also, we need to specify the number of cluster, C

@LimCK 7 / 32
FCM: Input, output and restriction
●
Output :
– A C-partition of X, which is visualize as a matrix U with
dimension (C X N)
U = [ uij ]
– where 1 ≤ i ≤ C gives the number of cluster
– and 1 ≤ j ≤ N gives the number of object / data
uij is the membership degree of object j belong to cluster i
– Some also include vectors V = { v1, v2, …, vC} that represent the
cluster centers
@LimCK 8 / 32
FCM: Input, output and restriction
● Since it is a membership degree, 0 ≤ uij ≤ 1.
●
The total MD of an object in all cluster must add up to 1.

● Each cluster ci contains, with non-zero weight, at least

one point, but does not contain, with a weight of one, all
the points.

@LimCK 9 / 32
Example of output, U

0.2 0.5 0.7 0.2 0.1 0.6 0.8 0.4 0.2 0.1
0.7 0.2 0.1 0.8 0.8 0.2 0.1 0.6 0.8 0.8
0.1 0.3 0.2 0 0.1 0.2 0.1 0 0 0.1

@LimCK 10 / 32
FCM Algorithm
●
Minimize cost function:

xj – data points. j =1, 2, …, N

ci – cluster centers. i = 1, 2, …, C
m – real number greater than 1.
(m controls the fuzziness. Typical value is 2)
@LimCK 11 / 32
FCM Algorithm
i. Randomly select C cluster center.
ii. Calculate the fuzzy membership degree uij.
iii.Compute the new cluster centers based on the the uij.
iv.Repeat step (ii) and (iii) until the stopping criteria achieved.

Stopping criteria :
1) center of clusters don’t change
2) change in the cost function is below a specified threshold
3) absolute change in any uij is below a given threshold
@LimCK 12 / 32
Computing Membership Degrees
●
The membership degrees are given by:

●
The distance an object to each cluster center is
computed.

@LimCK 13 / 32
Computing the cluster centers
●
For a cluster i , the corresponding cluster center
ci is defined as:

●
All points are considered and the contribution of
each point to the cluster center is weighted by
its membership degree.
@LimCK 14 / 32
Effect of parameter m
●
If m > 2, then the exponent 1/(m-1) decrease the weight
assigned to clusters that are close to the point.
●
If m→∞ , then the exponent → 0. This implies that the
weights → 1/k.
●
If m→1, the exponent increases the membership degree
of points to which the cluster is close.
As m → 1, membership degree → 1 for the closest cluster
and membership degree → 0 for all the other clusters (this
corresponds to k-means).
@LimCK 15 / 32
Fuzzy C Means - demonstration
●
Simple 1-D case, with 20 data points and 3 clusters.
●
Set m=2. Stop if difference between step <0.3

@LimCK 16 / 32
Fuzzy C Means - demonstration
●
Simple 1-D case, with 20 data points and 3 clusters.
●
Set m=2. Stop if difference between step <0.3

@LimCK 17 / 32
Fuzzy C Means - demonstration
●
Simple 1-D case, with 20 data points and 3 clusters.
●
Set m=2. Stop if difference between step <0.3

@LimCK 18 / 32
FCM with skfuzzy (Clustering)
●
Syntax:
result = fuzz.cmeans(data, c, m, error,
maxiter, init=None, seed=None)
Or
cn, u, u0, d, jm, p, fpc = fuzz.cmeans(data, c, m, error,
maxiter, init=None, seed=None)
●
Where inputs:
data – data to be clustered in 2D array with size (S,N). S : features, N : instances
c – integer representing desire number of clusters
m – float. Parameter m that control fuzziness of the clustering, typical value 2
error – Float. Stopping criteria. Algorithm stops if the cluster centers move < error

@LimCK 19 / 32
FCM with skfuzzy (Clustering)
●
Syntax:
result = fuzz.cmeans(data, c, m, error,
maxiter, init=None, seed=None)
Or
cn, u, u0, d, jm, p, fpc = fuzz.cmeans(data, c, m, error,
maxiter, init=None, seed=None)
●
Where inputs:
maxiter – Integer. Maximum number of iterations allowed
init – 2D array with size (S,N). Initial fuzzy c-partitioned matrix. If none provided,
algorithm is randomly initialized.
seed – Integer. If provided, sets random seed of init. No effect if init is provided. Mainly
for debug/testing purposes
@LimCK 20 / 32
FCM with skfuzzy (Clustering)
●
Syntax:
result = fuzz.cmeans(data, c, m, error,
maxiter, init=None, seed=None)
Or
cn, u, u0, d, jm, p, fpc = fuzz.cmeans(data, c, m, error,
maxiter, init=None, seed=None)
●
Where outputs:
cn – 2D array size (S,c) that represents c cluster centers
u – 2D array size (S,N) that represents final C-partitioned matrix
u0 – 2D array size (S,N) that represents the initial guess of u
d – 2D array size (S,N) that represents final Euclidian distance matrix
@LimCK 21 / 32
FCM with skfuzzy (Clustering)
●
Syntax:
result = fuzz.cmeans(data, c, m, error,
maxiter, init=None, seed=None)
Or
cn, u, u0, d, jm, p, fpc = fuzz.cmeans(data, c, m, error,
maxiter, init=None, seed=None)
●
Where outputs:
jm – 1D array that records the history of objective function values
p – number of iterations run
fpc - Final fuzzy partition coefficient
result – Everything from cn to fpc

@LimCK 22 / 32
FCM with skfuzzy (Prediction)
●
Syntax:
u, u0, d, jm, p, fpc =
fuzz.cmeans_predict ( testdate,
c_cntr, m, error, maxiter,
init=None, seed=None )

@LimCK 23 / 32
Demo: Iris dataset with 4 features
●
Iris dataset - contains 3 classes iris plant (Setosa, Versicolour,
Verginica)
●
50 instances for each class
●
One class is linearly separable from the other 2; the latter are
NOT linearly separable from each other.
●
4 features are recorded for each class : sepal length, sepal
width, petal length and petal width.
●
For better visualization, only 2 classes (100 instances in total)
are used for demo.
@LimCK 24 / 32
Demo: Iris dataset with 4 features
Average (cm) Sepal length Sepal width Petal length Petal width
setosa 5.006 3.418 1.464 0.244
versicolor 5.936 2.77 4.26 1.326
virginica 6.588 2.974 5.552 2.026

@LimCK 25 / 32
Demo: Iris dataset with 4 features
●
Data : “iris2b.csv” with dimension 100X2

@LimCK 26 / 32
Demo: Iris dataset with 4 features
Command
●
Data : “iris2b.csv” with dimension 100X2 cntr, u, u0, d, jm, p,
fpc = fuzz.cmeans(data,
cluster_number=2, m=2,
stopping_error=0.00001,
maxiter=100, init=None,
seed=None)

cntr
array([[5.97571191,
2.79326814,
4.30582601,
1.33911579],
[5.00456842,
3.40232882,
1.48743359,
@LimCK 0.25304421]]) 27 / 32
Demo: Iris dataset with 4 features
Command
●
Data : “iris2b.csv” with dimension 100X2 cntr, u, u0, d, jm, p,
fpc = fuzz.cmeans(data,
cluster_number=2, m=2,
stopping_error=0.00001,
maxiter=100, init=None,
seed=None)

u.shape
(2,100)

p
11

fpc
0.9236757239275252
@LimCK 28 / 32
Demo: Iris dataset with 4 features
Command
●
Data : “iris2b.csv” with dimension 100X2 cntr, u, u0, d, jm, p,
fpc = fuzz.cmeans(data,
cluster_number=2, m=2,
stopping_error=0.00001,
maxiter=100, init=None,
seed=None)

jm
array([191.57127093,
150.30291193, 139.24608917,
75.65029622,
41.43916205,
40.88894263, 40.88085657,
40.88063667,
40.88063043,
40.88063026, 40.88063025])

@LimCK 29 / 32
Demo: Iris dataset with 4 features
●
Data : “iris2b.csv” with dimension 100X2

Now, we remove all

the data points.

We have cluster
centers only.

We perform
prediction with data

@LimCK 30 / 32
Demo: Iris dataset with 4 features
Command (Prediction)
●
Data : “iris2b.csv” with dimension 100X2 new_u, new_u0, new_d,
new_jm,new_p, new_fpc =
fuzz.cluster.cmeans_pre
dict(data, cntr, m=2,
error=0.00001,
maxiter=30)

For hard clustering

Cluster =
np.argmax(new_u)

@LimCK 31 / 32
Thank you

@LimCK 32 / 32

مقدمة في العمليات التصادفية
No ratings yet
مقدمة في العمليات التصادفية
24 pages
Unsupervised Learning
No ratings yet
Unsupervised Learning
79 pages
Introduction To Signal Processing
No ratings yet
Introduction To Signal Processing
12 pages
Fuzzy Clustering
No ratings yet
Fuzzy Clustering
47 pages
Advanced Engineering Mathematics 9th Edition Erwin Kreyszig Instant Download
100% (2)
Advanced Engineering Mathematics 9th Edition Erwin Kreyszig Instant Download
53 pages
Lecture 2 Deep Learning Overview
No ratings yet
Lecture 2 Deep Learning Overview
98 pages
Dataxplore
No ratings yet
Dataxplore
34 pages
Week3 Watermark
No ratings yet
Week3 Watermark
57 pages
AIML Report FL
No ratings yet
AIML Report FL
19 pages
Fuzzy C Means
No ratings yet
Fuzzy C Means
4 pages
Fuzzy C Means
No ratings yet
Fuzzy C Means
18 pages
AI Powered IDS
No ratings yet
AI Powered IDS
6 pages
CSE4261 Lecture-8
No ratings yet
CSE4261 Lecture-8
49 pages
Unit 3-Fuzzy Clustering
No ratings yet
Unit 3-Fuzzy Clustering
34 pages
Ir Practical 9
No ratings yet
Ir Practical 9
4 pages
Fs ch10 Clustering
No ratings yet
Fs ch10 Clustering
59 pages
Unsupervised Learning 2024-PPG
No ratings yet
Unsupervised Learning 2024-PPG
85 pages
Fuzzy C-Means Clustering
No ratings yet
Fuzzy C-Means Clustering
22 pages
Numerical Analysis With Optimiz PDF
No ratings yet
Numerical Analysis With Optimiz PDF
102 pages
5 SpanningTrees EN
No ratings yet
5 SpanningTrees EN
23 pages
MIT6 0001F16 Pset4
No ratings yet
MIT6 0001F16 Pset4
10 pages
Fuzzy Model Identification Based On Cluster Estimation
No ratings yet
Fuzzy Model Identification Based On Cluster Estimation
12 pages
Research Proposal Endorsement and Approval: Department of Civil Engineering
No ratings yet
Research Proposal Endorsement and Approval: Department of Civil Engineering
73 pages
Lecture 1
No ratings yet
Lecture 1
15 pages
DWDM Lab All
No ratings yet
DWDM Lab All
20 pages
Chapter 7 - K-Nearest-Neighbor: Data Mining For Business Analytics
No ratings yet
Chapter 7 - K-Nearest-Neighbor: Data Mining For Business Analytics
16 pages
Introduction To AI Module 1 Part C
No ratings yet
Introduction To AI Module 1 Part C
5 pages
Convolution Presentation
No ratings yet
Convolution Presentation
65 pages
Probability (Notes)
No ratings yet
Probability (Notes)
16 pages
Data Structure 1 Fyit
No ratings yet
Data Structure 1 Fyit
39 pages
AI Chapter 3 Part 5
No ratings yet
AI Chapter 3 Part 5
30 pages
Fuzzy C Mean
No ratings yet
Fuzzy C Mean
6 pages
An Algorithm Predicting Stock Markets - Farbod - Dehghani
No ratings yet
An Algorithm Predicting Stock Markets - Farbod - Dehghani
19 pages
Introduction To Artificial Intelligence QA 2
No ratings yet
Introduction To Artificial Intelligence QA 2
29 pages
Lecture 023+-+Decision+Trees+ - 1
No ratings yet
Lecture 023+-+Decision+Trees+ - 1
54 pages
AML Clustering
No ratings yet
AML Clustering
7 pages
Fuzzy C-Means Clustering: Mahdi Amiri
100% (1)
Fuzzy C-Means Clustering: Mahdi Amiri
33 pages
Lab 2
No ratings yet
Lab 2
2 pages
Data Mining Algorithms in R - Clustering - Fuzzy Clustering - Fuzzy C-Means - Wikibooks, Open Books For An Open World
No ratings yet
Data Mining Algorithms in R - Clustering - Fuzzy Clustering - Fuzzy C-Means - Wikibooks, Open Books For An Open World
8 pages
ML Exp5 C36
No ratings yet
ML Exp5 C36
18 pages
Project Report
100% (1)
Project Report
35 pages
Unit IV Convolutional Codes
No ratings yet
Unit IV Convolutional Codes
33 pages
Fuzzy C Means
No ratings yet
Fuzzy C Means
2 pages
296 995 1 PB PDF
No ratings yet
296 995 1 PB PDF
15 pages
Abstractive Text Summarization Using Transformer Based Approach
No ratings yet
Abstractive Text Summarization Using Transformer Based Approach
10 pages
Performance Analysis of Various Fuzzy Clustering Algorithms: A Review
No ratings yet
Performance Analysis of Various Fuzzy Clustering Algorithms: A Review
12 pages
Fuzzy Means Algorithm
No ratings yet
Fuzzy Means Algorithm
14 pages
Fuzzypaper May No K
No ratings yet
Fuzzypaper May No K
20 pages
Fuzzy Image Processing: Fuzzy C-Means Clustering Farah Al-Tufaili
No ratings yet
Fuzzy Image Processing: Fuzzy C-Means Clustering Farah Al-Tufaili
17 pages
Practice Problem: Chapter 15, Short Term Scheduling
No ratings yet
Practice Problem: Chapter 15, Short Term Scheduling
6 pages
A Fuzzy K-Means Clustering Algorithm Using Cluster Center Displacement
No ratings yet
A Fuzzy K-Means Clustering Algorithm Using Cluster Center Displacement
15 pages
Assignment#3 AI
No ratings yet
Assignment#3 AI
5 pages
Image Segmentation by Fuzzy C-Means Clustering Algorithm With A Novel Penalty Term Yong Yang
No ratings yet
Image Segmentation by Fuzzy C-Means Clustering Algorithm With A Novel Penalty Term Yong Yang
15 pages
CSE1325 Mid 183
No ratings yet
CSE1325 Mid 183
1 page
On The Selection of M For Fuzzy C-Means
No ratings yet
On The Selection of M For Fuzzy C-Means
7 pages
Fuzzy Rule-Based Systems: IF Then
No ratings yet
Fuzzy Rule-Based Systems: IF Then
39 pages
Assignment 3
No ratings yet
Assignment 3
2 pages
Winkler 2010 Problems of Fuzzy C-Means Clustering and Similar Algorithms With High Dimensional Data Sets
No ratings yet
Winkler 2010 Problems of Fuzzy C-Means Clustering and Similar Algorithms With High Dimensional Data Sets
8 pages
Fuzzy C-Means - Review
No ratings yet
Fuzzy C-Means - Review
3 pages
Fuzzy Image Processing: Fuzzy C-Means Clustering Farah Al-Tufaili
No ratings yet
Fuzzy Image Processing: Fuzzy C-Means Clustering Farah Al-Tufaili
17 pages
Fuzzy C Means: Manual Work E. N. Sathishkumar
No ratings yet
Fuzzy C Means: Manual Work E. N. Sathishkumar
14 pages
Unsupervisd Learning Algorithm
No ratings yet
Unsupervisd Learning Algorithm
6 pages
Clustering - Fuzzy C-Means
No ratings yet
Clustering - Fuzzy C-Means
5 pages
Fuzzy
No ratings yet
Fuzzy
38 pages
A Novel Kernelized Fuzzy Clustering Algorithm For Data Classification
No ratings yet
A Novel Kernelized Fuzzy Clustering Algorithm For Data Classification
6 pages
Improving Fuzzy C-Means Clustering Based On Feature-Weight Learning
No ratings yet
Improving Fuzzy C-Means Clustering Based On Feature-Weight Learning
10 pages
Clustering
No ratings yet
Clustering
45 pages
Line Drawing Algorithm: Mastering Techniques for Precision Image Rendering
From Everand
Line Drawing Algorithm: Mastering Techniques for Precision Image Rendering
Fouad Sabry
No ratings yet
Pattern Recognition and Machine Learning: Fuzzy Sets in Pattern Recognition Debrup Chakraborty Cinvestav
No ratings yet
Pattern Recognition and Machine Learning: Fuzzy Sets in Pattern Recognition Debrup Chakraborty Cinvestav
38 pages
Implementing The Fuzzy C-Means Algorithm: by Gagarine Yaikhom
No ratings yet
Implementing The Fuzzy C-Means Algorithm: by Gagarine Yaikhom
15 pages
A Comparative Study of Fuzzy C-Means Algorithm and Entropy-Based Fuzzy Clustering Algorithms Subhagata Chattopadhyay
No ratings yet
A Comparative Study of Fuzzy C-Means Algorithm and Entropy-Based Fuzzy Clustering Algorithms Subhagata Chattopadhyay
20 pages
IMECS2009 pp177-182
No ratings yet
IMECS2009 pp177-182
6 pages
Unsupervised Optimal Fuzzy Clustering: I.Gath and A. B. Geva. IEEE Transactions On Pattern
No ratings yet
Unsupervised Optimal Fuzzy Clustering: I.Gath and A. B. Geva. IEEE Transactions On Pattern
34 pages
FRP PDF
No ratings yet
FRP PDF
19 pages
Fuzzy Clustering With Multiple Kernels: Naouel Baili Hichem Frigui
No ratings yet
Fuzzy Clustering With Multiple Kernels: Naouel Baili Hichem Frigui
7 pages
A New Initialization Method For The Fuzzy C-Means Algorithm Using Fuzzy Subtractive Clustering
No ratings yet
A New Initialization Method For The Fuzzy C-Means Algorithm Using Fuzzy Subtractive Clustering
7 pages
An Ε-Insensitive Approach To Fuzzy Clustering: Int. J. Appl. Math. Comput. Sci., 2001, Vol.11, No.4, 993-1007
No ratings yet
An Ε-Insensitive Approach To Fuzzy Clustering: Int. J. Appl. Math. Comput. Sci., 2001, Vol.11, No.4, 993-1007
15 pages
A Comparative Study and Analysis For Microarray Gene Expression Data Using Clustering Techniques
No ratings yet
A Comparative Study and Analysis For Microarray Gene Expression Data Using Clustering Techniques
3 pages
Fuzzy Clustering: Presented by CH - Srikanth (07991A1268)
No ratings yet
Fuzzy Clustering: Presented by CH - Srikanth (07991A1268)
11 pages
Generalized Fuzzy Clustering Model With Fuzzy C-Means
No ratings yet
Generalized Fuzzy Clustering Model With Fuzzy C-Means
11 pages
Vasvi Khullar Mca - Iv (B) 06417704417
No ratings yet
Vasvi Khullar Mca - Iv (B) 06417704417
5 pages
Fuzzy Means Questions
No ratings yet
Fuzzy Means Questions
2 pages
Pattern Recognition and Machine Learning: Fuzzy Sets in Pattern Recognition Debrup Chakraborty Cinvestav
No ratings yet
Pattern Recognition and Machine Learning: Fuzzy Sets in Pattern Recognition Debrup Chakraborty Cinvestav
15 pages
Fuzzy C-Mean Clustering Algorithm Modification and Adaptation For Applications
No ratings yet
Fuzzy C-Mean Clustering Algorithm Modification and Adaptation For Applications
4 pages
Clustering 1
No ratings yet
Clustering 1
6 pages
Fuzzy C Means (Overlapping Clustering)
No ratings yet
Fuzzy C Means (Overlapping Clustering)
13 pages
Introduction To Multi-Criteria Decision Making
100% (1)
Introduction To Multi-Criteria Decision Making
4 pages
About
No ratings yet
About
5 pages
Operations Research
94% (16)
Operations Research
191 pages
Projects With Microcontrollers And PICC
From Everand
Projects With Microcontrollers And PICC
Guillermo Perez Guillen
5/5 (1)