K-Means and ISODATA Clustering Algorithms For Landcover Classification Using Remote Sensing
K-Means and ISODATA Clustering Algorithms For Landcover Classification Using Remote Sensing
K-Means and ISODATA Clustering Algorithms for Landcover Classification Using Remote Sensing
Abstract-The aim of thisexploration work is to analyze the presentation ofunsupervised classification algorithms ISODATA(Iterative
Self-Organizing Data Analysis Technique Algorithm)andK-Means in remote sensing, to evaluate statistically by iterative techniques to
automatically group pixels of similar spectral features into unique clusters. This investigation used SUPARCO(Space and Upper
Atmosphere Research Commission (Pakistan)) obtained remotely sensed patch of Abbottabad Pakistan. The test patch of Abbottabad is
divided into Five bands i.e. NDVI (Normalized Difference Vegetation Index), green, near infrared, far infrared, and green. The ROIs
(regions of interest) selected for classification of Land Cover data comprises five different types of classes i.e. water bodies, agriculture,
settled area, forest and barren land. In this research of remote sensing the first step was to preprocess Abbottabad test patch by filtering,
to improve performance of classification andneighboring pixels homogeneity. The next step was to assess the accuracy of Two pixel
based unsupervised classifiers i.e. ISODATA and k-means on the said test patch. Finally, the mentioned classifiers performance is
evaluated by varying their different parameters to categorize the effect of the clustering algorithms and their class statisticson whole
classification outcomes.
Keywords: K-Means; ISODATA; Clustering Algorithms
these are agriculture, forest, settled area, barren and Implementation of Unsupervised Classification By
water (Razaet al., 2012).The geographical position of K-Mean And ISODATA Method
Abbottabad region and its land patterns are shown in Currently various clustering algorithms are
“Fig.1 (a) and (b)”. generally deployed in remote sensing. The two well-
known are the K-Means and the ISODATA
This paper illustrates the performance analysis of
unsupervised classification algorithms.
K-Means and ISODATA Clustering Algorithms for
Landcover Classification. The continuing paper is These algorithms are iterative in nature. Firstly
structured into three sections. Section 2 illustrates select the arbitrary starting values which show properties
Methodology. Performance analysis of results and of cluster and effect result of classification.
discussions are elaborated in section 3while section 4
has presented conclusion and future work. Generally in both approaches first step is assignment
of arbitrary initial values to cluster. Secondly classify
each pixel to the nearby cluster. To calculate cluster
mean of all pixels in one cluster is the third step. The
repetition of 2nd and 3rd steps continues until the
"change" between the iteration is small. The "change"
can be considered in 2 ways either by the percentage of
change of pixels from one iteration to another or by
calculating the change of distances for the mean cluster
vector between iterations.
(a)(b)
In addition, for improvement the ISODATA
FOREST AGRICULTUR WATER BARREN SETTELED consists of splitting and merging of clusters.
Fig1. (a)Abbottabad region geography(b)Land use pattern of The criteria for merging the clusters are based on
Abbottabad certain threshold if the distance between the centers
2. METHODOLOGY of 2 clusters is less than that or if the number of
Pattern recognition approaches are commonly pixels in one cluster isfewer than that limit, clusters
deployed to recognize the underlying patterns in would be merged.
remotely sensed data (Websource). In this research the The condition for splitting of clusters into 2 is
analysis started from data acquisition, pre-processing, satisfied if the cluster standard deviation increased
then unsupervised classification by K-mean and than a predefined value and the number of pixels is 2
ISODATA method was carried out, with final times the threshold for the minimum number of
processing in post-classification and accuracy pixels(Tou and Gonzalez, 2012 ).
assessment has been discussed in next section using
The K-Means Algorithm
ENVI 5.0.
Feature extraction is the most important step of
Data Acquisition any recognition system. The purpose of feature
Datasets for RS and clustering can be from a wide extraction is to take the important characteristics of the
range of sources like satellite sensor data, ground based image and classify the overall image using this small set
sensor data, general data of weather, energy systems and of information. The selection of features directly effects
so on. In this work dataset is the test patch of the classification operation. Good features results in a
Abbottabad region KPK Pakistan acquired from satellite higher success rate in the process of recognition and
of SUPARCO. Once the dataset is acquired it is vice versa. In this paper, two types of features have been
preprocessed, so that it is suitable for subsequent sub- extracted.
processes. The goal of k-means is to reduce the variability
within the cluster. The summation of squares distances
Pre-processing
termed as errors, between each pixel and its assigned
In this step using software ENVI 5.0, to obtain true
cluster center is minimized and declared as objective
color image load test patch in RGB color with sequence
function
of green, red and near infrared bands. The test patch
SS ∑∀ x C x 1
has5bandsi.e. red, green, far infrared, near infrared and
Wherein C(x) pixel x is assigned is to the mean of
one added band Normalized Difference Vegetation
the cluster.
Index (NDVI). The true color image is pre-processed by
filtering from convolutions and morphology using Mean Squared Error (MSE) is a measure of the
median filter. The filtered image is divided into 5types within cluster variability and represented as.
of lands i.e. agriculture, forest, settled area, barren land ∑∀
and water bodies using ROI tool. MSE 2
K-Means and ISODATA Clustering Algorithms……….. 317
4. CONCLUSION
In Toolbox As ISODATA does not keep
goto Classification, then
Unsupervised
Perform optional spatial
and spectral subsetting,
a fixed number of classes,
give values in the minimum
In a nutshell it has been concluded that
Classification and then
ISODATA
or/and masking, then
click OK. The ISODATA
and maximum Number Of
Classes, for spliting and
unsupervised classification of land pattern with K-
Parameters dialog
Classification. Input
File dialog is prompted.
appears.
merging classes based on
input thresholds Means and ISODATA by varying the parameter values
for number of iterations, number of classes and change
threshold yields a different degree of accuracy and each
In proposed fields Minimum # Pixels in
justify its performance according to literature.
For splitting, enter
give values for
Maximum
Class to its field is inserted for
merging purpose. If pixels are
the Maximum Class Stdv to
its field. If the standard
In addition for post classification of class statistics,
Iterations and the
fewer than the minimum
number, class will be deleted
deviation of a class is larger
than this threshold then the
class 1(red) has maximum average.
Change Threshold and the pixels placed in the
class is split into two classes.
(0-100%). class(es) nearest to them.
REFERENCES:
Jain., A. K. (2010) “Data clustering: 50 years beyond
Finally press OK. For each iteration of
classifier, status bar cycles from 0 to 100%. K-Means,” Journal of Pattern Recognition, Vol. 31,
Resultant output is added to the Layer
Manager 651-666, Elsevier Science Inc. New York, NY, US.
Fig 3. ISODATA implementation work flow
Kamkhet.T, (2012)“Analysis of thaichote band
characteristics using unsupervised pixel-based
classification,” 33rdAsian Conference on Remote
Sensing (ACRS), Pattaya,Thailand, Vol. 1. 3:395- 401.