Digital Image Classification
Digital Image Classification
Digital Image
Classification
Image Space
The concept here is to display the data samples in relation to one another
in a geometric sense thus providing a picture of the ground scene to the
viewer.
The key use of imagery in multispectral processing is to facilitate the
analyst associate multispectral data points (pixels) with specific locations
(points) in the ground scene.
In the analysis process it is useful in labelling pixels in the data as training
samples.
A digital image is a 2-D array of elements. In each element, the energy
reflected or emitted from the corresponding area on the earth‘s surface is
stored.
The spatial arrangement of the measurements defines the image or image
space.
Depending on the sensor, data are recorded in n-bands and digital image
elements are usually stored as 8-bit DN-values (range: 0-255).
9.4. Principle of Image Classification(3/10)
Spectral Space
The idea here is if response versus wavelength conveys needed
information by which to identify the contents of an individual pixel, this
then would provide a fundamental simplicity that is important from a
processing point of view.
Each pixel would be labelled individually resulting in much better result
compared to a label associated with a neighbourhood of pixels.
Response as a function of wavelength has useful characteristics, since it
provides the analyst with spectral information that is directly interpretable.
Practically, this is not always feasible due to the fact that the earth
surface cover types tend to vary in a characteristic way,
– E.g. spectral response of a maize field is not uniform for all pixels in the field
but varies in a characteristic way about some mean value.
9.4. Principle of Image Classification(5/10)
Feature Space
For one pixel, the values in e.g. two bands can be regarded as components
of a 2-D vector, the feature vector. E.g. a feature vector of (13,55) implies
that 13DN and 55DN values are stored for band 1 and band 2 respectively.
Similarly, this can be extended for a 3band situation.
A graph showing the values of the feature vectors is called feature space or
feature space plot.
NB. Plotting for a four or more dimensional cases is difficult. A practical
solution is to plot all possible combinations of two bands separately, e.g. for
4 bands six combinations namely 1&2; 1&3; 1&4; 2&3; 2&4; 3&4.
Plotting the combinations of the values of all the pixels of one image yields
a large cluster of points referred to as a scatter plot.
9.4. Principle of Image Classification(7/10)
Definition of Clusters(Training)
The operator needs to decide how the pixels based on their DN-Values
are assigned to the classes. The choice will depend on a number of
criteria namely:-
– Purpose of the classification
– Characteristics of the image data and training data
About ClusteringAlgorithm
The operator defines the maximum number of clusters in adata.
On the basis of this, arbitrary mean vectors are located by the computer as the centre points of the
clusters.
Each pixel is then assigned to a cluster by the minimum distance to cluster centroid decision rule.
Once all the pixels have been labelled, recalculation of the cluster centre takes place and the
process is repeated until the proper centres are found and the pixels labelled accordingly.
Iteration stops when the cluster centres do not change anymore. Note at any iteration, any clusters
with less than the specific number of pixels are eliminated.
After clustering, analysis is done of the closeness or separability of the clusters on the basis of
intercluster distance or divergence measure. Merging of clusters is done to reduce the number of
unnecessary subdivisions in the data set. The operator canuse:-
– Maximum number of clusters
– Distance between two cluster centres
– Radius of a cluster
– Minimum number of pixels
As a threshold for cluster elimination.
9.6. Statistical Approaches (5/17)
Hierarchical ClusteringAlgorithm
9.6. Statistical Approaches (6/17)
Box Classifier
9.6. Statistical Approaches (10/17)
Not only considers the cluster centre but also its shape, size and
orientation.
This is achieved by computing a statistical distance based on the mean
values and covariance matrix of the clusters.
The statistical distance is a probability value that tells us the probability of
an observation x belongs to a specific cluster.
A pixel is assigned to the cluster to which it has the highest probability.
The assumption of maximum likelihood is that the statistics of the clusters
have a normal or gaussian distribution.
9.6. Statistical Approaches (14/17)
F F F .....................F 1
i 1 2 N
i1