Machinelearning Paper
Machinelearning Paper
net/publication/325107675
CITATIONS READS
21 8,339
3 authors, including:
Rafega Beham
16 PUBLICATIONS 45 CITATIONS
SEE PROFILE
All content following this page was uploaded by Rafega Beham on 12 May 2018.
boundary energy method. Based on the measured data and For more accuracy and clarity in the image its edges and
extracted features classifiers to assign classes with the contrast needs to be enhanced. Colour map of the image are
segments are built. This paper makes use of Decision trees, used for enhancing edges and contrast.
ANN and SVM. Cone Penetration Testing (CPT)
C. Segmentation
methodology is used for classifying sub-surface soil. Feature
extraction is performed using Boundary Energy Data by Once the enhancement of the image is completed in the
parameterising experts’ perception of the shape of the series previous stage using image pre-processing techniques
data. The overall classification scheme is based on CPT data. segmentation of the image is performed. Popular known
Soil Classification is done based on different algorithms for algorithm K-means Clustering algorithm is employed for
different soil types. In this paper a method to classify series segmentation of the image. It is used as a partition clustering
data where the constraint of contiguity has to be maintained which aims at partitioning a given data set into disjoint
is presented. subsets so that specific clustering criteria are optimized. The
most widely used criterion is the clustering error criterion
III. PROPOSED SYSTEM which for every point computes its squared distance from the
corresponding cluster centre and then takes the sum of these
A. Image Acquisition distances for all points in the data set. The centre of the cluster
Different images of soil samples which are to be classified are is picked and each pixel of the image is assigned to the
captured using color camera and are provided as an input to cluster. Re-computation of cluster centre is done by using the
the system. The features of each type of soil are collected and average of all the pixels. This iteration continues and the next
are stored in a separate database. This database is later used step is to take each point belonging to a given data set and
in the final stage for soil and crop detection. associate it to the nearest centroid.
K-means Algorithm is as follows-
1) Input: K, set of points x1........xn.
2) Place the centroids C1.........Ck at random locations.
3) Repeat until convergence:
For each point xi:
Fig. 1: Images of Soil Samples Find nearest centroid Cj.arg min D(xi,Cj)
Assign the point xi to cluster j
For each cluster j=1.....K:
1
Cj(a) = xi−> Cj ∑ xi(a) for each a = 1 … … d
nj
For each cluster j=1......K:
New centroid Cj= mean of all points xi assign to
cluster j in previous step.
4) Stop when none of the cluster assignments change.
D. Feature Extraction
After the segmentation of the image is performed by k-means
in the above phase our next step is the feature extraction stage.
Fig. 2: Flowchart of the proposed system
This is the foremost step in this methodology. All the features
B. Image Pre-processing that are required for us to classify the soil type and crop
The image acquired from our previous stage is not error free. detection are done in this phase. A number of features like the
The quality of the image is decisive for results of analysis as texture, colour, intensity, saturation, hue, etc are extracted for
it affects both the ability to detect features under analysis and detection of soil type. a filter known as Gabor Filter is
precision of subsequent measurement. In order to get an error implemented for feature extraction. Gabor Filter is a linear
free image pre-processing techniques are applied. This phase filter used for edge detection. Frequency and orientation
is also known as the enhancement of the image since the representations of Gabor filter are similar to those of human
image is enhanced by improving its contrast and removing visual system and they have been found to be particularly
errors to obtain a better quality image for our future appropriate for texture representation and discrimination. A
processes. The image contains errors like noise or artifacts set of frequencies and orientation representations may be
like scratches, lapping tracks, comet tails, etc which needs to helpful for extracting useful features from the image. Also
be eliminated before the further processes. Hence a filter other features like entropy, standard deviance, mean, etc can
called the Smoothing filter is made use to remove the noise be extracted using Gabor filter. The main and important
and artifacts from the image. There are two types of filters: feature of soil that is colour is needed to be extracted. Hence
low pass filter and high pass filters. Smoothing filter is a low a measure called colour moments are used to differentiate
pass filter. It is used to remove high spatial frequency noise images based on their features of colour. These provide a
from a digital image. Smoothing filter employs a moving colour similarity between images which can be compared to
window operator which affects one pixel of the image at a the values of images indexed in the data base for tasks like
time, changing its value by some function of a local region of image retrieval.
pixels. The operator moves over the image to affect all the E. SVM Classification
pixels in the image. Thus with many iterations the smoothing
Support Vector Machine (SVM) algorithm is used for soil
filter gradually enhances the image by removing the errors.
classification. It has successful applications in many fields
like bioinformatics, text, image recognition, etc. SVM is a Used in Agricultural WSN”, Conf: on Information
universally accepted algorithm due to its simple nature. It is Technology and artificial Intelligence, Shanghai, China,
considered as an alternative to neural networks algorithm. pp.432-436.
The working nature is explained as follows: Given a set of [2] Kumar. R, Singh.M.P, Kumar.P, Singh.J.P, “Crop
training examples, each marked for belonging to one of two Selection Method to Maximize Crop Yield Rate using
categories, an SVM training algorithm builds a model that Machine Learning Technique”, (2015), Conf: on
assigns new examples into one category or the other, making ICSTM, pp.138-145.
it a non-probabilistic binary linear classifier. An SVM model [3] B. Bhattacharya, and D.P. Solomatine, "An algorithm for
is a representation of the examples as points in space, mapped clustering and classification of series data with constraint
so that the examples of the separate categories are divided by of contiguity", Proc. 3T"d nt. Conf: on Hybrid and
a clear gap that is as wide as possible. New examples are then Intelligent Systems, Melboume, Australia, 2003, pp.
mapped into that same space and predicted to belong to a 489-498.
category based on which side of the gap they fall on. SVM is [4] Soumya sree, Asst. Prof. A Rafega Beham, (2015),
effective to analyzing the separating planes and to identify the “Automated Diagnosis of Retina Images for Diabetic
largest margin so that the support to the data points will be Patients based on BP and SVM”, proceedings of the
identified. So we want to learn the mapping: X->Y, where ‘x’ International Journal of Computer Science and Mobile
belongs to X is some object and ‘y’ belongs to Y is a class Computing, NHCE, Bangalore, pp.299-306.
label.
IV. CONCLUSION
The results from this study can be used for rapid identification
of soil types when they arrive in railcars at the terminal soil
elevators. Since this classification technique does not require
time consuming image processing routines such as Fourier
descriptors, it can readily be implemented using commercial
imaging libraries with Digital Signal Processing (DSP)
boards for real time operations. The work carried out has
relevance to real world classification of soils and it involves
both image processing and pattern recognition techniques.
SVM involves the sound theory first and then implementation
and experiments. A significant advantage of using SVM that
the solution to an SVM is global and unique, they have simple
geometric interpretations and gives sparse solution. The
computational complexity of SVMs does not depend on the
dimensionality of the input space and they are less prone to
overfitting.
REFERENCES
[1] Xiang Gao, Tancheng Lu, Peng Liu, Qiyong Lu,(2014),
“A Soil Moisture Classification Model Based on SVM