Image Segmentation NO ML
Image Segmentation NO ML
https://doi.org/10.1007/s11277-019-07021-6
Abstract
Nowadays content-based image retrieval (CBIR) framework is drawing in consideration of
numerous analysts because of far-reaching applications found in numerous territories. In
this paper, a new CBIR methodology is proposed and adequacy of any CBIR framework
relies upon the features extracted from a color picture. In this work, firstly find the region
of interest of the image using Sobel and Canny method and later on output is applied on
HSV color space, it is clear to human vision eye. For classification, neural network is used
and categorized the data with class labels. The similarity distance is estimated between the
query image and stored image with different similarity metrics like Manhattan distance,
Euclidean distance, Chebyshev, Hamming distance and Jaccard distance. The experimental
result is estimated on accuracy, precision. The experiment performed on two well-known
databases i.e.: Corel-1k and Corel-5k dataset and new methodology proves the better accu-
racy results up to 87.33% and 68.93% respectively and improves the precision results also
up to 86.36% and 68.47% respectively. In this paper, results are also extended up to 80%.
Keywords CBIR · Sobel edge detection · Canny edge detection · HSV histogram · NN
1 Introduction
CBIR is a strategy to find and furthermore record pictures in the enormous arrangement
of database dependent on its visual substance, for instance, the premise of surfaces, hues,
shapes or spatial designs in its place of applying labels or different illustrative meta-
data catchphrases that may connect with pictures in the database. In conventional CBIR
* Rohit Raja
[email protected]
Sandeep Kumar
[email protected]
Md Rashid Mahmood
[email protected]
1
CSE Department, Sreyas Institute of Engineering and Technology, Hyderabad, India
2
ECE Department, Sreyas Institute of Engineering and Technology, Hyderabad, India
3
ECE Department, Guru Nanak Institutions Technical Campus, Hyderabad, India
13
Vol.:(0123456789)
170 R. Raja et al.
2 Literature Survey
Kaur et al. [5] has presented the efficient approach through images are retrieve by enhanc-
ing the CBIR for feature extraction. By using these features, different classifier used i.e.
ANN, Bayes and Fuzzy Neural Network for classification and achieved good precision
results respectively 76%, 66% and 43%.
Varish et al. [6] has introduced a CBIR methodology based on color features compo-
nent. Author gave the similar importance to all three fundamental colors while evaluating
the images. Processing cost of proposed methodology is low as compared to others while
testing the standard datasets and achieved 84.0% results.
Zhao et al. [7] have presented a novel approach means Multi-Trend Structure Descrip-
tor (MTSD) for feature extraction. With the MTSD algorithm, both Low level features and
local spatial structure information are characterized easily. Authors evaluate this algorithm
on standard datasets and evaluated results demonstrate the satisfactory output i.e. 79.28%
and 62.98 respectively Corel-1k and Corel-5k datasets. Execution time of proposed meth-
odology for feature extraction is also satisfactory approximately 9.1615 s.
Zheng et al. [8] has introduced a fast image retrieval algorithm with a good efficiency.
This algorithm applied on standard datasets Corel-1k and Corel-5k respectively achieved
69.32% and 51.84% results while evaluation.
13
Color Object Detection Based Image Retrieval Using ROI… 171
Mikolajczyk et al. [9] has presented a interest region detector based descriptor approach.
While evaluation, extension of SIFT descriptor used and it shows the better performance
74.67% and 57.92% among the low dimensional descriptor on Corel-1k and Corel-5k data-
sets respectively.
Amjad et al. [10] has presented Convolution Neural Network (CNN) for feature extrac-
tion from pictures in CBIR framework. ED is utilized for the relationship among question
and put away pictures utilizing the removed highlights. Execution of the proposed work
is assessed utilizing exactness. This work demonstrates enhanced outcomes when con-
trasted with the current works and results achieved 95%, 97%, and 88% respectively Corel,
Caltech, and Li datasets.
Aasia et al. [11] has introduced SIFT for feature extraction, which essentially gives us
the key point in a picture. Filter picture feature calculation give a lot of pictures includes
that are not important so we utilize the improvement method BFOA (Bacteria foraging
optimization algorithm) to decrease the multifaceted nature, cost, vitality and Time utiliza-
tion. Finally proposed methodology improves the accuracy on animal, butterfly, facial and
flower images respectively 60%, 76.7%, 97%, and 98%.
Ekta et al. [12] have introduced a structure for picture recovery that applies different
nearby surface descriptors on Log-Gabor channels reaction. Tests have been directed on
standard Wang, VisTex, and OT-Scene databases and respectively recovery execution of
proposed structure percentage improved up to 9.06, 9.20 and 8.23.
Pradnya et al. [13] examined the idea and extent of substance IR framework. It likewise
incorporates the diagram of MPEG-7 edge histogram descriptor (EDH) to remove the sub-
stance from pictures.
Benavides et al. [14] displayed the CBIR method considering as descriptors the mean,
the standard deviation, and the homogeneity of every one of the few picture windows
exposed to examination, that is, every window goes about as a neighborhood picture dis-
trict exposed to the face investigation having a face focal point at its center.
3 Proposed Work
In this paper new methodology has been proposed with a combination of all three features
i.e. texture, color, and shape. The experiment performed on the two standard databases and
achieved better precision results. This algorithm consists of the following steps:
1. Image Acquisition First step of the new algorithm is to consider query image and stored
image as shown in Fig. 2.
2. Pre Processing In this step, resized the query image and stored image with 192 * 128
sizes.
3. ROI Segmentation In the present work for segmentation of region of interest different
edge detector method are used like Sobel and Canny as shown in Fig. 1.
4. Feature Extraction The HSV color space is progressively appropriate for human eye
recognition. The human eye can’t recognize an expansive number of hues in the mean-
time; shading quantization is utilized in this strategy.
13
172 R. Raja et al.
Query Image
Image Collection
Retrieved
Image
Stop
13
Color Object Detection Based Image Retrieval Using ROI… 173
is used for shape features and it identifies the object edges and image information. Convert
the RGB image into HSV color space. Apply the Canny edge detector to extract edges
of an image. Let the H and V edge detector to it and store the resulting matrix as H1 and
V1. Now apply Sobel edge detection on the H2 and V2 matrix. Now combine the edges
obtained the matrix H and V and after that combine with the unmodified S to get HSV
image. Now this HSV image reverts to RGB image. Then find the histogram of R, G, B
matrices of this image. These histograms i.e. HR, HG, and HB will contain 256 bins as
shown in Fig. 3.
The similarity distance is estimated between the query image and stored image with dif-
ferent similarity metrics i.e. Manhattan Distance (MD), Euclidean Distance (ED), Cheby-
shev, Hamming Distance (HD) and Jaccard Distance (JD).
DM = |x1 − x2| + |y1 − y2| (1)
√
DE = (x1 − x2)2 + (y1 − y2)2 (2)
DH = min{D(x, y) ∶ x, y ∈ C, x ≠ y} (4)
5. Classification In this step, read the stored database and query image features. There
are mainly two standard databases: Corel-1k and Corel-5k dataset. In Corel-1k data-
set: Africa, Beach, Monuments, Elephant, Horses, Building, Food, Flower, Mountain,
and Dinosaur etc. and Corel-5k: Churches, Polar Regions, domestic animals, Lions,
Tiger, Costumed People, desert, Models etc. The learning process is done using the NN
method. Training set includes all the images from the image database and query picture
contains the subtitle. At that point contrast input inquiry picture subtitle and each class
mark in the Corel database. The query picture will be classified with a specific class
mark or not without much of stretching. In the event that the information query picture
has a place with any of the class names in the database, presently we can choose just
that coordinated class pictures in the database and perform recovery activities on that
chose a specific piece of the database dependent on the similitude coordinating.
13
174 R. Raja et al.
13
Color Object Detection Based Image Retrieval Using ROI… 175
4 Result Analysis
The experiments were performed on Corel-1k and Corel-5k standard database to evaluate
the proposed methodology. On Corel-1k dataset, every class comprises 1000 pictures of
size 192 × 128 in JPG organize. On Corel-5k dataset, every class comprises 5000 pictures
of size 192 × 128 in JPG design. In proposed methodology, the quantity of coordinated pic-
tures is differing from 1 to 20. Utilizing Sobel and Canny mix, we discover the article
edges which were not recognized appropriately in past work. The proposed methodology
retrieve the similar kind of image and output is measured in the form of accuracy, recall,
execution time and precision.
13
176 R. Raja et al.
13
Color Object Detection Based Image Retrieval Using ROI… 177
Input Image
(RGB)
H S V
Y Cb Cr
Histogram
Hy Hcb Hcr
Feature Extraction
method, Fig. 25 Shows the ROC by Proposed Algorithm on the Corel-1k and Corel-5k
dataset. Figure 26 shows Confusion Matrix by Proposed Algorithm on the Corel-1k data-
sets amd Corel-5k dataset. Figure 27 represpents MTSD Accuracy on Different Dataset
and Figure 28 represents Quantization level of intensity is fixed to 20, Accuracy Com-
parison between MTSD and Proposed method. Average accuracy and precision on Num-
ber of retrieval Images is represented in Table 1. The dimension of the feature vector and
feature extraction time was represented in Table 2. Table 3 represents the average value of
13
178 R. Raja et al.
13
Color Object Detection Based Image Retrieval Using ROI… 179
13
180 R. Raja et al.
13
Color Object Detection Based Image Retrieval Using ROI… 181
Fig. 11 Retrieval of 20 images for african people by proposed algorithm on the Corel-1k dataset
Fig. 12 Retrieval of 20 images for the bus by proposed algorithm on the Corel-1k dataset
13
182 R. Raja et al.
Fig. 13 Retrieval of 20 images for playing cards by proposed algorithm on the Corel-1k dataset
Fig. 14 Retrieval of 20 images for royal cars by proposed algorithm on the Corel-5k dataset
13
Color Object Detection Based Image Retrieval Using ROI… 183
Fig. 15 Retrieval of 20 images for crockery by proposed algorithm on the Corel-5k dataset
Fig. 16 Retrieval of 20 images for agricultural vehicle proposed algorithm on the Corel-5k dataset
13
184 R. Raja et al.
Accuracy Comparision
90
80
70
60
Percentage
50
40 Corel-1k
30
Corel-5k
20
10
0
MTSD Proposed
Database
Precision Comparision
100
90
80
70
Percentage
60
50
Corel-1K
40
30 Corel-5K
20
10
0
MTSD Proposed
Database
200
150
Dimension
100 Time
50
0
MTSD Proposed
13
Color Object Detection Based Image Retrieval Using ROI… 185
Precision Comparision
100
80
Percentage
60
40 Corel-1k
Corel-5k
20
0
MTSD Proposed
Database
Fig. 20 Shows comparision of average precision value using a different method MTSD and proposed
method
Accuracy Comparision
100
90
80
70
Percentage
60
50
40 Corel-1K
30 Corel-5K
20
10
0
MTSD Proposed
Database
MTSD precision & propoed prescision and MTSD accuracy and proposed accuracy when
the quantization level of color is 108. Table 4 the average retrieval precision of Proposed
Method on different similarity metrics. Table 5 the average retrieval precision of proposed
method on different similarity metrics. Table 6 represents average precision on number of
retrieval Image.
13
186 R. Raja et al.
90
80
70
60 L1
Percentage
Euclidean
50
Chebyshev
40 Hamming
Jaccard
30
20
10
0
African Beach Monuments Elephant Flower
Fig. 22 Shows accuracy comparison between various distance method on Coreal-1k dataset
70
60
50
L1
Percentage
Euclidean
40
Chebyshev
30 Hamming
Jaccard
20
10
0
Traffic Signals Playing Cards Flag Agricultural Trees
Vehicle
Fig. 23 Shows accuracy comparison between various distance method on Coreal-5k dataset
13
Color Object Detection Based Image Retrieval Using ROI… 187
60
50
40
30 Corel-1k
20 Corel-5k
10
0
13
188 R. Raja et al.
Fig. 26 Confusion matrix by proposed algorithm on the Corel-1k datasets amd Corel-5k dataset
Corel-5k
50.98% Corel-1k Corel-1k
55% Corel-5k
13
Color Object Detection Based Image Retrieval Using ROI… 189
Table 1 Average accuracy and Dataset MTSD accu- Proposed MTSD pre- Proposed
precision on number of retrieval racy (%) accuracy (%) cision (%) precision
images (%)
Table 3 The table represent the average value of MTSD precision and proposed precision and MTSD accu-
racy and proposed accuracy when the quantization level of color is 108
The quantization level of edge The quantization level of color is 108
orientation is 9
MTSD precision Proposed precision MTSD accuracy Proposed
(%) (%) (%) accuracy
(%)
Table 4 The average retrieval Category Proposed precision of similarity metrics (%) on Corel-
precision of proposed method on 1k dataset
different similarity metrics
L1 Euclidean Chebyshev Hamming Jaccard
Table 5 The average retrieval precision of proposed method on different similarity metrics
Category Proposed precision of similarity metrics (%) on Corel-5k dataset
L1 Euclidean Chebyshev Hamming Jaccard
13
190 R. Raja et al.
5 Conclusion
This paper has displayed that CBIR is utilized for looking and getting to the pictures from
large picture datasets. It recouped pictures based on picture descriptors. For color fea-
tures, we implement the color coding for finding color regions. In this work, firstly find
the region of interest (ROI) of the image using Sobel and Canny method. HSV histogram
is used for color descriptors. For finding edges used edge detector method, it detects an
object from an image and extracts features. This algorithm is better than the previous
algorithm. To detect the highest rank of an image used MD and HD similarity metrics
and it is far improved than other distance metrics. In the present work, the precision and
accuracy have reached up to 79.41% as well as 86.66% for Corel-1k dataset for N = 20 and
69.82% as well as 70.53% for Corel-5k dataset. In future work, the proposed method will
be tested on other database like Caltech database for improve object retrieval.
References
1. Arya, D., & Jha, J. (2016) Global and local descriptor for CBIR and image enhancement using multi-
feature fusion method. International Journal of Research—GRANTHAALAYAH.
2. Agarwal, S., Verma, A. K., & Dixit, N. (2011). Content-based image retrieval using color edge
detection and discrete wavelet transform. In International conference on issues and challenges in
intelligent computing techniques (ICICT), February 7–8, 2011 (pp. 368–372)
3. Zinzuvadia, K. M., Tanawala, B. A., & Brahmbhatt, K. N. (2015). A survey on feature-based image
retrieval using classification and relevance feedback techniques. IJIRCCE, 2, 1253.
4. Dixit, N., Tiwari, S. K., & Sharma, P. (2016). A new algorithm for CBIR using bi-cubic interpolation
with color coding and different level DWT. In 2016 IEEE (ICCUBEA) (pp. 1–6).
5. Kaur, M., & Dhingra, S. (2017). Comparative analysis of image classification techniques using statisti-
cal features in CBIR systems. In I-SMAC 2017 (pp. 265–270). IEEE.
6. Varnish, N., & Pal, A. K. (2015). Content-based image retrieval using statistical features of color
histogram. In 2015 3rd international conference on signal processing, communication and net-
working (ICSCN). IEEE
7. Zhao, M., Zhang, H., & Sun, J. (2016). A novel image retrieval method based on multi-trend structure
descriptor. Journal of Visual Communication and Image Representation, 38, 73–81.
8. Zheng, L., Wang, S., Liu, Z., & Tian, Q. (2015). Fast image retrieval: Query pruning and early ter-
mination. IEEE Transactions on Multimedia, 17(5), 648–659.
9. Mikolajczyk, K., & Schmid, C. (2005). A performance evaluation of local descriptors. IEEE Trans-
actions on Pattern Analysis and Machine Intelligence, 27(10), 1615–1630.
10. Shah, A., Naseeml, R., Sadia, S. I., & Shah, M. A. (2017). Improving CBIR accuracy using convo-
lutional neural network for feature extraction. IEEE 2017 (pp 1–5).
11. Ali, A., & Sharma, S. (2017). Content-based image retrieval using feature extraction with machine
learning. In ICICCS 2017 IEEE (pp 1048–1053).
12. Walia, E., & Verma, V. (2016). Boosting local texture descriptors with Log-Gabor filters response for
improved image retrieval (pp. 1–12). London: Springer. https://doi.org/10.1007/s13735-016-0099-2.
13. Vikhar, P., & Karde, P. (2016). Improved CBIR system using edge histogram descriptor (EHD) and
support vector machine (SVM). IEEE (pp. 1–5).
13
Color Object Detection Based Image Retrieval Using ROI… 191
14. Benavides, C., Villegas, J., Member, IEEE, Román, G., & Avilés, C. (2016). Face classification by
local texture analysis through CBIR and SURF points. IEEE Latin America Transactions, 14(5),
2418–2434.
15. Raja, R., Sinha, T. S., Patra, R. K., & Tiwari, S. (2018). Physiological trait based biometrical authen-
tication of human-face using LGXP and ANN techniques. International Journal of Information and
Computer Security Special Issue on: “Multimedia Information Security Solutions on Social Networks,
10(2/3), 303–320.
16. Raja, R., Sinha, T. S., & Dubey, R. P. (2015). Recognition of human-face from side-view using pro-
gressive switching pattern and soft-computing technique. Association for the advancement of model-
ling and simulation techniques in enterprises, Advance B (Vol. 58(1), pp. 14–34), ISSN:-1240-4543.
17. Kumar, S., Singh, S., & Kumar, J. (2018). Live detection of face using machine learning with multi-
feature method. Wireless Personal Communication Springer Journal (SCI). https://doi.org/10.1007/
s11277-018-5913-0.
18. Kumar, S., Singh, S., & Kumar, J. (2018). Automatic live facial expression detection using genetic
algorithm with haar wavelet features and SVM. Wireless Personal Communication Springer Journal
(SCI). https://doi.org/10.1007/s11277-018-5923-y.
Publisher’s Note Springer Nature remains neutral with regard to jurisdictional claims in published maps and
institutional affiliations.
Dr. Rohit Raja has received Ph.D. in Computer Science and Engineer-
ing form CVRAMAN University in 2016. His main research interest
includes Face recognition and Identification, Digital Image Processing,
Signal Processing and Networking. Presently he is working as Profes-
sor in CSE Department, Sreyas Institute of Engineering and Technol-
ogy, Hyderabad India. He has authored several Journal and Conference
Papers. He has a good Academics and Research experience in various
areas of Computer Science and Engineering. He has filed successfully
8 Patents. He has been received 2 times invitation being a Guest in
IEEE Conferences. He has published 75 research papers in various
International/National Journals (including IEEE, Springer etc.) and
Proceedings of the reputed International/National Conferences (includ-
ing Springer and IEEE). He has been nominated in the board of edi-
tors/reviewers of many peer-reviewed and refereed Journals (including
IEEE, Springer).
13
192 R. Raja et al.
13