Image Processing in Artificial Intellige
Image Processing in Artificial Intellige
ABSTRACT
Article Info Machines can learn to elucidate images the same way our brains do and analyse
Volume 6, Issue 5 those images much more thoroughly than we can. When applied to Image
Page Number: 244-249 Processing, Artificial Intelligence (AI) can propel face recognition and security
Publication Issue : functionality in public places, detecting and recognizing intruders, objects, and
September-October-2020 patterns in live images and videos, etc. Image processing technology focuses on
the development of data extraction methods applied to the statistical
classification of visual imagery. In classical image processing systems, an image
is pre-processed to remove noise (denoising), segmented to produce close object
boundaries, analysed to extract a representative feature, and compared to the
ideal object feature vectors by a classifier to decide the nearest object
Article History classification and its associated level. In this paper, we discuss about digital
Accepted : 10 Sep 2020 image processing and the role of AI in it.
Published : 20 Oct 2020 Keywords : Image Processing, Artificial Intelligence, Digital Image, Data
Extraction
Copyright: © the author(s), publisher and licensee Technoscience Academy. This is an open-access article distributed
under the terms of the Creative Commons Attribution Non-Commercial License, which permits unrestricted non-
commercial use, distribution, and reproduction in any medium, provided the original work is properly cited
244
Shahzeb Hussain et al Int J Sci Res CSE & IT, September-October-2020; 6 (5) : 244-249
• Image sharpening and restoration — Improves 2) Image enhancement – This phase enhances the
and enhances the quality of processed images. quality of the captured image to extract hidden
• Image retrieval — Helps in the searching of information from it for further processing.
images.
• Object measurement — Measures objects in an 3) Image restoration – This also improves the quality
image. of an image by removing possible corruptions in
• Pattern recognition — Distinguishes and classifies order to get a clearer version. This process is based
objects in an image, identifies their positions, and mostly on probability and mathematical models and
understands the scene. can be used to get rid of noise, blur, missing pixels,
watermarks, camera misfocus, and other corruptions
that may affect the training of a neural network.
6) Morphological processing – This method describes Figure 6: Image recognition with a CNN
the shape and structure of the objects in an image. It 8) Representation and description – It is the process
can be used when creating datasets for training AI of visualizing and describing processed images. The
models. In particular, morphological analysis and raw output of an AI system looks like an array of
processing can be applied at the stage where we want numbers and values that represent the data the AI
to describe what we need our AI model to detect or model was trained to produce. A deep neural
recognize. network usually doesn’t produce any output data
representations. With the help of special visualization
tools, these arrays of numbers can be turned into
readable images suitable for further analysis.
The use of Artificial Intelligence and Machine To make it easier to use these methodologies as well
Learning amplifies both the speed of data processing as to implement AI-based image processing
and the quality of the final result. With the aid of AI functionalities in our product, we can use specific
platforms, we can successfully achieve such complex libraries and frameworks. In the next section, we take
tasks as face recognition, object detection, and text a closer look at some of the most popular open-source
recognition. But, in order to get high-quality results, libraries for accomplishing different image processing
we need to pick the right tools and methods [5]. tasks with the help of AI algorithms.
A. OpenCV
II. IMAGE PROCESSING METHODS,
TECHNIQUES, AND TOOLS The Open Source Computer Vision Library (OpenCV)
is a popular library that provides a number of
Most images taken with regular sensors (e.g. camera)
computer and machine learning algorithms and
require pre-processing, as they can contain too much
functions for composing and supporting those
noise. Filtering and edge detection techniques are
algorithms. The library comes with Python, C++, and
two of the most common methods for processing
Java interfaces and supports all popular desktop and
digital images [6].
mobile operating systems. It includes various modules,
• Filtering – This method is used for improving and such as a machine learning module, an image
modifying the input image. With the help of processing module, and an object detection module.
different filters, we can emphasize or remove Using this library, we can acquire, compress, enhance,
certain features in an image, reduce image noise, restore, and extract data from images.
and so on. Popular filtering techniques include
linear filtering, Wiener filtering, and median
B. Visualisation Library
filtering.
• Edge detection – It uses filters for data extraction Visualization Library is C++ middleware for 2D and
and image segmentation. This method helps to 3D applications established on the Open Graphics
find meaningful edges of objects in processed Library. This toolkit allows building portable and
images by detecting discontinuities in brightness. high-performance applications for Windows, Linux,
Roberts edge detection, Sobel edge detection, and and Mac OS systems. It is easy and comfortable to
Canny edge detection are among the most popular work with.
edge detection techniques.
C. VGG Image Annotator
VGG Image Annotator (VIA) is a web-based
application for object annotation. It can be installed
in a web browser and can be used for annotating
detected objects in images, and video records. VIA
doesn’t require additional setup or installation and
can be used with any modern browser.
models for image processing. This will make the E. Google Cloud Vision
development a bit faster and easier. We can also use Cloud Vision is a part of the Google Cloud platform
special platforms and frameworks. Now we will take and provides a set of image processing features. It
a look at some of the most popular ones: offers an API for integrating such features as object
A. TensorFlow localization, image labelling and classification, and
object recognition. Cloud Vision uses pre-trained
Google’s TensorFlow is a popular open-source
machine learning models and create and train custom
framework with support of machine learning and
machine learning models for solving different image
deep learning. Using TensorFlow, we can create and
processing tasks [8].
train custom deep learning models. This framework
also provides a set of libraries that can be used in
image processing and computer vision projects. F. Google Colaboratory (Colab)
PyTorch is an open-source deep learning framework service that can be used for developing deep learning
created by the Facebook AI Research lab (FAIR). This applications from scratch. Colab makes it easier to use
framework has Python, C++, and Java interfaces. It is popular libraries such as Keras, OpenCV, and
used for building computer vision and natural TensorFlow. Colab provides free GPU resources as
[1] Patel, Krishna Kumar, A. Kar, S. N. Jha, and M. Shahzeb Hussain, Prayas Dixit, Md. Shaayan Hussain,
A. Khan. "Machine vision system: a tool for "Image Processing in Artificial Intelligence",
quality inspection of food and agricultural International Journal of Scientific Research in
products." Journal of food science and technology Computer Science, Engineering and Information
49, no. 2 (2012): 123-141. doi: 10.1007/s13197- Technology (IJSRCSEIT), ISSN : 2456-3307, Volume
011-0321-4 6 Issue 5, pp. 244-249, September-October 2020.
[2] Long, Jonathan, Evan Shelhamer, and Trevor Available at
Darrell. "Fully convolutional networks for doi : https://doi.org/10.32628/CSEIT206542
semantic segmentation." In Proceedings of the Journal URL : http://ijsrcseit.com/CSEIT206542
IEEE Conference on Computer Vision and
Pattern Recognition, pp. 3431-3440. 2015. DOI:
10.1109/CVPR.2015.7298965
[3] K. D. Irwin and G. C. Hilton, "Transition-Edge
Sensors," Cryogenic Particle Detection, ed. C.
Enss, Springer, 2005
[4] R. Deriche, Using Canny's criteria to derive a
"recursively implemented optimal edge detector",
Int. J. Computer Vision, Vol. 1, pp. 167-187, April
1987.
[5] Rautaray, Siddharth S., and Anupam Agrawal.
"Vision-based hand gesture recognition for
human-computer interaction: a survey." Artificial
Intelligence Review 43, no. 1 (2015): 1-54. Doi:
10.1007/s10462-012-9356-9
[6] "NDT Resource Center. " http://www.ndt-ed.org
/index_flash.htm.
[7] "Image processing toolbox user's guide." (2005)
The Math Works, Natick, Massachusetts, USA.
[8] Kehoe, Ben, Sachin Patil, Pieter Abbeel, and Ken
Goldberg. "A survey of research on cloud robotics
and automation." IEEE Transactions on
automation science and engineering 12, no. 2
(2015): 398-409. DOI:
10.1109/TASE.2014.2376492