Computer Vision in Retail
The retail field has been
fastest growing field and at the same time is using Computer
Vision for making the user experience more fruitful. Retailers can
use Computer Vision techniques to track customers’ movements
through stores, analyse navigational routes and detect walking
patterns.
Inventory Management is another such application. Through
security camera image analysis, a Computer Vision algorithm can
generate a very accurate estimate of the items available in the
store. Also, it can analyse the use of shelf space to identify
suboptimal configurations and suggest better item placement.
Self-Driving Cars: Computer Vision is the fundamental
technology behind developing autonomous vehicles.
Most leading car manufacturers in the world are
reaping the benefits of investing in artificial intelligence
for developing on-road versions of hands-free
technology.
This involves the process of identifying the objects,
getting navigational routes and also at the same time
environment monitoring.
Medical Imaging*: For the last decades, computer
supported medical imaging application has been a
trustworthy help for physicians. It doesn’t only
create and analyse images, but also becomes an
assistant and helps doctors with thelr interpretation.
The application is used to read and convert 2D scan
images into interactive 3D models that enable
medical professionals to gain a detailed
understanding of a patient’s health condition.
Google Translate App*: All you need to do to read signs in z
foreign language is to point your phone's camera at the words anc
let the Google Translate app tell you what it means in your preferrec
janguage almost instantly. By using optical character recognition tc
see the image and augmented reality to overlay an accurate
translation, this is a convenient tool that uses Computer Vision.A pixel (short for picture element) represents the smallest piece of the screen that can be controllet
individually in terms of colour and intensity.
A bitmapped graphic is stored as an array of dots, or pixels. Each pixel gets assigned a specific colour.
Image Resolution refers to the quality and size of an image in terms of the number of pixels it contains,
typically as ‘width x height’
The amount of information that is stored about a pixel determines its colour depth (pixel value), which
controls how precisely the pixel’s colour can be specified.
A grayscale image is an image in which the only colours are shades of grey.
An RGB image is a coloured image that is formed by the three-colour information (Red, Green, Blue
information) stored for each pixel of a digital image.
RGB value (0,0,0) represents black colour and RGB value (255, 255, 255) represents white colour.An image feature is a measurable piece of data in your image which is unique to this specific object, such
‘as a distinct colour, specific shape or an image segment
4 Computer Vision Tasks refer to the internal processing of image that helps in understanding and describing a
‘scene encapsulated in an image, such as identifying, labelling, locating, segmenting the objects in it.
4 Common computer vision tasks are : classification, localisation, object detection and image segmentation.
4+ Image Classification refers to the act of identifying and classifying a given image as belonging to one of a
set of predefined categories or classes.
*
¢ Localisation refers to identifying the position or location of an object in image, Location performs
classification along with localising the object.
4 Object detection refers to a combined action of localisation and classification, carried out on different
items/things/subjects of interest in an image/video to identify and label them as objects of specific types
and know their count.
*
The purpose of object detection is to find and then classify a different number of objects in an image. It is
commonly used in applications such as image retrieval and automated vehicle parking systems.
4 Allcomputer vision tasks are carried out with the help of algorithms. That is computer vision takes place
through a combination of algorithms like classification algorithms; localisation algorithms, detection
algorithms and segmentation algorithms ete.
4 OpenC¥ (Open Source Computer Vision Library) is an open-source library that includes several hundreds of
computer vision algorithms.
Solution Time
——
1. What is the difference between grayscale and RGB images ?
‘Ans. A grayscale image is an image in which the only colours are shades of gray.
‘An RGB image is a coloured image that is formed by the three-colour information (Red, Green,
Blue information) stored for each pixel of a digital image.
2. What are computer vision tasks ?
Ans. Computer Vision Tasks refer to the internal processing of image that helps in understanding
and describing a scene encapsulated in an image, such as identifying, labelling, locating, segmenting
the objects in it.
3. Briefly list four basic Computer Vision tasks.
‘Ans, Four basic computer vision tasks are :
(Image Classification. It predicts the class of an object in an image.
(ii) Image Localisation. It locates the presence of objects in an image and indicate them with
bounding boxes.
(iii) Object Detection. It locates the presence of objects with a bounding box and classes of the
located objects in an image.
(iv) Image Segmentation. A segmentation returns much more detailed information about the image.
4. What do you understand by image features ? How are these useful in computer vision ?
Ans. An image feature is a measurable piece of data in an image which is unique to this specific
object, such as a distinct colour, specific shape or an image segment.
Using image features, the CV algorithms identify and create classes of objects. Using these classes,
various CV tasks take place.