Chapter 1(Introduction to CV and Image Processing)
Chapter 1(Introduction to CV and Image Processing)
Instructor: Bayisa G.
• Computer vision is a scientific field which deals with how computers can be made
as high level devices which understand digital images and videos.
• In terms of engineering, it is an automate task that the human visual system can do.
• As a scientific discipline, computer vision is related to the theory of artificial system
which can extract information from images.
• Computer vision has methods for acquiring, processing, analyzing and
understanding the digital image.
• The most important task is to extract high dimensional data from the real world
which can produce numerical or symbolic information.
• The image data is used in the form of video sequences which can be seen by a
human
• Computer vision is not a new technology because scientists and experts have been
trying to develop machines that can see and understand visual data for almost six
decades. The evolution of computer vision is classified as follows:
1. 1959: The first experiment with computer vision was initiated in 1959, where
they showed a cat as an array of images. Initially, they found that the system reacts
first to hard edges or lines, and scientifically, this means that image processing
begins with simple shapes such as straight edges.
2. 1960: In 1960, artificial intelligence was added as a field of academic study to
solve human vision problems.
2. 1963: This was another great achievement for scientists when they developed
computers that could transform 2D images into 3-D images.
4. 1974: This year, optical character recognition (OCR) and intelligent character
recognition (ICR) technologies were successfully discovered.
• The OCR has solved the problem of recognizing text printed in any font or
typeface, whereas ICR can decrypt handwritten text. These inventions are one of
the greatest achievements in document and invoice processing, vehicle number
plate recognition, mobile payments, machine translation, etc.
5. 1982: In this year, the algorithm was developed to detect edges, corners, curves,
and other shapes. Further, scientists also developed a network of cells that could
recognize patterns.
6. 2000: In this year, scientists worked on a study of object recognition.
– The goal of Computer Vision is to write computer programs that can interpret
images.
• Why study Computer Vision
ü Images ( and movies) have become ubiquitous in both production and
consumption
ü Therefore applications to manipulate images (movies) are becoming core.
– Practically, every scene which is around us forms an image and this involved
under image processing.
– An image is formed by two-dimensional analog and the digital signal that
contains color information arranged along x and y spatial axis.
– An Image, be it a digital or a still image, all of it is nothing but a binary
representation of some visual information.
– This image is traffic sign, which gives information about the signals and signage
for the rides.
– This is a digital image and can be stored in any digital storage medium
– Image Processing
• The study of the properties of operators that produce images from other
images.
– Pattern Recognition
» Typically refers to the recognition of structures in 2D images(usually without
reference to any underlying 3D information).
– Photogrammetry
» The science of measurement through non-contact sensing, e.g. terrain maps
from satellite images
» Usually is more focused on accuracy issue than interpretation
– Computer Graphics
» Produce the plausible images
Computer Vision:
• In Computer Vision, computers or machines are made to gain high-level understanding from the
input digital images or videos with the purpose of automating tasks that the human visual system
can do. It uses many techniques and Image Processing is just one of them.
Image Processing
• Image Processing is the field of enhancing the images by tuning many parameter and features of
the images. So Image Processing is the subset of Computer Vision. Here, transformations are
applied to an input image and the resultant output image is returned. Some of these transformations
are- sharpening, smoothing, stretching etc.
• Now, as both the fields deal with working in visuals, i.e., images and videos, there seems to be lot
of confusion about the difference about these fields of computer science. In this article we will
discuss the difference between them.
Special Effects
• Shape and motion capture are new techniques used in movies like Avatar to
animate digital characters by recording the movements played by a human
actor.
• In order to do that, we have to find the exact positions of markers on the
actor’s face in a 3D space, and then recreate them on the digital avatar.
3D Urban Modeling
• Taking pictures with a drone over a city can be used to render a 3D model of
the city.
• Computer vision is used to combine all the photos into a single 3D model.
Scene Recognition
• It is possible to recognize the location where a photo was taken.
• For instance, a photo of a landmark can be compared to billions of photos on
google to find the best matches.
• We can then identify the best match and deduce the location of the photo.
Optical Character Recognition(OCR)
• One of the oldest successful applications of computer vision is to recognize
characters and numbers.
• This can be used to read zipcodes, or license plates.
Face Detection
• Face detection has been used for multiple years in cameras to take better pictures
and focus on the faces.
• Smile detection can allow a camera to take pictures automatically when the
subject is smiling.
• Face recognition is more difficult than face detection, but with the scale of today’s
data, companies like Facebook are able to get very good performance.
• Finally, we can also use computer vision for biometrics, using unique iris pattern
recognition or fingerprints.
• Medical Imaging
• Computer vision helps in MRI reconstruction, automatic pathology, diagnosis,
machine aided surgeries and more.
Augmented Reality(AR)/Virtual Reality(VR)
• Object occlusion (dense depth estimation), outside-in tracking, inside-out tracking
for virtual and augmented reality.
• Smartphones
• All the photo filters (including animation filters on social media), QR code
scanners, panorama construction, Computational photography, face detectors,
image detectors (Google Lens, Night Sight) that you use are computer vision
applications.
4/2/2025 CV and Image Processing 19
Application of Computer Vision(CV) …
Internet
• Image search, image captioning, Ariel imaging for maps, video categorization and
more.
Retail: Amazon recently opened to the public the Amazon Go store where shoppers
need not wait in line at the checkout counter to pay for their purchases.
Healthcare
• Gauss Surgical has developed blood monitoring solutions that described to
estimate in real-time blood loss during medical situations.
• This solution, the website reports, maximizes transfusions and recognized
hemorrhage better than the human eye.
• There are two categories of the steps involved in the image processing
1. Methods whose input are images.
2. Methods whose outputs are attributes extracted from those images.
1. Image Acquisition:
• Is the first step in image processing.
• This step is also known as preprocessing in image processing.
• It involves retrieving the image from a source, usually a hardware-based
source..
2. Image Enhancement:
• the process of bringing out and highlighting certain features of interest in an
image that has been obscured.
• This can involve changing the brightness, contrast, etc.
3. Image Restoration
• is the process of improving the appearance of an image.
• However, unlike image enhancement, image restoration is done using certain
mathematical or probabilistic models.
4. Color Image Processing
• includes a number of color modeling techniques in a digital domain.
• This step has gained prominence due to the significant use of digital images
over the internet.
7. Morphological Processing
• is a set of processing operations for morphing images based on their shapes.
8. Segmentation
• is one of the most difficult steps of image processing.
• It involves partitioning an image into its constituent parts or objects.
9. Representation and Description
• After an image is segmented into regions in the segmentation process, each
region is represented and described in a form suitable for further computer
processing.