0% found this document useful (0 votes)

84 views10 pages

CV Pipeline Preprocessing Stage: Dr. Hussien Karam

The document discusses the CV pipeline preprocessing stage. It defines image preprocessing as steps taken to format images before being used for model training and inference. The goals of preprocessing include assuring image quality, correcting coordinate systems, reducing noise, and enhancing contrast. Key techniques covered include converting to grayscale, standardizing size, data augmentation through scaling and rotations, resizing, random flips and rotations, adding random noise, and adjusting contrast. Preprocessing aims to clean, organize and standardize raw data for machine learning models.

Uploaded by

Ne'am Mohamed Abdullah

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

84 views10 pages

CV Pipeline Preprocessing Stage: Dr. Hussien Karam

Uploaded by

Ne'am Mohamed Abdullah

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 10

CV pipeline Preprocessing Stage

Dr. Hussien Karam

Team Members:

1. Shaza Hossam
2. Ne`am Mohamed
3. Mariam Abd EL-Twab
4. Dina Mamdoh
Def preprocessing:

Image preprocessing are the steps taken to format images before they are used
by model training and inference.

A preliminary processing of data in order to prepare it for the primary processing

or for further analysis. The term can be applied to any first or preparatory
processing stage when there are several steps required to prepare data for the
user. For example, extracting data from a larger set, filtering it for various reasons
and combining sets of data could be preprocessing steps.

Why preprocessing?

• The acquired data are usually messy and come from different sources. To
feed them to the ML model (or neural network), they need to be
standardized and cleaned up. More often than not, preprocessing is used
to conduct steps that reduce the complexity and increase the accuracy of
the applied algorithm. We can’t write a unique algorithm for each of the
condition in which an image is taken, thus, when we acquire an image, we
tend to convert it into a form that allows a general algorithm to solve it.
• When it comes to creating a Machine Learning model, data preprocessing
is the first step marking the initiation of the process. Typically, real-world
data is incomplete, inconsistent, inaccurate (contains errors or outliers), and
often lacks specific attribute values/trends. This is where data
preprocessing enters the scenario – it helps to clean, format, and organize
the raw data, thereby making it ready-to-go for Machine Learning models.
• Before raw data could be sent through a machine learning model it has to
undergo preprocessing. And it’s simply because data in the real world are
generally Incomplete, Noisy and Inconsistent. And if this is fed into the
machine learning model, results can come unexpectedly!
And that’s not really what we want. Data preprocessing is a proven method
for resolving such issues.

• Data preprocessing is a way of converting data from the raw form to a much
more usable or desired form, i.e., making data more meaningful by
rescaling, standardizing, binarizing, one hot encoding, and label encoding.

• Preprocessing is required to clean image data for model input. For example,
fully connected layers in convolutional neural networks required that all
images are the same sized arrays.

Goals of Image processing

• Assuring the image satisfies certain level of quality in order to pass it to a

Computer Vision function.

• Confirming the image coordinate system is correct (Resizing the Image).

• Reduce noise of the image.

• Contrast enhancement.

Preprocessing techniques

Convert color images to grayscale to reduce computation

complexity: in certain problems you’ll find it useful to lose unnecessary
information from your images to reduce space or computational complexity.
For example, converting your colored images to grayscale images. This is because
in many objects, color isn’t necessary to recognize and interpret an image.
Grayscale can be good enough for recognizing certain objects. Because color
images contain more information than black and white images, they can add
unnecessary complexity and take up more space in memory (Remember how color
images are represented in three channels, which means that converting it to
grayscale reduces the number of pixels that need to be processed).

Figure 1

In the example above, you can see how patterns in brightness and darkness of an
object (intensity) can be used to define the shape and characteristics of many
objects. In other applications, color is important to define certain objects. Like skin
cancer detection which relies heavily on the skin colors (red rashes).
The rule of thumb to identify the importance of colors in your problem is to look at
the image with the human eye, if you are able to identify the object that you are
looking for in a gray image then you probably have enough information to feed to
your model. If not, then you definitely need more information (colors) in your
images. The same rule can be applied for most other preprocessing techniques
that will be discussed next.

Standardize images: One important constraint that exists in some machine

learning algorithms, such as CNN, is the need to resize the images in your dataset
to a unified dimension. This implies that our images must be preprocessed and
scaled to have identical widths and heights before fed to the learning algorithm.

Data augmentation: Another common pre-processing technique involves

augmenting the existing dataset with perturbed versions of the existing images like
scaling and rotations.
This is done to enlarge your dataset and expose the neural network to a wide
variety of variations of your images. This makes it more likely that your model
recognizes objects when they appear in any form and shape. Here’s an example
of image augmentation applied to a butterfly image.

Resize: Changing the size of an image sounds trivial, but there are
considerations to take into account.

Many model architectures call for square input images, but few devices capture
perfectly square images. Altering an image to be a square call for either stretching
its dimensions to fit to be a square or keeping its aspect ratio constant and filling
in newly created “dead space” with new pixels. Moreover, input images may be
various sizes, and some may be smaller than the desired input size.

Random Flips: Randomly mirroring an image about its x- or y-axis forces our
model to recognize that an object need not always be read from left to right or up
to down. Flipping may be illogical for order-dependent contexts, like interpreting
text.

Best tips: for most real-world objects, flipping is a strong way to improve
performance.

Random Rotations: Rotating an image is particularly important when a model

may be used in non-fixed position, like a mobile app. Rotating can be tricky as it,
too, generates “dead pixels” on the edges of our images and, for bounding
boxes, requires trigonometry to update any bounding boxes.

Best tips: if an object may be a variety of different orientations relative to the

captured images, rotation is a good option. This would not be true for, say,
screenshots, where the image content is always in a fixed position.

Random Noise

Adding noise to images can take a variety of forms. A common technique is “salt
and pepper noise,” wherein image pixels are randomly converted to be completely
black or completely white. While deliberately adding noise to an image may reduce
training performance, this can be the goal if a model is overfitting on the wrong
elements.

Best tips: if a model is severely overfitting on image artifacts, salt and pepper noise
can effectively reduce this.
Contrast: Contrast is the difference in luminance or color that makes an object
distinguishable from other objects within the same field of view.

Clearly, the left image has a low contrast because it is difficult to identify the details
present in the image as compared to the right image.

Low contrast images can result from Poor illumination, lack of dynamic range in
the imaging sensor or even wrong setting of lens aperture during image acquisition
etc.

When performing Contrast enhancement, you must first decide whether you want
to do global or local contrast enhancement. Global means increasing the contrast
of the whole image, While in local we divide the image into small regions and
perform contrast enhancement on these regions independently. Don’t Worry, we
will discuss these in detail in the next blogs.
References:

1. CV_eBook_HK, Hussien Karam Hussien, 2020.

2. https://medium.com/@boelsmaxence/introduction-to-image-processing-filters-
179607f9824a

3. https://homepages.inf.ed.ac.uk/rbf/HIPR2/gsmooth.htm

4. Image preprocessing in the spatial domain, local neighborhood operations, Czech

Technical University in Prague, Czech Institute of Informatics, Robotics and Cybernetics,
Václav Hlavác.

5. https://towardsdatascience.com/image-pre-processing-c1aec0be3edf

6. https://freecontent.manning.com/the-computer-vision-pipeline-part-3-image-
preprocessing/

7. https://www.yourdictionary.com/preprocessing

Pre-Commissioning Check List of Generator
80% (15)
Pre-Commissioning Check List of Generator
26 pages
Buyers Guide HV Live Tank Circuit Breakers Ed5 en
No ratings yet
Buyers Guide HV Live Tank Circuit Breakers Ed5 en
126 pages
Press Machine
100% (1)
Press Machine
34 pages
Oracle Reports
No ratings yet
Oracle Reports
3 pages
a3
No ratings yet
a3
6 pages
Week-2 - ML Slides
No ratings yet
Week-2 - ML Slides
26 pages
Image processing file
No ratings yet
Image processing file
7 pages
unit 3_1_1709014556934
No ratings yet
unit 3_1_1709014556934
49 pages
Computer Vision - Unit 1 Notes
No ratings yet
Computer Vision - Unit 1 Notes
13 pages
Xu Ly Anh - Luong Chi Mai D
100% (1)
Xu Ly Anh - Luong Chi Mai D
154 pages
UNESCO Module: Introduction To Computer Vision and Image Processing
No ratings yet
UNESCO Module: Introduction To Computer Vision and Image Processing
48 pages
Module 5
No ratings yet
Module 5
72 pages
Fundamental Steps in Digital Image Processing
No ratings yet
Fundamental Steps in Digital Image Processing
3 pages
Object Sorting in Manufacturing Industries Using Image Processing
No ratings yet
Object Sorting in Manufacturing Industries Using Image Processing
9 pages
Short Note
No ratings yet
Short Note
5 pages
Data Preprocessing
No ratings yet
Data Preprocessing
2 pages
Image Classification With Pytorch: Pre-Processing Images To Use in Machine Learning Models
No ratings yet
Image Classification With Pytorch: Pre-Processing Images To Use in Machine Learning Models
48 pages
Vazquez ImageProcessFundamentals
No ratings yet
Vazquez ImageProcessFundamentals
83 pages
Automated Image Data Preprocessing With Deep Reinforcement Learning
No ratings yet
Automated Image Data Preprocessing With Deep Reinforcement Learning
9 pages
What Is Data Preprocessing
No ratings yet
What Is Data Preprocessing
4 pages
Image Data Preprocessing
No ratings yet
Image Data Preprocessing
34 pages
2-Machine Learning & Deep Learning
No ratings yet
2-Machine Learning & Deep Learning
87 pages
Introduction To Image Processing and Computer Vision 2 PDF
100% (2)
Introduction To Image Processing and Computer Vision 2 PDF
179 pages
AD8703 Basic of Computer vision UNIT 1
No ratings yet
AD8703 Basic of Computer vision UNIT 1
65 pages
Computer Vision U1&2 Notes (1)
No ratings yet
Computer Vision U1&2 Notes (1)
62 pages
BME_LEC_IMT-23
No ratings yet
BME_LEC_IMT-23
50 pages
Data Preprocessing PDF
No ratings yet
Data Preprocessing PDF
6 pages
Mechatronics Department
No ratings yet
Mechatronics Department
7 pages
CV_ALL_ANS
No ratings yet
CV_ALL_ANS
42 pages
DIgital Image Processing
No ratings yet
DIgital Image Processing
74 pages
CCV-Preview
No ratings yet
CCV-Preview
26 pages
CV_SVD_L04_P1_ImageTrasformations_1
No ratings yet
CV_SVD_L04_P1_ImageTrasformations_1
45 pages
_Guvi- NM - Open CV
No ratings yet
_Guvi- NM - Open CV
52 pages
Ece3099 Ipt PPT Template 18becxxxx
No ratings yet
Ece3099 Ipt PPT Template 18becxxxx
27 pages
CV 2 MARKS
No ratings yet
CV 2 MARKS
11 pages
Introduction To Image Processing
No ratings yet
Introduction To Image Processing
5 pages
Computer Vision(7th Sem)
No ratings yet
Computer Vision(7th Sem)
48 pages
Image Processing Chapter 2
No ratings yet
Image Processing Chapter 2
92 pages
Dip Module 1 Notes
No ratings yet
Dip Module 1 Notes
33 pages
Color Image Edge Detection Algorithm Based On Circular Shifting
No ratings yet
Color Image Edge Detection Algorithm Based On Circular Shifting
46 pages
Image Processing in Artificial Intellige
No ratings yet
Image Processing in Artificial Intellige
6 pages
Image Classification
No ratings yet
Image Classification
18 pages
Computer Vision
No ratings yet
Computer Vision
21 pages
Fundamentals of Image Processing
No ratings yet
Fundamentals of Image Processing
32 pages
Mathematical Tool
No ratings yet
Mathematical Tool
38 pages
2022_A review_ Data pre-processing and data augmentation techniques - ScienceDirect
No ratings yet
2022_A review_ Data pre-processing and data augmentation techniques - ScienceDirect
20 pages
NN-7
No ratings yet
NN-7
26 pages
e98da8fbc33b80a8a7c6cfc6ddfd7cf5
No ratings yet
e98da8fbc33b80a8a7c6cfc6ddfd7cf5
36 pages
Multimedia Systems: Multimedia Databases - Image Processing Basics
No ratings yet
Multimedia Systems: Multimedia Databases - Image Processing Basics
58 pages
Unit1 DIP PDF
No ratings yet
Unit1 DIP PDF
81 pages
Chapter 2
No ratings yet
Chapter 2
66 pages
Computer Vision CH2
No ratings yet
Computer Vision CH2
34 pages
Introcduction To Image Processing With Python Nour Eddine ALAA and Ismail Zine El Abidne March 5, 2021
No ratings yet
Introcduction To Image Processing With Python Nour Eddine ALAA and Ismail Zine El Abidne March 5, 2021
77 pages
Chapter-1 Introduction to Digital Image Processing[1]
No ratings yet
Chapter-1 Introduction to Digital Image Processing[1]
69 pages
What Is Image Processing? Explain Fundamental Steps in Digital Image Processing
No ratings yet
What Is Image Processing? Explain Fundamental Steps in Digital Image Processing
15 pages
Deep Learning lab manual
No ratings yet
Deep Learning lab manual
69 pages
Image Processing
No ratings yet
Image Processing
13 pages
i Ppr Extracted
No ratings yet
i Ppr Extracted
6 pages
Lecture01 &02 (1)
No ratings yet
Lecture01 &02 (1)
77 pages
1.Introduction
No ratings yet
1.Introduction
81 pages
Artificial Intelligence for Image Super Resolution
From Everand
Artificial Intelligence for Image Super Resolution
Debmitra Ghosh
No ratings yet
Image Segmentation: Unlocking Insights through Pixel Precision
From Everand
Image Segmentation: Unlocking Insights through Pixel Precision
Fouad Sabry
No ratings yet
Pedestrian Detection: Please, suggest a subtitle for a book with title 'Pedestrian Detection' within the realm of 'Computer Vision'. The suggested subtitle should not have ':'.
From Everand
Pedestrian Detection: Please, suggest a subtitle for a book with title 'Pedestrian Detection' within the realm of 'Computer Vision'. The suggested subtitle should not have ':'.
Fouad Sabry
No ratings yet
Imaging the World: Unlocking the Secrets of Digital Images
From Everand
Imaging the World: Unlocking the Secrets of Digital Images
Pasquale De Marco
No ratings yet
Academic Emergency Medicine - 2011 - Ross
No ratings yet
Academic Emergency Medicine - 2011 - Ross
9 pages
4) Frequency and Phase Modulation
No ratings yet
4) Frequency and Phase Modulation
13 pages
Chapter 18 Structured questions 2 - 複本
No ratings yet
Chapter 18 Structured questions 2 - 複本
18 pages
Garrison Academy For Cambridge Studies List of Text Books For A-Level
No ratings yet
Garrison Academy For Cambridge Studies List of Text Books For A-Level
2 pages
Product Design and Development Steps
No ratings yet
Product Design and Development Steps
22 pages
MATSCIENCE-202 Program
No ratings yet
MATSCIENCE-202 Program
7 pages
1.UNIX Operating System
No ratings yet
1.UNIX Operating System
37 pages
2024-2025-Class VIII-Mathematics-Chapter 6-AW
No ratings yet
2024-2025-Class VIII-Mathematics-Chapter 6-AW
8 pages
I Jcs It 20150603227
No ratings yet
I Jcs It 20150603227
3 pages
Flare Facade: Pixelskin 02 6
No ratings yet
Flare Facade: Pixelskin 02 6
5 pages
William Stallings Computer Organization and Architecture 7 Edition Architecture & Organization 1
No ratings yet
William Stallings Computer Organization and Architecture 7 Edition Architecture & Organization 1
5 pages
2 3 1 Identification Reactions of Ions and Functional Groups
No ratings yet
2 3 1 Identification Reactions of Ions and Functional Groups
4 pages
Probability Notes
No ratings yet
Probability Notes
6 pages
Analisis y Diseño Estructrural de Un Poste de Concreto 18m para Trasmision Eléctrica
No ratings yet
Analisis y Diseño Estructrural de Un Poste de Concreto 18m para Trasmision Eléctrica
118 pages
Checked Worksheet in Statistics and Probability Tayson Ian Melvin D
No ratings yet
Checked Worksheet in Statistics and Probability Tayson Ian Melvin D
5 pages
Yale Applied Math Course Syllabus Fall 2011
No ratings yet
Yale Applied Math Course Syllabus Fall 2011
3 pages
Concepts of Orthogonal Frequency Division Multiplexing (OFDM) and 802
No ratings yet
Concepts of Orthogonal Frequency Division Multiplexing (OFDM) and 802
4 pages
202409192051702326
No ratings yet
202409192051702326
150 pages
First Half Scribe
No ratings yet
First Half Scribe
33 pages
Real_World_Presentation_RA2311050010060_AswathAS_DAA
No ratings yet
Real_World_Presentation_RA2311050010060_AswathAS_DAA
7 pages
IP Quality of Service PDF
No ratings yet
IP Quality of Service PDF
368 pages
Multiple ch3
No ratings yet
Multiple ch3
4 pages
Travel Hyundai 210-7
100% (1)
Travel Hyundai 210-7
72 pages
Masonry: Legesse Arega B.C. SUMMARY For Variation Order For DIRE DAWA Airport Police Residence
No ratings yet
Masonry: Legesse Arega B.C. SUMMARY For Variation Order For DIRE DAWA Airport Police Residence
3 pages
SQL DBA Question
No ratings yet
SQL DBA Question
4 pages