Image Data Preprocessing

Image data preprocessing is essential for preparing raw images for neural networks by ensuring consistency in size and quality. Key steps include splitting data into training, validation, and test sets, converting images to tensors, resizing, normalizing pixel values, and batching for efficient processing. These practices enhance model efficiency, improve performance, and help prevent overfitting, ultimately leading to better predictions.

Uploaded by

Cristian

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

36 views34 pages

Image Data Preprocessing

Uploaded by

Cristian

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 34

IMAGE DATA

PREPROCESSING
BY: GROUP 4
KEY TOPICS:
1. Splitting Data into Training,
Validation, and Test Sets

2. Converting Images to
Tensors and Loading Data
WHAT IS
IMAGE DATA
PREPROCESSING?
What is Image Data
Preprocessing?
Image data preprocessing prepares
raw images so neural networks can
learn from them accurately, by
making sizes and quality consistent.
IMPORTANCE OF IMAGE DATA
PREPROCESSING
ENHANCES MODEL IMPROVES MODEL
EFFICIENCY PERFORMANCE
Preprocessing simplifies image Standardized data helps the
data, reducing the amount of model learn patterns more
computation required, which effectively by removing
makes training faster. inconsistencies and noise.

PREVENTS
OVERFITTING
By using techniques like data
augmentation, preprocessing
increases the dataset’s diversity,
helping the model generalize
better.
KEY STEPS FOR
PREPROCESSING
IMAGE DATA
SPLITTING DATA:
TRAINING,
VALIDATION AND
TEST SETS
SPLITTING DATA: TRAINING,
VALIDATION AND TEST SETS
Splitting the dataset into
training, validation, and test sets
is crucial for assessing a model’s
performance effectively.
What is Training, Validation
and Test Set?

Training Set:
Used to train the model.
Validation Set:
Used to adjust settings and check the model’s
performance during training.
Test Set:
Used only after training to measure how well the
model performs on new data.
EXAMPLE:
If you have 1,000 images, you might split
them as follows:
Training Set: 70% (700 images)
Validation Set: 15% (150 images)
Test Set: 15% (150 images)
Why is it Important?

Splitting data helps prevent

overfitting, ensuring the
model learns patterns rather
than memorizing specific
images.
CONVERTING IMAGES
TO TENSORS AND
LOADING DATA
CONVERTING IMAGES TO
TENSORS AND LOADING DATA

CONVERTING IMAGES TO TENSORS

FORMATS DATA FOR MODELS, WHILE
LOADING ORGANIZES IT INTO BATCHES
FOR TRAINING.
Example of Converting Image to Tensors:
If you have a 1024x1024 pixel RGB image,
converting it to a tensor might produce a
tensor of shape [3, 1024, 1024], where 3 is
the number of color channels (RGB) and
1024x1024 is the image's dimensions.
NOTE: CONVERTING AN IMAGE TO A
TENSOR TRANSFORMS THE PIXEL VALUES
INTO NUMBERS, MAKING IT EASIER FOR
NEURAL NETWORKS TO PROCESS. THE
IMAGE'S DIMENSIONS AND COLORS
REMAIN, BUT THE DATA IS NOW IN A
FORMAT THAT MACHINES CAN WORK WITH.
Example of Loading Data:
If you have an image file, loading data
would involve reading these files into
memory using a library like PIL (Python
Imaging Library) or OpenCV.
Why is it Important?
Loading data reads images, and
converting them to tensors prepares
them for the model to process.
RESIZING
IMAGES
RESIZING IMAGES

RESIZE ALL IMAGES TO A

CONSISTENT SIZE SO THE
MODEL CAN PROCESS
THEM EFFICIENTLY.
EXAMPLE:
An image of 1024x1024 pixels is
resized to 224x224 pixels to
standardize the input.
Why is it Important?
Resizing helps the model handle
images more easily and ensures
that the images are all the same
size for processing.
NORMALIZING
PIXEL VALUES
NORMALIZING PIXEL
VALUES
PIXEL VALUES ARE SCALED TO A
RANGE BETWEEN 0 AND 1 BY
DIVIDING BY 255. THIS ENSURES
ALL VALUES ARE WITHIN A
SIMILAR SCALE.
EXAMPLE:
A pixel value of 246 (the Blue
Channel in RGB) is normalized
to 0.96 (246/255).
NOTE: NORMALIZATION SCALES THE
PIXEL VALUES OF AN IMAGE WITHOUT
CHANGING ITS SIZE OR DIMENSIONS, THIS
CHANGE IN PIXEL VALUES IS NOT
VISUALLY NOTICEABLE IN THE IMAGE.
Why is it Important?
Normalization prevents some pixel
values from dominating the learning
process, making the training more
stable and efficient.
DATA LOADING
WITH BATCHING
DATA LOADING WITH
BATCHING
IMAGES ARE GROUPED INTO
BATCHES, ALLOWING THE MODEL TO
PROCESS SEVERAL IMAGES AT
ONCE.
EXAMPLE:

A batch might contain 32

images that are processed
together before the model
updates.
Why is it Important?
Batching improves memory
efficiency, speeds up training, and
stabilizes the model’s learning by
averaging updates across the
batch.
ADDITIONAL STEPS:
1. Data Augmentation: Adds variety by rotating,
flipping, zooming, etc.
Importance: Prevents overfitting and improves model
generalization.
2. Image Denoising and Smoothing: Reduces noise
with filters like Gaussian blur.
Importance: Helps the model focus on key features,
reducing distractions.
3. Histogram Equalization: Enhances contrast by
adjusting pixel intensity distribution.
Importance: Makes features clearer, improving
feature detection.
ADDITIONAL STEPS:

4. Color Space Conversion: Changes color format

(RGB to grayscale).
Importance: Simplifies data when color isn’t essential,
reducing model complexity.

5. Image Cropping and Centering: Focuses on the

main subject by cropping or centering.
Importance: Directs model’s attention to important
parts of the image.
Conclusion
These preprocessing steps help
ensure that images are properly
formatted for training neural
networks, leading to better model
performance, faster training, and
more accurate predictions.
THAT’S ALL
THANK YOU!

Image Classification With Pytorch: Pre-Processing Images To Use in Machine Learning Models
No ratings yet
Image Classification With Pytorch: Pre-Processing Images To Use in Machine Learning Models
48 pages
Deep Learning Project For Computer Vision With Python 2022
No ratings yet
Deep Learning Project For Computer Vision With Python 2022
297 pages
Diffusion
100% (5)
Diffusion
62 pages
Graphical Representation of Statistical Data
100% (5)
Graphical Representation of Statistical Data
56 pages
Unit 3 - 1 - 1709014556934
No ratings yet
Unit 3 - 1 - 1709014556934
49 pages
Deep Learning Based Computer Vision
No ratings yet
Deep Learning Based Computer Vision
98 pages
Computer Vision With Keras
No ratings yet
Computer Vision With Keras
67 pages
2-Machine Learning & Deep Learning
No ratings yet
2-Machine Learning & Deep Learning
87 pages
Lecture01 &02
No ratings yet
Lecture01 &02
77 pages
Cours 4 - Loading and Preprocessing Data With TensorFlow
No ratings yet
Cours 4 - Loading and Preprocessing Data With TensorFlow
23 pages
Legislature Parliament Architecture of Democracy
No ratings yet
Legislature Parliament Architecture of Democracy
11 pages
03 Pytorch Computer Vision
No ratings yet
03 Pytorch Computer Vision
29 pages
Deep Learning Notes
No ratings yet
Deep Learning Notes
155 pages
Image Processing
No ratings yet
Image Processing
36 pages
Layers in CNN
No ratings yet
Layers in CNN
22 pages
Core ML Survival Guide
No ratings yet
Core ML Survival Guide
505 pages
Unit 1 Notes
No ratings yet
Unit 1 Notes
30 pages
Deep Neural Network
No ratings yet
Deep Neural Network
60 pages
DL Lab-III-II
No ratings yet
DL Lab-III-II
98 pages
Vazquez ImageProcessFundamentals
No ratings yet
Vazquez ImageProcessFundamentals
83 pages
Image Classification Using Backpropagation Algorithm (Presentation)
No ratings yet
Image Classification Using Backpropagation Algorithm (Presentation)
23 pages
Convolutinal Neural Networks
No ratings yet
Convolutinal Neural Networks
43 pages
Artificial Intelligence ME: Manufacturing 6324
No ratings yet
Artificial Intelligence ME: Manufacturing 6324
23 pages
TP3 Mi204 Santos Scardellato
No ratings yet
TP3 Mi204 Santos Scardellato
20 pages
UGRD IT6209 Introduction To Multimedia Final
100% (1)
UGRD IT6209 Introduction To Multimedia Final
28 pages
CV - T3 - Unit-7
No ratings yet
CV - T3 - Unit-7
36 pages
Plant Disease Identification
No ratings yet
Plant Disease Identification
17 pages
Project Guidelines - AIML
No ratings yet
Project Guidelines - AIML
30 pages
Introduction To Convolutional Neural Network (CNN) Using Tensorflow - by Govinda Dumane - Towards Data Science
No ratings yet
Introduction To Convolutional Neural Network (CNN) Using Tensorflow - by Govinda Dumane - Towards Data Science
17 pages
Audio - Visual Aids
No ratings yet
Audio - Visual Aids
49 pages
Create Simple Deep Learning Neural Network For Classification
No ratings yet
Create Simple Deep Learning Neural Network For Classification
11 pages
Machine Learning Basics
No ratings yet
Machine Learning Basics
25 pages
Week-2 - ML Slides
No ratings yet
Week-2 - ML Slides
26 pages
Image Processing - Techniques, Types, & Applications (2023)
No ratings yet
Image Processing - Techniques, Types, & Applications (2023)
32 pages
Automated Image Data Preprocessing With Deep Reinforcement Learning
No ratings yet
Automated Image Data Preprocessing With Deep Reinforcement Learning
9 pages
IIT M CV
No ratings yet
IIT M CV
20 pages
Computer Vision NN Architecture
No ratings yet
Computer Vision NN Architecture
19 pages
Pytorch Waste Classification Using Densenet Jupyter Notebook
No ratings yet
Pytorch Waste Classification Using Densenet Jupyter Notebook
14 pages
BigData Assessment2 26230605
No ratings yet
BigData Assessment2 26230605
14 pages
03 Convolution Neural Networks and Computer Vision With Tensorflow
No ratings yet
03 Convolution Neural Networks and Computer Vision With Tensorflow
21 pages
Computer Vision
No ratings yet
Computer Vision
21 pages
Image Processing Through Machine Learning: By:-Akansh Kumar (En-1)
No ratings yet
Image Processing Through Machine Learning: By:-Akansh Kumar (En-1)
22 pages
3 Good Reasons To Use Layers in Photoshop
No ratings yet
3 Good Reasons To Use Layers in Photoshop
23 pages
Unit 3 Deep Learning
No ratings yet
Unit 3 Deep Learning
15 pages
A. Image Pre-Processing:: Grayscale Conversion Fig.5.f
No ratings yet
A. Image Pre-Processing:: Grayscale Conversion Fig.5.f
6 pages
Lab05 ML
No ratings yet
Lab05 ML
7 pages
DL 3
No ratings yet
DL 3
10 pages
DL Lab-Final
No ratings yet
DL Lab-Final
22 pages
NB4-07 PT II Data Augmentation
No ratings yet
NB4-07 PT II Data Augmentation
6 pages
Here Are Common Image Preprocessing Techniques Used in Machine Learning and Deep Learning
No ratings yet
Here Are Common Image Preprocessing Techniques Used in Machine Learning and Deep Learning
7 pages
MiniProj-3-Colorizing Old B&W Images
No ratings yet
MiniProj-3-Colorizing Old B&W Images
4 pages
Image Processing File
No ratings yet
Image Processing File
7 pages
CV Pipeline Preprocessing Stage: Dr. Hussien Karam
No ratings yet
CV Pipeline Preprocessing Stage: Dr. Hussien Karam
10 pages
Srafvana
No ratings yet
Srafvana
6 pages
Explore The Implementation of CNNs in Python
No ratings yet
Explore The Implementation of CNNs in Python
10 pages
Assignment 3 DL
No ratings yet
Assignment 3 DL
6 pages
Unit 1 Digital Image Fundamentals (DIP)
No ratings yet
Unit 1 Digital Image Fundamentals (DIP)
13 pages
CV Assignment 2 Group02
No ratings yet
CV Assignment 2 Group02
12 pages
Intensity Transformation of Images
No ratings yet
Intensity Transformation of Images
9 pages
Structure of Convolutional Neural Networks - Deep Learning
No ratings yet
Structure of Convolutional Neural Networks - Deep Learning
12 pages
3.2 Preprocessing
No ratings yet
3.2 Preprocessing
10 pages
Data Preprocessing
No ratings yet
Data Preprocessing
2 pages
Canva Test Items
No ratings yet
Canva Test Items
1 page
Image Classification
No ratings yet
Image Classification
18 pages
"I C U N N ": Mage Lassification Sing Eural Etworks
No ratings yet
"I C U N N ": Mage Lassification Sing Eural Etworks
15 pages
Gender Fair Language
No ratings yet
Gender Fair Language
12 pages
Multimedia Lab Record
No ratings yet
Multimedia Lab Record
21 pages
Laboratory 3. Basic Image Segmentation Techniques
No ratings yet
Laboratory 3. Basic Image Segmentation Techniques
10 pages
How To Write A Business Proposal That Drives Action - Grammarly
No ratings yet
How To Write A Business Proposal That Drives Action - Grammarly
12 pages
Shashi 1
No ratings yet
Shashi 1
36 pages
Integrating QT and OpenGL
No ratings yet
Integrating QT and OpenGL
39 pages
Tutorial: Cleaning Up Imported Meshes: Chapter 14 Working With Meshes and Polys
No ratings yet
Tutorial: Cleaning Up Imported Meshes: Chapter 14 Working With Meshes and Polys
11 pages
Computer Graphics and Animation
No ratings yet
Computer Graphics and Animation
54 pages
SST 2 Bed Dubai Skyline View 2.42m
No ratings yet
SST 2 Bed Dubai Skyline View 2.42m
18 pages
Quick Start C4D R12 US
No ratings yet
Quick Start C4D R12 US
126 pages
CH2 - Drawing Algorithms
No ratings yet
CH2 - Drawing Algorithms
89 pages
Statistics Probability
No ratings yet
Statistics Probability
53 pages
Comprehensive Guide To 2D Game Development
No ratings yet
Comprehensive Guide To 2D Game Development
2 pages
Surface Tension
No ratings yet
Surface Tension
41 pages
Illustrating The Nature of Bivariate Data
No ratings yet
Illustrating The Nature of Bivariate Data
40 pages
Women's Ways of Knowing
No ratings yet
Women's Ways of Knowing
15 pages
3DS Max
No ratings yet
3DS Max
11 pages
MS Excel
No ratings yet
MS Excel
21 pages
Women A Sectoral Situationer
No ratings yet
Women A Sectoral Situationer
20 pages
Intermolecular Forces
No ratings yet
Intermolecular Forces
19 pages
Fungsi Menu Pada Aplikasi GIMP
No ratings yet
Fungsi Menu Pada Aplikasi GIMP
39 pages
Computer Graphice Unit 4
No ratings yet
Computer Graphice Unit 4
4 pages
Create A Refreshing Beer Themed Poster Design in Photoshop - Colorburned PDF
No ratings yet
Create A Refreshing Beer Themed Poster Design in Photoshop - Colorburned PDF
35 pages
Advertising Microteaching Example
No ratings yet
Advertising Microteaching Example
12 pages
Log
No ratings yet
Log
30 pages
Tone Mapping: A Powerful Tool For Narrowband Color Combine, by J-P Metsavainio
No ratings yet
Tone Mapping: A Powerful Tool For Narrowband Color Combine, by J-P Metsavainio
19 pages
BCDS061 - Image Analytics
0% (1)
BCDS061 - Image Analytics
2 pages
WINMAG Plus V06 User Guide PDF
No ratings yet
WINMAG Plus V06 User Guide PDF
38 pages
Describe The Different Stages of The Graphics...
No ratings yet
Describe The Different Stages of The Graphics...
2 pages
Reaseacrch Papers 1
No ratings yet
Reaseacrch Papers 1
8 pages
TGP - Kapil Tripathi 1
No ratings yet
TGP - Kapil Tripathi 1
1 page
DATA MINING AND MACHINE LEARNING. PREDICTIVE TECHNIQUES: REGRESSION, GENERALIZED LINEAR MODELS, SUPPORT VECTOR MACHINE AND NEURAL NETWORKS
From Everand
DATA MINING AND MACHINE LEARNING. PREDICTIVE TECHNIQUES: REGRESSION, GENERALIZED LINEAR MODELS, SUPPORT VECTOR MACHINE AND NEURAL NETWORKS
César Pérez López
No ratings yet
IT Specialist: Artificial Intelligence Exam Prep - 500 Questions for Certification Success (0225)
From Everand
IT Specialist: Artificial Intelligence Exam Prep - 500 Questions for Certification Success (0225)
Satou Takahiro
No ratings yet