0% found this document useful (0 votes)

132 views48 pages

Image Classification With Pytorch: Pre-Processing Images To Use in Machine Learning Models

This document discusses image classification using machine learning. It covers representing images as tensors, the need for image pre-processing, and common pre-processing techniques like normalizing inputs, uniform image size, data augmentation, and dimensionality reduction to improve CNN performance. The demo section proposes implementing pre-processing using PyTorch on a cloud VM.

Uploaded by

Rahul Shetty

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

132 views48 pages

Image Classification With Pytorch: Pre-Processing Images To Use in Machine Learning Models

Uploaded by

Rahul Shetty

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 48

Image Classification with PyTorch

PRE-PROCESSING IMAGES TO USE IN MACHINE

LEARNING MODELS

Janani Ravi
CO-FOUNDER, LOONYCORN
www.loonycorn.com
Image classification using machine
Overview learning
Representing images as tensors
Need for image pre-processing
Common image pre-processing
techniques
Prerequisites and Course Outline
Prerequisites

Basic Python programming

Build and training machine learning
models
Worked with PyTorch to build simple
neural networks
Prerequisite Courses

Foundations of PyTorch
Building your first PyTorch solution
Course Outline

Images as features and pre-processing

techniques
Drawbacks of Deep Neural Networks
(DNNs) for image classification
Introducing Convolutional Neural
Networks (CNNs)
Hyperparameter tuning
Pre-trained models
Image Recognition
Image Recognition

Images represented Identify edges, A photo of a

as pixels colors, shapes horse
Images as Tensors
Images as Tensors

Each pixel holds a value based on the type of image

RGB Images

RGB values are for

color images

R, G, B: 0-255
RGB Images

255, 0, 0
RGB Images

0, 255, 0
RGB Images

0, 0, 255

3 values to represent
color, 3 channels
RGB Images

0, 0, 255

These are often scaled to be in

the 0-1 range as neural networks
work better with smaller numbers
Grayscale Images
Grayscale Images

Each pixel represents

only intensity information

0.0 - 1.0
Grayscale Images

0.5
Grayscale Images

0.5

1 value to represent
intensity, 1 channel
Images as Tensors

Single channel and multi-channel images

Images as Tensors

Images can be represented by a 3-D matrix

Images as Tensors

The number of channels specifies the

number of elements in the 3rd dimension
Images as Tensors

(6, 6, 1) (6, 6, 3)
List of Images

Deep learning frameworks usually deal with a list of

images in one 4-D tensor
List of Images

The images should all be the same size

List of Images

(10, 6, 6, 3)

The number of channels

List of Images

(10, 6, 6, 3)

The height and width of each image in the list

List of Images

(10, 6, 6, 3)

The number of images

Need for Image Pre-processing
Image Pre-processing Methods

Uniform Aspect Mean and Perturbed

Uniform Image Size
Ratio Images

Normalized Image Dimensionality

Data Augmentation
Inputs Reduction

Common techniques to improve CNN performance

Image Pre-processing Methods

Uniform Aspect Mean and Perturbed

Uniform Image Size
Ratio Images

Normalized Image Dimensionality

Data Augmentation
Inputs Reduction

Common techniques to improve CNN performance

Most models assume square shape

Uniform Aspect Crop images to be square

Ratio Usually, center of image most important
Makes aspect ratio constant
Image Pre-processing Methods

Uniform Aspect Mean and Perturbed

Uniform Image Size
Ratio Images

Normalized Image Dimensionality

Data Augmentation
Inputs Reduction

Common techniques to improve CNN performance

Fit image size to CNN feature maps
250 x 250 image to 100 x 100 image
Uniform Image Size
Down-scaling factor of 0.4
Up-scaling and down-scaling
Image Pre-processing Methods

Uniform Aspect Mean and Perturbed

Uniform Image Size
Ratio Images

Normalized Image Dimensionality

Data Augmentation
Inputs Reduction

Common techniques to improve CNN performance

Mean image: average pixel across entire
training dataset
Mean and Perturbed
Images Insights often emerge
E.g. faces usually in center of image
Perturbed image: intentionally distort
Mean and Perturbed pixels by varying them from mean image
Images E.g. to prevent CNN from only focusing
on center
Image Pre-processing Methods

Uniform Aspect Mean and Perturbed

Uniform Image Size
Ratio Images

Normalized Image Dimensionality

Data Augmentation
Inputs Reduction

Common techniques to improve CNN performance

“Normalize” each pixel
Subtract mean
Normalized Image
Inputs Divide by standard deviation
Ensures each pixel has similar data
distribution
Converts pixels to N(0,1) distribution
Normalized Image
Then scale to be in [0,1] or [0,255]
Inputs
Helps neural networks converge faster
Image Pre-processing Methods

Uniform Aspect Mean and Perturbed

Uniform Image Size
Ratio Images

Normalized Image Dimensionality

Data Augmentation
Inputs Reduction

Common techniques to improve CNN performance

RGB data has 3 channels
Can reduce to grayscale (just 1 channel)
Dimensionality Reduces dimensionality of all image
Reduction tensors
Reduce the size of the problem so
training completes faster
Image Pre-processing Methods

Uniform Aspect Mean and Perturbed

Uniform Image Size
Ratio Images

Normalized Image Dimensionality

Data Augmentation
Inputs Reduction

Common techniques to improve CNN performance

Perturbed images are a form of data
augmentation

Data Augmentation Scaling, rotation, affine transforms

Makes CNN training more robust
Reduces risk of overfitting
Demo
Set up a deep learning VM on a cloud
platform
Demo
Explore common image pre-processing
techniques
Demo
Implement image pre-processing using
PyTorch
Image classification using machine
Summary learning
Representing images as tensors
Need for image pre-processing
Common image pre-processing
techniques

Artificial Intelligence and Machine Learning in Medical Imaging
No ratings yet
Artificial Intelligence and Machine Learning in Medical Imaging
56 pages
AirSense 10 Service Manual
100% (1)
AirSense 10 Service Manual
91 pages
Programming in C
No ratings yet
Programming in C
689 pages
Computer Vision - Unit 1 Notes
No ratings yet
Computer Vision - Unit 1 Notes
13 pages
Unit 3
No ratings yet
Unit 3
105 pages
Unit 3
No ratings yet
Unit 3
80 pages
Deep Learning Based Computer Vision
No ratings yet
Deep Learning Based Computer Vision
98 pages
Unit 1
No ratings yet
Unit 1
200 pages
Deep Learning Notes
No ratings yet
Deep Learning Notes
155 pages
Lecture 4 - Deep Learning Introduction
No ratings yet
Lecture 4 - Deep Learning Introduction
63 pages
Introduction To Convolutional Neural Network (CNN) Using Tensorflow - by Govinda Dumane - Towards Data Science
No ratings yet
Introduction To Convolutional Neural Network (CNN) Using Tensorflow - by Govinda Dumane - Towards Data Science
17 pages
Hilux Ac
80% (5)
Hilux Ac
5 pages
Lec 8
No ratings yet
Lec 8
60 pages
2023 Bocconi 20600 Lez 1 Intro and Digital Images
No ratings yet
2023 Bocconi 20600 Lez 1 Intro and Digital Images
86 pages
CNN (Neural Network)
No ratings yet
CNN (Neural Network)
32 pages
Unit 3 - 1 - 1709014556934
No ratings yet
Unit 3 - 1 - 1709014556934
49 pages
Image Data Preprocessing
No ratings yet
Image Data Preprocessing
34 pages
Military AI-Week 05-AI in Computer Vision
No ratings yet
Military AI-Week 05-AI in Computer Vision
65 pages
Module 5
No ratings yet
Module 5
72 pages
Image Processing
No ratings yet
Image Processing
36 pages
Deep Learning Models For Digital Image Processing: A Review: R. Archana P. S. Eliahim Jeevaraj
No ratings yet
Deep Learning Models For Digital Image Processing: A Review: R. Archana P. S. Eliahim Jeevaraj
33 pages
Final Presentation
No ratings yet
Final Presentation
30 pages
Lec5 CNN RNN Attention
No ratings yet
Lec5 CNN RNN Attention
71 pages
3.1 - Image Fundamentals
No ratings yet
3.1 - Image Fundamentals
32 pages
Convolutional Neural Networks - Part 1
No ratings yet
Convolutional Neural Networks - Part 1
44 pages
Summary
No ratings yet
Summary
36 pages
CVlecture 5
No ratings yet
CVlecture 5
56 pages
CISM - Delegate Pack
No ratings yet
CISM - Delegate Pack
430 pages
Convolutional Networks 2024
No ratings yet
Convolutional Networks 2024
44 pages
CV - T3 - Unit-7
No ratings yet
CV - T3 - Unit-7
36 pages
03 Pytorch Computer Vision
No ratings yet
03 Pytorch Computer Vision
29 pages
Images, Neural Networks, CNNs
No ratings yet
Images, Neural Networks, CNNs
26 pages
Convolutinal Neural Networks
No ratings yet
Convolutinal Neural Networks
43 pages
Understanding The Drawbacks of Using Deep Neural Networks With Images
No ratings yet
Understanding The Drawbacks of Using Deep Neural Networks With Images
35 pages
01 - Mnist - Ipynb (4) - JupyterLab
No ratings yet
01 - Mnist - Ipynb (4) - JupyterLab
23 pages
CNN 1
No ratings yet
CNN 1
19 pages
Lab 5 - Intro To Convolutional Neural Networks
No ratings yet
Lab 5 - Intro To Convolutional Neural Networks
52 pages
JNTUK R20 UNIT-IV DEEP LEARNING TECHNIQUES-www - Jntumaterials.co - in
No ratings yet
JNTUK R20 UNIT-IV DEEP LEARNING TECHNIQUES-www - Jntumaterials.co - in
26 pages
Machine Learning Re Defining Semiconductor Industry 1598272842
No ratings yet
Machine Learning Re Defining Semiconductor Industry 1598272842
33 pages
Image Recognition Using ML (CNN) For Beginners - by Akhil Haridasan - The Startup - Medium
No ratings yet
Image Recognition Using ML (CNN) For Beginners - by Akhil Haridasan - The Startup - Medium
21 pages
Machine Learning Basics
No ratings yet
Machine Learning Basics
25 pages
Unit 3 Deep Learning
No ratings yet
Unit 3 Deep Learning
15 pages
Week-2 - ML Slides
No ratings yet
Week-2 - ML Slides
26 pages
An Introduction To Convolutional Neural Networks: November 2015
No ratings yet
An Introduction To Convolutional Neural Networks: November 2015
12 pages
A Comprehensive Review On DC Fast Charging Stations For Electric Vehicles Standards Power Conversion Technologies Architectures Energy Management and Cybersecurity
No ratings yet
A Comprehensive Review On DC Fast Charging Stations For Electric Vehicles Standards Power Conversion Technologies Architectures Energy Management and Cybersecurity
39 pages
03 Convolution Neural Networks and Computer Vision With Tensorflow
No ratings yet
03 Convolution Neural Networks and Computer Vision With Tensorflow
21 pages
Explore The Implementation of CNNs in Python
No ratings yet
Explore The Implementation of CNNs in Python
10 pages
Group B Deep Learning Assignment No: 3B: Categories
No ratings yet
Group B Deep Learning Assignment No: 3B: Categories
13 pages
Image Processing File
No ratings yet
Image Processing File
7 pages
Convolutional Neural Networks
No ratings yet
Convolutional Neural Networks
8 pages
Week 09
No ratings yet
Week 09
6 pages
A. Image Pre-Processing:: Grayscale Conversion Fig.5.f
No ratings yet
A. Image Pre-Processing:: Grayscale Conversion Fig.5.f
6 pages
DeekshikaJadyada21 AP24LDS11
No ratings yet
DeekshikaJadyada21 AP24LDS11
5 pages
An Introduction To Convolutional Neural Networks: November 2015
No ratings yet
An Introduction To Convolutional Neural Networks: November 2015
12 pages
Convolutional Neural Network (CNN) : Assignment On
No ratings yet
Convolutional Neural Network (CNN) : Assignment On
8 pages
CV Pipeline Preprocessing Stage: Dr. Hussien Karam
No ratings yet
CV Pipeline Preprocessing Stage: Dr. Hussien Karam
10 pages
Structure of Convolutional Neural Networks - Deep Learning
No ratings yet
Structure of Convolutional Neural Networks - Deep Learning
12 pages
3.2 Preprocessing
No ratings yet
3.2 Preprocessing
10 pages
Data Preprocessing
No ratings yet
Data Preprocessing
2 pages
Visual Image Understanding
No ratings yet
Visual Image Understanding
7 pages
Sagar Paper
No ratings yet
Sagar Paper
4 pages
Identify Web Cam Images Using Neural Networks
No ratings yet
Identify Web Cam Images Using Neural Networks
17 pages
Classify Webcam Images Using Deep Learning
No ratings yet
Classify Webcam Images Using Deep Learning
17 pages
"I C U N N ": Mage Lassification Sing Eural Etworks
No ratings yet
"I C U N N ": Mage Lassification Sing Eural Etworks
15 pages
q4 The Recorder (Flute)
No ratings yet
q4 The Recorder (Flute)
20 pages
EE-330 DSP Lab Manual
No ratings yet
EE-330 DSP Lab Manual
49 pages
Anna University Chennai 600 025: Ocs-352 - Iot Concepts and Applications
No ratings yet
Anna University Chennai 600 025: Ocs-352 - Iot Concepts and Applications
76 pages
P1sec - 5G Stand Alone Security
No ratings yet
P1sec - 5G Stand Alone Security
17 pages
Nokia Scaling User Plane Functions For Home Broadband Over Fixed Wireless Access
No ratings yet
Nokia Scaling User Plane Functions For Home Broadband Over Fixed Wireless Access
6 pages
Introducing Convolutional Neural Networks Slides
No ratings yet
Introducing Convolutional Neural Networks Slides
94 pages
Dse Placement Report
No ratings yet
Dse Placement Report
77 pages
Block DOH With Pfsense
No ratings yet
Block DOH With Pfsense
24 pages
Int J Communication - 2020 - Swain - A Cost Effective LoRa Based Customized Device For Agriculture Field Monitoring and
No ratings yet
Int J Communication - 2020 - Swain - A Cost Effective LoRa Based Customized Device For Agriculture Field Monitoring and
21 pages
LM - Unit-1 2
No ratings yet
LM - Unit-1 2
12 pages
Practical-2 Sem 2
No ratings yet
Practical-2 Sem 2
5 pages
Building Convolutional Neural Networks For Image Classification Slides
No ratings yet
Building Convolutional Neural Networks For Image Classification Slides
57 pages
Design Anaylsis of A Solar Power System For The Faculty of Engineering Rivers State University1
No ratings yet
Design Anaylsis of A Solar Power System For The Faculty of Engineering Rivers State University1
12 pages
Transfer Capacity Definitions
No ratings yet
Transfer Capacity Definitions
13 pages
Rohit Unit 2 ML Notes
No ratings yet
Rohit Unit 2 ML Notes
7 pages
Latest Batch Varification
No ratings yet
Latest Batch Varification
18 pages
Power Electronics Obj Q & A
No ratings yet
Power Electronics Obj Q & A
15 pages
9.0 H9 DC Injection Braking 6551
No ratings yet
9.0 H9 DC Injection Braking 6551
3 pages
Object-Oriented Software Engineering: Lecture # 01 Mehak Fatima
No ratings yet
Object-Oriented Software Engineering: Lecture # 01 Mehak Fatima
20 pages
AIR Adoption Guide LATEST
No ratings yet
AIR Adoption Guide LATEST
7 pages
A3977 Datasheet
No ratings yet
A3977 Datasheet
17 pages
Article Template For Journals
No ratings yet
Article Template For Journals
3 pages
Datasheet Sono4u Silver+
No ratings yet
Datasheet Sono4u Silver+
2 pages
GRE Waived Universities
No ratings yet
GRE Waived Universities
5 pages
Parabolic Transformations Lesson
No ratings yet
Parabolic Transformations Lesson
2 pages
Deep Online Sequential Extreme Learning Machines and Its Application in Pneumonia Detection
No ratings yet
Deep Online Sequential Extreme Learning Machines and Its Application in Pneumonia Detection
6 pages
GCP 16 Apr Ref
No ratings yet
GCP 16 Apr Ref
5 pages
PAUT Equipments For Reactors Connections Ammonia 2 at of Sorfert Plant
No ratings yet
PAUT Equipments For Reactors Connections Ammonia 2 at of Sorfert Plant
1 page
Product Designer - Resume
No ratings yet
Product Designer - Resume
1 page
Detection of Pneumonia Clouds in Chest X-Ray Using Image Processing Approach
No ratings yet
Detection of Pneumonia Clouds in Chest X-Ray Using Image Processing Approach
4 pages
Automatic Detection of Pneumonia On Compressed Sensing Images Using Deep Learning
No ratings yet
Automatic Detection of Pneumonia On Compressed Sensing Images Using Deep Learning
4 pages
Sony Jason Allccarima Muñico: Mechanical Engineering National University of Engineering
No ratings yet
Sony Jason Allccarima Muñico: Mechanical Engineering National University of Engineering
3 pages

Image Classification With Pytorch: Pre-Processing Images To Use in Machine Learning Models

Uploaded by

Image Classification With Pytorch: Pre-Processing Images To Use in Machine Learning Models

Uploaded by

Image Classification with PyTorch

PRE-PROCESSING IMAGES TO USE IN MACHINE

Basic Python programming

Images as features and pre-processing

Images represented Identify edges, A photo of a

Each pixel holds a value based on the type of image

RGB values are for

These are often scaled to be in

Each pixel represents

Single channel and multi-channel images

Images can be represented by a 3-D matrix

The number of channels specifies the

Deep learning frameworks usually deal with a list of

The images should all be the same size

The number of channels

The height and width of each image in the list

The number of images

Uniform Aspect Mean and Perturbed

Normalized Image Dimensionality

Common techniques to improve CNN performance

Uniform Aspect Mean and Perturbed

Normalized Image Dimensionality

Common techniques to improve CNN performance

Uniform Aspect Crop images to be square

Uniform Aspect Mean and Perturbed

Normalized Image Dimensionality

Common techniques to improve CNN performance

Uniform Aspect Mean and Perturbed

Normalized Image Dimensionality

Common techniques to improve CNN performance

Uniform Aspect Mean and Perturbed

Normalized Image Dimensionality

Common techniques to improve CNN performance

Uniform Aspect Mean and Perturbed

Normalized Image Dimensionality

Common techniques to improve CNN performance

Uniform Aspect Mean and Perturbed

Normalized Image Dimensionality

Common techniques to improve CNN performance

Data Augmentation Scaling, rotation, affine transforms

You might also like