0% found this document useful (0 votes)

22 views

Image Feature Extraction

This document discusses 3 beginner-friendly techniques for extracting features from image data: 1. Using grayscale pixel values as individual features by treating each pixel value as a separate feature. 2. Taking the mean pixel value of the red, green, and blue channels to generate a new matrix with averaged values as features. 3. Extracting edge features using edge detection kernels that identify pixels where there is a sharp change in color values, representing edges. This helps identify shapes within images.

Uploaded by

abdullah saif

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

22 views

Image Feature Extraction

Uploaded by

abdullah saif

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 11

3 Beginner-Friendly Techniques to Extract Features

from Image Data

Introduction
If we provide the right data and features, machine learning models can perform adequately and can even be used as a
benchmark solution.

1/15
So in this lecture, we will understand the different ways in which we can generate features from images. Youcan then use
these methods in your favorite machine learning algorithms!

1. Method #1 for Feature Extraction from Image Data: Grayscale Pixel Values as Features
2. Method #2 for Feature Extraction from Image Data: Mean Pixel Value of Channels
3. Method #3 for Feature Extraction from Image Data: Extracting Edges

How do Machines Store Images?

Let’s start with the basics. It’s important to understand how we can read and store images on our machines before we look at
anything else. Consider this the ‘pd.read_‘ function, but for images.

I’ll kick things off with a simple example. Look at the image below:

We have an image of the number 8. Look really closely at the image – you’ll notice that it is made up of small square boxes.
These are called pixels.

There is a caveat, however. We see the images as they are – in their visual form. We can easily differentiate the edges and colors
to identify what is in the picture. Machines, on the other hand, struggle to do this. They store images in the form of numbers.
Have a look at the image below:

Machines store images in the form of a matrix of numbers. The size of this matrix depends on the number of pixels we have in
any given image.

2/15
Let’s say the dimensions of an image are 180 x 200 or n x m. These dimensions are basically the number of pixels in the image
(height x width).

These numbers, or the pixel values, denote the intensity or brightness of the pixel. Smaller numbers (closer to zero) represent
black, and larger numbers (closer to 255) denote white. You’ll understand whatever we have learned so far by analyzing the
below image.

The dimensions of the below image are 22 x 16, which you can verify by counting the number of pixels:

The example we just discussed is that of a black and white image. What about colored images (which are far more prevalent in
the real world)? Do you think colored images also stored in the form of a 2D matrix as well?

A colored image is typically composed of multiple colors and almost all colors can be generated from three primary colors – red,
green and blue.

Hence, in the case of a colored image, there are three Matrices (or channels) – Red, Green, and Blue. Each matrix has values
between 0-255 representing the intensity of the color for that pixel. Consider the below image to understand this concept:

3/15
We have a colored image on the left (as we humans would see it). On the right, we have three matrices for the three-color
channels – Red, Green, and Blue. The three channels are superimposed to form a colored image.

Note that these are not the original pixel values for the given image as the original matrix would be very large and difficult to
visualize. Also, there are various other formats in which the images are stored. RGB is the most popular one and hence I have
addressed it here.

Reading Image Data

Let’s put our theoretical knowledge into practice. We’ll load an image to see what the matrix looks like:

(28,28)

The matrix has 784 values and this is a very small part of the complete matrix.

4/15
Let’s now dive into the core idea behind this lecture and explore various methods of using pixel values as features.

5/15
Method #1: Grayscale Pixel Values as Features
The simplest way to create features from an image is to use these raw pixel values as separate features.

Consider the same example for our image above (the number ‘8’) – the dimension of the image is 28 x 28.

Can you guess the number of features for this image? The number of features will be the same as the number of pixels! Hence,
that number will be 784.

Now here’s another curious question – how do we arrange these 784 pixels as features? Well, we can simply append every pixel
value one after the other to generate a feature vector. This is illustrated in the image below:

Let us take an image in Python and create these features for that image:

(650, 450

The image shape here is 650 x 450. Hence, the number of features should be 297,000. We can generate this using the reshape
function from NumPy where we specify the dimension of the image:

(297000,)
array([0.96470588, 0.96470588, 0.96470588, ..., 0.96862745, 0.96470588,
0.96470588])

Here, we have our feature – which is a 1D array of length 297,000.

6/15
7/15
Method #2: Mean Pixel Value of Channels
While reading the image in the previous section, we had set the parameter ‘as_gray = True’. So we only had one channel in the
image and we could easily append the pixel values. Let us remove the parameter and load the image again:

(660, 450, 3)

This time, the image has a dimension (660, 450, 3), where 3 is the number of channels. We can go ahead and create the features
as we did previously. The number of features, in this case, will be 660*450*3 = 891,000.

Alternatively, here is another approach we can use:

Instead of using the pixel values from the three channels separately, we can generate a new matrix that has the mean value of
pixels from all three channels.

The image below will give you even more clarity around this idea:

By doing so, the number of features remains the same and we also take into account the pixel values from all three channels of
the image. We will create a new matrix with the same size 660 x 450, where all values are initialized to 0. This matrix will
store the mean pixel values for the three channels:

(660, 450)

We have a 3D matrix of dimension (660 x 450 x 3) where 660 is the height, 450 is the width and 3 is the number of channels. To
get the average pixel values, we will use a for loop:

The new matrix will have the same height and width but only 1 channel. Now we can follow the same steps that we did in the
previous section. We append the pixel values one after the other to get a 1D array:

(297000,)

Method #3: Extracting Edge Features

Consider that we are given the below image and we need to identify the objects present in it:

8/15
You must have recognized the objects in an instant – a dog, a car and a cat. What are the features that you considered while
differentiating each of these images? The shape could be one important factor, followed by color, or size. What if the machine
could also identify the shape as we do?

A similar idea is to extract edges as features and use that as the input for the model. I want you to think about this for a moment
– how can we identify edges in an image? Edge is basically where there is a sharp change in color. Look at the below image:

I have highlighted two edges here. We could identify the edge because there was a change in color from white to brown (in the
right image) and brown to black (in the left). And as we know, an image is represented in the form of numbers. So, we will look
for pixels around which there is a drastic change in the pixel values.

Let’s say we have the following matrix for the image:

To identify if a pixel is an edge or not, we will simply subtract the values on either side of the pixel. For this example, we have the
highlighted value of 85. We will finnd the difference between the values 89 and 78. Since this difference is not very large, we can
say that there is no edge around this pixel.

Now consider the pixel 125 highlighted in the below image:

9/15
Since the difference between the values on either side of this pixel is large, we can conclude that there is a significant transition
at this pixel and hence it is an edge. Now the question is, do we have to do this step manually?

No! There are various kernels that can be used to highlight the edges in an image. The method we just discussed can also be
achieved using the Prewitt kernel (in the x-direction). Given below is the Prewitt kernel:

We take the values surrounding the selected pixel and multiply it with the selected kernel (Prewitt kernel). We can then add the
resulting values to get a final value. Since we already have -1 in one column and 1 in the other column, adding the values is
equivalent to taking the difference.

There are various other kernels and I have mentioned four most popularly used ones below:

Let’s now go back to generate edge features for the same image:

10/15
End Notes
This was a friendly introduction to getting your hands dirty with image data. I feel this is a very important part of a data
scientist’s toolkit given the rapid rise in the number of images being generated these days.

Here is an article on advanced feature Extraction Techniques for Images

Feature Engineering for Images: A Valuable Introduction to the HOG Feature Descriptor

11/15

Carbonite - Setup Manual (4802DR 120 07.3) E PDF
No ratings yet
Carbonite - Setup Manual (4802DR 120 07.3) E PDF
49 pages
e98da8fbc33b80a8a7c6cfc6ddfd7cf5
No ratings yet
e98da8fbc33b80a8a7c6cfc6ddfd7cf5
36 pages
Feature Extraction
No ratings yet
Feature Extraction
14 pages
Fundamentals of Computer Vision with QA
No ratings yet
Fundamentals of Computer Vision with QA
25 pages
Handwritten Numeric and Alphabetic Character Recognition and Signature Verification Using Neural Network
No ratings yet
Handwritten Numeric and Alphabetic Character Recognition and Signature Verification Using Neural Network
25 pages
2017 05 12 Image Segmentation
No ratings yet
2017 05 12 Image Segmentation
2 pages
Multimedia Systems: Multimedia Databases - Image Processing Basics
No ratings yet
Multimedia Systems: Multimedia Databases - Image Processing Basics
58 pages
Handwritten Numeric and Alphabetic Character Recognition and Signature Verification Using Neural Network
No ratings yet
Handwritten Numeric and Alphabetic Character Recognition and Signature Verification Using Neural Network
25 pages
3.2. SE5072_Material image learning lecture slides
No ratings yet
3.2. SE5072_Material image learning lecture slides
60 pages
Practical Image-1
No ratings yet
Practical Image-1
22 pages
Module 2
No ratings yet
Module 2
34 pages
Lecture 4
No ratings yet
Lecture 4
22 pages
8 Image Processing Fundamentals Full
No ratings yet
8 Image Processing Fundamentals Full
100 pages
Assignment No.: 5: Aim: Theory
No ratings yet
Assignment No.: 5: Aim: Theory
3 pages
ML-Unit 3
No ratings yet
ML-Unit 3
58 pages
Ai
No ratings yet
Ai
14 pages
I Jcs It 20150603227
No ratings yet
I Jcs It 20150603227
3 pages
Ip Cv Summary Finaaaal-1
No ratings yet
Ip Cv Summary Finaaaal-1
178 pages
Computer Vision
No ratings yet
Computer Vision
29 pages
Computer Vision 2
No ratings yet
Computer Vision 2
62 pages
ECE280F24_Lab5 (4)
No ratings yet
ECE280F24_Lab5 (4)
27 pages
Two Types of Image Segmentation Exist:: Semantic Segmentation. Objects Shown in An Image Are Grouped Based On
No ratings yet
Two Types of Image Segmentation Exist:: Semantic Segmentation. Objects Shown in An Image Are Grouped Based On
25 pages
MNIST Dataset
No ratings yet
MNIST Dataset
12 pages
Image Segmentation Digital Image Processing
100% (1)
Image Segmentation Digital Image Processing
44 pages
Feature Extraction
No ratings yet
Feature Extraction
23 pages
Fextract: Flowchart For Dataset Generation Process
No ratings yet
Fextract: Flowchart For Dataset Generation Process
5 pages
DUnit - III
No ratings yet
DUnit - III
46 pages
DIP Mod 4 Segment Part A
No ratings yet
DIP Mod 4 Segment Part A
58 pages
Computer vision
No ratings yet
Computer vision
13 pages
3.1 - Image Fundamentals
No ratings yet
3.1 - Image Fundamentals
32 pages
The Opencv User Guide: Release 2.4.0-Beta
No ratings yet
The Opencv User Guide: Release 2.4.0-Beta
23 pages
ISYE 8803 - Kamran - M2 - Image Processing
No ratings yet
ISYE 8803 - Kamran - M2 - Image Processing
54 pages
Unit 4 - Notes
No ratings yet
Unit 4 - Notes
16 pages
@introduction of Digital Image Processing (@background ,@digital Image Representation, @fundamental Step in Image Processing)
No ratings yet
@introduction of Digital Image Processing (@background ,@digital Image Representation, @fundamental Step in Image Processing)
4 pages
IIT M CV
No ratings yet
IIT M CV
20 pages
Microsoft
No ratings yet
Microsoft
6 pages
ImSeg04
No ratings yet
ImSeg04
42 pages
[Fall 2024] Images and Convolutions
No ratings yet
[Fall 2024] Images and Convolutions
69 pages
Image_Handling_Session -1_PDF
No ratings yet
Image_Handling_Session -1_PDF
25 pages
Image Processing Part 3
No ratings yet
Image Processing Part 3
5 pages
Tutorial 7 Developing A Simple Image Classifier
No ratings yet
Tutorial 7 Developing A Simple Image Classifier
11 pages
Machine Learning: Aigerim Bogyrbayeva
No ratings yet
Machine Learning: Aigerim Bogyrbayeva
85 pages
ImSeg 10 11 18
No ratings yet
ImSeg 10 11 18
41 pages
UNIT_1
No ratings yet
UNIT_1
15 pages
Dip 04 Updated
No ratings yet
Dip 04 Updated
12 pages
A Pattern Recognition Approach To Image Segmentation
No ratings yet
A Pattern Recognition Approach To Image Segmentation
7 pages
Opencv User PDF
No ratings yet
Opencv User PDF
23 pages
Plotting Images Using Matplotlib Library in Python
No ratings yet
Plotting Images Using Matplotlib Library in Python
8 pages
Image Processing in Matlab: An Introductory Approach by
No ratings yet
Image Processing in Matlab: An Introductory Approach by
21 pages
HODL Lec 3 DNNs For Vision 1
No ratings yet
HODL Lec 3 DNNs For Vision 1
36 pages
Image Processing in Matlab
100% (2)
Image Processing in Matlab
21 pages
Unit 4
No ratings yet
Unit 4
39 pages
AIfundamentals
No ratings yet
AIfundamentals
7 pages
basic of computer vision UNIT II
No ratings yet
basic of computer vision UNIT II
29 pages
Computer Vision-Lec 02
No ratings yet
Computer Vision-Lec 02
121 pages
10 1109icsc45622 2019 8938371
No ratings yet
10 1109icsc45622 2019 8938371
7 pages
Introduction to Deep Learning
From Everand
Introduction to Deep Learning
Eugene Charniak
No ratings yet
Image Segmentation: Unlocking Insights through Pixel Precision
From Everand
Image Segmentation: Unlocking Insights through Pixel Precision
Fouad Sabry
No ratings yet
Histogram Equalization: Enhancing Image Contrast for Enhanced Visual Perception
From Everand
Histogram Equalization: Enhancing Image Contrast for Enhanced Visual Perception
Fouad Sabry
No ratings yet
Python: Tips and Tricks to Programming Code with Python: Python Computer Programming, #3
From Everand
Python: Tips and Tricks to Programming Code with Python: Python Computer Programming, #3
Charlie Masterson
5/5 (1)
Computer Graphics in Python
From Everand
Computer Graphics in Python
Martin McBride
No ratings yet
FC360 Manual Service Rev. 101115
No ratings yet
FC360 Manual Service Rev. 101115
207 pages
Tle Keyboard Shortcuts
No ratings yet
Tle Keyboard Shortcuts
2 pages
NVR Flir PDF
No ratings yet
NVR Flir PDF
2 pages
Manual Aparat de Etichetare Dymo Letratag 100h Dy19757 3047
No ratings yet
Manual Aparat de Etichetare Dymo Letratag 100h Dy19757 3047
20 pages
Layout and Page Design Fundamentals
No ratings yet
Layout and Page Design Fundamentals
7 pages
Assignment #1
No ratings yet
Assignment #1
2 pages
Top 200 Computer MCQs on Input and Output Devices
No ratings yet
Top 200 Computer MCQs on Input and Output Devices
13 pages
Laptop Price Prediction in Machine Learning Using Random Forest Classifier Technique
No ratings yet
Laptop Price Prediction in Machine Learning Using Random Forest Classifier Technique
5 pages
An4860 Dsi Host On stm32f469479 stm32f7x8x9 and stm32l4r9s9 Mcus Stmicroelectronics
No ratings yet
An4860 Dsi Host On stm32f469479 stm32f7x8x9 and stm32l4r9s9 Mcus Stmicroelectronics
142 pages
Curiosity+Stream+VPAT®+2 4+-+WCAG+Edition
No ratings yet
Curiosity+Stream+VPAT®+2 4+-+WCAG+Edition
15 pages
Module 1 LO1 F
No ratings yet
Module 1 LO1 F
26 pages
Error Event Message Guide b
No ratings yet
Error Event Message Guide b
3,770 pages
Log
No ratings yet
Log
2 pages
Assignment Brief 2020-2021
No ratings yet
Assignment Brief 2020-2021
4 pages
WhatsApp: +31 6 87546855 - Buy Testdaf Zertifikat Online in Bremen, Hanover, Essen, Aachen, Regenshurg, Lubeck, Dortmund
No ratings yet
WhatsApp: +31 6 87546855 - Buy Testdaf Zertifikat Online in Bremen, Hanover, Essen, Aachen, Regenshurg, Lubeck, Dortmund
2 pages
PowerPoint Guide
No ratings yet
PowerPoint Guide
19 pages
Gujarat Technological University: W.E.F. AY 2018-19
No ratings yet
Gujarat Technological University: W.E.F. AY 2018-19
3 pages
3rd Gen Core Desktop Specification Update
No ratings yet
3rd Gen Core Desktop Specification Update
60 pages
Disk Storage, Basic File Structures, and Hashing: Dr. Hasnaa Raafat Dr. Nora Zakie
No ratings yet
Disk Storage, Basic File Structures, and Hashing: Dr. Hasnaa Raafat Dr. Nora Zakie
31 pages
Domino G20i OCC Datasheet (UK) 100616
No ratings yet
Domino G20i OCC Datasheet (UK) 100616
2 pages
Viva Questions-Aj Lab
No ratings yet
Viva Questions-Aj Lab
4 pages
Lesson 3: Introduction To The Desktop and Windows
No ratings yet
Lesson 3: Introduction To The Desktop and Windows
3 pages
Ovmed®: Product Brief
No ratings yet
Ovmed®: Product Brief
2 pages
1120230-Vipm CGR (Co) Board Questions v2v
No ratings yet
1120230-Vipm CGR (Co) Board Questions v2v
134 pages
Write A Shell Program To Check The Given String: Is Palindrome or Not
No ratings yet
Write A Shell Program To Check The Given String: Is Palindrome or Not
39 pages
User Instructions Humacount 5D Printout Editor Rev. 002
No ratings yet
User Instructions Humacount 5D Printout Editor Rev. 002
7 pages
Divy HPC
No ratings yet
Divy HPC
36 pages
(Ebook) Cocoa Programming for Mac OS X by Aaron Hillegass, Adam Preble ISBN 9780321774088, 0321774086 instant download
100% (3)
(Ebook) Cocoa Programming for Mac OS X by Aaron Hillegass, Adam Preble ISBN 9780321774088, 0321774086 instant download
50 pages
Array Processor
100% (1)
Array Processor
8 pages

Image Feature Extraction

Uploaded by

Image Feature Extraction

Uploaded by

3 Beginner-Friendly Techniques to Extract Features

from Image Data

How do Machines Store Images?

Reading Image Data

Here, we have our feature – which is a 1D array of length 297,000.

Alternatively, here is another approach we can use:

Method #3: Extracting Edge Features

Let’s say we have the following matrix for the image:

Now consider the pixel 125 highlighted in the below image:

Here is an article on advanced feature Extraction Techniques for Images

You might also like