0% found this document useful (0 votes)

934 views18 pages

Image Classification

An image is considered unstructured data even though it is stored in structured formats like JPEG or PNG. It does not contain relevant semantic information that is meaningful to humans or computer systems. Images can be analyzed to extract structured information and features. The CIFAR-10 dataset contains 60,000 images across 10 classes that are used for machine learning research. A subset of this dataset is used due to memory constraints, with the data split into training and test sets using stratified shuffling. Image preprocessing techniques like normalization, whitening, and dimensionality reduction are applied to prepare the data for classification models.

Uploaded by

Darshna Gupta

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

934 views18 pages

Image Classification

Uploaded by

Darshna Gupta

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 18

Image - Structured or Unstructured?

An image is considered as an unstructured data.

Even though every digital image is stored in structured formats such as jpg, png, gif,
etc., it doesn't contain relevant information, which is of interest to human or computer
system. It can be converted into a structured form through image analysis.

 CIFAR-10 is a widely used dataset for Machine Learning research, which is

created by A. Krizhevsky et al.

 It consists of 60,000 - 32x32 color images in 10 classes (airplane, automobile,

bird, cat, deer, dog, frog, horse, ship, and truck) with 50,000 training images and
10,000 testing images.

 Each class has 6,000 images. The classes in a CIFAR-10 dataset are mutually
exclusive.

At a glance:

 Number of classes: 10
 Size of image: 32 x 32 x 3

Note: In this course, we use only a subsetof the above dataset due to memory
constraints in online cloud platform. We will be explaining the generation of
subset in the upcoming cards

Subset Generation
As explained in dataset description, we use only a subset of CIFAR-10 dataset.

1. The dataset with 50,000 samples is split in the ratio 92:8. This split is done to
take a smaller portion of 50000 samples (i.e the 8% contains only 4000 images).

2. These 4000 samples are used for generating the train and test sets for
classification.

Here, StratifiedShuffleSplit is used to split the dataset. It splits the data by taking
equal number of samples from each class in a random manner.
#Splitting the whole training set into 92:8

seed=7

from sklearn.cross_validation import StratifiedShuffleSplit

data_split = StratifiedShuffleSplit(labels_all,1, test_size=0.08,random_state=seed) #

creating data_split object with 8% test size

for train_index, test_index in data_split:

split_data_92, split_data_8 = data_all[train_index], data_all[test_index]

split_label_92, split_label_8 = labels_all[train_index], labels_all[test_index]

4000 samples are split in the ratio 7:3. (i.e., 2800 for training and 1200 for testing)
using StratifiedShuffleSplit.

#Splitting the training set into 70 and 30

train_test_split = StratifiedShuffleSplit(split_label_8,1, test_size=0.3,random_state

=seed) #test_size=0.3 denotes that 30 % of the dataset is used for testing.

for train_index, test_index in train_test_split:

train_data_70, test_data_30 = split_data_8[train_index], split_data_8[test_index]

train_label_70, test_label_30 = split_label_8[train_index], split_label_8[test_in

dex]
train_data = train_data_70 #assigning to variable train_data

train_labels = train_label_70 #assigning to variable train_labels

test_data = test_data_30

test_labels = test_label_30

You can see the size of the above variables using:

print 'train_data : ', train_data.shape

print 'train_labels : ', train_labels.shape

print 'test_data : ', test_data.shape

print 'test_labels : ', test_labels.shape

Need for Preprocessing

 Using the Data preprocessing step, the raw data is converted into a form
suitable for subsequent analysis. All the steps before data training (model
creation) can be considered as a pre-processing step.

 The quality of an image is greatly influenced by its clarity and the device used to
capture it.

 The captured image may contain noise and irregularities, which can be removed
via preprocessing steps.

Need for Preprocessing

Some of the common preprocessing techniquesinclude:

 Normalization

 Dimensionality reduction (eg. PCA, SVD)

 Feature Extraction (e.g. SIFT, HOG)

 Whitening

 Denoising

 Contrast Stretching

 Background subtraction

 Image Enhancement

 Smoothing

In the following cards, we will describe some of the preprocessing techniques that can
be applied to images.

Normalization
Normalization is the process of converting the pixel intensity values to a normal state.

 It follows a normal distribution.

 A normalized image has mean = 0 and variance = 1

# definition of normalization function

def normalize(data, eps=1e-8):

data -= data.mean(axis=(1, 2, 3), keepdims=True)

std = np.sqrt(data.var(axis=(1, 2, 3), ddof=1, keepdims=True)) # calculating stan

dard deviation

std[std < eps] = 1.

data /= std

return data

# calling the function

train_data = normalize(train_data)

test_data = normalize(test_data)

# prints the shape of train data and test data

print 'train_data: ', train_data.shape

print 'test_data: ', test_data.shape

ZCA Whitening
Normalization is followed by a ZCA whitening process.

The main aim of whitening is to reduce data redundancy, which means the features are
less correlated and have the same variance.

ZCA stands for zero-phase component analysis. ZCA whitened images resemble
the normal image.

Principle Component Analysis (PCA)

 The major function of PCA is to decompose a multivariate dataset into a set of
successive orthogonal components. These orthogonal components explain a
maximum amount of the variance.

 PCA is a dimensionality reduction technique.

The whitened data is given as the input to PCA.

from sklearn.decomposition import PCA

# n_components specify the no.of components to keep

train_data_pca = PCA(n_components=train_data_flat.shape[1]).fit_transform(train_data_
flat)

test_data_pca = PCA(n_components=test_data_flat.shape[1]).fit_transform(test_data_fla
t)

train_data_pca = train_data_pca.T

test_data_pca = test_data_pca.T

To explore more on PCA, refer this link.

Singular Value Decomposition (SVD)

 SVD is a dimensionality reduction techniquethat has been used in several fields
such as image compression, face recognition, and noise filtering.

 In this method, a digital image (generally considered as a matrix) is decomposed

into three other matrices.

 The singular values (less in number) obtained from this refactoring process can
preserve useful features of the original image without utilizing high storage space
in the memory.

Singular Value Decomposition (SVD)

The below code for SVD may not work in the available online cloud playground due to
package issues. So, it is better to try this out in a local Python environment.

from skimage import color

# definition for SVD

def svdFeatures(input_data):

svdArray_input_data=[]

size = input_data.shape[0]

for i in range (0,size):

img=color.rgb2gray(input_data[i])

U, s, V = np.linalg.svd(img, full_matrices=False);

S=[s[i] for i in range(30)]

svdArray_input_data.append(S)

svdMatrix_input_data=np.matrix(svdArray_input_data)

return svdMatrix_input_data

# apply SVD for train and test data

train_data_svd=svdFeatures(train_data)

test_data_svd=svdFeatures(test_data)

Scale-Invariant Feature Transform for Feature

Generation (SIFT)
SIFT is mainly used for images that are less simple and less organized.

Even the photographs of the same material will undergo scale change corresponding to
the distance from the material, focal length etc. This is one of the reasons for not
considering the raw pixel values as useful features for images.

The main aim of using SIFT for feature extraction is to obtain features that are not
sensitive to changes in scale, rotation, image resolution, illumination, etc.

The major steps involved in SIFT algorithm are:

 Scale-space Extrema Detection

 Keypoint Localization

 Orientation Assignment

 Keypoint Descriptor

How does Classifier Work?

The following are the steps involved in building a classification model:

1. Initialize the classifier to be used.

2. Train the classifier - All classifiers in scikit-learn utilizes a fit(X, y) method to fit
the model (training) for the given train data X and train label y.

3. Predict the target - Given an unlabeled observation X, the predict(X) returns the
predicted label y.

4. Evaluate the classifier model - The score(X,y) returns the score for the given test
data X and test label y.
Classification Algorithms
There are various algorithms to solve the classification problems.

Few of them are as follows:

 Support Vector Machine Classifier (SVM)

 Naive Bayes Classifier

 Stochastic Gradient Descent Classifier

Note: The explanation for the algorithms are given in the Machine Learning
Axioms course. Refer this for further details.

In this course, let's see SVM in detail.

Support Vector Machine (SVM)

Support Vector Machine (SVM) is effective in:

 High-dimensional spaces.

 In cases, where, the number of dimensions > the number of samples.

 In cases with a clear margin of separation.

Given below is the code snippet for training in SVM:

from sklearn import svm #Creating a svm classifier model

clf = svm.SVC(gamma=.001,probability=True) #Model training

clf.fit(train_data_flat_t, train_labels) #After being fitted, the model can then be u

sed to predict the output.

Here, train_data_flat_t can be replaced with train_data_pca or train_data_svd for

PCA and SVD respectively.
Support Vector Machine (SVM) (Contd..)
For Prediction :

predicted=clf.predict(test_data_flat_t)

score= clf.score(test_data_flat_t,test_labels) #classification score.

print("score",score)

Similarly, test_data_flat_t can be replaced with test_data_pca or test_data_svd.

Above mentioned conventional classification algorithms could not give significant

accuracy. But, a better performance can be achieved by using deep learning techniques
like Convolutional Neural Networks (CNN).

Convolutional Neural Networks (CNN)

Deep learning has become more important for learning complex algorithms. It is a more
refined form of machine learning, which is based on neural networks that emulate the
brain.

Neural network consists of:

 input layer

 hidden layers

 output layer

Each layer is composed of nodes, where the computation happens.

Neural Network consists of interconnected neurons that passes

messages between each other.

 CNN is a special case of neural networks that consists of multiple convolutional

layers, pooling layers and finally, fully connected layers.
 The improved network structure helps in saving memory and computational
complexity. They are mainly used in pattern and image recognition problems.

Cross Validation
 Cross validation is considered as a model validation technique to evaluate the
performance of a model on unseen data.

 It is a better estimate to evaluate testing accuracy than training accuracy on

unseen data.

Points to remember:

 Cross validation gives high variance if the testing set and training set are not
drawn from the same population.

 Allowing training data to be included in testing data will not give actual
performance results.

In cross validation, the number of samples used for training the model is reduced, and
the results depend upon the choice of the pair of training and testing sets.

You can refer to the various cross validation approaches from here.

Partitioning the Data

It is a methodological mistake to test and train on the same dataset because the
classifier would fail to predict correctly for any unseen data. This could result
in overfitting.

To avoid this problem,

 The data is split into train set, validation set, and test set.

o Training Set: The data used to train the classifier.

o Validation Set: The data used to tune the classifier model parameters i.e.,
to understand how well the model has been trained (as part of training
data).

o Testing Set: The data used to evaluate the performance of the classifier
(unseen data by the classifier).
 This will help us to know the efficiency of our model.

 Since the online platform used in this course doesn't support huge dataset, only a
few samples are taken for training and testing.

Confusion Matrix

The above image is a confusion matrix for a two class classifier.

In the table,
 TP (True Positive) - The number of correct predictions that the occurrence is
positive

 FP (False Positive) - The number of incorrect predictions that the occurrence is

positive

 FN (False Negative) - The number of incorrect predictions that the occurrence is

negative

 TN (True Negative)- The number of correct predictions that the occurrence is

negative

 TOTAL - The total number of occurrence

Confusion Matrix is a technique used to evaluate the performance of a classifier.

 It visually depicts the performance in a tabular form that has two dimensions
namely, actual and predicted sets of data.

 The rows and columns of the table show the count of false positives, false
negatives, true positives, and true negatives.

The first parameter shows true values and the second parameter shows predicted
values.

from sklearn import metrics

conf_matrix=metrics.confusion_matrix(test_labels,predicted)

print("Confusion matrix:",conf_matrix)

In the above code, test_labels are the actual labels and predicted are the predicted
labels.

 Here, the diagonal elements of the confusion matrix shows the number of
correctly classified labels.

Classification Accuracy
Classification accuracy is defined as the percentage of correct predictions.

 To calculate class wise accuracy,

 CA = (correctly predicted images of a class/(Total images of the clas

s)) * 100

Class-wise accuracy is given by:

#To see the accuracy of each class.

accuracy=[]

leng = len(conf_matrix) #finding the length of confusion matrix

for i in range(leng):

#each diagonal element (conf_matrix[i,i]) is divided by the sum of the

elements of that particular row (conf_matrix[i].sum()).

ac=(conf_matrix[i,i]/((conf_matrix[i].sum())+.0000001))*100

accuracy.append(ac)

print accuracy

Overall accuracy is given by, OA = Sum of class-wise accuracy/no of classes

The code is as follows:

summation=0

no_of_classes = 10

for i in range(0,len(accuracy)):

summation+=accuracy[i]

overall_accuracy = summation/no_of_classes

print overall_accuracy

High classification accuracy always indicates a good classifier.

False

TF-IDF is a common methodology used in pre-processing of images.

True

The improvement of the image data that suppresses distortions or enhances

image features is called ____________.
Image preprocessing

Classification where each data is mapped to more than one class is called
____________.
Binary

In Supervised learning, class labels of the training samples are

Known

Choose the correct sequence for classifier building from the following:
initia – train- predict- evaluate
Select the correct option that directly achieves multi-class classification
(without support of binary classifiers).
K nearest

Which algorithm can be used for matching local regions in two images?
SIFT

Pruning is a technique associated with ______________.

Decision tree

Higher value of which of the following hyperparameters is better for decision

tree algorithm?
Can’t say

Which one of the following is not a classification technique?

Stratifiedshufflepoint

The first layer in a CNN is never a Convolutional Layer.

true

UNIT-2 ML notes
No ratings yet
UNIT-2 ML notes
15 pages
Unit 2
No ratings yet
Unit 2
11 pages
Download the full PDF version of Solution Manual for A Friendly Introduction to Numerical Analysis Brian Bradie right away.
100% (11)
Download the full PDF version of Solution Manual for A Friendly Introduction to Numerical Analysis Brian Bradie right away.
47 pages
ImageProcessing11 Morphology
No ratings yet
ImageProcessing11 Morphology
48 pages
HTML Tables and Forms (PDFDrive)
100% (1)
HTML Tables and Forms (PDFDrive)
68 pages
Weather Forecasting Basepaper
100% (1)
Weather Forecasting Basepaper
14 pages
Handout9 Trees Bagging Boosting
100% (1)
Handout9 Trees Bagging Boosting
23 pages
Unit-5 Decision Trees and Ensemble Learning
100% (1)
Unit-5 Decision Trees and Ensemble Learning
162 pages
Scilab Text Books PDF
No ratings yet
Scilab Text Books PDF
286 pages
Solving 2Xn or mX2 Games by Graphical Method
86% (14)
Solving 2Xn or mX2 Games by Graphical Method
11 pages
Data Preprocessing For Python
No ratings yet
Data Preprocessing For Python
3 pages
02 POLYNOMIALS MCQs
No ratings yet
02 POLYNOMIALS MCQs
28 pages
Deep Learning Based Recommendation Systems
No ratings yet
Deep Learning Based Recommendation Systems
47 pages
Digital Image Processing Segmntation Lab With Python
No ratings yet
Digital Image Processing Segmntation Lab With Python
9 pages
CH 6
No ratings yet
CH 6
72 pages
Facets of Data
No ratings yet
Facets of Data
6 pages
Runge-Kutta Methods For Fuzzy Differential Equations
No ratings yet
Runge-Kutta Methods For Fuzzy Differential Equations
9 pages
Introduction To Object Detection
No ratings yet
Introduction To Object Detection
24 pages
Evaluation Metrics in Machine Learning
No ratings yet
Evaluation Metrics in Machine Learning
14 pages
CSE-Machine Learning & Big Data - WSS Source Book
No ratings yet
CSE-Machine Learning & Big Data - WSS Source Book
181 pages
Python-Linear Regression
No ratings yet
Python-Linear Regression
72 pages
Laboratory 3. Basic Image Segmentation Techniques
No ratings yet
Laboratory 3. Basic Image Segmentation Techniques
10 pages
Lecture 03 Gradient Descent
No ratings yet
Lecture 03 Gradient Descent
26 pages
ML L8 Decision Tree
No ratings yet
ML L8 Decision Tree
109 pages
Model Overfitting Introduction To Data Mining, 2 Edition by Tan, Steinbach, Karpatne, Kumar
No ratings yet
Model Overfitting Introduction To Data Mining, 2 Edition by Tan, Steinbach, Karpatne, Kumar
30 pages
Regression Analysis
100% (2)
Regression Analysis
9 pages
I. The Types of Machine Learning
No ratings yet
I. The Types of Machine Learning
8 pages
Chi Merge
No ratings yet
Chi Merge
5 pages
Linear Regression
100% (1)
Linear Regression
51 pages
Chapter 4 - Equation Fitting
No ratings yet
Chapter 4 - Equation Fitting
24 pages
Text
No ratings yet
Text
131 pages
Module 2
No ratings yet
Module 2
20 pages
Diabetes Prediction Using Data Mining
No ratings yet
Diabetes Prediction Using Data Mining
17 pages
8_Bairstow's Method
No ratings yet
8_Bairstow's Method
7 pages
Data Mining
No ratings yet
Data Mining
49 pages
Data Structures & Algorithms - Topic 7 - Asymptotic Notations & Order of Growth
No ratings yet
Data Structures & Algorithms - Topic 7 - Asymptotic Notations & Order of Growth
26 pages
Accelerate Computing Vision and Image Processing Using VPI 1.1 by Rodolfo Lima
No ratings yet
Accelerate Computing Vision and Image Processing Using VPI 1.1 by Rodolfo Lima
23 pages
EAIT-D-22-01206
No ratings yet
EAIT-D-22-01206
21 pages
5-1 Skills Practice Answers
No ratings yet
5-1 Skills Practice Answers
1 page
Medical Image Fusion Method by Deep Learning
No ratings yet
Medical Image Fusion Method by Deep Learning
9 pages
DAA (4th) May 2022
No ratings yet
DAA (4th) May 2022
2 pages
Machine Learning Mini-Project Report
No ratings yet
Machine Learning Mini-Project Report
26 pages
Quadratic Inequalities Alg Questions MME
No ratings yet
Quadratic Inequalities Alg Questions MME
6 pages
IJNME I Published 7 - 09 - Completely Published
No ratings yet
IJNME I Published 7 - 09 - Completely Published
21 pages
Predicting Mode of Transport (ML) : Akalya KS
No ratings yet
Predicting Mode of Transport (ML) : Akalya KS
17 pages
Final Exam - Decision Analytics
No ratings yet
Final Exam - Decision Analytics
10 pages
Understanding LSTM
No ratings yet
Understanding LSTM
34 pages
Image Classification Using Pre-Trained Convolutional Neural Network in COLAB
No ratings yet
Image Classification Using Pre-Trained Convolutional Neural Network in COLAB
6 pages
Deep Learning Using Python + Keras (Chapter 3) - ResNet - CodeProject
No ratings yet
Deep Learning Using Python + Keras (Chapter 3) - ResNet - CodeProject
24 pages
Logistic Regression
No ratings yet
Logistic Regression
41 pages
Distance Based Models
No ratings yet
Distance Based Models
58 pages
Chandigarh Group of Colleges College of Engineering Landran, Mohali
No ratings yet
Chandigarh Group of Colleges College of Engineering Landran, Mohali
47 pages
Lab3 - Lab4
No ratings yet
Lab3 - Lab4
9 pages
BUAD 802 LP Graphical
No ratings yet
BUAD 802 LP Graphical
8 pages
Velammal Engineering College Department of Cse Design and Analysis of Algorithm Unit V Notes Decision Trees (2 Marks)
No ratings yet
Velammal Engineering College Department of Cse Design and Analysis of Algorithm Unit V Notes Decision Trees (2 Marks)
29 pages
RMM Unit-I Introdution To Data Mining
No ratings yet
RMM Unit-I Introdution To Data Mining
129 pages
Top 10 and The Best Supercomputers in The World
No ratings yet
Top 10 and The Best Supercomputers in The World
6 pages
Chap5.4 Hessian
No ratings yet
Chap5.4 Hessian
15 pages
ML0101EN Clus K Means Customer Seg Py v1
100% (1)
ML0101EN Clus K Means Customer Seg Py v1
8 pages
Ensemble Methods Bagging Boosting and Stacking
100% (1)
Ensemble Methods Bagging Boosting and Stacking
19 pages
Machine Learning Guide Line
No ratings yet
Machine Learning Guide Line
10 pages
Computational Method (Course Outline) 57176
No ratings yet
Computational Method (Course Outline) 57176
5 pages
Data Mining-Multimedia Datamining
No ratings yet
Data Mining-Multimedia Datamining
8 pages
Calculation Spreadsheet For Higher Order Polynomials Excel Andrew Thornett 260818@1237
No ratings yet
Calculation Spreadsheet For Higher Order Polynomials Excel Andrew Thornett 260818@1237
3 pages
K-Means Clustering Tutorial - Matlab Code
No ratings yet
K-Means Clustering Tutorial - Matlab Code
6 pages
Motion Detection
No ratings yet
Motion Detection
33 pages
Excel Spreadsheet in Teaching Numerical Methods
No ratings yet
Excel Spreadsheet in Teaching Numerical Methods
8 pages
Introduction To Deep Learning-Session3: Ravi Shukla
No ratings yet
Introduction To Deep Learning-Session3: Ravi Shukla
21 pages
Physics Project - For-Class-12th
No ratings yet
Physics Project - For-Class-12th
10 pages
Image Processing
No ratings yet
Image Processing
39 pages
Improper Integrals: Ü Two Ways To Classify
No ratings yet
Improper Integrals: Ü Two Ways To Classify
7 pages
A Review of Artificial Neural Network (ANN)
No ratings yet
A Review of Artificial Neural Network (ANN)
5 pages
Full Syllabus of Calicut University (2004) Information Technology (IT)
No ratings yet
Full Syllabus of Calicut University (2004) Information Technology (IT)
191 pages
Class 9 Mathematics Gist & Assignment 9
No ratings yet
Class 9 Mathematics Gist & Assignment 9
4 pages
SAS Presentation
No ratings yet
SAS Presentation
49 pages
Sign Language Recognition Using Deep Learning
No ratings yet
Sign Language Recognition Using Deep Learning
6 pages
ML UNIT-2 Notes
No ratings yet
ML UNIT-2 Notes
15 pages
Car Make and Model Recognition Using Ima
No ratings yet
Car Make and Model Recognition Using Ima
8 pages
Polynomials Oct 7
No ratings yet
Polynomials Oct 7
3 pages
Seminar Report Machine Learning
No ratings yet
Seminar Report Machine Learning
20 pages
Artificial Neural Networks Kluniversity Course Handout
No ratings yet
Artificial Neural Networks Kluniversity Course Handout
18 pages
IS PowerMethod
No ratings yet
IS PowerMethod
7 pages
Computer Vision I: Ai Courses by Opencv
No ratings yet
Computer Vision I: Ai Courses by Opencv
9 pages
Convolutional Neural Networks
No ratings yet
Convolutional Neural Networks
5 pages
Machine Learning Notes
No ratings yet
Machine Learning Notes
3 pages
Weka Tutorial
No ratings yet
Weka Tutorial
2 pages
Python Natural Language Processing Cookbook: Over 60 recipes for building powerful NLP solutions using Python and LLM libraries
From Everand
Python Natural Language Processing Cookbook: Over 60 recipes for building powerful NLP solutions using Python and LLM libraries
Zhenya Antić
No ratings yet
Excel 2013/2016: Get Your Hands Dirty
From Everand
Excel 2013/2016: Get Your Hands Dirty
Sam Akrasi
No ratings yet
Kernel Methods: Fundamentals and Applications
From Everand
Kernel Methods: Fundamentals and Applications
Fouad Sabry
No ratings yet
Mastering WebGL: Crafting Advanced 3D Web Experiences: WebGL Wizadry
From Everand
Mastering WebGL: Crafting Advanced 3D Web Experiences: WebGL Wizadry
Kameron Hussain
No ratings yet