0% found this document useful (0 votes)

14 views9 pages

Object Classification Using CNN

Uploaded by

narayan.gccp

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

14 views9 pages

Object Classification Using CNN

Uploaded by

narayan.gccp

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 9

15-10-2023

OBJECT CLASSIFICATION
USING CNN

OBJECT CLASSIFICATION USING CNN - STEPS OBJECT CLASSIFICATION USING CNN - STEPS
INVOLVED : INVOLVED :
COMMON DATASET USED ARE:
• STEP1: Data Collection and Preparation:
• Gather a labeled dataset containing images of the objects you want to MNIST (Modified National Institute of Standards and Technology) is a well-
classify. known dataset used in Computer Vision that was built by Yann Le Cun et al. It is
• Split the dataset into training, validation, and test sets. Common splits include composed of images that are handwritten digits (0-9), split into a training set
70-80% for training, 10-15% for validation, and 10-15% for testing of 50,000 images and a test set of 10,000, where each image is 28 x 28 pixels in
width and height.

The CIFAR-10 dataset consists of 60,000 images, 32 x 32 colour images in 10

classes, with 6,000 images per class. There are 50,000 training images and 10,000
test images.
15-10-2023

OBJECT CLASSIFICATION USING CNN - STEPS OBJECT CLASSIFICATION USING CNN - STEPS
INVOLVED :
INVOLVED :
COMMON DATASET USED ARE:
• STEP2 – DATA AUGMENTATION (OPTIONAL):
Data augmentation techniques like rotation, scaling, flipping, and
The Imagenet dataset consists of 1000 object categories, organized according to cropping can be applied to increase the diversity of training data and
WordNet hierarchy.
improve the model's generalization.

Processing Type Sample Output Processing Type Sample Output

Resize images

Jitter color

Warp images

Simulate noise

Simulate blur
Crop images
15-10-2023

OBJECT CLASSIFICATION USING CNN - STEPS OBJECT CLASSIFICATION USING CNN - STEPS
INVOLVED : INVOLVED :
• Step 3 – Preprocessing: • Step 4 – Build the CNN Model
It helps in training stabililty • Design the architecture of your CNN model. Common architectures include
VGG, ResNet, Inception, or custom designs.
Resize the images to a consistent input size (eg: 224x224 pixel) • Specify the number of convolutional layers, filter sizes, pooling layers, and
fully connected layers.
• Add activation functions like ReLU (Rectified Linear Unit) after convolutional
layers.

• AlexNet was the first convolution Network which used GPU(Graphics Processing Unit) to boost
performance.
• A graphics processing unit (GPU) is a specialized electronic circuit designed to manipulate
and alter memory to accelerate the creation of images in a frame buffer intended for output
to a display device.
• AlexNet architecture consists of 5 convolutional layers, 3 max-pooling layers, 3 fully
connected layers, and 1 softmax layer.
• Each convolutional layer consists of convolutional filters and a nonlinear activation function
ReLU.
• The pooling layers are used to perform max pooling.
• Input size is fixed due to the presence of fully connected layers.
• The input size is mentioned at most of the places as 224x224x3 but due to some padding which
happens it works out to be 227x227x3
15-10-2023

Full (simplified) AlexNet architecture:

[227x227x3] INPUT
[55x55x96] CONV1: 96 11x11 filters at stride 4, pad 0
[27x27x96] MAX POOL1: 3x3 filters at stride 2 Details/Retrospectives:
[27x27x96] NORM1: Normalization layer - first use of ReLU
[27x27x256] CONV2: 256 5x5 filters at stride 1, pad 2 - used Norm layers (not common anymore)
[13x13x256] MAX POOL2: 3x3 filters at stride 2 - data augmentation
[13x13x256] NORM2: Normalization layer - dropout 0.5
[13x13x384] CONV3: 384 3x3 filters at stride 1, pad 1 - batch size 128
[13x13x384] CONV4: 384 3x3 filters at stride 1, pad 1 - SGD Momentum 0.9
[13x13x256] CONV5: 256 3x3 filters at stride 1, pad 1 - Learning rate 1e-2,
[6x6x256] MAX POOL3: 3x3 filters at stride 2 - L2 weight decay 5e-4
[4096] FC6: 4096 neurons
[4096] FC7: 4096 neurons
[1000] FC8: 1000 neurons (class scores)

ResNet –Residual Network

(34,50,101,152) Deep Residual Learning for Image Recognition
Kaiming He Xiangyu Zhang Shaoqing Ren Jian Sun
Microsoft Research

• Very very deep network

• 152 layers
• won the 1st place on the ILSVRC 2015 classification task.
15-10-2023

Stacking CNN deep

• The authors introduced deep residual learning framework
• hypothesize that it is easier to optimize the residual mapping than to
optimize the original (plain network)
• Proposed Hypothesis performed better
• use layers to fit a residual mapping rather than fitting to underline
mapping
• Deep network should perform better but not performing as good as
shallow network
• The problem is due to not optimizing the learning

Residual Block
• H(x) is underlying mapping
• F(x)+x can be realized by feedforward neural networks with “shortcut
connections” known as identity mapping
• shortcut allows the gradient to be directly backpropagated to earlier layers
• It not creates any extra parameters and computation
• Training achieved by backpropagation

F(x) := H(x) – x
Fitting Residual:
H(x) =F(x)+x
15-10-2023

Full ResNet architecture GoogleNet

• 22 layers
• Inception module
• 5 million parameters(12x less than AlexNet)

Inception module
• design a good local network topology (network within a network) and then
stack these modules on top of each other
Naïve Inception module
• Used 9 Inception modules in the whole architecture
15-10-2023

Activation functions – Activation functions –

Rectified LINEAR UNIT (reLU) Rectified LINEAR UNIT (reLU)
A rectified linear unit (ReLU) is an activation function
that introduces the property of non-linearity to a deep Advantages
learning model and solves the vanishing gradients issue. 1. ReLU is computationally efficient and easy to implement
"It interprets the positive part of its argument. It is one of 2. It helps to avoid the vanishing gradient problem, which can occur when using other activation functions
the most popular activation functions in deep learning. 3. ReLU has been shown to be effective in deep learning models, achieving state-of-the-art results in many
applications

Mathematically Disadvantages
1.When the input is negative, the output is always zero, which can lead to the “dead neuron” problem where the
neuron stops learning and does not contribute to the model’s performance
2. ReLU is not a smooth function, which can cause some optimization algorithms to fail

Activation functions – Activation functions –

Leaky rectified LINEAR UNIT (LreLU) Leaky rectified LINEAR UNIT (LreLU)
Advantages
Leaky ReLU (Rectified Linear Unit) is a variation of the ReLU 1. It avoids the “dying ReLU” problem, where the gradient of the neuron can become zero during
activation function that overcomes the “dying ReLU” problem, training and the neuron will stop updating. The small negative slope allows the neuron to have a non-
where the neuron can become inactive during training and not zero gradient, even for negative inputs.
recover. The Leaky ReLU allows a small, non-zero gradient when 2. It combats bias shift since neuron are allowed to pass small negative signals to the output.
the input is negative, which helps to prevent the neuron from Disadvantages
dying. 1. The negative slope is a hyperparameter that needs to be tuned, which can add complexity to the
model.
Mathematically 2. While Leaky ReLU can help prevent the gradient from vanishing for negative inputs, it can still cause
the gradient to vanish for very large positive inputs. This can make it difficult to train deep networks
with many layers.
15-10-2023

Activation functions –
Exponential rELU Activation functions –
Exponential rELU
It is a smooth and continuous function that allows ADVANTAGES:
1.It can help to reduce the bias shift and avoid overfitting in neural networks.
negative values. 2. It has been shown to outperform other activation functions like ReLU and its variants in cases such as
regression (where output should take negative values), with imbalanced data (ELU can help prevent the
Mathematically vanishing gradient problem when some inputs have very large positive or negative values)
3. It is a smooth and continuous function, which can help more in the convergence of gradient-based
optimization algorithms.
4. ELU can help to avoid the dead neuron problem that can occur with ReLU activation function.

DISADVANTAGES
1. The exponential function used in the function can be computationally expensive.
2. The value of alpha needs to be carefully chosen to balance the advantages of the function.

Loss Function

• After completion of activation function, the model performance is verified with

• Step 5 : compile the model the required output.
• Choose an appropriate loss function, such as regression loss / classification
loss • The loss function is defined as the measurement of difference or error between
actual values and expected values at the current position

Loss function = Actual O/P – Desired O/P

• The average over all losses constitutes the cost.
•Loss functions are divided into two categories.

1. Regression loss
2. Classification Loss – Binary and Multi-class Classification
15-10-2023

Loss Function – Regression Loss Function – Binary

Loss Classification Loss Functions
1. Mean Squared Error (MSE)
• Binary Cross Entropy Loss
Squared Error loss for each training example, also known as L2 Loss, is the square of the difference between the
actual and the predicted values. 1. Entropy indicates disorder or uncertainty. It is measured for a random variable X with probability distribution
p(X):

2. Mean Square Logarithmic Error (MSLE) • Cross-entropy is the default loss function to use for binary classification problems. It is intended for use with
It measures the ratio between actual and predicted using logarithmic values. It is a good choice to predict the binary classification where the target values are in the set {0, 1}.
continuous data.
• 2. Hinge Loss
• An alternative to cross-entropy for binary classification problems is the hinge loss function, primarily developed for
3. Mean Absolute Error (MAE) use with Support Vector Machine (SVM) models. It is intended for use with binary classification where the target
Absolute Error for each training example is the distance between the predicted and the actual values, irrespective of values are in the set {-1, 1}.
the sign. Absolute Error is also known as the L1 loss:
• 3. Squared Hinge Loss
• It is an extension of Hinge Loss. It is mainly used for categorical prediction or yes/no kind of decision problems.

Loss Function – Multi-Class

Classification Loss Functions
• Multi-Class classification are those predictive modeling problems where examples are
assigned one of more than two classes.
• Step 6 - Training and Evaluation
• 1. Multiclass Cross Entropy Loss Train the data in batches and evaluate using metrics like precision,
recall, F1-Score, and confusion matrix to assess its performance.
• Cross-entropy is the default loss function to use for multi-class classification problems.
• It is best loss function for text classification.

• 2. Sparse Multiclass Cross-Entropy Loss

• Sparse cross-entropy addresses the same performance as cross-entropy calculation of error
and it is mainly used to handle large amount of data.

Images and Convolutional Neural Networks: Practical Deep Learning
No ratings yet
Images and Convolutional Neural Networks: Practical Deep Learning
34 pages
Training Deep Neural Networks
No ratings yet
Training Deep Neural Networks
55 pages
06 AIS302 ANN Backpropagation
No ratings yet
06 AIS302 ANN Backpropagation
83 pages
Deep Learning (22CS63) : Module-3
No ratings yet
Deep Learning (22CS63) : Module-3
58 pages
Architectures Discription
No ratings yet
Architectures Discription
75 pages
Deep Learning - Lecture 4 - CNNs
No ratings yet
Deep Learning - Lecture 4 - CNNs
53 pages
6 Lecture CNN
No ratings yet
6 Lecture CNN
45 pages
Convolutional Networks
No ratings yet
Convolutional Networks
211 pages
Chapter 9
No ratings yet
Chapter 9
73 pages
6-DeepVisualLearning L6
No ratings yet
6-DeepVisualLearning L6
82 pages
Traffic Sign Classification Slides
No ratings yet
Traffic Sign Classification Slides
29 pages
Emotion Detection FER2013 - Kinza Shakeel
No ratings yet
Emotion Detection FER2013 - Kinza Shakeel
21 pages
Midterm Study Guide Csci566
No ratings yet
Midterm Study Guide Csci566
20 pages
Genai See
No ratings yet
Genai See
51 pages
Deep Learning (MODULE-3)
No ratings yet
Deep Learning (MODULE-3)
85 pages
Lec2-Deep Neural Networks
No ratings yet
Lec2-Deep Neural Networks
12 pages
Unit III
No ratings yet
Unit III
58 pages
UNIT-4 Foundations of Deep Learning
100% (1)
UNIT-4 Foundations of Deep Learning
43 pages
Lecture 9 Training Deep Networks
No ratings yet
Lecture 9 Training Deep Networks
20 pages
Unit 3
No ratings yet
Unit 3
38 pages
Unit 3
No ratings yet
Unit 3
37 pages
UNIT-III Convolution Neural Networks
No ratings yet
UNIT-III Convolution Neural Networks
9 pages
Ann Unit 1
No ratings yet
Ann Unit 1
26 pages
Unit 3 Notes
No ratings yet
Unit 3 Notes
36 pages
f8194544 Microsoft PowerPoint DeepLearning
No ratings yet
f8194544 Microsoft PowerPoint DeepLearning
28 pages
Deep Learing
No ratings yet
Deep Learing
37 pages
ANN Notes
No ratings yet
ANN Notes
7 pages
Neural Networks Activation Functions 1694135997
No ratings yet
Neural Networks Activation Functions 1694135997
7 pages
DL Exp-3 16010422230
No ratings yet
DL Exp-3 16010422230
9 pages
AI for Everyone: An Intermediate Guide to Artificial Intelligence
From Everand
AI for Everyone: An Intermediate Guide to Artificial Intelligence
Nova Clarke
No ratings yet
ML Modelling - Part 1
No ratings yet
ML Modelling - Part 1
7 pages
What Are The Activation Functions, How Do I Deter...
No ratings yet
What Are The Activation Functions, How Do I Deter...
3 pages
6 Apr - 6 - DL
No ratings yet
6 Apr - 6 - DL
69 pages
Modern CNN Architectures
No ratings yet
Modern CNN Architectures
32 pages
4b Image Processing
No ratings yet
4b Image Processing
63 pages
Deep Learning
No ratings yet
Deep Learning
78 pages
Sol Sheet 8 CV 2022
No ratings yet
Sol Sheet 8 CV 2022
4 pages
Alexnet Tugce Kyunghee
No ratings yet
Alexnet Tugce Kyunghee
35 pages
Chapter 5 Deep Learning
No ratings yet
Chapter 5 Deep Learning
35 pages
MSCDA 605 Machine Learning Exam Model Answers May - 2019
No ratings yet
MSCDA 605 Machine Learning Exam Model Answers May - 2019
7 pages
CNN (1) - Unit 3 - Merged
No ratings yet
CNN (1) - Unit 3 - Merged
14 pages
Deep Learning Using Rectified Linear Units (ReLU)
No ratings yet
Deep Learning Using Rectified Linear Units (ReLU)
7 pages
3 - DeepLearning - and - CNN v3
No ratings yet
3 - DeepLearning - and - CNN v3
50 pages
Unit 4
No ratings yet
Unit 4
19 pages
Deep Learning Meets Sparse Regularization: A Signal Processing Perspective
No ratings yet
Deep Learning Meets Sparse Regularization: A Signal Processing Perspective
23 pages
Super VIP Cheatsheet - Deep Learning
No ratings yet
Super VIP Cheatsheet - Deep Learning
47 pages
RESNET
No ratings yet
RESNET
5 pages
Generative Adversarial Network
No ratings yet
Generative Adversarial Network
22 pages
DEEP LEARNING TECHNIQUES: CLUSTER ANALYSIS and PATTERN RECOGNITION with NEURAL NETWORKS. Examples with MATLAB
From Everand
DEEP LEARNING TECHNIQUES: CLUSTER ANALYSIS and PATTERN RECOGNITION with NEURAL NETWORKS. Examples with MATLAB
César Pérez López
No ratings yet
Res Net 4
No ratings yet
Res Net 4
23 pages
Unit Ii DNN
No ratings yet
Unit Ii DNN
24 pages
9.b Handout-4-Activation Functions
No ratings yet
9.b Handout-4-Activation Functions
4 pages
New
No ratings yet
New
8 pages
Introtodeeplearning MIT 6.S191
No ratings yet
Introtodeeplearning MIT 6.S191
36 pages
Deep Learning Notes For Easy Access
No ratings yet
Deep Learning Notes For Easy Access
14 pages
Deep Learning
No ratings yet
Deep Learning
40 pages
Generative AI
No ratings yet
Generative AI
2 pages
CNN
No ratings yet
CNN
31 pages
Intro CNN PDF
No ratings yet
Intro CNN PDF
31 pages
Oct1 A
No ratings yet
Oct1 A
7 pages
Improvement of Learning For CNN With Relu Activation by Sparse Regularization
No ratings yet
Improvement of Learning For CNN With Relu Activation by Sparse Regularization
8 pages
ANN Lab Manual-2
No ratings yet
ANN Lab Manual-2
35 pages
CH 02 Summary
No ratings yet
CH 02 Summary
3 pages
Machine Learning - Advanced Concepts
From Everand
Machine Learning - Advanced Concepts
Derrick Mwiti
No ratings yet
Identify Web Cam Images Using Neural Networks
No ratings yet
Identify Web Cam Images Using Neural Networks
17 pages
RNN Neural Network
No ratings yet
RNN Neural Network
23 pages
Convolutional Neural Networks: Computer Vision
No ratings yet
Convolutional Neural Networks: Computer Vision
14 pages
Tensorflow, Keras and Deep Learning
No ratings yet
Tensorflow, Keras and Deep Learning
51 pages
Classify Webcam Images Using Deep Learning
No ratings yet
Classify Webcam Images Using Deep Learning
17 pages
6COM1044 Deep Learning 1
No ratings yet
6COM1044 Deep Learning 1
49 pages
Lecture 22 Energy-Based Models - Hopfield Network
No ratings yet
Lecture 22 Energy-Based Models - Hopfield Network
57 pages
The Fundamentals of Machine Learning
No ratings yet
The Fundamentals of Machine Learning
12 pages
Lecture 01 Overview
No ratings yet
Lecture 01 Overview
39 pages
PowerPoint Presentation-3
No ratings yet
PowerPoint Presentation-3
28 pages
Artificial Neural Networks
No ratings yet
Artificial Neural Networks
49 pages
LSTM
No ratings yet
LSTM
8 pages
CSE1015 - Machine Learning Essentials: J Component Report
No ratings yet
CSE1015 - Machine Learning Essentials: J Component Report
18 pages
34-Why Neural Networks-24-10-2024
No ratings yet
34-Why Neural Networks-24-10-2024
19 pages
Typical CNN (Convolutional Neural Network) Architecture: CHARAN S (1VE20CA005) Cse-Ai, Svce
No ratings yet
Typical CNN (Convolutional Neural Network) Architecture: CHARAN S (1VE20CA005) Cse-Ai, Svce
13 pages
Perceptrons and Neural Networks: Manuela Veloso
No ratings yet
Perceptrons and Neural Networks: Manuela Veloso
23 pages
Advanced Soft Computing
No ratings yet
Advanced Soft Computing
24 pages
Algorithm - Pseudocode of 2D CNN
No ratings yet
Algorithm - Pseudocode of 2D CNN
7 pages
Credit
No ratings yet
Credit
6 pages
RNN & LSTM Notes
No ratings yet
RNN & LSTM Notes
8 pages
Deep Model For Dropout Prediction in MOOCs
No ratings yet
Deep Model For Dropout Prediction in MOOCs
7 pages
Build Deep Learning NN Models
No ratings yet
Build Deep Learning NN Models
6 pages
Q Bank2
No ratings yet
Q Bank2
4 pages
The Perceptron, Delta Rule and Its Variants
No ratings yet
The Perceptron, Delta Rule and Its Variants
7 pages
DL and Feature Learning
No ratings yet
DL and Feature Learning
2 pages
Final Neural June 2020
No ratings yet
Final Neural June 2020
2 pages

Object Classification Using CNN

Uploaded by

Object Classification Using CNN

Uploaded by

15-10-2023

The CIFAR-10 dataset consists of 60,000 images, 32 x 32 colour images in 10

Processing Type Sample Output Processing Type Sample Output

Full (simplified) AlexNet architecture:

ResNet –Residual Network

• Very very deep network

Stacking CNN deep

Full ResNet architecture GoogleNet

Activation functions – Activation functions –

Activation functions – Activation functions –

• After completion of activation function, the model performance is verified with

Loss function = Actual O/P – Desired O/P

Loss Function – Regression Loss Function – Binary

Loss Function – Multi-Class

• 2. Sparse Multiclass Cross-Entropy Loss

You might also like