0% found this document useful (0 votes)

14 views69 pages

Lec7 8+CNN 2

Uploaded by

knada1786

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

14 views69 pages

Lec7 8+CNN 2

Uploaded by

knada1786

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 69

Computational Vision

Presented by:
Dr . Mona Hussein Alnaggar
2024-2025
1st term
Lecture 7,8
Agenda
• Convolutional Neural Network CNN
Backpropagation
• is done by fine-tuning the
weights of the connections in
ANN units based on the error
rate obtained.
• This process continues until
the artificial neural network
can correctly recognize an
object in an image with
minimal possible error rates.
Backpropagation cont.
Feedforward vs Backpropagation
• The data is fed into the model and output from each layer is obtained from the
above step is called feedforward, we then calculate the error using an error
function, some common error functions are cross-entropy, square loss error, etc.
The error function measures how well the network is performing. After that, we
backpropagate into the model by calculating the derivatives. This step is
called Backpropagation which basically is used to minimize the loss.
Gradient Descent
• is used to optimize the weight and biases based on the cost
function.
• cost function evaluates the difference between the actual and
predicted outputs.
• Gradient descent is an optimization algorithm used to find the
values of parameters (coefficients) of a function (f) that
minimizes a cost function.
• In other words, gradient descent is an iterative algorithm that
helps to find the optimal solution to a given problem.
Gradient Descent cont.

• Gradient descent is an iterative

optimization algorithm that is used to
minimize a function by iteratively
moving in the direction of the steepest
descent as defined by the negative of
the gradient.
• In other words, the gradient descent
algorithm takes small steps in the
direction opposite to the gradient of
the function at the current point, with
the goal of reaching a global minimum.
The gradient descent algorithm works in the
following way:

1.Initialize the parameters of the function with some random values.

2.Calculate the gradient of the cost function with respect to the parameters.

3.Update the parameters by taking a small step in the opposite direction of the
gradient.

4.Repeat steps 2 and 3 until the algorithm reaches a local or global minimum,
where the gradient is zero.
Gradient Descent:
In Gradient Descent, we consider all the points in calculating loss and
derivative.
Stochastic gradient descent:
in Stochastic gradient descent, we use single point in loss function and
its derivative randomly.
Stochastic gradient descent:
Stochastic gradient descent:
Different Variants of Gradient Descent
• There are several variants of gradient descent that differ in the way the step size or learning rate is chosen and the way the updates are made. Here are
some popular variants:

1. Batch Gradient Descent: In batch gradient descent, To update the model parameter values like weight and bias, the entire training dataset is used to
compute the gradient and update the parameters at each iteration. This can be slow for large datasets but may lead to a more accurate model. It is
effective for convex or relatively smooth error manifolds because it moves directly toward an optimal solution by taking a large step in the direction of
the negative gradient of the cost function. However, it can be slow for large datasets because it computes the gradient and updates the parameters using
the entire training dataset at each iteration. This can result in longer training times and higher computational costs.

2. Stochastic Gradient Descent (SGD): In SGD, only one training example is used to compute the gradient and update the parameters at each iteration.
This can be faster than batch gradient descent but may lead to more noise in the updates.

3. Mini-batch Gradient Descent: In Mini-batch gradient descent a small batch of training examples is used to compute the gradient and update the
parameters at each iteration. This can be a good compromise between batch gradient descent and Stochastic Gradient Descent, as it can be faster than
batch gradient descent and less noisy than Stochastic Gradient Descent.
Advantages of gradient descent and its variants:

1. Widely used: Gradient descent and its variants are widely used in machine learning and
optimization problems because they are effective and easy to implement.

2. Convergence: Gradient descent and its variants can converge to a global minimum or a good local
minimum of the cost function, depending on the problem and the variant used.

3. Scalability: Many variants of gradient descent can be parallelized and are scalable to large datasets
and high-dimensional models.

4. Flexibility: Different variants of gradient descent offer a range of trade-offs between accuracy and
speed and can be adjusted to optimize the performance of a specific problem.
Disadvantages of gradient descent and its variants:

1. Choice of learning rate: The choice of learning rate is crucial for the convergence of gradient descent and its variants. Choosing a
learning rate that is too large can lead to oscillations or overshooting while choosing a learning rate that is too small can lead to
slow convergence or getting stuck in local minima.

2. Sensitivity to initialization: Gradient descent and its variants can be sensitive to the initialization of the model’s parameters, which
can affect the convergence and the quality of the solution.

3. Time-consuming: Gradient descent and its variants can be time-consuming, especially when dealing with large datasets and high-
dimensional models. The convergence speed can also vary depending on the variant used and the specific problem.

4. Local optima: Gradient descent and its variants can converge to a local minimum instead of the global minimum of the cost
function, especially in non-convex problems. This can affect the quality of the solution, and techniques like random initialization
and multiple restarts may be used to mitigate this issue.
Convolutional Neural Network
CNN
Convolutional Neural Network (CNN)
• A Convolutional Neural Network (CNN) is a type of Deep Learning neural network architecture
commonly used in Computer Vision.

• Computer vision is a field of Artificial Intelligence that enables a computer to understand and
interpret the image or visual data.

• When it comes to Machine Learning, Artificial Neural Networks perform really well. Neural
Networks are used in various datasets like images, audio, and text.

• Different types of Neural Networks are used for different purposes, for example for predicting the
sequence of words we use Recurrent Neural Networks more precisely an LSTM, similarly for
image classification we use Convolution Neural networks.
Convolution
Neural Network
• Convolutional Neural Network
(CNN) is the extended version
of artificial neural networks
(ANN) which is predominantly
used to extract the feature from the
grid-like matrix dataset.
• For example, visual datasets like
images or videos where data
patterns play an extensive role.
• CNN architecture consists of
multiple layers like the input layer,
Convolutional layer, Pooling layer,
and fully connected layers.
types of layers:
• In a regular Neural Network, there are three types of layers:

1. Input Layers: It’s the layer in which we give input to our model. The number of neurons in this layer is equal

to the total number of features in our data (number of pixels in the case of an image).

2. Hidden Layer: The input from the Input layer is then feed into the hidden layer. There can be many hidden

layers depending upon our model and data size. Each hidden layer can have different numbers of neurons

which are generally greater than the number of features. The output from each layer is computed by matrix

multiplication of output of the previous layer with learnable weights of that layer and then by the addition of

learnable biases followed by activation function which makes the network nonlinear.

3. Output Layer: The output from the hidden layer is then fed into a logistic function like sigmoid or softmax

which converts the output of each class into the probability score of each class.
CNN Concepts

• The Convolutional layer applies filters to the input image to extract

features,

• the Pooling layer downsamples the image to reduce computation,

• the fully connected layer makes the final prediction.

• The network learns the optimal filters through backpropagation and

gradient descent.
Steps of building CNN
Convolutional kernel

This is a gif image

Example:
Rectified linear unit，ReLU
Why to use Pooling Layers?
• A common CNN model architecture is to have a number of convolution and pooling
layers stacked one after the other.

• Pooling layers are used to reduce the dimensions of the feature maps. Thus, it reduces
the number of parameters to learn, and the amount of computation performed in the
network.

• The pooling layer summarizes the features present in a region of the feature map
generated by a convolution layer. So, further operations are performed on summarized
features instead of precisely positioned features generated by the convolution layer. This
makes the model more robust to variations in the position of the features in the input
image.
Pooling layer
Pooling
• When the array is created, the pixels are shifted over to the input matrix. The
number of pixels turning to the input matrix is known as the strides. When the
number of strides is 1, we move the filters to 1 pixel at a time. Similarly, when
the number of strides is 2, we carry the filters to 2 pixels, and so on. They are
Strides essential because they control the convolution of the filter against the input, i.e.,
Strides are responsible for regulating the features that could be missed while
flattening the image. They denote the number of steps we are moving in each
convolution. The following figure shows how the convolution would work.
• The padding plays a vital role in creating CNN.
• After the convolution operation, the original size of the image is shrunk.
• Also, in the image classification task, there are multiple convolution layers after which our
original image is shrunk after every step, which we don’t want.

Padding • Secondly, when the kernel moves over the original image, it passes through the middle
layer more times than the edge layers, due to which there occurs an overlap.
• To overcome this problem, a new concept was introduced named padding. It is an
additional layer that can add to the borders of an image while preserving the size of the
original picture. For example:
Max Pooling
CNN Layers
Try to build a CNN to recognize numbers
MNIST dataset

The MNIST database of handwritten digits,

available from this page,
has a training set of 60,000 examples, and a test set
of 10,000 examples.
It is a subset of a larger set available from NIST.
The digits have been size-normalized and centered in
a fixed-size image.
CNN summarized Concepts
• Now let’s talk about a bit of mathematics that is involved in the whole convolution process.

• Convolution layers consist of a set of learnable filters (or kernels) having small widths and heights and the same depth as that of
input volume (3 if the input layer is image input).

• For example, if we have to run convolution on an image with dimensions 34x34x3. The possible size of filters can be axax3, where
‘a’ can be anything like 3, 5, or 7 but smaller as compared to the image dimension.

• During the forward pass, we slide each filter across the whole input volume step by step where each step is called stride (which can
have a value of 2, 3, or even 4 for high-dimensional images) and compute the dot product between the kernel weights and patch
from input volume.

• As we slide our filters we’ll get a 2-D output for each filter and we’ll stack them together as a result, we’ll get output volume having
a depth equal to the number of filters. The network will learn all the filters.
Advantages of Convolutional Neural
Networks (CNNs):
1.Good at detecting patterns and features in images, videos, and
audio signals.
2.Robust to translation, rotation, and scaling invariance.
3.End-to-end training, no need for manual feature extraction.
4.Can handle large amounts of data and achieve high accuracy.
Disadvantages of Convolutional Neural
Networks (CNNs):
1.Computationally expensive to train and require a lot of
memory.
2.Can be prone to overfitting if not enough data or proper
regularization is used.
3.Requires large amounts of labeled data.
4.Interpretability is limited, it’s hard to understand what the
network has learned.
Q1. What are the fundamentals of deep
learning?
• A. The fundamentals of deep learning include:
1. Neural Networks: Deep learning relies on artificial neural networks, which are composed of interconnected
layers of artificial neurons.
2. Deep Layers: Deep learning models have multiple hidden layers, enabling them to learn hierarchical
representations of data.
3. Training with Backpropagation: Deep learning models are trained using backpropagation, which adjusts the
model’s weights based on the error calculated during forward and backward passes.
4. Activation Functions: Activation functions introduce non-linearity into the network, allowing it to learn
complex patterns.
5. Large Datasets: Deep learning models require large labeled datasets to effectively learn and generalize from
the data.
Q2. What are the fundamentals of neural
network?
• A. The fundamentals of neural networks include:
1. Neurons: Neural networks consist of interconnected artificial neurons that mimic the behavior of biological
neurons.
2. Weights and Biases: Neurons have associated weights and biases that determine the strength of their
connections and their activation thresholds.
3. Activation Function: Each neuron applies an activation function to its input, introducing non-linearity and
enabling complex computations.
4. Layers: Neurons are organized into layers, including input, hidden, and output layers, which process and
transform data.
5. Backpropagation: Neural networks are trained using backpropagation, adjusting weights based on error
gradients to improve performance.

Digital Modulations using Matlab
From Everand
Digital Modulations using Matlab
Mathuranathan Viswanathan
4/5 (6)
Lec14 CNNRNNModels
No ratings yet
Lec14 CNNRNNModels
64 pages
Assignment 13 Modern AI
No ratings yet
Assignment 13 Modern AI
3 pages
Unit III
No ratings yet
Unit III
89 pages
Module 1
No ratings yet
Module 1
64 pages
1 Intro
No ratings yet
1 Intro
91 pages
Lecture 4
No ratings yet
Lecture 4
45 pages
Gradient Descent Algorithms and Variations - PyImageSearch
No ratings yet
Gradient Descent Algorithms and Variations - PyImageSearch
21 pages
Artificial Neural Network MID
No ratings yet
Artificial Neural Network MID
13 pages
DL Unit2
No ratings yet
DL Unit2
113 pages
Unit III
No ratings yet
Unit III
89 pages
Gen Aiml Notes by Piyush
No ratings yet
Gen Aiml Notes by Piyush
39 pages
AI ML Nov 15
No ratings yet
AI ML Nov 15
32 pages
Introduction To Convolution Neural Network
No ratings yet
Introduction To Convolution Neural Network
6 pages
Introduction To Convolution Neural Network
No ratings yet
Introduction To Convolution Neural Network
15 pages
UNIT-III DeepLearning Notes
No ratings yet
UNIT-III DeepLearning Notes
30 pages
Unit IV Deep Leraning
No ratings yet
Unit IV Deep Leraning
35 pages
AI Unit II Lec Notes Deep Learning
No ratings yet
AI Unit II Lec Notes Deep Learning
64 pages
May Deep Learning
No ratings yet
May Deep Learning
16 pages
Dla-Cat 1
No ratings yet
Dla-Cat 1
37 pages
Ch2-Training, Optimization and Regularization of DNN-new
No ratings yet
Ch2-Training, Optimization and Regularization of DNN-new
114 pages
Convolutional Neural Networks
No ratings yet
Convolutional Neural Networks
21 pages
Unit 4 ML NN, DL, CNN-1
No ratings yet
Unit 4 ML NN, DL, CNN-1
84 pages
Deep Learning
No ratings yet
Deep Learning
17 pages
Unit 4 - GRADIENT LEARNING
No ratings yet
Unit 4 - GRADIENT LEARNING
3 pages
Neural Network (Basics)
No ratings yet
Neural Network (Basics)
48 pages
Backpropagation, Sgmiod Neuron & Gradient Discend
No ratings yet
Backpropagation, Sgmiod Neuron & Gradient Discend
29 pages
Gradient Descent
No ratings yet
Gradient Descent
13 pages
CNN
No ratings yet
CNN
6 pages
1.explain The Concept of Empirical Risk Minimization. What Is The Goal of Optimization in Deep Learning?
No ratings yet
1.explain The Concept of Empirical Risk Minimization. What Is The Goal of Optimization in Deep Learning?
11 pages
Gradient Descent - PR
No ratings yet
Gradient Descent - PR
31 pages
CS601 Machine Learning Unit 3
No ratings yet
CS601 Machine Learning Unit 3
47 pages
Gradient Descent Optimization
No ratings yet
Gradient Descent Optimization
27 pages
Lecture 5 - CS50's Introduction To Artificial Intelligence With Python
No ratings yet
Lecture 5 - CS50's Introduction To Artificial Intelligence With Python
16 pages
3 - DeepLearning - and - CNN v3
No ratings yet
3 - DeepLearning - and - CNN v3
50 pages
MLT Unit 4
No ratings yet
MLT Unit 4
15 pages
Deep Neural Networks
No ratings yet
Deep Neural Networks
48 pages
4th Unit Aktu Machine Learning
No ratings yet
4th Unit Aktu Machine Learning
9 pages
CS 601 Machine Learning Unit 3
No ratings yet
CS 601 Machine Learning Unit 3
37 pages
Lecture 17. Convolutional Neural Networks PDF
No ratings yet
Lecture 17. Convolutional Neural Networks PDF
32 pages
Op Tim Ization
No ratings yet
Op Tim Ization
9 pages
Artificial Neural Networks
No ratings yet
Artificial Neural Networks
100 pages
3.convolutional Networks and Sequence Modeling
No ratings yet
3.convolutional Networks and Sequence Modeling
19 pages
Deep Learning (All in One)
No ratings yet
Deep Learning (All in One)
23 pages
Unit 5 Ann
No ratings yet
Unit 5 Ann
28 pages
Unit 3 - Machine Learning
No ratings yet
Unit 3 - Machine Learning
29 pages
Artificial Intelligence Convolution Neural Networks
No ratings yet
Artificial Intelligence Convolution Neural Networks
77 pages
Chapter21 4e
No ratings yet
Chapter21 4e
35 pages
F11 Handout
No ratings yet
F11 Handout
5 pages
Lecture - 07 (Convolutional Neural Networks)
No ratings yet
Lecture - 07 (Convolutional Neural Networks)
57 pages
Project Exhibition 2
No ratings yet
Project Exhibition 2
42 pages
Computer Vision NN Architecture
No ratings yet
Computer Vision NN Architecture
19 pages
Unit II
No ratings yet
Unit II
38 pages
ML3 Unit 4-3
No ratings yet
ML3 Unit 4-3
13 pages
Antim Prahar AI and ML For Business 2025
No ratings yet
Antim Prahar AI and ML For Business 2025
45 pages
Deep LearningUNIT-IV
No ratings yet
Deep LearningUNIT-IV
16 pages
DL Regularization
No ratings yet
DL Regularization
51 pages
Neural Networks Unit 3
No ratings yet
Neural Networks Unit 3
93 pages
UNIT2
No ratings yet
UNIT2
25 pages
Backpropagation: Fundamentals and Applications for Preparing Data for Training in Deep Learning
From Everand
Backpropagation: Fundamentals and Applications for Preparing Data for Training in Deep Learning
Fouad Sabry
No ratings yet
Dissertation Theatre Vu Ou Lu
100% (1)
Dissertation Theatre Vu Ou Lu
7 pages
March Version 3 - Module 1
No ratings yet
March Version 3 - Module 1
27 pages
Application Letter For Safaricom
No ratings yet
Application Letter For Safaricom
4 pages
Cambridge IGCSE: 0500/12 First Language English
No ratings yet
Cambridge IGCSE: 0500/12 First Language English
16 pages
Red Team Blue Team Exercise Data Sheet
No ratings yet
Red Team Blue Team Exercise Data Sheet
2 pages
Fundamentals of Metal Forming
No ratings yet
Fundamentals of Metal Forming
22 pages
Hawas Bajawi CV
No ratings yet
Hawas Bajawi CV
4 pages
Halliburton Packer Service Tools Catalog
100% (8)
Halliburton Packer Service Tools Catalog
92 pages
Fumigation Within A Pharmaceutical Aseptic Filling Line
No ratings yet
Fumigation Within A Pharmaceutical Aseptic Filling Line
2 pages
Experiment No.: - Welding Procedure Specification (WPS) & Welder Performance Qualification (WPQ)
No ratings yet
Experiment No.: - Welding Procedure Specification (WPS) & Welder Performance Qualification (WPQ)
12 pages
Summarise The Nature and Effects of Perceived Fairness in Groups C2
No ratings yet
Summarise The Nature and Effects of Perceived Fairness in Groups C2
1 page
Ap Physics 2 Lab: Photoelectric Effect
No ratings yet
Ap Physics 2 Lab: Photoelectric Effect
9 pages
Chemistry For Physical Science
No ratings yet
Chemistry For Physical Science
9 pages
Light XlTwgwQ0 OvDn1N7
No ratings yet
Light XlTwgwQ0 OvDn1N7
41 pages
Perth 2014 - Abstract Book - Final PDF
100% (1)
Perth 2014 - Abstract Book - Final PDF
277 pages
High Fluence High Beam Quality Q Switched Ndyag Laser With Optoflex Delivery System For Treating Benign Pigmented Lesions and Tattoos
No ratings yet
High Fluence High Beam Quality Q Switched Ndyag Laser With Optoflex Delivery System For Treating Benign Pigmented Lesions and Tattoos
12 pages
The Possessed (Devils) by Fyodor Dostoevsky
No ratings yet
The Possessed (Devils) by Fyodor Dostoevsky
657 pages
Dose, Dilution and The LM Potencies
No ratings yet
Dose, Dilution and The LM Potencies
12 pages
List of Dutch Inventions and Discoveries - Wikipedia, The Free Encyclopedia20151006224847
No ratings yet
List of Dutch Inventions and Discoveries - Wikipedia, The Free Encyclopedia20151006224847
131 pages
Brochure Final Brochure A. M. 2025
No ratings yet
Brochure Final Brochure A. M. 2025
4 pages
Analysis of Variance (Anova) Part 3 Two-Way Anova Replication (Factorial Experiment)
No ratings yet
Analysis of Variance (Anova) Part 3 Two-Way Anova Replication (Factorial Experiment)
21 pages
Role Playing Rubric
No ratings yet
Role Playing Rubric
1 page
HoBt Test
No ratings yet
HoBt Test
7 pages
Tarekegn Mechato MscThesis
No ratings yet
Tarekegn Mechato MscThesis
94 pages
Rust Veto 4240 Pds 3
No ratings yet
Rust Veto 4240 Pds 3
1 page
The Changing Face of The Ethiopian Rift Lakes and Their Environs
No ratings yet
The Changing Face of The Ethiopian Rift Lakes and Their Environs
18 pages
NLM Sas 6
No ratings yet
NLM Sas 6
6 pages
UAS High School Profile 2024 25 Vers2
No ratings yet
UAS High School Profile 2024 25 Vers2
4 pages
Live Case Study - 1
No ratings yet
Live Case Study - 1
7 pages
USP-NF Purified Water
No ratings yet
USP-NF Purified Water
1 page

Lec7 8+CNN 2

Uploaded by

Lec7 8+CNN 2

Uploaded by

Computational Vision

• Gradient descent is an iterative

1.Initialize the parameters of the function with some random values.

• The Convolutional layer applies filters to the input image to extract

• the Pooling layer downsamples the image to reduce computation,

• the fully connected layer makes the final prediction.

• The network learns the optimal filters through backpropagation and

This is a gif image

The MNIST database of handwritten digits,

You might also like