9 CNN
9 CNN
(CNN)—Deep Learning
Dr. Ouiem Bchir
https://medium.com/
Convolutional Neural Network (CNN)—
Deep Learning
https://medium.com/
https://medium.com/
https://medium.com/
Figure 5: 3 x 3 Output matrix
Convolution of an image with different filters can perform
operations such as edge detection, blur and sharpen by applying
filters.
https://medium.com/
https://medium.com/
Figure 6 : Stride of 2 pixels
Padding
https://medium.com/
https://medium.com/
Pooling Layer
https://medium.com/
https://medium.com/
https://medium.com/
CNN Architectures—LeNet, AlexNet, VGG,
GoogLeNet and ResNet
https://medium.com/
ImageNet
https://medium.com/
https://medium.com/
https://medium.com/
Figure 3 : LeNet-5 Architecture
https://medium.com/
https://medium.com/
https://medium.com/
GoogLeNet
https://medium.com/
https://medium.com/
ResNet (2015)
In Residual network, it directly copy the input matrix to
the second transformation output and sum the output in
final ReLU function.
https://medium.com/
Hyperparameters
https://medium.com/
Hyperparameters
The learning rate hyperparameter controls the amount of
modification of the CNN model as a result of an estimated
error when the weights are updated. A too small value yields a
long training time, while a too large value yields missing the
optimal weights and learning sub-optimal ones.
https://medium.com/
Hyperparameters
https://medium.com/
Limitation
https://medium.com/
Limitation
The major reason why you cannot proceed with this
problem by building a standard convolutional network
followed by a fully connected layer is that, the length of
the output layer is variable — not constant, this is
because the number of occurrences of the objects of
interest is not fixed.
https://medium.com/
Limitation
The problem with this approach is that the objects of
interest might have different spatial locations within the
image and different aspect ratios.
https://medium.com/