0% found this document useful (0 votes)

14 views61 pages

PNAL9 CNNs

Uploaded by

engineeringengtr

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

14 views61 pages

PNAL9 CNNs

Uploaded by

engineeringengtr

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 61

Perceptron Networks and Applications

M. Ali Akcayol
Gazi University
Department of Computer Engineering
Content
 Convolutional neural networks
 Structure of the CNNs
 Convolution
 Stride and padding
 Pooling
 Fully connected layer
 Softmax
 Hyperparameters
 Applications

2
Convolutional neural networks
 Convolutional neural network (CNN) is a special type of
artificial neural networks.
 CNNs are deep learning architecture that is widely used
especially in image problems.
 A CNN consists of neurons similar to classical neural
networks and has a bias and weight values to learn.
 Each neuron takes inputs, combines them, and produces
outputs, usually with a non-linear function.
 CNN applications assume the inputs as images and allow
us to encode the properties into the architecture.

3
Convolutional neural networks
 Neurons in CNNs are arranged in three dimensions.

 In CNNs, each layer can receive 3D input and produce 3D

output.
 The input layer gets the image.
 The width and height of the input layer is equal to the width
and height of the image.
 The depth of the input layer can be 3 (red, green, blue).
4
Content
 Convolutional neural networks
 Structure of the CNNs
 Convolution
 Stride and padding
 Pooling
 Fully connected layer
 Softmax
 Hyperparameters
 Applications

5
Structure of the CNNs
 CNN uses convolution and pooling operators.
 A CNN has three basic types of layers:
 Convolutional layer
 Pooling layer
 Fully-connected layer
 Multiple convolution+pooling can be done consecutively.
 It then has several fully connected layers.
 In multi-label classification problems, there is a softmax layer at
the output.

6
Structure of the CNNs
 The fully-connected layer takes the three-dimensional input by
reducing it to one dimension and obtains a class label.
 Softmax layer calculates the probability distribution of the
output classes.

7
Structure of the CNNs
Example
 CIFAR-10* dataset, has 60.000 32x32 color images of 10
classes (6.000 images for each class).
 It can be splitted into 50.000 for train and 10.000 for test.

*CIFAR-100 (Canadian Institute For Advanced Research) has 100 classes and 600.000 32x32 images. 8
Structure of the CNNs
Example
 [Input-Conv-ReLU-Pool-FC] layers can be used for the CIFAR-10
dataset.
 The input layer takes 32x32x3 (red, green, blue) image pixels.
 The convolution layer calculates on the values it gets from the
local regions of the input using the selected filter.
 If 12 different filters are used, the output of the convolution
layer is 32x32x12 (RGB combined).
 The ReLU (Rectifier Linear Units) layer calculates the max (0, x)
activation function result and produces a 32x32x12 output.

9
Structure of the CNNs
Example
 The pool layer performs a downsampling operation and the
output size can be, for example, 16x16x12.
 The fully connected layer calculates the value of the output
class with 1x1x10.
 More successful results can be obtained by using different
numbers of CONV + RELU + POOL layers consecutively
depending on the problem type.

10
Structure of the CNNs
Example
 An example application for CIFAR-10 dataset can be found at
http://cs231n.stanford.edu/

11
Content
 Convolutional neural networks
 Structure of the CNNs
 Convolution
 Stride and padding
 Pooling
 Fully connected layer
 Softmax
 Hyperparameters
 Applications

12
Convolution
 The main block in CNN is the convolution layer.
 Convolution is the mathematical operation that allows two sets
to be combined.
 Convolution filter (kernel) is applied to the input to create a
feature map.

13
Convolution
 In the example, the input is 5x5 and the filter is 3x3.
 The convolution process is done by sliding the filter over the
input matrix.
 The result of matrix multiplication with mutual elements creates
one element of the feature map matrix.
 In the figure, convolution is done on 2D with a 3x3 filter.

14
Convolution
 In real applications the image is shown in 3D (height, width and
depth).
 Depth shows the color channels in the image.
 For RGB, the depth is taken as 3.
 Different convolution operations with different filters can be
performed on one input.
 The output feature map of each filter is different.
 By combining all feature maps, a feature map is obtained as a
result.

15
Convolution
 In the figure, a 32x32x3 image and a 5x5x3 filter are used.
 A 1x1x1 value is obtained by adding three 5x5x1 matrices.
 The feature map obtained is 32x32x1.
 If 10 different filters are used, the convolution layer consists of
32x32x10.

16
Convolution
 The feature map is obtained by shifting the filter at the entire
input matrix.

17
Convolution
 The result of the convolution operator is given as an input to
the activation function.
 The activation function is chosen depending on the problem.

18
Content
 Convolutional neural networks
 Structure of the CNNs
 Convolution
 Stride and padding
 Pooling
 Fully connected layer
 Softmax
 Hyperparameters
 Applications

19
Stride and padding
 Stride determines the movement size of the convolution filter at
each step (default = 1).
 As the movement step size increases, the size of the feature
map to be obtained becomes smaller.

20
Stride and padding
 Padding is used to create the same size feature map as the
input.
 Cells with a value of 0 around the input matrix are added as
padding.

21
Stride and padding
 Example: Inputs = 5x5x3, Padding= 1, Stride= 2

22
Content
 Convolutional neural networks
 Structure of the CNNs
 Convolution
 Stride and padding
 Pooling
 Fully connected layer
 Softmax
 Hyperparameters
 Applications

23
Pooling
 Pooling is applied after the convolution process and performs
dimension reduction.
 The pooling layer samples by reducing the height and width of
the feature map (the depth remains the same).
 Max pooling is the most widely used method.
 Window size and stride values are specified depending on the
problem.

24
Pooling
 Typically, the values for window size and stride are chosen so
that half of the feature map in the input is obtained.
 After pooling, the size of the feature map is reduced in half.

25
Content
 Convolutional neural networks
 Structure of the CNNs
 Convolution
 Stride and padding
 Pooling
 Fully connected layer
 Softmax
 Hyperparameters
 Applications

26
Fully connected layer
 After the pooling layer, a fully connected ANN is placed.
 Pooling layer output is taken in 3D and reduced to 1D at the
fully connected ANN
 ANN obtaines a 1D output vector which is size equals to number
of classes.

27
Content
 Convolutional neural networks
 Structure of the CNNs
 Convolution
 Stride and padding
 Pooling
 Fully connected layer
 Softmax
 Hyperparameters
 Applications

28
Softmax
 Softmax function is used in classification problems.
 The softmax layer calculates the probability distribution of the
output classes.

29
Softmax
 Softmax gives the distribution of the probability that the output
belongs to classes.

30
Softmax
 Usually, the number of the output neurons is taken as the
number of class labels.
 The output label that has high probability is assigned for given
input images.

31
Content
 Convolutional neural networks
 Structure of the CNNs
 Convolution
 Stride and padding
 Pooling
 Fully connected layer
 Softmax
 Hyperparameters
 Applications

32
Hyperparameters
 Hyper parameters are not learned directly, but determine the
properties of the model.
 The following hyper parameters are used in CNN:
 Filter size: Usually 3x3 is used, but may be larger depending
on the problem.
 Number of filters: The more filters are used, the more
powerful the model is obtained. However, a large number of
parameters increase the risk of overfitting.
 Stride: Usually 1 is chosen for stride, but a different value
can be chosen depending on the problem.
 Padding: Usually taken as padding 1, but may not be used
depending on the problem.

33
Content
 Convolutional neural networks
 Structure of the CNNs
 Convolution
 Stride and padding
 Pooling
 Fully connected layer
 Softmax
 Hyperparameters
 Applications

34
Applications
 CNN is a successfully applied model for image related
problems.
 CNN has been successfully implemented in recommendation
systems, NLP and many other areas.
 CNN automatically detects important features in the input
data.
 CNN model can classify images better and faster than
human.
 CNN model can identify objects very fast and with high
accuracy.

35
Applications
Image Classification
 Image classification involves assigning a label to an entire
image or photograph.
 This problem is also referred to as “object classification” or
“image recognition”.
 Some examples of image classification include:
 Labeling an x-ray as cancer or not (binary classification).
 Classifying a handwritten digit (multiclass classification).
 Assigning a name to a photograph of a face (multiclass
classification).

36
Applications
Image Classification
 A popular example of image classification used as a benchmark
problem is the MNIST dataset.

37
Applications
Image Classification
 A popular real-world version of classifying photos of digits is The
Street View House Numbers dataset.

38
Applications
Image Classification
 There are many image classification tasks that involve
photographs of objects.
 Two popular examples include the CIFAR-10 and CIFAR-100
datasets.
 The Large Scale Visual Recognition Challenge is an annual
competition in which teams compete for the best performance
using ImageNet database.
 There have been significant achievements in image
recognition/classification applications.

39
Applications
Image Classification With Localization
 Image classification with localization involves assigning a class
label and showing the location of the object by a bounding box.
 This is a more challenging version of image classification.
 Some examples of image classification with localization include:
 Labeling an x-ray as cancer or not and drawing a box around
the cancerous region.
 Classifying photographs of animals and drawing a box around
the animal in each scene.
 A classical dataset for image classification with localization is the
PASCAL Visual Object Classes dataset.

40
Applications
Image Classification With Localization
 This task may sometimes be referred to as “object detection.”
 The ILSVRC2016 Dataset for image classification with
localization is comprised of 150,000 photographs with 1,000
categories of objects.

41
Applications
Object Detection
 Object detection is the task of image classification with
localization.
 This is a more challenging task than simple image classification
or image classification with localization.
 Often, techniques developed for image classification with
localization are used and demonstrated for object detection.
 Some examples of object detection include:
 Drawing a bounding box and labeling each object in a street
scene.
 Drawing a bounding box and labeling each object in an indoor
photograph.
 Drawing a bounding box and labeling each object in a
landscape.
42
Applications
Object Detection
 The PASCAL Visual Object Classes dataset is a common dataset
for object detection.
 Another dataset is Microsoft’s Common Objects in Context
Dataset, namely COCO.

43
Applications
Image Colorization
 Image colorization involves converting a grayscale image to a
full color image.
 This task can be thought of as a type of photo filter or
transform that may not have an objective evaluation.
 Examples include colorizing old black and white photographs
and movies.
 Datasets often involve using existing photo datasets and
creating grayscale versions of photos.

44
Applications
Image Colorization
 Image colorization especially is used for historical or grayscale
old version of the photos.

45
Applications
Image Reconstruction
 Image reconstruction is the task of filling in missing or corrupt
parts of an image.
 This task can be thought of as a type of photo filter or
transform that may not have an objective evaluation.
 Examples include reconstructing old, damaged black and white
photographs and movies.
 Datasets often involve using existing photo datasets and
creating corrupted versions of photos.
 The models must learn to repair using original photos and
corrupted versions of the photos.

46
Applications
Image Reconstruction
 Image reconstruction and image inpainting is the task of filling
in missing or corrupt parts of an image.

47
Applications
Image Super-Resolution
 Image super-resolution is the task of generating a new version
of an image with a higher resolution and detail than the original
image.
 Often models developed for image restoration and inpainting
can be used for image super-resolution.
 Datasets often involve using existing photo and creating down-
scaled version.
 The CNN models must learn to create super-resolution versions
using training data set.

48
Applications
Image Super-Resolution
 Image super-resolution can generate a new higher resolution
version using the input than the original image.

49
Applications
Image Synthesis
 Image synthesis is the task of generating targeted modifications
of existing images or entirely new images.
 This is a very broad area that is rapidly advancing.
 It may include small modifications of image and video (e.g.
image-to-image translations), such as:
 Changing the style of an object in a scene.
 Adding an object to a scene.
 Adding a face to a scene.

50
Applications
Image Synthesis
 An image with a zebra image in the figure has been modified to
include a horse image.
 The patterns and colors in the image of the horse are
transferred to the zebras.

51
Applications
Image Synthesis
 It may also include generating entirely new images, such as:
 Generating faces.
 Generating bathrooms.
 Generating clothes.

52
Applications
 Multiple objects recognition

53
Applications
 Overlapped multiple objects recognition

54
Applications
 Real time object recognition (CNN)
https://www.youtube.com/watch?v=WZmSMkK9VuA

55
Applications
 Real time object recognition (CNN)
https://youtu.be/70Kv8Rr72ag

56
Applications
 Image colorization (CNN)
https://youtu.be/ys5nMO4Q0iY

57
Applications
 Self-driving car
https://youtu.be/hLaEV72elj0

58
Applications
 Robotic
https://youtu.be/tf7IEVTDjng

59
Applications
 Robotic
https://www.youtube.com/watch?v=kgaO45SyaO4

60
Homework

 Prepare a report on the use of convolutional neural networks

in the image applications.

Module5 ML
No ratings yet
Module5 ML
112 pages
CNN Students
No ratings yet
CNN Students
170 pages
CNN (Neural Network)
No ratings yet
CNN (Neural Network)
32 pages
Applied Information Processing Systems 2022
100% (1)
Applied Information Processing Systems 2022
588 pages
DL Unit-Ii
No ratings yet
DL Unit-Ii
34 pages
Unit 5 CNN
No ratings yet
Unit 5 CNN
151 pages
CNN 190813145957
No ratings yet
CNN 190813145957
34 pages
DL Mod3
No ratings yet
DL Mod3
102 pages
Deep Learning Lab Manual
100% (10)
Deep Learning Lab Manual
30 pages
Intro To CNN
No ratings yet
Intro To CNN
93 pages
Unit 3
No ratings yet
Unit 3
105 pages
JNTUK R20 UNIT-IV DEEP LEARNING TECHNIQUES (WWW - Jntumaterials.co - In)
No ratings yet
JNTUK R20 UNIT-IV DEEP LEARNING TECHNIQUES (WWW - Jntumaterials.co - In)
26 pages
Intro DL 02
No ratings yet
Intro DL 02
49 pages
MLP and CNN
No ratings yet
MLP and CNN
56 pages
06 CNN Convolutional Neural Network
No ratings yet
06 CNN Convolutional Neural Network
133 pages
Convolutional Neural Networks - Part 1
No ratings yet
Convolutional Neural Networks - Part 1
44 pages
03 - CNN
No ratings yet
03 - CNN
10 pages
Unit 3 CNN 2024
No ratings yet
Unit 3 CNN 2024
58 pages
Artificial Inteligence and Machine Learning
No ratings yet
Artificial Inteligence and Machine Learning
8 pages
CS601 Machine Learning Unit 3
No ratings yet
CS601 Machine Learning Unit 3
47 pages
Introduction To Deep Learning
No ratings yet
Introduction To Deep Learning
47 pages
Convolutional Neural Network
No ratings yet
Convolutional Neural Network
21 pages
Convolutional Networks 2024
No ratings yet
Convolutional Networks 2024
44 pages
Unit 4a - Convolutional Neural Networks
No ratings yet
Unit 4a - Convolutional Neural Networks
107 pages
12 Design Patterns
No ratings yet
12 Design Patterns
133 pages
Chapter14 CNN
No ratings yet
Chapter14 CNN
54 pages
Co2 CNN 3
No ratings yet
Co2 CNN 3
31 pages
Unit 2 QUESTIONS and ANSWERS
No ratings yet
Unit 2 QUESTIONS and ANSWERS
26 pages
DL Unit Iv
No ratings yet
DL Unit Iv
18 pages
4a Convolutional Neural Networks
No ratings yet
4a Convolutional Neural Networks
56 pages
Discovering Knee Osteoarthritis Using CNN Enhanced With Alexnet
No ratings yet
Discovering Knee Osteoarthritis Using CNN Enhanced With Alexnet
26 pages
Unit - 2
No ratings yet
Unit - 2
31 pages
DL Unit 3 2019PAT
No ratings yet
DL Unit 3 2019PAT
66 pages
Convolutional Neural Networks: Convolutions, Pooling and Cnns. Neural Architectures For Computer Vision
No ratings yet
Convolutional Neural Networks: Convolutions, Pooling and Cnns. Neural Architectures For Computer Vision
64 pages
Unit3 2023 NNDL
No ratings yet
Unit3 2023 NNDL
69 pages
DLT Unit - 4
No ratings yet
DLT Unit - 4
36 pages
Unit - 4 DL
No ratings yet
Unit - 4 DL
19 pages
MLT UNIT-4 & 5 Imp Sol
No ratings yet
MLT UNIT-4 & 5 Imp Sol
22 pages
UNIT-III DeepLearning Notes
No ratings yet
UNIT-III DeepLearning Notes
30 pages
An Emotional Recommender System For Music PDF
No ratings yet
An Emotional Recommender System For Music PDF
12 pages
Large Scale Font Independent Urdu Text Recognition System
No ratings yet
Large Scale Font Independent Urdu Text Recognition System
9 pages
Unit 3 - Machine Learning
No ratings yet
Unit 3 - Machine Learning
29 pages
JNTUK R20 UNIT-IV DEEP LEARNING TECHNIQUES-www - Jntumaterials.co - in
No ratings yet
JNTUK R20 UNIT-IV DEEP LEARNING TECHNIQUES-www - Jntumaterials.co - in
26 pages
CVlecture 5
No ratings yet
CVlecture 5
56 pages
Unit III
No ratings yet
Unit III
60 pages
Unit Iv DL
No ratings yet
Unit Iv DL
26 pages
Ai & Reasoning
No ratings yet
Ai & Reasoning
12 pages
Project Exhibition 2
No ratings yet
Project Exhibition 2
42 pages
Typical CNN (Convolutional Neural Network) Architecture: CHARAN S (1VE20CA005) Cse-Ai, Svce
No ratings yet
Typical CNN (Convolutional Neural Network) Architecture: CHARAN S (1VE20CA005) Cse-Ai, Svce
13 pages
Machine Learning by Joerg Kienitz
No ratings yet
Machine Learning by Joerg Kienitz
5 pages
PNAL6 MLPTraining
No ratings yet
PNAL6 MLPTraining
40 pages
Convolutional Neural Networks
No ratings yet
Convolutional Neural Networks
8 pages
Ch3 CNN
No ratings yet
Ch3 CNN
64 pages
Convolution Neural Network
No ratings yet
Convolution Neural Network
74 pages
CV PPT Mt101
No ratings yet
CV PPT Mt101
16 pages
DL 4
No ratings yet
DL 4
4 pages
NN 06
No ratings yet
NN 06
18 pages
Super VIP Cheatsheet - Deep Learning
No ratings yet
Super VIP Cheatsheet - Deep Learning
47 pages
DL Unit4
No ratings yet
DL Unit4
31 pages
Convolutional Neural Network
No ratings yet
Convolutional Neural Network
11 pages
Major Report
No ratings yet
Major Report
27 pages
DL Unit-4
No ratings yet
DL Unit-4
26 pages
Convolutional Neural Network - 5
No ratings yet
Convolutional Neural Network - 5
21 pages
CNN Sensor Fault Detection
No ratings yet
CNN Sensor Fault Detection
21 pages
Cnnbasics 171028092801
No ratings yet
Cnnbasics 171028092801
43 pages
Long-Range Transformers For Dynamic Spatiotemporal Forecasting
No ratings yet
Long-Range Transformers For Dynamic Spatiotemporal Forecasting
19 pages
UNIT 2 Self Notes
No ratings yet
UNIT 2 Self Notes
10 pages
Guddu Jha - Organized
No ratings yet
Guddu Jha - Organized
3 pages
Maldonado Et Al. - 2023
No ratings yet
Maldonado Et Al. - 2023
6 pages
Agriculture 12 01350 v2
No ratings yet
Agriculture 12 01350 v2
23 pages
Unmasking - The - Truth - A - Deep - Learning - Approach - To - Detecting - Deepfake - Audio - Through - MFCC - Features - P
No ratings yet
Unmasking - The - Truth - A - Deep - Learning - Approach - To - Detecting - Deepfake - Audio - Through - MFCC - Features - P
8 pages
Convolutional Neural Network (CNN) : Assignment On
No ratings yet
Convolutional Neural Network (CNN) : Assignment On
8 pages
Medical Imaging - HMRai
No ratings yet
Medical Imaging - HMRai
77 pages
Deep Learning Notes For Easy Access
No ratings yet
Deep Learning Notes For Easy Access
14 pages
An Automated System For Question Generation and Answer Evaluation
No ratings yet
An Automated System For Question Generation and Answer Evaluation
6 pages
Deep LearningUNIT-IV
No ratings yet
Deep LearningUNIT-IV
16 pages
Super VIP Cheetsheet - Deep Learning, AI, ML
No ratings yet
Super VIP Cheetsheet - Deep Learning, AI, ML
47 pages
Automated Image Capturing System For Deep Learning-Based Tomato Plant Leaf Disease Detection and Recognition
No ratings yet
Automated Image Capturing System For Deep Learning-Based Tomato Plant Leaf Disease Detection and Recognition
6 pages
Assignment 3 - ADM 3308
No ratings yet
Assignment 3 - ADM 3308
13 pages
Asif Et Al. - 2023
No ratings yet
Asif Et Al. - 2023
19 pages
Introduction
No ratings yet
Introduction
6 pages
An Intelligent Detection Approach For Smoking Behavior
No ratings yet
An Intelligent Detection Approach For Smoking Behavior
18 pages
Introduction To Natural Language Processing: by Rohit Sharma
No ratings yet
Introduction To Natural Language Processing: by Rohit Sharma
8 pages
Automated Bird Species Identification Using Machine Learning and Audio Analysis
No ratings yet
Automated Bird Species Identification Using Machine Learning and Audio Analysis
4 pages
A Convolutional Neural Network Based Approach To Financial Time Series Prediction
No ratings yet
A Convolutional Neural Network Based Approach To Financial Time Series Prediction
19 pages
Assignment #1: Afzal Ali (11282) Muhammad Hammad (11293) Muhammad Bilal (11291) Mehran Ahmed (11287) Date 20/03/2019
No ratings yet
Assignment #1: Afzal Ali (11282) Muhammad Hammad (11293) Muhammad Bilal (11291) Mehran Ahmed (11287) Date 20/03/2019
7 pages
【更有效的掩码模型】Architecture-Agnostic Masked Image Modeling - From ViT Back to CNN
No ratings yet
【更有效的掩码模型】Architecture-Agnostic Masked Image Modeling - From ViT Back to CNN
19 pages
Leaf Alert: A Systematic Rapid Plant Disease Detection
No ratings yet
Leaf Alert: A Systematic Rapid Plant Disease Detection
7 pages
AAL5 Heap Quick Sort
No ratings yet
AAL5 Heap Quick Sort
17 pages
AAL2 Asymptotic
No ratings yet
AAL2 Asymptotic
16 pages
Comprehensive Syllabus Laxman
No ratings yet
Comprehensive Syllabus Laxman
2 pages
DIGITNET A Deep Handwritten Digit Detection and Re
No ratings yet
DIGITNET A Deep Handwritten Digit Detection and Re
13 pages
AAL4 Recurrences
No ratings yet
AAL4 Recurrences
11 pages
7 Survey
No ratings yet
7 Survey
6 pages
Convolutional Neural Networks: Fundamentals and Applications for Analyzing Visual Imagery
From Everand
Convolutional Neural Networks: Fundamentals and Applications for Analyzing Visual Imagery
Fouad Sabry
No ratings yet