0% found this document useful (0 votes)

40 views50 pages

CII4Q3 - Computer Vision-EAR - Week-11-Intro To Deep Learning v1.0

The document provides an overview of deep learning and artificial neural networks. It discusses key concepts like artificial neurons, activation functions, feedforward and backpropagation in neural networks. It also covers limitations of early neural networks and how deep learning addresses them by learning multiple levels of representation. Additionally, it introduces convolutional neural networks, their basic operations like convolution and pooling, and components like dropout and batch normalization. Finally, it discusses the evolution of deep learning models from AlexNet to ResNet and SE Net.

Uploaded by

Zee Ingame

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

40 views50 pages

CII4Q3 - Computer Vision-EAR - Week-11-Intro To Deep Learning v1.0

Uploaded by

Zee Ingame

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 50

CII4Q3 VISI KOMPUTER

Introduction to Deep Learning

2
3
4
5
6
7
8
9
10
11
12
13
ARTIFICIAL NEURAL NETWORK

14
ARTIFICIAL NEURAL NETWORK

• Network of interconnected neurons.

• Each neuron is a mathematical function
• High number of layers in neural networks → deep learning

15
ARTIFICIAL NEURAL NETWORK

• Fundamental operation that occurs within neurons → linear function

Activation function
16
ACTIVATION FUNCTION

perceptron

sigmoid

RELU (Rectified Linear Units)

17
NEURAL NETWORK: A NEURON

• A neuron is a computational unit in the neural network that exchanges messages with each other.

Possible activation functions:

Step function/threshold Sigmoid function (a.k.a, logistic

function function)

18
FEED FORWARD/BACKPROPAGATION
NEURAL NETWORK

Feed forward algorithm:

• Activate the neurons from the
bottom to the top.

Backpropagation:
• Randomly initialize the parameters
• Calculate total error at the top, 𝑓6 (𝑒)
• Then calculate contributions to error, 𝛿𝑛 , at
each step going backwards.

19
LIMITATIONS OF NEURAL NETWORKS

Random initialization + densely connected networks lead to:

• High cost
• Each neuron in the neural network can be considered as a logistic regression.
• Training the entire neural network is to train all the interconnected logistic regressions.
• Difficult to train as the number of hidden layers increases
• Recall that logistic regression is trained by gradient descent.
• In backpropagation, gradient is progressively getting more dilute. That is, below top layers, the correction signal 𝛿𝑛 is
minimal.
• Stuck in local optima
• The objective function of the neural network is usually not convex.
• The random initialization does not guarantee starting from the proximity of global optima.
→ Solution:
• Deep Learning/Learning multiple levels of representation

20
LET’S GO INTO THE MATH....

21
COMPONENTS OF MACHINE LEARNING

• Learning algorithm
• Initialized with a set of default parameters 𝜃1 to 𝜃𝑛
• The data
• Iterate over the dataset and at each row, we feed in the attributes 𝑋1 to 𝑋𝑛 into the learning algorithm → outputs a
prediction of the target variable based on current set of parameters
• Loss function
• Used to compute how close our prediction is to the actual value of target as contained in our dataset.
• Aggregated across all examples
• Optimization algorithm→ gradient descent
• Update the parameters of learning algorithm in a direction that would reduce the aggregated loss

22
STRUCTURE OF DATA

• Each individual piece of data → variable-value pairs

• Variables → features
• Variables
• Continuous: real valued numbers (price, age, length, area, temperature, etc)
• Categorical: discrete variable; cannot be expressed as real valued numbers (gender, race, color, state, etc)

23
LOSS FUNCTION: REGRESSION

• Given a set of parameters, a loss function helps us to evaluate how well our learning algorithm is
performing on the training data using our current parameters.

Prediction, given parameters 𝜃1 , 𝜃2 ,

and 𝜃3

Simple loss function

Average loss of all examples; MSE

(Mean Squared Error)

24
LOSS FUNCTION: CLASSIFICATION
• Classification → return scores for all the classes available in our dataset.
• Softmax: take these scores and return probabilities between 0 and 1
• Given a set of scores (S)

Example:
P: Probability vector
e: base to the natural logarithm=2.71828
s: 𝑆𝑖 = score of each class

25
Softmax cross entropy loss

Loss function: classification Negative log likelihood loss

• Softmax cross entropy loss → the sum of negative of the log softmax score of the correct class

j: the index of the correct class

• Example:
• Score softmax
• Softmax cross entropy loss

• The loss is very low when we are making the right prediction

26
LOSS FUNCTION: PLUS REGULARIZATION

• Prevent overfitting
• Regularization → based on the fact that models usually overfit when the values of the parameters is too
large
• Parameter sets that have large values tend to result in low loss on the training set but fails to yield correspondingly high
score on the test set.
• Penalizing large weights

Weight decay: controls the

strength of the regularizer

L2 regularizer
27
OPTIMIZATION: GRADIENT DESCENT

• Finding the right set of parameters → finding parameters that yield the lowest error on the training set.

Gradient of the loss with respect

to the parameters

Learning rate

Usually calculated through:

backpropagation
28
STOCHASTIC GRADIENT DESCENT:
MINIBATCH GRADIENT DESCENT

• Operates over a batch of the dataset at a time

• Common batch size: 32, 64, 128

• Other modifications to the Ordinary Gradient Descent: Gradient Descent

with Momentum, Adagrad, AdaDelta, RMSProp, AdamOptimizer

29
CONVOLUTIONAL NEURAL NETWORK

CNN: feed-forward networks; locally connected.

Works by detecting specific patterns of feature across the entire image.

30
CONVOLUTIONAL NEURAL NETWORK

31
BASIC OPERATION

The more the number of feature

detectors present in a CNN, the better
it can classify images.

Feature detector in CNN: kernel or filter

Each filter → detects a specific pattern

Filter size: 5x5, 3x3, 2x2, 1x1

CNN → locally connected

At each dot product (convolution), they are only connected to the local region of the image

32
Stride, Padding

Size of output feature map

33
POOLING

• Reduce the dimensions of image

• Helps to make CNNs invariant to the final presentation of the image, by
picking the most important feature in a given pooling region

Max pooling

34
35
COMPONENTS OF CNN

• Dropout: reducing overfitting

• Switching off some activations by setting them to zero

• Batch Normalization (Ioffe & Szegedy, 2015)

• Addressing the problem of vanishing gradient

• Normalizing each batch of feature maps to have zero mean

• Data Augmentation

• Randomly apply flipping, shifting, rotation, scaling, whitening to our images

36
THE DEEP LEARNING REVOLUTION

• Deep ConvNets for Object Recognition

• Semantic Segmentation
• Object Detection

37
38
39
40
41
ARCHITECTURE

• AlexNet (2012)
• VGGNet (2014)
• Inception
• Residual Networks
• Evolution of ResNet
• SE NET

42
ALEXNET
• winner of the ILSVRC (ImageNet Large Scale Visual Recognition Competition) 2012
• It was the first time a Convolutional Neural Network would significantly outperform other methods on a
large dataset (ImageNet 2012) by a large margin.
• AlexNet was composed of five convolutional layers followed by three fully connected (Dense)
layers.
• Their most important contribution was the training process.
• they used data augmentation to artificially increase the training dataset.
• CUDA-Covnet code which was an incredibly efficient implementation of the convolution operation. It effectively
parallelized the training process across two GPUs. In those days, there were no Deep Learning libraries

43
8 layers

ReLU is introduced

Overlapping pooling:
stride is smaller than
the kernel size

Data augmentation:
image translation and
mirroring, altering the
intensity using PCA

Dropout: probability of
0.5

Not used anymore → Batch normalization

44
VGGNET

• Invented by Visual Geometry Group

• Runner up of the ILSVRC (ImageNet Large Scale Visual Recognition
Competition) 2014
• the first year that there are deep learning models obtaining the error rate
under 10%
• Using smaller filter size → the number of parameters are fewer

45
46
47
48
49
REFERENCES

• Introduction to Deep Computer Vision, 2018, John Olafenwa & Moses

Olafenwa
• https://medium.com/@dataturks/deep-learning-and-computer-vision-from-
basic-implementation-to-efficient-methods-3ca994d50e90
• Deep Learning, 2016, Ian Goodfellow,Yoshua Bengio, & Aaron Courville, MIT
Press
• Deep Learning, NYU, https://atcold.github.io/pytorch-Deep-Learning/

UNIT 1 Introduction Part 1
No ratings yet
UNIT 1 Introduction Part 1
37 pages
6-DeepVisualLearning L6
No ratings yet
6-DeepVisualLearning L6
82 pages
MN906 AI Watermarking
No ratings yet
MN906 AI Watermarking
99 pages
Lec14 CNNRNNModels
No ratings yet
Lec14 CNNRNNModels
64 pages
DNN Merged Sugata
No ratings yet
DNN Merged Sugata
243 pages
Basics of DL: Prof. Leal-Taixé and Prof. Niessner 1
No ratings yet
Basics of DL: Prof. Leal-Taixé and Prof. Niessner 1
76 pages
CII4Q3 VISI KOMPUTER - Deep Learning - CNN
No ratings yet
CII4Q3 VISI KOMPUTER - Deep Learning - CNN
106 pages
Deep - Learning
No ratings yet
Deep - Learning
49 pages
Chapter21 4e
No ratings yet
Chapter21 4e
35 pages
Unit 4 Short Notes
No ratings yet
Unit 4 Short Notes
27 pages
Deep Neural Networks
No ratings yet
Deep Neural Networks
48 pages
Unit II
No ratings yet
Unit II
38 pages
Deep Learning Unit2
No ratings yet
Deep Learning Unit2
43 pages
Day 1 S3
No ratings yet
Day 1 S3
29 pages
Lect 12 - Deep Feed Forward NN - Review
No ratings yet
Lect 12 - Deep Feed Forward NN - Review
93 pages
I2DL Student Lecture Notes
No ratings yet
I2DL Student Lecture Notes
97 pages
The Little Book of Deep Learning
No ratings yet
The Little Book of Deep Learning
167 pages
Unit III
No ratings yet
Unit III
58 pages
Midterm Study Guide Csci566
No ratings yet
Midterm Study Guide Csci566
20 pages
Deep Learning Book Part1
No ratings yet
Deep Learning Book Part1
100 pages
AA12 Deep Learning 2024
No ratings yet
AA12 Deep Learning 2024
30 pages
Lecture 3
No ratings yet
Lecture 3
48 pages
Deep Learning - Intro, Methods & Applications
100% (1)
Deep Learning - Intro, Methods & Applications
37 pages
LBDL
No ratings yet
LBDL
185 pages
Deep Learning Notes (1) 2
No ratings yet
Deep Learning Notes (1) 2
54 pages
ML MU Unit 5NeuralNetworkpdf 2025 04 16 13 47 39
No ratings yet
ML MU Unit 5NeuralNetworkpdf 2025 04 16 13 47 39
57 pages
Deep Learning Basics (Lecture Notes) : Romain Tavenard
No ratings yet
Deep Learning Basics (Lecture Notes) : Romain Tavenard
49 pages
Neural Network (Basics)
No ratings yet
Neural Network (Basics)
48 pages
Machine Learning (PARAMETER'S RESUMES 2)
No ratings yet
Machine Learning (PARAMETER'S RESUMES 2)
8 pages
Artificial Neural Networks
No ratings yet
Artificial Neural Networks
100 pages
Clevered AI Wizard Level 3
No ratings yet
Clevered AI Wizard Level 3
17 pages
Deep Learning 15
No ratings yet
Deep Learning 15
13 pages
Session 2 ANN 2024
No ratings yet
Session 2 ANN 2024
29 pages
Deep Learning
No ratings yet
Deep Learning
20 pages
Lbdlu
No ratings yet
Lbdlu
168 pages
Day 4. Deep Neural Networks
No ratings yet
Day 4. Deep Neural Networks
44 pages
What Is Computer Vision?
No ratings yet
What Is Computer Vision?
120 pages
Amazon Interview Question Bank - by Harine
No ratings yet
Amazon Interview Question Bank - by Harine
4 pages
CS 563-DeepLearning-SentimentApplication-April2022 (27403)
No ratings yet
CS 563-DeepLearning-SentimentApplication-April2022 (27403)
124 pages
SDL Unit 2 3 4
No ratings yet
SDL Unit 2 3 4
12 pages
01 - 2theory of Equations
100% (1)
01 - 2theory of Equations
7 pages
IoT - Lecture 11
No ratings yet
IoT - Lecture 11
58 pages
Introduction To Artificial Neural Networks
No ratings yet
Introduction To Artificial Neural Networks
31 pages
Data Mining: Practical Machine Learning Tools and Techniques
No ratings yet
Data Mining: Practical Machine Learning Tools and Techniques
123 pages
Deep Learning
No ratings yet
Deep Learning
78 pages
DL Intro
No ratings yet
DL Intro
64 pages
26 Neural Nets
No ratings yet
26 Neural Nets
77 pages
BMM 2018 - Deep Learning Tutorial
No ratings yet
BMM 2018 - Deep Learning Tutorial
47 pages
4b Image Processing
No ratings yet
4b Image Processing
63 pages
Value-Based Reinforcement Learning: Shusen Wang
No ratings yet
Value-Based Reinforcement Learning: Shusen Wang
53 pages
Chapter 5 Deep Learning
No ratings yet
Chapter 5 Deep Learning
35 pages
NISS Deep Learning Tutorial
No ratings yet
NISS Deep Learning Tutorial
58 pages
2630 20230529 Mahdi Momen Aldawood HH 15261 946399124
No ratings yet
2630 20230529 Mahdi Momen Aldawood HH 15261 946399124
11 pages
Sony Ai Content
No ratings yet
Sony Ai Content
26 pages
Introtodeeplearning MIT 6.S191
No ratings yet
Introtodeeplearning MIT 6.S191
36 pages
Deep Learning (DL) - Comprehensive Summary
No ratings yet
Deep Learning (DL) - Comprehensive Summary
9 pages
DEEP LEARNING TECHNIQUES: CLUSTER ANALYSIS and PATTERN RECOGNITION with NEURAL NETWORKS. Examples with MATLAB
From Everand
DEEP LEARNING TECHNIQUES: CLUSTER ANALYSIS and PATTERN RECOGNITION with NEURAL NETWORKS. Examples with MATLAB
César Pérez López
No ratings yet
Deep Learning
No ratings yet
Deep Learning
19 pages
Tutorial 1,2
No ratings yet
Tutorial 1,2
12 pages
Artificial Neural Networks - Lect - 4
No ratings yet
Artificial Neural Networks - Lect - 4
17 pages
Class X-Maths-Polynomials-Aecs2 Mumbai
No ratings yet
Class X-Maths-Polynomials-Aecs2 Mumbai
5 pages
DAA IA1 Updated
No ratings yet
DAA IA1 Updated
14 pages
Deep Learning Concise Notes
No ratings yet
Deep Learning Concise Notes
4 pages
AML 03 Dense Neural Networks
No ratings yet
AML 03 Dense Neural Networks
20 pages
FINAL
No ratings yet
FINAL
26 pages
Ma203 Numerical Methods (End - Mo22)
No ratings yet
Ma203 Numerical Methods (End - Mo22)
1 page
Reading 3 Machine Learning
No ratings yet
Reading 3 Machine Learning
9 pages
Case Studies
No ratings yet
Case Studies
22 pages
CSE5311 FinalExamPractice
No ratings yet
CSE5311 FinalExamPractice
12 pages
Machine Learning (ML) :: Aim: Analysis and Implementation of Deep Neural Network. Definitions
No ratings yet
Machine Learning (ML) :: Aim: Analysis and Implementation of Deep Neural Network. Definitions
6 pages
ParitoshShukla IPCV EXP6
No ratings yet
ParitoshShukla IPCV EXP6
10 pages
Performance & Tunning: Programação em ABAP/4
No ratings yet
Performance & Tunning: Programação em ABAP/4
41 pages
Quick Sort
No ratings yet
Quick Sort
5 pages
Bca Semester IV Design and Analysis of Algorithms 2023 Quest
No ratings yet
Bca Semester IV Design and Analysis of Algorithms 2023 Quest
2 pages
Chinese Remainder Theorem PDF
No ratings yet
Chinese Remainder Theorem PDF
5 pages
Versatile Medical Image Denoising Algorithm
No ratings yet
Versatile Medical Image Denoising Algorithm
22 pages
NM&CF QB
No ratings yet
NM&CF QB
9 pages
Examples: Bubble Sort, Insertion Sort, Merge Sort, Quick Sort, Heap Sort
No ratings yet
Examples: Bubble Sort, Insertion Sort, Merge Sort, Quick Sort, Heap Sort
10 pages
Coding: Aplikasi Sederhana Mengurutkan Bilangan
No ratings yet
Coding: Aplikasi Sederhana Mengurutkan Bilangan
8 pages
IOQM Worksheet 12 Integer Root Theorem
No ratings yet
IOQM Worksheet 12 Integer Root Theorem
6 pages
DBMS Problema
No ratings yet
DBMS Problema
1 page
Maths 9 Youth Education 756
No ratings yet
Maths 9 Youth Education 756
1 page
Backtracking:: Back Track
No ratings yet
Backtracking:: Back Track
30 pages
The Design Revolution of Logarithmic Number System Architecture
No ratings yet
The Design Revolution of Logarithmic Number System Architecture
7 pages
Open Lab Quiz 1 (10 - 2 - 21) (1-22)
No ratings yet
Open Lab Quiz 1 (10 - 2 - 21) (1-22)
7 pages
Experiment Lab Report - 3
No ratings yet
Experiment Lab Report - 3
5 pages
Properties of Discrete Time Convolution: Stephen Kruzick
No ratings yet
Properties of Discrete Time Convolution: Stephen Kruzick
4 pages
Assignment 3 Modulo Arithmetic
No ratings yet
Assignment 3 Modulo Arithmetic
3 pages
The Numpy Pocketbook: Essentials on the Go
From Everand
The Numpy Pocketbook: Essentials on the Go
Silas Meadowlark
No ratings yet

CII4Q3 - Computer Vision-EAR - Week-11-Intro To Deep Learning v1.0

Uploaded by

CII4Q3 - Computer Vision-EAR - Week-11-Intro To Deep Learning v1.0

Uploaded by

CII4Q3 VISI KOMPUTER

Introduction to Deep Learning

• Network of interconnected neurons.

• Fundamental operation that occurs within neurons → linear function

RELU (Rectified Linear Units)

Possible activation functions:

Step function/threshold Sigmoid function (a.k.a, logistic

Feed forward algorithm:

Random initialization + densely connected networks lead to:

• Each individual piece of data → variable-value pairs

Prediction, given parameters 𝜃1 , 𝜃2 ,

Simple loss function

Average loss of all examples; MSE

Loss function: classification Negative log likelihood loss

j: the index of the correct class

Weight decay: controls the

Gradient of the loss with respect

Usually calculated through:

• Operates over a batch of the dataset at a time

• Other modifications to the Ordinary Gradient Descent: Gradient Descent

CNN: feed-forward networks; locally connected.

Works by detecting specific patterns of feature across the entire image.

The more the number of feature

Feature detector in CNN: kernel or filter

Each filter → detects a specific pattern

CNN → locally connected

Size of output feature map

• Reduce the dimensions of image

• Dropout: reducing overfitting

• Switching off some activations by setting them to zero

• Batch Normalization (Ioffe & Szegedy, 2015)

• Addressing the problem of vanishing gradient

• Normalizing each batch of feature maps to have zero mean

• Randomly apply flipping, shifting, rotation, scaling, whitening to our images

• Deep ConvNets for Object Recognition

Not used anymore → Batch normalization

• Invented by Visual Geometry Group

• Introduction to Deep Computer Vision, 2018, John Olafenwa & Moses

You might also like