0% found this document useful (0 votes)

119 views

Introduction To Deep Learning: Internet of Things Group

This document provides an introduction to deep learning and neural networks. It discusses how neural networks are used to solve computer vision problems and are powering technologies like Tesla's autopilot system. It also covers the history of neural networks, how they are trained using gradient descent and backpropagation, and common neural network layers like convolutional layers, fully connected layers, and activation layers.

Uploaded by

Charles Nicollas

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

119 views

Introduction To Deep Learning: Internet of Things Group

Uploaded by

Charles Nicollas

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 50

Introduction to Deep Learning

Anna Petrovicheva
IOTG Computer Vision

Internet of Things Group 1

Agenda

1. Neural Networks overview

2. Math engine
3. Neural Network layers
4. Solving Computer Vision problems
5. How to train a network

Internet of Things Group 2

Deep Learning systems in real world

Image credit: DeepMind, Prisma, Yayvo, Google Translate, Redmond Pie, TechRepublic, Brit

Internet of Things Group *Other names and brands may be claimed as the property of others 3
Tesla autopilot

Image credit: Autopilot Full Self Driving Demonstration Nov 18 2016 Realtime Speed

Internet of Things Group *Other names and brands may be claimed as the property of others 4
Brief history
● 1965: first idea
● AI winter
● 1998: LeNet-5
● 2000’s: “The biggest issue of this paper, is that it relies on neural networks”
● 2012: groundbreaking results in ImageNet contest
○ Old algorithms
○ Big dataset
○ Compute power

● 2012-now: wide adoption

Internet of Things Group 5

Artificial Neural Network
parameter

neuron
w1
v1

input w2
output
v2 vnew
w3 dog

layer

Internet of Things Group 6

Training
Start: parameters are random

cat

Goal: find good parameters W = (w1, w2, … , wm)

Internet of Things Group 7

Finding parameters
prediction
error
● W = (w1, w2, … , wm) - point in
multidimensional space
○ Modern nets: 10s - 100s million
parameters

● Use W in network → get

corresponding prediction error
w1
○ Wstart: high prediction error
○ Woptimal: low prediction error
Wstart
w2 Woptimal ● Goal: get from Wstart to Woptimal

Internet of Things Group 8

Gradient descent
prediction
error

W1 = Wstart + α * F’(Wstart)
α - learning rate
Too small: long training
Too large: training diverges
w1

W1 Wstart
w2 Woptimal

Internet of Things Group 9

Gradient descent
prediction
error

W1 = Wstart + α * F’(Wstart)
W2 = W1 + α * F’(W1)
W3 = W2 + α * F’(W2)

w1 W4 = W3 + α * F’(W3)
W5 = W4 + α * F’(W4)
Wstart
W1
w2 Woptimal

Internet of Things Group 10

Non-convex task
prediction
error ● May stuck in local minima
● Solution depends on initial
point
State-of-the-art opinion:
● Local minima are not
w1 biggest problem
● “Like person driving a car
in a really confusing city”
Woptimal Wlocal
w2

Internet of Things Group 11

Stochastic gradient descent
Gradient descent:
▪ Take all data points (= all dataset)
▪ Compute parameter derivative in all points
▪ Make a step in this direction

Dataset is too big

▪ Too much time to compute
▪ Does not fit in operating memory

Stochastic Gradient Descent:

▪ Use random subset of data (new each iteration)

Internet of Things Group 12

Backpropagation algorithm
Forward pass w
1
w ● Cost function estimates
2
w prediction error
cat
3
● Layers compute derivative with
respect to parameters
● Parameter derivative is sent to
Stochastic Gradient Descent
Backward pass ● SGD outputs parameter update
w’1
for the next iteration
w’2
● Next iteration - new
SGD error e
parameters, new data from
w’3
dataset

Parameter
update ΔW

Internet of Things Group 13

Neural network layers

Internet of Things Group

Convolutional layer
1 0 1 ● Local connectivity

0 1 0 ● Convolves channels too

1 0 1 ● Each convolutional layer has many
different filters
● Each filter detects specific feature
○ Borders, colors

● General data transform tool

● Can have bias b

Image credit: Visualizing Neural Networks In Virtual Space

Internet of Things Group 15

Convolutional layer

Can represent any image operation

Goal: find suitable parameters

Takes 95% computations in network
Image credit: OpenCV documentation

Internet of Things Group 16

Convolutional layer filters

AlexNet 1st convolution filters

● Detect lines
● Detect color patterns
Further layers:
● Growing level of abstraction
○ “Face neuron”

Image credit: CS231n: Convolutional Neural Networks for Visual Recognition

Internet of Things Group 17

Fully connected layer

v1 w11 b1
w21
fc1

● 95 % of parameters in network
● “Classic” layer
● Usually used before the final
bm
classificator
wnm fcm

Internet of Things Group 18

Activation layer

● Applied after all convolution and fully

connected layers
● Analogous to biological neuron
mechanism
○ Neuron firing rate

Internet of Things Group 19

Activation layers
● Original idea: Heaviside step function Heaviside step
function
○ Fire / not fire
○ Non-differentiable -> cannot use
backpropagation

● Approximation: sigmoid / tanh tanh

sigmoid
○ Approximate step function
○ Differentiable
○ Saturate and kill gradients
● Used almost everywhere: Rectified Linear Unit ReLU
○ Accelerates convergence in training
○ Does not saturate

Internet of Things Group 20

Pooling layer
● Types:
○ Average pooling
0 -1 0 2 ○ Max pooling
max
1 1 -1 1 pooling 1 2
● Reduces data dimensionality
1 0 3 0 2 3
○ Less parameters
-1 2 0 1 ○ Less computations
○ Controls overfitting

Internet of Things Group 21

Typical feed-forward neural network

No cycles VGG16 topology

Activation after each convolution / FC

Pooling after several convolution blocks

Image credit: Feature Evaluation of Deep Convolutional Neural Networks for Object Recognition and Detection

Internet of Things Group 22

Solving Computer Vision with Deep
Learning

Internet of Things Group

Image classification

classification dog cat bird

backbone
head 0.7 0.2 0.1

● Predicts category of image

● Backbone extracts features

● Classification head outputs probabilities of each category

Internet of Things Group 24

Softmax layer + cross-entropy loss

Softmax layer Cross-entropy loss

label dog cat bird

ground truth 1 0 0 Cross-entropy loss

algorithm 1 0.2 0.6 0.2 - ((ln(0.2) * 1) + (ln(0.6) * 0) + (ln(0.2) * 0)) = 1.6

algorithm 2 0.5 0.4 0.1 - ((ln(0.5) * 1) + (ln(0.4) * 0) + (ln(0.1) * 0)) = 0.69

algorithm 3 0.8 0.1 0.1 - ((ln(0.8) * 1) + (ln(0.1) * 0) + (ln(0.1) * 0)) = 0.22

Internet of Things Group 25

ImageNet

Greatest driver of Deep Learning and

image classification
1 million images
1000 classes
▪ 120 dog breeds

ImageNet 2017 is the last one

Internet of Things Group 26

● Before 2012:
non-Deep
Learning
methods
● 2012: AlexNet
● 2014: VGG,
GoogLeNet
● 2015: ResNet

Internet of Things Group 27

ResNet topology
Won ImageNet 2015 image classification contest
Key advantage: residual connection
▪ Better convergence in parameter space

Outperformes human accuracy in image classification

▪ Andrej Karpathy blog

ResNet-like topologies are state-of-the-art

▪ Top accuracy in many Computer Vision tasks

Very deep
▪ 50 / 101 / 152 -convolution modifications
Image credit: Deep Residual Learning for Image Recognition

Internet of Things Group 28

Typical Deep Learning algorithm for Computer
Vision
Requirement: big datasets for the task exist
Typical solution

task-specific
input backbone output
layers

Backbone: AlexNet, VGG, GoogLeNet, ResNet and other

▪ Without softmax head
▪ Extracts representative features
▪ Pretrained on ImageNet

Internet of Things Group 29

Object detection

detection elephant

backbone
head tree
tree

VGG Faster R-CNN

Inception R-FCN

ResNet SSD
Image credit: Savanna

Internet of Things Group 30

Object detection

Image credit: YOLO v2

Internet of Things Group 31
Semantic segmentation

● Generate mask of objects of each

class on image
○ Road
○ Pedestrian
○ ...

● Each pixel classification

● Datasets
○ General case
○ Road scenarios

Image credit: DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs

Internet of Things Group 32

Semantic segmentation

Image credit: Feature Space Optimization for Semantic Video Segmentation - CityScapes Demo 02

Internet of Things Group 33

Instance segmentation
Mask for each object + category
of object
▪ Semantic segmentation

▪ Object detection

State-of-the-art: Mask-R-CNN

Internet of Things Group 34

Generative Adversarial Networks
● Generator network generates
sample
● Discriminator network tries to
distinguish real samples from
generated
○ Bank-counterfeiter task

● Trained GAN:
○ Good generator of new objects
○ Good estimator of object quality

● Any task can be interpreted as GAN

Image credit: Stability of Generative Adversarial Networks

Internet of Things Group 35

GAN for image generation

September 2016 March 2017

Image credit: BEGAN: Boundary Equilibrium Generative Adversarial Networks

Internet of Things Group 36

GAN for image generation

Image credit: BEGAN: Boundary Equilibrium Generative Adversarial Networks

Internet of Things Group 37

GAN for image generation from caption

Image credit: StackGAN: Text to Photo-realistic Image Synthesis with Stacked Generative Adversarial Networks

Internet of Things Group 38

GAN for Super Resolution

Original image Bicubic interpolation SRGAN

Image credit: Photo-Realistic Single Image Super-Resolution Using a Generative Adversarial Network

Internet of Things Group 39

GAN for image to image translation

Image credit: Image-to-Image Translation with Conditional Adversarial Nets

Internet of Things Group 40

GAN for image to image translation

Image credit: CycleGAN

Internet of Things Group 41

How to train a network

Internet of Things Group

Understand state-of-the-art

● Google Scholar, Arxiv papers

● Datasets, benchmarks
● Existing implementations, open repositories

Internet of Things Group 43

Prepare dataset
● Neural Networks demand big datasets
○ ImageNet: 1.4 million images
○ MS COCO: 300 thousand images
● Data augmentation
○ Cropping
○ Flipping
○ Brightness / contrast

Internet of Things Group 44

Prepare dataset
Small amount of real-life data: add train-val split

overfitting generalization

Train Train-Val Validation Test

high error high error high error high error

Bigger model More data Get more data similar

Train longer More regularization to test More validation data
Other architecture Other architecture Other architecture

Andrew Ng. Nuts and Bolts of Applying Deep Learning

Internet of Things Group 45
Iterative experiments
● Overfit 1 sample
● Put all results in table
● Variability:
▪ Backbone
▪ Task-specific layers and loss
▪ Data augmentation
▪ Optimization parameters
– Learning rate value and policy
– Regularization

Image credit: Speed/accuracy trade-offs for modern convolutional object detectors

Internet of Things Group 46

Accuracy evaluation

● Compare with state-of-the-art

● Analyze accuracy dynamics while training
1.0 1.0

train
0.9 0.9
val
0.8 0.8

accuracy
accuracy

0.7 0.7

0.6 0.6

0.5 0.5

iterations iterations

Typical good training Overfitting

Internet of Things Group 47

Choose accuracy metric

● Single accuracy metric

○ Comparable results

Example:

Accuracy Performance
● Accuracy: optimizing metric
Model 1 98 % 2 seconds
● Time: satisficing metric
Model 2 93 % 0.5 second

Internet of Things Group 48

General tips
● Neural Networks can solve vision problems human can solve in 1 second
● Open source repositories do not work out of the box
● Find your way to learn about new DL research

Papers submitted to Arxiv categories cs.AI, cs.LG, cs.CV, cs.CL, cs.NE, stat.ML over time

Image credit: Andrej Karpathy’s blog @ Medium

Internet of Things Group 49

Internet of Things Group

AI Powered Decision Making in Banks
100% (2)
AI Powered Decision Making in Banks
17 pages
FT04_Haghighat_Independent_2023
No ratings yet
FT04_Haghighat_Independent_2023
40 pages
DL_Unit3_1 (1)
No ratings yet
DL_Unit3_1 (1)
67 pages
anthony
No ratings yet
anthony
33 pages
2017 MSSC Verhelst eDNNP-1
No ratings yet
2017 MSSC Verhelst eDNNP-1
11 pages
Convolutional Neural Networks in Python _ DataCamp
No ratings yet
Convolutional Neural Networks in Python _ DataCamp
22 pages
An Overview of Convolutional Neural Network Architectures For Deep Learning
No ratings yet
An Overview of Convolutional Neural Network Architectures For Deep Learning
22 pages
Aidl 2023s DL 08 CNN Architectures
No ratings yet
Aidl 2023s DL 08 CNN Architectures
51 pages
7 CNN
No ratings yet
7 CNN
66 pages
Deep Residual Learning
No ratings yet
Deep Residual Learning
80 pages
2015WS HS SpikingVision
No ratings yet
2015WS HS SpikingVision
23 pages
7 Applications of Convolutional Neural Networks - FWS
No ratings yet
7 Applications of Convolutional Neural Networks - FWS
3 pages
Unit 5a - Machine Vision
No ratings yet
Unit 5a - Machine Vision
55 pages
Deep Learning Hardware
No ratings yet
Deep Learning Hardware
82 pages
Deep Learning For Iot: Tausif Diwan, Jitendra V. Tembhurne, Tapan Kumar Jain, and Pooja Jain
No ratings yet
Deep Learning For Iot: Tausif Diwan, Jitendra V. Tembhurne, Tapan Kumar Jain, and Pooja Jain
17 pages
Intro CNN PDF
No ratings yet
Intro CNN PDF
31 pages
CNN
No ratings yet
CNN
31 pages
Lec14-CNNRNNModels
No ratings yet
Lec14-CNNRNNModels
64 pages
FT04 Haghighat Intel 2022
No ratings yet
FT04 Haghighat Intel 2022
30 pages
Convolutional Neural Networks Notes
No ratings yet
Convolutional Neural Networks Notes
29 pages
W11 Lecture ITS69204 Image Recognition (1)
No ratings yet
W11 Lecture ITS69204 Image Recognition (1)
44 pages
CNN 2
No ratings yet
CNN 2
47 pages
DL Inference FPGA Class1
No ratings yet
DL Inference FPGA Class1
56 pages
Chapter 5 Deep Learning
No ratings yet
Chapter 5 Deep Learning
35 pages
Ch10 Deep Learning
No ratings yet
Ch10 Deep Learning
104 pages
CV Ss16 0609 Deep Learning
No ratings yet
CV Ss16 0609 Deep Learning
91 pages
CV Mot
No ratings yet
CV Mot
69 pages
CNN Eem305
100% (1)
CNN Eem305
7 pages
4b Image Processing
No ratings yet
4b Image Processing
63 pages
Module 05
No ratings yet
Module 05
10 pages
Introduction+to+Neural+Networks+ +Lecture+Slides+Part+1
No ratings yet
Introduction+to+Neural+Networks+ +Lecture+Slides+Part+1
36 pages
(IJCST-V10I5P12) :mrs J Sarada, P Priya Bharathi
No ratings yet
(IJCST-V10I5P12) :mrs J Sarada, P Priya Bharathi
6 pages
Ch-3 Convolutional Neural Networks (CNNs)
No ratings yet
Ch-3 Convolutional Neural Networks (CNNs)
11 pages
Hot Chips Overview
No ratings yet
Hot Chips Overview
47 pages
CNNs 1697477106
No ratings yet
CNNs 1697477106
42 pages
Deep Convolutional Neural Networks For Image Classification: Many Slides From Rob Fergus (NYU and Facebook)
No ratings yet
Deep Convolutional Neural Networks For Image Classification: Many Slides From Rob Fergus (NYU and Facebook)
55 pages
Tutorial On DNN 1 of 9 Background of DNNs
No ratings yet
Tutorial On DNN 1 of 9 Background of DNNs
65 pages
CERN Deep Learning and Vision
No ratings yet
CERN Deep Learning and Vision
72 pages
Alexnet Tugce Kyunghee
No ratings yet
Alexnet Tugce Kyunghee
35 pages
BMM 2018 - Deep Learning Tutorial
No ratings yet
BMM 2018 - Deep Learning Tutorial
47 pages
Convolutional Neural Networks: CS 535 Deep Learning, Winter 2020 Fuxin Li
No ratings yet
Convolutional Neural Networks: CS 535 Deep Learning, Winter 2020 Fuxin Li
44 pages
Cv Ppt Mt101
No ratings yet
Cv Ppt Mt101
16 pages
10. Image Processing With Deep Learning
No ratings yet
10. Image Processing With Deep Learning
39 pages
Presented By, Shobha C.Hiremath (01FE17MCS019)
No ratings yet
Presented By, Shobha C.Hiremath (01FE17MCS019)
25 pages
Literature Review On Image Classification Architecture
No ratings yet
Literature Review On Image Classification Architecture
14 pages
Comprehensive Notes on Advanced CNN Concepts & Vision Tasks
No ratings yet
Comprehensive Notes on Advanced CNN Concepts & Vision Tasks
5 pages
Seminar Report cnn1
No ratings yet
Seminar Report cnn1
23 pages
APznzaZp-lBWxNLLbeHcgbcoyZ_3DYbxlSM3oatGJxvuM32Ge0YSP4A4GRRh3dZikfZziyGN-oBjW2j9zLTwT48zDwGzPKihuA0VwchMHE...Lhqna8IM6dHYiHSkpaJoZeHLFyHNb7VMugavUyXsIhrWs3tttRZITDc1OxHMZG9Bk22SGDwI5j3XBrN3BdtLgo5Tvi0ES8ANurYUFF9oS_4V93oKKeKsnvGQ==
No ratings yet
APznzaZp-lBWxNLLbeHcgbcoyZ_3DYbxlSM3oatGJxvuM32Ge0YSP4A4GRRh3dZikfZziyGN-oBjW2j9zLTwT48zDwGzPKihuA0VwchMHE...Lhqna8IM6dHYiHSkpaJoZeHLFyHNb7VMugavUyXsIhrWs3tttRZITDc1OxHMZG9Bk22SGDwI5j3XBrN3BdtLgo5Tvi0ES8ANurYUFF9oS_4V93oKKeKsnvGQ==
58 pages
Deep Learning Notes (1) 2
No ratings yet
Deep Learning Notes (1) 2
54 pages
Super VIP Cheatsheet - Deep Learning
No ratings yet
Super VIP Cheatsheet - Deep Learning
47 pages
MLPS2021 03 CNN-RNN-LSTM
No ratings yet
MLPS2021 03 CNN-RNN-LSTM
71 pages
sdl unit 2 3 4
No ratings yet
sdl unit 2 3 4
12 pages
DL 4
No ratings yet
DL 4
5 pages
Deep Learning Unit2
No ratings yet
Deep Learning Unit2
43 pages
4a Convolutional Neural Networks
No ratings yet
4a Convolutional Neural Networks
56 pages
Guddu jha_organized
No ratings yet
Guddu jha_organized
3 pages
Final
No ratings yet
Final
30 pages
L10 - Intro - To - Deep - Learning
No ratings yet
L10 - Intro - To - Deep - Learning
75 pages
C2 W1 Merged
No ratings yet
C2 W1 Merged
286 pages
Deeplearning - Ai Deeplearning - Ai
No ratings yet
Deeplearning - Ai Deeplearning - Ai
78 pages
CompTIA Network+: Untangling Ethernet, Herding Packets, and Conquering Connectivity Chaos
From Everand
CompTIA Network+: Untangling Ethernet, Herding Packets, and Conquering Connectivity Chaos
Scott Markham
No ratings yet
Liangqu Long, Xiangming Zeng Beginning Deep Learning With TensorFlow
100% (1)
Liangqu Long, Xiangming Zeng Beginning Deep Learning With TensorFlow
727 pages
Medical Heart Disease
No ratings yet
Medical Heart Disease
8 pages
PTDLKT
No ratings yet
PTDLKT
11 pages
Final Exam Review: Nishant Mehta
No ratings yet
Final Exam Review: Nishant Mehta
32 pages
NN-BNU2
No ratings yet
NN-BNU2
47 pages
Data Science and Machine Learning For Non-Programmers - Using SAS Enterprise Miner-Chapman and Hall - CRC (2024)
No ratings yet
Data Science and Machine Learning For Non-Programmers - Using SAS Enterprise Miner-Chapman and Hall - CRC (2024)
590 pages
Syllabus - CS 231N PDF
No ratings yet
Syllabus - CS 231N PDF
1 page
Heart Disease Prediction (Review-1)
No ratings yet
Heart Disease Prediction (Review-1)
10 pages
Tensorflow and Keras Apis: 0.1 Computer Vision: Neural Networks and Deep Learning
No ratings yet
Tensorflow and Keras Apis: 0.1 Computer Vision: Neural Networks and Deep Learning
32 pages
DBSCAN
No ratings yet
DBSCAN
3 pages
Session Catalog - Automation Fair 2023
No ratings yet
Session Catalog - Automation Fair 2023
4 pages
Bias Variance Trade Off
No ratings yet
Bias Variance Trade Off
14 pages
behavsci-14-00677
No ratings yet
behavsci-14-00677
28 pages
Feature Engineering
No ratings yet
Feature Engineering
9 pages
Aplicación de La Inteligencia Artificial en La Industria Alimentaria - Una Guía
No ratings yet
Aplicación de La Inteligencia Artificial en La Industria Alimentaria - Una Guía
42 pages
Effectiveness of Normalization Pre-Processing of Big Data To The Machine Learning Performance
No ratings yet
Effectiveness of Normalization Pre-Processing of Big Data To The Machine Learning Performance
6 pages
MmAP Multi-Modal Alignment Prompt For Cross-Domain Multi-Task Learning
No ratings yet
MmAP Multi-Modal Alignment Prompt For Cross-Domain Multi-Task Learning
9 pages
Cbse - Department of Skill Education Curriculum For Session 2024-2025
No ratings yet
Cbse - Department of Skill Education Curriculum For Session 2024-2025
12 pages
FRAC_Draft2.1
No ratings yet
FRAC_Draft2.1
87 pages
Data Science Interview Questions
100% (1)
Data Science Interview Questions
300 pages
Practical Machine Learning With R - Tutorials and Case - Carsten Lange - 2024 - CRC Press LLC - 9781003367147 - Anna's Archive
No ratings yet
Practical Machine Learning With R - Tutorials and Case - Carsten Lange - 2024 - CRC Press LLC - 9781003367147 - Anna's Archive
369 pages
A Comparative Study of Supervised Machine Learning Algorithms For Stock Market Trend Prediction
No ratings yet
A Comparative Study of Supervised Machine Learning Algorithms For Stock Market Trend Prediction
5 pages
Reliability and Statistical Computing Modeling Methods and Applications 1st Edition Hoang Pham (Editor) - The ebook in PDF/DOCX format is available for instant download
100% (1)
Reliability and Statistical Computing Modeling Methods and Applications 1st Edition Hoang Pham (Editor) - The ebook in PDF/DOCX format is available for instant download
57 pages
CAPSTONE THESIS Format
No ratings yet
CAPSTONE THESIS Format
29 pages
Glossary of Artificial Intelligence
No ratings yet
Glossary of Artificial Intelligence
62 pages
Artificial Intelligence Financial Services
No ratings yet
Artificial Intelligence Financial Services
27 pages
Lesson 02 - AI in Finance
No ratings yet
Lesson 02 - AI in Finance
51 pages
Implementation of Deep Neural Network Using VLSI B
No ratings yet
Implementation of Deep Neural Network Using VLSI B
8 pages
2023 Banglalp-1 2
No ratings yet
2023 Banglalp-1 2
11 pages