0% found this document useful (0 votes)

4 views48 pages

Lect11 Neural Nets2

Uploaded by

Parth Pandey

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

4 views48 pages

Lect11 Neural Nets2

Uploaded by

Parth Pandey

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 48

Computer Vision

Neural Network:

(Before) Linear score function:

(Now) 2-layer Neural Network

x W1 h W2 s

3072 100 10

2
3
Classification

4
Preview [From recent Yann
LeCun slides]

5
ImageNet
(slide from Kaiming He’s recent presentation)
7
Working with CNNs in practice:
• Data augmentation
• Transfer learning
• Autoencoders

8
Data Augmentation

9
Classification

10
Data Augmentation

“cat”
Load
image and
label Compute
loss
CNN

11
Data Augmentation

“cat”
Load
image and
label Compute
loss
CNN

Transform
image
12
Data Augmentation

- Change the pixels without

changing the label
What the computer
sees
- Train on transformed data

- VERY widely used

13
Data Augmentation
1. Horizontal flips

14
Data Augmentation
2. Random crops/scales
Training: sample random crops / scales

15
Data Augmentation
2. Random crops/scales
Training: sample random crops / scales
ResNet:
1. Pick random L in range [256, 480]
2. Resize training image, short side = L
3. Sample random 224 x 224 patch

16
Data Augmentation
2. Random crops/scales
Training: sample random crops / scales
ResNet:
1. Pick random L in range [256, 480]
2. Resize training image, short side = L
3. Sample random 224 x 224 patch

Testing: average a fixed set of crops

17
Data Augmentation
2. Random crops/scales
Training: sample random crops / scales
ResNet:
1. Pick random L in range [256, 480]
2. Resize training image, short side = L
3. Sample random 224 x 224 patch

Testing: average a fixed set of crops

ResNet:
1. Resize image at 5 scales: {224, 256, 384, 480, 640}
2. For each size, use 10 224 x 224 crops: 4 corners + center, + flips
18
Data Augmentation
3. Color jitter
Simple:
Randomly jitter contrast

19
Data Augmentation
Complex:
3. Color jitter
Simple: 1. Apply PCA to all [R, G, B]
Randomly jitter contrast pixels in training set

2. Sample a “color offset” along

principal component
directions
1. Add offset to all pixels of a
training image
(As seen in [Krizhevsky et al. 2012], ResNet,
etc)
20
Data Augmentation
4. Get creative!

Random mix/combinations of :
- translation
- rotation
- stretching
- shearing,
- lens distortions, … (go crazy)

21
Data Augmentation: Takeaway

• Simple to implement, use it

• Especially useful for small datasets
• Fits into framework of noise / marginalization

22
Transfer Learning

“You need a lot of a data if you want to train/use

CNNs”

23
Transfer Learning

“You need a lot of a data if you want to train/use

CNNs”

24
Transfer Learning with CNNs
1. Train on
Imagenet

25
Transfer Learning with CNNs
2. Small dataset:
1. Train on feature extractor
Imagenet

Freeze
these

Train
this
26
Transfer Learning with CNNs
2. Small dataset: 3. Medium dataset:
1. Train on feature extractor finetuning
Imagenet
more data = retrain
more of the network
(or all of it)
Freeze
Freeze these
these

Train
this
Train
this
27
Transfer Learning with CNNs
2. Small dataset: 3. Medium dataset:
1. Train on feature extractor finetuning
Imagenet
more data = retrain
more of the network
(or all of it)
Freeze
Freeze these
tip: use only ~1/10th
these of the original
learning rate in
finetuning top layer,
and ~1/100th on
Train
intermediate layers
this
Train
this
28
CNN Features off-the-shelf: an Astounding Baseline for Recognition
[Razavian et al, 2014]

DeCAF: A Deep
Convolutional
Activation Feature for
Generic Visual
Recognition
[Donahue*, Jia*, et
al., 2013]

29
very similar very different
dataset dataset
more generic

very little data ? ?

more specific

quite a lot of data ? ?

30
very similar very different
dataset dataset
more generic

very little data Use Linear ?

Classifier on top
more specific
layer

quite a lot of data Finetune a few ?

layers

31
very similar very different
dataset dataset
more generic

very little data Use Linear You’re in trouble…

Classifier on top Try linear
more specific
layer classifier from
different stages

quite a lot of data Finetune a few Finetune a larger

layers number of layers

32
Overview
Caffe Torch Theano TensorFlow

Language C++, Python Lua Python Python

Pretrained Yes ++ Yes ++ Yes (Lasagne) Inception

Multi-GPU: Yes Yes Yes Yes

Data parallel cunn.DataParallelTable platoon

Multi-GPU: No Yes Experimental Yes (best)

Model fbcunn.ModelParallel

parallel
Readable Yes (C++) Yes (Lua) No No
source code
33
Good at RNN No Mediocre Yes Yes (best)
Supervised vs Unsupervised
Supervised Learning
Data: (x, y)
x is data, y is label

Goal: Learn a function to map

x -> y
Examples: Classification, regression,
object detection, semantic
segmentation, image captioning, etc

34
Supervised vs Unsupervised
Supervised Learning Unsupervised Learning
Data: (x, y) Data: x
x is data, y is label Just data, no labels!

Goal: Learn a function to map Goal: Learn some structure of

x -> y the data
Examples: Classification, regression, Examples: Clustering, dimensionality
object detection, etc reduction, feature learning, etc.

35
Unsupervised Learning
• Autoencoders
• Traditional: feature learning

36
Autoencoders

Features z
Encoder

Input data x

38
Autoencoders
Originally: Linear + nonlinearity (sigmoid)
Later: Deep, fully-connected
Later: ReLU CNN

Features z
Encoder

Input data x

39
Autoencoders
Originally: Linear + nonlinearity (sigmoid)
z usually smaller than x
Later: Deep, fully-connected
(dimensionality reduction)
Later: ReLU CNN

Features z
Encoder

Input data x

40
Autoencoders
Reconstructed
input data
xx
Decoder

Features z
Encoder

Input data x

41
Originally: Linear +

Autoencoders nonlinearity (sigmoid)

Later: Deep, fully-connected
Later: ReLU CNN (upconv)

Reconstructed
input data
xx
Decoder Encoder: 4-layer conv
Decoder: 4-layer upconv
Features z
Encoder

Input data x

42
Originally: Linear +

Autoencoders nonlinearity (sigmoid)

Later: Deep, fully-connected
Later: ReLU CNN (upconv)

Reconstructed
input data
xx
Encoder / decoder Decoder Train for
sometimes share reconstruction
weights Features z with no labels!

Example:
dim(x) = D Encoder
dim(z) = H
we: H x D Input data x
wd : D x H = weT

43
Autoencoders Loss function
(Often L2)

Reconstructed
input data
xx
Decoder Train for
reconstruction
Features z with no labels!

Encoder

Input data x

44
Autoencoders
Reconstructed
input data
xx
After training, Decoder
throw away
decoder! Features z
Encoder

Input data x

45
Autoencoders Loss function
(Softmax, etc)
bird plan
Predicted Label yy y dog deer e truc
Use encoder to k
initialize a Classifier
supervised Train for final task
Fine-tune
model
Features z encoder (sometimes with
jointly with small data)
classifier
Encoder

Input data x

46
Autoencoders
Autoencoders can
reconstruct data, and
Reconstructed
xx can learn features to
input data
initialize a supervised
Decoder
model
Features z
Encoder

Input data x

3 # Deep Learning
No ratings yet
3 # Deep Learning
36 pages
8 Deep Learning CNN
No ratings yet
8 Deep Learning CNN
63 pages
Lec5 CNN RNN Attention
No ratings yet
Lec5 CNN RNN Attention
71 pages
Lec6 RNN Attention Search
No ratings yet
Lec6 RNN Attention Search
62 pages
Convolutional Neural Networks : Covnets
No ratings yet
Convolutional Neural Networks : Covnets
22 pages
Chapter 5 Deep Learning
No ratings yet
Chapter 5 Deep Learning
35 pages
6-DeepVisualLearning L6
No ratings yet
6-DeepVisualLearning L6
82 pages
Deeplearning - Ai Deeplearning - Ai
No ratings yet
Deeplearning - Ai Deeplearning - Ai
73 pages
Unit 3
No ratings yet
Unit 3
105 pages
Advanced DL Computer Vision
No ratings yet
Advanced DL Computer Vision
10 pages
Week7_ConvNets and Transfer Learning
No ratings yet
Week7_ConvNets and Transfer Learning
39 pages
Parkinson's Disease Detection
100% (1)
Parkinson's Disease Detection
88 pages
Week 6
No ratings yet
Week 6
8 pages
Keras1 - 1.4 Advanced Model Architectures
No ratings yet
Keras1 - 1.4 Advanced Model Architectures
11 pages
UNIT IV_NNDL (3)
No ratings yet
UNIT IV_NNDL (3)
32 pages
7 CNN
No ratings yet
7 CNN
66 pages
ADA Project Report - 2 067
No ratings yet
ADA Project Report - 2 067
9 pages
Part 2
No ratings yet
Part 2
225 pages
03_pytorch_computer_vision
No ratings yet
03_pytorch_computer_vision
29 pages
[Fall 2024] Deep Learning 3
No ratings yet
[Fall 2024] Deep Learning 3
54 pages
Horizon Academic Research Journal Vol. 4 No. 1
No ratings yet
Horizon Academic Research Journal Vol. 4 No. 1
406 pages
Image Classification Using Convolutional Neural Networks (CNNS)
No ratings yet
Image Classification Using Convolutional Neural Networks (CNNS)
61 pages
DLCV Ch2 Neural Network
No ratings yet
DLCV Ch2 Neural Network
68 pages
Unit 4a - Convolutional Neural Networks
No ratings yet
Unit 4a - Convolutional Neural Networks
107 pages
FT04_Haghighat_Independent_2023
No ratings yet
FT04_Haghighat_Independent_2023
40 pages
Lecture2.2 UnimodalRepresentations Part1 PDF
No ratings yet
Lecture2.2 UnimodalRepresentations Part1 PDF
92 pages
Convolutional Neural Networks: Convolutions, Pooling and Cnns. Neural Architectures For Computer Vision
No ratings yet
Convolutional Neural Networks: Convolutions, Pooling and Cnns. Neural Architectures For Computer Vision
64 pages
CII4Q3 VISI KOMPUTER - Deep Learning - CNN
No ratings yet
CII4Q3 VISI KOMPUTER - Deep Learning - CNN
106 pages
CNN with TensorFlow and Keras
No ratings yet
CNN with TensorFlow and Keras
11 pages
Neural Networks and Deep Learning
No ratings yet
Neural Networks and Deep Learning
19 pages
Rec03 - Deep Architectures
No ratings yet
Rec03 - Deep Architectures
65 pages
MCA Curriculum Syllabus 2023 25 Batch CompleteSyllabus 31-01-2025 (1)
No ratings yet
MCA Curriculum Syllabus 2023 25 Batch CompleteSyllabus 31-01-2025 (1)
72 pages
Military AI-Week 05-AI in Computer Vision
No ratings yet
Military AI-Week 05-AI in Computer Vision
65 pages
CV_T3_ Unit-7
No ratings yet
CV_T3_ Unit-7
36 pages
Intro CNN PDF
No ratings yet
Intro CNN PDF
31 pages
Deep Learning Based 3D Segmentation: A Survey: He, Yu, Liu, Yang, Sun and Mian
No ratings yet
Deep Learning Based 3D Segmentation: A Survey: He, Yu, Liu, Yang, Sun and Mian
28 pages
CV Unit V
No ratings yet
CV Unit V
18 pages
Images, Neural Networks, CNNs
No ratings yet
Images, Neural Networks, CNNs
26 pages
Generalized Parking Occupancy Analysis Based On Di
No ratings yet
Generalized Parking Occupancy Analysis Based On Di
25 pages
03 Convolution Neural Networks and Computer Vision With Tensorflow
No ratings yet
03 Convolution Neural Networks and Computer Vision With Tensorflow
21 pages
anthony
No ratings yet
anthony
33 pages
CNN RNN LSTM Attention
No ratings yet
CNN RNN LSTM Attention
86 pages
CVlecture 5
No ratings yet
CVlecture 5
56 pages
Assignment-6 STC-DL
No ratings yet
Assignment-6 STC-DL
17 pages
ch4_CNN
No ratings yet
ch4_CNN
35 pages
4b Image Processing
No ratings yet
4b Image Processing
63 pages
Convolutional Neural Networks: CMSC 733 Fall 2015 Angjoo Kanazawa
No ratings yet
Convolutional Neural Networks: CMSC 733 Fall 2015 Angjoo Kanazawa
55 pages
Deep Learning: Seungsang Oh
No ratings yet
Deep Learning: Seungsang Oh
39 pages
Identify Web Cam Images Using Neural Networks
No ratings yet
Identify Web Cam Images Using Neural Networks
17 pages
Karsl: Arabic Sign Language Database: Ala Addin I. Sidig, Hamzah Luqman, Sabri Mahmoud, and Mohamed Mohandes
No ratings yet
Karsl: Arabic Sign Language Database: Ala Addin I. Sidig, Hamzah Luqman, Sabri Mahmoud, and Mohamed Mohandes
19 pages
CO2_CNN_3
No ratings yet
CO2_CNN_3
31 pages
COMP3220 Lect 11 - Introduction to Convolutional Neural Networks
No ratings yet
COMP3220 Lect 11 - Introduction to Convolutional Neural Networks
13 pages
CNN
No ratings yet
CNN
31 pages
Introduction to Deep Learning
No ratings yet
Introduction to Deep Learning
47 pages
5-Convolutional Neural Network
No ratings yet
5-Convolutional Neural Network
43 pages
ANNtoSNN PDF
No ratings yet
ANNtoSNN PDF
12 pages
A52.CHB 2023 2 GenerationZ
No ratings yet
A52.CHB 2023 2 GenerationZ
17 pages
Convolutional Neural Networks in Python _ DataCamp
No ratings yet
Convolutional Neural Networks in Python _ DataCamp
22 pages
Week 02 Ch2.1 Introduction To Neural Networks
No ratings yet
Week 02 Ch2.1 Introduction To Neural Networks
44 pages
Medium Understand The Softmax Function in Minutes F3a59641e86d
No ratings yet
Medium Understand The Softmax Function in Minutes F3a59641e86d
14 pages
UNIT 2 Self Notes
No ratings yet
UNIT 2 Self Notes
10 pages
Convolutional Neural Networks: Computer Vision CS 543 / ECE 549 University of Illinois Jia-Bin Huang
No ratings yet
Convolutional Neural Networks: Computer Vision CS 543 / ECE 549 University of Illinois Jia-Bin Huang
76 pages
4a Convolutional Neural Networks
No ratings yet
4a Convolutional Neural Networks
56 pages
Hot Chips Overview
No ratings yet
Hot Chips Overview
47 pages
L09-10 DL and CNN
No ratings yet
L09-10 DL and CNN
56 pages
Chou 2020
No ratings yet
Chou 2020
10 pages
CNN 2
No ratings yet
CNN 2
47 pages
YOLO Series Algorithms in Object Detection of Unmanned Aerial Vehicles: A Survey
No ratings yet
YOLO Series Algorithms in Object Detection of Unmanned Aerial Vehicles: A Survey
30 pages
ieee paper4
No ratings yet
ieee paper4
16 pages
Visual Search at Alibaba: Yanhao Zhang, Pan Pan, Yun Zheng, Kang Zhao, Yingya Zhang, Xiaofeng Ren, Rong Jin
No ratings yet
Visual Search at Alibaba: Yanhao Zhang, Pan Pan, Yun Zheng, Kang Zhao, Yingya Zhang, Xiaofeng Ren, Rong Jin
9 pages
Unit -4 Artificial Neural Networks
No ratings yet
Unit -4 Artificial Neural Networks
33 pages
7 Applications of Convolutional Neural Networks - FWS
No ratings yet
7 Applications of Convolutional Neural Networks - FWS
3 pages
An inter-modal attention-based deep learning framework using unified modality for multimodal fake news, hate speech and offensive language detection
No ratings yet
An inter-modal attention-based deep learning framework using unified modality for multimodal fake news, hate speech and offensive language detection
11 pages
Show, Attend and Read: A Simple and Strong Baseline For Irregular Text Recognition
No ratings yet
Show, Attend and Read: A Simple and Strong Baseline For Irregular Text Recognition
9 pages
Guddu jha_organized
No ratings yet
Guddu jha_organized
3 pages
Cv Ppt Mt101
No ratings yet
Cv Ppt Mt101
16 pages
Learning Efficient Point Cloud Generation For Dense 3D Object Reconstruction
No ratings yet
Learning Efficient Point Cloud Generation For Dense 3D Object Reconstruction
8 pages
Classify Webcam Images Using Deep Learning
No ratings yet
Classify Webcam Images Using Deep Learning
17 pages
Raza Multimodaltrace Deepfake Detection Using Audiovisual Representation Learning CVPRW 2023 Paper
No ratings yet
Raza Multimodaltrace Deepfake Detection Using Audiovisual Representation Learning CVPRW 2023 Paper
8 pages
Int422 Project
No ratings yet
Int422 Project
8 pages
Apc 40 Apc210116
No ratings yet
Apc 40 Apc210116
8 pages
Human Activity Recogniton Using Machine Learning IJERTV10IS040236
No ratings yet
Human Activity Recogniton Using Machine Learning IJERTV10IS040236
5 pages
Master Thesis Deep Learning
100% (3)
Master Thesis Deep Learning
7 pages
A Review On Animal Detection and Classification Using Computer Vision Techniques: Scope For Future Enhancement To Application
No ratings yet
A Review On Animal Detection and Classification Using Computer Vision Techniques: Scope For Future Enhancement To Application
6 pages
Structure of Convolutional Neural Networks - Deep Learning
No ratings yet
Structure of Convolutional Neural Networks - Deep Learning
12 pages
Deep Learning
No ratings yet
Deep Learning
3 pages
Classification of Palm Trees Diseases Using Convolution Neural Network
No ratings yet
Classification of Palm Trees Diseases Using Convolution Neural Network
8 pages
FDP Deep Learning
No ratings yet
FDP Deep Learning
2 pages
DEEP LEARNING TECHNIQUES: CLUSTER ANALYSIS and PATTERN RECOGNITION with NEURAL NETWORKS. Examples with MATLAB
From Everand
DEEP LEARNING TECHNIQUES: CLUSTER ANALYSIS and PATTERN RECOGNITION with NEURAL NETWORKS. Examples with MATLAB
César Pérez López
No ratings yet
DATA MINING and MACHINE LEARNING. CLASSIFICATION PREDICTIVE TECHNIQUES: NAIVE BAYES, NEAREST NEIGHBORS and NEURAL NETWORKS: Examples with MATLAB
From Everand
DATA MINING and MACHINE LEARNING. CLASSIFICATION PREDICTIVE TECHNIQUES: NAIVE BAYES, NEAREST NEIGHBORS and NEURAL NETWORKS: Examples with MATLAB
César Pérez López
No ratings yet