0% found this document useful (0 votes)

268 views31 pages

Fully Convolutional Neural Network

Fully convolutional networks (FCNs) are end-to-end neural networks that take input of any size and produce pixelwise outputs for tasks like semantic segmentation. FCNs modify classification networks to be fully convolutional by converting fully connected layers to convolutional layers. This allows processing entire images with variable sizes efficiently in one forward pass. FCNs achieve state-of-the-art results on semantic segmentation benchmarks while being much faster than prior methods. Code and pre-trained models for FCNs are available on various datasets.

Uploaded by

Ilma Arifiany

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

268 views31 pages

Fully Convolutional Neural Network

Uploaded by

Ilma Arifiany

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 31

Fully Convolutional Networks

Jon Long and Evan Shelhamer

CVPR15 Caffe Tutorial
pixels in, pixels out
monocular depth estimation (Liu et al. 2015)

semantic
segmentation

boundary prediction (Xie & Tu 2015)

< 1/5 second

???
end-to-end learning

3
a classification network

“tabby cat”

4
becoming fully convolutional

5
becoming fully convolutional

6
upsampling output

7
end-to-end, pixels-to-pixels network

8
end-to-end, pixels-to-pixels network

upsampling
conv, pool, pixelwise
nonlinearity output + loss

9
spectrum of deep features
combine where (local, shallow) with what (global, deep)

fuse features into deep jet

(cf. Hariharan et al. CVPR15 “hypercolumn”)

skip layers

interp + sum

sk
ip
to
fu
se
la interp + sum
ye
rs
end-to-end, joint learning !
of semantics and location

dense output
skip layer refinement
input image stride 32 stride 16 stride 8 ground truth

no skips 1 skip 2 skips

training + testing
- train full image at a time without patch sampling
- reshape network to take input of any size
- forward time is ~150ms for 500 x 500 x 21 output
FCN SDS* Truth Input

Relative to prior state-of-the-

art SDS:

- 20% improvement
for mean IoU

- 286× faster

*Simultaneous Detection and Segmentation

Hariharan et al. ECCV14
models + code
fully convolutional networks are fast, end-
to-end models for pixelwise problems

- code in Caffe branch (merged soon) caffe.berkeleyvision.org

- models for PASCAL VOC, NYUDv2,

SIFT Flow, PASCAL-Context in Model Zoo

fcn.berkeleyvision.org
github.com/BVLC/caffe
models
- PASCAL VOC standard for object segmentation
- NYUDv2 multi-modal rgb + depth scene segmentation
- SIFT Flow multi-task for semantic + geometric segmentation
- PASCAL-Context object + scene segmentation
inference

inference script (gist)

solving

solving script (gist)

Reshape
- Decide shape on-the-fly in C++ / Python / MATLAB
- DataLayer automatically reshapes
for batch size == 1
- Essentially free
(only reallocates when necessary)
Helpful Layers
- Losses can take spatial predictions + truths
- Deconvolution / “backward convolution”
can compute interpolation
- Crop: maps coordinates between layers
FCN for Pose Estimation
Georgia Gkioxari
UC Berkeley
FCN for Pose Estimation
Input data:
Image
FCN for Pose Estimation
Input data:
Image Keypoints
FCN for Pose Estimation
Input data:
Image Keypoints

Define an area around the

keypoint as its positive
neighborhood with radius r.
FCN for Pose Estimation
Input data:
Image Keypoints Labels
FCN for Pose Estimation
Input data:
Image Labels
Heat Map Predictions from FCN

Test Image Right Ankle Right Knee Right Hip Right Wrist Right Elbow Right Shoulder
Heat Map Predictions from FCN

Test Image Right Ankle Right Knee Right Hip Right Wrist Right Elbow Right Shoulder

Two modes because there

are two Right Shoulders in
the image!
Heat Maps to Keypoints
PCK @ 0.2 LSP test set FCN baseline PCK == ~69%
Ankle 56.5

Knee 60.0

Hip 56.6
State-of-the-art == ~72%
Wrist 62.9

Elbow 71.8

Shoulder 78.8

Head 93.6
Details
Architecture:
● FCN - 32 stride. No data augmentation.
● radius = 0.1*im.shape[0] (no cross validation)

Runtime on a K40:
● 0.7 sec/iteration for training (15hrs for 80K iterations)
● 0.25 sec/image for inference for all keypoints
conclusion
fully convolutional networks are fast, end-
to-end models for pixelwise problems

- code in Caffe branch (merged soon) caffe.berkeleyvision.org

- models for PASCAL VOC, NYUDv2,

SIFT Flow, PASCAL-Context in Model Zoo

fcn.berkeleyvision.org
github.com/BVLC/caffe

Fundamentals of Neural Networks
No ratings yet
Fundamentals of Neural Networks
24 pages
Angular
100% (1)
Angular
2 pages
A Gentle Introduction To Graph Neural Network
100% (1)
A Gentle Introduction To Graph Neural Network
122 pages
Scikit Learn Docs PDF
100% (3)
Scikit Learn Docs PDF
2,204 pages
Deep Learning With Python Sample
100% (1)
Deep Learning With Python Sample
31 pages
Neural Networks PDF
No ratings yet
Neural Networks PDF
89 pages
ANN Notes
No ratings yet
ANN Notes
54 pages
Deep Learning (MODULE-3)
No ratings yet
Deep Learning (MODULE-3)
85 pages
Easyctf Writeups
No ratings yet
Easyctf Writeups
64 pages
Deep Learning 4/7: Convolutional Neural Networks: C. de Castro, IEIIT-CNR, Cristina - Decastro@ieiit - Cnr.it
0% (1)
Deep Learning 4/7: Convolutional Neural Networks: C. de Castro, IEIIT-CNR, Cristina - Decastro@ieiit - Cnr.it
49 pages
Icpc Template
No ratings yet
Icpc Template
23 pages
C Library Functions
No ratings yet
C Library Functions
62 pages
Library Management - Principles and Practice
60% (5)
Library Management - Principles and Practice
83 pages
Lecture 5
No ratings yet
Lecture 5
36 pages
Q-Learning and Deep Q Networks (DQN)
No ratings yet
Q-Learning and Deep Q Networks (DQN)
52 pages
Creating A NAS With Ubuntu Server
No ratings yet
Creating A NAS With Ubuntu Server
10 pages
FALLSEM2023-24 CSE1008 ETH VL2023240107110 2023-08-14 Reference-Material-I
No ratings yet
FALLSEM2023-24 CSE1008 ETH VL2023240107110 2023-08-14 Reference-Material-I
26 pages
A Gentle Introduction To Neural Networks With Python
100% (1)
A Gentle Introduction To Neural Networks With Python
85 pages
UBA-5 2 0-ReleaseNotes
100% (1)
UBA-5 2 0-ReleaseNotes
14 pages
HTML Reference by Caregory Sinhala
No ratings yet
HTML Reference by Caregory Sinhala
8 pages
Student Resource Portal: Meghan Patil, Mihir Prajapati, Ankit Patel
No ratings yet
Student Resource Portal: Meghan Patil, Mihir Prajapati, Ankit Patel
18 pages
SC101Assignment3 Eng
No ratings yet
SC101Assignment3 Eng
10 pages
GDM December 1999
No ratings yet
GDM December 1999
43 pages
Chapter 6
No ratings yet
Chapter 6
26 pages
Deep Learning For Computer Vision
No ratings yet
Deep Learning For Computer Vision
55 pages
AreaComp2 Users Guide
No ratings yet
AreaComp2 Users Guide
6 pages
Scrum Reference Card
No ratings yet
Scrum Reference Card
6 pages
Instant Ebooks Textbook Deep Generative Modeling Jakub M. Tomczak Download All Chapters
No ratings yet
Instant Ebooks Textbook Deep Generative Modeling Jakub M. Tomczak Download All Chapters
49 pages
Resume
No ratings yet
Resume
7 pages
An Introduction To Programming Physics-Informed Neural Network-Based Computational Solid Mechanics
100% (1)
An Introduction To Programming Physics-Informed Neural Network-Based Computational Solid Mechanics
32 pages
Supervised Learning
No ratings yet
Supervised Learning
3 pages
Paper 1-Bidirectional LSTM With Attention Mechanism and Convolutional Layer
100% (1)
Paper 1-Bidirectional LSTM With Attention Mechanism and Convolutional Layer
51 pages
Adobe Acquires Figma, Figma's Disruption, The Figma OS - Stratechery by Ben Thompson
No ratings yet
Adobe Acquires Figma, Figma's Disruption, The Figma OS - Stratechery by Ben Thompson
5 pages
Linear Regression
No ratings yet
Linear Regression
83 pages
Discover More Lightroom Presets and Save Over 60
No ratings yet
Discover More Lightroom Presets and Save Over 60
11 pages
CD4069UBC Inverter Circuits: General Description Features
No ratings yet
CD4069UBC Inverter Circuits: General Description Features
7 pages
An Analysis of Convolutional Neural Network Architectures
No ratings yet
An Analysis of Convolutional Neural Network Architectures
54 pages
Guide Convolutional Neural Network CNN
100% (1)
Guide Convolutional Neural Network CNN
25 pages
Back Propagation Technique
No ratings yet
Back Propagation Technique
24 pages
Python TensorFlow Tutorial - Build A Neural Network - Adventures in Machine Learning
No ratings yet
Python TensorFlow Tutorial - Build A Neural Network - Adventures in Machine Learning
18 pages
Artificial Neural Networks
No ratings yet
Artificial Neural Networks
24 pages
Moblie Virus Descriptions
No ratings yet
Moblie Virus Descriptions
49 pages
Fedora Operating System
100% (1)
Fedora Operating System
11 pages
Example of 2D Convolution
No ratings yet
Example of 2D Convolution
5 pages
RNN
No ratings yet
RNN
16 pages
Knowledge Engineering: With Semantic Web Technologies
No ratings yet
Knowledge Engineering: With Semantic Web Technologies
20 pages
Deep Learning: Hoàng Huy Minh Hoàng Thảo Lan Chi Phạm Huy Thiên Phúc Trương Huỳnh Đăng Khoa
No ratings yet
Deep Learning: Hoàng Huy Minh Hoàng Thảo Lan Chi Phạm Huy Thiên Phúc Trương Huỳnh Đăng Khoa
25 pages
ARTIFICIAL NEURAL NETWORKS-moduleIII
No ratings yet
ARTIFICIAL NEURAL NETWORKS-moduleIII
61 pages
GNN Review
No ratings yet
GNN Review
26 pages
Activations, Loss Functions & Optimizers in ML
No ratings yet
Activations, Loss Functions & Optimizers in ML
29 pages
Artificial Neural Network
No ratings yet
Artificial Neural Network
36 pages
Brian Charlot
No ratings yet
Brian Charlot
1 page
Physics-Guided Physics-Informed and Physics-Encode
No ratings yet
Physics-Guided Physics-Informed and Physics-Encode
37 pages
Reinforcement Learning
No ratings yet
Reinforcement Learning
46 pages
Dogfooding
No ratings yet
Dogfooding
10 pages
11.feature Selection, Extraction
No ratings yet
11.feature Selection, Extraction
38 pages
Postern of Fate PDF
No ratings yet
Postern of Fate PDF
3 pages
DCGAN (Deep Convolution Generative Adversarial Networks)
No ratings yet
DCGAN (Deep Convolution Generative Adversarial Networks)
27 pages
Tutorials
No ratings yet
Tutorials
17 pages
Self Test Questions
No ratings yet
Self Test Questions
46 pages
Convolutional Neural Networks
No ratings yet
Convolutional Neural Networks
13 pages
SJ-20130929085856-001-ZXSDR R8862 (HV1.0) Product Description - 581872
100% (4)
SJ-20130929085856-001-ZXSDR R8862 (HV1.0) Product Description - 581872
28 pages
Cuda C/C++ Basics: NVIDIA Corporation
No ratings yet
Cuda C/C++ Basics: NVIDIA Corporation
67 pages
Introducing Web Forms: VB Intro1.aspx
No ratings yet
Introducing Web Forms: VB Intro1.aspx
42 pages
OOPS in C++ PDF
No ratings yet
OOPS in C++ PDF
7 pages
Deep Learning Methods and Applications For Electrical Power Systems A Comprehensive Review
No ratings yet
Deep Learning Methods and Applications For Electrical Power Systems A Comprehensive Review
22 pages
Bidirectional RNN and RVNN
No ratings yet
Bidirectional RNN and RVNN
15 pages
Fake News Detection Using Machine Learning Models
No ratings yet
Fake News Detection Using Machine Learning Models
5 pages
Backpropagation
No ratings yet
Backpropagation
7 pages
Computer Vision Pretrained Models: What Is Pre-Trained Model?
No ratings yet
Computer Vision Pretrained Models: What Is Pre-Trained Model?
10 pages
CNN RNN Assignment Set 4
0% (1)
CNN RNN Assignment Set 4
2 pages
Tutorial Pytorch Best Commands
No ratings yet
Tutorial Pytorch Best Commands
8 pages
A Comprehensive Survey of Graph Neural Networks PDF
No ratings yet
A Comprehensive Survey of Graph Neural Networks PDF
22 pages
A Practical Guide To Graph Neural Networks
No ratings yet
A Practical Guide To Graph Neural Networks
28 pages
Machine Learning Guide Line
No ratings yet
Machine Learning Guide Line
10 pages
M.E Design 2008 Syllabus
No ratings yet
M.E Design 2008 Syllabus
35 pages
Convolutional Neural Networks
No ratings yet
Convolutional Neural Networks
5 pages
Radial Basis Function
No ratings yet
Radial Basis Function
35 pages
Multiple Object Tracking Using Deep Learning With Yolo v5 IJERTCONV9IS13010
No ratings yet
Multiple Object Tracking Using Deep Learning With Yolo v5 IJERTCONV9IS13010
5 pages
General Framework For Object Detection
No ratings yet
General Framework For Object Detection
9 pages
AI Unit 4 - Artificial Neural Network by Kulbhushan (Krazy Kaksha & KK World)
No ratings yet
AI Unit 4 - Artificial Neural Network by Kulbhushan (Krazy Kaksha & KK World)
5 pages
Building Powerful Image Classification Models Using Very Little Data
No ratings yet
Building Powerful Image Classification Models Using Very Little Data
20 pages
Dropout Vs Pruning
No ratings yet
Dropout Vs Pruning
2 pages
Notes On Backpropagation
No ratings yet
Notes On Backpropagation
14 pages
Analysis of Statistical Parsing in Natural Language Processing
No ratings yet
Analysis of Statistical Parsing in Natural Language Processing
6 pages
Adaline/Madaline:Applications
100% (1)
Adaline/Madaline:Applications
25 pages
Machine Learning Notes
No ratings yet
Machine Learning Notes
3 pages
Effective Amazon Machine Learning
From Everand
Effective Amazon Machine Learning
Alexis Perrier
No ratings yet
Backpropagation: Fundamentals and Applications for Preparing Data for Training in Deep Learning
From Everand
Backpropagation: Fundamentals and Applications for Preparing Data for Training in Deep Learning
Fouad Sabry
No ratings yet
Connectivity Prediction in Mobile Ad Hoc Networks for Real-Time Control
From Everand
Connectivity Prediction in Mobile Ad Hoc Networks for Real-Time Control
Sebastian Thelen
5/5 (1)