0% found this document useful (0 votes)

113 views

Deep Learning Models

This document summarizes a presentation on deep learning models given by Byoung-Hee Kim at the Biointelligence Lab of Seoul National University in 2012. The presentation covers the history of neural networks, including perceptrons from the 1960s and backpropagation networks from 1985. It discusses recent advances in deep learning since 2006, including unsupervised feature learning techniques like sparse coding. Applications of deep learning to tasks like digit recognition, image classification, audio recognition and motion generation are also summarized.

Uploaded by

neneds

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

113 views

Deep Learning Models

Uploaded by

neneds

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 70

Deep Learning Models

2012-05-03
Byoung-Hee Kim
Biointelligence Lab, CSE,
Seoul National University
NOTE: most slides are from talks of Geoffrey Hinton, Andrew Ng, and Yoshua Bengio.

(C) 2012, SNU Biointelligence Lab, http://bi.snu.ac.kr/

Input

output

target

Two!

(C) 2012, SNU Biointelligence Lab, http://bi.snu.ac.kr/

Artificial Neural Networks

(C) 2012, SNU Biointelligence Lab, http://bi.snu.ac.kr/

Historical background:
First generation neural networks
Perceptrons (~1960)
used a layer of handcoded features and tried
to recognize objects by
learning how to weight
these features.
There was a neat
learning algorithm for
adjusting the weights.
But perceptrons are
fundamentally limited in
what they can learn to
do.

(C) 2012, SNU Biointelligence Lab, http://bi.snu.ac.kr/

Bomb

Toy

output units e.g.

class labels

non-adaptive
hand-coded
features

input units
e.g. pixels

Sketch of a typical
perceptron from the 1960s
10

Second generation neural networks (~1985)

Back-propagate
error signal to
get derivatives
for learning

Compare outputs with

correct answer to get
error signal

outputs

hidden
layers

input vector
(C) 2012, SNU Biointelligence Lab, http://bi.snu.ac.kr/

But, finding any model with deep architecture was not successful till 2006
(C) 2012, SNU Biointelligence Lab, http://bi.snu.ac.kr/

http://www.iro.umontreal.ca/~pift6266/H10/notes/deepintro.html
(C) 2012, SNU Biointelligence Lab, http://bi.snu.ac.kr/

(C) 2012, SNU Biointelligence Lab, http://bi.snu.ac.kr/

Agenda
Computer Perception

Unsupervised feature learning

Various deep learning models

Application cases of deep learning models

Written digit recognition/generation (MNIST dataset)

Image classification
Audio recognition
Language modeling
Motion generation

References
Appendix

(C) 2012, SNU Biointelligence Lab, http://bi.snu.ac.kr/

Brain-like Cognitive Computing & Deep Learning

It is well know that the brain has a
hierarchical structure
Researchers try to build models that
simulate and/or act like the brain
Learning deep structures from data,
or the deep learning is a new frontier
in Artificial Intelligence research

Researchers try to find analogies between the

characteristics of the brain and their deep
models

(C) 2012, SNU Biointelligence Lab, http://bi.snu.ac.kr/

Feature Learning
pixel 1

Learning
algorithm

Input

pixel 2

Input space

Motorbikes
Non-Motorbikes

pixel 1
(C) 2012, SNU Biointelligence Lab, http://bi.snu.ac.kr/

Feature Learning
handle

wheel

Feature
Extractor

Learning
algorithm

Input
Feature space

pixel 2

handle

Input space

Motorbikes
Non-Motorbikes

pixel 1
(C) 2012, SNU Biointelligence Lab, http://bi.snu.ac.kr/

wheel

How is computer perception done?

Object
detection
Image

Low-level
vision features

Recognition

Audio
classification

Audio

Low-level
audio features

Speaker
identification

Helicopter
control
Helicopter
(C) 2012, SNU Biointelligence Lab, http://bi.snu.ac.kr/

Low-level state
features

Action
22

Learning representations

Sensor

Feature
Representation

(C) 2012, SNU Biointelligence Lab, http://bi.snu.ac.kr/

Learning
algorithm

Computer vision features

SIFT

HoG

Textons

(C) 2012, SNU Biointelligence Lab, http://bi.snu.ac.kr/

Spin image

RIFT

GLOH

Audio features

MFCC

Spectrogram

Flux

ZCR

(C) 2012, SNU Biointelligence Lab, http://bi.snu.ac.kr/

Rolloff
25

Problems of hand-tuned features

Needs expert knowledge

Sub-optimal
Time-consuming and expensive
Does not generalize to other domains

Can we automatically learn good feature representations?

Sensor representation in the brain

Seeing with your tongue

Human echolocation (sonar)

Auditory cortex
learns to see.
Auditory
Cortex
(C) 2012, SNU Biointelligence Lab, http://bi.snu.ac.kr/

[BrainPort; Martinez et al; Roe et al.]

Unsupervised Feature Learning

Find a better way to represent images than pixels

The goal of Unsupervised Feature Learning

Unlabeled images
Learning
algorithm

Feature representation
(C) 2012, SNU Biointelligence Lab, http://bi.snu.ac.kr/

(C) 2012, SNU Biointelligence Lab, http://bi.snu.ac.kr/

Stochastic binary units

(Bernoulli variables)
1

These have a state

of 1 or 0.

p(si 1)
The probability of
turning on is
determined by the
weighted input
from other units
(plus a bias)

p( si 1)

0
0

bi s j w ji
j
1

1 exp(bi s j w ji )
j

(C) 2012, SNU Biointelligence Lab, http://bi.snu.ac.kr/

Binary
Stochastic
Neuron

(C) 2012, SNU Biointelligence Lab, http://bi.snu.ac.kr/

A model of digit recognition

The top two layers form an
associative memory whose
energy landscape models the low
dimensional manifolds of the
digits.
The energy valleys have names

2000 top-level neurons

10 label
neurons

The model learns to generate

combinations of labels and images.

To perform recognition we start with a

neutral state of the label units and do
an up-pass from the image followed
by a few iterations of the top-level
associative memory.
(C) 2012, SNU Biointelligence Lab, http://bi.snu.ac.kr/

500 neurons

28 x 28
pixel
image
49

Generation & Recognition of Digits by DBN

Deep belief network that learns to generate
handwritten digits

http://www.cs.toronto.edu/~hinton/digits.html

(C) 2012, SNU Biointelligence Lab, http://bi.snu.ac.kr/

First stage of visual processing in brain: V1

The first stage of visual processing in the brain (V1) does
edge detection.

Schematic of simple cell

Actual simple cell

Gabor functions.
[Images from DeAngelis, Ohzawa & Freeman, 1995]

Sparse coding illustration

Learned bases (f1 , , f64): Edges

Natural Images
50

100

150

200

250

100

300

150
350

200
400

250

300

100

450

500
50

100

150

200

350

250

300

350

400

450
150

500

200

400

250

450
300

500
50

100

150

350
200

250

300

350

100

150

400

450

500

400

450

500
50

200

250

300

350

400

450

500

Test example
0.8 *
x

0.8 *

+ 0.3 *
f36

+ 0.3 *

+ 0.5 *
f42

+ 0.5 *

f63

[0, 0, , 0, 0.8, 0, , 0, 0.3, 0, , 0, 0.5, ]

Compact & easily
= [a1, , a64] (feature representation)
interpretable

Supervised learning

Cars

Testing:
What is this?

Motorcycles

Semi-supervised learning

Unlabeled images (all cars/motorcycles)

Testing:
What is this?

Car

Motorcycle

Self-taught learning

Unlabeled images (random internet images)

Testing:
What is this?

Car

Motorcycle

Self-taught learning

Sparse codin
g, LCC, etc.

f1, f2, , fk

Use learned f1, f2, , fk to represent training/test sets.

Using f1, f2, , fk
Car

Motorcycle

a1, a2, , ak

Convolutional DBN for Images

Convolutional DBN on face images

object models

object parts
(combination
of edges)

edges

pixels

Learning of object parts

Examples of learned object parts from object categories
Faces

Cars

Elephants

Chairs

Training on multiple objects

Trained on 4 classes (cars, faces, motorbikes, airplanes).
Second layer: Shared-features and object-specific features.
Third layer: More specific features.

Plot of H(class|neuron active)

Hierarchical probabilistic inference

Generating posterior samples from faces by filling in experiments
(cf. Lee and Mumford, 2003). Combine bottom-up and top-down inference.

Input images

Samples from
feedforward
Inference
(control)
Samples from
Full posterior
inference

An application to modeling motion capture data

(Taylor, Roweis & Hinton, 2007)

Human motion can be captured by placing

reflective markers on the joints and then
using lots of infrared cameras to track the
3-D positions of the markers.
Given a skeletal model, the 3-D positions of
the markers can be converted into the joint
angles plus 6 parameters that describe the
3-D position and the roll, pitch and yaw of
the pelvis.

We only represent changes in yaw because physics

doesnt care about its value and we want to avoid
circular variables.

Video lecture: http://videolectures.net/gesturerecognition2011_taylor_tutorial/

Motion Generation by Conditional RBM

(C) 2012, SNU Biointelligence Lab, http://bi.snu.ac.kr/

Motion Generation by Conditional RBM

(C) 2012, SNU Biointelligence Lab, http://bi.snu.ac.kr/

Motion Generation by Conditional RBM

(C) 2012, SNU Biointelligence Lab, http://bi.snu.ac.kr/

Motion Generation by Conditional RBM

(C) 2012, SNU Biointelligence Lab, http://bi.snu.ac.kr/

Hintons Talk in Google:

http://www.youtube.com/watch?v=VdIURAu1
-aU

Andrew Ngs Talk in Bay Area Vision

Meeting: Unsupervised Feature
Learning and Deep Learning

http://www.youtube.com/watch?v=ZmNOAtZI
gIk&feature=relmfu

(C) 2012, SNU Biointelligence Lab, http://bi.snu.ac.kr/

References
General Info on Deep Learning

http://deeplearning.net/

Review

Y. Bengio, Learning deep architectures for AI,

Foundations and Trends in Machine Learning,
2(1):1-127, 2009.
I. Arel, D.C. Rose, and T.P. Karnowski, Deep
machine learning A new frontier in Artificial
Intelligence Research, Computational
Intelligence Magazine, 14:12-18, 2010.

(C) 2012, SNU Biointelligence Lab, http://bi.snu.ac.kr/

References
Tutorials & Workshops

Deep Learning and Unsupervised Feature

Learning workshop NIPS 2010:
http://deeplearningworkshopnips2010.wordpr
ess.com/schedule/acceptedpapers/
Workshop on Learning Feature Hierarchies ICML 2009:
http://www.cs.toronto.edu/~rsalakhu/deeplea
rning/index.html

(C) 2012, SNU Biointelligence Lab, http://bi.snu.ac.kr/

Nexus Full
80% (10)
Nexus Full
365 pages
Scan To BIM - Presentation
No ratings yet
Scan To BIM - Presentation
61 pages
A Survey of Evolution of Image Captioning PDF
No ratings yet
A Survey of Evolution of Image Captioning PDF
18 pages
ML Performance Improvement Cheatsheet
No ratings yet
ML Performance Improvement Cheatsheet
11 pages
Deep Learning
100% (3)
Deep Learning
32 pages
Artificial Intelligence AI
No ratings yet
Artificial Intelligence AI
21 pages
Deep Learning
No ratings yet
Deep Learning
18 pages
Modul Machine Learning
No ratings yet
Modul Machine Learning
20 pages
Lesson 5 Deep Neural Net Optimization Tuning Interpretability
100% (1)
Lesson 5 Deep Neural Net Optimization Tuning Interpretability
105 pages
Artificial Intelligence Vs Machine Learning Vs Deep Learning
No ratings yet
Artificial Intelligence Vs Machine Learning Vs Deep Learning
38 pages
D 02 Large Language Models
No ratings yet
D 02 Large Language Models
58 pages
1 - Intro To Machine Learning
100% (1)
1 - Intro To Machine Learning
20 pages
Lecture 5
No ratings yet
Lecture 5
114 pages
Artificial Neural Network
No ratings yet
Artificial Neural Network
38 pages
AdaBoost Classifier in Python (Article) - DataCamp
100% (1)
AdaBoost Classifier in Python (Article) - DataCamp
9 pages
Introduction To Deep Learning
No ratings yet
Introduction To Deep Learning
151 pages
Introduction To Machine Learning PDF
100% (1)
Introduction To Machine Learning PDF
17 pages
Deep Learning Tutorial Complete (v3)
No ratings yet
Deep Learning Tutorial Complete (v3)
109 pages
Download Complete Data Mining for Business Intelligence Concepts Techniques and Applications in Microsoft Office Excel r with XLMiner r 2nd ed Edition Patel PDF for All Chapters
100% (18)
Download Complete Data Mining for Business Intelligence Concepts Techniques and Applications in Microsoft Office Excel r with XLMiner r 2nd ed Edition Patel PDF for All Chapters
60 pages
Ollama - Your Shortcut To Supercharged Applications - Bridge The Gap With LLMs - by Kanishk Khatter - Medium
No ratings yet
Ollama - Your Shortcut To Supercharged Applications - Bridge The Gap With LLMs - by Kanishk Khatter - Medium
16 pages
Neural Networks and Deep Learning
No ratings yet
Neural Networks and Deep Learning
50 pages
CSC445: Neural Networks
No ratings yet
CSC445: Neural Networks
51 pages
Transformer Architecture
No ratings yet
Transformer Architecture
18 pages
Deep Learning
No ratings yet
Deep Learning
169 pages
Machine Learning Module-3
No ratings yet
Machine Learning Module-3
23 pages
Deep Learning Cours
No ratings yet
Deep Learning Cours
165 pages
Udacity Deep Learning Notes
No ratings yet
Udacity Deep Learning Notes
46 pages
Machine Learning (10.17.2018)
No ratings yet
Machine Learning (10.17.2018)
45 pages
Introduction To Neural Networks
No ratings yet
Introduction To Neural Networks
51 pages
Deep Learning Book
100% (5)
Deep Learning Book
42 pages
The Dawn of LMMS: Preliminary Explorations With Gpt-4V (Ision)
No ratings yet
The Dawn of LMMS: Preliminary Explorations With Gpt-4V (Ision)
166 pages
Intro To Machine Learning With TensorFlow Nanodegree Program Syllabus
No ratings yet
Intro To Machine Learning With TensorFlow Nanodegree Program Syllabus
15 pages
Deep Learning - DL-2
100% (1)
Deep Learning - DL-2
44 pages
Regularization_for_Neural_Networks_1718966083
No ratings yet
Regularization_for_Neural_Networks_1718966083
9 pages
Feature Engineering
No ratings yet
Feature Engineering
13 pages
Data Science Full Roadmap
No ratings yet
Data Science Full Roadmap
2 pages
Ensemble Learning: Wisdom of The Crowd
100% (1)
Ensemble Learning: Wisdom of The Crowd
12 pages
Download Full Deep Learning 1st Edition Dulani Meedeniya PDF All Chapters
100% (2)
Download Full Deep Learning 1st Edition Dulani Meedeniya PDF All Chapters
50 pages
Proceedings of International Conference On Computer Vision-And Image Processing CVIP 2016 Volume II
No ratings yet
Proceedings of International Conference On Computer Vision-And Image Processing CVIP 2016 Volume II
556 pages
New Advances in Machine Learning
No ratings yet
New Advances in Machine Learning
374 pages
DEEP_LEARNING_UNIT_1[1]
No ratings yet
DEEP_LEARNING_UNIT_1[1]
24 pages
Full download Neural Networks A Visual Introduction for Beginners Michael Taylor pdf docx
100% (1)
Full download Neural Networks A Visual Introduction for Beginners Michael Taylor pdf docx
65 pages
An Introduction To Supervised Learning With Scikit-Learn: Machine Learning: The Problem Setting
No ratings yet
An Introduction To Supervised Learning With Scikit-Learn: Machine Learning: The Problem Setting
4 pages
9 Distance Measures in Data Science
No ratings yet
9 Distance Measures in Data Science
9 pages
Deep Learning Nanodegree Syllabus 8-15
No ratings yet
Deep Learning Nanodegree Syllabus 8-15
15 pages
What Is Convolutional Neural Network
No ratings yet
What Is Convolutional Neural Network
16 pages
Career Plans For Next 2 Years
No ratings yet
Career Plans For Next 2 Years
11 pages
Slide 7 - Neural Networks
No ratings yet
Slide 7 - Neural Networks
64 pages
chapter 4 Neural Network
No ratings yet
chapter 4 Neural Network
46 pages
Unit 5 Neural Network
No ratings yet
Unit 5 Neural Network
31 pages
GANppt
100% (1)
GANppt
34 pages
Convolutional Neural Networks For Visual Recognition
No ratings yet
Convolutional Neural Networks For Visual Recognition
45 pages
The COMPLETE TRUTH About AI Agents (2024)
No ratings yet
The COMPLETE TRUTH About AI Agents (2024)
32 pages
Bias and Variance
No ratings yet
Bias and Variance
6 pages
A Tour of TensorFlow
No ratings yet
A Tour of TensorFlow
16 pages
Nn4ir PDF
No ratings yet
Nn4ir PDF
290 pages
TensorFlow Tutorial For Beginners (Article) - DataCamp PDF
No ratings yet
TensorFlow Tutorial For Beginners (Article) - DataCamp PDF
60 pages
Hybrid Neural Networks: Fundamentals and Applications for Interacting Biological Neural Networks with Artificial Neuronal Models
From Everand
Hybrid Neural Networks: Fundamentals and Applications for Interacting Biological Neural Networks with Artificial Neuronal Models
Fouad Sabry
No ratings yet
Deep Learning Models
No ratings yet
Deep Learning Models
70 pages
Deep Learning and Applications: Pham The Bao Ptbao@sgu - Edu.vn
No ratings yet
Deep Learning and Applications: Pham The Bao Ptbao@sgu - Edu.vn
43 pages
Deep Learning 2 July 2014
No ratings yet
Deep Learning 2 July 2014
75 pages
Snap Form
No ratings yet
Snap Form
4 pages
Sara Westin Resume Weebly
100% (1)
Sara Westin Resume Weebly
2 pages
Assmann - Collective Memory and Cultural Identity
No ratings yet
Assmann - Collective Memory and Cultural Identity
10 pages
Post SSC First Year Diploma 2021 English Notification
No ratings yet
Post SSC First Year Diploma 2021 English Notification
4 pages
TASK SHEET - Preliminary Pages of CBLM
88% (8)
TASK SHEET - Preliminary Pages of CBLM
2 pages
5th Grade Science Lesson Powerpoint
No ratings yet
5th Grade Science Lesson Powerpoint
13 pages
ICE Advert (Jan-Jun) 2025 Intake (1)
No ratings yet
ICE Advert (Jan-Jun) 2025 Intake (1)
3 pages
Progress Report: Teacher - Student's Name
No ratings yet
Progress Report: Teacher - Student's Name
4 pages
Research Poster
No ratings yet
Research Poster
1 page
MEXICO Management Style
No ratings yet
MEXICO Management Style
6 pages
MARK SCHEME For The May/June 2006 Question Paper: University of Cambridge International Examinations
No ratings yet
MARK SCHEME For The May/June 2006 Question Paper: University of Cambridge International Examinations
5 pages
Unit 5
No ratings yet
Unit 5
18 pages
A Critical Study On Business Strategies of Infosys LTD.: Ihthisham
No ratings yet
A Critical Study On Business Strategies of Infosys LTD.: Ihthisham
10 pages
Pune Education 2014
No ratings yet
Pune Education 2014
97 pages
Prof. Dr. Rolf D. Schmid (Auth.) - Biotechnology in Japan - A Comprehensive Guide-Springer-Verlag Berlin Heidelberg (1991) PDF
No ratings yet
Prof. Dr. Rolf D. Schmid (Auth.) - Biotechnology in Japan - A Comprehensive Guide-Springer-Verlag Berlin Heidelberg (1991) PDF
785 pages
How To Cite Examples in Ielts Task Two
No ratings yet
How To Cite Examples in Ielts Task Two
1 page
CHN Sample Exam 1
No ratings yet
CHN Sample Exam 1
6 pages
The Effects of Social Media On The Under
No ratings yet
The Effects of Social Media On The Under
16 pages
Mathematics
No ratings yet
Mathematics
37 pages
Notification NNSB Clerk Posts
No ratings yet
Notification NNSB Clerk Posts
1 page
A Divinational Art by Sheer Curiosity
No ratings yet
A Divinational Art by Sheer Curiosity
5 pages
Identifying The Genre of A Material Viewed
No ratings yet
Identifying The Genre of A Material Viewed
10 pages
School Oral Health Program
100% (2)
School Oral Health Program
82 pages
Baguio Central University: - 40 Appendix A Sample Letter To The Dean
No ratings yet
Baguio Central University: - 40 Appendix A Sample Letter To The Dean
5 pages
Presentation On Teamwork by Ahsan Kabir BSCS1 #20
No ratings yet
Presentation On Teamwork by Ahsan Kabir BSCS1 #20
19 pages
Psychoeducational Group Session Summary 1UPARSON
No ratings yet
Psychoeducational Group Session Summary 1UPARSON
5 pages
Analysis of New Honda City Customer Profile & Satisfaction Level - 2017
No ratings yet
Analysis of New Honda City Customer Profile & Satisfaction Level - 2017
77 pages
Tổng hợp đề Writing Task 2 - 2023
No ratings yet
Tổng hợp đề Writing Task 2 - 2023
13 pages
HOTS Questions For Math PDF
100% (4)
HOTS Questions For Math PDF
2 pages