0% found this document useful (0 votes)

29 views99 pages

Tensorflow Ensai SID 13 01 17

The document provides an introduction to deep learning with TensorFlow. It discusses topics such as convolutional neural networks, sequence modeling, reinforcement learning and how TensorFlow works. Exercises are also provided to help understand concepts like gradient descent, logistic regression and feeding data in TensorFlow.

Uploaded by

jr.developer.78

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

29 views99 pages

Tensorflow Ensai SID 13 01 17

Uploaded by

jr.developer.78

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 99

1

INTRODUCTION TO DEEP LEARNING

WITH TENSORFLOW

Fabien Baradel
PhD Candidate

fabienbaradel.github.io
@fabienbaradel
[email protected]
DEEP LEARNING? 2

TENSORFLOW?
TENSORFLOW BACKGROUND 3

• 6 months internship at Xerox Research Center Europe:

- « Unsupervised Domain Adaptation »
- Image recognition & Sentimental text classification
- Machine Learning for Services team
- Grenoble, France

• Phd Candidate at LIRIS - INSA Lyon since October 2016:

- « Deep Learning for human understanding: gestures, poses, activities »
- Imagine team
- Supervisors: Christian Wolf & Julien Mille
- Working with videos
DEEP LEARNING
FOR HUMAN UNDERSTANDING 4

• Action recognition => Classification

• Sequence Learning
• Supervised Learning
DEEP LEARNING
FOR HUMAN UNDERSTANDING 5

• Microsoft Kinect - Xbox

DEEP LEARNING
FOR HUMAN UNDERSTANDING 6

Video = sequence of
Microsoft Kinect v2: frames
Skeleton not enough
• 3D joint location
(looking at book, looking at
• 25 joints smartphone)
SELF-DRIVING CARS 7

Tesla

• https://www.youtube.com/watch?v=CxanE_W46ts
MATERIALS 8

Virtual Machine: USB key

• Python 2.7
• Tensorflow
• Ubuntu 16.04
• Datasets

Seminar slides & Code corrections:

https://fabienbaradel.github.io
9
Introduction
• Basics of Tensorflow
• Machine Learning: analytic solution vs.
gradient descent
Supervised Learning (image recognition)
• Neural networks reminder
• Convolution Networks
• Going deeper with ConvNets
Unsupervised Learning
• Autoencoder
• Generative Adversial Network
Sequence modelling
• RNN, LSTM
• Word2vec
Reinforcement Learning
• Deep Q-learning
• Frozen Lake
10

INTRODUCTION
WHAT IS TENSORFLOW? 11
• A python library
• pip install tensorflow
• Google
• open-source
• library for numerical computation
using data flow graphs
• CPU and GPU
• Research & Industry
PRINCIPLE 12
« HELLO WOLRD » 13

• INTRODUCTION EXERCISES

• Difference between constant/variable

and placeholder

• Constant = a fixed Variable

• With placeholder you need to feed

data to your graph during your
session

• Tensorflow workflow:
• Draw your graph
• Feed data
• … and optimize
« BASIC MATHS OPERATIONS » 14

• Open « math_ops.py »
• Same thing with integer
• Mathematical operation done
only using Tensorflow library (no
numpy or else)
• Draw the schema of the code

WITH CONSTANTS WITH PLACEHOLDERS

ANALYTIC SOLUTION IN ML 15

Linear regression y = X +✏

Least Squares solution

ˆ = arg min||X y||2

ˆ = (X T X) 1
XT Y

Could solve the same problem by

solving the optimization problem
using gradient descent
GRADIENT DESCENT 16

• Goal: minimizing an function

• Random initialization of parameters
• At time t, gradient gives the slope of the function
• Iterative process
• Updating the parameters in the positive direction of
the gradient according to a learning rate
• Repeat until convergence
BATCH STOCHASTIC GRADIENT DESCENT 17
Linear regression y = X +✏
Minimize a loss function:
N
X
J( ) = (Xi yi ) 2
i=1
ˆ = arg min J( )

• Initialize ˆ0 randomly
• Choose a learning rate ⌘
• for t in range(training_step):
• Compute the loss
N
X
J( ˆt ) = (Xi ˆt yi ) 2
i=1
• Update parameters
ˆt+1 = ˆt ⌘rJ( ˆt )

https://jalammar.github.io/visual-interactive-guide-basics-neural-networks/#train-your-dragon
MINI-BATCH SGD 18

SGD = stochastic gradient descent

• Initialize ˆ0 randomly
• Choose a learning rate ⌘
• Choose a batch size n
• for t in range(training_step): • Initialization is
n important!
• S
Pick a random sample t from • Learning rate too!
training data
• Need a validation set
to avoid overfitting
• Compute the loss function
X
J( ˆt ) = (Xi ˆt yi ) 2

i2Stn
Neural nets always trained
• Update parameters with mini-batch SGD!

ˆt+1 = ˆt ⌘rJ( ˆt )
EXERCISES 19

• Go to the Github repo and complete the codes:

✴ SGD/linear_regression_exo.py
✴ SGD/binary_classif_exo.py

import ipdb; ipdb.set_trace()

http://playground.tensorflow.org/
https://wookayin.github.io/TensorflowKR-2016-talk-debugging/
20

NEURAL NETWORKS
MNIST DATASET 21

• Handwritten digits
• 60.000 training data and 10.000 test data
• 28x28 grayscale images
• matrix of size 28x28 with value between 0 and 255
• data preprocessing = rescaling to [0,1]
MULTINOMIAL LOGISTIC REGRESSION ON MNIST:
CREATE THE GRAPH 22

logits predictions
label
c
W
vectorization learnable parameter softmax

10x1 28x28
10

784

Compute cross-entropy
c) =
J(W y ⇥ log(ŷ)
WHY LOG?
http://colah.github.io/posts/2015-09-Visual-Information/
MULTINOMIAL LOGISTIC REGRESSION ON MNIST:
CREATE THE GRAPH 23

images t=0
logits predictions labels

ct
W
vectorization learnable parameter softmax

10
10

784

X
Compute ct ) =
J(W yi ⇥ log(yˆi )
cross-entropy
i2St2

c updated [ c ct )
Wt by SGD W t+1 = W t ⌘rJ(W
MULTINOMIAL LOGISTIC REGRESSION ON MNIST:
FEED DATA 24

images t=1
logits predictions labels

ct
W
vectorization learnable parameter softmax

28x28
10 10
10

784

X
Compute ct ) =
J(W yi ⇥ log(yˆi )
cross-entropy
i2St2

c updated [ c ct )
Wt by SGD W t+1 = W t ⌘rJ(W
MULTINOMIAL LOGISTIC REGRESSION ON MNIST:
FEED DATA 25

images t=2
logits predictions labels

ct
W
vectorization learnable parameter softmax

28x28
10 10
10

784

X
Compute ct ) =
J(W yi ⇥ log(yˆi )
cross-entropy
i2St2

c updated [ c ct )
Wt by SGD W t+1 = W t ⌘rJ(W
NEURAL NETWORKS 26

0 1 0 1
✓ yb1
B yb2 C
0
B1C
B C Error B C
B yb3 C B0C
FUNCTION B C B C
@ ... A @...A
yc10
0
input
inference function output label

• Minimize your error on a training set

• Find the best inference function parameters
• Difference between neuralNets an deepNets: only in the inference function

ŷ = f (✓, x)
J(✓) = error(ŷ, y) given ✓
✓ˆ = argmin J(✓)
And train it using mini-batch SGD!
NEURAL NETWORKS IN TENSORFLOW:
GENERAL GRAPH 27
placeholder placeholder

0 1
✓ 0
yc
c1
1 ?
B yc C Error B?C
FUNCTION B Cc 2 B C
@ ... A @...A
yccn ?
inference function output
input
label

inference function => ŷ = f (✓, x)

loss function => J(✓) = error(ŷ, y) given ✓
optimization problem => ✓ˆ = argmin J(✓)

And train it using mini-batch SGD in a Tensorflow session!

NEURAL NETWORKS IN TENSORFLOW:
GENERAL TRAINING 28
placeholder placeholder
0 1 0 1
✓b0 yc
B yc
B c2 C
c1
C 1
B0C
@ ... A Error B C
FUNCTION yccn 0 1 @...A
yc
c1
0 1
B yc C 0 0
B c2 C B1C
0 1 @ ... A
yc B C
0 [email protected]
c1
inference function B yc C yccn
B C c 2
0
@ ... A 0
yc B0C
cn B C
@...A
outputs 1
inputs

labels

feed mini-batch of data step by step

And train it using mini-batch SGD in a Tensorflow session!

NEURAL NETWORKS IN TENSORFLOW 29
placeholder placeholder
0 1 0 1
✓b1 yc
B yc
B c2 C
c1
C
0
B0C
@ ... A Error B C
@...A
FUNCTION yccn 0 1
yc
c1 1 011
B yc C
B c2 C B0C
0 1 @ ... A
yc B C
0 1 @...A
c1
inference function B yc C yccn
B C c 2
0
B C 0
@ ... A
yc cn B1C
@...A
outputs
0
inputs

labels

feed mini-batch of data step by step

And train it using mini-batch SGD in a Tensorflow session!

NEURAL NETWORKS IN TENSORFLOW 30
placeholder placeholder
0 1 0 1
✓b2 yc
B yc
B c2 C
c1
C 0
B1C
@ ... A Error B C
yc @...A
FUNCTION cn 0 1 0 1
yc
c1 0 0
B yc C B0C
B c2 C B C
0 1 @ ... A
ycc1
yc 0 1 @...A
inference function B yc C cn
1
B C c 2
@ ... A
1
B0C
yc B C
cn
@...A
0
outputs
inputs

labels