0% found this document useful (0 votes)

124 views26 pages

"Hello World" of Deep Learning

The document provides information about Keras, a deep learning library for Python. It discusses that Keras was created by Francois Chollet and is easy to learn and use while still providing flexibility. It also provides examples of using Keras for handwritten digit recognition and discusses concepts like model saving/loading, using GPUs, mini-batch training, and shuffling data.

Uploaded by

Mohan Sc

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

124 views26 pages

"Hello World" of Deep Learning

Uploaded by

Mohan Sc

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 26

“Hello world”

of deep learning
If you want to learn theano:
http://speech.ee.ntu.edu.tw/~tlkagk/courses/MLDS_2015_2/L
Keras ecture/Theano%20DNN.ecm.mp4/index.html
http://speech.ee.ntu.edu.tw/~tlkagk/courses/MLDS_2015_2/Le
cture/RNN%20training%20(v6).ecm.mp4/index.html

Very flexible
or
Need some
effort to learn

Easy to learn and use

Interface of
TensorFlow or (still have some flexibility)
Theano You can modify it if you can write
keras TensorFlow or Theano
Keras
• François Chollet is the author of Keras.
• He currently works for Google as a deep learning
engineer and researcher.
• Keras means horn in Greek
• Documentation: http://keras.io/
• Example:
https://github.com/fchollet/keras/tree/master/exa
mples
感謝沈昇勳同學提供圖檔

使用 Keras 心得
Example Application
• Handwriting Digit Recognition

Machine “1”

28 x 28

MNIST Data: http://yann.lecun.com/exdb/mnist/

“Hello world” for deep learning
Keras provides data sets loading function: http://keras.io/datasets/
Keras
……
28x28

……
500
softplus, softsign, relu, tanh,
hard_sigmoid, linear
……
500

Softmax

y1 y2
…… y10
Keras

Several alternatives: https://keras.io/objectives/

Keras

Step 3.1: Configuration

SGD, RMSprop, Adagrad, Adadelta, Adam, Adamax, Nadam

Step 3.2: Find the optimal network parameters

Training data Labels In the following slides

(Images) (digits)
Keras
Step 3.2: Find the optimal network parameters

numpy array numpy array

28 x 28 …… 10 ……
=784

Number of training examples Number of training examples

https://www.tensorflow.org/versions/r0.8/tutorials/mnist/beginners/index.html
Keras

Save and load models

http://keras.io/getting-started/faq/#how-can-i-save-a-keras-model

How to use the neural network (testing):

case 1:

case 2:
Keras
• Using GPU to speed training
• Way 1
• THEANO_FLAGS=device=gpu0 python
YourCode.py
• Way 2 (in your code)
• import os
• os.environ["THEANO_FLAGS"] =
"device=gpu0"
Live Demo
We do not really minimize total loss!
Mini-batch  Randomly initialize
network parameters

x1 NN y1 𝑦ො 1  Pick the 1st batch

Mini-batch

𝐶1 𝐿′ = 𝐶 1 + 𝐶 31 + ⋯
x31 NN y31 𝑦ො 31 Update parameters once
𝐶 31  Pick the 2nd batch
……

𝐿′′ = 𝐶 2 + 𝐶 16 + ⋯
Update parameters once
x2 NN y2 𝑦ො 2
Mini-batch

…
𝐶2  Until all mini-batches
have been picked
x16 NN y16 𝑦ො 16
𝐶 16 one epoch
……

Repeat the above process

Batch size influences both speed and
Mini-batch performance. You have to tune it.

 Pick the 1st batch

x1 NN y1 𝑦ො 1
𝐿′ = 𝐶 1 + 𝐶 31 + ⋯
Mini-batch

𝑙1
Update parameters once
x31 NN y31 𝑦ො 31
𝑙31  Pick the 2nd batch
……

𝐿′′ = 𝐶 2 + 𝐶 16 + ⋯
100 examples in a mini-batch Update parameters once

…
Batch size = 1
 Until all mini-batches
Stochastic gradient descent have been picked
Repeat 20 times one epoch
Speed
• Smaller batch size means more updates in one epoch
• E.g. 50000 examples
• batch size = 1, 50000 updates in one epoch 166s 1 epoch
• batch size = 10, 5000 updates in one epoch 17s 10 epoch
166s Batch size = 1 or 10, update the same
amount of times in the same period.
Batch size = 10 is more stable
GTX 980 on MNIST with
17s 50000 training examples
Speed - Matrix Operation
x1 …… y1
x2 W1 W2 ……
WL y2
b1 b2 bL

……
……

……

……
xN x a1 ……
a2 y yM

y =𝑓 x Forward pass (Backward pass is similar)

=𝜎 WL …𝜎 W2 𝜎 W1 x + b1 + b2 … + bL
Speed - Matrix Operation
• Why mini-batch is faster than stochastic gradient
descent?
Stochastic Gradient Descent

𝑧1 = 𝑊1 𝑥 𝑧1 = 𝑊1 𝑥 ……

Mini-batch
matrix

Practically, which
𝑧1 𝑧1 = 𝑊1 𝑥 𝑥
one is faster?
Performance
• Larger batch size yields more efficient computation.
• However, it can yield worse performance
Shuffle the training examples for each epoch
Epoch 1 Epoch 2

x1 NN y1 𝑦ො 1 x1 NN y1 𝑦ො 1

Mini-batch
Mini-batch

𝑙1 𝑙1
x31 NN y31 𝑦ො 31 x31 NN y31 𝑦ො 31
𝑙31 𝑙17

……
……

Don’t worry. This is the default of Keras.

x2 NN y2 𝑦ො 2 x2 NN y2 𝑦ො 2
Mini-batch
Mini-batch

𝑙2 𝑙2

x16 NN y16 𝑦ො 16 x16 NN y16 𝑦ො 16

𝑙16 𝑙26

……
……
Analysis
x1 When did the neuron
has the largest output?
x2

……

……
xN

Red: positive
Arranging the weights Blue: negative
according to the pixels The neurons in the first
they connected layer usually detect part
of the digits.
Try another task
政治
“stock” in document
經濟
Machine

體育
“president” in document

體育政治財經
http://top-breaking-news.com/
Try another task
Live Demo

(Edexcel AS - A Level 2016 Series) Linsay Frost, Lauren Lewis, Daniel Mace, Viv Pointon, Paul Wraight - GCE Geography As Level Student Book (2016, Pearson)
100% (2)
(Edexcel AS - A Level 2016 Series) Linsay Frost, Lauren Lewis, Daniel Mace, Viv Pointon, Paul Wraight - GCE Geography As Level Student Book (2016, Pearson)
344 pages
Quantum Computing Books
No ratings yet
Quantum Computing Books
1 page
Sample Outline Azure Machine Learning Engineering
No ratings yet
Sample Outline Azure Machine Learning Engineering
17 pages
Machine Learning Cheat Sheet ??? - ?
No ratings yet
Machine Learning Cheat Sheet ??? - ?
231 pages
Deep Learning Methods and Applications For Electrical Power Systems A Comprehensive Review
No ratings yet
Deep Learning Methods and Applications For Electrical Power Systems A Comprehensive Review
22 pages
Introduction To Machine Learning PDF
100% (1)
Introduction To Machine Learning PDF
17 pages
Deep Learning Nanodegree Syllabus 8-15
No ratings yet
Deep Learning Nanodegree Syllabus 8-15
15 pages
Machine Learning Systems
No ratings yet
Machine Learning Systems
1,748 pages
02 - Lecture Note - TensorFlow Ops
No ratings yet
02 - Lecture Note - TensorFlow Ops
21 pages
Machine Learning For Tabular Data XGBoost, Deep Learning, and AI (Mark Ryan, Luca Massaron) (Z-Library)
100% (1)
Machine Learning For Tabular Data XGBoost, Deep Learning, and AI (Mark Ryan, Luca Massaron) (Z-Library)
504 pages
Machine Learning Notes
No ratings yet
Machine Learning Notes
75 pages
Machine Learning Algorithms
No ratings yet
Machine Learning Algorithms
9 pages
Natural Language Processing
100% (1)
Natural Language Processing
12 pages
Machine Learning Revision Notes
No ratings yet
Machine Learning Revision Notes
6 pages
List of Deep Learning and NLP Resources
No ratings yet
List of Deep Learning and NLP Resources
69 pages
PyTorch Workflow Fundamentals
No ratings yet
PyTorch Workflow Fundamentals
1 page
Simple Libraries in Python
No ratings yet
Simple Libraries in Python
12 pages
MAchine Learning
No ratings yet
MAchine Learning
120 pages
Mehryar Mohri - Foundations of Machine Learning - Book
No ratings yet
Mehryar Mohri - Foundations of Machine Learning - Book
1 page
Keras Cheat Sheet Python For Data Science: Model Architecture Inspect Model
No ratings yet
Keras Cheat Sheet Python For Data Science: Model Architecture Inspect Model
1 page
5 Pretraining On Unlabeled Data - Build A Large Language Model (From Scratch)
No ratings yet
5 Pretraining On Unlabeled Data - Build A Large Language Model (From Scratch)
61 pages
Deep Learning Unit 1
No ratings yet
Deep Learning Unit 1
24 pages
Top 100 Interview Questions On Machine Learning
100% (1)
Top 100 Interview Questions On Machine Learning
155 pages
Tutorial Pytorch Best Commands
No ratings yet
Tutorial Pytorch Best Commands
8 pages
MACHINELEARING UNIT 1material
100% (1)
MACHINELEARING UNIT 1material
64 pages
Top 45 Machine Learning Interview Questions in 2025
100% (1)
Top 45 Machine Learning Interview Questions in 2025
37 pages
The Art of ChatGPT Prompting
No ratings yet
The Art of ChatGPT Prompting
18 pages
Artificial Neural Networks: Part 1/3
No ratings yet
Artificial Neural Networks: Part 1/3
25 pages
Deep Learning
100% (2)
Deep Learning
49 pages
TensorFlow Basics
100% (1)
TensorFlow Basics
38 pages
Unit 4 Deeplearning
No ratings yet
Unit 4 Deeplearning
41 pages
Image Processing With CUDA
No ratings yet
Image Processing With CUDA
66 pages
NLP and Generative AI Syllabus - 2025
No ratings yet
NLP and Generative AI Syllabus - 2025
5 pages
MLOps
No ratings yet
MLOps
9 pages
Pandas
100% (1)
Pandas
1,131 pages
Introduction of Machine Learning
No ratings yet
Introduction of Machine Learning
58 pages
AI Using Python
No ratings yet
AI Using Python
9 pages
1 - Machine Learning (Start)
No ratings yet
1 - Machine Learning (Start)
32 pages
ML Unit 1 Pallav
No ratings yet
ML Unit 1 Pallav
22 pages
A Quick Introduction To Tensorflow: Machine Learning Spring 2019
100% (1)
A Quick Introduction To Tensorflow: Machine Learning Spring 2019
22 pages
CEC453 Machine Learning
No ratings yet
CEC453 Machine Learning
168 pages
Deep Learning by AndrewNG Tutorial Notes
No ratings yet
Deep Learning by AndrewNG Tutorial Notes
298 pages
What Is Machine Learning
No ratings yet
What Is Machine Learning
9 pages
Day 45 PyTorch Presentation
No ratings yet
Day 45 PyTorch Presentation
67 pages
Generative AI Interview Questions and Answers
No ratings yet
Generative AI Interview Questions and Answers
7 pages
Deep Learning (MODULE-3)
No ratings yet
Deep Learning (MODULE-3)
85 pages
Computer Vision - Ipynb - Colaboratory
No ratings yet
Computer Vision - Ipynb - Colaboratory
17 pages
Getting Started With TensorFlow - Js - TensorFlow - Medium
No ratings yet
Getting Started With TensorFlow - Js - TensorFlow - Medium
6 pages
Machine Learning Midterm
No ratings yet
Machine Learning Midterm
18 pages
Career Track For AI/ML
No ratings yet
Career Track For AI/ML
10 pages
TensorFlow Cheatsheet Zero To Mastery V1.01
No ratings yet
TensorFlow Cheatsheet Zero To Mastery V1.01
26 pages
170 Machine Learning Interview Questions and Answer For 2021
100% (1)
170 Machine Learning Interview Questions and Answer For 2021
65 pages
Lecture 3 Finetuning Part 1
No ratings yet
Lecture 3 Finetuning Part 1
85 pages
NLP Semester 7
No ratings yet
NLP Semester 7
1,072 pages
Intro To Data Science Summary
No ratings yet
Intro To Data Science Summary
17 pages
Machine Learning Notes
No ratings yet
Machine Learning Notes
15 pages
Tensorflow 2.0 Cheat Sheet: Some Pre-Requisites TF Core Learning Algorithms Working With Keras Models
No ratings yet
Tensorflow 2.0 Cheat Sheet: Some Pre-Requisites TF Core Learning Algorithms Working With Keras Models
2 pages
Hopfield Networks: Fundamentals and Applications of The Neural Network That Stores Memories
From Everand
Hopfield Networks: Fundamentals and Applications of The Neural Network That Stores Memories
Fouad Sabry
No ratings yet
Hebbian Learning: Fundamentals and Applications for Uniting Memory and Learning
From Everand
Hebbian Learning: Fundamentals and Applications for Uniting Memory and Learning
Fouad Sabry
No ratings yet
Hybrid Neural Networks: Fundamentals and Applications for Interacting Biological Neural Networks with Artificial Neuronal Models
From Everand
Hybrid Neural Networks: Fundamentals and Applications for Interacting Biological Neural Networks with Artificial Neuronal Models
Fouad Sabry
No ratings yet
Effective Amazon Machine Learning
From Everand
Effective Amazon Machine Learning
Alexis Perrier
No ratings yet
1.1 Background of The Study: Chapter - I
No ratings yet
1.1 Background of The Study: Chapter - I
38 pages
Statistics Using Stata An Integrative Approach: Weinberg and Abramowitz 2016
No ratings yet
Statistics Using Stata An Integrative Approach: Weinberg and Abramowitz 2016
46 pages
What Statistical Test Do I Need?: Key Words
No ratings yet
What Statistical Test Do I Need?: Key Words
2 pages
Chapter 19, Factor Analysis
No ratings yet
Chapter 19, Factor Analysis
7 pages
GRP 5 Codesheet
No ratings yet
GRP 5 Codesheet
39 pages
Commerce - 2nd Edition
100% (1)
Commerce - 2nd Edition
227 pages
SPSS Produksi
No ratings yet
SPSS Produksi
3 pages
Experimental Design
100% (1)
Experimental Design
16 pages
The Four Questions of Data Analysis: Donald J. Wheeler
No ratings yet
The Four Questions of Data Analysis: Donald J. Wheeler
6 pages
Practical No.-01
No ratings yet
Practical No.-01
25 pages
CLUSTERING GRID-BASED METHODS Elsayed Hemayed Data Mining Course
No ratings yet
CLUSTERING GRID-BASED METHODS Elsayed Hemayed Data Mining Course
14 pages
Research Group 2 4.DocxUPDATE
0% (1)
Research Group 2 4.DocxUPDATE
31 pages
Correlation of English Reading Comprehension and Problem
100% (2)
Correlation of English Reading Comprehension and Problem
38 pages
Complete Download Research Methods For Engineers 1st Edition David V. Thiel PDF All Chapters
100% (11)
Complete Download Research Methods For Engineers 1st Edition David V. Thiel PDF All Chapters
75 pages
OBE Syllabus BUS 5 Quantitative Techniques To Business
No ratings yet
OBE Syllabus BUS 5 Quantitative Techniques To Business
7 pages
Detecting and Correcting For Label Shift With Black Box Predictors
No ratings yet
Detecting and Correcting For Label Shift With Black Box Predictors
11 pages
Module 1-Nature of Psychological Measurement. 2
No ratings yet
Module 1-Nature of Psychological Measurement. 2
11 pages
Hypothesis Testing For The Population Proportion: One-Tailed Test
No ratings yet
Hypothesis Testing For The Population Proportion: One-Tailed Test
5 pages
Pengaruh Promosi Kualitas Produk Dan Harga Terhada
No ratings yet
Pengaruh Promosi Kualitas Produk Dan Harga Terhada
12 pages
S No. Buyer / Non-Buyer Durability Light Weight Low Cost Rot Resistance
No ratings yet
S No. Buyer / Non-Buyer Durability Light Weight Low Cost Rot Resistance
3 pages
MATH3004 Industrial Project Semester 2 2018 Bentley Campus INT
No ratings yet
MATH3004 Industrial Project Semester 2 2018 Bentley Campus INT
9 pages
Book Recommendations From Nassim Taleb
No ratings yet
Book Recommendations From Nassim Taleb
14 pages
2020 Boys Names
No ratings yet
2020 Boys Names
89 pages
STA 324 Survey $ Sampling
100% (1)
STA 324 Survey $ Sampling
98 pages
Exercise Problems: Information Theory and Coding
No ratings yet
Exercise Problems: Information Theory and Coding
6 pages
Chapter 3: Research Methodology
No ratings yet
Chapter 3: Research Methodology
16 pages
Strat 3D
100% (1)
Strat 3D
65 pages
Decision Making Under Risk: Expected Monetary Value (EMV)
No ratings yet
Decision Making Under Risk: Expected Monetary Value (EMV)
36 pages