0% found this document useful (0 votes)

233 views24 pages

Deep Learning Cookbook

This document provides an overview of deep learning and discusses: - The evolution of AI from traditional machine learning to modern deep learning approaches using massive data sets. - How deep learning uses unsupervised training on generic code to recognize patterns in data without feature engineering. - Some successful applications of deep learning like AlphaGo, facial recognition, and autonomous vehicles. - The types of neural networks like convolutional and recurrent networks used in deep learning and some key terminology. - Why deep learning has become popular for applications involving vision, speech, text, recommendations and other areas. - How an individual customer's AI needs may evolve from exploring options to experimenting to scaling up and optimizing models. - Some of the

Uploaded by

Rashmikant Singh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

233 views24 pages

Deep Learning Cookbook

Uploaded by

Rashmikant Singh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 24

Deep Learning – a cookbook view

or “Comparative Analysis of Different Deep Learning Solutions “

The evolution of artificial intelligence Massive unstructured big data

Deep Learning
– Unsupervised training
– Generic code
– Pattern recognition

Systems can
– Observe
– Test
– Refine

Massive structured data sets Successes

– AlphaGO First Computer GO program to
Small data sets Machine learning beat a human
– Deep Blue Beating World Chess – Deep Face Facial verification
Early artificial intelligence Champion Kasparov
– Libratus AI Poker App
– ENIAC Heralded the Giant Brain“;
– DARPA Challenge Autonomous – Digital virtual assistants Siri
used for WW II ballistics
vehicle drove 132 miles – Google Self-driving cars
– Industrial robots
Statistical and mathematical models Predictive models defined by
Advanced Analytics and Heuristic
applied to solve problems machines based neural networks

1940 – 1980 1990 – 2000s Today

2
Traditional machine learning
Requires feature engineering Artificial
Intelligence

Machine
Learning

Machine learning Deep

Training Data Feature engineering
algorithm Learning

Training
Prediction
HPC

Learned model
Data Feature extraction Prediction
(prediction function)

3
Deep learning
Efficient data representations, no more feature engineering

Deep learning
Training Data
algorithm

Training
Prediction (inference)

Learned model
Data (transformation and Prediction
prediction function)

4
Types of artificial neural networks
Topology to fit data characteristics
Convolutional: Fully connected: Recurrent:
Images Speech, text, sensor Speech, text, sensor

Hidden Hidden Hidden Hidden Hidden

Input Output Input
Layer 1 Layer 2 Layer 1 Layer 2 Output Input
Layer 1
Output

5
Terminology

Flower
Epoch Batch Predictions

Model House
Errors
Iteration True labels
Training data

Worker 1 Worker 2 Worker 1 Worker 2

Worker 1 Strong scaling Weak scaling

6
Why deep learning?
Applications

Vision Speech Text Other

‒ Search & information ‒ Interactive voice ‒ Search and ranking ‒ Recommendation
extraction response (IVR) systems ‒ Sentiment analysis engines
‒ Security/Video ‒ Voice interfaces (Mobile, ‒ Machine translation ‒ Advertising
surveillance Cars, Gaming, Home) ‒ Question answering ‒ Fraud detection
‒ Self-driving cars ‒ Security (speaker ‒ AI challenges
‒ Medical imaging identification) ‒ Drug discovery
‒ Robotics ‒ Health care ‒ Sensor data analysis
‒ People with disabilities ‒ Diagnostic support

7
Applications break down

Images Image analysis

Video Video surveillance Detection

Look for a known object/pattern

Speech Speech recognition Generation

Generate content

Classification
Text Sentiment analysis Assign a label from a predefined set of
labels

Sensor Predictive maintenance Anomaly detection

Look for abnormal, unknown patterns

Other Fraud detection

8
How an individual customer’s AI evolves

Explore Experiment Scale up and Optimize

How can AI help me? How can I get started? How can I scale and optimize?

Do things better Boundary constraints Provisioning for inference

– Product development (regulations, etc.)
– Customer experience Infrastructure scale up
– Productivity Data – Training
– Employee experience Data model? Location? – Inference
How to create a model? – On-prem / cloud / hybrid
Do new things – Homegrown solution or open source?
– New disruptions – Simple ML or scalable DL? Data management
– Between edge and core
Design – Security
How to design and deploy the PoC? – Updates
– On-prem, cloud? – Regulations
– How to think about inference – Tracing

Performance
What is the best config to run?
How to tune the model to improve
accuracy?
9
Key IT challenges are constraining deep learning adoption
Limited knowledge, resources and capabilities
How to get started? How to go to production? How to optimize?

“I need simple, infrastructure and “I could use more expert advice and “I need help integrating the latest
software capabilities to rapidly and tailored solutions for migrating and technologies into my deep learning
efficiently support deep learning integrating apps in a production environment to accelerate
app development.” environment.” actionable insights.”

Immature, sub-optimal Inability to scale Lack of technology

foundation and integrate integration capabilities

Content under embargo until Oct 10, 2017 10

What about AI consumers ?

Do it yourself How do I do it ? I know better

Current wave of AI / Could benefit from better data Super-Experts – current

Machine Learning is core to science, machine learning, but wave is woefully inadequate
their business. All in-house it is not historically their core-
competency

Google, Baidu, Facebook, Banks, advertisers, Government – DoD, DoE,

Microsoft, Apple, etc. healthcare, manufacturing, NSA, NASA, etc.
food, automotive, etc.

Not ready for an ASIC. Don’t know what Begging for higher performance ASICs.
they need exactly. Many still developing Know exactly what they want to do.
on CPUs. Can’t use solutions that can’t Strong technology pull.
be verified or understood
Where to start ?
Recommend DL stack by vertical application

Verticals Voice interfaces Social media Manufacturing Oil & gas Connected cars

Data type Speech Images Video Sensor data

Data Small Moderate Large

Typical layers Convolutional Fully-connected Recurrent … Neural Network sits here

Frameworks TensorFlow Caffe 2 CNTK Torch …

Infrastructure x86 GPUs FPGAs TPU ? …

12
Neural Network : Popular Networks

Model size Model size GFLOPs

Network
(# params) (MB) (forward pass)
AlexNet 60,965,224 233 MB 0.7
GoogleNet 6,998,552 27 MB 1.6
VGG-16 138,357,544 528 MB 15.5
VGG-19 143,667,240 548 MB 19.6
ResNet50 25,610,269 98 MB 3.9
ResNet101 44,654,608 170 MB 7.6
ResNet152 60,344,387 230 MB 11.3

13
Today’s scale
Model size, data size, compute requirements

Application Model Training data FLOPs per epoch

Vision 1.7 * 109 14*106 images 6*1.7*109*14*106
~6.8 GB ~2.5 TB (256x256) ~1.4*1017
~10 TB (512x512)
Speech 60 * 106 100K hours of audio 6*60*106*34*109
~240 MB ~34*109 frames ~1.2*1019
~50 TB
Text 6.5 * 106 856*106 words 6*6.5*106*856*106
~260 MB ~3.3*1016

Signals 1.2 * 106 3106 frames 61.231063106

~4.8 MB 6.5*1013
Today’s hardware
Model size, data size, compute requirements

Application Model Training data FLOPs per epoch

Vision 1.7 * 109 14*106 images 6*1.7*109*14*106
~6.8 GB ~2.5 TB (256x256) ~1.4*1017
~10 TB (512x512)

1 epoch per hour:

~39 TFLOPS
Today’s hardware:
Google TPU2: 180 TFLOPS Tensor ops (FP16 ??)
NVIDIA Tesla V100: 15 TFLOPS SP (30 TFLOPS FP16 , 120 TFLOPS Tensor ops), 12 GB memory
NVIDIA Tesla P100: 10.6 TFLOPS SP, 16 GB memory
NVIDIA Tesla K40: 4.29 TFLOPS SP, 12 GB memory
NVIDIA Tesla K80: 5.6 TFLOPS SP (8.74 TFLOPS SP with GPU boost), 24 GB memory
INTEL Xeon Phi: 2.4 TFLOPS SP

Superdome X: ~21 TFLOPS SP, 24 TB memory

So what to recommend?

Software

Hardware

16
Building performance models

Alex Net
TensorFlow Hardware

GoogleNet Scalable, automated real-time

Worker 1 Worker 2 intelligence
Caffe 2 Strong scaling
VGG-16, VGG -19
Tensor RT
ResNet 50, 101,152 Worker 1 Worker 2
Populated with 8 GPUs
BVLC Caffe Weak scaling

Eng Acoustic Model

17
TensorFlow – Weak Scaling – Training – Different models perfromance
in Tensor Flow . Scaling up to 8 GPUs
Speedup for up to 8 GPUs
8

0
1 2 4 8
DeepMNIST EngAcousticModel GoogleNet ResNet101 ResNet152
ResNet50 SensorNet VGG16 VGG19
18
TensorFlow - Inference ( Inferences per Second) - Different Models
witth different Batch
DeepMNIST
numbers GoogleNet
4500
350000 4000
300000 3500
250000 3000
2500
200000
2000
150000 1500
100000 1000
500
50000
HOW TO ANALYZE ALL THE DIFFERENT NUMBERS .
0
0
1 2 4 8 16 32 64 128 256 512 1024 2048 4096 8192
1 2 4 8 16 32 64 128 256 512 1024 2048 4096 8192
1 2 4 8
1 2 4 8

AS WE ADD MORE
ResNet50 OPTIONS and MORE TECHNOLOGIES
VGG19 IT
WOULD BE IMPOSSIBLE TO USE
1800
1000
1600
900
1400
800
1200
700
1000 600
800 500
600 400
400 300
200 200
0 100
1 2 4 8 16 32 64 128 256 512 1024 2048 4096 8192 0
1 2 4 8 16 32 64 128 256 512 1024 2048 4096 8192
1 2 4 8
1 2 4 8

19
HPE demystifies deep learning for faster intelligence across all organizations
New IT expertise, blueprints and technologies to get started, scale, integrate and optimize

Get started rapidly: Scale and Integrate: Deliver Optimize Environment:

Develop deep learning models attractive returns Enhance competitive advantage

IT expertise and solutions Proven blueprints and services Technology integration capabilities
to “get started” with deep learning models for “scalable” production deployments to maximize performance

Expertise Proven Blueprints Integration capabilities

− Rapid technology selection guides − Reference Architectures − Enhanced global Centers of Excellence
− State of the art training − Innovation labs for best practices − Next gen technology integration
Solutions Services
− Integrated purpose-built solutions − Deploy, integrate and support
− Out of the box solutions − Flexible, on-demand capacity

20
Get
Started

Select ideal technology configurations

with HPE Deep Learning Cookbook

“Book of recipes” for deep Expert advice to Availability of

learning workloads get you started complete toolset

− Comprehensive tool set based on extensive − Informed decision making - optimal − Deep Learning Benchmarking Suite:
benchmarking hardware and software configurations available on GitHub Dec 2018
− Includes 11 workloads with 8 DL − Eliminates the “guesswork” - validated − Deep Learning Performance
frameworks and 8 HPE hardware systems methodology and data Analysis Tool: planned to be released in
− Estimates workload performance and − Improves efficiency - detects bottlenecks the beginning of 2018.
recommends an optimal HW/SW stack for in deep learning workloads − Reference configurations: available
that workload soon on HPE.com website

21
Deep Learning Cookbook helps to pick the right HW/SW stack

Knowledgebase
Benchmarking Suite Reporting tool
Performance results • Performance results
• Benchmarking scripts
• 11 reference models • Performance prediction for arbitrary
• Reference models
• 8 frameworks ANNs
• Performance metrics • 8 hardware systems
• Scalability prediction
• Optimal HW/SW configuration
for a given workload
Performance and
scalability models
Reference configurations
• Machine learning (SVR) • Image classification
to predict performance
• Others to come
of core operations
will be available externally • Analytical communication
models
internal assets
• Analytical models for overall
performance
22
23
Thank you
Natalia Vassilieva Sorin Cheran
[email protected] [email protected]

Sergey Serebryakov Bruno Monnet

[email protected] [email protected]

Azure Machine Learning Guide
100% (1)
Azure Machine Learning Guide
1,748 pages
Intro To Deep Learning
No ratings yet
Intro To Deep Learning
39 pages
Deep Learning Models
No ratings yet
Deep Learning Models
70 pages
Ai101guide 190430154655 PDF
67% (3)
Ai101guide 190430154655 PDF
34 pages
Pre-Competency Checklist: Central Bicol State University of Agriculture-Pasacao Campus
No ratings yet
Pre-Competency Checklist: Central Bicol State University of Agriculture-Pasacao Campus
12 pages
Group E Deep Learning Final
No ratings yet
Group E Deep Learning Final
31 pages
PDF Deep Learning with JavaScript: Neural networks in TensorFlow.js 1st Edition Shanqing Cai download
100% (2)
PDF Deep Learning with JavaScript: Neural networks in TensorFlow.js 1st Edition Shanqing Cai download
65 pages
Bernd Klein Python Data Analysis Letter
No ratings yet
Bernd Klein Python Data Analysis Letter
514 pages
Probability and Stats For Data Science PDF
100% (1)
Probability and Stats For Data Science PDF
237 pages
Machine Learning Notes
No ratings yet
Machine Learning Notes
52 pages
SKVA Document
No ratings yet
SKVA Document
2 pages
What Is Convolutional Neural Network
No ratings yet
What Is Convolutional Neural Network
16 pages
Machine Learning
100% (1)
Machine Learning
6 pages
Cl.7 U.3 L.2
No ratings yet
Cl.7 U.3 L.2
3 pages
Machine Learning Textbook
No ratings yet
Machine Learning Textbook
191 pages
Factors Affecting Modular Learning of Grade 12 SHS of PNHS
100% (1)
Factors Affecting Modular Learning of Grade 12 SHS of PNHS
43 pages
LECTURE: DR - Hj.Farida Repelita Wati Kembaren, M.Hum.: Intermediate Speaking Influences of Public Speaking
No ratings yet
LECTURE: DR - Hj.Farida Repelita Wati Kembaren, M.Hum.: Intermediate Speaking Influences of Public Speaking
10 pages
Deep Learning Andrew NG
100% (3)
Deep Learning Andrew NG
173 pages
Sample Outline Azure Machine Learning Engineering
No ratings yet
Sample Outline Azure Machine Learning Engineering
17 pages
Udacity Machine Learning Analysis Supervised Learning
100% (1)
Udacity Machine Learning Analysis Supervised Learning
504 pages
Statistics in Details
100% (2)
Statistics in Details
283 pages
Learning Episode 8
No ratings yet
Learning Episode 8
16 pages
StatisticsMachineLearningPythonDraft PDF
100% (1)
StatisticsMachineLearningPythonDraft PDF
323 pages
Deploy Machine Learning Models
100% (1)
Deploy Machine Learning Models
45 pages
Deep Learning Interview Questions - Deep Learning Questions
No ratings yet
Deep Learning Interview Questions - Deep Learning Questions
21 pages
771 A18 Lec4
100% (1)
771 A18 Lec4
128 pages
Course Outline - EnGR 301
No ratings yet
Course Outline - EnGR 301
7 pages
LSTM
No ratings yet
LSTM
42 pages
MACHINELEARING UNIT 1material
100% (1)
MACHINELEARING UNIT 1material
64 pages
List of Deep Learning and NLP Resources
No ratings yet
List of Deep Learning and NLP Resources
69 pages
Atiqah 47-58
No ratings yet
Atiqah 47-58
12 pages
CT2 Answer Key
No ratings yet
CT2 Answer Key
5 pages
Study Guide English 1B Week 1-3
No ratings yet
Study Guide English 1B Week 1-3
12 pages
What Objective Tests
No ratings yet
What Objective Tests
3 pages
Template For Lesson Design
No ratings yet
Template For Lesson Design
11 pages
3-7 year
No ratings yet
3-7 year
2 pages
Machine Learning Basic Principles
No ratings yet
Machine Learning Basic Principles
124 pages
Greg Bowe Resume 08
No ratings yet
Greg Bowe Resume 08
2 pages
Reflection EDUC 121 Revised
No ratings yet
Reflection EDUC 121 Revised
2 pages
Machine Learning
100% (2)
Machine Learning
211 pages
Pandas
100% (1)
Pandas
1,131 pages
R Deep Learning Essentials - Sample Chapter
100% (3)
R Deep Learning Essentials - Sample Chapter
24 pages
A Novel Adoption of LSTM in Customer Touchpoint Prediction Problems Presentation 1
100% (1)
A Novel Adoption of LSTM in Customer Touchpoint Prediction Problems Presentation 1
73 pages
Statquest Gentle Introduction To Rna Seq
100% (1)
Statquest Gentle Introduction To Rna Seq
188 pages
Lac Plan 2024 2025kinder to 3
No ratings yet
Lac Plan 2024 2025kinder to 3
3 pages
1 - Intro To Machine Learning
100% (1)
1 - Intro To Machine Learning
20 pages
Immediate download (eBook PDF) Machine Learning A Probabilistic Perspective by Kevin P. Murphy ebooks 2024
100% (8)
Immediate download (eBook PDF) Machine Learning A Probabilistic Perspective by Kevin P. Murphy ebooks 2024
46 pages
Title of The Session Session 1: Session Guide Writing Duration, Date & Venue Target Participants and Profile Objectives Terminal
No ratings yet
Title of The Session Session 1: Session Guide Writing Duration, Date & Venue Target Participants and Profile Objectives Terminal
19 pages
MA124 Syllabus
No ratings yet
MA124 Syllabus
5 pages
Deep Learning Nanodegree Syllabus 8-15
No ratings yet
Deep Learning Nanodegree Syllabus 8-15
15 pages
Silk Roud Sauran
No ratings yet
Silk Roud Sauran
4 pages
Deep Learning
No ratings yet
Deep Learning
18 pages
Machine Learning Guide: Meher Krishna Patel
No ratings yet
Machine Learning Guide: Meher Krishna Patel
121 pages
Artificial Intelligence and Machine Learning in Business
No ratings yet
Artificial Intelligence and Machine Learning in Business
5 pages
Full download Neural Networks A Visual Introduction for Beginners Michael Taylor pdf docx
100% (1)
Full download Neural Networks A Visual Introduction for Beginners Michael Taylor pdf docx
65 pages
Introduction To Machine Learning PDF
100% (1)
Introduction To Machine Learning PDF
17 pages
Face Detection & Emotion Recognition
No ratings yet
Face Detection & Emotion Recognition
26 pages
The Honest Woodcutter Lesson Plan
100% (1)
The Honest Woodcutter Lesson Plan
13 pages
Capstone Proposal - Grad 2025 1
No ratings yet
Capstone Proposal - Grad 2025 1
3 pages
Deep Reinforcement Learning
No ratings yet
Deep Reinforcement Learning
47 pages
Machine Learning Is Fun 1565131730
No ratings yet
Machine Learning Is Fun 1565131730
48 pages
MINESEC Anglais 1èreACDTI Probat 2021
No ratings yet
MINESEC Anglais 1èreACDTI Probat 2021
3 pages
What Is Machine Learning
No ratings yet
What Is Machine Learning
9 pages
Mcmaster University Case Study
No ratings yet
Mcmaster University Case Study
3 pages
DLL - Mapeh 3 - Q4 - W7
No ratings yet
DLL - Mapeh 3 - Q4 - W7
4 pages
Getting Started With MLOPs 21 Page Tutorial
No ratings yet
Getting Started With MLOPs 21 Page Tutorial
21 pages
Social Skills Lesson
No ratings yet
Social Skills Lesson
3 pages
"Hello World" of Deep Learning
No ratings yet
"Hello World" of Deep Learning
26 pages
Cause and Effect of Absenteeis To Academic Performances of Students
100% (2)
Cause and Effect of Absenteeis To Academic Performances of Students
31 pages
1 - Machine Learning (Start)
No ratings yet
1 - Machine Learning (Start)
32 pages
RNN LSTM Example Implementations With Keras TensorFlow
No ratings yet
RNN LSTM Example Implementations With Keras TensorFlow
20 pages
Keras
50% (2)
Keras
2 pages
Deep Learning Lecture 0 Introduction Alexander Tkachenko
No ratings yet
Deep Learning Lecture 0 Introduction Alexander Tkachenko
31 pages
Neural Networks and Deep Learning - Deep Learning Explained To Your Granny - A Visual Introduction For Beginners Who Want To Make Their Own Deep Learning Neural Network (Machine Learning)
100% (5)
Neural Networks and Deep Learning - Deep Learning Explained To Your Granny - A Visual Introduction For Beginners Who Want To Make Their Own Deep Learning Neural Network (Machine Learning)
84 pages
Machine Learning Handouts
No ratings yet
Machine Learning Handouts
110 pages
Convolutional Neural Network
No ratings yet
Convolutional Neural Network
7 pages
Name: Aminah NIM: 18178001 Class: A
No ratings yet
Name: Aminah NIM: 18178001 Class: A
2 pages
JUNE 9, 2022 Four Causes of Family Conflict: Semi-Detailed Lesson Plan in English 10
No ratings yet
JUNE 9, 2022 Four Causes of Family Conflict: Semi-Detailed Lesson Plan in English 10
2 pages
LED TV, Laptop, Powerpoint Presentation, Manila Paper, Colored Papers and
No ratings yet
LED TV, Laptop, Powerpoint Presentation, Manila Paper, Colored Papers and
3 pages
Geographic Coordinate Conversion
No ratings yet
Geographic Coordinate Conversion
11 pages
Tensorflow Presentation
No ratings yet
Tensorflow Presentation
13 pages
Read & Download (PDF Kindle)
No ratings yet
Read & Download (PDF Kindle)
5 pages
Business Requirements Document /: Project Name Module Name
No ratings yet
Business Requirements Document /: Project Name Module Name
11 pages
What Is A Support Vector Machine?: Primer
No ratings yet
What Is A Support Vector Machine?: Primer
3 pages
TensorFlow Developer Certification Guide
From Everand
TensorFlow Developer Certification Guide
Patrick J
No ratings yet
Talend Open Studio Cookbook
From Everand
Talend Open Studio Cookbook
Rick Barton
2/5 (1)
Apache Spark 2.x Cookbook
From Everand
Apache Spark 2.x Cookbook
Rishi Yadav
No ratings yet
Google Cloud Data Engineer 100+ Practice Exam Questions With Well Explained Answers
From Everand
Google Cloud Data Engineer 100+ Practice Exam Questions With Well Explained Answers
vivian njoroge
No ratings yet
The Datadog Handbook: A Guide to Monitoring, Metrics, and Tracing
From Everand
The Datadog Handbook: A Guide to Monitoring, Metrics, and Tracing
Robert Johnson
No ratings yet
Google Cloud Dataproc The Ultimate Step-By-Step Guide
From Everand
Google Cloud Dataproc The Ultimate Step-By-Step Guide
Gerardus Blokdyk
No ratings yet