Deep Learning Cookbook
Deep Learning Cookbook
Deep Learning
– Unsupervised training
– Generic code
– Pattern recognition
Systems can
– Observe
– Test
– Refine
Machine
Learning
Training
Prediction
HPC
Learned model
Data Feature extraction Prediction
(prediction function)
3
Deep learning
Efficient data representations, no more feature engineering
Deep learning
Training Data
algorithm
Training
Prediction (inference)
Learned model
Data (transformation and Prediction
prediction function)
4
Types of artificial neural networks
Topology to fit data characteristics
Convolutional: Fully connected: Recurrent:
Images Speech, text, sensor Speech, text, sensor
5
Terminology
Flower
Epoch Batch Predictions
Model House
Errors
Iteration True labels
Training data
6
Why deep learning?
Applications
7
Applications break down
Classification
Text Sentiment analysis Assign a label from a predefined set of
labels
8
How an individual customer’s AI evolves
Performance
What is the best config to run?
How to tune the model to improve
accuracy?
9
Key IT challenges are constraining deep learning adoption
Limited knowledge, resources and capabilities
How to get started? How to go to production? How to optimize?
“I need simple, infrastructure and “I could use more expert advice and “I need help integrating the latest
software capabilities to rapidly and tailored solutions for migrating and technologies into my deep learning
efficiently support deep learning integrating apps in a production environment to accelerate
app development.” environment.” actionable insights.”
Not ready for an ASIC. Don’t know what Begging for higher performance ASICs.
they need exactly. Many still developing Know exactly what they want to do.
on CPUs. Can’t use solutions that can’t Strong technology pull.
be verified or understood
Where to start ?
Recommend DL stack by vertical application
Verticals Voice interfaces Social media Manufacturing Oil & gas Connected cars
12
Neural Network : Popular Networks
13
Today’s scale
Model size, data size, compute requirements
Software
Hardware
16
Building performance models
Alex Net
TensorFlow Hardware
17
TensorFlow – Weak Scaling – Training – Different models perfromance
in Tensor Flow . Scaling up to 8 GPUs
Speedup for up to 8 GPUs
8
0
1 2 4 8
DeepMNIST EngAcousticModel GoogleNet ResNet101 ResNet152
ResNet50 SensorNet VGG16 VGG19
18
TensorFlow - Inference ( Inferences per Second) - Different Models
witth different Batch
DeepMNIST
numbers GoogleNet
4500
350000 4000
300000 3500
250000 3000
2500
200000
2000
150000 1500
100000 1000
500
50000
HOW TO ANALYZE ALL THE DIFFERENT NUMBERS .
0
0
1 2 4 8 16 32 64 128 256 512 1024 2048 4096 8192
1 2 4 8 16 32 64 128 256 512 1024 2048 4096 8192
1 2 4 8
1 2 4 8
AS WE ADD MORE
ResNet50 OPTIONS and MORE TECHNOLOGIES
VGG19 IT
WOULD BE IMPOSSIBLE TO USE
1800
1000
1600
900
1400
800
1200
700
1000 600
800 500
600 400
400 300
200 200
0 100
1 2 4 8 16 32 64 128 256 512 1024 2048 4096 8192 0
1 2 4 8 16 32 64 128 256 512 1024 2048 4096 8192
1 2 4 8
1 2 4 8
19
HPE demystifies deep learning for faster intelligence across all organizations
New IT expertise, blueprints and technologies to get started, scale, integrate and optimize
IT expertise and solutions Proven blueprints and services Technology integration capabilities
to “get started” with deep learning models for “scalable” production deployments to maximize performance
20
Get
Started
− Comprehensive tool set based on extensive − Informed decision making - optimal − Deep Learning Benchmarking Suite:
benchmarking hardware and software configurations available on GitHub Dec 2018
− Includes 11 workloads with 8 DL − Eliminates the “guesswork” - validated − Deep Learning Performance
frameworks and 8 HPE hardware systems methodology and data Analysis Tool: planned to be released in
− Estimates workload performance and − Improves efficiency - detects bottlenecks the beginning of 2018.
recommends an optimal HW/SW stack for in deep learning workloads − Reference configurations: available
that workload soon on HPE.com website
21
Deep Learning Cookbook helps to pick the right HW/SW stack
Knowledgebase
Benchmarking Suite Reporting tool
Performance results • Performance results
• Benchmarking scripts
• 11 reference models • Performance prediction for arbitrary
• Reference models
• 8 frameworks ANNs
• Performance metrics • 8 hardware systems
• Scalability prediction
• Optimal HW/SW configuration
for a given workload
Performance and
scalability models
Reference configurations
• Machine learning (SVR) • Image classification
to predict performance
• Others to come
of core operations
will be available externally • Analytical communication
models
internal assets
• Analytical models for overall
performance
22
23
Thank you
Natalia Vassilieva Sorin Cheran
[email protected] [email protected]
24