0% found this document useful (0 votes)

10 views83 pages

Unit Ii

The document provides an overview of Keras and TensorFlow, highlighting their features, installation processes, and model architecture. It details the types of models available in Keras, including the Sequential model and the functional API, as well as various layers, optimizers, activation functions, and cost functions. Additionally, it discusses hyperparameter tuning methods and the importance of optimizing AI infrastructure for deep learning applications.

Uploaded by

prathammalviya8

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

10 views83 pages

Unit Ii

Uploaded by

prathammalviya8

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 83

Building Models with

Keras
UNIT-II
Ker
as
• A python package (Python 2.7-3.6)
• Sits on top of TensorFlow or Theano (Stopped)
• High-level neural network API
• Runs seamlessly on CPU and GPU
• Open source with user manual (https://keras.io/)
• Less coding lines required to build/run a model
TensorFl
ow
• Inherit from Theano (data flow graph)
• A python(3.5-3.7) package/C++ library
• Running on CPU or NVIDIA CUDA GPU
• End-2-End platform for machine/deep learning
• Multi platform (desktop, web by TF.js, mobile by TF
Lite)
• Open source with user manual
(https://www.tensorflow.org/)

• More coding lines required to

build/run a model
NVIDIA CUDA
Toolkit
• C/C++ library
• A parallel computing platform for NVIDIA GPU
• Most deep learning researchers rely on
• GPU-accelerated computing/applications
• Not open source (https://developer.nvidia.com/cuda-zone)
• CPU vs GPU: TensorFlow training CNN model on CIFAR10 images
Anaconda3
Installation
• Anaconda3
• Download (https://www.anaconda.com/distribution/)
• Installation (https://docs.anaconda.com/anaconda/install/)
• Restart required
TensorFlow/Keras
Installation
• Start the anaconda navigator
• Windows: Start->All program->Anaconda3-
>Anaconda Navigator
• Linux: type “anaconda-navigator” under the
linux terminal
• Install TensorFlow and Keras
• Environments->choose All
• type “tensorflow”
• CPU based:
tensorflow (choose
1.14)
keras (2.2.4) apply
• GPU based:
• CUDA Compute Capability
>= 3.0, better
>= 3.7 (check more)
• tensorflow-gpu (choose 1.14) and keras-
gpu (2.2.4), then apply
Installation
Confirmed
• TensorFlow test code:
import tensorflow as tf
sess = tf.compat.v1.Session()
a = tf.compat.v1.constant(1)
b = tf.compat.v1.constant(2)
print(sess.run(a+b))

• Expect to have
answer 3
Installation
Confirmed
• Keras requires backend setting for Windows users:
• https://keras.io/backend/
• Setting in keras.json:
“backend”: “tensorflow”
• Keras test code:
import keras

• Expect to see
Using TensorFlow backend
Keras
Models
• Two main types of models available
• The Sequential model (easy to learn, high-level API)
• A linear stack of layers
• Need to specify what input shape it should expect (input dimension)
• https://keras.io/getting-started/sequential-model-guide/
• The Model class used with the functional API (similar to tensorflow2.0)
• https://keras.io/models/about-keras-models/
• https://keras.io/getting-started/functional-api-guide/
Keras Sequential
Model
• Define a sequential model • Training
• model = Sequential() model = model.fit(data, one_hot_labels,
mode.add(Dense(32, input_dim=784)) epoch=10, batch_size=32)

model.add(Activation(‘relu’)) • Predition
model.add(Dense(10)) Y = model.predict(X)
• model.add(Activation(‘softmax’))
• Compilation
• model.compile(optimizer=‘rmsprop’,
• loss=‘binary_crossentropy’,
metrics=[‘accuray’])
Keras: Layers
 Input:
input_img = Input(shape=(rows , cols , channels))
 Dense:
x = Dense(num_of_units , activation=‘activation_function’)
 Conv2D:
x = Conv2D(num_of_filters, kernel_size , stride,
activation=‘activation_function’,padding=‘type_of_padding’)
 MaxPool2D:
x = MaxPool2D(kernel_size)
 Flatten:
 Dropout:
x = Dropout(value_of_dropout)

11
Keras: Optimizers
 SGD

 RMSProp

 AdaGrad

 Adam

 …

12
Keras: Activation Functions
 Sigmoid

 Tanh

 Relu

 LeakyRelu

 ELU

 Softmax
 …

13
Keras: Cost Functions
 Mean Squared Error (‘mse’)

 Binary Cross Entropy (‘binary_crossentropy’)

 Kullback Leibler Divergence (‘kullback_leibler_divergence’)

 …

14
Keras: Defining the architecture
There are two ways to define the architecture:

15
Keras: Defining the architecture
There are two ways to define the architecture:

16
Exercises:
 Exercise 1:
Define the network architecture following the LeNet-5 model.
 Exercise 2:
Evaluate the network performance in terms of accuracy in relation to the
change of:
1. Learning rate: 0.1 and 0.001.
2. Activation functions: ReLU and Sigmoid.
3. Dropout values: 0.25, 0.5 and 0.75

17
Layer
s
• linea
r
• sigmoi
d
• tan
h
• rel
u
• PReLU

• Leaky
ReLU
• SReLU
• L1 weight
penalty
• L2 weight
penalty
https://github.com/vdumoulin/
conv_arithmetic
https://github.com/vdumoulin/
conv_arithmetic

border_mode = ‘valid’
no strides: subsample=
(1,1)
https://github.com/vdumoulin/
conv_arithmetic

border_mode =
‘same’ no strides:
subsample= (1,1)
https://github.com/vdumoulin/
conv_arithmetic

border_mode =
‘valid’ 2x2 strides:
subsample= (2,2)
https://github.com/vdumoulin/
conv_arithmetic

border_mode = ‘same’
2x2 strides: subsample=
(2,2)
Initializatio
ns

For a discussion of weight

initializations:
SGD

RMSPro

Adam

Tune the learning

rate!
• Use metrics to specify what you want in
history
• Up to you to save it!
Saving and loading
weights
Saving and loading a
model
Loading a retrained model: first
approach
Loading a retrained model: first
approach
Loading a retrained model: second
approach
Loading a retrained model: second
approach
Loading a retrained model: second
approach
Challenges in Tuning Hyperparameters

• Solving a problem with deep learning often follows a pipeline that includes feature engineering,
model selection, training by tuning hyperparameters, and validation.

• Hyperparameters (HPs) can be

divided into two categories:
• Training-related: learning rate, batch size, dropout rate, and
epoch count
• Model design-related: model structure, regularization, and
activation functions

N. Shawki: On Automating Hyperparameter Optimization for Deep Learning Applications 63

• Due to the number of hyperparameters involved, it is nearly impossible to explore all
possible combinations.

• Autotuning is an active research area that involves automated search techniques to find
an optimal solution.

• A few popular autotuning algorithms are Grid Search, Random Search, Bayesian
Optimization, and Gradient-based Optimization.

• Keras Tuner uses random search for finding a generalized solution.

https://keras.io/keras_tuner/

N. Shawki: On Automating Hyperparameter Optimization for Deep Learning Applications 64

Hyperparameter Tuning
• The process of selecting hyperparameters is a complex optimization problem.

• Grid search of the hyperparameter space is a popular method which is simple to implement and
parallelize, and provides insight into the search space.

• Ongoing research suggests that automated random search optimization is a more efficient alternative
that often yields as good or better models than manual methods due to their ability to search larger
configuration spaces.

• The problem of hyperparameter tuning () can be expressed as:

where,

= the hyperparameters, = the learning algorithm,

= the search space, = the hyperparameter response function.

= the loss function,

= the ground truth

N. Shawki: On Automating Hyperparameter Optimization for Deep Learning Applications 65

N. Shawki: On Automating Hyperparameter Optimization for Deep Learning Applications 66
DGX
Charlie Boyle
A100
Chris Lamb
Rajeev
Jayavant
OVERVIE
W

Conta
ct In Great Britain:
Sky Blue Microsystems Zerif Technologies Ltd.
GmbH Winnington House, 2 Woodberry
Geisenhausenerstr. 18 Grove
81379 Munich, Germany Finchley, London N12 0DR
+49 89 780 2970, +44 115 855 7883,
[email protected] [email protected] www.zerif.co.uk
www.skyblue.de
2

N. Shawki: On Automating Hyperparameter Optimization for Deep Learning Applications 68

SOLVING THE INFLEXIBILITY OF AI
INFRASTRUCTURE
Not Optimized, Complex to Manage, Difficult to Scale
Predictably
TRAINING
CLUSTER

Inflexible infrastructure silos that were

never meant for the pace of AI

Constrained workload placement by

system-level characteristics

Non-uniform performance across the data

center
6
9

Unable to adapt to dynamic workload

demands Constrained capacity planning

https://www.youtube.com/watch?v=MY7jZGZw9vA
ANALYTICS INFERENCE
CLUSTER CLUSTER https://www.youtube.com/watch?v=ZevjEbu8N3E

N. Shawki: On Automating Hyperparameter Optimization for Deep Learning Applications 69

DGX A100: THE
UNIVERSAL AI SYSTEM

One System for Integrated Access to Game-changing

7
0
Unmatched
Every AI Unmatched AI Performance for Data Center
Workload Expertise Innovators Scalability
Performance meets Fastrack AI Fastest time-to-solution Build leadership-class
utility – analytics, AI transformation with the world’s first 5 infrastructure that
training and inference DGXpert know-how and PFLOPS AI system, built scales to keep ahead
all in one experience on NVIDIA A100 of demand

N. Shawki: On Automating Hyperparameter Optimization for Deep Learning Applications 70

ONE SYSTEM FOR ALL AI
INFRASTRUCTURE
AI Infrastructure Re-Imagined, Optimized, and Ready
for Enterprise AI-at-Scale

Flexible AI infrastructure that adapts to

the pace of enterprise

One universal building block for the

AI data center

Uniform, consistent performance

across
7 the data center
1

Any workload on any node - any time

Limitless capacity planning with

predictably great performance with
scale
Analytics  Training  Inference
any job | any size | any node | anytime

N. Shawki: On Automating Hyperparameter Optimization for Deep Learning Applications 71

GAME-CHANGING
PERFORMANCE FOR
INNOVATORS

9x Mellanox ConnectX-6 VPI HDR InfiniBand/200Gb

Ethernet
450GB/sec Bi-directional Bandwidth

Dual 64-core AMD CPUs and 1TB System

Memory
3.2X More Cores to Power the Most Intensive AI Jobs

8 NVIDIA A100 GPUs with 320GB TOTAL GPU

Memory
12 NVLinks/GPU
600GB/sec GPU-to-GPU 7
Bi-directional Bandwidth
6 Second Generation2
NVSwitches
4.8TB/sec Bi-directional Bandwidth
2X More than Previous Generation
NVSwitch
15TB Gen4 NVME SSDs
25GB/sec Peak Bandwidth
2X Faster than Gen3 NVME
SSDs

N. Shawki: On Automating Hyperparameter Optimization for Deep Learning Applications 72

DGX A100
PERFORMANCE
1289 6
Sequenc 10 PetaOPS 8
es/s 8
B

G
r
6 172 a
p
X
X h

216 E
Sequences/s d
g
e
58 s
TOPS /
8x DGX
CPU DGX s
V100 A100 CPU Server DGX
FP32 TF32 Cluster A100
A100
Training Analytic
NLP: BERT- s
Large Inference
PageRan
BERT Pre-Training Throughput using PyTorch including Peak Compute
(2/3)Phase 1 and (1/3)Phase 2 | Phase 1 Seq Len = 128, k
13
CPU Server: 2x Intel Platinum 8280 using INT8 3000x CPU Servers vs. 4x DGX A100
Phase 2 Seq Len = 512 V100: DGX-1 with 8x V100 using DGX A100: DGX A100 with 8x A100 using INT8 with Published Common Crawl Data Set: 128B Edges,
FP32 precision Structural Sparsity 2.6TB Graph
DGX A100: DGX A100 with 8x A100 using TF32 precision
N. Shawki: On Automating Hyperparameter Optimization for Deep Learning Applications 7 73
NEW
FEATURES
8
DGX A100: NEW A100 GPUS AND
2X FASTER NVSWITCH
5 PetaFLOPS AI Performance

Eight new A100 Tensor Core GPUs/320GB total

HBM2

Twelve NVLinks per GPU, 2x more than V100

600GB/s bi-directional bandwidth between any

GPU pair

~10X PCIe Gen4 bandwidth with next-gen NVLink

7
5
All GPUs fully connected with six next-gen
NVSwitch

4.8TB/s bi-directional bandwidth

In one second we could transfer 426 hours of HD

video Download HD video to 80K smartphones

N. Shawki: On Automating Hyperparameter Optimization for Deep Learning Applications simultaneously 75

CONSOLIDATING DIFFERENT WORKLOADS ON
DGX A100
One Platform for Training, Inference and Data
Analytics

4x
A100s
DL
Training

2x A100s
Data
Analytics

2x A100s Inferencing in MIG

7
6 mode
Instance Instance
1 7
TRT TRT TRT TRT TRT TRT TRT

Instance Instance
8 14
TRT TRT TRT TRT TRTT TRT TRT

N. Shawki: On Automating Hyperparameter Optimization for Deep Learning Applications 76

UNMATCHED SCALABILITY WITH
MELLANOX NETWORKING
Highest Network Throughput for Data
• Eightand Clustering
Mellanox single-port
ConnectX-6
For clustering
Storage • networking:
Supporting HDR/HDR100/EDR
Networking
InfiniBand default or 200GigE

450GB/sec total peak

bandwidth

Cluster •For data/storage

Cluster networking:
Networkin Networkin One Mellanox dual-port
g g ConnectX-6
7
7
Supporting: 200/100/50/40/25/10Gb Ethernet
default or
HDR/HDR100/EDR InfiniBand
Single-
port One optional Dual-Port CX-6 available as
CX-6 add-on
NIC All I/O now PCIe Gen4, 2x performance increase over Gen3

Scale up multiple DGX A100 nodes with Mellanox Quantum

Switch,
the world’s smartest network switch
N. Shawki: On Automating Hyperparameter Optimization for Deep Learning Applications 77
THE WORLD’S MOST SECURE AI
SYSTEM FOR ENTERPRISE
Built-In Security: Multi-layered Defense for AI
Infrastructure
DGX A100 delivers the most
robust
security posture for your AI
enterprise
Secure boot

Self-Encrypted Drives
(SED) to protect data at
rest

GPU
Board
CPU Secure
Boar of
Update
Firmware
d
BM
C

1
N. Shawki: On Automating Hyperparameter Optimization for Deep Learning Applications 2 78
INTRODUCING: NVIDIA DGXpert
With Every DGX system - Your Trusted Navigator in AI
Transformation

7
9

With you every step of the way - Included with every DGX
system
N. Shawki: On Automating Hyperparameter Optimization for Deep Learning Applications 79
DGX: DELIVERING AI FOR
BUSINESS
Backed by 1000’s of Data Scientists, Engineers &
SATURNV
Plan AI Deploy Optimize
• System Sizing

MAGLE
• HPL System testing • DLI for Workflow
new features
Lifecycle Data
• Network Design
Ingestion •Analytics
Data
Cluster Tools Setup • App Code Reviews
Management
• Secure AI Guidance • System Runbook • Technology Upgrades
Services Management

V
MAGLE DL Data
SW with HPC SW Data
Data Analytics
Workflow
SW
Optimized TF32Ingestion Optimized with Managementwith RAPIDS
Analytics

Acceleration Tensor Core

Management
V

Software
8
0

Highest
Performance
Systems

N. Shawki: On Automating Hyperparameter Optimization for Deep Learning Applications 80

NVIDIA DGX
SUPERPOD WITH
DGX A100
Unmatched data center scalability –
deployed in under 3 weeks
Leadership-class AI infrastructure

The blueprint for AI power and scale using DGX

A100

Infused with the expertise of NVIDIA’s AI

practitioners Designed to solve the previously

unsolvable Configurations start at 20 systems

NVIDIA DGX SuperPOD

8
1 deployed in SATURNV

1,120 A100 GPUs

140 DGX A100 Systems

170 Mellanox 200G HDR switches

4 PB of high-performance storage

N. Shawki: On Automating Hyperparameter Optimization for Deep Learning Applications 700 PFLOPS of power to train the previously 81
MORE THAN A SERVER
– NVIDIA’S
COMMITMENT TO
DELIVERING AI
SUCCESS
Backed by a Global Team of DGXperts
14,000+ of “AI-fluent” practitioners with a
decade
FPO of experience

Need nice GPU Backed by SATURNV – world’s largest DGX

tray/Delta/HGX-3 proving ground
image from
Creative Fully-Optimized
Full-stack solution, optimized at every
layer: data, algorithms, models +
compute, storage, networking, and more

The DGXpert World’s

Unmatched Universal AI-fluent First
Field-Proven
A100 Data Center AI System
Talent Platform Scale Thousands of deployed AI systems and
customers

1
N. Shawki: On Automating Hyperparameter Optimization for Deep Learning Applications 7 83
Contac
t In Great Britain:
Sky Blue Microsystems Zerif Technologies Ltd.
GmbH Winnington House, 2 Woodberry
Geisenhausenerstr. 18 Grove
81379 Munich, Germany Finchley, London N12 0DR
+49 89 780 2970, +44 115 855 7883,
[email protected] [email protected] www.zerif.co.uk
www.skyblue.de
2

CepheusUniversal Whiteback17
90% (10)
CepheusUniversal Whiteback17
456 pages
ZJC Focus On Combined Science Form 1
92% (36)
ZJC Focus On Combined Science Form 1
226 pages
14 AAU - Level 6 - Test - Challenge - Unit 4
100% (13)
14 AAU - Level 6 - Test - Challenge - Unit 4
5 pages
Introduction To TensorFlow For Artificial Intelligence
No ratings yet
Introduction To TensorFlow For Artificial Intelligence
41 pages
09 Tensorflow101 Slide
No ratings yet
09 Tensorflow101 Slide
78 pages
Hiperparametre
No ratings yet
Hiperparametre
10 pages
DL Unit 3
No ratings yet
DL Unit 3
21 pages
Dla
No ratings yet
Dla
23 pages
DSE 3141 Deep Learning Lab Manual 2024 Week4
No ratings yet
DSE 3141 Deep Learning Lab Manual 2024 Week4
14 pages
Unit 2
No ratings yet
Unit 2
10 pages
Deep Learning With Python Mini Course
No ratings yet
Deep Learning With Python Mini Course
26 pages
Deeplearning Ai
No ratings yet
Deeplearning Ai
64 pages
Keras v.2.1.6
No ratings yet
Keras v.2.1.6
244 pages
DL Unit 4 Notes
No ratings yet
DL Unit 4 Notes
21 pages
Keras-tensorflow-IT Haarlem 2023
No ratings yet
Keras-tensorflow-IT Haarlem 2023
35 pages
CHP 3
No ratings yet
CHP 3
6 pages
Deep Learning r18 Jntuh Lab Manual
No ratings yet
Deep Learning r18 Jntuh Lab Manual
20 pages
Lec 07 8
No ratings yet
Lec 07 8
40 pages
Fundamentals of Deep Learning
No ratings yet
Fundamentals of Deep Learning
195 pages
Deep Learning With Python Sample
100% (1)
Deep Learning With Python Sample
31 pages
Chapter 3
No ratings yet
Chapter 3
24 pages
DL LAB Manual (Uma)
No ratings yet
DL LAB Manual (Uma)
20 pages
Deep Learning With Keras and Tensorflow
No ratings yet
Deep Learning With Keras and Tensorflow
557 pages
15 ML
No ratings yet
15 ML
60 pages
Deep Learning1
No ratings yet
Deep Learning1
23 pages
14 DL Frameworks
No ratings yet
14 DL Frameworks
30 pages
WINSEM2024-25 CSE4006 ETH AP2024254000693 2024-12-19 Reference-Material-I
No ratings yet
WINSEM2024-25 CSE4006 ETH AP2024254000693 2024-12-19 Reference-Material-I
10 pages
Sony Ai Content
No ratings yet
Sony Ai Content
26 pages
DL 1 2 3
No ratings yet
DL 1 2 3
24 pages
DL Lab Manual
No ratings yet
DL Lab Manual
65 pages
Deep Learning With Keras - Quick Guide
No ratings yet
Deep Learning With Keras - Quick Guide
22 pages
8 Deep Learning CNN
No ratings yet
8 Deep Learning CNN
63 pages
Deep Learning With Python: TH TH
No ratings yet
Deep Learning With Python: TH TH
36 pages
Large-Scale Deep Learning With Tensorflow: Jeff Dean Google Brain Team
No ratings yet
Large-Scale Deep Learning With Tensorflow: Jeff Dean Google Brain Team
119 pages
Tensorflow 2 - 0 Slides PDF
No ratings yet
Tensorflow 2 - 0 Slides PDF
100 pages
1 AI - Introduction and ML
No ratings yet
1 AI - Introduction and ML
32 pages
DL Assessment - I
No ratings yet
DL Assessment - I
8 pages
Workshop 1 Frameworks Deep Learning
No ratings yet
Workshop 1 Frameworks Deep Learning
16 pages
Professional Machine Learning Engineer Demo
No ratings yet
Professional Machine Learning Engineer Demo
9 pages
Chap 3.2 IntroductionToKeras
No ratings yet
Chap 3.2 IntroductionToKeras
36 pages
Machine Learning Assignment-1
No ratings yet
Machine Learning Assignment-1
7 pages
Python Deep Learning With Keras
No ratings yet
Python Deep Learning With Keras
21 pages
KT 01 Intro2Keras
No ratings yet
KT 01 Intro2Keras
24 pages
Deep Learning
No ratings yet
Deep Learning
28 pages
TensorFlow With R
No ratings yet
TensorFlow With R
46 pages
Explore The Implementation of CNNs in Python
No ratings yet
Explore The Implementation of CNNs in Python
10 pages
Building Deep Neural Network
No ratings yet
Building Deep Neural Network
17 pages
DL-Experiments-1 To 5
No ratings yet
DL-Experiments-1 To 5
43 pages
RLDL128
No ratings yet
RLDL128
73 pages
Lecture07. ANN (Chapter 10-2)
No ratings yet
Lecture07. ANN (Chapter 10-2)
26 pages
Tensorlayer Documentation: Release 1.11.1
No ratings yet
Tensorlayer Documentation: Release 1.11.1
258 pages
Auto Keras
No ratings yet
Auto Keras
6 pages
L6 Hardware and Software For DL en
No ratings yet
L6 Hardware and Software For DL en
66 pages
Unit 3 Slides - Getting Started With Neural Networks
No ratings yet
Unit 3 Slides - Getting Started With Neural Networks
70 pages
Deep Learning With Tensor Ow and Google Cloud Ai 2-In-1
No ratings yet
Deep Learning With Tensor Ow and Google Cloud Ai 2-In-1
6 pages
H13-311 - V3.5 Huawei Exam Practice Questions
No ratings yet
H13-311 - V3.5 Huawei Exam Practice Questions
13 pages
Tensorflow 2.0 Cheat Sheet: Some Pre-Requisites TF Core Learning Algorithms Working With Keras Models
No ratings yet
Tensorflow 2.0 Cheat Sheet: Some Pre-Requisites TF Core Learning Algorithms Working With Keras Models
2 pages
Course Title: Fundamentals of Deep Learning Lab: BTECH Programme: AI&DS
No ratings yet
Course Title: Fundamentals of Deep Learning Lab: BTECH Programme: AI&DS
81 pages
Unit - 3 DL
No ratings yet
Unit - 3 DL
17 pages
Deep Learning With Python
100% (6)
Deep Learning With Python
396 pages
MML Cours9 Convolutional Neural Networks
No ratings yet
MML Cours9 Convolutional Neural Networks
61 pages
A Quick Introduction To Tensorflow: Machine Learning Spring 2019
100% (1)
A Quick Introduction To Tensorflow: Machine Learning Spring 2019
22 pages
Mastering C++: Advanced Techniques and Tricks
From Everand
Mastering C++: Advanced Techniques and Tricks
Ted Norice
No ratings yet
Lab 3
No ratings yet
Lab 3
2 pages
PPE Lab Manual
No ratings yet
PPE Lab Manual
52 pages
The Best of Charlie Munger 1994 2011 PDF
No ratings yet
The Best of Charlie Munger 1994 2011 PDF
1 page
Data Sheet: SFH757 and SFH757V
No ratings yet
Data Sheet: SFH757 and SFH757V
4 pages
Blood of The Fold Terry Goodkind Instant Download
100% (1)
Blood of The Fold Terry Goodkind Instant Download
35 pages
Kinetic Theory & Thermal Properties Notes IGCSE AVG
100% (3)
Kinetic Theory & Thermal Properties Notes IGCSE AVG
12 pages
Lecture Set Three-Wave Generator
No ratings yet
Lecture Set Three-Wave Generator
10 pages
MCQ Class 2 MS Word
No ratings yet
MCQ Class 2 MS Word
11 pages
EMR System UI Design
No ratings yet
EMR System UI Design
3 pages
A CR CCP 702 PF 001 Red Star IG
No ratings yet
A CR CCP 702 PF 001 Red Star IG
730 pages
Business Case Studies
No ratings yet
Business Case Studies
10 pages
Course Structure R15me
No ratings yet
Course Structure R15me
217 pages
Red Zone Equipment Checklist
No ratings yet
Red Zone Equipment Checklist
4 pages
Kowsi Final Project
No ratings yet
Kowsi Final Project
50 pages
Chapters 7
No ratings yet
Chapters 7
64 pages
Resilient Control Architectures and Power Systems 1st Edition Craig Rieger Instant Download
100% (1)
Resilient Control Architectures and Power Systems 1st Edition Craig Rieger Instant Download
44 pages
Alemite Oil Mist Application Manual
100% (1)
Alemite Oil Mist Application Manual
34 pages
Admission Form BNU
No ratings yet
Admission Form BNU
2 pages
Electrical Thumb Rules You MUST Follow Part 5
No ratings yet
Electrical Thumb Rules You MUST Follow Part 5
3 pages
60. Đề Thi Thử TN THPT 2021 - Môn Tiếng Anh - Sở GD & ĐT Hưng Yên - File Word Có Lời Giải
No ratings yet
60. Đề Thi Thử TN THPT 2021 - Môn Tiếng Anh - Sở GD & ĐT Hưng Yên - File Word Có Lời Giải
6 pages
? Gallery Walk Scoring Rubric
No ratings yet
? Gallery Walk Scoring Rubric
2 pages
Employee Retention, Engagement and Careers
No ratings yet
Employee Retention, Engagement and Careers
16 pages
LN40D550 - Fast Track Troubleshooting Manual PDF
No ratings yet
LN40D550 - Fast Track Troubleshooting Manual PDF
4 pages
Dbms Lab 1,2,3,4
No ratings yet
Dbms Lab 1,2,3,4
40 pages
Pollution Emitting From Guernsey Power Plant/PEH Incinerator and Proposed EtW
No ratings yet
Pollution Emitting From Guernsey Power Plant/PEH Incinerator and Proposed EtW
6 pages
Samuel Mercer - The Ideology of Work - Theoretical Humanism, Work and Labour (Historical Materialism Book Series, 311) - Brill Academic Pub (2024)
No ratings yet
Samuel Mercer - The Ideology of Work - Theoretical Humanism, Work and Labour (Historical Materialism Book Series, 311) - Brill Academic Pub (2024)
219 pages
PP Math6 QTR2W7 Day 1
No ratings yet
PP Math6 QTR2W7 Day 1
14 pages

Unit Ii

Uploaded by

Unit Ii

Uploaded by

Building Models with

• More coding lines required to

 Binary Cross Entropy (‘binary_crossentropy’)

 Kullback Leibler Divergence (‘kullback_leibler_divergence’)

For a discussion of weight

Tune the learning

• Hyperparameters (HPs) can be

N. Shawki: On Automating Hyperparameter Optimization for Deep Learning Applications 63

• Keras Tuner uses random search for finding a generalized solution.

N. Shawki: On Automating Hyperparameter Optimization for Deep Learning Applications 64

• The problem of hyperparameter tuning () can be expressed as:

= the hyperparameters, = the learning algorithm,

= the search space, = the hyperparameter response function.

= the loss function,

= the ground truth

N. Shawki: On Automating Hyperparameter Optimization for Deep Learning Applications 65

N. Shawki: On Automating Hyperparameter Optimization for Deep Learning Applications 68

Inflexible infrastructure silos that were

Constrained workload placement by

Non-uniform performance across the data

Unable to adapt to dynamic workload

demands Constrained capacity planning

N. Shawki: On Automating Hyperparameter Optimization for Deep Learning Applications 69

One System for Integrated Access to Game-changing

N. Shawki: On Automating Hyperparameter Optimization for Deep Learning Applications 70

Flexible AI infrastructure that adapts to

One universal building block for the

Uniform, consistent performance

Any workload on any node - any time

Limitless capacity planning with

N. Shawki: On Automating Hyperparameter Optimization for Deep Learning Applications 71

9x Mellanox ConnectX-6 VPI HDR InfiniBand/200Gb

Dual 64-core AMD CPUs and 1TB System

8 NVIDIA A100 GPUs with 320GB TOTAL GPU

N. Shawki: On Automating Hyperparameter Optimization for Deep Learning Applications 72

Eight new A100 Tensor Core GPUs/320GB total

Twelve NVLinks per GPU, 2x more than V100

600GB/s bi-directional bandwidth between any

~10X PCIe Gen4 bandwidth with next-gen NVLink

4.8TB/s bi-directional bandwidth

In one second we could transfer 426 hours of HD

video Download HD video to 80K smartphones

N. Shawki: On Automating Hyperparameter Optimization for Deep Learning Applications simultaneously 75

2x A100s Inferencing in MIG

N. Shawki: On Automating Hyperparameter Optimization for Deep Learning Applications 76

450GB/sec total peak

Cluster •For data/storage

Scale up multiple DGX A100 nodes with Mellanox Quantum

Acceleration Tensor Core

N. Shawki: On Automating Hyperparameter Optimization for Deep Learning Applications 80

The blueprint for AI power and scale using DGX

Infused with the expertise of NVIDIA’s AI

practitioners Designed to solve the previously

unsolvable Configurations start at 20 systems

NVIDIA DGX SuperPOD

1,120 A100 GPUs

140 DGX A100 Systems

170 Mellanox 200G HDR switches

Need nice GPU Backed by SATURNV – world’s largest DGX

The DGXpert World’s

You might also like