0% found this document useful (0 votes)

81 views64 pages

Lecture8 Computational Graph Pytorch TF

The document discusses computation graphs and deep learning frameworks like PyTorch and TensorFlow. It defines a computation graph as a directed acyclic graph with nodes for variables and operations. Computation graphs allow automatic calculation of gradients using backpropagation. Frameworks like PyTorch and TensorFlow implement computation graphs with dynamic "define by run" graphs or static "define and run" graphs. This allows automatic calculation of gradients for training neural networks.

Uploaded by

kpratik41

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

81 views64 pages

Lecture8 Computational Graph Pytorch TF

Uploaded by

kpratik41

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 64

Lecture 6 – Computational Graphs; PyTorch and

Tensorflow

DD2424

April 11, 2019

DD2424 - Lecture 8 1
Outline

• First Part
• Computation Graphs
• TensorFlow
• PyTorch
• Notes

• Second Part

DD2424 - Lecture 8 2
Frameworks

DD2424 - Lecture 8 3
Frameworks

DD2424 - Lecture 8 4
O’Reilly Poll: Most popular framework for machine learning

[ Source: https://www.techrepublic.com/google-amp/article/most-popular-
programming-language-frameworks-and-tools-for-machine-learning/ ]

DD2424 - Lecture 8 5
What are computation graphs?

DD2424 - Lecture 8 6
Computation Graph

• DAG (directed acyclic graph)

• Nodes
• Variables
• Mathematical Operations
var
• Edges
• Feeding input op

var

DD2424 - Lecture 8 7
Computation Graph

•𝑐 = 𝑎+𝑏

𝒄=𝒂+𝒃

DD2424 - Lecture 8 8
Computation Graph

•𝑐 = 𝑎+𝑏∗2

𝒄=𝒂+𝒛

𝒃 𝒛=𝒃∗𝟐

DD2424 - Lecture 8 9
Computation Graph

• Tensors: Multi-dimensional arrays

• 𝒂 = 𝑊𝒙 + 𝒃

𝑧 = 𝑾𝑥 a= 𝒛 + 𝒃

DD2424 - Lecture 8 10
Computation Graph

• A feed-forward neural network

𝑾𝟏

𝑧1 = 𝑾𝟏 𝑥 𝒂𝟏 = 𝒛𝟏 + 𝒃𝟏 𝒔𝟏 = 𝝈(𝒂𝟏 )

𝒃𝟏

DD2424 - Lecture 8 11
Computation Graph

• A multi-layer feed-forward neural network

𝑾𝟏 𝑾𝟐

𝑧1 = 𝑾𝟏 𝑥 𝒂𝟏 = 𝒛𝟏 + 𝒃𝟏 𝒔𝟏 = 𝝈(𝒂𝟏 ) 𝑧2 = 𝑾𝟐 𝒔𝟏 𝒂𝟐 = 𝒛𝟐 + 𝒃𝟐 𝒔𝟐 = 𝝈(𝒂𝟐 )

𝒃𝟏 𝒃𝟐

DD2424 - Lecture 8 12
Python (NumPy)

𝑧 = 𝑾𝑥 a= 𝒛 + 𝒃

𝒃
DD2424 - Lecture 8 13
PyTorch

NumPy

PyTorch
𝑾

𝑧 = 𝑾𝑥 a= 𝒛 + 𝒃

𝒃
DD2424 - Lecture 8 14
PyTorch

NumPy

PyTorch

Not always!

DD2424 - Lecture 8 15
PyTorch-NumPy

• Converting a Torch Tensor to a NumPy array and vice versa is a breeze.

DD2424 - Lecture 8 16
PyTorch-NumPy

• Converting a Torch Tensor to a NumPy array and vice versa is a breeze.

Shared Memory

DD2424 - Lecture 8 17
PyTorch-NumPy

• Converting a Torch Tensor to a NumPy array and vice versa is a breeze.

DD2424 - Lecture 8 18
“Define by Run” Computation Graphs

This kind of computation graph is called “define by run“

Also referred to as “dynamic”

DD2424 - Lecture 8 19
“Define and Run” Computation Graphs

• First define the graph structure

• Then run it by feeding in the (input) variables.
Define graph G Run the graph G

𝑾𝟏
• Run G with 𝑥1 , 𝑊1 , 𝑏1
𝑧1 = 𝑾𝟏 𝑥 𝒂𝟏 = 𝒛𝟏 + 𝒃𝟏 𝒔𝟏 = 𝝈(𝒂𝟏 )

𝒙 • Run G with 𝑥2 , 𝑊2 , 𝑏2
𝒃𝟏

• …

Also known as “static graphs”

DD2424 - Lecture 8 20
Run graph
Define graph
many times

DD2424 - Lecture 8
21
TensorFlow
Data loop

• Dynamic Graph • Static Graph

DD2424 - Lecture 8 22
Why computation graphs at all?!

DD2424 - Lecture 8 23
Why computation graphs?

• In lecture 3, you’ve learnt how to do backprop using the chain rule

DD2424 - Lecture 8 24
Why computation graphs?

• Is it feasible?

DD2424 - Lecture 8 25
Why computation graphs?

• Automatic chain rule

• automatic back-prop using implemented operations
• Each operation has their gradient already implemented
• If you want to use a novel operation, then you have to provide it’s gradient w.r.t. inputs
and its learnable parameters (if any)

DD2424 - Lecture 8 26
Let’s look at examples in PyTorch and TensorFlow

DD2424 - Lecture 8 27
Computation Graph

• A feed-forward neural network

𝑾𝟏

𝑧1 = 𝑾𝟏 𝑥 𝒂𝟏 = 𝒛𝟏 + 𝒃𝟏 𝒔𝟏 = 𝝈(𝒂𝟏 )

𝒃𝟏

DD2424 - Lecture 8 28
Computation Graph

• A feed-forward neural network with squared 𝐿2 loss

𝑾𝟏

𝑧1 = 𝑾𝟏 𝑥 𝒂𝟏 = 𝒛𝟏 + 𝒃𝟏 𝒔𝟏 = 𝝈(𝒂𝟏 ) 𝑙 = |𝑠1 − 𝑦 |2

𝒃𝟏 𝒚

DD2424 - Lecture 8 29
Backprop in Computation Graph

• Learnable parameters

𝑾𝟏

𝑧1 = 𝑾𝟏 𝑥 𝒂𝟏 = 𝒛𝟏 + 𝒃𝟏 𝒔𝟏 = 𝝈(𝒂𝟏 ) 𝑙 = |𝑠1 − 𝑦 |2

𝒃𝟏 𝒚

DD2424 - Lecture 8 30
Backprop in Computation Graph

𝜕𝑙
𝜕𝑊1
𝑾𝟏

𝑧1 = 𝑾𝟏 𝑥 𝒂𝟏 = 𝒛𝟏 + 𝒃𝟏 𝒔𝟏 = 𝝈(𝒂𝟏 ) 𝑙 = |𝑠1 − 𝑦 |2

𝒃𝟏 𝒚

𝜕𝑙
𝜕𝑏1

DD2424 - Lecture 8 31
Backprop in Computation Graph

𝜕𝑙
𝜕𝑧1
𝜕𝑊1
𝑾𝟏 𝜕𝑊1 𝜕𝑎1 𝜕𝑠1 𝜕𝑙
𝜕𝑧1 𝜕𝑎1 𝜕𝑠1
𝑧1 = 𝑾𝟏 𝑥 𝒂𝟏 = 𝒛𝟏 + 𝒃𝟏 𝒔𝟏 = 𝝈(𝒂𝟏 ) 𝑙 = |𝑠1 − 𝑦 |2

𝒙
𝜕𝑎1
𝒃𝟏 𝒚
𝜕𝑏1
𝜕𝑙
𝜕𝑏1

DD2424 - Lecture 8 32
Backprop in Computation Graph

A deep learning framework provides an automatic gradient calculation

of its output variables w.r.t. its input variables
𝜕𝑙
𝜕𝑧1
𝜕𝑊1
𝑾𝟏 𝜕𝑊1 𝜕𝑎1 𝜕𝑠1 𝜕𝑙
𝜕𝑧1 𝜕𝑎1 𝜕𝑠1
𝑧1 = 𝑾𝟏 𝑥 𝒂𝟏 = 𝒛𝟏 + 𝒃𝟏 𝒔𝟏 = 𝝈(𝒂𝟏 ) 𝑙 = |𝑠1 − 𝑦 |2

𝒙
𝜕𝑎1
𝒃𝟏 𝒚
𝜕𝑏1
𝜕𝑙
𝜕𝑏1

DD2424 - Lecture 8 33
Backprop in Computation Graph

• Addition Node
• Forward pass: 𝑎 = 𝑏 + 𝑐
𝜕𝑎 𝜕𝑎
• Backward pass: 𝜕𝑏 = 1 and 𝜕𝑐
=1

DD2424 - Lecture 8 34
Backprop in Computation Graph

• Max Node
• Forward pass: 𝑎 = max 𝑏, 𝑐

• Backward pass:
• If b < c
𝜕𝑎 𝜕𝑎
• 𝜕𝑏
= 0 and
𝜕𝑐
=1
max
• If b > c
𝜕𝑎 𝜕𝑎
• 𝜕𝑏
= 1 and
𝜕𝑐
=0

DD2424 - Lecture 8 35
Variables and Ops

• Ops
• Intermediate or final nodes

• Variables
• intrinsic parameters of the model
• input to the model
𝑾𝟏

𝑧1 = 𝑾𝟏 𝑥 𝒂𝟏 = 𝒛𝟏 + 𝒃𝟏 𝒔𝟏 = 𝝈(𝒂𝟏 ) 𝑙 = |𝑠1 − 𝑦 |2

𝒃𝟏 𝒚

DD2424 - Lecture 8 36
Variables and Ops

• Ops
• Intermediate or final nodes

• Variables
• intrinsic parameters of the model
• input to the model
𝑾𝟏

𝑧1 = 𝑾𝟏 𝑥 𝒂𝟏 = 𝒛𝟏 + 𝒃𝟏 𝒔𝟏 = 𝝈(𝒂𝟏 ) 𝑙 = |𝑠1 − 𝑦 |2

𝒃𝟏 𝒚

DD2424 - Lecture 8 37
Variables and Ops

• Variables
• Intrinsic parameters of the model
• Input to the model

• TensorFlow
• Variables
• Place Holders 𝑾𝟏

𝑧1 = 𝑾𝟏 𝑥 𝒂𝟏 = 𝒛𝟏 + 𝒃𝟏 𝒔𝟏 = 𝝈(𝒂𝟏 ) 𝑙 = |𝑠1 − 𝑦 |2

• PyTorch 𝒙
• Variables
𝒃𝟏 𝒚

DD2424 - Lecture 8 38
Variable

PyTorch Autograd

• package: torch.autograd

Data
Tensor

Gradient
w.r.t.
this variable

Function
that created
this variable DD2424 - Lecture 8 39
Pytorch Autograd

DD2424 - Lecture 8 40
Pytorch Autograd

DD2424 - Lecture 8 41
Pytorch Autograd

DD2424 - Lecture 8 42
Pytorch Autograd

DD2424 - Lecture 8 43
PyTorch Autograd

• Calculate gradient using backward() method of a Variable

• var.backward()

DD2424 - Lecture 8 44
TensorFlow gradients

• Add gradient nodes in the graph where necessary using

Tf.gradients(ys, xs, gs)

• And evaluate it

DD2424 - Lecture 8 45
TensorFlow gradients

• Then update the parameters

DD2424 - Lecture 8 46
TensorFlow gradient

• Use tf.Variable instead

DD2424 - Lecture 8 47
How to use GPU?

DD2424 - Lecture 8 48
PyTorch GPU

Turn variables into “GPU” variables by the following command:

• var = var.cuda(#)

DD2424 - Lecture 8 49
PyTorch GPU

Turn back variables into “CPU” variables by the following command:

• var = var.cpu()

DD2424 - Lecture 8 50
TensorFlow GPU

• In TF variables or operations can sit on specific device

• tf.device(/gpu:0)
• tf.device(/gpu:1)
•…
• tf.device(/cpu:0)

DD2424 - Lecture 8 51
TensorFlow GPU

• In TF variables or operations can sit on specific device

tf.Session(config=tf.ConfigProto(log_device_placement=True))

MatMul: (MatMul): /job:localhost/replica:0/task:0/device:GPU:0

2018-04-10 12:59:09.508497: I tensorflow/core/common_runtime/placer.cc:874] MatMul: (MatMul)/job:localhost/replica:0/task:0/device:GPU:0
add: (Add): /job:localhost/replica:0/task:0/device:GPU:0
2018-04-10 12:59:09.508513: I tensorflow/core/common_runtime/placer.cc:874] add: (Add)/job:localhost/replica:0/task:0/device:GPU:0
Maximum: (Maximum): /job:localhost/replica:0/task:0/device:GPU:0
2018-04-10 12:59:09.508525: I tensorflow/core/common_runtime/placer.cc:874] Maximum: (Maximum)/job:localhost/replica:0/task:0/device:GPU:0
Maximum/y: (Const): /job:localhost/replica:0/task:0/device:GPU:0
2018-04-10 12:59:09.508537: I tensorflow/core/common_runtime/placer.cc:874] Maximum/y: (Const)/job:localhost/replica:0/task:0/device:GPU:0
Placeholder_2: (Placeholder): /job:localhost/replica:0/task:0/device:GPU:0
2018-04-10 12:59:09.508548: I tensorflow/core/common_runtime/placer.cc:874] Placeholder_2: (Placeholder)/job:localhost/replica:0/task:0/device:GPU:0
Placeholder_1: (Placeholder): /job:localhost/replica:0/task:0/device:GPU:0
2018-04-10 12:59:09.508558: I tensorflow/core/common_runtime/placer.cc:874] Placeholder_1: (Placeholder)/job:localhost/replica:0/task:0/device:GPU:0
Placeholder: (Placeholder): /job:localhost/replica:0/task:0/device:GPU:0
2018-04-10 12:59:09.508567: I tensorflow/core/common_runtime/placer.cc:874] Placeholder: (Placeholder)/job:localhost/replica:0/task:0/device:GPU:0

DD2424 - Lecture 8 52
TensorFlow GPU

• Some TF operations do not have a CUDA implementation

tf.Session(config=tf.ConfigProto(
allow_soft_placement=True, log_device_placement=True))

DD2424 - Lecture 8 53
How to implement complicated models in practice?

DD2424 - Lecture 8 54
PT High-Level Library

• PyTorch package called nn and class called Module

DD2424 - Lecture 8 55
TF High-Level Libraries

• Keras: highest abstraction

• SLIM: best pre-trained models
• TFLearn,
• Sonnet,
• Pretty Tensor,
•…

DD2424 - Lecture 8 56
Data, storage, and loading

!!!Important!!!

• Always monitor CPU/GPU usage (linux: nvidia-smi, top)

• Make storage more efficient (TF Records, etc.)

• Make reading pipeline more efficient (parallel readers, prefetching,

etc.)

DD2424 - Lecture 8 57
Use Visualization

• Always monitor the loss function on the training and validation sets visually
• Monitor all other important scalars, such as learning rate, regularization loss,
layer activations summary, how full your data queues are, and …

• If you have an imbalanced classification problem, visualize the CE loss separately

for each class.

• If you work with images, time to time visualize samples from the batch, if you do
data augmentation, visualize the original sample as well as the augmented one

• TensorBoard for TF
• TensorBoardX, matplotlib, seaborn, … for PT
DD2424 - Lecture 8 58
Use Visualization

You can have the configuration shown as a text file in tensorboard!

DD2424 - Lecture 8 59
Which one is better? PyTorch or TensorFlow?

DD2424 - Lecture 8 60
pros and cons

• PyTorch: easier for prototyping

• PyTorch: much easier to implement flexible graphs
• PyTorch: different structures in each iteration (dependent on data). This is possible with TF too, but is a pain.
• PyTorch: manipulating weight and gradients
• PyTorch: code-level debugging (breakpoints, imperative, tracing your own code instead of TF kernels)
• PyTorch: probably better abstractions for dataset, variable, parallelism, etc. but TF has many high-level wrappers with better abstractions
• Tie?!: Faster run-time, (NHWC v.s. NCHW)
• TF: TensorBoard
• TF: research-level debugging (TensorBoard)
• TF: windows
• TF: distributed training (PyTorch has it now too, but seems not as developed as the TF version)
• TF: easier with distributing the code over multiple devices (GPUs/CPU) (maybe not anymore)
• TF: online community is noticeably larger
• TF: data readers
• TF: supposedly more optimizations of the graph (done by the engine)
• TF: documentation and tutorials
• TF: more models available
• TF: Serialization, code and portability (saving and loading models for across platforms, or checkpoints)
• TF: Deployment: Server, Mobile, etc. (TensorFlow Serving, TensorFlow Lite)
• TF: Richer API (e.g. FFT)
• TF: Automatic shape inference
• TF has a MOOC: https://eu.udacity.com/course/deep-learning--ud730

DD2424 - Lecture 8 61
TensorFlow Eager execution

• Eager Execution
• Dynamic!

• tf.enable_eager_execuation()

• Considerably Slower (being worked on)

• https://www.tensorflow.org/guide/eager

DD2424 - Lecture 8 62
Caffe(2)

• Portability is seamless (e.g. mobile apps)

• Simplest framework for fine-tuning or feature extraction

• Used to be fastest (Caffe)

DD2424 - Lecture 8 63
Summary

• Don’t take the following statements too seriously! -- it depends on many factors
• If you want to use pretrained classic deep networks (AlexNet, VGG, ResNet, …) for feature extraction and/or fine-
tuning → Use Caffe and/or Caffe2
• If you have a mobile application in mind → Use Caffe/Caffe2 or TensorFlow
• If you want more pythonic → use PyTorch
• If you are familiar with Matlab and don’t need much flexibility or advanced layers → use MatConvNet
• If you don’t want so much of flexibility and still use python → use Keras
• If you are working on NLP applications or complicated RNNs → use PyTorch
• If you want large community help, sustainable learning of a framework → use TensorFlow
• If you want to work on bleeding-edge papers → See what framework has the original and/or cleanest
implementation (most likely TensorFlow)
• If you want to prototype many different novel setups → Use PyTorch or TF Eager

DD2424 - Lecture 8 64

Pytorch Tutorial 1 Rev 1
No ratings yet
Pytorch Tutorial 1 Rev 1
48 pages
01 - Lecture Slide - Overview of Tensorflow
100% (1)
01 - Lecture Slide - Overview of Tensorflow
65 pages
Python TensorFlow Tutorial - Build A Neural Network - Adventures in Machine Learning
No ratings yet
Python TensorFlow Tutorial - Build A Neural Network - Adventures in Machine Learning
18 pages
Cours 3 - Custom Models and Training With TensorFlow
No ratings yet
Cours 3 - Custom Models and Training With TensorFlow
36 pages
Tensor Flow 101
100% (8)
Tensor Flow 101
58 pages
Mitesh Patel PDF
77% (13)
Mitesh Patel PDF
92 pages
Pytorch 101: Deep Learning PHD Course 2017/2018
No ratings yet
Pytorch 101: Deep Learning PHD Course 2017/2018
19 pages
Tensor Flow
No ratings yet
Tensor Flow
2 pages
CS 230 - Deep Learning Tips and Tricks Cheatsheet
No ratings yet
CS 230 - Deep Learning Tips and Tricks Cheatsheet
8 pages
Introduction To Deep Neural Networks - DataCamp
No ratings yet
Introduction To Deep Neural Networks - DataCamp
10 pages
Pytorch Slides
No ratings yet
Pytorch Slides
31 pages
CS236 Introduction To PyTorch
100% (4)
CS236 Introduction To PyTorch
33 pages
Computational Graph
No ratings yet
Computational Graph
17 pages
Dynamic Computation Graphs
No ratings yet
Dynamic Computation Graphs
12 pages
Micro Controller Assignment No.1
No ratings yet
Micro Controller Assignment No.1
2 pages
Computational Infrastructure: Sweta Agrawal
No ratings yet
Computational Infrastructure: Sweta Agrawal
18 pages
Computational Graphs in Deep Learning Unit v4 Deep Leaerning
No ratings yet
Computational Graphs in Deep Learning Unit v4 Deep Leaerning
3 pages
Week 13 GCP Lec Notes
No ratings yet
Week 13 GCP Lec Notes
28 pages
Tensorflow PDF
No ratings yet
Tensorflow PDF
62 pages
Chapter21 4e
No ratings yet
Chapter21 4e
35 pages
Appendix Tensorflow PDF
50% (8)
Appendix Tensorflow PDF
14 pages
L6 Hardware and Software For DL en
No ratings yet
L6 Hardware and Software For DL en
66 pages
ABAP Debugger Scripting-Basics
No ratings yet
ABAP Debugger Scripting-Basics
8 pages
(FreeCourseWeb - Com) WebUser - Issue 479
No ratings yet
(FreeCourseWeb - Com) WebUser - Issue 479
76 pages
INTRO TO Deep Learning Focusing On ToolS - Knowlexon - Biswa
No ratings yet
INTRO TO Deep Learning Focusing On ToolS - Knowlexon - Biswa
37 pages
TF Recitation
No ratings yet
TF Recitation
38 pages
Deep Learning With PyTorch Guide For Beginners and Intermediate
100% (7)
Deep Learning With PyTorch Guide For Beginners and Intermediate
120 pages
Introduction To PyTorch
No ratings yet
Introduction To PyTorch
35 pages
24 TensorFlow Clipper
No ratings yet
24 TensorFlow Clipper
35 pages
Lecture 2: Introduction To Pytorch
No ratings yet
Lecture 2: Introduction To Pytorch
7 pages
S06 DNN Tensorflow PyTorch Wip
No ratings yet
S06 DNN Tensorflow PyTorch Wip
24 pages
GNNV4
No ratings yet
GNNV4
55 pages
Deep Learning Unit 4
No ratings yet
Deep Learning Unit 4
11 pages
2c PyTorch4
No ratings yet
2c PyTorch4
4 pages
Tensorflow Ensai SID 13 01 17
No ratings yet
Tensorflow Ensai SID 13 01 17
99 pages
Sow ITE1013 For MQA Modified
No ratings yet
Sow ITE1013 For MQA Modified
4 pages
A Quick Introduction To Tensorflow: Machine Learning Spring 2019
100% (1)
A Quick Introduction To Tensorflow: Machine Learning Spring 2019
22 pages
Tensorflow Usage: Babii Andrii
No ratings yet
Tensorflow Usage: Babii Andrii
33 pages
DIP Lab 10
No ratings yet
DIP Lab 10
11 pages
Tensor
No ratings yet
Tensor
19 pages
Pytorch Tutorial 1
No ratings yet
Pytorch Tutorial 1
48 pages
MongoDB Administration Guide
50% (2)
MongoDB Administration Guide
198 pages
Chapter DeepLearningwithTensorFlow
No ratings yet
Chapter DeepLearningwithTensorFlow
19 pages
Deep Learning Lab: How To Train Your First Neural Network
No ratings yet
Deep Learning Lab: How To Train Your First Neural Network
68 pages
1 TensorFlow
No ratings yet
1 TensorFlow
66 pages
AML Lecture1.3
No ratings yet
AML Lecture1.3
72 pages
Astro AI
No ratings yet
Astro AI
20 pages
Krishna Rungta - TensorFlow in 1 Day Make Your Own Neural Network (2018) - Trang-3
No ratings yet
Krishna Rungta - TensorFlow in 1 Day Make Your Own Neural Network (2018) - Trang-3
18 pages
Pbxware Installation Guide: © 2003-2008 Bicom Systems
No ratings yet
Pbxware Installation Guide: © 2003-2008 Bicom Systems
41 pages
Yec Fi Matching System 1.20
0% (1)
Yec Fi Matching System 1.20
48 pages
Introduction To TensorFlow
No ratings yet
Introduction To TensorFlow
3 pages
CSE488 - Lab7 - Neural Networks and TensorFlow
No ratings yet
CSE488 - Lab7 - Neural Networks and TensorFlow
21 pages
Deep Learning With Keras - Quick Guide
No ratings yet
Deep Learning With Keras - Quick Guide
22 pages
What Is TensorFlow
No ratings yet
What Is TensorFlow
38 pages
AD PyTorch
No ratings yet
AD PyTorch
4 pages
Harvard CS197 Lecture 6 & 7 Notes
No ratings yet
Harvard CS197 Lecture 6 & 7 Notes
18 pages
Microsoft Net For Programmers 1st Fergal Grimes Instant Download
No ratings yet
Microsoft Net For Programmers 1st Fergal Grimes Instant Download
86 pages
Nscreen DC Nscreen I91 20090216 111413 Nscreen I91 Service Manual
100% (1)
Nscreen DC Nscreen I91 20090216 111413 Nscreen I91 Service Manual
93 pages
Lec2 - Intro To Tensorflow
No ratings yet
Lec2 - Intro To Tensorflow
120 pages
PyTorch 1 - 0 - Bringing Research and Production Together Presentation
No ratings yet
PyTorch 1 - 0 - Bringing Research and Production Together Presentation
108 pages
09 Tensorflow101 Slide
No ratings yet
09 Tensorflow101 Slide
78 pages
Sony Ai Content
No ratings yet
Sony Ai Content
26 pages
Introduction To Artificial Neural Networks
No ratings yet
Introduction To Artificial Neural Networks
31 pages
14 DL Frameworks
No ratings yet
14 DL Frameworks
30 pages
Sm3 07 C Service Manual
No ratings yet
Sm3 07 C Service Manual
36 pages
Tesis 7
No ratings yet
Tesis 7
76 pages
DSE 3141 Deep Learning Lab Manual 2024 Week4
No ratings yet
DSE 3141 Deep Learning Lab Manual 2024 Week4
14 pages
Bacciu 2020
No ratings yet
Bacciu 2020
62 pages
Introduction To STARS
100% (3)
Introduction To STARS
40 pages
MLT Unit 1 & 2
No ratings yet
MLT Unit 1 & 2
119 pages
Jacob Eisenstein - Natural Language Processing-MIT Press
No ratings yet
Jacob Eisenstein - Natural Language Processing-MIT Press
591 pages
STV Lab Final Yr
No ratings yet
STV Lab Final Yr
83 pages
Character Streams
100% (1)
Character Streams
6 pages
LLM For Maths People
No ratings yet
LLM For Maths People
53 pages
Batch Data Communication: Objective
No ratings yet
Batch Data Communication: Objective
59 pages
How To Build Wealth Like Warren Buffet
100% (1)
How To Build Wealth Like Warren Buffet
38 pages
Eng Analysis Cheat Notes
No ratings yet
Eng Analysis Cheat Notes
4 pages
MB Manual Z690-Ap-Series e 1101
No ratings yet
MB Manual Z690-Ap-Series e 1101
46 pages
Thesis Kumar Official
No ratings yet
Thesis Kumar Official
123 pages
Url - Install - Readme
No ratings yet
Url - Install - Readme
2 pages
PPPOE
No ratings yet
PPPOE
5 pages
Docu104008 - DDVE 7.6.0.5 GCP Installation and Administration Guide (REV 02)
No ratings yet
Docu104008 - DDVE 7.6.0.5 GCP Installation and Administration Guide (REV 02)
66 pages
Section 7 Lesson 3: Trapping User-Defined Exceptions: Vocabulary
No ratings yet
Section 7 Lesson 3: Trapping User-Defined Exceptions: Vocabulary
6 pages
Big Data On AWS 3.1 LabGuide
No ratings yet
Big Data On AWS 3.1 LabGuide
112 pages
1072620B TVN 50 - FW 2.0 - Release Notes-EN
No ratings yet
1072620B TVN 50 - FW 2.0 - Release Notes-EN
4 pages
Bridge Course Part 2
No ratings yet
Bridge Course Part 2
8 pages
Eclipse Run User Guid
No ratings yet
Eclipse Run User Guid
54 pages
Ineo 367 Brochure
No ratings yet
Ineo 367 Brochure
8 pages
Tac/Com II Control Heads Custom Configuration Chart: Control Head Part Number Sequence
No ratings yet
Tac/Com II Control Heads Custom Configuration Chart: Control Head Part Number Sequence
2 pages
Bioministy
No ratings yet
Bioministy
29 pages
LPH75-ST420 Datasheet 20230606
No ratings yet
LPH75-ST420 Datasheet 20230606
3 pages
Maths
No ratings yet
Maths
3 pages
Blue Team Tools 1677860442
No ratings yet
Blue Team Tools 1677860442
29 pages
Homework Assignment Well Logging
100% (1)
Homework Assignment Well Logging
10 pages
Program Logic Formulation
No ratings yet
Program Logic Formulation
3 pages
Indian Companies Result Calendar
No ratings yet
Indian Companies Result Calendar
12 pages
Ad DP D Ti Advanced Production Engineering: Funded Research Program
No ratings yet
Ad DP D Ti Advanced Production Engineering: Funded Research Program
15 pages
Otc 24470 MS
No ratings yet
Otc 24470 MS
13 pages
IPTC 16957 A Model For Wettability Alteration in Fractured Reservoirs
No ratings yet
IPTC 16957 A Model For Wettability Alteration in Fractured Reservoirs
10 pages
Vijay G Phone Number: 510-921-2473 Professional Summary
No ratings yet
Vijay G Phone Number: 510-921-2473 Professional Summary
5 pages
Inhibitor Data 9-2-2014
No ratings yet
Inhibitor Data 9-2-2014
7 pages
Spe 151797 MS
No ratings yet
Spe 151797 MS
9 pages
Spe 151740 MS
No ratings yet
Spe 151740 MS
6 pages
3 Minute French - Course 1 Bumper Crossword: Test Your Knowledge of All The Words We've Learnt in This Course
No ratings yet
3 Minute French - Course 1 Bumper Crossword: Test Your Knowledge of All The Words We've Learnt in This Course
2 pages
Windows Presentation Foundation
No ratings yet
Windows Presentation Foundation
38 pages
Hackers Guide To Machine Learning With Python PDF
100% (15)
Hackers Guide To Machine Learning With Python PDF
272 pages
Hackers Guide To Machine Learning With Python PDF
100% (15)
Hackers Guide To Machine Learning With Python PDF
272 pages
Hidden Line Removal: Unveiling the Invisible: Secrets of Computer Vision
From Everand
Hidden Line Removal: Unveiling the Invisible: Secrets of Computer Vision
Fouad Sabry
No ratings yet

Lecture8 Computational Graph Pytorch TF

Uploaded by

Lecture8 Computational Graph Pytorch TF

Uploaded by

Lecture 6 – Computational Graphs; PyTorch and

April 11, 2019

• DAG (directed acyclic graph)

• Tensors: Multi-dimensional arrays

• A feed-forward neural network

• A multi-layer feed-forward neural network

• Converting a Torch Tensor to a NumPy array and vice versa is a breeze.

• Converting a Torch Tensor to a NumPy array and vice versa is a breeze.

• Converting a Torch Tensor to a NumPy array and vice versa is a breeze.

This kind of computation graph is called “define by run“

Also referred to as “dynamic”

• First define the graph structure

Also known as “static graphs”

• Dynamic Graph • Static Graph

• In lecture 3, you’ve learnt how to do backprop using the chain rule

• Automatic chain rule

• A feed-forward neural network

• A feed-forward neural network with squared 𝐿2 loss

A deep learning framework provides an automatic gradient calculation

• Calculate gradient using backward() method of a Variable

• Add gradient nodes in the graph where necessary using

• Then update the parameters

• Use tf.Variable instead

Turn variables into “GPU” variables by the following command:

Turn back variables into “CPU” variables by the following command:

• In TF variables or operations can sit on specific device

• In TF variables or operations can sit on specific device

MatMul: (MatMul): /job:localhost/replica:0/task:0/device:GPU:0

• Some TF operations do not have a CUDA implementation

• PyTorch package called nn and class called Module

• Keras: highest abstraction

• Always monitor CPU/GPU usage (linux: nvidia-smi, top)

• Make storage more efficient (TF Records, etc.)

• Make reading pipeline more efficient (parallel readers, prefetching,

• If you have an imbalanced classification problem, visualize the CE loss separately

You can have the configuration shown as a text file in tensorboard!

• PyTorch: easier for prototyping

• Considerably Slower (being worked on)

• Portability is seamless (e.g. mobile apps)

• Simplest framework for fine-tuning or feature extraction

• Used to be fastest (Caffe)

You might also like