0% found this document useful (0 votes)

42 views19 pages

Pytorch 101: Deep Learning PHD Course 2017/2018

This document provides an overview of PyTorch, a Python-based deep learning framework. It discusses key PyTorch concepts like Tensors, automatic differentiation via autograd, neural network modules in torch.nn, and training neural networks end-to-end using loss functions and backpropagation. The document also compares PyTorch's dynamic computational graph approach to TensorFlow's static graph approach.

Uploaded by

jae lyn

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

42 views19 pages

Pytorch 101: Deep Learning PHD Course 2017/2018

Uploaded by

jae lyn

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 19

Deep Learning Phd Course

PyTorch 101
Deep Learning PhD Course
2017/2018

Marco Ciccone
Dipartimento di Informatica Elettronica e Bioingegneria
Politecnico di Milano
Deep Learning Phd Course

What is PyTorch?
It’s a Python based scientific computing package targeted at two sets of audiences:

- A replacement for NumPy to use the power of GPUs

- a deep learning research platform that provides maximum flexibility and speed

import torch
x = torch.Tensor(5, 3)
print(x)
Deep Learning Phd Course

Multiple syntaxes
Syntax 1 Addition: providing an output tensor as argument

y = torch.rand(5, 3) result = torch.Tensor(5, 3)

print(x + y) torch.add(x, y, out=result)
print(result)

Syntax 2 In-place
print(torch.add(x, y)) # adds x to y
y.add_(x)
print(y)

NOTE: all in-place operations have suffix _

Deep Learning Phd Course

NumPy Bridge
Converting a Torch Tensor to a NumPy array and vice versa is a breeze.

PyTorch => Numpy Numpy => PyTorch

import torch Import torch

a = torch.ones(5) import numpy as np
print(a) a = np.ones(5)
b = torch.from_numpy(a)
b = a.numpy() np.add(a, 1, out=a)
print(b) print(a)
print(b)

NOTE: The Torch Tensor and NumPy array will share their underlying memory locations,
and changing one will change the other.
Deep Learning Phd Course

CUDA Tensors
# let us run this cell only if CUDA is available
if torch.cuda.is_available():
x = x.cuda()
y = y.cuda()
x + y
Deep Learning Phd Course

Autograd (Automatic Differentiation)

The autograd package provides automatic differentiation for all operations on Tensors. It is a define-by-run
framework, which means that your backprop is defined by how your code is run, and that every single
iteration can be different.

autograd.Variable is the central class of the package.

It wraps a Tensor, and supports nearly all of operations defined on it.

Once you finish your computation you can call .backward() and have all the gradients computed
automatically.

You can access the raw tensor through the .data attribute, while the gradient w.r.t. this variable is
accumulated into .grad.

PyTorch Variables have the same API as PyTorch tensors: (almost) any operation you can do on a
Tensor you can also do on a Variable; the difference is that autograd allows you to automatically
compute gradients.
Deep Learning Phd Course

Autograd Example
import torch
from torch.autograd import Variable

x = Variable(torch.ones(2, 2), requires_grad=True)

print(x)

y = x + 2
print(y)
print(y.grad_fn)

z = y * y * 3
out = z.mean()
print(z, out)
out.backward() Try it on jupyter!
print(x.grad)
Deep Learning Phd Course

Static vs Dynamic graph

Again we define a computational graph, and use automatic differentiation to
compute gradients.

- TF: Static graph

- The computational graph is defined once and then executed over and over again, possibly
feeding different input data to the graph.
- Graph is optimized upfront, before the execution.
- Loops requires specific operations (tf.scan)
- PyTorch: Dynamic graph
- Each forward pass defines a new computational graph.
- Easy control flow (Imperative mode makes loops easier).
- Easy to perform different operations for different data points.
Deep Learning Phd Course

torch.nn package
Neural network module.

Convenient way of encapsulating parameters, with helpers for moving them to GPU,
exporting, loading, etc…
>>> Container example

model = torch.nn.Sequential(
torch.nn.Linear(D_in, H),
torch.nn.ReLU(),
torch.nn.Linear(H, D_out),
)
Deep Learning Phd Course

Custom module def forward(self, x):

# Max pooling over a (2, 2) window
x = F.max_pool2d(F.relu(self.conv1(x)), (2, 2))
import torch
# If the size is a square
from torch.autograd import Variable
you can only specify a single number
import torch.nn as nn
x = F.max_pool2d(F.relu(self.conv2(x)), 2)
import torch.nn.functional as F
x = x.view(-1, self.num_flat_features(x))
x = F.relu(self.fc1(x))
class Net(nn.Module):
x = F.relu(self.fc2(x))
x = self.fc3(x)
def __init__(self):
return x
super(Net, self).__init__()
# 1 input image channel,
# 6 output channels,
def num_flat_features(self, x):
# 5x5 square convolution kernel
# all dimensions except the batch dimension
self.conv1 = nn.Conv2d(1, 6, 5)
size = x.size()[1:]
self.conv2 = nn.Conv2d(6, 16, 5)
num_features = 1
# an affine operation: y = Wx + b
for s in size:
self.fc1 = nn.Linear(16 * 5 * 5, 120)
num_features *= s
self.fc2 = nn.Linear(120, 84)
return num_features
self.fc3 = nn.Linear(84, 10)
Deep Learning Phd Course

net = Net()
print(net)

>>>>>

Net(
(conv1): Conv2d (1, 6, kernel_size=(5, 5), stride=(1, 1))
(conv2): Conv2d (6, 16, kernel_size=(5, 5), stride=(1, 1))
(fc1): Linear(in_features=400, out_features=120)
(fc2): Linear(in_features=120, out_features=84)
(fc3): Linear(in_features=84, out_features=10)
)

The learnable parameters of a model are returned by net.parameters()

params = list(net.parameters())
print(len(params))
print(params[0].size()) # conv1's .weight
Deep Learning Phd Course

Mini-batches in torch.nn
torch.nn only supports mini-batches

The entire torch.nn package only supports inputs that are a mini-batch of samples, and not a
single sample.

For example, nn.Conv2d will take in a 4D Tensor of nSamples x nChannels x Height x Width.

If you have a single sample, just use input.unsqueeze(0) to add a fake batch dimension.
Deep Learning Phd Course

Loss function
output = net(input)
target = Variable(torch.arange(1, 11)) # a dummy target, for example
criterion = nn.MSELoss()

loss = criterion(output, target)

print(loss)

Now, if you follow loss in the backward direction, using it’s .grad_fn attribute, you will see a graph of computations that looks like
this:

input -> conv2d -> relu -> maxpool2d -> conv2d -> relu -> maxpool2d
-> view -> linear -> relu -> linear -> relu -> linear
-> MSELoss
-> loss

So, when we call loss.backward(), the whole graph is differentiated w.r.t. the loss, and all Variables in the graph will have their
.grad Variable accumulated with the gradient.
Deep Learning Phd Course

BackProp
To backpropagate the error all we have to do is to loss.backward().

You need to clear the existing gradients, otherwise gradients will be accumulated to existing gradients

Now we shall call loss.backward(), and have a look at conv1’s bias gradients before and after the backward.

net.zero_grad() # zeroes the gradient buffers of all parameters

print('conv1.bias.grad before backward')

print(net.conv1.bias.grad)

loss.backward()

print('conv1.bias.grad after backward')

print(net.conv1.bias.grad)
Deep Learning Phd Course

Gradients after backward

conv1.bias.grad before backward
Variable containing:
0
0
0
0
0
0
[torch.FloatTensor of size 6]

conv1.bias.grad after backward

Variable containing:
1.00000e-02 *
7.4571
-0.4714
-5.5774
-6.2058
6.6810
3.1632
[torch.FloatTensor of size 6]
Deep Learning Phd Course

Update the weights

The simplest update rule used in practice is the Stochastic Gradient Descent (SGD):

weight = weight - learning_rate * gradient

It can be implements this using simple python code:

learning_rate = 0.01
for f in net.parameters():
f.data.sub_(f.grad.data * learning_rate)
Deep Learning Phd Course

Optimizers
However, as you use neural networks, you want to use various different update rules such as SGD, Nesterov-SGD, Adam,
RMSProp, etc. To enable this, we built a small package: torch.optim that implements all these methods. Using it is very
simple:

import torch.optim as optim

# create your optimizer

optimizer = optim.SGD(net.parameters(), lr=0.01)

# in your training loop:

optimizer.zero_grad() # zero the gradient buffers
output = net(input)
loss = criterion(output, target)
loss.backward()
optimizer.step() # Does the update
Deep Learning Phd Course

That was easier!

Let’s open Jupyter again!
Deep Learning Phd Course

Acknowledgements
Slides based on http://pytorch.org/tutorials/

Course - A Deep Understanding of Deep Learning (With Python Intro)
No ratings yet
Course - A Deep Understanding of Deep Learning (With Python Intro)
4 pages
Pytorch Tutorial 1 Rev 1
No ratings yet
Pytorch Tutorial 1 Rev 1
48 pages
Pytorch Cheatsheet EN
No ratings yet
Pytorch Cheatsheet EN
1 page
PyTorch PDF
No ratings yet
PyTorch PDF
72 pages
CS236 Introduction To PyTorch
100% (4)
CS236 Introduction To PyTorch
33 pages
Fatigue Analysis
100% (1)
Fatigue Analysis
2 pages
Pytorch Tutorial: Narges Honarvar Nazari January 30
No ratings yet
Pytorch Tutorial: Narges Honarvar Nazari January 30
29 pages
Deep Learning With PyTorch Guide For Beginners and Intermediate
100% (7)
Deep Learning With PyTorch Guide For Beginners and Intermediate
120 pages
NN From Scratch
No ratings yet
NN From Scratch
5 pages
Pytorch Slides
No ratings yet
Pytorch Slides
31 pages
Pytorch Tutorial PDF
No ratings yet
Pytorch Tutorial PDF
27 pages
Where The Forest Meets The Sea Sample Lesson Plan
100% (2)
Where The Forest Meets The Sea Sample Lesson Plan
29 pages
First Quarter Grasps For Performance Task #1: Writing Speech Choir Piece
No ratings yet
First Quarter Grasps For Performance Task #1: Writing Speech Choir Piece
3 pages
Building Deep Learning Models Using The PyTorch Library
No ratings yet
Building Deep Learning Models Using The PyTorch Library
4 pages
Train Your Image Classifier Model With PyTorch
No ratings yet
Train Your Image Classifier Model With PyTorch
6 pages
Intro To PyTorch and Neural Networks - Intro To PyTorch and Neural Networks Cheatsheet - Codecademy
No ratings yet
Intro To PyTorch and Neural Networks - Intro To PyTorch and Neural Networks Cheatsheet - Codecademy
8 pages
Chapter 3
No ratings yet
Chapter 3
26 pages
Activation Functions: Ismail Elezi
No ratings yet
Activation Functions: Ismail Elezi
30 pages
Chapter 2
No ratings yet
Chapter 2
35 pages
Lecture8 Computational Graph Pytorch TF
No ratings yet
Lecture8 Computational Graph Pytorch TF
64 pages
Chapter 2
No ratings yet
Chapter 2
35 pages
PyTorch CrashCourse
No ratings yet
PyTorch CrashCourse
17 pages
PyTorch CrashCourse
No ratings yet
PyTorch CrashCourse
16 pages
Chapter 3
No ratings yet
Chapter 3
26 pages
Pytorch Tutorial: - Ntu Machine Learning Course
No ratings yet
Pytorch Tutorial: - Ntu Machine Learning Course
64 pages
2c PyTorch4
No ratings yet
2c PyTorch4
4 pages
DIP Lab 10
No ratings yet
DIP Lab 10
11 pages
Module02 PyTorch
No ratings yet
Module02 PyTorch
36 pages
PyTorch - A Comprehensive Overview
No ratings yet
PyTorch - A Comprehensive Overview
7 pages
Unit 4 Part 3
No ratings yet
Unit 4 Part 3
8 pages
Pytorch Tutorial 1
No ratings yet
Pytorch Tutorial 1
48 pages
Chapter1 Intro
No ratings yet
Chapter1 Intro
35 pages
GK Deeplearning
No ratings yet
GK Deeplearning
15 pages
Harvard CS197 Lecture 6 & 7 Notes
No ratings yet
Harvard CS197 Lecture 6 & 7 Notes
18 pages
R Deep Neural Network Step by Step
No ratings yet
R Deep Neural Network Step by Step
27 pages
Chapter 1
No ratings yet
Chapter 1
37 pages
Chapter 1
No ratings yet
Chapter 1
50 pages
Pytorch Demo 1749471354
No ratings yet
Pytorch Demo 1749471354
10 pages
Intro To Pytorch
No ratings yet
Intro To Pytorch
12 pages
Deep Learning Basics (Lecture Notes) : Romain Tavenard
No ratings yet
Deep Learning Basics (Lecture Notes) : Romain Tavenard
49 pages
Pytorch Neural Networks Guide 1717173717
No ratings yet
Pytorch Neural Networks Guide 1717173717
17 pages
PyTorch 1 - 0 - Bringing Research and Production Together Presentation
No ratings yet
PyTorch 1 - 0 - Bringing Research and Production Together Presentation
108 pages
Day 45 PyTorch Presentation
No ratings yet
Day 45 PyTorch Presentation
67 pages
PyTorch Crash Course 1713016363
No ratings yet
PyTorch Crash Course 1713016363
15 pages
cs519 hw2
No ratings yet
cs519 hw2
15 pages
ML Hota Assign5
No ratings yet
ML Hota Assign5
2 pages
ANN 08 Pytorch V6
No ratings yet
ANN 08 Pytorch V6
81 pages
Stars 4 0 0 0 + Forks 7 0 0 + License MIT
No ratings yet
Stars 4 0 0 0 + Forks 7 0 0 + License MIT
19 pages
Fundamentals of Deep Learning
No ratings yet
Fundamentals of Deep Learning
195 pages
00 Pytorch and Deep Learning Fundamentals PDF
No ratings yet
00 Pytorch and Deep Learning Fundamentals PDF
44 pages
Deep Learning
No ratings yet
Deep Learning
46 pages
Lecture 14 Introduction To Pytorch
No ratings yet
Lecture 14 Introduction To Pytorch
45 pages
Lecture 2: Introduction To Pytorch
No ratings yet
Lecture 2: Introduction To Pytorch
7 pages
DeepLearning Pytorch 522H0134 NguyenNhatHuy 522H0150 PhamHuynhTin
No ratings yet
DeepLearning Pytorch 522H0134 NguyenNhatHuy 522H0150 PhamHuynhTin
54 pages
Py Torch
No ratings yet
Py Torch
11 pages
Building Makemore Part 4 - Becoming A Backprop Ninja
No ratings yet
Building Makemore Part 4 - Becoming A Backprop Ninja
50 pages
Deep Learning Lab: How To Train Your First Neural Network
No ratings yet
Deep Learning Lab: How To Train Your First Neural Network
68 pages
Pytorch Tutorial For Beginner: Department of Computer Science & Engineering University of Washington
No ratings yet
Pytorch Tutorial For Beginner: Department of Computer Science & Engineering University of Washington
11 pages
Algebra Book
No ratings yet
Algebra Book
804 pages
Lecture 02-2
No ratings yet
Lecture 02-2
37 pages
NATO Phonetic Alphabet 2015 NGL
No ratings yet
NATO Phonetic Alphabet 2015 NGL
1 page
Notas Clase Top Notch 1
No ratings yet
Notas Clase Top Notch 1
57 pages
David Ellen - The Scientific Examination of Documents - Methods and Techniques - Methods and Techniques-Taylor & Francis (2014)
No ratings yet
David Ellen - The Scientific Examination of Documents - Methods and Techniques - Methods and Techniques-Taylor & Francis (2014)
189 pages
Tetra 30 Final Result
No ratings yet
Tetra 30 Final Result
6 pages
Ayurvedic Literature in Orissa - An Overview: Prem Kishore, M.M. Padhi, G.C. Nanda
No ratings yet
Ayurvedic Literature in Orissa - An Overview: Prem Kishore, M.M. Padhi, G.C. Nanda
6 pages
Character Traits Lesson
No ratings yet
Character Traits Lesson
4 pages
Identifying and Remediating Reading Difficulties
No ratings yet
Identifying and Remediating Reading Difficulties
17 pages
5 Leave PDF
No ratings yet
5 Leave PDF
1 page
AIML&CS ITIOT, BCT R24 COURSE STRUTURE With Syllabus
No ratings yet
AIML&CS ITIOT, BCT R24 COURSE STRUTURE With Syllabus
10 pages
Unit 3.1
No ratings yet
Unit 3.1
88 pages
Advanced Linguistics OK
No ratings yet
Advanced Linguistics OK
5 pages
(En) M Sew - Soft Instruction
No ratings yet
(En) M Sew - Soft Instruction
58 pages
1st Key 2nd Key 3rd Key: Explorer Keyboard Shortcuts
No ratings yet
1st Key 2nd Key 3rd Key: Explorer Keyboard Shortcuts
24 pages
1.4 List, Tuples, Sets and Dictionaries
No ratings yet
1.4 List, Tuples, Sets and Dictionaries
3 pages
Reflection Rubric
No ratings yet
Reflection Rubric
1 page
Quiz 003 - Attempt Review PDF
No ratings yet
Quiz 003 - Attempt Review PDF
3 pages
Cryptography
No ratings yet
Cryptography
23 pages
NETCONF and YANG Concepts: Presented by Tail-F
No ratings yet
NETCONF and YANG Concepts: Presented by Tail-F
16 pages
DBMS Module-II
No ratings yet
DBMS Module-II
33 pages
Red Lesson 15 - Salvation Vs Redemption
No ratings yet
Red Lesson 15 - Salvation Vs Redemption
2 pages
Note Sap 2091232
No ratings yet
Note Sap 2091232
2 pages
MATH 1150 - Assign #5
No ratings yet
MATH 1150 - Assign #5
1 page
Manners of The Carrier of Quran Book
No ratings yet
Manners of The Carrier of Quran Book
92 pages
גדל אשר באך - 701-800
No ratings yet
גדל אשר באך - 701-800
100 pages
Analyze Translation Quality Rater by Rizky
No ratings yet
Analyze Translation Quality Rater by Rizky
10 pages
Unit-5: Combinational Circuit: Adders Subtractor Comparator Parity Generator
No ratings yet
Unit-5: Combinational Circuit: Adders Subtractor Comparator Parity Generator
14 pages
Python Advanced Programming: The Guide to Learn Python Programming. Reference with Exercises and Samples About Dynamical Programming, Multithreading, Multiprocessing, Debugging, Testing and More
From Everand
Python Advanced Programming: The Guide to Learn Python Programming. Reference with Exercises and Samples About Dynamical Programming, Multithreading, Multiprocessing, Debugging, Testing and More
Marcus Richards
No ratings yet
Advanced C Concepts and Programming: First Edition
From Everand
Advanced C Concepts and Programming: First Edition
Gayatri
3/5 (1)
Profound Python Data Science
From Everand
Profound Python Data Science
Onder Teker
No ratings yet
Oracle Certified Professional Java Programmer OCPJP 1Z0 809
From Everand
Oracle Certified Professional Java Programmer OCPJP 1Z0 809
Manish Soni
No ratings yet