0% found this document useful (0 votes)

66 views48 pages

Pytorch Tutorial 1

Uploaded by

Da HUANG

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

66 views48 pages

Pytorch Tutorial 1

Uploaded by

Da HUANG

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 48

Machine Learning

Pytorch Tutorial
TA : 曾元（Yuan Tseng）
2022.02.18
Outline
● Background: Prerequisites & What is Pytorch?
● Training & Testing Neural Networks in Pytorch
● Dataset & Dataloader
● Tensors
● torch.nn: Models, Loss Functions
● torch.optim: Optimization
● Save/load models
Prerequisites
● We assume you are already familiar with…
1. Python3
■ if-else, loop, function, file IO, class, ...
■ refs: link1, link2, link3
2. Deep Learning Basics
■ Prof. Lee’s 1st & 2nd lecture videos from last year
■ ref: link1, link2

Some knowledge of NumPy will also be useful!

What is PyTorch?
● An machine learning framework in Python.
● Two main features:
○ N-dimensional Tensor computation (like NumPy) on GPUs
○ Automatic diﬀerentiation for training deep neural networks
Training Neural Networks

Define Neural Optimization

Loss Function
Network Algorithm

Training

More info about the training process in last year's lecture video.
Training & Testing Neural Networks

Training Validation Testing

Guide for training/validation/testing can be found here.

Training & Testing Neural Networks - in Pytorch
Step 1.
torch.utils.data.Dataset &
Load Data torch.utils.data.DataLoader

Training Validation Testing

Dataset & Dataloader
● Dataset: stores data samples and expected values
● Dataloader: groups data in batches, enables multiprocessing

● dataset = MyDataset(file)
● dataloader = DataLoader(dataset, batch_size, shuffle=True)

Training: True
Testing: False

More info about batches and shuﬄing here.

Dataset & Dataloader
from torch.utils.data import Dataset, DataLoader

class MyDataset(Dataset):
def __init__(self, file):
self.data = ... Read data & preprocess

def getitem(self, index):

return self.data[index] Returns one sample at a time

def __len__(self):
return len(self.data) Returns the size of the dataset
Dataset & Dataloader
dataset = MyDataset(file)

dataloader = DataLoader(dataset, batch_size=5, shuffle=False)

DataLoader
__getitem__(0) 0
__getitem__(1) 1
Dataset __getitem__(2) 2 batch_size
__getitem__(3) 3
__getitem__(4) 4
mini-batch
Tensors
● High-dimensional matrices (arrays)

1-D tensor 2-D tensor 3-D tensor

e.g. audio e.g. black&white e.g. RGB images
images
Tensors – Shape of Tensors
● Check with .shape

4
3

5
3
5 5
(5, ) (3, 5) (4, 5, 3)

dim 0 dim 0 dim 1 dim 0 dim 1 dim 2

Note: dim in PyTorch == axis in NumPy

Tensors – Creating Tensors
● Directly from data (list or numpy.ndarray) tensor([[1., -1.],
x = torch.tensor([[1, -1], [-1, 1]]) [-1., 1.]])

x = torch.from_numpy(np.array([[1, -1], [-1, 1]]))

● Tensor of constant zeros & ones tensor([[0., 0.],

[0., 0.]])
x = torch.zeros([2, 2])

x = torch.ones([1, 2, 5]) tensor([[[1., 1., 1., 1., 1.],

shape [1., 1., 1., 1., 1.]]])
Tensors – Common Operations
Common arithmetic functions are supported, such as:

● Addition ● Summation

z = x + y y = x.sum()

● Subtraction ● Mean

z = x - y y = x.mean()

● Power

y = x.pow(2)
Tensors – Common Operations
● Transpose: transpose two speciﬁed dimensions

>>> x = torch.zeros([2, 3])

2
>>> x.shape
3
torch.Size([2, 3])

>>> x = x.transpose(0, 1)

>>> x.shape 3

torch.Size([3, 2])
2
Tensors – Common Operations
● Squeeze: remove the speciﬁed dimension with length = 1

>>> x = torch.zeros([1, 2, 3])

>>> x.shape 1
3
2
torch.Size([1, 2, 3])

>>> x = x.squeeze(0)
(dim = 0)
>>> x.shape 2

torch.Size([2, 3]) 3
Tensors – Common Operations
● Unsqueeze: expand a new dimension

>>> x = torch.zeros([2, 3]) 2

>>> x.shape
3
torch.Size([2, 3])

>>> x = x.unsqueeze(1) (dim = 1)

>>> x.shape 2

torch.Size([2, 1, 3]) 3
1
Tensors – Common Operations
x 2
3
1

● Cat: concatenate multiple tensors

y 2
>>> x = torch.zeros([2, 1, 3])
3
3
>>> y = torch.zeros([2, 3, 3])

>>> z = torch.zeros([2, 2, 3]) z

>>> w = torch.cat([x, y, z], dim=1) 3

>>> w.shape
w
torch.Size([2, 6, 3]) 2
3
6
more operators: https://pytorch.org/docs/stable/tensors.html
Tensors – Data Type
● Using diﬀerent data types for model and data will cause errors.

Data type dtype tensor

32-bit ﬂoating point torch.float torch.FloatTensor

64-bit integer (signed) torch.long torch.LongTensor

see oﬃcial documentation for more information on data types.

Tensors – PyTorch v.s. NumPy
● Similar attributes

PyTorch NumPy
x.shape x.shape
x.dtype x.dtype

see oﬃcial documentation for more information on data types.

ref: https://github.com/wkentaro/pytorch-for-numpy-users
Tensors – PyTorch v.s. NumPy
● Many functions have the same names as well

PyTorch NumPy
x.reshape / x.view x.reshape
x.squeeze() x.squeeze()
x.unsqueeze(1) np.expand_dims(x, 1)

ref: https://github.com/wkentaro/pytorch-for-numpy-users
Tensors – Device
● Tensors & modules will be computed with CPU by default

Use .to() to move tensors to appropriate devices.

● CPU
x = x.to(‘cpu’)
● GPU
x = x.to(‘cuda’)
Tensors – Device (GPU)
● Check if your computer has NVIDIA GPU

torch.cuda.is_available()

● Multiple GPUs: specify ‘cuda:0’, ‘cuda:1’, ‘cuda:2’, ...

● Why use GPUs?

○ Parallel computing with more cores for arithmetic calculations
○ See What is a GPU and do you need one in deep learning?
Tensors – Gradient Calculation
1 >>> x = torch.tensor([[1., 0.], [-1., 1.]], requires_grad=True)

2 >>> z = x.pow(2).sum()

3 >>> z.backward()

4 >>> x.grad
1 2
tensor([[ 2., 0.],

[-2., 2.]])
3 4

See here to learn about gradient calculation.

Training & Testing Neural Networks – in Pytorch
Step 2.
torch.nn.Module
Load Data
Define Neural
Network

Loss Function Training Validation Testing

Optimization
Algorithm
torch.nn – Network Layers
● Linear Layer (Fully-connected Layer)

nn.Linear(in_features, out_features)

Input Tensor Output Tensor

nn.Linear(32, 64)
* x 32 * x 64

can be any shape (but last dimension must be 32)

e.g. (10, 32), (10, 5, 32), (1, 1, 3, 32), ...
torch.nn – Network Layers
● Linear Layer (Fully-connected Layer)

ref: last year's lecture video

torch.nn – Neural Network Layers
● Linear Layer (Fully-connected Layer)

y1
x1

y2
x2

32 y3 64 W x x + b = y
x3 (64x32)
...

...

x32
y64
torch.nn – Network Parameters
● Linear Layer (Fully-connected Layer)

>>> layer = torch.nn.Linear(32, 64)

>>> layer.weight.shape

torch.Size([64, 32]) W x x + b = y
(64x32)
>>> layer.bias.shape

torch.Size([64])
torch.nn – Non-Linear Activation Functions
● Sigmoid Activation

nn.Sigmoid()

● ReLU Activation

nn.ReLU()

See here to learn about why we need activation functions.

torch.nn – Build your own neural network
import torch.nn as nn

class MyModel(nn.Module):
def __init__(self):
super(MyModel, self).__init__()
self.net = nn.Sequential(
nn.Linear(10, 32), Initialize your model & deﬁne layers
nn.Sigmoid(),
nn.Linear(32, 1)
)

def forward(self, x):

Compute output of your NN
return self.net(x)
torch.nn – Build your own neural network
import torch.nn as nn import torch.nn as nn

class MyModel(nn.Module): class MyModel(nn.Module):

def __init__(self): def __init__(self):
super(MyModel, self).__init__() super(MyModel, self).__init__()
self.net = nn.Sequential( self.layer1 = nn.Linear(10, 32)
nn.Linear(10, 32), self.layer2 = nn.Sigmoid(),
nn.Sigmoid(), = self.layer3 = nn.Linear(32,1)
nn.Linear(32, 1)
) def forward(self, x):
out = self.layer1(x)
def forward(self, x): out = self.layer2(out)
return self.net(x) out = self.layer3(out)
return out
Training & Testing Neural Networks – in Pytorch
Step 3.
torch.nn.MSELoss
torch.nn.CrossEntropyLoss etc.
Load Data
Define Neural
Network

Loss Function Training Validation Testing

Optimization
Algorithm
torch.nn – Loss Functions
● Mean Squared Error (for regression tasks)

criterion = nn.MSELoss()

● Cross Entropy (for classiﬁcation tasks)

criterion = nn.CrossEntropyLoss()

● loss = criterion(model_output, expected_value)

Training & Testing Neural Networks – in Pytorch
Step 4.
torch.optim
Load Data
Define Neural
Network

Loss Function Training Validation Testing

Optimization
Algorithm
torch.optim
● Gradient-based optimization algorithms that adjust network
parameters to reduce error. (See Adaptive Learning Rate lecture video)

● E.g. Stochastic Gradient Descent (SGD)

torch.optim.SGD(model.parameters(), lr, momentum = 0)

torch.optim
optimizer = torch.optim.SGD(model.parameters(), lr, momentum = 0)

● For every batch of data:

1. Call optimizer.zero_grad() to reset gradients of model parameters.
2. Call loss.backward() to backpropagate gradients of prediction loss.
3. Call optimizer.step() to adjust model parameters.

See oﬃcial documentation for more optimization algorithms.

Training & Testing Neural Networks – in Pytorch

Load Data
Define Neural
Network

Loss Function Training Validation Testing

Optimization
Algorithm Step 5.
Entire Procedure
Neural Network Training Setup

dataset = MyDataset(file) read data via MyDataset

tr_set = DataLoader(dataset, 16, shuffle=True) put dataset into Dataloader

model = MyModel().to(device) construct model and move to device (cpu/cuda)

criterion = nn.MSELoss() set loss function

optimizer = torch.optim.SGD(model.parameters(), 0.1) set optimizer

Neural Network Training Loop
for epoch in range(n_epochs): iterate n_epochs

model.train() set model to train mode

for x, y in tr_set: iterate through the dataloader

optimizer.zero_grad() set gradient to zero

x, y = x.to(device), y.to(device) move data to device (cpu/cuda)

pred = model(x) forward pass (compute output)

loss = criterion(pred, y) compute loss

loss.backward() compute gradient (backpropagation)

optimizer.step() update model with optimizer

Neural Network Validation Loop
model.eval() set model to evaluation mode

total_loss = 0

for x, y in dv_set: iterate through the dataloader

x, y = x.to(device), y.to(device) move data to device (cpu/cuda)

with torch.no_grad(): disable gradient calculation

pred = model(x) forward pass (compute output)

loss = criterion(pred, y) compute loss

total_loss += loss.cpu().item() * len(x) accumulate loss

avg_loss = total_loss / len(dv_set.dataset) compute averaged loss

Neural Network Testing Loop
model.eval() set model to evaluation mode

preds = []

for x in tt_set: iterate through the dataloader

x = x.to(device) move data to device (cpu/cuda)

with torch.no_grad(): disable gradient calculation

pred = model(x) forward pass (compute output)

preds.append(pred.cpu()) collect prediction

Notice - model.eval(), torch.no_grad()
● model.eval()

Changes behaviour of some model layers, such as dropout and batch

normalization.

● with torch.no_grad()

Prevents calculations from being added into gradient computation

graph. Usually used to prevent accidental training on validation/testing
data.
Save/Load Trained Models
● Save

torch.save(model.state_dict(), path)

● Load

ckpt = torch.load(path)

model.load_state_dict(ckpt)
More About PyTorch
● torchaudio
○ speech/audio processing
● torchtext
○ natural language processing
● torchvision
○ computer vision
● skorch
○ scikit-learn + pyTorch
More About PyTorch
● Useful github repositories using PyTorch
○ Huggingface Transformers (transformer models: BERT, GPT, ...)
○ Fairseq (sequence modeling for NLP & speech)
○ ESPnet (speech recognition, translation, synthesis, ...)
○ Most implementations of recent deep learning papers
○ ...
References
● Machine Learning 2021 Spring Pytorch Tutorial
● Oﬃcial Pytorch Tutorials
● https://numpy.org/
Any questions?

Sharp J. Exam Ref AI-900 Microsoft Azure AI Fundamentals 2022 PDF
100% (4)
Sharp J. Exam Ref AI-900 Microsoft Azure AI Fundamentals 2022 PDF
366 pages
3900 & 5900 Series Base Station Model Description (15) (PDF) - EN
No ratings yet
3900 & 5900 Series Base Station Model Description (15) (PDF) - EN
893 pages
Pytorch Cheatsheet EN
No ratings yet
Pytorch Cheatsheet EN
1 page
Pytorch Tutorial 1 Rev 1
No ratings yet
Pytorch Tutorial 1 Rev 1
48 pages
PyTorch CrashCourse
No ratings yet
PyTorch CrashCourse
16 pages
PyTorch Crash Course 1713016363
No ratings yet
PyTorch Crash Course 1713016363
15 pages
PyTorch CrashCourse
No ratings yet
PyTorch CrashCourse
17 pages
Module02 PyTorch
No ratings yet
Module02 PyTorch
36 pages
Chapter1 Intro
No ratings yet
Chapter1 Intro
35 pages
Deep Learning Lab: How To Train Your First Neural Network
No ratings yet
Deep Learning Lab: How To Train Your First Neural Network
68 pages
Pytorch Slides
No ratings yet
Pytorch Slides
31 pages
CS236 Introduction To PyTorch
100% (4)
CS236 Introduction To PyTorch
33 pages
Unit 4 Part 3
No ratings yet
Unit 4 Part 3
8 pages
2c PyTorch4
No ratings yet
2c PyTorch4
4 pages
Pytorch Basics - For Absolute Beginners - Sel, Tam (Sel, Tam) - 2021 - Anna's Archive - Copie
No ratings yet
Pytorch Basics - For Absolute Beginners - Sel, Tam (Sel, Tam) - 2021 - Anna's Archive - Copie
62 pages
Pytorch Neural Networks Guide 1717173717
No ratings yet
Pytorch Neural Networks Guide 1717173717
17 pages
Lec 3
No ratings yet
Lec 3
30 pages
DL Pytorch
No ratings yet
DL Pytorch
8 pages
Harvard CS197 Lecture 6 & 7 Notes
No ratings yet
Harvard CS197 Lecture 6 & 7 Notes
18 pages
Chapter 1
No ratings yet
Chapter 1
50 pages
(Deep Learning Using PyTorch) (Cheatsheet)
No ratings yet
(Deep Learning Using PyTorch) (Cheatsheet)
7 pages
Beginner's PyTorch Guide
No ratings yet
Beginner's PyTorch Guide
35 pages
PyTorch PDF
No ratings yet
PyTorch PDF
72 pages
Pytorch Tutorial For Beginner: Department of Computer Science & Engineering University of Washington
No ratings yet
Pytorch Tutorial For Beginner: Department of Computer Science & Engineering University of Washington
11 pages
Intro To PyTorch and Neural Networks - Intro To PyTorch and Neural Networks Cheatsheet - Codecademy
No ratings yet
Intro To PyTorch and Neural Networks - Intro To PyTorch and Neural Networks Cheatsheet - Codecademy
8 pages
Day 45 PyTorch Presentation
No ratings yet
Day 45 PyTorch Presentation
67 pages
Pytorch
No ratings yet
Pytorch
38 pages
DIP Lab 10
No ratings yet
DIP Lab 10
11 pages
Chapter 3 - Training Deep Neural Networks
No ratings yet
Chapter 3 - Training Deep Neural Networks
25 pages
Pytorch Demo 1749471354
No ratings yet
Pytorch Demo 1749471354
10 pages
PyTorch Guide With Code
No ratings yet
PyTorch Guide With Code
4 pages
Deep Learning With PyTorch Guide For Beginners and Intermediate
100% (7)
Deep Learning With PyTorch Guide For Beginners and Intermediate
120 pages
Pytorch Tutorial: - Ntu Machine Learning Course
No ratings yet
Pytorch Tutorial: - Ntu Machine Learning Course
64 pages
Py Torch
No ratings yet
Py Torch
786 pages
PyTorch - A Comprehensive Overview
No ratings yet
PyTorch - A Comprehensive Overview
7 pages
Deep Learning Unit 4
No ratings yet
Deep Learning Unit 4
11 pages
Chapter 3
No ratings yet
Chapter 3
26 pages
Introduction To PyTorch
No ratings yet
Introduction To PyTorch
35 pages
یادگیری پایتورچ
No ratings yet
یادگیری پایتورچ
30 pages
PyTorch Fundamentals - Zero To Mastery Learn PyTorch For Deep Learning
No ratings yet
PyTorch Fundamentals - Zero To Mastery Learn PyTorch For Deep Learning
45 pages
Chapter 1
No ratings yet
Chapter 1
37 pages
Chapter 3
No ratings yet
Chapter 3
26 pages
Tensors
No ratings yet
Tensors
12 pages
PyTorch 1 - 0 - Bringing Research and Production Together Presentation
No ratings yet
PyTorch 1 - 0 - Bringing Research and Production Together Presentation
108 pages
WWW Learnpytorch
No ratings yet
WWW Learnpytorch
14 pages
00 Pytorch and Deep Learning Fundamentals PDF
No ratings yet
00 Pytorch and Deep Learning Fundamentals PDF
44 pages
Py Torch
No ratings yet
Py Torch
19 pages
S06 DNN Tensorflow PyTorch Wip
No ratings yet
S06 DNN Tensorflow PyTorch Wip
24 pages
Pytorch Tutorial: Narges Honarvar Nazari January 30
No ratings yet
Pytorch Tutorial: Narges Honarvar Nazari January 30
29 pages
Pytorch 101: Deep Learning PHD Course 2017/2018
No ratings yet
Pytorch 101: Deep Learning PHD Course 2017/2018
19 pages
Ilovepdf Merged
No ratings yet
Ilovepdf Merged
10 pages
Unit 4 Part 3 DL - 1
No ratings yet
Unit 4 Part 3 DL - 1
5 pages
Apurv Notes - Foundations of Pytorch
No ratings yet
Apurv Notes - Foundations of Pytorch
15 pages
Pytorch Tutorial PDF
No ratings yet
Pytorch Tutorial PDF
27 pages
Tutorials Sources Beginner Ptcheat
No ratings yet
Tutorials Sources Beginner Ptcheat
7 pages
A Brief Introduction To Pytorch: (A Deep Learning Library)
No ratings yet
A Brief Introduction To Pytorch: (A Deep Learning Library)
32 pages
NN From Scratch
No ratings yet
NN From Scratch
5 pages
Solving Math Problems
From Everand
Solving Math Problems
George N. Frempong
No ratings yet
Worked Examples in Mathematics for Scientists and Engineers
From Everand
Worked Examples in Mathematics for Scientists and Engineers
G. Stephenson
No ratings yet
Matrices with MATLAB (Taken from "MATLAB for Beginners: A Gentle Approach")
From Everand
Matrices with MATLAB (Taken from "MATLAB for Beginners: A Gentle Approach")
Peter Kattan
3/5 (4)
A Brief Introduction to MATLAB: Taken From the Book "MATLAB for Beginners: A Gentle Approach"
From Everand
A Brief Introduction to MATLAB: Taken From the Book "MATLAB for Beginners: A Gentle Approach"
Peter Kattan
2.5/5 (2)
Graphs with MATLAB (Taken from "MATLAB for Beginners: A Gentle Approach")
From Everand
Graphs with MATLAB (Taken from "MATLAB for Beginners: A Gentle Approach")
Peter Kattan
4/5 (2)
Chapter 2 - IP Static Routing
No ratings yet
Chapter 2 - IP Static Routing
43 pages
Mde 5180F
No ratings yet
Mde 5180F
47 pages
EC-Council CEH Printables Sample
No ratings yet
EC-Council CEH Printables Sample
9 pages
Downloading, Setting Up, and Using The SolidWorks Drawing Templates
No ratings yet
Downloading, Setting Up, and Using The SolidWorks Drawing Templates
3 pages
Moshell Commands
100% (4)
Moshell Commands
76 pages
Zfs Internals Uli Graef
No ratings yet
Zfs Internals Uli Graef
32 pages
Subhash Arun Dwivedi - CV - Lab Report!
No ratings yet
Subhash Arun Dwivedi - CV - Lab Report!
26 pages
H8S, H8/300 Series C/C++ Compiler Supplementary Information
No ratings yet
H8S, H8/300 Series C/C++ Compiler Supplementary Information
68 pages
Bhushan Kishor Shende Resume
No ratings yet
Bhushan Kishor Shende Resume
2 pages
I4850 Datasheet 1 - 8
No ratings yet
I4850 Datasheet 1 - 8
2 pages
11 Computer Science: Volume-I (Unit I & Ii)
No ratings yet
11 Computer Science: Volume-I (Unit I & Ii)
8 pages
Consensus Algorithm
No ratings yet
Consensus Algorithm
34 pages
The Top Issues in IBM MQ and IIB: Barry D. Lamkin Executive IT Specialist
No ratings yet
The Top Issues in IBM MQ and IIB: Barry D. Lamkin Executive IT Specialist
50 pages
10-WCDMA RNO Introduction To GENEX Probe and Assistant - 20051214
No ratings yet
10-WCDMA RNO Introduction To GENEX Probe and Assistant - 20051214
62 pages
MTech Dec 2019 - Jan 2020
No ratings yet
MTech Dec 2019 - Jan 2020
6 pages
Analog Circuits Versus Digital Circuits
No ratings yet
Analog Circuits Versus Digital Circuits
2 pages
Charak - An Introduction
No ratings yet
Charak - An Introduction
29 pages
Operators and Control Statements in Java
No ratings yet
Operators and Control Statements in Java
44 pages
Mohini Patil ServiceNow Resume
100% (1)
Mohini Patil ServiceNow Resume
1 page
Moore Derek Resume 2023 12 06
No ratings yet
Moore Derek Resume 2023 12 06
11 pages
MUC1004 - 2008 - 2016 Administrator - Guide V1.1
No ratings yet
MUC1004 - 2008 - 2016 Administrator - Guide V1.1
112 pages
Common IP Network Topologies
No ratings yet
Common IP Network Topologies
7 pages
FB Weighing
No ratings yet
FB Weighing
8 pages
Cambridge ICT SYLLABUS (Terabytes Connect With Computers) Class 7
56% (9)
Cambridge ICT SYLLABUS (Terabytes Connect With Computers) Class 7
3 pages
S4hana Analytics B
No ratings yet
S4hana Analytics B
52 pages
Active Directory Documentation
No ratings yet
Active Directory Documentation
3 pages
Network Devices (Hub, Repeater, Bridge, Switch, Router, Gateways and Brouter)
No ratings yet
Network Devices (Hub, Repeater, Bridge, Switch, Router, Gateways and Brouter)
4 pages
Digital Pressure Gauges Additel 680 Series
No ratings yet
Digital Pressure Gauges Additel 680 Series
3 pages

Pytorch Tutorial 1

Uploaded by

Pytorch Tutorial 1

Uploaded by

Machine Learning

Some knowledge of NumPy will also be useful!

Define Neural Optimization

Training Validation Testing

Guide for training/validation/testing can be found here.

Training Validation Testing

More info about batches and shuﬄing here.

def __getitem__(self, index):

dataloader = DataLoader(dataset, batch_size=5, shuffle=False)

1-D tensor 2-D tensor 3-D tensor

dim 0 dim 0 dim 1 dim 0 dim 1 dim 2

Note: dim in PyTorch == axis in NumPy

x = torch.from_numpy(np.array([[1, -1], [-1, 1]]))

● Tensor of constant zeros & ones tensor([[0., 0.],

x = torch.ones([1, 2, 5]) tensor([[[1., 1., 1., 1., 1.],

>>> x = torch.zeros([2, 3])

>>> x = torch.zeros([1, 2, 3])

>>> x = torch.zeros([2, 3]) 2

>>> x = x.unsqueeze(1) (dim = 1)

● Cat: concatenate multiple tensors

>>> z = torch.zeros([2, 2, 3]) z

>>> w = torch.cat([x, y, z], dim=1) 3

Data type dtype tensor

32-bit ﬂoating point torch.float torch.FloatTensor

64-bit integer (signed) torch.long torch.LongTensor

see oﬃcial documentation for more information on data types.

see oﬃcial documentation for more information on data types.

Use .to() to move tensors to appropriate devices.

● Multiple GPUs: specify ‘cuda:0’, ‘cuda:1’, ‘cuda:2’, ...

● Why use GPUs?

See here to learn about gradient calculation.

Loss Function Training Validation Testing

Input Tensor Output Tensor

can be any shape (but last dimension must be 32)

ref: last year's lecture video

>>> layer = torch.nn.Linear(32, 64)

See here to learn about why we need activation functions.

def forward(self, x):

class MyModel(nn.Module): class MyModel(nn.Module):

Loss Function Training Validation Testing

● Cross Entropy (for classiﬁcation tasks)

● loss = criterion(model_output, expected_value)

Loss Function Training Validation Testing

● E.g. Stochastic Gradient Descent (SGD)

torch.optim.SGD(model.parameters(), lr, momentum = 0)

● For every batch of data:

See oﬃcial documentation for more optimization algorithms.

Loss Function Training Validation Testing

dataset = MyDataset(file) read data via MyDataset

tr_set = DataLoader(dataset, 16, shuffle=True) put dataset into Dataloader

model = MyModel().to(device) construct model and move to device (cpu/cuda)

criterion = nn.MSELoss() set loss function

optimizer = torch.optim.SGD(model.parameters(), 0.1) set optimizer

model.train() set model to train mode

for x, y in tr_set: iterate through the dataloader

optimizer.zero_grad() set gradient to zero

x, y = x.to(device), y.to(device) move data to device (cpu/cuda)

pred = model(x) forward pass (compute output)

loss = criterion(pred, y) compute loss

loss.backward() compute gradient (backpropagation)

optimizer.step() update model with optimizer

for x, y in dv_set: iterate through the dataloader

x, y = x.to(device), y.to(device) move data to device (cpu/cuda)

with torch.no_grad(): disable gradient calculation

pred = model(x) forward pass (compute output)

loss = criterion(pred, y) compute loss

total_loss += loss.cpu().item() * len(x) accumulate loss

avg_loss = total_loss / len(dv_set.dataset) compute averaged loss

for x in tt_set: iterate through the dataloader

x = x.to(device) move data to device (cpu/cuda)

with torch.no_grad(): disable gradient calculation

pred = model(x) forward pass (compute output)

preds.append(pred.cpu()) collect prediction

Changes behaviour of some model layers, such as dropout and batch

Prevents calculations from being added into gradient computation

You might also like

def getitem(self, index):