0% found this document useful (0 votes)

128 views15 pages

PyTorch Crash Course 1713016363

This document provides an overview of PyTorch with sections on tensor basics, autograd, training loops, neural networks, convolutional neural networks, and saving/loading models. It covers topics like tensor operations, autograd for automatic differentiation, using models, loss functions and optimizers in a training loop, and examples of linear regression and a simple neural network.

Uploaded by

moussakallaabdoulaye

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

128 views15 pages

PyTorch Crash Course 1713016363

Uploaded by

moussakallaabdoulaye

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 15

PyTorch Crash Course

Overview:
1. Tensor Basics
• Create, Operations, NumPy, GPU Support
1. Autograd
• Linear regression example
1. Training Loop with: Model, Loss & Optimizer
• A typical PyTorch training pipeline
1. Neural Network
• Also: GPU, Datasets, DataLoader, Transforms & Evaluation
1. Convolutional Neural Network
• Also: Save/Load model

1. Tensors
Everything in PyTorch is based on Tensor operations. A Tensor is a multi-dimensional matrix
containing elements of a single data type:

import torch

# torch.empty(size): uninitiallized
x = torch.empty(1) # scalar
print("empty(1):", x)
x = torch.empty(3) # vector
print("empty(3):",x)
x = torch.empty(2, 3) # matrix
print("empty(2,3):",x)
x = torch.empty(2, 2, 3) # tensor, 3 dimensions
#x = torch.empty(2,2,2,3) # tensor, 4 dimensions
print("empty(2, 2, 3):",x)

# torch.rand(size): random numbers [0, 1]

x = torch.rand(5, 3)
print("rand(5,3):", x)

# torch.zeros(size), fill with 0

# torch.ones(size), fill with 1
x = torch.zeros(5, 3)
print("zeros(5,3):", x)

# check size
print("size", x.size()) # x.size(0)
print("shape", x.shape) # x.shape[0]
# check data type
print(x.dtype)

# specify types, float32 default

x = torch.zeros(5, 3, dtype=torch.float16)
print(x)

# check type
print(x.dtype)

# construct from data

x = torch.tensor([5.5, 3])
print(x, x.dtype)

# requires_grad argument
# This will tell pytorch that it will need to calculate the gradients
for this tensor
# later in your optimization steps
# i.e. this is a variable in your model that you want to optimize
x = torch.tensor([5.5, 3], requires_grad=True)
print(x)

Operations with Tensors

# Operations
x = torch.ones(2, 2)
y = torch.rand(2, 2)

# elementwise addition
z = x + y
# torch.add(x,y)

# in place addition, everythin with a trailing underscore is an

inplace operation
# i.e. it will modify the variable
# y.add_(x)

print(x)
print(y)
print(z)

# subtraction
z = x - y
z = torch.sub(x, y)

# multiplication
z = x * y
z = torch.mul(x,y)

# division
z = x / y
z = torch.div(x,y)

# Slicing
x = torch.rand(5,3)
print(x)
print("x[:, 0]", x[:, 0]) # all rows, column 0
print("x[1, :]", x[1, :]) # row 1, all columns
print("x[1, 1]", x[1,1]) # element at 1, 1

# Get the actual value if only 1 element in your tensor

print("x[1,1].item()", x[1,1].item())

# Reshape with torch.view()

x = torch.randn(4, 4)
y = x.view(16)
z = x.view(-1, 8) # the size -1 is inferred from other dimensions
# if -1 it pytorch will automatically determine the necessary size
print(x.size(), y.size(), z.size())

NumPy
Converting a Torch Tensor to a NumPy array and vice versa is very easy

a = torch.ones(5)
print(a)

# torch to numpy with .numpy()

b = a.numpy()
print(b)
print(type(b))

# Careful: If the Tensor is on the CPU (not the GPU),

# both objects will share the same memory location, so changing one
# will also change the other
a.add_(1)
print(a)
print(b)

# numpy to torch with .from_numpy(x), or torch.tensor() to copy it

import numpy as np
a = np.ones(5)
b = torch.from_numpy(a)
c = torch.tensor(a)
print(a)
print(b)
print(c)

# again be careful when modifying

a += 1
print(a)
print(b)
print(c)

GPU Support
By default all tensors are created on the CPU. But we can also move them to the GPU (if it's
available ), or create them directly on the GPU.

device = torch.device('cuda' if torch.cuda.is_available() else 'cpu')

x = torch.rand(2,2).to(device) # move tensors to GPU device

#x = x.to("cpu")
#x = x.to("cuda")

x = torch.rand(2,2, device=device) # or directy create them on GPU

2. Autograd
The autograd package provides automatic differentiation for all operations on Tensors.
Generally speaking, torch.autograd is an engine for computing the vector-Jacobian product. It
computes partial derivates while applying the chain rule.

Set requires_grad = True:

import torch

# requires_grad = True -> tracks all operations on the tensor.

x = torch.randn(3, requires_grad=True)
y = x + 2

# y was created as a result of an operation, so it has a grad_fn

attribute.
# grad_fn: references a Function that has created the Tensor
print(x) # created by the user -> grad_fn is None
print(y)
print(y.grad_fn)

# Do more operations on y
z = y * y * 3
print(z)
z = z.mean()
print(z)

# Let's compute the gradients with backpropagation

# When we finish our computation we can call .backward() and have all
the gradients computed automatically.
# The gradient for this tensor will be accumulated into .grad
attribute.
# It is the partial derivate of the function w.r.t. the tensor

print(x.grad)
z.backward()
print(x.grad) # dz/dx

# !!! Careful!!! backward() accumulates the gradient for this tensor

into .grad attribute.
# !!! We need to be careful during optimization !!!
optimizer.zero_grad()

Stop a tensor from tracking history:

For example during the training loop when we want to update our weights, or after training
during evaluation. These operations should not be part of the gradient computation. To prevent
this, we can use:

• x.requires_grad_(False)
• x.detach()
• wrap in with torch.no_grad():
# .requires_grad_(...) changes an existing flag in-place.
a = torch.randn(2, 2)
b = (a * a).sum()
print(a.requires_grad)
print(b.grad_fn)

a.requires_grad_(True)
b = (a * a).sum()
print(a.requires_grad)
print(b.grad_fn)

# .detach(): get a new Tensor with the same content but no gradient
computation:
a = torch.randn(2, 2, requires_grad=True)
b = a.detach()
print(a.requires_grad)
print(b.requires_grad)

# wrap in 'with torch.no_grad():'

a = torch.randn(2, 2, requires_grad=True)
print(a.requires_grad)
with torch.no_grad():
b = a ** 2
print(b.requires_grad)

Gradient Descent Autograd

Linear Regression example:
f ( x )=w∗x+ b
here : f(x) = 2 * x

import torch

# Linear regression
# f = w * x + b
# here : f = 2 * x

X = torch.tensor([1, 2, 3, 4, 5, 6, 7, 8], dtype=torch.float32)

Y = torch.tensor([2, 4, 6, 8, 10, 12, 14, 16], dtype=torch.float32)

w = torch.tensor(0.0, dtype=torch.float32, requires_grad=True)

# model output
def forward(x):
return w * x

# loss = MSE
def loss(y, y_pred):
return ((y_pred - y)**2).mean()

X_test = 5.0

print(f'Prediction before training: f({X_test}) =

{forward(X_test).item():.3f}')

# Training
learning_rate = 0.01
n_epochs = 100

for epoch in range(n_epochs):

# predict = forward pass
y_pred = forward(X)

# loss
l = loss(Y, y_pred)

# calculate gradients = backward pass

l.backward()

# update weights
#w.data = w.data - learning_rate * w.grad
with torch.no_grad():
w -= learning_rate * w.grad

# zero the gradients after updating

w.grad.zero_()

if (epoch+1) % 10 == 0:
print(f'epoch {epoch+1}: w = {w.item():.3f}, loss =
{l.item():.3f}')

print(f'Prediction after training: f({X_test}) =

{forward(X_test).item():.3f}')

3. Model, Loss & Optimizer

A typical PyTorch pipeline looks like this:

1. Design model (input, output, forward pass with different layers)

2. Construct loss and optimizer
3. Training loop:
• Forward = compute prediction and loss
• Backward = compute gradients
• Update weights
import torch
import torch.nn as nn

# Linear regression
# f = w * x
# here : f = 2 * x

# 0) Training samples, watch the shape!

X = torch.tensor([[1], [2], [3], [4], [5], [6], [7], [8]],
dtype=torch.float32)
Y = torch.tensor([[2], [4], [6], [8], [10], [12], [14], [16]],
dtype=torch.float32)

n_samples, n_features = X.shape

print(f'n_samples = {n_samples}, n_features = {n_features}')

# 0) create a test sample

X_test = torch.tensor([5], dtype=torch.float32)

# 1) Design Model, the model has to implement the forward pass!

# Here we could simply use a built-in model from PyTorch

# model = nn.Linear(input_size, output_size)

class LinearRegression(nn.Module):
def __init__(self, input_dim, output_dim):
super(LinearRegression, self).__init__()
# define different layers
self.lin = nn.Linear(input_dim, output_dim)

def forward(self, x):

return self.lin(x)
input_size, output_size = n_features, n_features

model = LinearRegression(input_size, output_size)

print(f'Prediction before training: f({X_test.item()}) =

{model(X_test).item():.3f}')

# 2) Define loss and optimizer

learning_rate = 0.01
n_epochs = 100

loss = nn.MSELoss()
optimizer = torch.optim.SGD(model.parameters(), lr=learning_rate)

# 3) Training loop
for epoch in range(n_epochs):
# predict = forward pass with our model
y_predicted = model(X)

# loss
l = loss(Y, y_predicted)

# calculate gradients = backward pass

l.backward()

# update weights
optimizer.step()

# zero the gradients after updating

optimizer.zero_grad()

if (epoch+1) % 10 == 0:
w, b = model.parameters() # unpack parameters
print('epoch ', epoch+1, ': w = ', w[0][0].item(), ' loss = ',
l.item())

print(f'Prediction after training: f({X_test.item()}) =

{model(X_test).item():.3f}')

4. First Neural Net

GPU, Datasets, DataLoader, Transforms, Neural Net, Training & Evaluation

import torch
import torch.nn as nn
import torchvision
import torchvision.transforms as transforms
import matplotlib.pyplot as plt
# Device configuration
device = torch.device('cuda' if torch.cuda.is_available() else 'cpu')

# Hyper-parameters
input_size = 784 # 28x28
hidden_size = 500
num_classes = 10
num_epochs = 2
batch_size = 100
learning_rate = 0.001

# MNIST dataset
train_dataset = torchvision.datasets.MNIST(root='./data',
train=True,

transform=transforms.ToTensor(),
download=True)

test_dataset = torchvision.datasets.MNIST(root='./data',
train=False,

transform=transforms.ToTensor())

# Data loader
train_loader = torch.utils.data.DataLoader(dataset=train_dataset,
batch_size=batch_size,
shuffle=True)

test_loader = torch.utils.data.DataLoader(dataset=test_dataset,
batch_size=batch_size,
shuffle=False)

examples = iter(test_loader)
example_data, example_targets = examples.next()

for i in range(6):
plt.subplot(2,3,i+1)
plt.imshow(example_data[i][0], cmap='gray')
plt.show()
# Fully connected neural network with one hidden layer
class NeuralNet(nn.Module):
def __init__(self, input_size, hidden_size, num_classes):
super(NeuralNet, self).__init__()
self.l1 = nn.Linear(input_size, hidden_size)
self.relu = nn.ReLU()
self.l2 = nn.Linear(hidden_size, num_classes)

def forward(self, x):

out = self.l1(x)
out = self.relu(out)
out = self.l2(out)
# no activation and no softmax at the end
return out

model = NeuralNet(input_size, hidden_size, num_classes).to(device)

# Loss and optimizer

criterion = nn.CrossEntropyLoss()
optimizer = torch.optim.Adam(model.parameters(), lr=learning_rate)

# Train the model

n_total_steps = len(train_loader)
for epoch in range(num_epochs):
for i, (images, labels) in enumerate(train_loader):
# origin shape: [100, 1, 28, 28]
# resized: [100, 784]
images = images.reshape(-1, 28*28).to(device)
labels = labels.to(device)
# Forward pass and loss calculation
outputs = model(images)
loss = criterion(outputs, labels)

# Backward and optimize

loss.backward()
optimizer.step()
optimizer.zero_grad()

if (i+1) % 100 == 0:
print (f'Epoch [{epoch+1}/{num_epochs}], Step
[{i+1}/{n_total_steps}], Loss: {loss.item():.4f}')

Epoch [1/2], Step [100/600], Loss: 0.3486

Epoch [1/2], Step [200/600], Loss: 0.1807
Epoch [1/2], Step [300/600], Loss: 0.2612
Epoch [1/2], Step [400/600], Loss: 0.1134
Epoch [1/2], Step [500/600], Loss: 0.1875
Epoch [1/2], Step [600/600], Loss: 0.3031
Epoch [2/2], Step [100/600], Loss: 0.0671
Epoch [2/2], Step [200/600], Loss: 0.1215
Epoch [2/2], Step [300/600], Loss: 0.1317
Epoch [2/2], Step [400/600], Loss: 0.0537
Epoch [2/2], Step [500/600], Loss: 0.0350
Epoch [2/2], Step [600/600], Loss: 0.0633

# Test the model: we don't need to compute gradients

with torch.no_grad():
n_correct = 0
n_samples = len(test_loader.dataset)

for images, labels in test_loader:

images = images.reshape(-1, 28*28).to(device)
labels = labels.to(device)

outputs = model(images)

# max returns (output_value ,index)

_, predicted = torch.max(outputs, 1)
n_correct += (predicted == labels).sum().item()

acc = n_correct / n_samples

print(f'Accuracy of the network on the {n_samples} test images:
{100*acc} %')

Accuracy of the network on the 10000 test images: 96.92 %

5. CNN
This section covers:

• Convolutional Layers
• MaxPooling
• Save/Load model
import torch
import torch.nn as nn
import torch.nn.functional as F
import torchvision
import torchvision.transforms as transforms
import matplotlib.pyplot as plt
import numpy as np

# Device configuration
device = torch.device('cuda' if torch.cuda.is_available() else 'cpu')

# Hyper-parameters
num_epochs = 10
batch_size = 32
learning_rate = 0.001

# dataset has PILImage images of range [0, 1].

# We transform them to Tensors of normalized range [-1, 1]
transform = transforms.Compose(
[transforms.ToTensor(),
transforms.Normalize((0.5, 0.5, 0.5), (0.5, 0.5, 0.5))])

# CIFAR10: 60000 32x32 color images in 10 classes, with 6000 images

per class
train_dataset = torchvision.datasets.CIFAR10(root='./data',
train=True,
download=True,
transform=transform)

test_dataset = torchvision.datasets.CIFAR10(root='./data',
train=False,
download=True,
transform=transform)

train_loader = torch.utils.data.DataLoader(train_dataset,
batch_size=batch_size,
shuffle=True)

test_loader = torch.utils.data.DataLoader(test_dataset,
batch_size=batch_size,
shuffle=False)
classes = ('plane', 'car', 'bird', 'cat',
'deer', 'dog', 'frog', 'horse', 'ship', 'truck')

def imshow(imgs):
imgs = imgs / 2 + 0.5 # unnormalize
npimgs = imgs.numpy()
plt.imshow(np.transpose(npimgs, (1, 2, 0)))
plt.show()

# one batch of random training images

dataiter = iter(train_loader)
images, labels = dataiter.next()
img_grid = torchvision.utils.make_grid(images[0:25], nrow=5)
imshow(img_grid)

Files already downloaded and verified

class ConvNet(nn.Module):
def __init__(self):
super().__init__()
self.conv1 = nn.Conv2d(3, 32, 3)
self.pool = nn.MaxPool2d(2, 2)
self.conv2 = nn.Conv2d(32, 64, 3)
self.conv3 = nn.Conv2d(64, 64, 3)
self.fc1 = nn.Linear(64*4*4, 64)
self.fc2 = nn.Linear(64, 10)

def forward(self, x):

# N, 3, 32, 32
x = F.relu(self.conv1(x)) # -> N, 32, 30, 30
x = self.pool(x) # -> N, 32, 15, 15
x = F.relu(self.conv2(x)) # -> N, 64, 13, 13
x = self.pool(x) # -> N, 64, 6, 6
x = F.relu(self.conv3(x)) # -> N, 64, 4, 4
x = torch.flatten(x, 1) # -> N, 1024
x = F.relu(self.fc1(x)) # -> N, 64
x = self.fc2(x) # -> N, 10
return x

model = ConvNet().to(device)

criterion = nn.CrossEntropyLoss()
optimizer = torch.optim.Adam(model.parameters(), lr=learning_rate)

n_total_steps = len(train_loader)
for epoch in range(num_epochs):

running_loss = 0.0

for i, (images, labels) in enumerate(train_loader):

images = images.to(device)
labels = labels.to(device)

# Forward pass
outputs = model(images)
loss = criterion(outputs, labels)

# Backward and optimize

loss.backward()
optimizer.step()
optimizer.zero_grad()

running_loss += loss.item()

print(f'[{epoch + 1}] loss: {running_loss / n_total_steps:.3f}')

print('Finished Training')
PATH = './cnn.pth'
torch.save(model.state_dict(), PATH)

[1] loss: 1.472

[2] loss: 1.105
[3] loss: 0.942
[4] loss: 0.835
[5] loss: 0.762
[6] loss: 0.697
[7] loss: 0.649
[8] loss: 0.603
[9] loss: 0.561
[10] loss: 0.527
Finished Training

loaded_model = ConvNet()
loaded_model.load_state_dict(torch.load(PATH)) # it takes the loaded
dictionary, not the path file itself
loaded_model.to(device)
loaded_model.eval()

with torch.no_grad():
n_correct = 0
n_correct2 = 0
n_samples = len(test_loader.dataset)

for images, labels in test_loader:

images = images.to(device)
labels = labels.to(device)
outputs = model(images)

# max returns (value ,index)

_, predicted = torch.max(outputs, 1)
n_correct += (predicted == labels).sum().item()

outputs2 = loaded_model(images)
_, predicted2 = torch.max(outputs2, 1)
n_correct2 += (predicted2 == labels).sum().item()

acc = 100.0 * n_correct / n_samples

print(f'Accuracy of the model: {acc} %')

acc = 100.0 * n_correct2 / n_samples

print(f'Accuracy of the loaded model: {acc} %')

Accuracy of the model: 71.29 %

Accuracy of the loaded model: 71.29 %

License Plate Recognition
No ratings yet
License Plate Recognition
22 pages
Pandas Handbook
No ratings yet
Pandas Handbook
33 pages
Van Der Post H. Python For Finance. A Crash Course Modern Guide 2024
80% (5)
Van Der Post H. Python For Finance. A Crash Course Modern Guide 2024
304 pages
Theano
No ratings yet
Theano
660 pages
Python Programming For Economics Finance
No ratings yet
Python Programming For Economics Finance
267 pages
NumPy: Beginner's Guide - Third Edition - Sample Chapter
75% (4)
NumPy: Beginner's Guide - Third Edition - Sample Chapter
54 pages
Adaline/Madaline:Applications
100% (1)
Adaline/Madaline:Applications
25 pages
A Gentle Introduction To Neural Networks With Python
100% (1)
A Gentle Introduction To Neural Networks With Python
85 pages
DL Full Merged
No ratings yet
DL Full Merged
454 pages
Computational Physics
100% (1)
Computational Physics
130 pages
Pandas
100% (1)
Pandas
1,131 pages
Machine Learning Most Compressive Perfect Book
No ratings yet
Machine Learning Most Compressive Perfect Book
136 pages
Pytorch Lightning Readthedocs Latest
100% (1)
Pytorch Lightning Readthedocs Latest
421 pages
12ip 22 23
No ratings yet
12ip 22 23
188 pages
11.NUMPY Lab File (R20)
100% (1)
11.NUMPY Lab File (R20)
105 pages
Pytorch Lightning Manual Readthedocs Io English May2020
No ratings yet
Pytorch Lightning Manual Readthedocs Io English May2020
562 pages
Simple Libraries in Python
No ratings yet
Simple Libraries in Python
12 pages
Deep Learning and TensorFlow
No ratings yet
Deep Learning and TensorFlow
50 pages
Cheat Sheets For Ai: Neural Networks, Machine Learning, Deeplearning & Big Data
No ratings yet
Cheat Sheets For Ai: Neural Networks, Machine Learning, Deeplearning & Big Data
25 pages
Machine Learning Summarized Notes 1660762916
No ratings yet
Machine Learning Summarized Notes 1660762916
111 pages
Linear Regression
No ratings yet
Linear Regression
83 pages
Kwant 1.3.1 Documentation: C. W. Groth, M. Wimmer, A. R. Akhmerov, X. Waintal, Et Al
No ratings yet
Kwant 1.3.1 Documentation: C. W. Groth, M. Wimmer, A. R. Akhmerov, X. Waintal, Et Al
169 pages
Python For Multivariate Analysis
No ratings yet
Python For Multivariate Analysis
47 pages
Matplotlib PDF
No ratings yet
Matplotlib PDF
16 pages
Machine Learning
100% (1)
Machine Learning
185 pages
Revision Point - Series
No ratings yet
Revision Point - Series
5 pages
Salary Prediction LinearRegression
100% (1)
Salary Prediction LinearRegression
7 pages
Fipy-2 0
100% (1)
Fipy-2 0
193 pages
23 DeepLearning PDF
No ratings yet
23 DeepLearning PDF
74 pages
Deep Learning Course File Aiml-1
No ratings yet
Deep Learning Course File Aiml-1
183 pages
ForUpload Python R1
No ratings yet
ForUpload Python R1
4 pages
Automatic Differentiation With Pytorch: Stat 479: Deep Learning, Spring 2019 Sebastian Raschka
No ratings yet
Automatic Differentiation With Pytorch: Stat 479: Deep Learning, Spring 2019 Sebastian Raschka
43 pages
Signals and Systems
No ratings yet
Signals and Systems
37 pages
Kaggle State of Machine Learning and Data Science 2020 PDF
No ratings yet
Kaggle State of Machine Learning and Data Science 2020 PDF
30 pages
PyTorch Workflow Fundamentals
No ratings yet
PyTorch Workflow Fundamentals
1 page
PyTorch Guide
No ratings yet
PyTorch Guide
17 pages
Pandas Visualisation
No ratings yet
Pandas Visualisation
27 pages
Pytorch Tutorial For Beginner: Department of Computer Science & Engineering University of Washington
No ratings yet
Pytorch Tutorial For Beginner: Department of Computer Science & Engineering University of Washington
11 pages
Array Programming With NumPy
No ratings yet
Array Programming With NumPy
19 pages
Introduction To Python
No ratings yet
Introduction To Python
35 pages
GPU Computing With Spark and Python
No ratings yet
GPU Computing With Spark and Python
33 pages
02 - Lecture Note - TensorFlow Ops
No ratings yet
02 - Lecture Note - TensorFlow Ops
21 pages
Introduction To TensorFlow in Python
100% (3)
Introduction To TensorFlow in Python
146 pages
Deep Learning Tensorflow
No ratings yet
Deep Learning Tensorflow
35 pages
7 Time Series Datasets For Machine Learning
No ratings yet
7 Time Series Datasets For Machine Learning
8 pages
Early Stopping in Practice
No ratings yet
Early Stopping in Practice
14 pages
DL Lab Manual
100% (1)
DL Lab Manual
35 pages
Machine Learning + Devops Using Azure ML Services
No ratings yet
Machine Learning + Devops Using Azure ML Services
17 pages
Computer Vision Pretrained Models: What Is Pre-Trained Model?
No ratings yet
Computer Vision Pretrained Models: What Is Pre-Trained Model?
10 pages
3 - ANN Part One PDF
No ratings yet
3 - ANN Part One PDF
30 pages
1694600777-Unit2.2 Logistic Regression CU 2.0
100% (1)
1694600777-Unit2.2 Logistic Regression CU 2.0
37 pages
Keras
50% (2)
Keras
2 pages
Tutorial Pytorch Best Commands
No ratings yet
Tutorial Pytorch Best Commands
8 pages
Lecture 03 Gradient Descent
No ratings yet
Lecture 03 Gradient Descent
26 pages
Building Powerful Image Classification Models Using Very Little Data
No ratings yet
Building Powerful Image Classification Models Using Very Little Data
20 pages
Seminar Report Machine Learning
No ratings yet
Seminar Report Machine Learning
20 pages
Advanced Python Programming: National Institute of Technology Warangal
No ratings yet
Advanced Python Programming: National Institute of Technology Warangal
1 page
Columbia Seaborn Tutorial
No ratings yet
Columbia Seaborn Tutorial
12 pages
An Introduction To Mathematics Behind Neural Networks
No ratings yet
An Introduction To Mathematics Behind Neural Networks
5 pages
Gujarat Technological University: Semester - V Subject Name: Python Programming
No ratings yet
Gujarat Technological University: Semester - V Subject Name: Python Programming
4 pages
Python Basic Data Analysis 20180412
No ratings yet
Python Basic Data Analysis 20180412
53 pages
Figure Style and Scale: Darkgrid Whitegrid Dark White Ticks Darkgrid
No ratings yet
Figure Style and Scale: Darkgrid Whitegrid Dark White Ticks Darkgrid
15 pages
Install TensorFlow With Pip - TensorFlow
No ratings yet
Install TensorFlow With Pip - TensorFlow
3 pages
Machine Learning Engineer
No ratings yet
Machine Learning Engineer
4 pages
Python Interview Questions
No ratings yet
Python Interview Questions
61 pages
Machine Learning Introduction
100% (1)
Machine Learning Introduction
20 pages
Machine Learning: Andrew NG's Course From Coursera: Presentation
100% (1)
Machine Learning: Andrew NG's Course From Coursera: Presentation
4 pages
Array-Numpy-Quiz - Attempt Review
No ratings yet
Array-Numpy-Quiz - Attempt Review
10 pages
(Deep Learning Using PyTorch) (Cheatsheet)
No ratings yet
(Deep Learning Using PyTorch) (Cheatsheet)
7 pages
A2 Rahil
No ratings yet
A2 Rahil
5 pages
Machine Learning Using Python PDF
No ratings yet
Machine Learning Using Python PDF
2 pages
Numpy Tutorial
No ratings yet
Numpy Tutorial
1 page
CCS355 Neural Networks and Deep Learning Lab
No ratings yet
CCS355 Neural Networks and Deep Learning Lab
43 pages
PSC QBank
No ratings yet
PSC QBank
3 pages
Python For Machine Learning
No ratings yet
Python For Machine Learning
384 pages
PyTorch CrashCourse
No ratings yet
PyTorch CrashCourse
17 pages
PyTorch CrashCourse
No ratings yet
PyTorch CrashCourse
16 pages
Matlab and Plots - Python
No ratings yet
Matlab and Plots - Python
10 pages
Matplotlib 1
No ratings yet
Matplotlib 1
29 pages
IP Practical
No ratings yet
IP Practical
28 pages
Python in Excel Tutorial1
No ratings yet
Python in Excel Tutorial1
54 pages
Diabetes Prediction 1704256341
No ratings yet
Diabetes Prediction 1704256341
17 pages
Machine Learning For Tabular Data XGBoost, Deep Learning, and AI (Mark Ryan, Luca Massaron) (Z-Library)
100% (1)
Machine Learning For Tabular Data XGBoost, Deep Learning, and AI (Mark Ryan, Luca Massaron) (Z-Library)
504 pages
X Ai Record Work 2023-2024-1
No ratings yet
X Ai Record Work 2023-2024-1
25 pages
Unit V Anseer Key
No ratings yet
Unit V Anseer Key
10 pages
Class 12 Ip Practical Programs 2024-25 Revised
No ratings yet
Class 12 Ip Practical Programs 2024-25 Revised
42 pages
Data Visualization - Lab - Manual - 2024
No ratings yet
Data Visualization - Lab - Manual - 2024
13 pages
Unit 4
No ratings yet
Unit 4
108 pages
Pytorch Demo 1749471354
No ratings yet
Pytorch Demo 1749471354
10 pages

PyTorch Crash Course 1713016363

Uploaded by

PyTorch Crash Course 1713016363

Uploaded by

PyTorch Crash Course

# torch.rand(size): random numbers [0, 1]

# torch.zeros(size), fill with 0

# specify types, float32 default

# construct from data

Operations with Tensors

# in place addition, everythin with a trailing underscore is an

# Get the actual value if only 1 element in your tensor

# Reshape with torch.view()

# torch to numpy with .numpy()

# Careful: If the Tensor is on the CPU (not the GPU),

# numpy to torch with .from_numpy(x), or torch.tensor() to copy it

# again be careful when modifying

device = torch.device('cuda' if torch.cuda.is_available() else 'cpu')

x = torch.rand(2,2).to(device) # move tensors to GPU device

x = torch.rand(2,2, device=device) # or directy create them on GPU

Set requires_grad = True:

# requires_grad = True -> tracks all operations on the tensor.

# y was created as a result of an operation, so it has a grad_fn

# Let's compute the gradients with backpropagation

# !!! Careful!!! backward() accumulates the gradient for this tensor

Stop a tensor from tracking history:

# wrap in 'with torch.no_grad():'

Gradient Descent Autograd

X = torch.tensor([1, 2, 3, 4, 5, 6, 7, 8], dtype=torch.float32)

w = torch.tensor(0.0, dtype=torch.float32, requires_grad=True)

print(f'Prediction before training: f({X_test}) =

for epoch in range(n_epochs):

# calculate gradients = backward pass

# zero the gradients after updating

print(f'Prediction after training: f({X_test}) =

3. Model, Loss & Optimizer

1. Design model (input, output, forward pass with different layers)

# 0) Training samples, watch the shape!

n_samples, n_features = X.shape

# 0) create a test sample

# 1) Design Model, the model has to implement the forward pass!

# Here we could simply use a built-in model from PyTorch

def forward(self, x):

model = LinearRegression(input_size, output_size)

print(f'Prediction before training: f({X_test.item()}) =

# 2) Define loss and optimizer

# calculate gradients = backward pass

# zero the gradients after updating

print(f'Prediction after training: f({X_test.item()}) =

4. First Neural Net

def forward(self, x):

model = NeuralNet(input_size, hidden_size, num_classes).to(device)

# Loss and optimizer

# Train the model

# Backward and optimize

Epoch [1/2], Step [100/600], Loss: 0.3486

# Test the model: we don't need to compute gradients

for images, labels in test_loader:

# max returns (output_value ,index)

acc = n_correct / n_samples

Accuracy of the network on the 10000 test images: 96.92 %

# dataset has PILImage images of range [0, 1].

# CIFAR10: 60000 32x32 color images in 10 classes, with 6000 images

# one batch of random training images

Files already downloaded and verified

def forward(self, x):

for i, (images, labels) in enumerate(train_loader):

# Backward and optimize

print(f'[{epoch + 1}] loss: {running_loss / n_total_steps:.3f}')

[1] loss: 1.472

for images, labels in test_loader:

# max returns (value ,index)

acc = 100.0 * n_correct / n_samples

acc = 100.0 * n_correct2 / n_samples

Accuracy of the model: 71.29 %

You might also like