0% found this document useful (0 votes)

13 views

Chapter 2

Uploaded by

Javier Gonzalez

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

13 views

Chapter 2

Uploaded by

Javier Gonzalez

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 35

Running a forward

pass
INTRODUCTION TO DEEP LEARNING WITH PYTORCH

Maham Faisal Khan

Senior Data Science Content Developer
What is a forward pass?
Input data is passed forward or Some possible outputs:
propagated through a network
Binary classification
Computations performed at each layer Single probability between 0 and 1
Outputs of each layer passed to each
Multiclass classification
subsequent layer
Distribution of probabilities summing to 1
Output of final layer: "prediction"
Regression values
Used for both training and prediction Continuous numerical predictions

INTRODUCTION TO DEEP LEARNING WITH PYTORCH

Is there also a backward pass?
Backward pass, or backpropagation is used to update weights and biases during training

In the "training loop", we:

1. Propagate data forward
2. Compare outputs to true values (ground-truth)

3. Backpropagate to update model weights and biases

4. Repeat until weights and biases are tuned to produce useful outputs

INTRODUCTION TO DEEP LEARNING WITH PYTORCH

Binary classification: forward pass
# Create input data of shape 5x6
input_data = torch.tensor(
[[-0.4421, 1.5207, 2.0607, -0.3647, 0.4691, 0.0946],
[-0.9155, -0.0475, -1.3645, 0.6336, -1.9520, -0.3398],
[ 0.7406, 1.6763, -0.8511, 0.2432, 0.1123, -0.0633],
[-1.6630, -0.0718, -0.1285, 0.5396, -0.0288, -0.8622],
[-0.7413, 1.7920, -0.0883, -0.6685, 0.4745, -0.4245]])

# Create binary classification model

model = nn.Sequential(
nn.Linear(6, 4), # First linear layer
nn.Linear(4, 1), # Second linear layer
nn.Sigmoid() # Sigmoid activation function
)

# Pass input data through model

output = model(input_data)

INTRODUCTION TO DEEP LEARNING WITH PYTORCH

Binary classification: forward pass
print(output)

tensor([[0.5188], [0.3761], [0.5015], [0.3718], [0.4663]],

grad_fn=<SigmoidBackward0>)

Outputs:
five probabilities between zero and one

one value for each sample (row) in data

Classification:
Class = 1 for first and third values: 0.5188 , 0.5015

Class = 0 for second, fourth and fifth values: 0.3761 , 0.3718 , 0.4633

INTRODUCTION TO DEEP LEARNING WITH PYTORCH

Multi-class classification: forward pass
# Specify model has three classes
n_classes = 3

# Create multiclass classification model

model = nn.Sequential(
nn.Linear(6, 4), # First linear layer
nn.Linear(4, n_classes), # Second linear layer
nn.Softmax(dim=-1) # Softmax activation
)

# Pass input data through model

output = model(input_data)
print(output.shape)

torch.Size([5, 3])

INTRODUCTION TO DEEP LEARNING WITH PYTORCH

Multi-class classification: forward pass
print(output)

tensor([[0.4969, 0.3606, 0.1425],

[0.5105, 0.3262, 0.1633],
[0.3253, 0.3174, 0.3572],
[0.5499, 0.3361, 0.1141],
[0.4117, 0.3366, 0.2517]], grad_fn=<SoftmaxBackward0>)

Outputs:
The output dimension is 5 × 3

Each row sums to one

Value with highest probability is assigned predicted label in each row

Row 1 = class 1 (mammal), row 2 = class 1 (mammal), row 3 = class 3 (reptile)

INTRODUCTION TO DEEP LEARNING WITH PYTORCH

Regression: forward pass
# Create regression model tensor([[0.3818],
model = nn.Sequential( [0.0712],
nn.Linear(6, 4), # First linear layer [0.3376],
nn.Linear(4, 1) # Second linear layer [0.0231],
) [0.0757]],
grad_fn=<AddmmBackward0>)
# Pass input data through model
output = model(input_data)

# Return output
print(output)

INTRODUCTION TO DEEP LEARNING WITH PYTORCH

Let's practice!
INTRODUCTION TO DEEP LEARNING WITH PYTORCH
Using loss functions
to assess model
predictions
INTRODUCTION TO DEEP LEARNING WITH PYTORCH

Maham Faisal Khan

Senior Data Science Content Developer
Why do we need a loss function?
Loss function:

Gives feedback to model during training

Takes in model prediction y^ and ground truth y

Outputs a float

INTRODUCTION TO DEEP LEARNING WITH PYTORCH

Why do we need a loss function?
hair feathers eggs milk airborne aquatic predator toothed backbone breathes venomous fins legs tail domestic catsize class
1 0 0 1 0 0 1 1 1 1 0 0 4 0 0 1 0

Predicted class = 0 -> correct = low loss

Predicted class = 1 -> wrong = high loss

Predicted class = 2 -> wrong = high loss

INTRODUCTION TO DEEP LEARNING WITH PYTORCH

One-hot encoding concepts
loss = F (y, y^)
y is a single integer (class label)
e.g. y = 0 when y is a mammal

y^ is a tensor (output of softmax)

If N is the number of classes, e.g. N = 3

y^ is a tensor with N dimensions,

e.g. y^ = [0.57492, 0.034961, 0.15669]

How do we compare an integer with a tensor?

INTRODUCTION TO DEEP LEARNING WITH PYTORCH

One-hot encoding concepts
Transforming true label to tensor of zeros and ones

one_hot_numpy = np.array([1, 0, 0])

INTRODUCTION TO DEEP LEARNING WITH PYTORCH

Transforming labels with one-hot encoding
import torch.nn.functional as F

F.one_hot(torch.tensor(0), num_classes = 3)

tensor([1, 0, 0])

F.one_hot(torch.tensor(1), num_classes = 3)

tensor([0, 1, 0])

F.one_hot(torch.tensor(2), num_classes = 3)

tensor([0, 0, 1])

INTRODUCTION TO DEEP LEARNING WITH PYTORCH

Cross entropy loss in PyTorch
from torch.nn import CrossEntropyLoss

scores = tensor([[-0.1211, 0.1059]])

one_hot_target = tensor([[1, 0]])

criterion = CrossEntropyLoss()
criterion(scores.double(), one_hot_target.double())

tensor(0.8131, dtype=torch.float64)

INTRODUCTION TO DEEP LEARNING WITH PYTORCH

Bringing it all together
Loss function takes

scores
model predictions before the final softmax function

one_hot_target
one hot encoded ground truth label

and outputs

loss
a single float.

Our training goal is to minimize loss.

INTRODUCTION TO DEEP LEARNING WITH PYTORCH

Let's practice!
INTRODUCTION TO DEEP LEARNING WITH PYTORCH
Using derivatives to
update model
parameters
INTRODUCTION TO DEEP LEARNING WITH PYTORCH

Maham Faisal Khan

Senior Data Science Content Developer
Minimizing the loss
We need to minimize loss

High loss: model prediction is wrong

Low loss: model prediction is correct

INTRODUCTION TO DEEP LEARNING WITH PYTORCH

An analogy for derivatives
Hiking down a mountain to the valley floor:

steep slopes:
a step makes us lose a lot of elevation =
derivative is high (red arrows)

gentler slopes:
a step makes us lose a little bit of
elevation = derivative is low (green
arrows)

valley floor:
not losing elevation by taking a step =
derivative is null (blue arrow)

INTRODUCTION TO DEEP LEARNING WITH PYTORCH

Connecting derivatives and model training
Model training: updating a model's parameters to minimize the loss.

INTRODUCTION TO DEEP LEARNING WITH PYTORCH

Backpropagation concepts
Consider a network made of three layers,
L0, L1 and L2
we calculate local gradients for L0, L1
and L2 using backpropagation
we calculate loss gradients with respect
to L2, then use L2 gradients to calculate
L1 gradients, and so on

INTRODUCTION TO DEEP LEARNING WITH PYTORCH

Backpropagation in PyTorch
# Create the model and run a forward pass
model = nn.Sequential(nn.Linear(16, 8),
nn.Linear(8, 4),
nn.Linear(4, 2))
prediction = model(sample)

# Calculate the loss and compute the gradients

criterion = CrossEntropyLoss()
loss = criterion(prediction, target)
loss.backward()

# Access each layer's gradients

model[0].weight.grad, model[0].bias.grad
model[1].weight.grad, model[1].bias.grad
model[2].weight.grad, model[2].bias.grad

INTRODUCTION TO DEEP LEARNING WITH PYTORCH

Updating model parameters
Update the weights by subtracting local # Learning rate is typically small
gradients scaled by the learning rate lr = 0.001

# Update the weights

weight = model[0].weight
weight_grad = model[0].weight.grad
weight = weight - lr * weight_grad

# Update the biases

bias = model[0].bias
bias_grad = model[0].bias.grad
bias = bias - lr * bias_grad

INTRODUCTION TO DEEP LEARNING WITH PYTORCH

Convex and non-convex functions
This is a convex function. This is a non-convex function.

INTRODUCTION TO DEEP LEARNING WITH PYTORCH

Gradient descent
For non-convex functions, we will use an iterative process such as gradient descent
In PyTorch, an optimizer takes care of weight updates

The most common optimizer is stochastic gradient descent (SGD)

import torch.optim as optim

# Create the optimizer

optimizer = optim.SGD(model.parameters(), lr=0.001)

Optimizer handles updating model parameters (or weights) after calculation of local
gradients

optimizer.step()

INTRODUCTION TO DEEP LEARNING WITH PYTORCH

Let's practice!
INTRODUCTION TO DEEP LEARNING WITH PYTORCH
Writing our first
training loop
INTRODUCTION TO DEEP LEARNING WITH PYTORCH

Maham Faisal Khan

Senior Data Science Content Developer
Training a neural network
1. Create a model
2. Choose a loss function

3. Create a dataset

4. Define an optimizer

5. Run a training loop, where for each sample of the dataset, we repeat:
Calculating loss (forward pass)

Calculating local gradients

Updating model parameters

INTRODUCTION TO DEEP LEARNING WITH PYTORCH

Introducing the Data Science Salary dataset
This dataset contains salary data for data science-related jobs.
The features are: experience_level , employment_type , remote_ratio and company_size .
They were turned into categories.

experience_level employment_type remote_ratio company_size salary_in_usd

0 0 0.5 1 0.036
1 0 1.0 2 0.133
2 0 0.0 1 0.234
1 0 1.0 0 0.076
2 0 1.0 1 0.170

The target is salary in US dollars; it is not a category but a continuous quantity

For regression problems, we cannot use softmax or sigmoid as last activation function

We need a different loss function than cross-entropy

INTRODUCTION TO DEEP LEARNING WITH PYTORCH

Introducing the Mean Squared Error Loss
The mean squared error loss (MSE loss) is the squared difference between the prediction
and the ground truth.

def mean_squared_loss(prediction, target):

return np.mean((prediction - target)**2)

in PyTorch

criterion = nn.MSELoss()
# Prediction and target are float tensors
loss = criterion(prediction, target)

This loss is used for regression problems (e.g., when trying to fit a linear regression model).

INTRODUCTION TO DEEP LEARNING WITH PYTORCH

Before the training loop
# Create the dataset and the dataloader
dataset = TensorDataset(torch.tensor(features).float(), torch.tensor(target).float())
dataloader = DataLoader(dataset, batch_size=4, shuffle=True)

# Create the model

model = nn.Sequential(nn.Linear(4, 2),
nn.Linear(2, 1))

# Create the loss and optimizer

criterion = nn.MSELoss()
optimizer = optim.SGD(model.parameters(), lr=0.001)

INTRODUCTION TO DEEP LEARNING WITH PYTORCH

The training loop
# Loop through the dataset multiple times
for epoch in range(num_epochs):
for data in dataloader:
# Set the gradients to zero
optimizer.zero_grad()
# Get feature and target from the data loader
feature, target = data
# Run a forward pass
pred = model(feature)
# Compute loss and gradients
loss = criterion(pred, target)
loss.backward()
# Update the parameters
optimizer.step()

INTRODUCTION TO DEEP LEARNING WITH PYTORCH

Let's practice!
INTRODUCTION TO DEEP LEARNING WITH PYTORCH

Pytorch Cheatsheet EN
No ratings yet
Pytorch Cheatsheet EN
1 page
chapter2 (1)
No ratings yet
chapter2 (1)
35 pages
Chapter1 Intro
No ratings yet
Chapter1 Intro
35 pages
chapter1 (1)
No ratings yet
chapter1 (1)
50 pages
Chapter 1
No ratings yet
Chapter 1
37 pages
chapter4 (1)
No ratings yet
chapter4 (1)
34 pages
Chapter 4
No ratings yet
Chapter 4
34 pages
Activation Functions: Ismail Elezi
No ratings yet
Activation Functions: Ismail Elezi
30 pages
Chapter 3
No ratings yet
Chapter 3
26 pages
chapter3 (1)
No ratings yet
chapter3 (1)
26 pages
Crashcourse DL Pytorch Parr
No ratings yet
Crashcourse DL Pytorch Parr
39 pages
Intro To PyTorch and Neural Networks - Intro To PyTorch and Neural Networks Cheatsheet - Codecademy
No ratings yet
Intro To PyTorch and Neural Networks - Intro To PyTorch and Neural Networks Cheatsheet - Codecademy
8 pages
PyTorch Crash Course 1713016363
No ratings yet
PyTorch Crash Course 1713016363
15 pages
Pytorch 101: Deep Learning PHD Course 2017/2018
No ratings yet
Pytorch 101: Deep Learning PHD Course 2017/2018
19 pages
Deep Learning With PyTorch 1
No ratings yet
Deep Learning With PyTorch 1
1 page
vertopal.com_PyTorch_CrashCourse
No ratings yet
vertopal.com_PyTorch_CrashCourse
16 pages
Fundamentals of Deep Learning
No ratings yet
Fundamentals of Deep Learning
195 pages
PyTorch PDF
No ratings yet
PyTorch PDF
72 pages
Linear Regression With Pytroch
No ratings yet
Linear Regression With Pytroch
13 pages
Ilovepdf Merged
No ratings yet
Ilovepdf Merged
10 pages
CS335 Lab6
No ratings yet
CS335 Lab6
7 pages
HW1P1_F23
No ratings yet
HW1P1_F23
37 pages
Module02 PyTorch
No ratings yet
Module02 PyTorch
36 pages
Deep Learning Lab: How To Train Your First Neural Network
No ratings yet
Deep Learning Lab: How To Train Your First Neural Network
68 pages
Pytorch Neural Networks Guide 1717173717
No ratings yet
Pytorch Neural Networks Guide 1717173717
17 pages
PyTorch 1 - 0 - Bringing Research and Production Together Presentation
No ratings yet
PyTorch 1 - 0 - Bringing Research and Production Together Presentation
108 pages
Pytorch Tutorial 1
No ratings yet
Pytorch Tutorial 1
48 pages
IBest_DeepLearning
No ratings yet
IBest_DeepLearning
123 pages
Dive Into Deep Learning
No ratings yet
Dive Into Deep Learning
105 pages
unit-4-part-3
No ratings yet
unit-4-part-3
8 pages
Pytorch Tutorial: - Ntu Machine Learning Course
No ratings yet
Pytorch Tutorial: - Ntu Machine Learning Course
64 pages
DeepLearning Recap
No ratings yet
DeepLearning Recap
104 pages
Notes Chapter8
No ratings yet
Notes Chapter8
4 pages
L10 Learning II Gradient Based Learning
No ratings yet
L10 Learning II Gradient Based Learning
72 pages
cs519 hw2
No ratings yet
cs519 hw2
15 pages
Intro To Deep Learning With TensorFlow - Introduction To TensorFlow Cheatsheet - Codecademy
No ratings yet
Intro To Deep Learning With TensorFlow - Introduction To TensorFlow Cheatsheet - Codecademy
8 pages
NLP-NeuralNetworks Reading Notes
No ratings yet
NLP-NeuralNetworks Reading Notes
13 pages
Deep Learning
100% (1)
Deep Learning
49 pages
L6 Multilayer FeedForward network XOR & MNIST DIGIT
No ratings yet
L6 Multilayer FeedForward network XOR & MNIST DIGIT
51 pages
Lecture 2: Introduction To Pytorch
No ratings yet
Lecture 2: Introduction To Pytorch
7 pages
Harvard CS197 Lecture 6 & 7 Notes
No ratings yet
Harvard CS197 Lecture 6 & 7 Notes
18 pages
Pytorch Tutorial: Narges Honarvar Nazari January 30
No ratings yet
Pytorch Tutorial: Narges Honarvar Nazari January 30
29 pages
unit 4 part 3 dl_1
No ratings yet
unit 4 part 3 dl_1
5 pages
L4 Linear Regression
No ratings yet
L4 Linear Regression
51 pages
PyTorch_CrashCourse
No ratings yet
PyTorch_CrashCourse
17 pages
Day 45 PyTorch Presentation
No ratings yet
Day 45 PyTorch Presentation
67 pages
Part 13 MD
No ratings yet
Part 13 MD
41 pages
Markdown to PDF
No ratings yet
Markdown to PDF
2 pages
ANN Analysis
No ratings yet
ANN Analysis
5 pages
Assignment 4x
No ratings yet
Assignment 4x
19 pages
Py Torch
No ratings yet
Py Torch
786 pages
Pytorch Tutorial 1 Rev 1
No ratings yet
Pytorch Tutorial 1 Rev 1
48 pages
PyTorch Neural Network Classifcation
No ratings yet
PyTorch Neural Network Classifcation
1 page
NISS Deep Learning Tutorial
No ratings yet
NISS Deep Learning Tutorial
58 pages
Types of Neural Networks
No ratings yet
Types of Neural Networks
7 pages
PyTorch Workflow Fundamentals - Zero To Mastery Learn PyTorch For Deep Learning
No ratings yet
PyTorch Workflow Fundamentals - Zero To Mastery Learn PyTorch For Deep Learning
43 pages
Week2 DL
No ratings yet
Week2 DL
29 pages
Deep Learning With Keras
100% (5)
Deep Learning With Keras
136 pages
DL - M2 - Deep Feedforward NN
No ratings yet
DL - M2 - Deep Feedforward NN
97 pages
Advanced C Concepts and Programming: First Edition
From Everand
Advanced C Concepts and Programming: First Edition
Gayatri
3/5 (1)
Underwater Image Dehazing Using A Novel Color Channel Based Dual Transmission Map Estimation
No ratings yet
Underwater Image Dehazing Using A Novel Color Channel Based Dual Transmission Map Estimation
24 pages
Mini Project Report Format (1)
No ratings yet
Mini Project Report Format (1)
32 pages
Heart Disease Documentation
No ratings yet
Heart Disease Documentation
82 pages
RediMinds - AIEnabler - Technical - Exercise - DF 1
No ratings yet
RediMinds - AIEnabler - Technical - Exercise - DF 1
2 pages
Motion Fused Frames: Data Level Fusion Strategy For Hand Gesture Recognition
No ratings yet
Motion Fused Frames: Data Level Fusion Strategy For Hand Gesture Recognition
9 pages
The+Impact+of+AI+on+Cybersecurity+A+New+Paradigm+for+Threat+Management
No ratings yet
The+Impact+of+AI+on+Cybersecurity+A+New+Paradigm+for+Threat+Management
8 pages
Liu et al. - 2022 - The application of artificial intelligence assistant to deep learning in teachers' teaching and stud
No ratings yet
Liu et al. - 2022 - The application of artificial intelligence assistant to deep learning in teachers' teaching and stud
13 pages
L18_gan__slides
No ratings yet
L18_gan__slides
33 pages
Ncracit 2023
No ratings yet
Ncracit 2023
479 pages
CII4Q3 VISI KOMPUTER - Deep Learning - CNN
No ratings yet
CII4Q3 VISI KOMPUTER - Deep Learning - CNN
106 pages
Saurav Gupta Resume 2016
No ratings yet
Saurav Gupta Resume 2016
1 page
Self-Attention For Audio Super-Resolution
No ratings yet
Self-Attention For Audio Super-Resolution
6 pages
Class-Based Digital Attendance Management System With Computer Vision
No ratings yet
Class-Based Digital Attendance Management System With Computer Vision
9 pages
11 - Vietnamese Text Classification and Sentiment Based
No ratings yet
11 - Vietnamese Text Classification and Sentiment Based
3 pages
Etasr 4202 PDF
No ratings yet
Etasr 4202 PDF
6 pages
DL Lab Manual 2022-23
No ratings yet
DL Lab Manual 2022-23
34 pages
Cross Task Cognitive Load Classification With Identity Mapping Based Distributed CNN and Attention Based RNN Using Gabor Decomposed Data Images
No ratings yet
Cross Task Cognitive Load Classification With Identity Mapping Based Distributed CNN and Attention Based RNN Using Gabor Decomposed Data Images
18 pages
What Is A Minimum Viable AI Product
No ratings yet
What Is A Minimum Viable AI Product
10 pages
Deep Learning-Aided 6G Wireless Networks
No ratings yet
Deep Learning-Aided 6G Wireless Networks
51 pages
Plant_Disease_Detection_in_Imbalanced_Datasets_Using_Efficient_Convolutional_Neural_Networks_With_Stepwise_Transfer_Learning
No ratings yet
Plant_Disease_Detection_in_Imbalanced_Datasets_Using_Efficient_Convolutional_Neural_Networks_With_Stepwise_Transfer_Learning
16 pages
(IJCST-V11I6P9) :shivani Muthyala, Premkumar Reddy
No ratings yet
(IJCST-V11I6P9) :shivani Muthyala, Premkumar Reddy
9 pages
Deep Learning Overview
No ratings yet
Deep Learning Overview
102 pages
Paper New
No ratings yet
Paper New
6 pages
The Livestock Farming Digital Transformation Implementation of New and Emerging Technologies Using Artificial Intelligence
No ratings yet
The Livestock Farming Digital Transformation Implementation of New and Emerging Technologies Using Artificial Intelligence
13 pages
Personalization in Practice
No ratings yet
Personalization in Practice
4 pages
KalmanNet Neural Network Aided Kalman
No ratings yet
KalmanNet Neural Network Aided Kalman
13 pages
Age and Gender Detection Using Python: Mohd Rafey, Gurubasava
No ratings yet
Age and Gender Detection Using Python: Mohd Rafey, Gurubasava
5 pages
PGP in Data Science and AI With Fellowship
No ratings yet
PGP in Data Science and AI With Fellowship
14 pages
Vicky
No ratings yet
Vicky
11 pages
Artificial Intelligence - 141727
100% (1)
Artificial Intelligence - 141727
11 pages