0% found this document useful (0 votes)

51 views50 pages

Lecture 04 Back Propagation

This document discusses backpropagation and computational graphs in deep learning. It explains that a computational graph can be created to represent the forward pass of a neural network. During backpropagation, the chain rule is used to compute gradients along the graph from the final loss function back to the weights and inputs. This allows gradients of the loss with respect to weights and inputs to be calculated in order to perform gradient descent for training. Computational graphs and backpropagation are essential for training complex neural networks with many parameters.

Uploaded by

lingyun wu

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

51 views50 pages

Lecture 04 Back Propagation

Uploaded by

lingyun wu

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 50

PyTorch Tutorial

04. Back Propagation

Lecturer : Hongpu Liu Lecture 4-1 PyTorch Tutorial @ SLAM Research Group
Compute gradient in simple network

Neuron
Linear Model 𝒙
∗ ෝ
𝒚 𝒙 ∗ ෝ
𝒚
𝑦ො = 𝑥 ∗ 𝜔 𝝎
𝝎

Stochastic Gradient Descent Derivative of Loss Function

𝜕𝑙𝑜𝑠𝑠 𝜕𝑙𝑜𝑠𝑠𝑛
𝜔 =𝜔−𝛼 𝜕𝜔
= 2 ∙ 𝑥𝑛 ∙ (𝑥𝑛 ∙ 𝜔 − 𝑦𝑛 )
𝜕𝜔

Lecturer : Hongpu Liu Lecture 4-2 PyTorch Tutorial @ SLAM Research Group
What about the complicated network?

Gradient

𝜕𝑙𝑜𝑠𝑠
=?
𝜕𝜔

Lecturer : Hongpu Liu Lecture 4-3 PyTorch Tutorial @ SLAM Research Group
Computational Graph
𝑋 Input

Weight 𝑊1 𝑀𝑀 Matrix multiplication

A two layer neural network

𝑦ො = 𝑊2 𝑊1 ∙ 𝑋 + 𝑏1 + 𝑏2

Lecturer : Hongpu Liu Lecture 4-4 PyTorch Tutorial @ SLAM Research Group
Computational Graph
𝑋 Input

Weight 𝑊1 𝑀𝑀 Matrix multiplication

A two layer neural network Bias 𝑏1 𝐴𝐷𝐷 Vector addition

𝑦ො = 𝑊2 𝑊1 ∙ 𝑋 + 𝑏1 + 𝑏2

Lecturer : Hongpu Liu Lecture 4-5 PyTorch Tutorial @ SLAM Research Group
Computational Graph
𝑋

First Layer
𝑊1 𝑀𝑀

A two layer neural network 𝑏1 𝐴𝐷𝐷

𝑦ො = 𝑊2 𝑊1 ∙ 𝑋 + 𝑏1 + 𝑏2

Lecturer : Hongpu Liu Lecture 4-6 PyTorch Tutorial @ SLAM Research Group
Computational Graph
𝑋

First Layer
𝑊1 𝑀𝑀

A two layer neural network 𝑏1 𝐴𝐷𝐷

𝑦ො = 𝑊2 𝑊1 ∙ 𝑋 + 𝑏1 + 𝑏2
𝑊2 𝑀𝑀

Lecturer : Hongpu Liu Lecture 4-7 PyTorch Tutorial @ SLAM Research Group
Computational Graph
𝑋

First Layer
𝑊1 𝑀𝑀

A two layer neural network 𝑏1 𝐴𝐷𝐷

𝑦ො = 𝑊2 𝑊1 ∙ 𝑋 + 𝑏1 + 𝑏2
𝑊2 𝑀𝑀

𝑏2 𝐴𝐷𝐷

𝑦ො

Lecturer : Hongpu Liu Lecture 4-8 PyTorch Tutorial @ SLAM Research Group
Computational Graph
𝑋

First Layer
𝑊1 𝑀𝑀

A two layer neural network 𝑏1 𝐴𝐷𝐷

𝑦ො = 𝑊2 𝑊1 ∙ 𝑋 + 𝑏1 + 𝑏2
𝑊2 𝑀𝑀
Second Layer

𝑏2 𝐴𝐷𝐷

𝑦ො

Lecturer : Hongpu Liu Lecture 4-9 PyTorch Tutorial @ SLAM Research Group
What problem about this two layer neural network?
𝑋
A two layer neural network

𝑦ො = 𝑊2 𝑊1 ∙ 𝑋 + 𝑏1 + 𝑏2 𝑊1 𝑀𝑀

𝑋
= 𝑊2 ∙ 𝑊1 ∙ 𝑋 + (𝑊2 𝑏1 +𝑏2 )
𝑏1 𝐴𝐷𝐷 𝑊 𝑀𝑀
=𝑊∙𝑋+𝑏

𝑊2 𝑀𝑀 𝑏 𝐴𝐷𝐷

𝑦ො
𝑏2 𝐴𝐷𝐷

𝑦ො

Lecturer : Hongpu Liu Lecture 4-10 PyTorch Tutorial @ SLAM Research Group
What problem about this two layer neural network?
𝑋
A two layer neural network
First Layer
𝑦ො = 𝑊2 𝑊1 ∙ 𝑋 + 𝑏1 + 𝑏2 𝑊1 𝑀𝑀

= 𝑊2 ∙ 𝑊1 ∙ 𝑋 + (𝑊2 𝑏1 +𝑏2 )
Nonlinear
𝑏1 𝐴𝐷𝐷 𝜎
=𝑊∙𝑋+𝑏 Function

Second Layer
A nonlinear function is required 𝑊2 𝑀𝑀
by each layer.

We shall talk about this later. Nonlinear

𝑏2 𝐴𝐷𝐷 𝜎
Function

𝑦ො

Lecturer : Hongpu Liu Lecture 4-11 PyTorch Tutorial @ SLAM Research Group
The composition of functions and Chain Rule

Lecturer : Hongpu Liu Lecture 4-12 PyTorch Tutorial @ SLAM Research Group
Chain Rule – 1. Create Computational Graph (Forward)

𝑥
𝑥
𝑓(𝑥, 𝜔)

𝑓 𝑧 Loss

𝜔
𝜔

Lecturer : Hongpu Liu Lecture 4-13 PyTorch Tutorial @ SLAM Research Group
Chain Rule – 2. Local Gradient

𝑥
𝜕𝑧
𝑥 𝜕𝑥
𝑓(𝑥, 𝜔)

𝑓 𝑧 Loss

𝜔
𝜕𝑧
𝜔 𝜕𝜔

Lecturer : Hongpu Liu Lecture 4-14 PyTorch Tutorial @ SLAM Research Group
Chain Rule – 3. Given gradient from successive node

𝑥
𝜕𝑧
𝑥 𝜕𝑥
𝑓(𝑥, 𝜔)

𝑓 𝑧 Loss
𝜕𝐿
𝜔 𝜕𝑧
𝜕𝑧
𝜔 𝜕𝜔

Lecturer : Hongpu Liu Lecture 4-15 PyTorch Tutorial @ SLAM Research Group
Chain Rule – 4. Use chain rule to compute the gradient (Backward)

𝑥
𝜕𝑧
𝑥 𝜕𝑥
𝜕𝐿 𝑓(𝑥, 𝜔)
𝜕𝑥
𝑓 𝑧 Loss
𝜕𝐿
𝜔 𝜕𝑧
𝜕𝑧
𝜔 𝜕𝜔
𝜕𝐿
𝜕𝜔

Lecturer : Hongpu Liu Lecture 4-16 PyTorch Tutorial @ SLAM Research Group
Chain Rule – 4. Use chain rule to compute the gradient (Backward)

𝑥
𝜕𝑧
𝑥 𝜕𝑥
𝜕𝐿 𝜕𝐿 𝜕𝑧 𝑓(𝑥, 𝜔)
= ∙
𝜕𝑥 𝜕𝑧 𝜕𝑥
𝑓 𝑧 Loss
𝜕𝐿
𝜔 𝜕𝑧
𝜕𝑧
𝜔 𝜕𝜔
𝜕𝐿 𝜕𝐿 𝜕𝑧
= ∙
𝜕𝜔 𝜕𝑧 𝜕𝜔

Lecturer : Hongpu Liu Lecture 4-17 PyTorch Tutorial @ SLAM Research Group
Example: 𝑓 = 𝑥 ∙ 𝜔

𝑥
𝜕𝑧
𝑥 𝜕𝑥
=ω
𝑧

𝑓 =𝑥∙𝜔 𝑧 Loss

𝜔
𝜕𝑧
𝜔 𝜕𝜔
=𝑥

Lecturer : Hongpu Liu Lecture 4-18 PyTorch Tutorial @ SLAM Research Group
Example: 𝑓 = 𝑥 ∙ 𝜔, 𝑥 = 2, 𝜔 = 3

Forward

𝑥=2
𝜕𝑧
𝑥 𝜕𝑥
=ω
𝑧=6

𝑓 =𝑥∙𝜔 𝑧 Loss

𝜔=3
𝜕𝑧
𝜔 𝜕𝜔
=𝑥

Lecturer : Hongpu Liu Lecture 4-19 PyTorch Tutorial @ SLAM Research Group
Example: Backward

Backward

𝑥=2
𝜕𝑧
𝑥 𝜕𝑥
=ω
𝑧=6

𝑓 =𝑥∙𝜔 𝑧 Loss
𝜕𝐿
𝜔=3 =5
𝜕𝑧
𝜕𝑧
𝜔 𝜕𝜔
=𝑥 Given by successive node

Lecturer : Hongpu Liu Lecture 4-20 PyTorch Tutorial @ SLAM Research Group
Example: Backward

Backward

𝑥=2
𝜕𝑧
𝑥 𝜕𝑥
=ω
𝜕𝐿 𝜕𝐿 𝜕𝑧 𝑧=6
= ∙
𝜕𝑥 𝜕𝑧 𝜕𝑥
𝑓 =𝑥∙𝜔 𝑧 Loss
𝜕𝐿
𝜔=3 =5
𝜕𝑧
𝜕𝑧
𝜔 𝜕𝜔
=𝑥
𝜕𝐿 𝜕𝐿 𝜕𝑧
= ∙
𝜕𝜔 𝜕𝑧 𝜕𝜔

Lecturer : Hongpu Liu Lecture 4-21 PyTorch Tutorial @ SLAM Research Group
Example: Backward

Backward

𝑥=2
𝜕𝑧
𝑥 𝜕𝑥
=ω
𝜕𝐿 𝜕𝐿 𝜕𝑧 𝑧=6
= ∙
𝜕𝑥 𝜕𝑧 𝜕𝑥
= 5 ∙ 𝜔 = 15 𝑓 =𝑥∙𝜔 𝑧 Loss
𝜕𝐿
𝜔=3 =5
𝜕𝑧
𝜕𝑧
𝜔 𝜕𝜔
=𝑥
𝜕𝐿 𝜕𝐿 𝜕𝑧
= ∙
𝜕𝜔 𝜕𝑧 𝜕𝜔
= 5 ∙ 𝑥 = 10

Lecturer : Hongpu Liu Lecture 4-22 PyTorch Tutorial @ SLAM Research Group
Computational Graph of Linear Model

Linear Model

𝑦ො = 𝑥 ∗ 𝜔

𝜔 ∗ 𝑦ො

Lecturer : Hongpu Liu Lecture 4-23 PyTorch Tutorial @ SLAM Research Group
Computational Graph of Linear Model

Linear Model

𝑦ො = 𝑥 ∗ 𝜔

𝜔 ∗ 𝑦ො
𝜕𝑥𝜔
=𝑥
𝜕𝜔

Lecturer : Hongpu Liu Lecture 4-24 PyTorch Tutorial @ SLAM Research Group
Computational Graph of Linear Model

Linear Model

𝑦ො = 𝑥 ∗ 𝜔

component in constructing dynamic Tensor 𝝎
computational graph.
Data Grad
𝝏𝒍𝒐𝒔𝒔
It contains data and grad, which storage 𝝎 𝝏𝝎
the value of node and gradient w.r.t loss
respectively.

Lecturer : Hongpu Liu Lecture 4-35 PyTorch Tutorial @ SLAM Research Group
Implementation of linear model with PyTorch

If autograd mechanics are required, the

import torch element variable requires_grad of
Tensor has to be set to True.
x_data = [1.0, 2.0, 3.0]
y_data = [2.0, 4.0, 6.0]

w = torch.Tensor([1.0])
w.requires_grad = True

Lecturer : Hongpu Liu Lecture 4-36 PyTorch Tutorial @ SLAM Research Group
Implementation of linear model with PyTorch

Define the linear model:

Linear Model
def forward(x):
return x * w 𝑦ො = 𝑥 ∗ 𝜔

def loss(x, y):

y_pred = forward(x)
return (y_pred - y) ** 2

Lecturer : Hongpu Liu Lecture 4-37 PyTorch Tutorial @ SLAM Research Group
Implementation of linear model with PyTorch

Define the loss function:

def forward(x): Loss Function

return x * w 𝑙𝑜𝑠𝑠 = (𝑦ො − 𝑦)2 = (𝑥 ∙ 𝜔 − 𝑦)2

def loss(x, y):

y_pred = forward(x)
return (y_pred - y) ** 2

Lecturer : Hongpu Liu Lecture 4-38 PyTorch Tutorial @ SLAM Research Group
Implementation of linear model with PyTorch

print("predict (before training)", 4, forward(4).item())

for epoch in range(100):

for x, y in zip(x_data, y_data):
l = loss(x, y) Forward, compute the loss.
l.backward()
print('\tgrad:', x, y, w.grad.item())
w.data = w.data - 0.01 * w.grad.data

w.grad.data.zero_()

print("progress:", epoch, l.item())

print("predict (after training)", 4, forward(4).item())

Lecturer : Hongpu Liu Lecture 4-39 PyTorch Tutorial @ SLAM Research Group
Implementation of linear model with PyTorch

print("predict (before training)", 4, forward(4).item())

for epoch in range(100):

for x, y in zip(x_data, y_data): Backward, compute grad for
l = loss(x, y)
l.backward() Tensor whose requires_grad
print('\tgrad:', x, y, w.grad.item())
w.data = w.data - 0.01 * w.grad.data set to True
w.grad.data.zero_()

print("progress:", epoch, l.item())

print("predict (after training)", 4, forward(4).item())

Lecturer : Hongpu Liu Lecture 4-40 PyTorch Tutorial @ SLAM Research Group
Implementation of linear model with PyTorch

print("predict (before training)", 4, forward(4).item())

for epoch in range(100):

for x, y in zip(x_data, y_data):
l = loss(x, y)
l.backward()
print('\tgrad:', x, y, w.grad.item()) The grad is utilized to update
w.data = w.data - 0.01 * w.grad.data
weight.
w.grad.data.zero_()

print("progress:", epoch, l.item())

print("predict (after training)", 4, forward(4).item())

Lecturer : Hongpu Liu Lecture 4-41 PyTorch Tutorial @ SLAM Research Group
Implementation of linear model with PyTorch

print("predict (before training)", 4, forward(4).item())

for epoch in range(100):

for x, y in zip(x_data, y_data):
l = loss(x, y)
l.backward() NOTICE:
print('\tgrad:', x, y, w.grad.item())
w.data = w.data - 0.01 * w.grad.data The grad computed by .backward()
w.grad.data.zero_() will be accumulated.
print("progress:", epoch, l.item()) So after update, remember set the
grad
print("predict (after training)", 4, forward(4).item()) to ZERO!!!

Lecturer : Hongpu Liu Lecture 4-42 PyTorch Tutorial @ SLAM Research Group
Implementation of linear model with PyTorch

print("predict (before training)", 4, forward(4).item())

for epoch in range(100):

for x, y in zip(x_data, y_data):
l = loss(x, y)
l.backward()
print('\tgrad:', x, y, w.grad.item())
w.data = w.data - 0.01 * w.grad.data

w.grad.data.zero_()

print("progress:", epoch, l.item())

print("predict (after training)", 4, forward(4).item())

Lecturer : Hongpu Liu Lecture 4-43 PyTorch Tutorial @ SLAM Research Group
Forward/Backward in PyTorch

𝑥 =1 𝑦 =2

𝜔 ∗ 𝑦ො − 𝑟 ^2 𝑙𝑜𝑠𝑠

Lecturer : Hongpu Liu Lecture 4-44 PyTorch Tutorial @ SLAM Research Group
Forward in PyTorch

𝑥 =1 𝑦 =2

𝜔 ∗ 𝑦ො − 𝑟 ^2 𝑙𝑜𝑠𝑠 = 1

w = torch.Tensor([1.0])
w.requires_grad = True

l = loss(x, y)

Lecturer : Hongpu Liu Lecture 4-45 PyTorch Tutorial @ SLAM Research Group
Backward in PyTorch

𝑥 =1 𝑦 =2

𝜔 ∗ 𝑦ො − 𝑟 ^2 𝑙𝑜𝑠𝑠 = 1

𝜕𝑙𝑜𝑠𝑠
l.backward() = 𝑤. 𝑔𝑟𝑎𝑑
𝜕𝜔

Lecturer : Hongpu Liu Lecture 4-46 PyTorch Tutorial @ SLAM Research Group
Update weight in PyTorch

𝑥 =1 𝑦 =2

𝜔 ∗ 𝑦ො − 𝑟 ^2 𝑙𝑜𝑠𝑠 = 1

𝜕𝑙𝑜𝑠𝑠
l.backward() = 𝑤. 𝑔𝑟𝑎𝑑
𝜕𝜔

w.data = w.data - 0.01 * w.grad.data

Lecturer : Hongpu Liu Lecture 4-47 PyTorch Tutorial @ SLAM Research Group
Exercise 4-3: Compute gradients using computational graph

Quadratic Model Loss Function

𝑦ො = 𝜔1 𝑥 2 + 𝜔2 𝑥 + 𝑏 𝑙𝑜𝑠𝑠 = (𝑦ො − 𝑦)2 = (𝑥 ∙ 𝜔 − 𝑦)2

𝜕𝑙𝑜𝑠𝑠 𝜕𝑙𝑜𝑠𝑠 𝜕𝑙𝑜𝑠𝑠

=? =? =?
𝜕𝜔1 𝜕𝜔2 𝜕𝑏

Lecturer : Hongpu Liu Lecture 4-48 PyTorch Tutorial @ SLAM Research Group
Exercise 4-4: Compute gradients using PyTorch

Quadratic Model Loss Function

𝑦ො = 𝜔1 𝑥 2 + 𝜔2 𝑥 + 𝑏 𝑙𝑜𝑠𝑠 = (𝑦ො − 𝑦)2 = (𝑥 ∙ 𝜔 − 𝑦)2

𝜕𝑙𝑜𝑠𝑠 𝜕𝑙𝑜𝑠𝑠 𝜕𝑙𝑜𝑠𝑠

=? =? =?
𝜕𝜔1 𝜕𝜔2 𝜕𝑏

Lecturer : Hongpu Liu Lecture 4-49 PyTorch Tutorial @ SLAM Research Group
PyTorch Tutorial
04. Back Propagation

Lecturer : Hongpu Liu Lecture 4-50 PyTorch Tutorial @ SLAM Research Group

PyTorch PDF
No ratings yet
PyTorch PDF
72 pages
CS236 Introduction To PyTorch
100% (4)
CS236 Introduction To PyTorch
33 pages
L4 Linear Regression
No ratings yet
L4 Linear Regression
51 pages
(Deep Learning Using PyTorch) (Cheatsheet)
No ratings yet
(Deep Learning Using PyTorch) (Cheatsheet)
7 pages
Pytorch Exercise
No ratings yet
Pytorch Exercise
5 pages
CS229 Lecture Notes
No ratings yet
CS229 Lecture Notes
142 pages
Crash Course On Tensorflow!: Vincent Lepetit!
No ratings yet
Crash Course On Tensorflow!: Vincent Lepetit!
63 pages
Pytorch Cheatsheet EN
No ratings yet
Pytorch Cheatsheet EN
1 page
Lecture 12 Basic RNN
No ratings yet
Lecture 12 Basic RNN
100 pages
Lecture 14 Introduction To Pytorch
No ratings yet
Lecture 14 Introduction To Pytorch
45 pages
Lecture 13 RNN Classifier
No ratings yet
Lecture 13 RNN Classifier
96 pages
Slides
No ratings yet
Slides
81 pages
Pump Control (Main Hydraulic) ..
92% (12)
Pump Control (Main Hydraulic) ..
23 pages
Pytorch Tutorial 1 Rev 1
No ratings yet
Pytorch Tutorial 1 Rev 1
48 pages
Lecture 10 Basic CNN
No ratings yet
Lecture 10 Basic CNN
65 pages
Pytorch Slides
No ratings yet
Pytorch Slides
31 pages
DLL Els Quarter 1 Week 3
No ratings yet
DLL Els Quarter 1 Week 3
3 pages
IBest DeepLearning
No ratings yet
IBest DeepLearning
123 pages
Lecture 05 Linear Regression With PyTorch
No ratings yet
Lecture 05 Linear Regression With PyTorch
28 pages
Lecture 02 Linear Model
No ratings yet
Lecture 02 Linear Model
27 pages
Lecture 11 Advanced CNN
No ratings yet
Lecture 11 Advanced CNN
42 pages
Lecture 09 Softmax Classifier
No ratings yet
Lecture 09 Softmax Classifier
46 pages
Lecture 07 Multiple Dimension Input
No ratings yet
Lecture 07 Multiple Dimension Input
27 pages
Lecture 06 Logistic Regression
No ratings yet
Lecture 06 Logistic Regression
20 pages
Chapter 1
No ratings yet
Chapter 1
37 pages
Chapter 2
No ratings yet
Chapter 2
35 pages
1-Linear Regression and TensorFlow
No ratings yet
1-Linear Regression and TensorFlow
79 pages
Cesc 12 - Q1 - M5 PDF
No ratings yet
Cesc 12 - Q1 - M5 PDF
14 pages
AD PyTorch
No ratings yet
AD PyTorch
4 pages
L5 Single Layer FeedForward Network NN - Linear
No ratings yet
L5 Single Layer FeedForward Network NN - Linear
21 pages
Pytorch 101: Deep Learning PHD Course 2017/2018
No ratings yet
Pytorch 101: Deep Learning PHD Course 2017/2018
19 pages
Pytorch Tutorial: - Ntu Machine Learning Course
No ratings yet
Pytorch Tutorial: - Ntu Machine Learning Course
64 pages
Pytorch Tutorial 1
No ratings yet
Pytorch Tutorial 1
48 pages
Chapter 2
No ratings yet
Chapter 2
35 pages
Lecture 01 Overview
No ratings yet
Lecture 01 Overview
39 pages
DR Basit Assignments
No ratings yet
DR Basit Assignments
13 pages
Part 13 MD
No ratings yet
Part 13 MD
41 pages
HW1P1 F23
No ratings yet
HW1P1 F23
37 pages
Lecture 03 Gradient Descent
No ratings yet
Lecture 03 Gradient Descent
26 pages
Py Torch
No ratings yet
Py Torch
11 pages
Beginner's PyTorch Guide
No ratings yet
Beginner's PyTorch Guide
35 pages
Pytorch Demo 1749471354
No ratings yet
Pytorch Demo 1749471354
10 pages
Intermediate Relay: Wiring Diagram
No ratings yet
Intermediate Relay: Wiring Diagram
1 page
California Utility Bill PDF 2 1
No ratings yet
California Utility Bill PDF 2 1
1 page
Module02 PyTorch
No ratings yet
Module02 PyTorch
36 pages
R Deep Neural Network Step by Step
No ratings yet
R Deep Neural Network Step by Step
27 pages
PyTorch CrashCourse
No ratings yet
PyTorch CrashCourse
17 pages
Assignment 4x
No ratings yet
Assignment 4x
19 pages
Pytorch: Tensors and Datasets
No ratings yet
Pytorch: Tensors and Datasets
9 pages
Completion Diagram: Reda Discharge: UT Pump Oring Oring B/u LT Pump
No ratings yet
Completion Diagram: Reda Discharge: UT Pump Oring Oring B/u LT Pump
2 pages
Linear Regression With Pytroch
No ratings yet
Linear Regression With Pytroch
13 pages
PyTorch CrashCourse
No ratings yet
PyTorch CrashCourse
16 pages
Banking Theory Law and Practice
No ratings yet
Banking Theory Law and Practice
17 pages
MGT610 Objective File For Final Term
No ratings yet
MGT610 Objective File For Final Term
175 pages
Palaka
No ratings yet
Palaka
2 pages
Harvard CS197 Lecture 6 & 7 Notes
No ratings yet
Harvard CS197 Lecture 6 & 7 Notes
18 pages
Order of The Mass-2
100% (2)
Order of The Mass-2
2 pages
Intro To PyTorch and Neural Networks - Intro To PyTorch and Neural Networks Cheatsheet - Codecademy
No ratings yet
Intro To PyTorch and Neural Networks - Intro To PyTorch and Neural Networks Cheatsheet - Codecademy
8 pages
ML Lab 06 Manual - Linear Regression 1 (Version 6)
No ratings yet
ML Lab 06 Manual - Linear Regression 1 (Version 6)
8 pages
Affidavit of Solo Parent
No ratings yet
Affidavit of Solo Parent
7 pages
CIFAR - 10 - Dataset - Using - CNN - Aniiiii - HTML
No ratings yet
CIFAR - 10 - Dataset - Using - CNN - Aniiiii - HTML
8 pages
PyTorch Guide With Code
No ratings yet
PyTorch Guide With Code
4 pages
Westinghouse W4207 TV User Manual
No ratings yet
Westinghouse W4207 TV User Manual
24 pages
Linear Regr GD
No ratings yet
Linear Regr GD
3 pages
Functional Level Strategy of Starbucks
No ratings yet
Functional Level Strategy of Starbucks
25 pages
Abeya Merga Research
No ratings yet
Abeya Merga Research
45 pages
PyTorch Crash Course 1713016363
No ratings yet
PyTorch Crash Course 1713016363
15 pages
EDST2003 Week 1 Final
No ratings yet
EDST2003 Week 1 Final
54 pages
EE769 Assignment 3
No ratings yet
EE769 Assignment 3
1 page
Cefr Letters b2 and c1
No ratings yet
Cefr Letters b2 and c1
32 pages
Pytorch Tutorial For Beginner: Department of Computer Science & Engineering University of Washington
No ratings yet
Pytorch Tutorial For Beginner: Department of Computer Science & Engineering University of Washington
11 pages
2c PyTorch4
No ratings yet
2c PyTorch4
4 pages
Info Age
No ratings yet
Info Age
31 pages
Motors AC
No ratings yet
Motors AC
5 pages
Installation of NS2
No ratings yet
Installation of NS2
3 pages
RLT A Question of Trust
No ratings yet
RLT A Question of Trust
3 pages
Retail Supply Chain Management
No ratings yet
Retail Supply Chain Management
12 pages
Noun. (1) The French Indirect Object Pronouns Are
No ratings yet
Noun. (1) The French Indirect Object Pronouns Are
4 pages
Monitoring Sheet MR Sia Opv Campaign Final 2023 Doc Grace
No ratings yet
Monitoring Sheet MR Sia Opv Campaign Final 2023 Doc Grace
12 pages
Identification: Vulnerable Individual (Assessment)
No ratings yet
Identification: Vulnerable Individual (Assessment)
20 pages
ITI Newsletter July 2024
No ratings yet
ITI Newsletter July 2024
3 pages
Essential Insight - Mesh Installation and Maintenance Manual PDF
No ratings yet
Essential Insight - Mesh Installation and Maintenance Manual PDF
58 pages
12 - Memory Management, Garbage Collection, Immutability, and Design by Contrac
No ratings yet
12 - Memory Management, Garbage Collection, Immutability, and Design by Contrac
3 pages
Qadaqadar PDF
No ratings yet
Qadaqadar PDF
4 pages
Passive - Comparative - Mini Test PDF
No ratings yet
Passive - Comparative - Mini Test PDF
2 pages
December 2 Flier Final-NEW PDF
No ratings yet
December 2 Flier Final-NEW PDF
1 page
Rs 007
No ratings yet
Rs 007
1 page

Lecture 04 Back Propagation

Uploaded by

Lecture 04 Back Propagation

Uploaded by

PyTorch Tutorial

04. Back Propagation

Stochastic Gradient Descent Derivative of Loss Function

Weight 𝑊1 𝑀𝑀 Matrix multiplication

A two layer neural network

Weight 𝑊1 𝑀𝑀 Matrix multiplication

A two layer neural network Bias 𝑏1 𝐴𝐷𝐷 Vector addition

A two layer neural network 𝑏1 𝐴𝐷𝐷

A two layer neural network 𝑏1 𝐴𝐷𝐷

A two layer neural network 𝑏1 𝐴𝐷𝐷

A two layer neural network 𝑏1 𝐴𝐷𝐷

We shall talk about this later. Nonlinear

Linear Model Loss Function

𝑦ො = 𝑥 ∗ 𝜔 𝑙𝑜𝑠𝑠 = (𝑦ො − 𝑦)2 = (𝑥 ∙ 𝜔 − 𝑦)2

Linear Model Loss Function

𝑦ො = 𝑥 ∗ 𝜔 𝑙𝑜𝑠𝑠 = (𝑦ො − 𝑦)2 = (𝑥 ∙ 𝜔 − 𝑦)2

Linear Model Loss Function

𝑦ො = 𝑥 ∗ 𝜔 𝑙𝑜𝑠𝑠 = (𝑦ො − 𝑦)2 = (𝑥 ∙ 𝜔 − 𝑦)2

Linear Model Loss Function

𝑦ො = 𝑥 ∗ 𝜔 𝑙𝑜𝑠𝑠 = (𝑦ො − 𝑦)2 = (𝑥 ∙ 𝜔 − 𝑦)2

Linear Model Loss Function

𝑦ො = 𝑥 ∗ 𝜔 𝑙𝑜𝑠𝑠 = (𝑦ො − 𝑦)2 = (𝑥 ∙ 𝜔 − 𝑦)2

Linear Model Loss Function

𝑦ො = 𝑥 ∗ 𝜔 𝑙𝑜𝑠𝑠 = (𝑦ො − 𝑦)2 = (𝑥 ∙ 𝜔 − 𝑦)2

Linear Model Loss Function

𝑦ො = 𝑥 ∗ 𝜔 𝑙𝑜𝑠𝑠 = (𝑦ො − 𝑦)2 = (𝑥 ∙ 𝜔 − 𝑦)2

Linear Model Loss Function

𝑦ො = 𝑥 ∗ 𝜔 𝑙𝑜𝑠𝑠 = (𝑦ො − 𝑦)2 = (𝑥 ∙ 𝜔 − 𝑦)2

Affine Model Loss Function

𝑦ො = 𝑥 ∗ 𝜔 + 𝑏 𝑙𝑜𝑠𝑠 = (𝑦ො − 𝑦)2 = (𝑥 ∙ 𝜔 − 𝑦)2

In PyTorch, Tensor is the important

If autograd mechanics are required, the

Define the linear model:

def loss(x, y):

Define the loss function:

def forward(x): Loss Function

def loss(x, y):

print("predict (before training)", 4, forward(4).item())

for epoch in range(100):

print("progress:", epoch, l.item())

print("predict (after training)", 4, forward(4).item())

print("predict (before training)", 4, forward(4).item())

for epoch in range(100):

print("progress:", epoch, l.item())

print("predict (after training)", 4, forward(4).item())

print("predict (before training)", 4, forward(4).item())

for epoch in range(100):

print("progress:", epoch, l.item())

print("predict (after training)", 4, forward(4).item())

print("predict (before training)", 4, forward(4).item())

for epoch in range(100):

print("predict (before training)", 4, forward(4).item())

for epoch in range(100):

print("progress:", epoch, l.item())

print("predict (after training)", 4, forward(4).item())

w.data = w.data - 0.01 * w.grad.data

Quadratic Model Loss Function

𝑦ො = 𝜔1 𝑥 2 + 𝜔2 𝑥 + 𝑏 𝑙𝑜𝑠𝑠 = (𝑦ො − 𝑦)2 = (𝑥 ∙ 𝜔 − 𝑦)2

𝜕𝑙𝑜𝑠𝑠 𝜕𝑙𝑜𝑠𝑠 𝜕𝑙𝑜𝑠𝑠

Quadratic Model Loss Function

𝑦ො = 𝜔1 𝑥 2 + 𝜔2 𝑥 + 𝑏 𝑙𝑜𝑠𝑠 = (𝑦ො − 𝑦)2 = (𝑥 ∙ 𝜔 − 𝑦)2

𝜕𝑙𝑜𝑠𝑠 𝜕𝑙𝑜𝑠𝑠 𝜕𝑙𝑜𝑠𝑠

You might also like