0% found this document useful (0 votes)

51 views54 pages

Fundamentals of Deep Learning: Part 2: How A Neural Network Trains

The document discusses how neural networks are trained. It begins with an agenda that outlines topics including a simpler model, activation functions, and overfitting. It then explains how a neural network learns by adjusting its weights and biases to minimize a loss function through gradient descent. Different activation functions like ReLU and sigmoid are also introduced. Optimizers help neural networks learn more efficiently by determining how far to move along the gradient direction each step.

Uploaded by

Praveen Singh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

51 views54 pages

Fundamentals of Deep Learning: Part 2: How A Neural Network Trains

Uploaded by

Praveen Singh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 54

FUNDAMENTALS OF

DEEP LEARNING
Part 2: How a Neural Network Trains

1
Part 1: An Introduction to Deep Learning

Part 2: How a Neural Network Trains

AGENDA Part 3: Convolutional Neural Networks

Part 4: Data Augmentation and Deployment

Part 5: Pre-trained Models

Part 6: Advanced Architectures

AGENDA – PART 2
• Recap
• A Simpler Model
• From Neuron to Network
• Activation Functions
• Overfitting
• From Neuron to Classification
RECAP OF THE EXERCISE
What just happened?

Loaded and visualized our data

Edited our data (reshaped, normalized, to categorical)

Created our model

Compiled our model

Trained the model on our data

DATA PREPARATION
Input as an array

28 [0,0,0,24,75,184,185,78,32,55,0,0,0…]

5
DATA PREPARATION
Targets as categories

0 [1,0,0,0,0,0,0,0,0,0]

1 [0,1,0,0,0,0,0,0,0,0]

2 [0,0,1,0,0,0,0,0,0,0]
3 [0,0,0,1,0,0,0,0,0,0]
.
.
. 6
AN UNTRAINED MODEL

[ 0, 0, …, 0] (784,)

… … … (512,)
Layer
Size
… … … (512,)

(10,)

7
A SIMPLER MODEL
8
A SIMPLER MODEL
𝑦 =𝑚𝑥 +𝑏

6
m•x 𝑚=?
x y 4

y
1 3 2

2 5 0 ^
𝑦 b=?
0.5 1 1.5 2 2.5 3
x

9
A SIMPLER MODEL
𝑦 =𝑚𝑥 +𝑏

6
m•x 𝑚=?
x y 4

y
1 3 2

2 5 0 ^
𝑦 b=?
0.5 1 1.5 2 2.5 3
x

10
A SIMPLER MODEL
𝑦 =𝑚𝑥 +𝑏
Start
6 Random
m•x
4
x y
𝑚=−1
y
1 3 4 2

2 5 3 0 ^
𝑦 b=5
0.5 1 1.5 2 2.5 3
x

11
A SIMPLER MODEL
𝑦 =𝑚𝑥 +𝑏
x y 6
5
1 3 4 1 4
3

√
2 5 3 4 2
𝑛
1
1
𝑅𝑀𝑆𝐸= ∑ (𝑦 𝑖 − 𝑦 𝑖 )
^
MSE = 2.5 0 2
0.5 1 1.5 2 2.5 3
RMSE = 1.6 x
𝑛 𝑖=1

12
A SIMPLER MODEL
𝑦 =𝑚𝑥 +𝑏
x y 6
5
1 3 4 1 4
3

y
2 5 3 4 2
1
MSE = 2.5 0
0.5 1 1.5 2 2.5 3
RMSE = 1.6 x

13
THE LOSS CURVE

Loss Surface 16

MSE

14
THE LOSS CURVE

6 16

4
y

Current
2
MSE
0
0.5 1 1.5 2 2.5 3
Target
x
𝑚=−1
b=5 0

15
THE LOSS CURVE

6 16

4
y

Old
2
Current
MSE
0
0.5 1 1.5 2 2.5 3
Target
x
𝑚=−1
b=4 0

16
THE LOSS CURVE

6 16

4
y

2
Current
MSE
0
0.5 1 1.5 2 2.5 3
Target
x
𝑚=0
b=4 0

17
THE LOSS CURVE

The 16
Which direction loss decreases
Gradient
the most
: The
learning
rate How far to travel

Epoch A model update with the full MSE

dataset

Target
Batch
A sample of the full dataset

Step An update to the weight 0

parameters

18
THE LOSS CURVE

The 16
Which direction loss decreases
Gradient
the most
: The
learning
rate How far to travel

Epoch A model update with the full MSE

dataset

Target
Batch
A sample of the full dataset

Step An update to the weight 0

parameters

19
OPTIMIZERS

Loss – Momentum Optimizer • Adam

• Adagrad
• RMSprop
• SGD

20
FROM NEURON TO
NETWORK
21
BUILDING A NETWORK

• Scales to more inputs

w1 w2

^
𝑦

22
BUILDING A NETWORK

x1 x2
w2 w3
w1 w4
• Scales to more inputs
• Can chain neurons
w5 w6

^
𝑦

23
BUILDING A NETWORK

x1 x2
w2 w3
w1 w4
• Scales to more inputs
• Can chain neurons
w5 w6
• If all regressions are
linear, then output will
^
𝑦 also be a linear
regression

24
ACTIVATION FUNCTIONS
25
ACTIVATION FUNCTIONS

Linear ReLU Sigmoid

{
1
^
𝑦 =𝑤𝑥+𝑏 𝑦 = 𝑤𝑥 +𝑏 𝑖𝑓 𝑤𝑥 +𝑏> 0
^ ^
𝑦= −( 𝑤𝑥+𝑏)
0 𝑜𝑡h𝑒𝑟𝑤𝑖𝑠𝑒 1+𝑒

10 10 1
0.8
5 5 0.6
0.4
0 0
0.2
-5 -5 0
- - - - - - - - 012356781
-10 -10 18765321 . . . . . . 0
-10 -5 0 5 10 -10 -5 0 5 10 0 . . . . . . 257 257

26
ACTIVATION FUNCTIONS

Linear ReLU Sigmoid

27
ACTIVATION FUNCTIONS

x1 2 2
w1 w2 1
0
1
0
-1 -1
-2 -2
--------------------001122334455667788991 --------------------001122334455667788991
19988776655443322110. . . . . . . . . .0 19988776655443322110. . . . . . . . . .0
0. . . . . . . . . . 5555555555 0. . . . . . . . . . 5555555555
w3 w4
2
1
0
-1

^
𝑦 -2
--------------------001122334455667788991
19988776655443322110. . . . . . . . . .0
0. . . . . . . . . . 5555555555

28
OVERFITTING
29
OVERFITTING
Why not have a super large neural network?

30
OVERFITTING
Which Trendline is Better?

0.9 0.9

0.8 0.8

0.7 0.7

0.6 0.6

0.5 0.5

0.4 0.4

0.3 0.3

0.2 0.2
MSE = .0000 MSE = .0113
0.1 0.1

0 0
0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1

31
OVERFITTING
Which Trendline is Better?

0.9 0.9

0.8 0.8

0.7 0.7

0.6 0.6

0.5 0.5

0.4 0.4

0.3 0.3

0.2 0.2
MSE = .0308 MSE = .0062
0.1 0.1

0 0
0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1

32
TRAINING VS VALIDATION DATA
Avoid memorization

Training data MSE Per Epoch

60
• Core dataset for the model to learn on
50

Validation data 40
30

MSE
• New data for model to see if it truly 20
understands (can generalize)
10
Overfitting 0
1 2 3 4 5 6 7 8 9 10
• When model performs well on the training Epoch
data, but not the validation data (evidence of
memorization) Training MSE
• Ideally the accuracy and loss should be Validation MSE - Expected
similar between both datasets Validation MSE - Overfitting

33
FROM REGRESSION TO
CLASSIFICATION
34
AN MNIST MODEL

[ 0, 0, …, 0] (784,)

… … … (512,)
Layer
Size
… … … (512,)

(10,)

35
AN MNIST MODEL

[ 0, 0, …, 0] (784,)

ReLU … … … (512,)
Layer
Size
ReLU … … … (512,)

Sigmoid
(10,)

36
AN MNIST MODEL

[ 0, 0, …, 0] (784,)

ReLU … … … (512,)
Layer
Size
ReLU … … … (512,)

Softmax
(10,)

37
RMSE FOR PROBABILITIES?
4

0
0 0.5 1 1.5 2 2.5 3 3.5
RMSE FOR PROBABILITIES?
4

0
0 0.5 1 1.5 2 2.5 3 3.5
CROSS ENTROPY
4
Cross Entropy
3 Blue Point Prediction
6
2
4

1 2

Loss
0 0
0 0.5 1 1.5 2 2.5 3 3.5

9
05

85
00

99
0.
0.
0.
0.
0.
0.
00
0.
Assigned Probability

Loss if True Loss if False

CROSS ENTROPY
4
Cross Entropy
3 Blue Point Prediction
6
2
4

1 2

Loss
0 0
0 0.5 1 1.5 2 2.5 3 3.5

9
05

85
00

99
0.
0.
0.
0.
0.
0.
00
0.
𝐿𝑜𝑠𝑠 =− ¿
Assigned Probability
𝑡 ( 𝑥 ) =𝑡𝑎𝑟𝑔𝑒𝑡 (0 𝑖𝑓 𝐹𝑎𝑙𝑠𝑒 , 1𝑖𝑓 𝑇𝑟𝑢𝑒)
Loss if True Loss if False
𝑝 ( 𝑥 )=𝑝𝑟𝑜𝑏𝑎𝑏𝑖𝑙𝑖𝑡𝑦 𝑝𝑟𝑒𝑑𝑖𝑐𝑡𝑖𝑜𝑛𝑜𝑓 𝑝𝑜𝑖𝑛𝑡 𝑥
CROSS ENTROPY
4
Cross Entropy
3 Blue Point Prediction
6
2
4

1 2

Loss
0 0
0 0.5 1 1.5 2 2.5 3 3.5

9
05

85
00

99
0.
0.
0.
0.
0.
0.
00
0.
Assigned Probability

Loss if True Loss if False

BRINGING IT TOGETHER
43
THE NEXT EXERCISE
The American Sign Language Alphabet

44
LET’S GO!
45
APPENDIX: GRADIENT
DESCENT
HELPING THE COMPUTER CHEAT CALCULUS

46
Learning From Error

1 2
𝑀𝑆𝐸= ((3 − (𝑚 ( 1 )+ 𝑏)) ¿ ¿ 2+(5 − (𝑚 ( 2 ) +𝑏)) )¿
2

𝜕 𝑀𝑆𝐸 𝜕 𝑀𝑆𝐸
=9𝑚+5 𝑏 − 23 =5 𝑚+3 𝑏 − 13
𝜕𝑚 𝜕𝑏
𝑚=−1
𝜕 𝑀𝑆𝐸 𝜕 𝑀𝑆𝐸 b=5
=− 7 =− 3
𝜕𝑚 𝜕𝑏
THE LOSS CURVE

Loss Surface 16

Current

Target

48
THE LOSS CURVE

16
𝜕 𝑀𝑆𝐸 𝜕 𝑀𝑆𝐸
=− 7 =− 3
𝜕𝑚 𝜕𝑏

Target

49
THE LOSS CURVE

16
𝜕 𝑀𝑆𝐸 𝜕 𝑀𝑆𝐸
=− 7 =− 3
𝜕𝑚 𝜕𝑏

𝜕 𝑀𝑆𝐸
m := m − λ
𝜕𝑚
Target

𝜕 𝑀𝑆𝐸
b ≔𝑏− λ 0
𝜕𝑏

50
THE LOSS CURVE

16
𝜕 𝑀𝑆𝐸 𝜕 𝑀𝑆𝐸
=− 7 =− 3
𝜕𝑚 𝜕𝑏

𝜕 𝑀𝑆𝐸
m := m − λ
𝜕𝑚
Target
λ=.6
𝜕 𝑀𝑆𝐸
b ≔𝑏− λ 0
𝜕𝑏

51
THE LOSS CURVE

16
𝜕 𝑀𝑆𝐸 𝜕 𝑀𝑆𝐸
=− 7 =− 3
𝜕𝑚 𝜕𝑏

𝜕 𝑀𝑆𝐸
m := m − λ
𝜕𝑚
Target
λ=.005
𝜕 𝑀𝑆𝐸
b ≔𝑏− λ 0
𝜕𝑏

52
THE LOSS CURVE

λ=.1

Target
b ≔ 5+3 λ= 4.7

53
54

Lecture04 NeuralNetwork
No ratings yet
Lecture04 NeuralNetwork
77 pages
PowerPoint Presentation-2
No ratings yet
PowerPoint Presentation-2
52 pages
6.neural Networks 2
No ratings yet
6.neural Networks 2
44 pages
Neural Network Intro Lecture 4
No ratings yet
Neural Network Intro Lecture 4
46 pages
Slide 2-f2
No ratings yet
Slide 2-f2
52 pages
Data Modelling Using ER Diagram
100% (2)
Data Modelling Using ER Diagram
30 pages
Unit 2 DL
No ratings yet
Unit 2 DL
70 pages
Pattern Classification 11. Backpropagation & Time-Series Forecasting
No ratings yet
Pattern Classification 11. Backpropagation & Time-Series Forecasting
78 pages
DeepLearning Workshop Humayun
No ratings yet
DeepLearning Workshop Humayun
63 pages
Lecture 10
No ratings yet
Lecture 10
155 pages
Ch2-Training, Optimization and Regularization of DNN-new
No ratings yet
Ch2-Training, Optimization and Regularization of DNN-new
114 pages
Lect 12 - Deep Feed Forward NN - Review
No ratings yet
Lect 12 - Deep Feed Forward NN - Review
93 pages
Probability Neuron Network
No ratings yet
Probability Neuron Network
84 pages
DeepLearning Recap
No ratings yet
DeepLearning Recap
104 pages
Domnic Object Detecion Basics
No ratings yet
Domnic Object Detecion Basics
62 pages
Artificial Neural Networks
No ratings yet
Artificial Neural Networks
100 pages
L10 Learning II Gradient Based Learning
No ratings yet
L10 Learning II Gradient Based Learning
72 pages
Fundamentals of Deep Learning
No ratings yet
Fundamentals of Deep Learning
195 pages
CS460 - Deep Learning - W02 & W03
No ratings yet
CS460 - Deep Learning - W02 & W03
44 pages
Chapter 2 - Artificial Neural Networks
No ratings yet
Chapter 2 - Artificial Neural Networks
19 pages
Dive Into Deep Learning
No ratings yet
Dive Into Deep Learning
105 pages
HODL Lec 2 Training NNs Intro TF
No ratings yet
HODL Lec 2 Training NNs Intro TF
83 pages
Lecture8 DeepLearning
No ratings yet
Lecture8 DeepLearning
94 pages
Deep Learning Module-02 Search Creators
No ratings yet
Deep Learning Module-02 Search Creators
15 pages
Deep Learning 1
No ratings yet
Deep Learning 1
48 pages
Lec 8
No ratings yet
Lec 8
43 pages
Study Notes - Lesson 1 - 7 PDF
No ratings yet
Study Notes - Lesson 1 - 7 PDF
25 pages
6 Working Example 01-08-2024
No ratings yet
6 Working Example 01-08-2024
21 pages
Neural - Networks
No ratings yet
Neural - Networks
47 pages
Deep Learning
No ratings yet
Deep Learning
78 pages
Lesson Plan (Class 12information Technology)
100% (2)
Lesson Plan (Class 12information Technology)
6 pages
Deep Learning PDF
100% (1)
Deep Learning PDF
87 pages
3 Non Linear Classifiers
No ratings yet
3 Non Linear Classifiers
74 pages
26 Neural Nets
No ratings yet
26 Neural Nets
77 pages
IoT - Lecture 11
No ratings yet
IoT - Lecture 11
58 pages
MLP 1122 20240509 ch10 DeepNN
No ratings yet
MLP 1122 20240509 ch10 DeepNN
47 pages
Pattern Classification 10. Linear Perceptron, Least Squares & Multi-Layer Nns
No ratings yet
Pattern Classification 10. Linear Perceptron, Least Squares & Multi-Layer Nns
38 pages
Kagan Lecture2
No ratings yet
Kagan Lecture2
118 pages
Part 13 MD
No ratings yet
Part 13 MD
41 pages
Deep Learning Basics (Lecture Notes) : Romain Tavenard
No ratings yet
Deep Learning Basics (Lecture Notes) : Romain Tavenard
49 pages
Crashcourse DL Pytorch Parr
No ratings yet
Crashcourse DL Pytorch Parr
39 pages
Lect 7
No ratings yet
Lect 7
43 pages
Random Variables & Discrete Probability Distributions
100% (1)
Random Variables & Discrete Probability Distributions
22 pages
Advanced Machine Learning: Neural Networks Decision Trees Random Forest Xgboost
No ratings yet
Advanced Machine Learning: Neural Networks Decision Trees Random Forest Xgboost
61 pages
Part 1.2. Back Propagation
No ratings yet
Part 1.2. Back Propagation
30 pages
SQL Assignment 1
50% (2)
SQL Assignment 1
4 pages
ML Fundamentals by Bitspace
No ratings yet
ML Fundamentals by Bitspace
19 pages
DL Mannual For Reference
No ratings yet
DL Mannual For Reference
58 pages
Comparisons of CAD Jewellery Software - CAD Jewellery Skills
No ratings yet
Comparisons of CAD Jewellery Software - CAD Jewellery Skills
35 pages
Verification and Validation of Simulation Models
No ratings yet
Verification and Validation of Simulation Models
20 pages
Curs4site PDF
No ratings yet
Curs4site PDF
44 pages
Control System Term Paper
No ratings yet
Control System Term Paper
12 pages
Exp 3
No ratings yet
Exp 3
7 pages
Database Management System (DBMS) Important Question and Answers
100% (1)
Database Management System (DBMS) Important Question and Answers
7 pages
GK Deeplearning
No ratings yet
GK Deeplearning
15 pages
APKA Report
No ratings yet
APKA Report
3 pages
Reviewer
No ratings yet
Reviewer
7 pages
Deep Learning Week 201
No ratings yet
Deep Learning Week 201
3 pages
ADM Extract v9.2
No ratings yet
ADM Extract v9.2
99 pages
Machine Learning With Convolutional Neural Networks
No ratings yet
Machine Learning With Convolutional Neural Networks
22 pages
AML 03 Dense Neural Networks
No ratings yet
AML 03 Dense Neural Networks
20 pages
Solving Math Problems
From Everand
Solving Math Problems
George N. Frempong
No ratings yet
Constructive Solid Geometry
No ratings yet
Constructive Solid Geometry
15 pages
Chapter04 - Getting Started With Neural Networks
No ratings yet
Chapter04 - Getting Started With Neural Networks
9 pages
This Code Fragment Defines A Single Layer With Artificial Neurons, and It Expects Input Variables
No ratings yet
This Code Fragment Defines A Single Layer With Artificial Neurons, and It Expects Input Variables
9 pages
This Code Fragment Defines A Single Layer With Artificial Neurons, and It Expects Input Variables
No ratings yet
This Code Fragment Defines A Single Layer With Artificial Neurons, and It Expects Input Variables
9 pages
Bonus Problems
No ratings yet
Bonus Problems
23 pages
Rater Scan and Random Scan Display
No ratings yet
Rater Scan and Random Scan Display
12 pages
Introduction To UML: Structural and Use Case Modeling
No ratings yet
Introduction To UML: Structural and Use Case Modeling
91 pages
Introduction To Relational Database Management System
No ratings yet
Introduction To Relational Database Management System
13 pages
CH 02 Summary
No ratings yet
CH 02 Summary
3 pages
cs519 hw2
No ratings yet
cs519 hw2
15 pages
Database Schema
No ratings yet
Database Schema
9 pages
Information Theory
No ratings yet
Information Theory
26 pages
SQL Assignment
No ratings yet
SQL Assignment
4 pages
Symptomatically Brain Tumor Detection Using Convolutional Neural Networks
No ratings yet
Symptomatically Brain Tumor Detection Using Convolutional Neural Networks
10 pages
Object Oriented Analysis and Design
No ratings yet
Object Oriented Analysis and Design
6 pages
Subsurface Scattering
No ratings yet
Subsurface Scattering
3 pages
Lecture 3 Functional Dependancy +closure
No ratings yet
Lecture 3 Functional Dependancy +closure
35 pages
Transfer Learning: Objectives
No ratings yet
Transfer Learning: Objectives
16 pages
Entity Relationship Diagram
No ratings yet
Entity Relationship Diagram
80 pages
Chapter 7 Normalization
No ratings yet
Chapter 7 Normalization
24 pages
Image Classification With The MNIST Dataset: Objectives
No ratings yet
Image Classification With The MNIST Dataset: Objectives
21 pages
Convolutional Neural Networks: Objectives
No ratings yet
Convolutional Neural Networks: Objectives
10 pages
Image Classification of An American Sign Language Dataset: Objectives
No ratings yet
Image Classification of An American Sign Language Dataset: Objectives
11 pages
Pre-Trained Models: Objectives
No ratings yet
Pre-Trained Models: Objectives
12 pages
The Logistic Regression Analysis in Spss - Statistics Solutions PDF
No ratings yet
The Logistic Regression Analysis in Spss - Statistics Solutions PDF
2 pages
Data Augmentation: Objectives
No ratings yet
Data Augmentation: Objectives
10 pages
Deploying Your Model: Objectives
No ratings yet
Deploying Your Model: Objectives
9 pages
Fundamentals of Deep Learning: Part 6: Advanced Architectures
No ratings yet
Fundamentals of Deep Learning: Part 6: Advanced Architectures
35 pages
Assessment: The Dataset
No ratings yet
Assessment: The Dataset
5 pages
Unit - 3 8255: (Programmable Peripheral Interface)
No ratings yet
Unit - 3 8255: (Programmable Peripheral Interface)
7 pages
Iot Based Solar Energy Monitoring System: Suprita M. Patil Vijayalashmi M
No ratings yet
Iot Based Solar Energy Monitoring System: Suprita M. Patil Vijayalashmi M
6 pages
Fundamentals of Deep Learning: Part 5: Pre-Trained Models
No ratings yet
Fundamentals of Deep Learning: Part 5: Pre-Trained Models
18 pages
An Overview of Microprocessor
No ratings yet
An Overview of Microprocessor
16 pages
Brain Tumour Segmentation Using Convolutional Neural Network With Tensor Flow
No ratings yet
Brain Tumour Segmentation Using Convolutional Neural Network With Tensor Flow
7 pages
Sequence Data: Objectives
No ratings yet
Sequence Data: Objectives
15 pages
Depuracion Logs Prod
No ratings yet
Depuracion Logs Prod
14 pages
Anand
No ratings yet
Anand
3 pages
SQL Assignment 1
No ratings yet
SQL Assignment 1
4 pages
Factorio Friday Facts 214
No ratings yet
Factorio Friday Facts 214
2 pages
Tugas 3 Achmad Mauludin 165060301111002
No ratings yet
Tugas 3 Achmad Mauludin 165060301111002
7 pages
BP Neural Network Principle and MATLAB Simulation: Xiong Xin Nie Mingxin
No ratings yet
BP Neural Network Principle and MATLAB Simulation: Xiong Xin Nie Mingxin
6 pages
Advanced Mechatronic Systems: 2014 International Conference On
No ratings yet
Advanced Mechatronic Systems: 2014 International Conference On
6 pages
Relational Database
No ratings yet
Relational Database
6 pages
A.) B.) C.) D.)
No ratings yet
A.) B.) C.) D.)
4 pages
Tutorial On RDF Data Modelling: Domain Description 1
No ratings yet
Tutorial On RDF Data Modelling: Domain Description 1
4 pages
Multiplication Tables and Flashcards: Times Tables for Children
From Everand
Multiplication Tables and Flashcards: Times Tables for Children
Jack Goldstein
4/5 (1)
Jupyterlab: Clearing Gpu Memory
No ratings yet
Jupyterlab: Clearing Gpu Memory
2 pages

Fundamentals of Deep Learning: Part 2: How A Neural Network Trains

Uploaded by

Fundamentals of Deep Learning: Part 2: How A Neural Network Trains

Uploaded by

FUNDAMENTALS OF

Part 2: How a Neural Network Trains

AGENDA Part 3: Convolutional Neural Networks

Part 4: Data Augmentation and Deployment

Part 5: Pre-trained Models

Part 6: Advanced Architectures

Loaded and visualized our data

Edited our data (reshaped, normalized, to categorical)

Created our model

Compiled our model

Trained the model on our data

Epoch A model update with the full MSE

Step An update to the weight 0

Epoch A model update with the full MSE

Step An update to the weight 0

Loss – Momentum Optimizer • Adam

• Scales to more inputs

Linear ReLU Sigmoid

Linear ReLU Sigmoid

Training data MSE Per Epoch

Loss if True Loss if False

Loss if True Loss if False

You might also like