0% found this document useful (0 votes)

15 views80 pages

ML.8-Neural Networks - Deep Learning (Week 12,13)

Uploaded by

Sơn Trịnh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

15 views80 pages

ML.8-Neural Networks - Deep Learning (Week 12,13)

Uploaded by

Sơn Trịnh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 80

Nhân bản – Phụng sự – Khai phóng

Chapter 8

Neural Networks & Deep Learning

Machine Learning
CONTENTS

• Perceptron

• Neural networks

• Gradient descent

• Backpropagation
CONTENTS

• Perceptron
• Neural networks

• Gradient descent

• Backpropagation
Perceptron

1950s Age of the Perceptron

1957 The Perceptron (Rosenblatt)
1969 Perceptrons (Minsky, Papert)

1980s Age of the Neural Network

1986 Back propagation (Hinton)

1990s Age of the Graphical Model

2000s Age of the Support Vector Machine

2010s Age of the Deep Network

Deep Learning = Known algorithms + Computing power + Big data

…Perceptron
Inspiration from Biology

• Neural nets/perceptrons are loosely inspired by biology.

• But they certainly are not a model of how the brain works,
or even how neurons work.
…Perceptron

N-d binary vector

perceptron is just one line of code!

sign of zero is +1
…Perceptron

initialized w = 0
…Perceptron

observation (1,-1)
label -1
…Perceptron

observation (1,-1)
label -1

=1
…Perceptron

observation (1,-1)
label -1
…Perceptron

update w

observation (1,-1)
label -1
…Perceptron

update w
no match!

(-1,1) (0,0) -1 (1,-1) 1

observation (1,-1)
label -1
…Perceptron

observation (-1,1)
label +1

(-1,1)
…Perceptron

(-1,1) (-1,1)
=1

observation (-1,1)
label +1

(-1,1)
…Perceptron

(-1,1) (-1,1)
=1

observation (-1,1)
label +1

(-1,1)
…Perceptron
update w
match!

(-1,1) (-1,1) +1 (-1,1) 0

observation (-1,1)
label +1
update w
…Perceptron
…Perceptron

update w
…Perceptron

update w
…Perceptron
…Perceptron

update w
…Perceptron

repeat …
…Perceptron
Another way to draw it…

weights (1) Combine the sum and

activation function

inputs output

Activation Function
(e.g., Sigmoid function of weighted sum)

(2) suppress the bias term

(less clutter)
…Perceptron
Programming the 'forward pass'
Activation function (sigmoid, logistic function)
float f(float a){
return 1.0 / (1.0+ exp(-a));
}

output

Perceptron function (logistic regression)

float perceptron(vector<float> x, vector<float> w){
float a = dot(x,w);
return f(a);
}
CONTENTS

• Perceptron

• Neural networks
• Gradient descent

• Backpropagation

• Stochastic gradient descent

25
Neural networks

Connect a bunch of perceptrons together …

a collection of connected perceptrons

‘six perceptrons’
Neural networks

Some terminology…

‘hidden’ layer
‘input’ layer ‘output’ layer

…also called a Multi-layer Perceptron (MLP)

Neural networks
this layer is a
‘fully connected layer’

all pairwise neurons between layers are connected

Neural networks

How many neurons (perceptrons)? 4+2=6

How many weights (edges)? (3 x 4) + (4 x 2) = 20

How many learnable parameters total? 20 + 6 = 26

Neural networks

performance usually tops out at 2-3 layers,

deeper networks don’t really improve performance...

...with the exception of Convolutional Neural Networks for images

CONTENTS

• Perceptron.

• Neural networks.

• Gradient descent
• Backpropagation.

• Stochastic gradient descent.

31
Gradient descent

Loss Function: defines what is means to be close to the true solution

chose the loss function! (some are better than others depending on what you want to do)

Squared Error
(a popular loss function)
Gradient descent

Loss Function: defines what is means to be close to the true solution

chose the loss function! (some are better than others depending on what you want to do)
Gradient descent
world’s smallest perceptron!

(a.k.a. line equation, linear regression)

Given several examples

and a perceptron

Modify weight such that gets ‘closer’ to

perceptron perceptron true

parameter output label
…Gradient descent

Code to train perceptron:

just one line of code!

Now where does this come from?

…Gradient descent

update rule:
CONTENTS

• Perceptron.

• Neural networks.

• Gradient descent.

• Backpropagation.
• Stochastic gradient descent.

37
Backpropagation

 function of ONE parameter!

Training the world’s smallest perceptron

This is just gradient
descent, that means…

this should be the gradient

of the loss function

Now where does this come from?

Backpropagation

…is the rate at which this will change…

the loss function

… per unit change of this

the weight parameter

Let’s compute the derivative…

Backpropagation

Compute the derivative

just shorthand

That means the weight update for gradient descent is:

move in direction of negative gradient
Backpropagation

Gradient Descent (world’s smallest perceptron)

For each sample

1. Predict

a. Forward pass

b. Compute Loss

2. Update

a. Back Propagation

b. Gradient update
Backpropagation

world’s (second) smallest perceptron!

We need to train the network:

What is known? What is unknown?
Backpropagation

Entire network can be written out as a long equation

known

We need to train the network:

What is known? What is unknown?

Backpropagation

Entire network can be written out as a long equation

activation function
sometimes has unknown
parameters
We need to train the network:

What is known? What is unknown?

Backpropagation

Learning an MLP

Given a set of samples and a MLP

Estimate the parameters of the MLP

Backpropagation

Gradient Descent

For each random sample

1. Predict

a. Forward pass

b. Compute Loss

Backpropagation

rest of the network

Backpropagation
Backpropagation

already computed.
re-use (propagate)!
The Chain Rule

a.k.a. backpropagation
The chain rule says…

depends on

depends on depends on depends on depends on depends on

depends on
The chain rule says…

depends on

depends on depends on depends on depends on depends on

depends on

already computed.
re-use (propagate)!
depends on

depends on depends on depends on depends on depends on

depends on
depends on

depends on depends on depends on depends on depends on

depends on
depends on

depends on depends on depends on depends on depends on

depends on
Gradient Descent

For each example sample

1. Predict

a. Forward pass

b. Compute Loss

2. Update

a. Back Propagation

b. Gradient update
Gradient Descent

For each example sample

1. Predict

a. Forward pass

b. Compute Loss

2. Update

a. Back Propagation
vector of parameter partial derivatives

b. Gradient update
vector of parameter update equations
SUMMARY

• Perceptron

• Neural networks

• Gradient descent

• Backpropagation

Computer Vision 77
MNIST database
Experiments with the MNIST database
• The MNIST database of handwritten digits
• Training set of 60,000 examples, test set of 10,000 examples
• Vectors in 𝑅784 (28x28 images)
• Labels are the digits they represent
• Various methods have been tested with this training set and test set

• Linear models: 7% - 12% error

• KNN: 0.5%- 5% error
• Neural networks: 0.35% - 4.7% error
• Convolutional NN: 0.23% - 1.7% error
78
Demo
Tinker With a Neural Network Right Here in Your Browser
• Open source software to play with neural networks in your browser.
• The dots are colored orange or blue for positive and negative examples.
• It’s possible to choose the activation function, architecture, rate etc.
• Very well done! Let’s check it out!

Artificial Intelligence 79
Nhân bản – Phụng sự – Khai phóng

Enjoy the Course…!

Machine Learning 80

E Advanced Service Functional Blocks C-Arm C-Arm
100% (3)
E Advanced Service Functional Blocks C-Arm C-Arm
78 pages
Deep Learning PDF
100% (1)
Deep Learning PDF
87 pages
ML Unit-5
No ratings yet
ML Unit-5
19 pages
2024 04 21 11.55.01
No ratings yet
2024 04 21 11.55.01
694 pages
Question Bank With Answers - OOPs Through JAVA - Total 5 Units
No ratings yet
Question Bank With Answers - OOPs Through JAVA - Total 5 Units
84 pages
Nokia IP Routing Portfolio Poster Graphic en
No ratings yet
Nokia IP Routing Portfolio Poster Graphic en
1 page
Mastering SciPy - Sample Chapter
No ratings yet
Mastering SciPy - Sample Chapter
45 pages
Deep Learning Fundamentals in Python
From Everand
Deep Learning Fundamentals in Python
LazyProgrammer
4/5 (9)
05 ANN Artificial Neural Networks
No ratings yet
05 ANN Artificial Neural Networks
221 pages
Lesson 3 Artificial Neural Network
No ratings yet
Lesson 3 Artificial Neural Network
77 pages
Intro To Neural Networks
No ratings yet
Intro To Neural Networks
100 pages
Neural Deep Learning
No ratings yet
Neural Deep Learning
221 pages
Classification BP Regression KNN Other Classifiers - Final
No ratings yet
Classification BP Regression KNN Other Classifiers - Final
116 pages
Lecture 10
No ratings yet
Lecture 10
155 pages
Fetal Brain Ultrasound Image Classification Using Deep Learning
100% (1)
Fetal Brain Ultrasound Image Classification Using Deep Learning
5 pages
Unit 4 ML NN, DL, CNN-1
No ratings yet
Unit 4 ML NN, DL, CNN-1
84 pages
L6 Neural Network
No ratings yet
L6 Neural Network
57 pages
DL U-I Introduction Part-2
No ratings yet
DL U-I Introduction Part-2
48 pages
4.0 The Complete Guide To Artificial Neural Networks
No ratings yet
4.0 The Complete Guide To Artificial Neural Networks
23 pages
ML Lec 10 Neural Networks
No ratings yet
ML Lec 10 Neural Networks
87 pages
Introduction Deep Eng
No ratings yet
Introduction Deep Eng
50 pages
Productivity Tools Microsoft Office Excel
No ratings yet
Productivity Tools Microsoft Office Excel
7 pages
3ML.05.NeuralNetworks DeepLearning
No ratings yet
3ML.05.NeuralNetworks DeepLearning
67 pages
Unit 1
No ratings yet
Unit 1
72 pages
5 - From Linear Models To Multi-Layer Perceptrons
No ratings yet
5 - From Linear Models To Multi-Layer Perceptrons
45 pages
ML Unit-2
No ratings yet
ML Unit-2
141 pages
Chapter 5 Artificial Neural Networks
No ratings yet
Chapter 5 Artificial Neural Networks
50 pages
Artificial Neural Networks
No ratings yet
Artificial Neural Networks
100 pages
Unit V
No ratings yet
Unit V
49 pages
Lecture 221004 04
No ratings yet
Lecture 221004 04
29 pages
Domnic Object Detecion Basics
No ratings yet
Domnic Object Detecion Basics
62 pages
Unit - 4 ANN
No ratings yet
Unit - 4 ANN
46 pages
ML Labs
No ratings yet
ML Labs
46 pages
Chapter 2 - Network Basics
No ratings yet
Chapter 2 - Network Basics
64 pages
SJNanda - Neural Network
No ratings yet
SJNanda - Neural Network
43 pages
CNN and Gan: Introduction To
No ratings yet
CNN and Gan: Introduction To
58 pages
Module 5 Lecture 2
No ratings yet
Module 5 Lecture 2
45 pages
Al-Qaysi Mina
No ratings yet
Al-Qaysi Mina
46 pages
Artificial Neural Network
No ratings yet
Artificial Neural Network
35 pages
SJNanda Neural Network
No ratings yet
SJNanda Neural Network
43 pages
Chapter 2 - Artificial Neural Networks
No ratings yet
Chapter 2 - Artificial Neural Networks
19 pages
Ai - W7L13
No ratings yet
Ai - W7L13
46 pages
Customer One Reason Codes Definitions
No ratings yet
Customer One Reason Codes Definitions
22 pages
Ann MJJ-1
No ratings yet
Ann MJJ-1
64 pages
Kagan Lecture2
No ratings yet
Kagan Lecture2
118 pages
Unit 2 - ML
No ratings yet
Unit 2 - ML
18 pages
Understanding and Coding Neural Networks From Scratch in Python and R
100% (1)
Understanding and Coding Neural Networks From Scratch in Python and R
15 pages
ANN MODULE 1 Part2
No ratings yet
ANN MODULE 1 Part2
58 pages
Neural Networks
No ratings yet
Neural Networks
29 pages
26 Neural Nets
No ratings yet
26 Neural Nets
77 pages
Understanding and Coding Neural Networks From Scratch in Python and R
No ratings yet
Understanding and Coding Neural Networks From Scratch in Python and R
12 pages
Unit - II ML
No ratings yet
Unit - II ML
9 pages
Unit 4
No ratings yet
Unit 4
38 pages
Session XX - Neural Network
No ratings yet
Session XX - Neural Network
43 pages
Artificial Intelligence - Chapter 7
No ratings yet
Artificial Intelligence - Chapter 7
18 pages
2017 Industrial
No ratings yet
2017 Industrial
11 pages
Lesson 7.0 Supervised Learning With Neural Networks
No ratings yet
Lesson 7.0 Supervised Learning With Neural Networks
22 pages
Machine Learning Unit 5 Notes
No ratings yet
Machine Learning Unit 5 Notes
19 pages
Samsung Ah68 02293b Users Manual 280484
No ratings yet
Samsung Ah68 02293b Users Manual 280484
39 pages
AC by Sheshank Gupta
No ratings yet
AC by Sheshank Gupta
12 pages
Scales of Data
No ratings yet
Scales of Data
6 pages
CC511 Week 5 - 6 - NN - BP
No ratings yet
CC511 Week 5 - 6 - NN - BP
62 pages
Linux Commands
No ratings yet
Linux Commands
1 page
Slide 2
No ratings yet
Slide 2
35 pages
Machine Learningbased Lie Detectorappliedtoa Collectedand Annotated Dataset
No ratings yet
Machine Learningbased Lie Detectorappliedtoa Collectedand Annotated Dataset
10 pages
Neural Networks
No ratings yet
Neural Networks
10 pages
Learn Python through Nursery Rhymes and Fairy Tales: Classic Stories Translated into Python Programs (Coding for Kids and Beginners)
From Everand
Learn Python through Nursery Rhymes and Fairy Tales: Classic Stories Translated into Python Programs (Coding for Kids and Beginners)
Shari Eskenas
5/5 (1)
Lecture 12 - Neural Networks (DONE!!) PDF
No ratings yet
Lecture 12 - Neural Networks (DONE!!) PDF
27 pages
1-Python Algebra Maths
No ratings yet
1-Python Algebra Maths
26 pages
Project Schedule Management Overview: 174 Part 1 - Guide
No ratings yet
Project Schedule Management Overview: 174 Part 1 - Guide
1 page
Machine Learning With Artificial Neural Networks
No ratings yet
Machine Learning With Artificial Neural Networks
44 pages
Mid 1 DL Notes
No ratings yet
Mid 1 DL Notes
15 pages
Unit 2
No ratings yet
Unit 2
20 pages
Neural Networks
No ratings yet
Neural Networks
10 pages
09-Neural Networks
No ratings yet
09-Neural Networks
18 pages
Chapter 1.2. Overview of ML
No ratings yet
Chapter 1.2. Overview of ML
17 pages
Deep Learning Notes
No ratings yet
Deep Learning Notes
3 pages
ML.1-Overview of ML (Week 1)
No ratings yet
ML.1-Overview of ML (Week 1)
24 pages
Instalacion 4090 9121
No ratings yet
Instalacion 4090 9121
4 pages
Neural Network Presentation
No ratings yet
Neural Network Presentation
33 pages
0CO - PC - PCP - 03 - 04 Selection
No ratings yet
0CO - PC - PCP - 03 - 04 Selection
2 pages
Why Physical Design Is Easier Than HCI Design 3
No ratings yet
Why Physical Design Is Easier Than HCI Design 3
6 pages
Advanced Information Retreival: Chapter 02: Modeling - Neural Network Model
No ratings yet
Advanced Information Retreival: Chapter 02: Modeling - Neural Network Model
31 pages
Neural Net 3rdclass
No ratings yet
Neural Net 3rdclass
35 pages
Install Apache PHP5 MySQL5.6 Debian 9.6
No ratings yet
Install Apache PHP5 MySQL5.6 Debian 9.6
5 pages
Ade Assignment
No ratings yet
Ade Assignment
18 pages
Enquiry: ENQ Nom Date of Enq Name of The Candidate Mobile Nomber Subject
No ratings yet
Enquiry: ENQ Nom Date of Enq Name of The Candidate Mobile Nomber Subject
15 pages
Datasheet MB980 Ibase
No ratings yet
Datasheet MB980 Ibase
1 page
Dasari Teja Sree: Career Objective
No ratings yet
Dasari Teja Sree: Career Objective
3 pages
ML.0-Introduction To ML Course
No ratings yet
ML.0-Introduction To ML Course
7 pages
Preparing A Static PDF Form
No ratings yet
Preparing A Static PDF Form
6 pages
Internet Safety Theory Assessment
No ratings yet
Internet Safety Theory Assessment
4 pages
Password Function: Procedure For Locking and Unlocking
No ratings yet
Password Function: Procedure For Locking and Unlocking
1 page
Tara Coen, Bba: #204 - 13339 102A AVENUE, SURREY, BC, V3T 0C5::: (604) 290-8272
No ratings yet
Tara Coen, Bba: #204 - 13339 102A AVENUE, SURREY, BC, V3T 0C5::: (604) 290-8272
2 pages
Welcome To Diligent-BTS One Pager
No ratings yet
Welcome To Diligent-BTS One Pager
1 page