0% found this document useful (0 votes)

10 views33 pages

Week2 - Intro To Neural Nets

Uploaded by

jullianyorkgsantos

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

10 views33 pages

Week2 - Intro To Neural Nets

Uploaded by

jullianyorkgsantos

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 33

Motivation for Neural Nets

 Use biology as inspiration for

mathematical model
 Get signals from previous neurons
 Generate signals (or not)
according to inputs
 Pass signals on to next neurons
 By layering many neurons, can
create complex model

2
Neural Net Structure

Input Output
(Feature Vector) (Label)

 Can think of it as a complicated computation engine

 We will ”train it” using our training data
 Then (hopefully) it will give good answers on new data

3
Basic Neuron Visualization

activation
function

4
Basic Neuron Visualization

Data from
previous layer
activation
function

5
Basic Neuron Visualization
Some form of computation
transforms the inputs

activation
function

6
Basic Neuron Visualization

activation
function The neuron outputs the
transformed data

7
Basic Neuron Visualization
x1

x2 w2 activation
function

8
Basic Neuron Visualization
x1

x2 w2 activation
function

x3 1

9
Basic Neuron Visualization
x1 z = x1w1+ x2w2+ x3w3+b

x2 w2 activation f(z)
function

x3 1

10
In Vector Notation
𝑚
z = “net input” 𝑧=𝑏+ 𝑥𝑖 𝑤𝑖
b = “bias term” 𝑖=1

f = activation function 𝑧=𝑏 + 𝑥𝑇 𝑤

a = output to next layer 𝑎 = 𝑓(𝑧)

11
Relation to Logistic Regression
1
When we choose: 𝑓 𝑧 = 1+𝑒 −𝑧

𝑧=𝑏+ 𝑥𝑖 𝑤𝑖 = 𝑥1 𝑤1 + 𝑥2 𝑤2 + ⋯ + 𝑥𝑚 𝑤𝑚 + 𝑏
𝑖=1

Then a neuron is simply a ”unit” of logistic regression!

weights  coefficients inputs  variables
bias term  constant term

12
Relation to Logistic Regression
1
This is called the “sigmoid” function: 𝜎 𝑧 =
1 + 𝑒 −𝑧

13
Nice Property of Sigmoid Function
1
𝜎 𝑧 = Quotient rule
1 + 𝑒 −𝑧 𝑑 𝑓(𝑥) 𝑓 ′ 𝑥 𝑔 𝑥 − 𝑓 𝑥 𝑔′(𝑥)
⋅ =
0 − (−𝑒 −𝑧 ) 𝑒 −𝑧 𝑑𝑥 𝑔(𝑥) 𝑔 𝑥 2
𝜎′ 𝑧 = =
1 + 𝑒 −𝑧 2 1 + 𝑒 −𝑧 2

1 + 𝑒 −𝑧 − 1 1 + 𝑒 −𝑧 1
= = −
1 + 𝑒 −𝑧 2 1 + 𝑒 −𝑧 2 1 + 𝑒 −𝑧 2

1 1 1 1
= − = 1 −
1 + 𝑒 −𝑧 1 + 𝑒 −𝑧 2 1 + 𝑒 −𝑧 1 + 𝑒 −𝑧

𝜎′ 𝑧 = 𝜎(𝑧)(1 − 𝜎(𝑧)) This will be helpful!

14
Example Neuron Computation
x1 z = x1w1+ x2w2+ x3w3+b

(sigmoid)
x2 w2
activation f(z)

function

x3 1

15
Example Neuron Computation
.9 z = x1w1+ x2w2+ x3w3+b

(sigmoid)
.2 3
activation f(z)

function

.3 1

16
Example Neuron Computation
.9 z = .9(2)+ .2(3)+ .3(-1)+.5 = 2.6

(sigmoid)
.2 3
activation f(z)

function

.3 1

17
Example Neuron Computation
.9 z = .9(2)+ .2(3)+ .3(-1)+.5 = 2.6

f(z)=f(3.5)=1/(1+exp(-2.6))
= .93
(sigmoid)
.2 3
activation
function

.3 1

18
Example Neuron Computation
.9 z = .9(2)+ .2(3)+ .3(-1)+.5 = 2.6

f(z)=f(3.5)=1/(1+exp(-2.6))
= .93
(sigmoid)
.2 3
activation
function
Neuron would output
the value .93
.3 1

19
Why Neural Nets?
 Why not just use a single neuron?
Why do we need a larger network?
 A single neuron (like logistic
regression) only permits a linear
decision boundary.
 Most real-world problems are
considerably more complicated!

20
Feedforward Neural Network
𝜎 𝜎
𝑥1 𝑦1
𝜎 𝜎
𝑥2 𝑦2
𝜎 𝜎
𝑥3 𝑦3
𝜎 𝜎

21
Weights
𝜎 𝜎
𝑥1 𝑦1
𝜎 𝜎
𝑥2 𝑦2
𝜎 𝜎
𝑥3 𝑦3
𝜎 𝜎

22
Input Layer
𝜎 𝜎
𝑥1 𝑦1
𝜎 𝜎
𝑥2 𝑦2
𝜎 𝜎
𝑥3 𝑦3
𝜎 𝜎

23
Hidden Layers
𝜎 𝜎
𝑥1 𝑦1
𝜎 𝜎
𝑥2 𝑦2
𝜎 𝜎
𝑥3 𝑦3
𝜎 𝜎

24
Output Layer
𝜎 𝜎
𝑥1 𝑦1
𝜎 𝜎
𝑥2 𝑦2
𝜎 𝜎
𝑥3 𝑦3
𝜎 𝜎

25
Weights (represented by matrices)
𝑊 (1) 𝑊 (2) 𝑊 (3)
𝜎 𝜎
𝑥1 𝑦1
𝜎 𝜎
𝑥2 𝑦2
𝜎 𝜎
𝑥3 𝑦3
𝜎 𝜎

26
Net Input (sum of weighted inputs, before activation function)
𝑧 (2) 𝑧 (3) 𝑧 (4)
𝜎 𝜎
𝑥1 𝑦1
𝜎 𝜎
𝑥2 𝑦2
𝜎 𝜎
𝑥3 𝑦3
𝜎 𝜎

27
Activations (output of neurons to next layer)
𝑎(2) 𝑎(3)
𝑎(1) 𝜎 𝜎 𝑎(4)

𝑥1 𝑦1
𝜎 𝜎
𝑥2 𝑦2
𝜎 𝜎
𝑥3 𝑦3
𝜎 𝜎

28
Matrix representation of computation
𝑧 (2)
𝑎(2)
𝑊 (1)
𝜎
𝑥1
𝑥 = 𝑥1 , 𝑥2 , 𝑥3 𝑊 (1) is a
3x4 matrix 𝜎
(𝑥 = 𝑎(1) ) 𝑧 (2) is a 𝑥2
4-vector
𝑧 (2) = 𝑥𝑊 (1) 𝜎
𝑎(2) is a
𝑎(2) = 𝜎(𝑧 2
) 4-vector
𝑥3
𝜎

29
Continuing the Computation
For a single training instance (data point)
Input: vector x (a row vector of length 3)
Output: vector 𝑦 (a row vector of length 3)

𝑧 (2) = 𝑥𝑊 (1) 𝑎(2) = 𝜎(𝑧 2

)

𝑧 (3) = 𝑎(2) 𝑊 (2) 𝑎(3) = 𝜎(𝑧 3

)
𝑦 = 𝑠𝑜𝑓𝑡𝑚𝑎𝑥(𝑧 4 )
𝑧 (4) = 𝑎(3) 𝑊 (3)

30
Multiple data points
In practice, we do these computation for many data points at the same time,
by “stacking” the rows into a matrix. But the equations look the same!
Input: matrix x (an nx3 matrix) (each row a single instance)
Output: vector 𝑦 (an nx3 matrix) (each row a single prediction)

𝑧 (2) = 𝑥𝑊 (1) 𝑎(2) = 𝜎(𝑧 2

)

𝑧 (3) = 𝑎(2) 𝑊 (2) 𝑎(3) = 𝜎(𝑧 3

)
𝑦 = 𝑠𝑜𝑓𝑡𝑚𝑎𝑥(𝑧 4 )
𝑧 (4) = 𝑎(3) 𝑊 (3)

31
Now we know how feedforward NNs do Computations.
Next, we will learn how to adjust the weights to learn from data.

Lecture 2.1 - Quantum Circuit Compilation With Qiskit
No ratings yet
Lecture 2.1 - Quantum Circuit Compilation With Qiskit
44 pages
Differentiation
No ratings yet
Differentiation
3 pages
Mechanics Problem
No ratings yet
Mechanics Problem
9 pages
Understanding Neural Networks
No ratings yet
Understanding Neural Networks
12 pages
L10 Neural Network
No ratings yet
L10 Neural Network
52 pages
Neural Networks
No ratings yet
Neural Networks
27 pages
06 NeuralNetworks 2024
No ratings yet
06 NeuralNetworks 2024
82 pages
Neural Networks and Their Statistical Application
No ratings yet
Neural Networks and Their Statistical Application
41 pages
Advanced Information Retreival: Chapter 02: Modeling - Neural Network Model
No ratings yet
Advanced Information Retreival: Chapter 02: Modeling - Neural Network Model
31 pages
Notes Chapter Neural Networks
No ratings yet
Notes Chapter Neural Networks
18 pages
Neural Network Presentation
No ratings yet
Neural Network Presentation
33 pages
Unit III
No ratings yet
Unit III
29 pages
Machine Learning For Beginners
No ratings yet
Machine Learning For Beginners
16 pages
Neural Networks: Associate Professor Department of Management Studies
No ratings yet
Neural Networks: Associate Professor Department of Management Studies
57 pages
Neural Network 1704953886
No ratings yet
Neural Network 1704953886
25 pages
08 Neural Networks Representation PDF
No ratings yet
08 Neural Networks Representation PDF
10 pages
Lec 06
No ratings yet
Lec 06
20 pages
Deep Learning
No ratings yet
Deep Learning
15 pages
Unit 03 - Neural Networks - MD
No ratings yet
Unit 03 - Neural Networks - MD
24 pages
Neural Networks - Annotated
No ratings yet
Neural Networks - Annotated
21 pages
Lecture 09 Slides - After
No ratings yet
Lecture 09 Slides - After
57 pages
Neural Nets
No ratings yet
Neural Nets
33 pages
Understanding Neural Networks. We Explore How Neural Networks Function - by Tony Yiu - Towards Data Science
No ratings yet
Understanding Neural Networks. We Explore How Neural Networks Function - by Tony Yiu - Towards Data Science
18 pages
Unit 3 - Ann
No ratings yet
Unit 3 - Ann
49 pages
Slide 7 - Neural Networks
No ratings yet
Slide 7 - Neural Networks
64 pages
ECSE484 Intro v2
No ratings yet
ECSE484 Intro v2
67 pages
MATLAB by Examples - Starting With Neural Network in Matlab
No ratings yet
MATLAB by Examples - Starting With Neural Network in Matlab
6 pages
10 Neural Network
No ratings yet
10 Neural Network
65 pages
Neural Network
100% (1)
Neural Network
54 pages
How To Create A Simple Neural Network in Python
100% (1)
How To Create A Simple Neural Network in Python
4 pages
Lecture 9. Neural Networks
No ratings yet
Lecture 9. Neural Networks
106 pages
Introduction To Feedforward Neural Networks
No ratings yet
Introduction To Feedforward Neural Networks
20 pages
Neural
No ratings yet
Neural
53 pages
NN Lecture1 Introduction
No ratings yet
NN Lecture1 Introduction
40 pages
AI Lecture 16
No ratings yet
AI Lecture 16
51 pages
Neural Networks
No ratings yet
Neural Networks
33 pages
Artificial Neural Networks
No ratings yet
Artificial Neural Networks
82 pages
Neural Networks
No ratings yet
Neural Networks
12 pages
Neural Computing
No ratings yet
Neural Computing
13 pages
Deep Learning - Part-1
No ratings yet
Deep Learning - Part-1
143 pages
Artificial Neural Networks
No ratings yet
Artificial Neural Networks
26 pages
Module 5 Lecture 2
No ratings yet
Module 5 Lecture 2
45 pages
Unit 1
No ratings yet
Unit 1
20 pages
Bai 1 Eng
No ratings yet
Bai 1 Eng
10 pages
Chapter 5 Artificial Neural Networks
No ratings yet
Chapter 5 Artificial Neural Networks
50 pages
Unit I
No ratings yet
Unit I
90 pages
ML-5TH Unit
No ratings yet
ML-5TH Unit
28 pages
NN Introduction MES
No ratings yet
NN Introduction MES
39 pages
Intro To NN
No ratings yet
Intro To NN
4 pages
Artificial Neural Network
No ratings yet
Artificial Neural Network
31 pages
Neural Networks
No ratings yet
Neural Networks
40 pages
ML Unit4 Notes
No ratings yet
ML Unit4 Notes
96 pages
Artificial Neural Networks
No ratings yet
Artificial Neural Networks
26 pages
Structure of Neural Networks
No ratings yet
Structure of Neural Networks
12 pages
What Is A Neural Network? - IBM
No ratings yet
What Is A Neural Network? - IBM
10 pages
Chapter 6 AI
No ratings yet
Chapter 6 AI
52 pages
Machine Learning
No ratings yet
Machine Learning
77 pages
Unit - 4
No ratings yet
Unit - 4
17 pages
Neural Network
No ratings yet
Neural Network
55 pages
Artificial Neural Networks (Anns) : Intro
No ratings yet
Artificial Neural Networks (Anns) : Intro
15 pages
10-Artificial Neural Networks - Perceptron Learning Algorithm-02-08-2024
No ratings yet
10-Artificial Neural Networks - Perceptron Learning Algorithm-02-08-2024
38 pages
Shallow Networks Versus Deep Networks
No ratings yet
Shallow Networks Versus Deep Networks
6 pages
Solving Math Problems
From Everand
Solving Math Problems
George N. Frempong
No ratings yet
Design of Singly Reinforced Beam Case 1
No ratings yet
Design of Singly Reinforced Beam Case 1
12 pages
Combining Normal Random Variables
No ratings yet
Combining Normal Random Variables
4 pages
Model Predictive Control Notes
100% (6)
Model Predictive Control Notes
135 pages
Unit I Predictive Analytics
No ratings yet
Unit I Predictive Analytics
39 pages
Research Paper Spam Detection
No ratings yet
Research Paper Spam Detection
4 pages
03 A Polynomial Linear Regression
No ratings yet
03 A Polynomial Linear Regression
6 pages
Module 3
No ratings yet
Module 3
21 pages
Implementation of Pattern Matching Algorithm
No ratings yet
Implementation of Pattern Matching Algorithm
4 pages
9-Hashing Schemes
No ratings yet
9-Hashing Schemes
23 pages
Operation Management Forecast
No ratings yet
Operation Management Forecast
2 pages
Quintic B-Spline Method For Numerical Solution of Fourth Order Singular Perturbation Boundary Value Problems
No ratings yet
Quintic B-Spline Method For Numerical Solution of Fourth Order Singular Perturbation Boundary Value Problems
11 pages
A Review of PID Control Tuning Methods and Applications
No ratings yet
A Review of PID Control Tuning Methods and Applications
10 pages
Control System Notes m2
No ratings yet
Control System Notes m2
132 pages
Thesis On Content Based Image Retrieval
100% (3)
Thesis On Content Based Image Retrieval
7 pages
Data-Driven Switching Control Technique Based On Deep Reinforcement Learning For Packed E-Cell As Smart EV Charger
No ratings yet
Data-Driven Switching Control Technique Based On Deep Reinforcement Learning For Packed E-Cell As Smart EV Charger
9 pages
Chapter 7 Complexity
No ratings yet
Chapter 7 Complexity
21 pages
Missing Child Identification System Using Deep Learning and Multiclass SVM
No ratings yet
Missing Child Identification System Using Deep Learning and Multiclass SVM
9 pages
Adnan Salman
No ratings yet
Adnan Salman
15 pages
Artificial Intelligence
No ratings yet
Artificial Intelligence
3 pages
CSE221 Lab 04 Graph Summer 2023
No ratings yet
CSE221 Lab 04 Graph Summer 2023
19 pages
Spectral Mapping Theorem For Polynomials
No ratings yet
Spectral Mapping Theorem For Polynomials
28 pages
An Incremental Clustering Algorithm Based On Mahalanobis Distance
No ratings yet
An Incremental Clustering Algorithm Based On Mahalanobis Distance
1 page
Opc Vector Imaging Model
No ratings yet
Opc Vector Imaging Model
9 pages
Windowing Functions Improve FFT Results,: Richard Lyons
No ratings yet
Windowing Functions Improve FFT Results,: Richard Lyons
7 pages
Optimization For Machine Learning: Lecture 12: Coordinate Descent, BCD, Altmin 6.881: MIT
No ratings yet
Optimization For Machine Learning: Lecture 12: Coordinate Descent, BCD, Altmin 6.881: MIT
124 pages
Final Paper Imgprocessing
No ratings yet
Final Paper Imgprocessing
11 pages
2023 Super Mock 2 Elective Maths 2
No ratings yet
2023 Super Mock 2 Elective Maths 2
7 pages