0% found this document useful (0 votes)

10 views

Machine Learning Lecture 11

Uploaded by

Saad Mohamed Saad

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

10 views

Machine Learning Lecture 11

Uploaded by

Saad Mohamed Saad

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 28

Building Blocks: Neurons

A neuron takes inputs, does some math

with them, and produces one output.

First, each input is multiplied by a weight

Second, all the weighted inputs are added together with a bias b
(w.x) + b = w1* x1 + w2* x2 + b

Finally, the sum is passed through an activation function

y = f((w.x) + b) + f(w1* x1 + w2* x2 + b)

Activation Function and Examples
The activation function is used to turn an unbounded input into an
output that has a nice, predictable form.
Building Blocks: Neurons
The activation function is used to turn an unbounded input into an
output that has a nice, predictable form.
A commonly used activation function is the sigmoid function

The sigmoid function only outputs

numbers in the range (0,1).
You can think of it as compressing
(−∞,+∞) to (0,1)
— big negative numbers become ~0
and
big positive numbers become ~1.
Example
w1 = 0 , w2 = 1 , and b = 4
x1 = 2 , and x2 = 3
Example
Is the output change If we use others activation functions?

Function Value at x = 7
Unit Step 1
Linear Function 7
Hyperbolic Tangent 0.9999

Function Value at x = 7
Unit Step 0
Linear Function -7
Hyperbolic Tangent -1
Sigmoid 0
Linear Classifier
The simple neuron can solve linear classifier problem such as OR.

There are many lines can be used to classify this problem

XOR problem

The simple neuron (Perceptron) can not classify the problems such as
XOR.

There is no line can be used to classify this problem.

Hidden Layer
The solution to this problem is to expand beyond the single-layer
architecture by adding an additional layer of units without any direct access
to the outside world, known as a hidden layer. It solves the conflict.
This kind of architecture is another feed-forward network known as a
multilayer perceptron (MLP).
Combining Neurons into a Neural Network
A neural network is nothing more than a bunch of neurons connected together.

This network has 2 inputs, a hidden layer with 2 neurons (h1 and h2), and an output
layer with 1 neuron (o1).
Notice that the inputs for o1 are the outputs from h1 and h2
A hidden layer is any layer between the input (first) layer and output (last) layer.
There can be multiple hidden layers! (This is called Deep Learning )
Example
Given this neural network
w1 =w3 =0 , w2 = w4 =1, and b = 0 for h1 and h2 and o1

Neurons h1 and h2 and o1 has same activation function

x1 = 2 and x2 = 3 Calculate the output y

Training a Neural Network
Build a neural network to predict someone’s sex given their weight and
height from this data
(Sex (Weight) (Height) ID
F 59 164 1
M 72 183 2
M 71 179 3
F 54 154 4

First Step : Construct the neural network typology :

Inputs (2 inputs Wight and Height)
One Hidden layer with two neuron h1 and h2
One output neuron O1  sex
Second Step: (Repeated task) train the network that means determine the weights
of network

w1, w2, w3, w3, w5, w6,b1,b2,b3

Training a Neural Network
First : Data preparing We’ll represent Male Sex Weight - 64 Height - 170
1 -5 -6
with a 0 and Female with a 1, and we’ll also 0 8 13
shift the data to make it easier to use: 0 7 9
y_true 1 -10 -16

Second: we initialize w1, w2, w3, w3, w5, w6,b1,b2,b3 by random values

Third: we determine predicate Sex according to give Weight and Height as follow

O1 is called y_pred
Training a Neural Network
Loss
We first need a way to quantify how “good” it’s doing so that it can try to do “better”.
That’s what the loss is.
We’ll use the mean squared error (MSE) loss:

Where
• n is the number of samples, which is 4
• y represents the variable being predicted, which is Sex.
• y_true is the true value of the variable (the “correct answer”).
y_pred is the predicted value of the variable. It’s whatever our network outputs.
(y_true−y_pred)² is known as the squared error.
Our loss function is simply taking the average over all squared errors.
The better our predictions are, the lower our loss will be!
Better predictions = Lower loss.
Training a network = trying to minimize its loss.
Training a Neural Network
The process of calculate h1, h2 ,and o1 is called feedforward where the calculation goes from input to
hidden and then to output

The process of modifying weights (w1,w2,w3, … w2) is called back Propagation where the modifications
goes from output to hidden and then to input

Feed Forward Calculation

Feed Forward Calculation
So this type neural network (NN) is named:
X1 1 W11
• Multilayer NN
H1
W1k
WH11
• Feedforward NN
W21 • back Propagation
X2 2 .
W2k .
O1
.
.
WHk1
. Wn1
.
Hk
why these names are used ?
Xn n
Wnk • Multilayer NN
• Feedforward NN
Back Propagation weights modifications
Back Propgation weights modifications
• back Propagation
Training a Neural Network

In Feedforward we calculate
1. H1 and h2
2. O1

In back Propagation we calculate

1. b3
2. w6 and w6
3. b2 and b1
4. w1, w2, w3, and w4
Training Algorithm
1. initialize w1,w2,w3,w4,w5,w6,wb1,b2,b3
2. Repeat this part for N times
I. Determine h1, h2, o1
II.Determine mean squared error (MSE) loss
III.Modify w1,w2,w3,w4,w5,w6,wb1,b2,b3
3. IF loss is acceptable
The neural network is constructed
Else
Try to re-build new NN typology (increase hidden nodes)
Training Algorithm (trial and Errors)
Start Construct another
typology (add nodes for
Initialize Weights And Biases hidden or add new hidden
layer

Determine Hidden values (h1, h2) and Output (o1)

YES
loss is
acceptable Stop
Determine mean squared error (MSE) loss

Repeat for
All iterations Done
Modify Weights And Biases
specified iterations
First Step initialize w1,w2,w3,w4,w5,w6,wb1,b2,b3
# Weights
self.w1 = np.random.normal() w1 = -0.17078787256065822
self.w2 = np.random.normal() w2 = 0.8018910260223238
self.w3 = np.random.normal() w3 = 2.042028648489558
w4 = 0.9472266245782457
self.w4 = np.random.normal()
w5 = 0.11610745255156371
self.w5 = np.random.normal()
w6 = -0.04474672280574983
self.w6 = np.random.normal()

# Biases
self.b1 = np.random.normal() b1 = 0.049210103523584146
self.b2 = np.random.normal() b2 = -0.7372822297715569
self.b3 = np.random.normal() b3 = 0.6148445824873799
Second Step: Determine Hidden values (h1, h2) and Output (o1)
x1 = w1 * weight + w2 * height + b1

h1 = x3 = w5 * h1 + w6 * h2 + b3

x2 = w3* weight + w3 * height + b2 o1 =

h2 =

def feedforward(self, x):

# x is a numpy array with 2 elements.
h1 = sigmoid(self.w1 * x[0] + self.w2 * x[1] + self.b1)
h2 = sigmoid(self.w3 * x[0] + self.w4 * x[1] + self.b2)
o1 = sigmoid(self.w5 * h1 + self.w6 * h2 + self.b3)
return o1
Applay at first row hight weight Y SE = 0 initial value
-2 -1 1

w1 = -0.170
w2 = 0.802
b1 = 0.049

w3 = 2.042
w4 = 0.947
b2 = -0.737

w5 = 0.116
w6 = -0.045
b3 = 0.615
Determine mean squared error (MSE) loss

def mse_loss(y_true, y_pred):

# y_true and y_pred are numpy arrays of the same length.
return ((y_true - y_pred) ** 2).mean()

Ypred Ytrue
0.65 1
0.67 0
0.67 0
0.65 1

MSE = ¼ {(1-0.65)2 +(0-0.67)2 + (0-0.67)2 + (1-0.65)2}

MSE = ¼ (1.127) = 0.28
Third Step: Modify Weights And Biases
We write loss (MSE) is as a function of weights and biases
MSE = L(w1,w2,w3,w4,w5,w6,wb1,b2,b3)

We use partial differentiation to determine the value of the effect of each of the
weights.so that the
new weight = old weight – learning rate * partial differential of equation L with respect
to weight.
Third Step: Modify Weights And Biases
= −
ℎ o1
d d
− =2 − ∗ = −2 −
d d
Third Step: Modify Weights And Biases
Imagine we wanted to tweak w1. How would loss L change if we changed w1?
That’s a question the partial derivative can answer. How do we calculate it?
To start, let’s rewrite the partial derivative in terms of ∂y_pred/∂w1 instead:

Where L= (1−y_pred)²
Ytrue = 1 in this example
Third Step: Modify Weights And Biases
We can break down ∂L/∂w1 into
several parts we can calculate:

Since w1 only aﬀects h1 (not h2), we can write

This system of calculating partial

derivatives by working backwards is
known as backpropagation, or
“backprop”.
Third Step: Modify Weights And Biases
The same with
Third Step: Modify Weights And Biases
Where w5  ypre  loss

The same with

Soal AWS
No ratings yet
Soal AWS
16 pages
Introduction To Linear Algebra With Applications
0% (3)
Introduction To Linear Algebra With Applications
7 pages
Sec#1 DDM
No ratings yet
Sec#1 DDM
18 pages
Chapter3 - BP
No ratings yet
Chapter3 - BP
12 pages
How To Build Your Own Neural Network From Scratch in
No ratings yet
How To Build Your Own Neural Network From Scratch in
6 pages
Census Income Project
No ratings yet
Census Income Project
4 pages
Machine Learning For Beginners
No ratings yet
Machine Learning For Beginners
16 pages
Neural Networks
No ratings yet
Neural Networks
27 pages
Neural Network (Perceptrons)
No ratings yet
Neural Network (Perceptrons)
31 pages
NeuralNetworks
No ratings yet
NeuralNetworks
29 pages
7-Working example-01-08-2024 (1)
No ratings yet
7-Working example-01-08-2024 (1)
29 pages
Machine Learning
No ratings yet
Machine Learning
77 pages
Classification BP Regression KNN Other Classifiers_ Final.ppt
No ratings yet
Classification BP Regression KNN Other Classifiers_ Final.ppt
116 pages
15-NEURAL-NETWORK-UPDATED
No ratings yet
15-NEURAL-NETWORK-UPDATED
85 pages
12. NN Introduction MES
No ratings yet
12. NN Introduction MES
39 pages
Neural Network
100% (1)
Neural Network
54 pages
Module 5 Lecture 2
No ratings yet
Module 5 Lecture 2
45 pages
Foundations of Machine Learning: Module 6: Neural Network
No ratings yet
Foundations of Machine Learning: Module 6: Neural Network
68 pages
Neural Net 3rdclass
No ratings yet
Neural Net 3rdclass
35 pages
CS460 - Deep Learning - W02 & W03
No ratings yet
CS460 - Deep Learning - W02 & W03
44 pages
ANN MODULE 1 Part2
No ratings yet
ANN MODULE 1 Part2
58 pages
neural-networks-essay-feranmi-dere
No ratings yet
neural-networks-essay-feranmi-dere
7 pages
Back Propagation
No ratings yet
Back Propagation
29 pages
Artificial Neural Network
No ratings yet
Artificial Neural Network
35 pages
Neural
No ratings yet
Neural
53 pages
DEEP LEARNING-MATERIAL FOR THE UNITS 1,2,3
No ratings yet
DEEP LEARNING-MATERIAL FOR THE UNITS 1,2,3
36 pages
Understanding Backpropagation Algorithm - Towards Data Science
No ratings yet
Understanding Backpropagation Algorithm - Towards Data Science
11 pages
Backpropagation
No ratings yet
Backpropagation
12 pages
Machine Learning With Artificial Neural Networks
No ratings yet
Machine Learning With Artificial Neural Networks
44 pages
Lecture 13.3 Classification ANN
No ratings yet
Lecture 13.3 Classification ANN
64 pages
Ann
No ratings yet
Ann
31 pages
lect8_dnn (1)
No ratings yet
lect8_dnn (1)
33 pages
Pr2_ANN_WriteUp.docx
No ratings yet
Pr2_ANN_WriteUp.docx
11 pages
Mind - How To Build A Neural Network (Part One)
No ratings yet
Mind - How To Build A Neural Network (Part One)
9 pages
36-Multi-Layer Perceptron and Its Properties-30-10-2024
No ratings yet
36-Multi-Layer Perceptron and Its Properties-30-10-2024
39 pages
Back-Propagation Is Very Simple. Who Made It Complicated
No ratings yet
Back-Propagation Is Very Simple. Who Made It Complicated
26 pages
Data Mining Techniques: Presentation On Neural Network
No ratings yet
Data Mining Techniques: Presentation On Neural Network
55 pages
ANN-Implemetation of Back-Prop
No ratings yet
ANN-Implemetation of Back-Prop
89 pages
Advanced Information Retreival: Chapter 02: Modeling - Neural Network Model
No ratings yet
Advanced Information Retreival: Chapter 02: Modeling - Neural Network Model
31 pages
Neural Networks Handout
No ratings yet
Neural Networks Handout
7 pages
Annette Paper
No ratings yet
Annette Paper
7 pages
Foundations of Machine Learning: Module 6: Neural Network
No ratings yet
Foundations of Machine Learning: Module 6: Neural Network
14 pages
Understanding and Creating Neural Networks
No ratings yet
Understanding and Creating Neural Networks
69 pages
Backpropagation With Example
No ratings yet
Backpropagation With Example
42 pages
Unit - II ML
No ratings yet
Unit - II ML
9 pages
Backpropagation (Numericals) SOLVED NEW
No ratings yet
Backpropagation (Numericals) SOLVED NEW
8 pages
Building Neural Networks_ A Hands-On Journey from Scratch with Python _ by Long Nguyen _ Medium
No ratings yet
Building Neural Networks_ A Hands-On Journey from Scratch with Python _ by Long Nguyen _ Medium
21 pages
ML Unit-2
No ratings yet
ML Unit-2
141 pages
Chapter 5 Artificial Neural Networks
No ratings yet
Chapter 5 Artificial Neural Networks
50 pages
Unit II
No ratings yet
Unit II
12 pages
Neural Network
No ratings yet
Neural Network
55 pages
Pr3_ANN_WriteUp.docx
No ratings yet
Pr3_ANN_WriteUp.docx
8 pages
Neural Networks - 2
No ratings yet
Neural Networks - 2
79 pages
M3_Transcript
No ratings yet
M3_Transcript
10 pages
DL_ANN_RNN_CNN [Autosaved] [Autosaved]
No ratings yet
DL_ANN_RNN_CNN [Autosaved] [Autosaved]
53 pages
Lecture 10 Neural Network
No ratings yet
Lecture 10 Neural Network
34 pages
lec6 (1)
No ratings yet
lec6 (1)
18 pages
Presentation 1
No ratings yet
Presentation 1
14 pages
CC511 Week 5 - 6 - NN - BP
No ratings yet
CC511 Week 5 - 6 - NN - BP
62 pages
06-NeuralNetworks-2024
No ratings yet
06-NeuralNetworks-2024
82 pages
Exercises of Quantum Physics
From Everand
Exercises of Quantum Physics
Simone Malacrida
No ratings yet
Square Summable Power Series
From Everand
Square Summable Power Series
Louis de Branges
5/5 (1)
Bio Inspired Computing: Fundamentals and Applications for Biological Inspiration in the Digital World
From Everand
Bio Inspired Computing: Fundamentals and Applications for Biological Inspiration in the Digital World
Fouad Sabry
No ratings yet
A Short Course in Discrete Mathematics
From Everand
A Short Course in Discrete Mathematics
Edward A. Bender
3/5 (1)
Nader Gamal WDP - Nader Gamal
No ratings yet
Nader Gamal WDP - Nader Gamal
6 pages
Introduction
No ratings yet
Introduction
10 pages
IIS Lecture 6
No ratings yet
IIS Lecture 6
45 pages
sheet_MCQ-1
No ratings yet
sheet_MCQ-1
32 pages
Devops MCQ PDF
100% (2)
Devops MCQ PDF
6 pages
IIS Lecture 3
No ratings yet
IIS Lecture 3
21 pages
Sheet 1
No ratings yet
Sheet 1
11 pages
Sheet 2
No ratings yet
Sheet 2
11 pages
The Sharp Index Model
No ratings yet
The Sharp Index Model
11 pages
Updating The QR Factorization and The Least Squares Problem (2008)
No ratings yet
Updating The QR Factorization and The Least Squares Problem (2008)
73 pages
4 - Appathurai2019 - ECG Characterization Techniques
No ratings yet
4 - Appathurai2019 - ECG Characterization Techniques
13 pages
18MAT11 NEW Notes Via
No ratings yet
18MAT11 NEW Notes Via
141 pages
Exact Closed Form Algorithm For The Four Peg Tower of Hanoi Puzzle
No ratings yet
Exact Closed Form Algorithm For The Four Peg Tower of Hanoi Puzzle
6 pages
03 Intensity Transformations - PDF
No ratings yet
03 Intensity Transformations - PDF
55 pages
Resueltos Amortizado
No ratings yet
Resueltos Amortizado
3 pages
Lesson 09 - Introduction To Model Building
No ratings yet
Lesson 09 - Introduction To Model Building
85 pages
SOLN1 Ef 8904 F2013
No ratings yet
SOLN1 Ef 8904 F2013
4 pages
Lecture 13: Causality of LSI Systems: Prof. Vikram M Gadre Indian Institute of Technology, Bombay January 15, 2015
No ratings yet
Lecture 13: Causality of LSI Systems: Prof. Vikram M Gadre Indian Institute of Technology, Bombay January 15, 2015
5 pages
Chapter 8 Simultaneous Equations
No ratings yet
Chapter 8 Simultaneous Equations
8 pages
Controls Practice Exam
No ratings yet
Controls Practice Exam
4 pages
(Ebook) Handbook of Bayesian Variable Selection by Mahlet Tadesse, Marina Vannucci ISBN 9780367543761, 9780367543785, 9781003089018, 9782021031720, 2021031721, 0367543761, 0367543788, 1003089011, 2021031722 All Chapters Instant Download
100% (5)
(Ebook) Handbook of Bayesian Variable Selection by Mahlet Tadesse, Marina Vannucci ISBN 9780367543761, 9780367543785, 9781003089018, 9782021031720, 2021031721, 0367543761, 0367543788, 1003089011, 2021031722 All Chapters Instant Download
73 pages
Scheduling Models
No ratings yet
Scheduling Models
103 pages
Computer Science Project: For Isc Programming in Bluej
No ratings yet
Computer Science Project: For Isc Programming in Bluej
33 pages
Cloud Securityusing Hybrid Cryptography
No ratings yet
Cloud Securityusing Hybrid Cryptography
7 pages
Attachment Harmonic Mean Lyst5448
No ratings yet
Attachment Harmonic Mean Lyst5448
3 pages
DP-100 Designing and Implementing A Data Science Solution On Azure Exam 3
No ratings yet
DP-100 Designing and Implementing A Data Science Solution On Azure Exam 3
5 pages
Thesis Cangemi
No ratings yet
Thesis Cangemi
94 pages
Face Recognition Using PCA Based Algorithm and Neural Network
No ratings yet
Face Recognition Using PCA Based Algorithm and Neural Network
4 pages
PAK Exam ConceptQ Fall 2014 NEFTCI PW
No ratings yet
PAK Exam ConceptQ Fall 2014 NEFTCI PW
46 pages
Biological Psychiatry
No ratings yet
Biological Psychiatry
7 pages
The Normal Distribution
No ratings yet
The Normal Distribution
26 pages
Bs Computer Science
No ratings yet
Bs Computer Science
3 pages
Modified Binary Search Algorithm For Duplicate Elements
No ratings yet
Modified Binary Search Algorithm For Duplicate Elements
5 pages
Chapter 10 Simple Linear Regression and Correlation
No ratings yet
Chapter 10 Simple Linear Regression and Correlation
28 pages
Theory of Computation
100% (1)
Theory of Computation
166 pages
Lagrangian Formalism
No ratings yet
Lagrangian Formalism
9 pages

Machine Learning Lecture 11

Uploaded by

Machine Learning Lecture 11

Uploaded by

Building Blocks: Neurons

A neuron takes inputs, does some math

First, each input is multiplied by a weight

Finally, the sum is passed through an activation function

y = f((w.x) + b) + f(w1* x1 + w2* x2 + b)

The sigmoid function only outputs

There are many lines can be used to classify this problem

There is no line can be used to classify this problem.

Neurons h1 and h2 and o1 has same activation function

x1 = 2 and x2 = 3 Calculate the output y

First Step : Construct the neural network typology :

w1, w2, w3, w3, w5, w6,b1,b2,b3

Feed Forward Calculation

In back Propagation we calculate

Determine Hidden values (h1, h2) and Output (o1)

x2 = w3* weight + w3 * height + b2 o1 =

def feedforward(self, x):

def mse_loss(y_true, y_pred):

MSE = ¼ {(1-0.65)2 +(0-0.67)2 + (0-0.67)2 + (1-0.65)2}

Since w1 only aﬀects h1 (not h2), we can write

This system of calculating partial

The same with

You might also like