Artifical Neural Networks - Lect - 2

The document summarizes the basic architecture and components of neural networks. It discusses that a neural network consists of neurons with weighted inputs that are summed and passed through an activation function. The weights are initialized and then updated during training using an algorithm like the perceptron learning rule to minimize error. Common activation functions include the sigmoid, tanh, and ReLU functions.

Uploaded by

ma5395822

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

25 views16 pages

Artifical Neural Networks - Lect - 2

Uploaded by

ma5395822

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 16

The basic architecture of

neural networks
Simple model of neuron
y1
• Each neuron has a threshold value w1j
w2j
y2
• Each neuron has weighted inputs

w3j
y3 O
from other neurons
yi wij

• The input signals form a weighted sum

• The weighted sum is subtracted from its threshold value, to give its
activation level.
Cont.
• If the activation level exceeds the threshold, the neuron “fires”.
Otherwise, the perceptron does not(e.g, ignored).

• Note : “The most common fixed value of threshold is zero”.

• Activation level is passed through a sigmoid activation function to

determine output
Mathematical representation of a perceptron
• Consider training instance is in the form ,
input vector 𝑋 is
and desired output y is
• Note: d is number of nodes, -1 and +1 is observed value of binary class
• The basic steps in the learning process in the perceptron is working as follows:
1) Compute the following function
2) The sign function is applied on the previous value to determine the
actual output:
Cont.
• The sign function maps the real value to -1 or +1 (e.g. sign(2.3) is +1
and sign(-2.3) is -1 ).
Note: sign function is a type of “activation functions” which have many
forms( will be discussed later).

3) The error of the prediction is then computed as the form:

4) If the error value is nonzero , then the weights must be updated in

the negative direction of the error gradient( in next slides).
Note: is called loss function (different forms of E will be discussed later)
Cont.
• The typical form of perceptron equation has a fixed value , it is called
a bias.
• By incorporating the bias value b, we can write the actual output
equation as the form:

• Note: For simplicity, bias may not appear in the following equations
Cont.
• The perceptron algorithm use smooth approximation of the gradient with
respect to each example:

• The weights are updating according to:

Current
Weight Previous
weight ∆w

𝛼 is learning rate (mostly equals 0.1).

Weight initialization in NNs
We can use one of the following rules to initialize weight:

1. Zero initialization: this choice causes all weights have the same value in
subsequent iterations.

2. Random initialization : this is a better choice to break the symmetry.

However, initializing weight with much high or low value can result in
slower optimization.

3. Using an extra scaling factor in scheme like Xavier initialization:

this method can solve the above issue (slower optimization). That’s why this
is the more recommended weight initialization method among all.
Convergence
• The perceptron algorithm always converges to provide zero error on
the training data when data are linearly separable.
• But it is not guaranteed to converge in the case when the data are not
linearly separable.
Activation function
• Choice of activation function is a critical part in NN design.

• Neuron computes two functions in the node:

The value computed before applying activation function is called pre activation
value (computed by ∑ ) , whereas the value computed after applying
activation function is called post activation value (denoted by )
Cont.
• All neurons contain an activation function which determines whether the
signal is strong enough to produce an output.

• Classical activation functions:

• Sign function (or Bipolar binary function ) used to map to binary outputs at
prediction time(-1 or +1).
• Sigmoid function (or unipolar continuous function) outputs a value in the interval
(0,1), thus it can create probabilistic outputs(real values) and creating loss
function based on maximum- likelihood model.
• Tanh function is similar to sigmoid function, but it is preferred when the outputs
desired to be positive or negative. Furthermore, its larger gradient makes it easier
to train.
Cont.
• Piecewise linear activation functions:

• Both ReLU and hard tanh have largely replaced sigmoid and soft tanh
activation function in modern neural networks for the ease of training.
Cont.
Learning algorithms
During the learning process, weights can be updated by different rules. In the next
rules: C is learning rate, d desired output, o actual output and net = 𝑾𝒕 𝐗
• Perceptron Learning Rule:
∆wi = c [di – oi)] xi
• Hebbien learning rule:
∆wi = c [oi] xi
• Delta learning rule:
∆wi = c [di – oi)] f '(net) xi
f '(net)= 1/2 (1-o2(
• Widrow-Hoff Learning Rule (d is independent of activation function)
∆wi = c [di – net)] f '(net) xi
f '(net)= 1, f (net) = net
• Correlation Learning Rule (special case of Hebbien)
∆wi = c [di ] xi
Example 1: illustrates the perceptron learning rule

Sol : when c = 0.1

Lecture-3 Learning in Feedforward Neural Networks (1) 654
No ratings yet
Lecture-3 Learning in Feedforward Neural Networks (1) 654
35 pages
Unit V
No ratings yet
Unit V
26 pages
Clase3 Redunidireccional
No ratings yet
Clase3 Redunidireccional
74 pages
Activation Functions and Initialization Methods
No ratings yet
Activation Functions and Initialization Methods
17 pages
Neural Network and Fuzzy Logic
50% (2)
Neural Network and Fuzzy Logic
54 pages
ANN BackPropagation
No ratings yet
ANN BackPropagation
17 pages
FML Unit5
No ratings yet
FML Unit5
21 pages
Neural Deep Learning
No ratings yet
Neural Deep Learning
221 pages
Unit V - Aiml PDF
No ratings yet
Unit V - Aiml PDF
29 pages
Deep Learning
No ratings yet
Deep Learning
180 pages
Artificial Neural Networks
No ratings yet
Artificial Neural Networks
53 pages
UNIT III 3.1 ML Artificial Neural Networks
No ratings yet
UNIT III 3.1 ML Artificial Neural Networks
65 pages
ML Lecture#4
No ratings yet
ML Lecture#4
109 pages
Unit 5
No ratings yet
Unit 5
102 pages
Tasks On Neurons and ANN
No ratings yet
Tasks On Neurons and ANN
15 pages
Ann Muj
No ratings yet
Ann Muj
65 pages
Deep Learning Lab Manual
No ratings yet
Deep Learning Lab Manual
73 pages
08 NN
No ratings yet
08 NN
43 pages
Activation Functions
No ratings yet
Activation Functions
11 pages
Unit 5
No ratings yet
Unit 5
28 pages
Neural Network
No ratings yet
Neural Network
82 pages
ML Unit 3 Study Material-1
No ratings yet
ML Unit 3 Study Material-1
32 pages
Final PPT DataMining
No ratings yet
Final PPT DataMining
64 pages
Module4 AI
No ratings yet
Module4 AI
12 pages
05 ANN Artificial Neural Networks
No ratings yet
05 ANN Artificial Neural Networks
221 pages
Lecture - 05 (Introduction To ANN)
No ratings yet
Lecture - 05 (Introduction To ANN)
27 pages
FALLSEM2024-25 BCSE209L TH VL2024250101737 2024-08-06 Reference-Material-I
No ratings yet
FALLSEM2024-25 BCSE209L TH VL2024250101737 2024-08-06 Reference-Material-I
20 pages
International Baccalaureate (IB) : Artificial Neural Networks - #1
No ratings yet
International Baccalaureate (IB) : Artificial Neural Networks - #1
33 pages
NN Unit - 1
No ratings yet
NN Unit - 1
27 pages
Lecture 9
No ratings yet
Lecture 9
97 pages
Artificial Neural Network
No ratings yet
Artificial Neural Network
86 pages
Lecture 19 NN
No ratings yet
Lecture 19 NN
32 pages
UNIT1 Perceptron MLP
No ratings yet
UNIT1 Perceptron MLP
26 pages
Slide 2
No ratings yet
Slide 2
35 pages
Lesson 7.0 Supervised Learning With Neural Networks
No ratings yet
Lesson 7.0 Supervised Learning With Neural Networks
22 pages
L4 Training Neural Networks en
No ratings yet
L4 Training Neural Networks en
48 pages
AI & ML Unit 5 Notes
No ratings yet
AI & ML Unit 5 Notes
23 pages
Unit 1 NNDL
No ratings yet
Unit 1 NNDL
8 pages
Dave Reed: Connectionist Approach To AI
No ratings yet
Dave Reed: Connectionist Approach To AI
26 pages
cst414 - Deep Learning
No ratings yet
cst414 - Deep Learning
34 pages
Advanced Supervised Learning
No ratings yet
Advanced Supervised Learning
17 pages
NNunit 2
No ratings yet
NNunit 2
25 pages
Unit V
No ratings yet
Unit V
25 pages
Ann MJJ-1
No ratings yet
Ann MJJ-1
64 pages
cs188 sp23 Note25
No ratings yet
cs188 sp23 Note25
8 pages
Unit 1
No ratings yet
Unit 1
19 pages
ANN MODULE 1 Part2
No ratings yet
ANN MODULE 1 Part2
58 pages
ML Tushar Assignment
No ratings yet
ML Tushar Assignment
8 pages
Lecture 19 NN
No ratings yet
Lecture 19 NN
32 pages
Unit 4
No ratings yet
Unit 4
9 pages
NNDL
No ratings yet
NNDL
96 pages
05 ANN Artificial Neural Networks
No ratings yet
05 ANN Artificial Neural Networks
216 pages
ANN PG Module1
No ratings yet
ANN PG Module1
75 pages
Unit - II ML
No ratings yet
Unit - II ML
9 pages
Artificial Neural Network: Lecture Module 22
No ratings yet
Artificial Neural Network: Lecture Module 22
54 pages
Artificial Neural Networks: System That Can Acquire, Store, and Utilize Experiential Knowledge
100% (1)
Artificial Neural Networks: System That Can Acquire, Store, and Utilize Experiential Knowledge
40 pages
Lecture 10 Neural Network
No ratings yet
Lecture 10 Neural Network
34 pages
A Presentation On: By: Edutechlearners
No ratings yet
A Presentation On: By: Edutechlearners
33 pages
Scalar Control of Induction Motor Drives
100% (2)
Scalar Control of Induction Motor Drives
6 pages
How To Prepare The Sar
100% (1)
How To Prepare The Sar
107 pages
Question Bank Polar Curves Unit-1
100% (1)
Question Bank Polar Curves Unit-1
2 pages
Midterm Exam Gen Math
No ratings yet
Midterm Exam Gen Math
4 pages
Learning Objectives:: Sweep Solid Sweep Thin
No ratings yet
Learning Objectives:: Sweep Solid Sweep Thin
48 pages
Factor Analysis
67% (3)
Factor Analysis
25 pages
Extreme Response Spectrum of A Random Vibration PDF
No ratings yet
Extreme Response Spectrum of A Random Vibration PDF
196 pages
Binary Image Compression Schemes
0% (1)
Binary Image Compression Schemes
19 pages
Power Line Parameters
No ratings yet
Power Line Parameters
11 pages
Generic PLL-Based Grid-Forming Control
No ratings yet
Generic PLL-Based Grid-Forming Control
4 pages
Aerospace Engineering Syllabus
No ratings yet
Aerospace Engineering Syllabus
6 pages
Itec 8500-Research Article Annotations-Walker-S
No ratings yet
Itec 8500-Research Article Annotations-Walker-S
1 page
TH2 5
No ratings yet
TH2 5
15 pages
Lipton Inference To The Best Explanation
No ratings yet
Lipton Inference To The Best Explanation
14 pages
IEEE Formate
No ratings yet
IEEE Formate
5 pages
Objectives: Developing The Service Plan
No ratings yet
Objectives: Developing The Service Plan
6 pages
Basics of Geometry
100% (1)
Basics of Geometry
27 pages
196 - EE8501, EE6501 Power System Analysis - Question Bank 1
No ratings yet
196 - EE8501, EE6501 Power System Analysis - Question Bank 1
10 pages
Xi CS Revision Exam MS
No ratings yet
Xi CS Revision Exam MS
8 pages
Ee3304 hw1 SLN
No ratings yet
Ee3304 hw1 SLN
11 pages
(2021 Xiaofei Yang) A Novel NLOS Error Compensation Method
No ratings yet
(2021 Xiaofei Yang) A Novel NLOS Error Compensation Method
10 pages
Chapter 06 - Basics of Digital Audio
No ratings yet
Chapter 06 - Basics of Digital Audio
97 pages
System Architecture
No ratings yet
System Architecture
34 pages
Problem Set 2
No ratings yet
Problem Set 2
2 pages
AL Applied Mathematics 2001 Paper1+2 (E)
No ratings yet
AL Applied Mathematics 2001 Paper1+2 (E)
11 pages
Reporte
No ratings yet
Reporte
2 pages
Africans' Contribution To Science: A Culture of Excellence
No ratings yet
Africans' Contribution To Science: A Culture of Excellence
10 pages
6 Semester PYQs
No ratings yet
6 Semester PYQs
2 pages
Test Construction With IRT
No ratings yet
Test Construction With IRT
11 pages
Chapter 1 - BRODGAR STATISTIC
No ratings yet
Chapter 1 - BRODGAR STATISTIC
4 pages
Backpropagation: Fundamentals and Applications for Preparing Data for Training in Deep Learning
From Everand
Backpropagation: Fundamentals and Applications for Preparing Data for Training in Deep Learning
Fouad Sabry
No ratings yet
Exercises of Logarithms and Exponentials
From Everand
Exercises of Logarithms and Exponentials
Simone Malacrida
No ratings yet