0% found this document useful (0 votes)

48 views9 pages

Neural Network Basics 2.1 Neurons or Nodes and Layers

This document discusses the basics of neural network structure and components. It describes how: - Neurons or nodes are connected through weighted connections and organized into layers, with input, hidden, and output layers being most common. - Individual neurons receive multiple inputs which are each multiplied by a weight and summed, then passed through an activation function to produce a single output. - Neural networks are commonly fully connected feedforward networks with multiple hidden layers, though some networks have lateral or feedback connections. - Input nodes simply distribute input data, hidden nodes process information between input and output layers, and output nodes produce the network's predictions or classifications.

Uploaded by

Shubha Shri

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

48 views9 pages

Neural Network Basics 2.1 Neurons or Nodes and Layers

Uploaded by

Shubha Shri

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 9

2. Neural network basics 2.

1 Neurons or nodes and layers

Next commonalities among different neural networks are discussed in Most neural network structures use some (certain) type of neuron.
order to get started and show which structural parts or concepts There are several different kinds of neural networks. Thus, only most
appear in almost all networks. It is presented how neurons or nodes common, perhaps most frequently used will be considered.
form weighted connections, how neurons create layers, and how An algorithm called a neural network is typically composed of
activation functions affect the output of a layer. Let us begin with individual, interconnected units usually called neurons, nodes or units.
neurons and layers. Fig. 2.1 shows the structure of a single artificial neuron that receives
input from one or more sources being other neurons or data input. The
data can be binary, integers or real (floating point) values. If a symbolic
or nominal variable occurs, this must first be encoded with a set of
binary variables. For instance, a nominal variable of three alternatives
{blue, grey, brown} could be encoded, e.g, with {0,1,2}. Sometimes,
bipolar values 1 and -1 are used instead of binary 0 and 1.
34 35

Input Input Input

1 2 3
Weight 3
Weight 2
Weight 1 The node or artificial neuron multiplies each of these inputs by a
weight. Then it adds the multiplications and passes the sum to an
Neuron
activation function. Some neural networks do not use an activation
function when their principle is different. The following equation
summarises the calculated output:
(2.1)
Activation function

In the equation, variables x and w represent the input vector and

Fig. 2.1 An artificial neuron. weight vector of the neuron when there are p inputs into the neuron.
Output Greek letter (phi) denotes an activation function. The process results
in a single output from a neuron.

36 37
I1 I2 I3 I4

Fig. 2.1 shows the structure with just one building component. Such N1 N2
nodes are chained together with many aritifical neurons to construct a
network. In Fig. 2.2 there are three neurons. Now the activation
functions and intermediate outputs are included implicitly in the nodes N3
and weights in arcs (connections) between nodes. The strucure in Fig.
2.2 could still be a part of a larger network. The input and output are
often a special type of neurons, either to accept input data or to O
generate output values of the network.
Fig. 2.2 An artificial neural network with four layers of input nodes {I1,
I2, I3, I4}, hidden nodes {N1, N2}, {N3} and output node {O}.

38 39

Input layer I1 I2 I3 I4

In Fig. 2.2. there are four layers called input layer, two hidden layers
and ouput layer. Normally, all nodes of a single layer have the same
properties like activation function and type like input, hidden or Hidden layer 1 N1 N2
output. Note that these node types are used in feedforward networks,
that is multilayer percoptrons. Still, virtually always the nodes of the
same layer are of the same type, and input and output have to be
N3 N4
taken care of. Hidden layer 2

The network in Fig. 2.2 is extended in Fig. 2.3 the network of which
depicts a common type feedforward network, however, a small one as Output layer O
to the numbers of nodes in its layers. Note in the sense of a directed
graph data structure it is ”complete” as to arcs between the layers: Fig. 2.3 A fully connected feedforward or multilayer perceptron
there exist all possible arcs from each node of a layer to the nodes of network.
the following layer. On the other hand, there are no lateral arcs
between the nodes of the same layer in feedforward networks.
40 41
2.2 Types of neurons or nodes Input, hidden and output nodes
The basic forms of neural networks are typically feedforward ones. Notice that input nodes do not have activation functions. Thus, they
Recursive networks do also exist even if obviously they do not have so are little more than placeholders. The input is simply weighted and
many and versatile forms compared to the former. As mentioned summed. Furthermore, the size of input and output vectors will be the
above, the types or roles of nodes also vary. Sometimes the same node same if the neural network has nodes that are both input and output.
may have more than one role. For instance, Boltzmann machines are an
example of a neural network architecture in which nodes are both Hidden nodes have two important characteristics. First, they only
input and output. receive input from the other nodes, such as input or preceding hidden
Normally the input to a neural network is represented as an array or nodes. Second, they only output to other nodes, either as output or
vector as in Equation 2.1., in which the vector is of dimension p=dj and j other, following hidden nodes. Hidden nodes are not directly
denotes the layer. For the input layer the dimension d1 is equal to the connected to the incoming data or to the eventual output. They are
number of input variables. often grouped into fully connected hidden layers.

42 43

A common question concerns the number of hidden nodes in a Another reason why additional hidden layers seemed to be a problem
network. Since the answer is complex, this question will be considered was that they would require a very extensive training set to be able to
in different contexts. It is good to notice that the numbers of layers and compute weights for the network. Before deep learning, the former
nodes affect the time complexity of the use of a neural network. situation was actually a problem, since deep learning means networks
Prior the time of deep learning, it was suggested that one or two of several hidden layers. Although networks of one or two hidden
hidden layers are enough so that a feedforward network can function layers are able to learn ”everything” in theory, deep learning facilitates
virtually as a universal approximator for any mathematical function. Let a more complex representation of patterns in the data.
us remember that if there is one hidden layer, there are two processing
layers, the hidden layer and output layer. The above-mentioned
approximation of any function is, however, a theoretical thought,
because it does not express how the approximation could be made.

44 45
B
Input layer I1 I2 1

Bias nodes
Bias nodes are added to feedforward neural networks to help these Hidden layer 1 N1 N2
B
2
learn patterns. Bias nodes function like an input node that always
produces constant value 1 or other constant. Because of this property,
they are not connected to the previous layer. The constant 1 here is
called the bias activation. Not all neural networks have bias nodes. Fig. Hidden layer 2 N3 N4
B
3
2.4 depicts a two-hidden-layer network with bias nodes. The network
includes three bias nodes. Bias neurons allow the output of an
activation function to be shifted. This will be presented later on, in the Output layer O
context of activation functions.
Regardless of the type of neuron, node or processing unit, neural
networks almost always are constructed of weighted connections Fig. 2.4 A feedforward network with bias nodes B1, B2 and B3.
between these units.
46 47

2.3 Activation functions

In neurocomputing activation or transfer functions establish bounds for
the output of neurons. Neural networks can use several different
activation functions. The most common are dealt with in the following.
Selecting an activation function is an important consideration since it
can affect how one has to format input data.
At first, the most basic activation function called linear function is
shown. It has not practical use, but is rather a starting point.

(2.2) Fig. 2.6 Linear activation function.

It is the identity mapping. See also Fig. 2.6.
48 49
Earlier artificial neurons of feedforward networks were called
perceptrons. The step or threshold activation function is another
simple function. McCulloch and Pitts (1943) introduced it and applied a
step activation function:

(2.3)

Equation (2.3) outputs value 1 for inputs of 0.5 or greater and 0 for all
other values. Step functions are also called threshold functions because
they only return 1 (true) for those values above some threshold given, (a) (b)
e.g., according to Fig. 2.7(a). The next phase is to form a ”ramp” as in Fig. 2.7 (a) Step activation function; (b) Linear threshold between bounds,
Fig. 2.7.(b). otherwise 0 or 1.

50 51

The sigmoid or logistic activation function is a very common choice for

feedforward neural networks that need to output only positive values.
Despite its extensive use, the hyperbolic tangent or the rectified linear
unit (ReLU) function are often more suitable. The sigmoid is as follows.

(2.4)

Its values are restricted between 0 and 1. See Fig. 2.8.

Fig. 2.8 Sigmoid activation function.

52 53
The hyperbolic tangent function is also one of the most important
activation functions. It is restricted into the range between -1 and 1.

(2.5)

It has a similar shape to the sigmoid function. It has some advantages

over the sigmoid function. These involve the derivatives used in the
training of the neural network, and they will be covered later for the
section of Backpropagation algorithm. Fig. 2.9 Hyperbolic tangent activation function.

54 55

Teh and Hinton (2000) introduced the rectified linear unit (ReLU). It is
simple and seen a good choice for hidden layers.

(2.6)

The advantage of the rectified linear unit comes partly from that it is a
linear, non-saturating function. Unlike the sigmoid or hyperbolic
tangent activation functions, ReLU does not saturate to -1, 0 or 1. See
Fig. 2.10. A saturating activation function moves towards and
eventually attains a value. For instance, the hyperbolic function Fig. 2.10 Rectified linear unit activation function.
saturates to -1 as x decreases and to 1 as x increases.
56 57
The final activation function is the softmax function. Along with the Let us recall the iris data containing flowers from three iris species.
linear function, softmax is usually found in the output layer of a neural When we input a data case to the neural network applying the softmax
network. The node that has the greatest value claims the input as a activation function, this allows the network to give the probability that
member of its class. Because it is a preferable method, the softmax these measurements belong to each of three species. For example,
activation function forces the output of the neural network to their probabilities could be 80%, 15% and 5%. Since these are
represent the probability that the input falls into each of the classes. probabilities, their sum must add up 100%. Output nodes do not
Without the softmax, the node’s outputs are simply numeric values, inherently specify the probabilities of the classes. Therefore, softmax is
with the greatest indicating the winning class. useful, when it produces such probabilites. The softmax function is as
follows.

58 59

The role of bias

(2.7)
Together, the weight w and bias b of a node shape the output of the
In the formula, i represents the index of the output node, and j activation function. Equation (2.8) represents a single-input sigmoid
represents the indexes of all nodes in the group or level. The variable z activation function neural network
designates the array of the output nodes. It is important to note that
softmax is computed differently from the other activation functions (2.8)
given. When using softmax, the output of a single node is dependent
on the other output nodes. In Equation (2.7), the output of the other
output nodes is contained in the variable z, unlike in those other Eq. (2.8) is the combination of Eq. (2.1) of a neural network and Eq.
activation functions. (2.4) of the sigmoid activation function. Fig. 2.11(a) shows the effect of
weight variation on the output of the sigmoid function. Fig. 2.11(b)
shows the effect of bias variation.

60 61
OR
2.5 Logic with neural networks I1 I2 B1

AND
Logical operators 0 AND 0 =0 I1 1
1 -0.5
I2 B1
can be 1 AND 0 = 0 NOT
implemented with O1
neural networks. 0 AND 1 = 0 1 -1.5 I1 B1
1
Let us look at the 1 AND 1 = 1
truth table of 0 OR 0 = 0 O1 0.5
operators AND, 0 OR 1 = 1
-1
OR, NOT. Neural
(a) (b) networks can 1 OR 0 = 1 O1

Fig. 2.11(a) Sigmoids for weights w in {0.5, 1.0, 1.5, 2.0}, the greater represent these 1 OR 1 = 1
weight, the steeper curve, and (b) bias b in {0.5, 1.0, 1.5, 2.0} (w=1.0), according to Fig. NOT 0 = 1 Fig. 2.12 The logical operators as networks.
the greater bias, the leftmost curve because of the shift being not 2.12 NOT 1 = 0 AND with 1 inputs: 1*1+1*1+(-1.5)=0.5>0; true
complete when the all curves merge together at the top or bottom left.
62 63

Perceptron: a vectorial perspective

In Fig. 2.12 the following function is used From Eq. (2.9) (b=w0, x0=1) expression Class A

(2.9) (2.11)
where fh is a step function named the Heaviside function with p can be represented as a line mapped in Class B
2-dimensional (two variables)
variables and bias b Euclidean space to distinguish two
(2.10) separate classes of cases or datapoints.
Vector x represents any case in the w
variable space. (A situation of two fully
and which produces outputs either 1 or 0. separate classes is idealistic, in fact, not x1
encountered in actual data sets.) Fig. 2.13 Two distinct sets of cases or
patterns in 2-dimensional space.
64 65
I1 I2 B1
Exclusive or (XOR) 1 1
-0.5

1 1 -1.5
(0,1)
(1,1)
One can find out easily that XOR is not Using two or more processing layers XOR N2
N1 B2
possible to implement with a single (operator ) for inputs p and q
0.5
(processing) layer of nodes when -1
looking at Fig. 2.14 and noticing that a
single feedforward or perceptron layer (2.12) N3 B3
-1.5
can correspond to linear mappings only. 1 1
Namely, by locating a line at whatever can be implemented as depicted in Fig. O1
positions it is not possible to distinguish (0,0) (1,0)
2.15.
the two classes of true outputs for Fig. 2.15 Two classes of XOR can
{(1,0),(0,1)} and false outputs for be separated with more than one
{(0,0),(1,1)} by using one line only. Fig. 2.14 Two classes of XOR cannot
be separated with one line. processing layer.
66 67

Topics in Finite and Discrete Mathematics - Sheldon M. Ross
100% (1)
Topics in Finite and Discrete Mathematics - Sheldon M. Ross
279 pages
Introduction To Modern Industrial Engineering
100% (2)
Introduction To Modern Industrial Engineering
221 pages
Artificial Neural Network: Synapses Weight The Individual Parts of Information
No ratings yet
Artificial Neural Network: Synapses Weight The Individual Parts of Information
8 pages
Mathcad - Neural
100% (1)
Mathcad - Neural
27 pages
Unit 3 - Ann
No ratings yet
Unit 3 - Ann
49 pages
John Zink Burner Control Narratives
100% (3)
John Zink Burner Control Narratives
19 pages
Unit 2 - Machine Learning
No ratings yet
Unit 2 - Machine Learning
19 pages
Tenses - Ready Reckoner: Tense Affirmative/Negative/Question Use Signal Words
100% (2)
Tenses - Ready Reckoner: Tense Affirmative/Negative/Question Use Signal Words
7 pages
Neural Network
No ratings yet
Neural Network
37 pages
ASC-unit 1 Notes
No ratings yet
ASC-unit 1 Notes
46 pages
Neuron Model and Network Architectures
No ratings yet
Neuron Model and Network Architectures
18 pages
Deep Learning
No ratings yet
Deep Learning
37 pages
Neural Network
No ratings yet
Neural Network
131 pages
Ch5-Feedforward Neural Networks, Word Embeddings, Neural Language Models, and Word2vec PDF
No ratings yet
Ch5-Feedforward Neural Networks, Word Embeddings, Neural Language Models, and Word2vec PDF
67 pages
Lecture15 NeuronNetworks
No ratings yet
Lecture15 NeuronNetworks
61 pages
Machine Learning and Pattern Recognition Week 8 Neural Net Intro
No ratings yet
Machine Learning and Pattern Recognition Week 8 Neural Net Intro
3 pages
Medical Astrology - Medicine by The Stars
No ratings yet
Medical Astrology - Medicine by The Stars
4 pages
CS 522 Selected Topics in CS: Lecture 07 - Artificial Neural Network
No ratings yet
CS 522 Selected Topics in CS: Lecture 07 - Artificial Neural Network
52 pages
NNFL Unit III For ECE & EEE
No ratings yet
NNFL Unit III For ECE & EEE
29 pages
House Dzone Refcard 383 Neural Network Essentials
No ratings yet
House Dzone Refcard 383 Neural Network Essentials
5 pages
Unit 2
No ratings yet
Unit 2
44 pages
Deep Learning 10 Hours
No ratings yet
Deep Learning 10 Hours
27 pages
The Introduction To Neural Networks 10 4 24
No ratings yet
The Introduction To Neural Networks 10 4 24
54 pages
WINSEM2023-24 BITE410L TH VL2023240503970 2024-03-11 Reference-Material-I
No ratings yet
WINSEM2023-24 BITE410L TH VL2023240503970 2024-03-11 Reference-Material-I
40 pages
Neural Networks
No ratings yet
Neural Networks
21 pages
Neural Net Notes
No ratings yet
Neural Net Notes
7 pages
Neural Networks Notes
No ratings yet
Neural Networks Notes
22 pages
ML Lec-22
No ratings yet
ML Lec-22
25 pages
Chapter2-Neural+Network PartA
No ratings yet
Chapter2-Neural+Network PartA
38 pages
Module 5 AIML Notes
No ratings yet
Module 5 AIML Notes
77 pages
ECE/CS 559 - Neural Networks Lecture Notes #2 Mathematical Models For The Neuron, Neural Network Architectures
No ratings yet
ECE/CS 559 - Neural Networks Lecture Notes #2 Mathematical Models For The Neuron, Neural Network Architectures
8 pages
Networks. These Are Formed From Trillions of Neurons (Nerve
No ratings yet
Networks. These Are Formed From Trillions of Neurons (Nerve
14 pages
Artificial Intelligence: Outline
No ratings yet
Artificial Intelligence: Outline
35 pages
L10 Neural Network
No ratings yet
L10 Neural Network
52 pages
NNDL
No ratings yet
NNDL
96 pages
Neural Networks From Scratch: 3.1 Formal Neuron
No ratings yet
Neural Networks From Scratch: 3.1 Formal Neuron
8 pages
Notes Chapter Neural Networks
No ratings yet
Notes Chapter Neural Networks
18 pages
Lesson 7.0 Supervised Learning With Neural Networks
No ratings yet
Lesson 7.0 Supervised Learning With Neural Networks
22 pages
AI17-Neural Networks
No ratings yet
AI17-Neural Networks
34 pages
@vtucode - in Module 5 AI 2021 Scheme 5th Sem
No ratings yet
@vtucode - in Module 5 AI 2021 Scheme 5th Sem
66 pages
Types of Neural Networks and Definition of Neural Network
No ratings yet
Types of Neural Networks and Definition of Neural Network
15 pages
Module 2
No ratings yet
Module 2
44 pages
Unit 5 Neural Networks and Types of Learning
No ratings yet
Unit 5 Neural Networks and Types of Learning
38 pages
Neural Networks
No ratings yet
Neural Networks
28 pages
Supervised Learning Unit 4-Neural Network
No ratings yet
Supervised Learning Unit 4-Neural Network
30 pages
AI Mod4 Session 8 Best Fit Line & ANN
No ratings yet
AI Mod4 Session 8 Best Fit Line & ANN
39 pages
Lesson 2 Neural Network Architectures
No ratings yet
Lesson 2 Neural Network Architectures
35 pages
Week-3 Module-2 Neural Network
No ratings yet
Week-3 Module-2 Neural Network
58 pages
AI Unit5 Neural Network 1c2c9166 c1b7 47a3 8ce1 E914f1ab6afb
No ratings yet
AI Unit5 Neural Network 1c2c9166 c1b7 47a3 8ce1 E914f1ab6afb
52 pages
Neural Networks
No ratings yet
Neural Networks
28 pages
Artificial Neural Network
No ratings yet
Artificial Neural Network
86 pages
Artificial Neural Network Concepts/Terminology
No ratings yet
Artificial Neural Network Concepts/Terminology
22 pages
2 DeepLearning
No ratings yet
2 DeepLearning
46 pages
Unit 2
No ratings yet
Unit 2
18 pages
6ee412 ch6 Neural DSP
No ratings yet
6ee412 ch6 Neural DSP
41 pages
DM Chapter 7
No ratings yet
DM Chapter 7
6 pages
UNIT-II Chapter-2
No ratings yet
UNIT-II Chapter-2
20 pages
Lecture 1
No ratings yet
Lecture 1
26 pages
Artificial Neural Network
No ratings yet
Artificial Neural Network
29 pages
Convolutional Neural Networks
No ratings yet
Convolutional Neural Networks
21 pages
Artificial Neural Networks
No ratings yet
Artificial Neural Networks
13 pages
Neural Network
No ratings yet
Neural Network
7 pages
Neural NetworksChapter2Sup
No ratings yet
Neural NetworksChapter2Sup
20 pages
Private Health Institutions Law
100% (1)
Private Health Institutions Law
22 pages
ARINC Meteorological Data Collection and Reporting System (MDCRS)
No ratings yet
ARINC Meteorological Data Collection and Reporting System (MDCRS)
16 pages
SinclairCollins K-Series 02 2016
No ratings yet
SinclairCollins K-Series 02 2016
20 pages
Krisis Hipertensi
No ratings yet
Krisis Hipertensi
29 pages
Syllabus MKCU Semester 2
No ratings yet
Syllabus MKCU Semester 2
3 pages
Risk Ranger
No ratings yet
Risk Ranger
31 pages
FICM Unit 3
No ratings yet
FICM Unit 3
6 pages
Sustainable Architecture Wiki
No ratings yet
Sustainable Architecture Wiki
9 pages
Excel - Chapter
No ratings yet
Excel - Chapter
69 pages
Amanda Mcelvany Position Paper Final
No ratings yet
Amanda Mcelvany Position Paper Final
6 pages
Hyaluronic Acid
No ratings yet
Hyaluronic Acid
7 pages
Technical Data Sheet & Processing Guide: ENMAT™ Thermoplastics Resin Y1000P
No ratings yet
Technical Data Sheet & Processing Guide: ENMAT™ Thermoplastics Resin Y1000P
6 pages
Update On Renewed Effort To Strengthen Routine Immunization
No ratings yet
Update On Renewed Effort To Strengthen Routine Immunization
49 pages
Urological Oncology: A Comparison Between Clinical and Pathologic Staging in Patients With Bladder Cancer
No ratings yet
Urological Oncology: A Comparison Between Clinical and Pathologic Staging in Patients With Bladder Cancer
5 pages
Judo Physiological Profile Sportsmedicine Franchini
No ratings yet
Judo Physiological Profile Sportsmedicine Franchini
21 pages
QIG Quick Installation Guide DCU 305 R3
No ratings yet
QIG Quick Installation Guide DCU 305 R3
2 pages
User Manual 3948368
No ratings yet
User Manual 3948368
4 pages
Carolina Reaper
No ratings yet
Carolina Reaper
19 pages
My Classroom
No ratings yet
My Classroom
1 page
Failure Mode For Gas CHromatograph
No ratings yet
Failure Mode For Gas CHromatograph
2 pages
Active Assisted Knee Flexion and Extension
No ratings yet
Active Assisted Knee Flexion and Extension
46 pages
M P5 Rev1 Sem2 2024 2025
No ratings yet
M P5 Rev1 Sem2 2024 2025
6 pages
Case Bennie and The Jets (CHAPTER 3) : Muadz Kamaruddin 191264
No ratings yet
Case Bennie and The Jets (CHAPTER 3) : Muadz Kamaruddin 191264
2 pages
Gr9 PT2 Portions 2024-25
No ratings yet
Gr9 PT2 Portions 2024-25
4 pages
Aluminum and Glass Company in Qatar
No ratings yet
Aluminum and Glass Company in Qatar
5 pages
An Introduction To Digital Design
From Everand
An Introduction To Digital Design
Jason King
2/5 (1)
Bio Inspired Computing: Fundamentals and Applications for Biological Inspiration in the Digital World
From Everand
Bio Inspired Computing: Fundamentals and Applications for Biological Inspiration in the Digital World
Fouad Sabry
No ratings yet

Neural Network Basics 2.1 Neurons or Nodes and Layers

Uploaded by

Neural Network Basics 2.1 Neurons or Nodes and Layers

Uploaded by

2. Neural network basics 2.

1 Neurons or nodes and layers

Input Input Input

In the equation, variables x and w represent the input vector and

2.3 Activation functions

(2.2) Fig. 2.6 Linear activation function.

The sigmoid or logistic activation function is a very common choice for

Its values are restricted between 0 and 1. See Fig. 2.8.

It has a similar shape to the sigmoid function. It has some advantages

The role of bias

Perceptron: a vectorial perspective

You might also like