0% found this document useful (0 votes)

89 views46 pages

Chapter 4 Neural Network

The document discusses neural networks, comparing the brain's capabilities with machine learning, and detailing the architecture and functioning of neural networks, including multi-layer perceptrons and convolutional neural networks. It outlines applications of neural networks in various fields such as finance, robotics, and natural language processing, as well as the training process using backpropagation. Additionally, it highlights the pros and cons of neural networks and introduces deep learning concepts.

Uploaded by

yosefdemeke08

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

89 views46 pages

Chapter 4 Neural Network

Uploaded by

yosefdemeke08

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 46

Chapter-4

Neural Network

By: Yeshambel A.
Neural Network

2
The Power of Brain vs. Machine

• The Brain
– Pattern
Recognition
– Association
– Complexity
– Noise Tolerance

• The
Machine
– Calculation
– Precision 3
Features of the Brain

• Ten billion (1010) neurons

• Face Recognition ~0.1secs
• On average, each neuron has several
thousand connections
• Hundreds of operations per second
• High degree of parallel computation
• Distributed representations
• Die off frequently (never replaced)

4
Neural Network classifier
⚫ It is represented as a layered set of interconnected processors.

⚫ These processor nodes and connections resembles a relationship with the

neurons of the brain.
⚫ Each node has a weighted connection to several other nodes in adjacent
layers.
⚫ Individual nodes take the input received from connected nodes and use the
weights together to compute output values.
⚫ The inputs are fed simultaneously into the input layer.

⚫ The weighted outputs of these units are fed into hidden layer.

⚫ The weighted outputs of the last hidden layer are inputs to units
making up the output layer.
5
6
Neural Networks Applications
There are two basic goals for neural network research:
Brain modelling
• Aid our understanding of how the brain works. This helps to understand the nature of
perception, actions, learning and memory, thought and intelligence and/or formulate medical
solutions to brain damaged patients.
Artificial System Construction/ real world applications.
• Financial modelling – predicting the stock market
• Time series prediction – climate, weather, seizures
• Computer games – intelligent agents, chess, backgammon
• Robotics – autonomous adaptable robots
• Pattern recognition – speech recognition, seismic activity, sonar signals
• Data analysis – data compression, data mining
• Bioinformatics – DNA sequencing, alignment

7
Architecture of Neural network
⚫ Neural networks are used to look for patterns in data, learn these patterns,
& then classify new patterns & make forecasts
⚫ A network with the input and output layer only is called single-layered
neural network. Whereas, a multilayer neural network is a generalized one
with one or more hidden layer.
⚫ A network containing two hidden layers is called a three-layer neural network, and so on.

8
A Multilayer Neural Network
⚫ Input: corresponds with class attribute that are with normalized
attributes values.
– There are as many nodes as classattributes, X = {x1, x2, …. xm}, where n is the number
of attributes.
• Hidden Layer
– neither its input nor its output can be observed from outside.
– The number of nodes in the
hidden layer & the number of hidden layers depends on implementation.
– Mostly different number of hidden layers and nodes produce different result

• Output Layer – corresponds to the class attribute.

There are as many nodes as classes (values of the class attribute)

9
Multi-layer Perceptron (MLP)
• One of the most popular neural network model is the multi-layer perceptron (MLP).
• In an MLP, neurons are arranged in layers. There is one input layer, one output
layer, and several (or many) hidden layers.

10
11
Hidden layer: Neuron with
Activation
⚫ The neuron is the basic information processing unit of a NN.
⚫ It consists of:
1. A set of links, describing the neuron inputs, with weights W1,W2,
…,Wm
2. An adder function (linear combiner) for computing the weighted sum
of the inputs (real numbers): M
y = ∑j = w xj j
1
3. Activation function (also called squashing function): for limiting the
output behavior of the neuron.

12
Activation Functions

(a) is a step function or (b) is a sigmoid

threshold function function:
(hardlimiting): 1/(1+e-x)
⚫Changing the bias weight W0,i moves the threshold location
⚫ Bias helps the neural network to be more flexible since it adjust the activation
function left-or-right, making it centered on some other value than x = 0. To
this effect an additional node is added to the input layer, with its constant input;
say, 1 or -1, … When this is multiplied by the weights of the hidden layer, it
provides a bias to activation function.

13
Activation Functions

14
Two Topologies of neural
network
⚫ NN can be designed in a feed forward or recurrent manner
⚫ In a feed forward neural network connections
between the units do not form a directed cycle.
⚫ In this network, the information moves in only one direction, forward, from the
input nodes, through the hidden nodes (if any) & to the output nodes. There are no
cycles or loops or no feedback connections are present in the network, that is,
connections extending from outputs of units to inputs of units in the same layer or
previous layers.
⚫ In recurrent networks data circulates back &
forth until the activation of the units is stabilized
⚫ Recurrent networks have a feedback loop where data can be fed back into the input
at some point before it is fed forward again for further processing and final output.

15
Training the neural
⚫ network
The purpose is to learn to generalize using a set of sample
patterns
where the desired output is known.
⚫ Back Propagation is the most commonly used method for
training
multilayer feed forward NN.
⚫ Back propagation learns by iteratively processing a set of training
data (samples).
⚫ For each sample, weights are modified to minimize the
error between the desired output and the actual output.
⚫ After propagating an input through the network, the error is
calculated and the error is 16
Training
Algorithm
⚫ The learning algorithm is as follows
⚫ Initialize the weights and threshold to small random
numbers.
⚫ Present a vector x to the neuron inputs and calculate
m the
output using the adder
y = ∑j j
function. w
⚫ Apply the activation function (in this case step
functionx) j=
such that y = 0 if y  1

0
1 if y >
⚫ Update the weights according to the
0
error.

W j =W j +η (yT
17
Training Multi-layer NN

18
Training Multi-layer NN

Train this layer

first

19
Training Multi-layer NN

Train this layer first

then this
layer

20
Training Multi-layer NN

Train this layer

first
then this layer
then this
layer 21
Training Multi-layer NN

Train this layer

first
then this layer
then this
layer 22
Training Multi-layer NN

Train this layer

first
then this layer
then this
layer finally this 23 23
Calculating the Error
⚫ Evaluatethe predicted output - Calcualte the error as the
difference between the predicted output against the target
output of sample n and passed to a loss function

24
Calculating the Error: Example

25
Reducing Error
• The main goal of the training is to reduce the error or the difference between
prediction and actual output.
• By decomposing prediction into its basic elements we can find that weights are the
variable elements affecting prediction value. In other words, in order to change
prediction value, we need to change weights values.

How to change\update the weights value so that the error is reduced?

The answer is Backpropagation!

26
Pros and Cons of Neural Network
• Useful for learning complex data like handwriting, speech and
image recognition
Cons
Pros  Slow training time
 Can learn more complicated
 Hard to interpret & understand the learned
class boundaries
 Fast application function (weights)
 Can handle large number of  Hard to implement: trial & error for
features choosing number of nodes
o Neural Network needs long time for training.
o Neural Network has a high tolerance to noisy and incomplete
data
o Conclusion: Use neural nets only if decision-trees fail.
27
Deep Learning…

What exactly is deep learning ?

1. ‘Deep Learning’ means using a neural network
with several layers of nodes between input and output

2. The series of layers between input & output do

feature identification and processing in a series of stages,
just as our brains seem to.

28
Deep Learning…

29
Convolutional Neural Networks (CNNs)

• CNNs are a special kind of multi-layer neural networks, designed for processing data
that has an input shape like a 2D matrix like images.

• CNN’s are typically used for image detection and classification.

• Images are 2D matrix of pixels on which we run CNN to either recognize the image or
to classify the image.

• Example: Identify if an image is of a human being, or car or just digits on an address.

30
Convolutional Neural Network Architecture

31
Convolutional Neural Network Architecture

• A CNN typically has three layers:

• Convolutional layer,
• Pooling layer, and
• Fully connected layer.

• Convolutional layer is the core building block of a CNN, and it is where the
majority of computation occurs.
• The term convolution refers to the mathematical combination of two functions to
produce a third function. It merges two sets of information.
• In the case of a CNN, the convolution is performed on the input data with the use of a
filter or kernel then produce a feature map.

32
Convolution Operation

(CONV) uses filters that perform convolution

operations as it is scanning the input with respect to
its dimensions. Its hyperparameters include the filter
size and stride . The resulting output called feature
map/activation map

33
Convolution Operation

34
Pooling Layer
The pooling Layer is a mechanism of down sampling. It is usually appended after
convolutional layers to progressively decrease the spatial size of feature maps.

• Max pooling takes the largest

value from the window of the
image currently covered by the
kernel.
• Average pooling takes
the average of all values in
the window.

35
The whole CNN
cat dog
…… Convolution

Max Pooling
Can
Fully Connected repea
Feedforward network
Convolution t
many
times
Max Pooling

Flattened
36
Recurrent Neural Networks (RNN)
• A recurrent neural network (RNN) is an extension of a regular
feedforward neural network, which is able to handle variable-length
sequential data and processing time-series prediction.
• Example: If you want to predict the next word in a sentence you need
to know which words came before it.
• In sequence problem, the output depends on
• Current Input
• Previous Output
• Example: Sequence is important for part of speech (POS) tagging
• Traditional neural network cannot capture such relationship.

37
Typical RNN Architecture

RNN can be seen as an MLP network with the addition of loops to the
architecture.

38
RNN Example: Guess part of speech (POS)

39
RNN Example: Sentiment Analysis

40
Recurrent Neural Networks: Process
Sequences

e.g. Image Captioning

image -> sequence of
words
41
Recurrent Neural Networks: Process
Sequences

e.g. Sentiment Classification

sequence of words ->
sentiment
42
Recurrent Neural Networks: Process
Sequences

e.g. Machine Translation

seq of words -> seq of
words
43
Recurrent Neural Networks: Process
Sequences

e.g. Video classification on frame

level
44
RNN Applications
• Natural language processing
• E.g. Given a sequence of words, RNN predicts the probability of next word given the
previous ones.
• Machine translation: Similar to language modeling
• E.g. Google translator (English to Amharic )
• Speech recognition:
• given input: sequence of acoustic signals, produce output phonetic segments
• Image tagging : RNN + CNN jointly trained.
• CNN generates features (hidden state representation).
• RNN reads CNN features and produces output (end-to-end training).
• Time series prediction : Forecast of future values in a time series, from past seen
values.
• e.g Weather forecast, financial time series
45
THANK
YOU

Computer Vision Unit 4
No ratings yet
Computer Vision Unit 4
186 pages
A. Standardization of Engineering Requirements Using Large Language Models
No ratings yet
A. Standardization of Engineering Requirements Using Large Language Models
223 pages
2022PhD - Princeton - Bridging Theory and Practice in Deep Learning Optimization and Generalization
No ratings yet
2022PhD - Princeton - Bridging Theory and Practice in Deep Learning Optimization and Generalization
540 pages
Individual Dual Sports
100% (1)
Individual Dual Sports
60 pages
FT of AI
No ratings yet
FT of AI
109 pages
Merged PDF Cset301 Ai-Ml
No ratings yet
Merged PDF Cset301 Ai-Ml
610 pages
Blake I. Essays On Coding Theory 2024
100% (1)
Blake I. Essays On Coding Theory 2024
472 pages
Machine Learning Lab
No ratings yet
Machine Learning Lab
46 pages
Segmentation Detection
100% (1)
Segmentation Detection
109 pages
OS by JJsir
No ratings yet
OS by JJsir
269 pages
Deep Learning Lab Practicals
No ratings yet
Deep Learning Lab Practicals
24 pages
Physics Informed Neural Network Theory and Applications
No ratings yet
Physics Informed Neural Network Theory and Applications
44 pages
Factors Affecting The Academic Performance of ABM Students of Santa Isabel College of Manila AY 2017-2018
No ratings yet
Factors Affecting The Academic Performance of ABM Students of Santa Isabel College of Manila AY 2017-2018
14 pages
UNIT-2R Deep Learning
No ratings yet
UNIT-2R Deep Learning
34 pages
Computer Vision Lec-1
No ratings yet
Computer Vision Lec-1
110 pages
Master Teacher-Annex B1 RPMS Tool For Highly Proficient Teachers SY 2022-2023
100% (1)
Master Teacher-Annex B1 RPMS Tool For Highly Proficient Teachers SY 2022-2023
20 pages
Computer Vision and Deep Learning 1708702317
No ratings yet
Computer Vision and Deep Learning 1708702317
93 pages
CEC453 Machine Learning
No ratings yet
CEC453 Machine Learning
168 pages
Deep Learning
No ratings yet
Deep Learning
189 pages
Ai Federated Learning Fundamentals Challenges
No ratings yet
Ai Federated Learning Fundamentals Challenges
309 pages
Efficient Extraction of Deep Image Features Using Convolutional Neural
No ratings yet
Efficient Extraction of Deep Image Features Using Convolutional Neural
12 pages
Deep Learning (MODULE-3)
No ratings yet
Deep Learning (MODULE-3)
85 pages
NLP and Generative AI Syllabus - 2025
No ratings yet
NLP and Generative AI Syllabus - 2025
5 pages
Unit 2
No ratings yet
Unit 2
112 pages
Little Book of Deep Learning
100% (1)
Little Book of Deep Learning
158 pages
CNN Short
No ratings yet
CNN Short
61 pages
Deep Learning Unit 1
No ratings yet
Deep Learning Unit 1
24 pages
The Art of ChatGPT Prompting
No ratings yet
The Art of ChatGPT Prompting
18 pages
Deep Learning Approaches For Network Int
No ratings yet
Deep Learning Approaches For Network Int
116 pages
Music Recommendation System Using Facial Expression Recognition Using Machine Learning
No ratings yet
Music Recommendation System Using Facial Expression Recognition Using Machine Learning
7 pages
Image Enhancement
No ratings yet
Image Enhancement
144 pages
Pytorch Lightning Readthedocs Latest
100% (1)
Pytorch Lightning Readthedocs Latest
421 pages
Object Detection - Week 1 - Object Detection in 20 Years - Final
No ratings yet
Object Detection - Week 1 - Object Detection in 20 Years - Final
280 pages
Nn4ir PDF
No ratings yet
Nn4ir PDF
290 pages
Chapter 2 - Robot Kinematics
No ratings yet
Chapter 2 - Robot Kinematics
35 pages
Speech Emotion Recognition With Deep Learning
No ratings yet
Speech Emotion Recognition With Deep Learning
10 pages
Soft Computing Decode
No ratings yet
Soft Computing Decode
142 pages
Lecture 4.b - Metaheuristics - Basic Concepts
No ratings yet
Lecture 4.b - Metaheuristics - Basic Concepts
42 pages
Artificial Neural Networks: Part 1/3
No ratings yet
Artificial Neural Networks: Part 1/3
25 pages
Chapter 7 - Neural-Networks
100% (1)
Chapter 7 - Neural-Networks
60 pages
02 Fundamentals of Neural Network
No ratings yet
02 Fundamentals of Neural Network
40 pages
Aicte Activity
No ratings yet
Aicte Activity
49 pages
A Novel Adoption of LSTM in Customer Touchpoint Prediction Problems Presentation 1
100% (1)
A Novel Adoption of LSTM in Customer Touchpoint Prediction Problems Presentation 1
73 pages
Introduction To Deep Learning
No ratings yet
Introduction To Deep Learning
151 pages
How To Code A Neural Network With Backpropagation in Python
No ratings yet
How To Code A Neural Network With Backpropagation in Python
133 pages
AI-Lecture 12 - Simple Perceptron
100% (1)
AI-Lecture 12 - Simple Perceptron
24 pages
Recent Advances in Mobile Robotics PDF
No ratings yet
Recent Advances in Mobile Robotics PDF
464 pages
النظرية الاحتسابية
No ratings yet
النظرية الاحتسابية
5 pages
General Framework For Object Detection
No ratings yet
General Framework For Object Detection
9 pages
Deep Learning Tensorflow
No ratings yet
Deep Learning Tensorflow
35 pages
Answers All 2007
0% (1)
Answers All 2007
64 pages
Computer Vision Based Moving Object Detection and Tracking: Suresh Kumar, Prof. Yatin Kumar Agarwal
No ratings yet
Computer Vision Based Moving Object Detection and Tracking: Suresh Kumar, Prof. Yatin Kumar Agarwal
6 pages
CNN RNN Assignment Set 4
0% (1)
CNN RNN Assignment Set 4
2 pages
Combining Multiple Sources of Knowledge in Deep Cnns For Action Recognition
No ratings yet
Combining Multiple Sources of Knowledge in Deep Cnns For Action Recognition
8 pages
Introduction To Neural Networks Using Matlab 6 0 S N Sivanandam Sumathi Deepa
0% (1)
Introduction To Neural Networks Using Matlab 6 0 S N Sivanandam Sumathi Deepa
4 pages
PyTorch Workflow Fundamentals
No ratings yet
PyTorch Workflow Fundamentals
1 page
CNN Architectures: Lenet, Alexnet, VGG, Googlenet, Resnet and More
No ratings yet
CNN Architectures: Lenet, Alexnet, VGG, Googlenet, Resnet and More
9 pages
Deep Learning CNN
No ratings yet
Deep Learning CNN
204 pages
Depth Prediction Single Image
No ratings yet
Depth Prediction Single Image
8 pages
ANN Matlab
No ratings yet
ANN Matlab
13 pages
Admission Form PDF
No ratings yet
Admission Form PDF
6 pages
GIAN MARLEX A. MILAN 12 Rutherford 2
No ratings yet
GIAN MARLEX A. MILAN 12 Rutherford 2
60 pages
My Studybook Module 1-5 (PORTFOLIO) - EBORDE, GWENDOLYN Q PDF
100% (1)
My Studybook Module 1-5 (PORTFOLIO) - EBORDE, GWENDOLYN Q PDF
65 pages
1 - Basic Concepts of Engineering Research
No ratings yet
1 - Basic Concepts of Engineering Research
32 pages
Syllabus English Lang Primary
No ratings yet
Syllabus English Lang Primary
14 pages
Lesson Plan PPL
No ratings yet
Lesson Plan PPL
4 pages
Physical Education and Health Syllabus
100% (1)
Physical Education and Health Syllabus
2 pages
Chapter 5 Model Evaluation
No ratings yet
Chapter 5 Model Evaluation
21 pages
B 190313162555
No ratings yet
B 190313162555
29 pages
UG English III Sem Syllabi
No ratings yet
UG English III Sem Syllabi
48 pages
Application of The Universal Design For Learning: Inclusive Education - Case Study
No ratings yet
Application of The Universal Design For Learning: Inclusive Education - Case Study
10 pages
Tadlo MCL
No ratings yet
Tadlo MCL
11 pages
Scratch - Storyline With Exponents
No ratings yet
Scratch - Storyline With Exponents
3 pages
Tle Techdraft9 Q2 M5
No ratings yet
Tle Techdraft9 Q2 M5
18 pages
Chapter 3-Unsupervised Learning - Updated
No ratings yet
Chapter 3-Unsupervised Learning - Updated
54 pages
Teacher Tunover 2.HILINA ASSEFA
100% (1)
Teacher Tunover 2.HILINA ASSEFA
96 pages
Chapter 1-Introduction
No ratings yet
Chapter 1-Introduction
33 pages
Children's Cognitive Development: Alternatives To Piaget
No ratings yet
Children's Cognitive Development: Alternatives To Piaget
15 pages
Amani K. Hamdan (Eds.) - Teaching and Learning in Saudi Arabia - Perspectives From Higher Education-SensePublishers (2015)
No ratings yet
Amani K. Hamdan (Eds.) - Teaching and Learning in Saudi Arabia - Perspectives From Higher Education-SensePublishers (2015)
240 pages
Selected Topic
No ratings yet
Selected Topic
14 pages
AP French Teaching Plan
No ratings yet
AP French Teaching Plan
2 pages
ML Individual Assigenment 1
No ratings yet
ML Individual Assigenment 1
11 pages
MCL Ind Assign
No ratings yet
MCL Ind Assign
10 pages
Testing Practices of English Teachers in Selected Public Secondary Schools
No ratings yet
Testing Practices of English Teachers in Selected Public Secondary Schools
12 pages
Machine Learningassignment G - 7
No ratings yet
Machine Learningassignment G - 7
10 pages
Roles of ICT in The Field of Psychology
No ratings yet
Roles of ICT in The Field of Psychology
2 pages
SF 2
No ratings yet
SF 2
148 pages
English Action Plan 2022-2023
No ratings yet
English Action Plan 2022-2023
8 pages
Childhood in The 1980s vs. Childhood Today
No ratings yet
Childhood in The 1980s vs. Childhood Today
4 pages
Week6 Q3 2023 2024
No ratings yet
Week6 Q3 2023 2024
5 pages
AIML ISE mpq2
No ratings yet
AIML ISE mpq2
4 pages
2018 Vvob Tech Brief School-Leadership Web
No ratings yet
2018 Vvob Tech Brief School-Leadership Web
12 pages
Conceptual Understanding and Procedural Fluency in Mathematics Overview
No ratings yet
Conceptual Understanding and Procedural Fluency in Mathematics Overview
2 pages
MSC Mathematics of Finance (02250186) : University of Pretoria Yearbook 2017
No ratings yet
MSC Mathematics of Finance (02250186) : University of Pretoria Yearbook 2017
3 pages
Connectivity Prediction in Mobile Ad Hoc Networks for Real-Time Control
From Everand
Connectivity Prediction in Mobile Ad Hoc Networks for Real-Time Control
Sebastian Thelen
5/5 (1)
Hopfield Networks: Fundamentals and Applications of The Neural Network That Stores Memories
From Everand
Hopfield Networks: Fundamentals and Applications of The Neural Network That Stores Memories
Fouad Sabry
No ratings yet
Hebbian Learning: Fundamentals and Applications for Uniting Memory and Learning
From Everand
Hebbian Learning: Fundamentals and Applications for Uniting Memory and Learning
Fouad Sabry
No ratings yet
Hybrid Neural Networks: Fundamentals and Applications for Interacting Biological Neural Networks with Artificial Neuronal Models
From Everand
Hybrid Neural Networks: Fundamentals and Applications for Interacting Biological Neural Networks with Artificial Neuronal Models
Fouad Sabry
No ratings yet

Chapter 4 Neural Network

Uploaded by

Chapter 4 Neural Network

Uploaded by

Chapter-4

• Ten billion (1010) neurons

⚫ These processor nodes and connections resembles a relationship with the

• Output Layer – corresponds to the class attribute.

(a) is a step function or (b) is a sigmoid

Train this layer

Train this layer first

Train this layer

Train this layer

Train this layer

How to change\update the weights value so that the error is reduced?

The answer is Backpropagation!

What exactly is deep learning ?

2. The series of layers between input & output do

• CNN’s are typically used for image detection and classification.

• Example: Identify if an image is of a human being, or car or just digits on an address.

• A CNN typically has three layers:

(CONV) uses filters that perform convolution

• Max pooling takes the largest

e.g. Image Captioning

e.g. Sentiment Classification

e.g. Machine Translation

e.g. Video classification on frame

You might also like