0% found this document useful (0 votes)

56 views60 pages

8.lecture7 28a 29 NN

Neural networks are composed of connected input and output units, with each connection having an associated weight. Neural networks learn by adjusting these weights to correctly classify training data. The training process involves forward propagation to calculate outputs and backward propagation to adjust weights to minimize errors between network outputs and actual classifications over many iterations. This allows the neural network to learn complex decision boundaries from nonlinear input-output mappings in the data.

Uploaded by

Muhd Afiq Azmir

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

56 views60 pages

8.lecture7 28a 29 NN

Uploaded by

Muhd Afiq Azmir

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 60

BIT 33603 : DATA MINING

LECTURE 7 : NEURAL NETWORKS

Neural Network is a set of connected INPUT/OUTPUT UNITS, where each connection has a WEIGHT associated with it.

Neural Network learning is also called CONNECTIONIST learning due to the connections between units.

Neural Network learns by adjusting the weights so as to be able to correctly classify the training data and hence, after testing phase, to classify
unknown data.
-0.06

W2
-2.5 f(x)
W3

1.4
-0.06

2.7

-8.6
-2.5 f(x)
0.002 x = -0.06×2.7 + 2.5×8.6 + 1.4×0.002 = 21.34

1.4
Single
Multilayer
Perceptron Layer
Perceptron
Perceptron
MLP
• INPUT: records without class attribute with normalized attributes values.

• INPUT VECTOR: X = { x1, x2, …. xn}

where n is the number of (non class) attributes.

• INPUT LAYER – there are as many nodes as non-class attributes i.e. as the length of the input
vector.

• HIDDEN LAYER – the number of nodes in the hidden layer and the number of hidden layers
depends on implementation.

• OUTPUT LAYER – corresponds to the class attribute.

• There
k= 1, are
2,.. as many nodes as classes (values of the class attribute).
#classes
Neural Network Learning
2 MAJOR PROCESSES INVOLVE :-
- The inputs are fed simultaneously into the input layer.

1) Forward propagation - The weighted outputs of these units are fed into hidden layer.

- The weighted outputs of the last hidden layer are inputs to units making up the
output layer.

2) Back- Propagation
• Back Propagation learns by iteratively processing a set of training data (samples).
• For each sample, weights are modified to minimize the error between network’s classification and actual classification
Steps in training the MLP
• STEP ONE: initialize the weights and biases.

• The weights in the network are initialized to random numbers from the interval [-1,1] or [0,1]

• Each unit has a BIAS associated with it

• The biases are similarly initialized to random numbers from the interval [-1,1].

• STEP TWO: feed the training sample.

• STEP THREE: Propagate the inputs forward; we compute the net input and output of each unit in the hidden and output layers.

• STEP FOUR: back propagate the error.

• STEP FIVE: update weights and biases to reflect the propagated errors.

• STEP SIX: terminating conditions.

Terminating Conditions
• When to stop the training ….
• All
wij in the previous epoch are below some threshold, or

•The percentage of samples misclassified in the previous epoch is below some

threshold, or

• a pre specified number of epochs has expired.

• In practice, several hundreds of thousands of epochs may be required before the

weights will converge.

• Training a neural network with backpropagation learning algorithm usually requires

that all representations of the input set (called one epoch) are presented many times.
For examples, the ANN may use hundreds to thousands epochs.
Backpropagation Formulas

Output vector

Output nodes Errk  Ok (1  Ok )(Tk  Ok )

1
Oj  I
1 e j
Err j  O j (1  O j ) Errk w jk
Hidden nodes k

I j   wij Oi   j wij  j   j  (l) Err j

wij  wij  (l ) Err j Oi

Input nodes

Input vector: xi
A dataset:

Inputs class
1.4 2.7 1.9 0
3.8 3.4 3.2 0
6.4 2.8 1.7 1
4.1 0.1 0.2 0
etc …
Training the neural network
Inputs class
1.4 2.7 1.9 0
3.8 3.4 3.2 0
6.4 2.8 1.7 1
4.1 0.1 0.2 0
etc …
Training data
Inputs class Initialise with random weights
1.4 2.7 1.9 0
3.8 3.4 3.2 0
6.4 2.8 1.7 1
4.1 0.1 0.2 0
etc …
Training data
Inputs class Present a training pattern
1.4 2.7 1.9 0
3.8 3.4 3.2 0
6.4 2.8 1.7 1
4.1 0.1 0.2 0 1.4
etc …
2.7

1.9
Training data
Inputs class Feed it through to get output
1.4 2.7 1.9 0
3.8 3.4 3.2 0
6.4 2.8 1.7 1
4.1 0.1 0.2 0 1.4
etc …
2.7 0.8

1.9
Training data
Inputs class Compare with target output
1.4 2.7 1.9 0
3.8 3.4 3.2 0
6.4 2.8 1.7 1
4.1 0.1 0.2 0 1.4
etc …
2.7 0.8
0
1.9 error 0.8
Training data
Inputs class Adjust weights based on error
1.4 2.7 1.9 0
3.8 3.4 3.2 0
6.4 2.8 1.7 1
4.1 0.1 0.2 0 1.4
etc …
2.7 0.8
0
1.9 error 0.8
Training data
Inputs class Present a training pattern
1.4 2.7 1.9 0
3.8 3.4 3.2 0
6.4 2.8 1.7 1
4.1 0.1 0.2 0 6.4
etc …
2.8

1.7
Training data
Inputs class Feed it through to get output
1.4 2.7 1.9 0
3.8 3.4 3.2 0
6.4 2.8 1.7 1
4.1 0.1 0.2 0 6.4
etc …
2.8 0.9

1.7
Training data
Inputs class Compare with target output
1.4 2.7 1.9 0
3.8 3.4 3.2 0
6.4 2.8 1.7 1
4.1 0.1 0.2 0 6.4
etc …
2.8 0.9
1
1.7 error -0.1
Training data
Inputs class Adjust weights based on error
1.4 2.7 1.9 0
3.8 3.4 3.2 0
6.4 2.8 1.7 1
4.1 0.1 0.2 0 6.4
etc …
2.8 0.9
1
1.7 error -0.1
Training data
Inputs class And so on ….
1.4 2.7 1.9 0
3.8 3.4 3.2 0
6.4 2.8 1.7 1
4.1 0.1 0.2 0 6.4
etc …
2.8 0.9
1
1.7 error -0.1

Repeat this thousands, maybe millions of times – each time taking a

random training instance, and making slight weight adjustments

Algorithms for weight adjustment are designed to make

changes that will reduce the error
The decision boundary perspective…
Initial random weights
The decision boundary perspective…
Present a training instance / adjust the weights
The decision boundary perspective…
Present a training instance / adjust the weights
The decision boundary perspective…
Present a training instance / adjust the weights
The decision boundary perspective…
Present a training instance / adjust the weights
The decision boundary perspective…
Eventually ….
The points are:
• weight-learning algorithms for NNs are somehow a tedious
process

• they work by making thousands and thousands of tiny

adjustments, each making the network do better at the most
recent pattern, but perhaps a little worse on many others

• but, eventually this tends to be good enough to

learn effective classifiers for many real applications
Some other points

If f(x) is non-linear, a network with 1 hidden layer can, in theory, learn

perfectly any classification problem. A set of weights exists that can
produce the targets from the inputs. The problem is finding them.
Some other ‘by the way’ points
If f(x) is linear, the NN can only draw straight decision
boundaries (like a single layer of perceptron- SLP)
Some other ‘by the way’ points
NNs with a hidden layer use nonlinear f(x) so they can draw
complex boundaries, but keep the data unchanged
Some other ‘by the way’ points
NNs use nonlinear f(x) so they SVMs only draw straight lines,
can draw complex boundaries, but they transform the data first
but keep the data unchanged in a way that makes that OK
So how the NN weights are adjusted?
Training data
Inputs class And so on ….
1.4 2.7 1.9 0
3.8 3.4 3.2 0
6.4 2.8 1.7 1
4.1 0.1 0.2 0 6.4
etc …
2.8 0.9
1
1.7 error -0.1

Repeat this thousands, maybe millions of times – each time

taking a random training instance, and making slight
weight adjustments
Algorithms for weight adjustment are designed to make
changes that will reduce the error
Example of Back propagation
Input = 3,
Hidden Neuron = 2
Output =1

Initialize weights and bias To

: random numbers from -1.0 to 1.0

Initial Input and weight

x1 x2 x3 w14 w15 24 25 34 35 46 56

1 0 1 0.2 -0.3 0.4 0.1 -0.5 0.2 -0.3 -0.2

Initial bias :

θ4 θ5 θ6

-0.4 0.2 0.1

Net Input and Output Calculation

Unitj Net Input Ij Output Oj

4 0.2 + 0 - 0.5 -0.4 = -0.7 1

Oj  = 0.332
1  e0.7
5 -0.3 + 0 + 0.2 + 0.2 =0.1 1
Oj  = 0.525
1  e 0.1
6 (-0.3)0.332-(0.2)
1
(0.525)+0.1= -0.105 Oj  = 0.475
1  e0.105
Calculation of Error at Each Node
Errk  Ok (1  Ok )(Tk  Ok )
Err j  O j (1  O j ) Errk w jk
k

Unit j Error j
6 0.475(1-0.475)(1-0.475) =0.1311
We assume T 6 = 1 , where is T is target output,
O is the current output
5 0.525 x (1- 0.525)x 0.1311x
(-0.2) = 0.0065

4 0.332 x (1-0.332) x 0.1311 x

(-0.3) = -0.0087
Calculation of weights and Bias Updating
Let say the Learning Rate l =0.9
Weight New Values
w46 -0.3 + 0.9(0.1311)(0.332) = -0.261

wij  wij  (l ) Err j Oi w56 -0.2 + (0.9)(0.1311)(0.525) = -0.138

w14 0.2 + 0.9(-0.0087)(1) = 0.192

w15 -0.3 + (0.9)(-0.0065)(1) = -0.306

……..similarly ………similarly
θ6 0.1 +(0.9)(0.1311)=0.218

……..similarly ………similarly
Developing
a Neural Network–Based System
Application

• Forecasting/Market Prediction: finance and banking

• Manufacturing: quality control, fault diagnosis

• Medicine: analysis of electrocardiogram data, RNA & DNA

sequencing, drug development without animal testing

• Control: process, robotics

Time Series Prediction
• Time series prediction: given an existing data series, we observe or model the data
series to make accurate forecasts

• Example time series

• Financial (e.g., stocks, exchange rates)
• Physically observed (e.g., weather, sunspots, river flow)
• Why is it important?
• Preventing undesirable events by forecasting the event, identifying the circumstances
preceding the event, and taking corrective action so the event can be avoided (e.g.,
inflationary economic period)
• Forecasting undesirable, yet unavoidable, events to preemptively lessen their impact
(e.g., solar maximum w/ sunspots)
• Profiting from forecasting (e.g., financial markets)

58
59

Adtec Shah Alam - KTM BKT Badak Rm14 KTM BKT Badak - KTM Bandr TSK SLTN - PNDN Indah RM 9.10
No ratings yet
Adtec Shah Alam - KTM BKT Badak Rm14 KTM BKT Badak - KTM Bandr TSK SLTN - PNDN Indah RM 9.10
1 page
Lec 35
No ratings yet
Lec 35
12 pages
Unit 4
No ratings yet
Unit 4
13 pages
AI Deep Learning Cheat Sheets-From BecomingHuman - Ai PDF
100% (3)
AI Deep Learning Cheat Sheets-From BecomingHuman - Ai PDF
25 pages
Excuse For Class Due To Interview
No ratings yet
Excuse For Class Due To Interview
3 pages
Lecture7B Classification
No ratings yet
Lecture7B Classification
78 pages
It 8 Sem Machine Learning 3705 Summer 2019
No ratings yet
It 8 Sem Machine Learning 3705 Summer 2019
2 pages
7-Working Example-01-08-2024
No ratings yet
7-Working Example-01-08-2024
29 pages
Kyrillos Amgad Dawoud Ayad Nanotechnology 4430 1249598300
No ratings yet
Kyrillos Amgad Dawoud Ayad Nanotechnology 4430 1249598300
7 pages
6 Working Example 01-08-2024
No ratings yet
6 Working Example 01-08-2024
21 pages
I Robot
No ratings yet
I Robot
29 pages
ML Expt 9
No ratings yet
ML Expt 9
9 pages
Assingmentbic 10503
No ratings yet
Assingmentbic 10503
13 pages
Lec05 Classifiers NeuralNets
No ratings yet
Lec05 Classifiers NeuralNets
54 pages
Lesson 1 - History, Definitions and Basic Concepts
No ratings yet
Lesson 1 - History, Definitions and Basic Concepts
6 pages
3ML.05.NeuralNetworks DeepLearning
No ratings yet
3ML.05.NeuralNetworks DeepLearning
67 pages
Chapter 5 OFPS Entrepreneurship 5E
No ratings yet
Chapter 5 OFPS Entrepreneurship 5E
19 pages
Military AI-Week 04-Deep Learning
No ratings yet
Military AI-Week 04-Deep Learning
66 pages
Car Damage Detective: Assessing Car Damage With Convolutional Neural Networks Ting Neo
No ratings yet
Car Damage Detective: Assessing Car Damage With Convolutional Neural Networks Ting Neo
11 pages
AI Penilaian Aset - MAPPI Rev2
No ratings yet
AI Penilaian Aset - MAPPI Rev2
51 pages
Artificial Intelligence: Galgotias University Email
No ratings yet
Artificial Intelligence: Galgotias University Email
5 pages
Types of Image Segmentation
No ratings yet
Types of Image Segmentation
3 pages
Classification BP Regression KNN Other Classifiers - Final
No ratings yet
Classification BP Regression KNN Other Classifiers - Final
116 pages
Artificial Neural NetworkIV
No ratings yet
Artificial Neural NetworkIV
6 pages
STS Chapter 6
No ratings yet
STS Chapter 6
29 pages
Week 4
No ratings yet
Week 4
5 pages
Recurrent Neural Networks: Prof. Gheith Abandah
No ratings yet
Recurrent Neural Networks: Prof. Gheith Abandah
32 pages
Intro To Neural Networks Explained For Beginners: Sajjad Mustafa
No ratings yet
Intro To Neural Networks Explained For Beginners: Sajjad Mustafa
110 pages
Human Activity Recognition
No ratings yet
Human Activity Recognition
10 pages
Test 1 Fluid Mechanic (Afiq)
No ratings yet
Test 1 Fluid Mechanic (Afiq)
3 pages
Higher Nationals: Internal Verification of Assessment Decisions - BTEC (RQF)
No ratings yet
Higher Nationals: Internal Verification of Assessment Decisions - BTEC (RQF)
7 pages
1 AI 1 Introduction PDF
100% (1)
1 AI 1 Introduction PDF
79 pages
Back Propagation
No ratings yet
Back Propagation
29 pages
Face Recognition Attendance System
No ratings yet
Face Recognition Attendance System
12 pages
Ann 2 A
No ratings yet
Ann 2 A
20 pages
Sensors: SLAM in Dynamic Environments: A Deep Learning Approach For Moving Object Tracking Using ML-RANSAC Algorithm
No ratings yet
Sensors: SLAM in Dynamic Environments: A Deep Learning Approach For Moving Object Tracking Using ML-RANSAC Algorithm
20 pages
Deep Learning Lab: Translated MLP - CNN
No ratings yet
Deep Learning Lab: Translated MLP - CNN
19 pages
Neural-Network Questions
0% (1)
Neural-Network Questions
3 pages
Lecture 02 - Artificial Neural Network
No ratings yet
Lecture 02 - Artificial Neural Network
37 pages
Ch2 ANN BB
No ratings yet
Ch2 ANN BB
16 pages
Recent Trends in Nanobiotechnology
100% (2)
Recent Trends in Nanobiotechnology
9 pages
Application of Artificial Intelligence in Pile Foundation
No ratings yet
Application of Artificial Intelligence in Pile Foundation
1 page
Piloting Course RM 360k AMLE Course RM 107k
No ratings yet
Piloting Course RM 360k AMLE Course RM 107k
1 page
Master of Technology in Computer Science: Generative Adversarial Network
No ratings yet
Master of Technology in Computer Science: Generative Adversarial Network
11 pages
Coned 2018 Research and Development
No ratings yet
Coned 2018 Research and Development
145 pages
GE Aviation Schedule
No ratings yet
GE Aviation Schedule
11 pages
Lecture 40,41 BP Algorithm
No ratings yet
Lecture 40,41 BP Algorithm
11 pages
1511487643
No ratings yet
1511487643
14 pages
Test 1 Manufacturing Process KJD2283 32034
No ratings yet
Test 1 Manufacturing Process KJD2283 32034
3 pages
L4deep Learning
No ratings yet
L4deep Learning
14 pages
Python数据科学速查表 - Scikit-Learn
No ratings yet
Python数据科学速查表 - Scikit-Learn
1 page
Ai - W7L13
No ratings yet
Ai - W7L13
46 pages
18CSC305J - Artificial Intelligence Unit IV Question Bank Part A
No ratings yet
18CSC305J - Artificial Intelligence Unit IV Question Bank Part A
7 pages
Final PPT DataMining
No ratings yet
Final PPT DataMining
64 pages
Lampiran A. Academic Calendar 2019-2020 For Undergraduate, Postgraduate (By Course Work) & Franchise Prog
No ratings yet
Lampiran A. Academic Calendar 2019-2020 For Undergraduate, Postgraduate (By Course Work) & Franchise Prog
1 page
Outline of Artificial Intelligence
No ratings yet
Outline of Artificial Intelligence
18 pages
Artificial Intelligence: A Policy-Oriented Introduction
100% (8)
Artificial Intelligence: A Policy-Oriented Introduction
18 pages
Lecture 10
No ratings yet
Lecture 10
155 pages
L10 Learning II Gradient Based Learning
No ratings yet
L10 Learning II Gradient Based Learning
72 pages
02 Training
No ratings yet
02 Training
51 pages
Week 4 - DL and Conference Call Paper
No ratings yet
Week 4 - DL and Conference Call Paper
41 pages
Exp 4
No ratings yet
Exp 4
9 pages
Belt Drives Examples
No ratings yet
Belt Drives Examples
3 pages
05 Deep Learning
No ratings yet
05 Deep Learning
53 pages
DWDM Lab1
No ratings yet
DWDM Lab1
3 pages
Neural
No ratings yet
Neural
53 pages
36-Multi-Layer Perceptron and Its Properties-30-10-2024
No ratings yet
36-Multi-Layer Perceptron and Its Properties-30-10-2024
39 pages
Slide 2
No ratings yet
Slide 2
35 pages
Lecture 13.3 Classification ANN
No ratings yet
Lecture 13.3 Classification ANN
64 pages
Introduction To Nano Materials
No ratings yet
Introduction To Nano Materials
5 pages
Exp 3
No ratings yet
Exp 3
9 pages
Mod 2.4,2.5,2.6 Architecture Design
No ratings yet
Mod 2.4,2.5,2.6 Architecture Design
20 pages
Hyperparameters
No ratings yet
Hyperparameters
15 pages
ANN Notes
No ratings yet
ANN Notes
54 pages
IEEE Xplore - The Evolution of Robotics Research
No ratings yet
IEEE Xplore - The Evolution of Robotics Research
1 page
Intro To Nanotechnology
No ratings yet
Intro To Nanotechnology
20 pages
IRM Syllabus 1
No ratings yet
IRM Syllabus 1
3 pages
ANN-Implemetation of Back-Prop
No ratings yet
ANN-Implemetation of Back-Prop
89 pages
Unit 2
No ratings yet
Unit 2
38 pages
Curs5site PDF
No ratings yet
Curs5site PDF
47 pages
Week 2
No ratings yet
Week 2
17 pages
Tensorflow Keras Pytorch: Step 1: For Each Input, Multiply The Input Value X With Weights W
No ratings yet
Tensorflow Keras Pytorch: Step 1: For Each Input, Multiply The Input Value X With Weights W
6 pages
Theory of ANN
No ratings yet
Theory of ANN
21 pages
Advanced Machine Learning: Neural Networks Decision Trees Random Forest Xgboost
No ratings yet
Advanced Machine Learning: Neural Networks Decision Trees Random Forest Xgboost
61 pages
Introduction To Deep Learning: Suresh Jaganathan
No ratings yet
Introduction To Deep Learning: Suresh Jaganathan
73 pages
Learning and Generalization in Single Layer Perceptrons: Introduction To Neural Networks: Lecture 4
No ratings yet
Learning and Generalization in Single Layer Perceptrons: Introduction To Neural Networks: Lecture 4
16 pages
Classification of Cardboard Papers Using A Multilayer Perceptron
No ratings yet
Classification of Cardboard Papers Using A Multilayer Perceptron
14 pages
Training of Neural Networks: Q.J. Zhang, Carleton University
No ratings yet
Training of Neural Networks: Q.J. Zhang, Carleton University
44 pages
Chapter3 - BP
No ratings yet
Chapter3 - BP
12 pages
Shortcomings in Single Layer Neural Networks: Most Real World Problems Are Not
No ratings yet
Shortcomings in Single Layer Neural Networks: Most Real World Problems Are Not
43 pages
ProjectReport Kanwarpal
No ratings yet
ProjectReport Kanwarpal
17 pages
Back Propagation Algorithm
No ratings yet
Back Propagation Algorithm
19 pages
Neural Network
100% (1)
Neural Network
54 pages
An Introduction To Mathematics Behind Neural Networks
No ratings yet
An Introduction To Mathematics Behind Neural Networks
5 pages
Artificial Neural Networks
No ratings yet
Artificial Neural Networks
26 pages
Lec 15 MLP Cont
No ratings yet
Lec 15 MLP Cont
34 pages
Neural Net 3rdclass
No ratings yet
Neural Net 3rdclass
35 pages
Multi Layer Feed-Forward Network Learning
No ratings yet
Multi Layer Feed-Forward Network Learning
5 pages

8.lecture7 28a 29 NN

Uploaded by

8.lecture7 28a 29 NN

Uploaded by

BIT 33603 : DATA MINING

LECTURE 7 : NEURAL NETWORKS

• INPUT VECTOR: X = { x1, x2, …. xn}

• OUTPUT LAYER – corresponds to the class attribute.

• Each unit has a BIAS associated with it

• STEP TWO: feed the training sample.

• STEP FOUR: back propagate the error.

• STEP SIX: terminating conditions.

•The percentage of samples misclassified in the previous epoch is below some

• a pre specified number of epochs has expired.

• In practice, several hundreds of thousands of epochs may be required before the

• Training a neural network with backpropagation learning algorithm usually requires

Output nodes Errk  Ok (1  Ok )(Tk  Ok )

I j   wij Oi   j wij  j   j  (l) Err j

wij  wij  (l ) Err j Oi

Repeat this thousands, maybe millions of times – each time taking a

Algorithms for weight adjustment are designed to make

• they work by making thousands and thousands of tiny

• but, eventually this tends to be good enough to

If f(x) is non-linear, a network with 1 hidden layer can, in theory, learn

Repeat this thousands, maybe millions of times – each time

Initialize weights and bias To

Initial Input and weight

1 0 1 0.2 -0.3 0.4 0.1 -0.5 0.2 -0.3 -0.2

-0.4 0.2 0.1

Unitj Net Input Ij Output Oj

4 0.2 + 0 - 0.5 -0.4 = -0.7 1

4 0.332 x (1-0.332) x 0.1311 x

wij  wij  (l ) Err j Oi w56 -0.2 + (0.9)(0.1311)(0.525) = -0.138

w14 0.2 + 0.9(-0.0087)(1) = 0.192

w15 -0.3 + (0.9)(-0.0065)(1) = -0.306

• Forecasting/Market Prediction: finance and banking

• Manufacturing: quality control, fault diagnosis

• Medicine: analysis of electrocardiogram data, RNA & DNA

• Control: process, robotics

• Example time series

You might also like