0% found this document useful (0 votes)

136 views30 pages

NN Models & Architecture of NN: CSE-4619 Machine Learning

The document discusses the history and architecture of neural networks. It describes early models like the McCulloch-Pitts model and the perceptron. It explains different neural network architectures including single-layer feedforward, multi-layer feedforward, and recurrent networks. It provides an example of using a neural network for classification and outlines the process for building, training, and testing a neural network model.

Uploaded by

proshanto salma

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

136 views30 pages

NN Models & Architecture of NN: CSE-4619 Machine Learning

Uploaded by

proshanto salma

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 30

NN Models & Architecture of NN

CSE-4619
Machine Learning

~ from the lecture of Noureddin Sadawi

McCullogh-Pitts Model
In 1943 two electrical engineers, Warren McCullogh and Walter Pitts,
published the first paper describing what we would call a neural
network. Their "neurons" operated under the following assumptions:
➔ They are binary devices (Vi = [0,1])
➔ Each neuron has a fixed threshold, theta
➔ The neuron receives inputs from excitatory synapses, all having
identical weights.
➔ Inhibitory inputs have an absolute veto power over any excitatory
inputs.
➔ At each time step the neurons are simultaneously (synchronously)
updated by summing the weighted excitatory inputs and setting the
output (Vi) to 1 iff the sum is greater than or equal to the threhold
AND if the neuron receives no inhibitory input.
McCullogh-Pitts Model

It cannot be represented with a single neuron, but the relationship

XOR = (V1 OR V2) AND NOT (V1 AND V2) suggests that it can be
represented with the network below:
The Perceptron vs McCullogh-Pitts Model
The next major advance was the perceptron, introduced by Frank
Rosenblatt in his 1958 paper. The perceptron had the following
differences from the McCullough-Pitts neuron:
➔ The weights and thresholds were not all identical.
➔ Weights can be positive or negative.
➔ There is no absolute inhibitory synapse.
➔ Although the neurons were still two-state, the output function f(u)
goes from [-1,1], not [0,1]. (This is no big deal, as a suitable change
in the threshold lets you transform from one convention to the other.)
➔ Most importantly, there was a learning rule.
Decision Boundaries for AND and OR

We can now plot the decision boundaries of our logic gates

AND OR
w1=1, w2=1, θ=1.5 w1=1, w2=1, θ=0.5
OR
I1 I2 out I1

0 0 0
0 1 1
AND
I1 (1, 0) (1, 1)
1 0 1
I1 I2 out
0 0 0 1 1 1

0 1 0
(1, 1)
1 0 0 (1, 0)
1 1 1 I2
(0, 0) (0, 1)

I2
(0, 0) (0, 1)

8
Decision Boundary for XOR

The difficulty in dealing with XOR is rather obvious. We need two straight
lines to separate the different outputs/decisions:
I1

XOR

I1 I2 out I1
0 0 0

0 1 1
I2

1 0 1

1 1 0

Solution: either change the transfer function so that it has more than one
decision boundary, or use a more complex network that is able to generate
more complex decision boundaries.
9
ANN Architectures
Mathematically, ANNs can be represented as weighted directed graphs. The
most common ANN architectures are:

Single-Layer Feed-Forward NNs: One input layer and one output layer of
processing units. No feedback connections (e.g. a Perceptron)

Multi-Layer Feed-Forward NNs: One input layer, one output layer, and one or
more hidden layers of processing units. No feedback connections (e.g. a
Multi-Layer Perceptron)

Recurrent NNs: Any network with at least one feedback connection. It may, or
may not, have hidden units

Further interesting variations include: sparse connections, time-delayed

connections, moving windows, …

10
Examples of Network Architectures

Single Layer Multi-Layer Recurrent

Feed-Forward Feed-Forward Network

11
Example: A Classification Task

A typical neural network application is classification. Consider the simple example of

classifying trucks given their masses and lengths:

Mass Length Class

10.0 6 Lorry

20.0 5 Lorry

5.0 4 Van

2.0 5 Van

3.0 6 Lorry

10.0 7 Lorry

15.0 8 Lorry

5.0 9 Lorry

How do we construct a neural network that can classify any Lorry and Van?
14
Cookbook Recipe for Building Neural Networks
Formulating neural network solutions for particular problems is a multi-stage
process:
1. Understand and specify the problem in terms of inputs and required
outputs
2. Take the simplest form of network you think might be able to solve your
problem
3. Try to find the appropriate connection weights (including neuron
thresholds) so that the network produces the right outputs for each input
in its training data
4. Make sure that the network works on its training data and test its
generalization by checking its performance on new testing data
5. If the network doesn’t perform well enough, go back to stage 3 and try
harder
6. If the network still doesn’t perform well enough, go back to stage 2 and
try harder
7. If the network still doesn’t perform well enough, go back to stage 1 and
try harder
8. Problem solved – or not

15
Building a Neural Network (stages 1 & 2)

For our truck example, our inputs can be direct encodings of the masses and
lengths. Generally we would have one output unit for each class, with
activation 1 for ‘yes’ and 0 for ‘no’. In our example, we still have one output
unit, but the activation 1 corresponds to ‘lorry’ and 0 to ‘van’ (or vice versa).
The simplest network we should try first is the single layer Perceptron. We
can further simplify things by replacing the threshold by an extra weight as
we discussed before. This gives us:

Class=sgn(w0+w1.Mass+w2.Length)

w2
w0
w1

1 Mass Length

16
Training the Neural Network (stage 3)
Whether our neural network is a simple Perceptron, or a much
complicated multi-layer network, we need to develop a systematic
procedure for determining appropriate connection weights.

The common procedure is to have the network learn the appropriate

weights from a representative set of training data.

For classifications a simple Perceptron uses decision boundaries (lines

or hyperplanes), which it shifts around until each training pattern is
correctly classified.

The process of “shifting around” in a systematic way is called learning.

The learning process can then be divided into a number of small steps.

Daa Lab Manual
No ratings yet
Daa Lab Manual
60 pages
Scope Statement For The Time Table Generation System For Thapar University
60% (5)
Scope Statement For The Time Table Generation System For Thapar University
4 pages
CS8591-Computer Networks Department of CSE 2020-2021
No ratings yet
CS8591-Computer Networks Department of CSE 2020-2021
24 pages
Decision Tree in Machine Learning
No ratings yet
Decision Tree in Machine Learning
16 pages
C# Program To Demonstrate Multilevel Inheritance
No ratings yet
C# Program To Demonstrate Multilevel Inheritance
5 pages
Exercise 2
No ratings yet
Exercise 2
11 pages
Computer Organization & Architecture (KCA 105) : DR Manmohan Mishra Associate Professor MCA, Department
No ratings yet
Computer Organization & Architecture (KCA 105) : DR Manmohan Mishra Associate Professor MCA, Department
87 pages
Angular JS Lab Manual
No ratings yet
Angular JS Lab Manual
43 pages
Unit Ii
No ratings yet
Unit Ii
61 pages
Applications of Binary Trees
No ratings yet
Applications of Binary Trees
4 pages
Demand Paging On Symbian Online Book
100% (2)
Demand Paging On Symbian Online Book
177 pages
BE LP5 Manual 23-24
No ratings yet
BE LP5 Manual 23-24
67 pages
Software Engineering Lab Manual 4th Sem
No ratings yet
Software Engineering Lab Manual 4th Sem
60 pages
CAHM Unit 1 Notes
No ratings yet
CAHM Unit 1 Notes
16 pages
PPS - Unit 1
No ratings yet
PPS - Unit 1
69 pages
Os Lab Manual Final Os-2
No ratings yet
Os Lab Manual Final Os-2
88 pages
Notes - Unit 3 - Map Reduce Applications
No ratings yet
Notes - Unit 3 - Map Reduce Applications
11 pages
Unit 6 Fds 2023
No ratings yet
Unit 6 Fds 2023
67 pages
Lecture 3 Multiprocessor Vs Multicomputer Vs DS
No ratings yet
Lecture 3 Multiprocessor Vs Multicomputer Vs DS
55 pages
Assignment 2 (SPOS) Edited
No ratings yet
Assignment 2 (SPOS) Edited
12 pages
Practice Assignment 11 Sol 12453
100% (1)
Practice Assignment 11 Sol 12453
6 pages
Network Devices (Hub, Repeater, Bridge, Switch, Router, Gateways
No ratings yet
Network Devices (Hub, Repeater, Bridge, Switch, Router, Gateways
9 pages
CORBA Services
No ratings yet
CORBA Services
5 pages
Final Practical List Computer Peripherals and Interface
No ratings yet
Final Practical List Computer Peripherals and Interface
42 pages
LP 4 Lab Manual
No ratings yet
LP 4 Lab Manual
52 pages
Matrimonialsitemanagementsystem
No ratings yet
Matrimonialsitemanagementsystem
78 pages
Net Framework and C# Programming Practical File
No ratings yet
Net Framework and C# Programming Practical File
21 pages
Question Bank: T.E. (Computer Engineering) Data Science and Big Data Analytics (2019 Pattern)
No ratings yet
Question Bank: T.E. (Computer Engineering) Data Science and Big Data Analytics (2019 Pattern)
4 pages
UNIT 2 - Connectionless and Connection Oriented Protocol PDF
No ratings yet
UNIT 2 - Connectionless and Connection Oriented Protocol PDF
115 pages
Unit-1 23022020041806AM
No ratings yet
Unit-1 23022020041806AM
40 pages
Operating System Lab Manual 19 20
No ratings yet
Operating System Lab Manual 19 20
97 pages
Constructor and Destructor of Java
No ratings yet
Constructor and Destructor of Java
5 pages
PPL Unit 4
No ratings yet
PPL Unit 4
25 pages
Enterprise Information Architecture Component Model - Chapter 5
100% (1)
Enterprise Information Architecture Component Model - Chapter 5
27 pages
Deep Learning r18 Jntuh Lab Manual
No ratings yet
Deep Learning r18 Jntuh Lab Manual
20 pages
CS-703 (B) Data Warehousing and Data Mining Lab
No ratings yet
CS-703 (B) Data Warehousing and Data Mining Lab
50 pages
VI Branch: Information Internet and Web Technologies
No ratings yet
VI Branch: Information Internet and Web Technologies
3 pages
Project Presentation: Sorting System Using Image Processing
No ratings yet
Project Presentation: Sorting System Using Image Processing
9 pages
DDA Algorithm 1
No ratings yet
DDA Algorithm 1
16 pages
Java Lab Manual: Aurora'S PG College Moosarambagh Mca Department
No ratings yet
Java Lab Manual: Aurora'S PG College Moosarambagh Mca Department
97 pages
Com 112
No ratings yet
Com 112
9 pages
AWT (Abstract Windowing Toolkit) : The AWT Is Roughly Broken Into Three Categories
100% (1)
AWT (Abstract Windowing Toolkit) : The AWT Is Roughly Broken Into Three Categories
21 pages
UML Diagram For University Information S
No ratings yet
UML Diagram For University Information S
21 pages
Practical 15: Write Code To Perform Insert, Find, Update, and Delete Operations On Student Database Using Node - Js and Mongodb
No ratings yet
Practical 15: Write Code To Perform Insert, Find, Update, and Delete Operations On Student Database Using Node - Js and Mongodb
10 pages
Java - Lab - Manual-21csl35 - Skit
No ratings yet
Java - Lab - Manual-21csl35 - Skit
30 pages
P.prabu (28x61c) CCS334 BDA - Unit 4
No ratings yet
P.prabu (28x61c) CCS334 BDA - Unit 4
28 pages
AWP Lab Manual
No ratings yet
AWP Lab Manual
73 pages
Conversion of CFG To PDA Conversion of PDA To CFG
No ratings yet
Conversion of CFG To PDA Conversion of PDA To CFG
22 pages
ICMP Misbehaviour
100% (1)
ICMP Misbehaviour
34 pages
Unit 4 Notes
No ratings yet
Unit 4 Notes
46 pages
Cse Aiml Dec 2022
No ratings yet
Cse Aiml Dec 2022
27 pages
Dbms Lab Manual
No ratings yet
Dbms Lab Manual
37 pages
30-Day DSA Mastery Plan - Google Sheets
No ratings yet
30-Day DSA Mastery Plan - Google Sheets
1 page
III Year-V Semester: B.Tech. Computer Science and Engineering 5CS4-02: Compiler Design UNIT-1
100% (1)
III Year-V Semester: B.Tech. Computer Science and Engineering 5CS4-02: Compiler Design UNIT-1
11 pages
Project Report GitHub
No ratings yet
Project Report GitHub
32 pages
Os - Unit 5
No ratings yet
Os - Unit 5
60 pages
System Software Cs2304 Notes
No ratings yet
System Software Cs2304 Notes
100 pages
Android Studio 3.2 Development Essentials - Android 9 Edition: Developing Android 9 Apps Using Android Studio 3.2, Java and Android Jetpack
From Everand
Android Studio 3.2 Development Essentials - Android 9 Edition: Developing Android 9 Apps Using Android Studio 3.2, Java and Android Jetpack
Neil Smyth
No ratings yet
ML - UNIT-1 &2 Notes
No ratings yet
ML - UNIT-1 &2 Notes
84 pages
Basics
No ratings yet
Basics
48 pages
Artificial Neural Networks
No ratings yet
Artificial Neural Networks
66 pages
Information System Building Block
No ratings yet
Information System Building Block
6 pages
Process Modeling
No ratings yet
Process Modeling
11 pages
System Development & SDLC
No ratings yet
System Development & SDLC
9 pages
System Design
No ratings yet
System Design
3 pages
Final Term Examination January 2014: Answer The Following Questions (Total: 40 Marks)
No ratings yet
Final Term Examination January 2014: Answer The Following Questions (Total: 40 Marks)
3 pages
Feasibility Analysis & System Proposal
100% (1)
Feasibility Analysis & System Proposal
8 pages
DxDiag Requisitos
No ratings yet
DxDiag Requisitos
30 pages
Project 619839 EPP 1 2020 1 FI EPPKA1 JMD MOB
No ratings yet
Project 619839 EPP 1 2020 1 FI EPPKA1 JMD MOB
2 pages
WEG - Transformer
No ratings yet
WEG - Transformer
20 pages
MBA Marketing Research Project Guidelines
No ratings yet
MBA Marketing Research Project Guidelines
7 pages
Unit 8
No ratings yet
Unit 8
9 pages
Marketnext Foundation
No ratings yet
Marketnext Foundation
4 pages
Unit 2 Principles of Assessm Ent in Instructional Decision
No ratings yet
Unit 2 Principles of Assessm Ent in Instructional Decision
11 pages
NW NSC GR 11 Maths Lit P1 Eng Memo Nov 2019
No ratings yet
NW NSC GR 11 Maths Lit P1 Eng Memo Nov 2019
7 pages
LRFD 0.9F 0.75F 0.99F: LR F A LR
No ratings yet
LRFD 0.9F 0.75F 0.99F: LR F A LR
4 pages
Pronoun-Antecedent Rules
No ratings yet
Pronoun-Antecedent Rules
22 pages
Dpi Reports
No ratings yet
Dpi Reports
2 pages
9-Mm Pistol Pmi Training: REF: FM 23 - 35
No ratings yet
9-Mm Pistol Pmi Training: REF: FM 23 - 35
30 pages
Laporan Daftar Pengguna GoodEva SmartSafety - Batch 1
No ratings yet
Laporan Daftar Pengguna GoodEva SmartSafety - Batch 1
3 pages
Writing Letter of Apllication and Resume
No ratings yet
Writing Letter of Apllication and Resume
10 pages
REPORT Contour
100% (3)
REPORT Contour
7 pages
Introduction To FIR Filter Design
No ratings yet
Introduction To FIR Filter Design
34 pages
Power Electronics For Electric Vehicles
No ratings yet
Power Electronics For Electric Vehicles
51 pages
Current Affairs - Compendium - DMS - IIT - Delhi
No ratings yet
Current Affairs - Compendium - DMS - IIT - Delhi
28 pages
Displacement and Acceleration C Programming
No ratings yet
Displacement and Acceleration C Programming
11 pages
RSA Projects Overview
100% (2)
RSA Projects Overview
7 pages
fml-g12s Ds en
No ratings yet
fml-g12s Ds en
7 pages
Mutations
No ratings yet
Mutations
48 pages
AX Series Hanyoung Brochure
No ratings yet
AX Series Hanyoung Brochure
6 pages
The Handbook of Mobile Middleware 1st Edition Paolo Bellavista 2024 Scribd Download
No ratings yet
The Handbook of Mobile Middleware 1st Edition Paolo Bellavista 2024 Scribd Download
45 pages
Letter of Invitation SGC
No ratings yet
Letter of Invitation SGC
7 pages
0-Week Forex Trading Roadmap (Jun 7
No ratings yet
0-Week Forex Trading Roadmap (Jun 7
14 pages
A Simple Proof of Bernoulli's Inequality: Sanjeev Saxena
No ratings yet
A Simple Proof of Bernoulli's Inequality: Sanjeev Saxena
2 pages
3 Happiness Exercises
No ratings yet
3 Happiness Exercises
20 pages
Businessethics
No ratings yet
Businessethics
2 pages