0% found this document useful (0 votes)

20 views39 pages

Ain3001 - Introduction - To.ann

Uploaded by

iremaslan200313

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

20 views39 pages

Ain3001 - Introduction - To.ann

Uploaded by

iremaslan200313

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 39

AIN-3001

Machine Learning
Introduction to ANN

Dr. Fatih KAHRAMAN

[email protected]
Module Content
Core of Deep Learning: ANNs
> ANNs
– Versatile, Powerful, Scalable
e.g.
– classifying billions of images → Google Images
– speech recognition service → Apple’s Siri
– recommending the best videos → YouTube
– beat the world champion at Go → DeepMind’s
Alpha‐Zero
making them ideal to tackle large and highly complex
Machine Learning tasks
Module Content
> Introduction to artificial neural networks
– quick tour of the very first ANN architectures
– Multi-Layer Perceptrons(MLPs)
> implement neural networks using the popular Keras API
– beautifully designed and simple high-level API for
building, training, evaluating and running neural
networks
Let’s go back in time to see how artificial neural networks
came to be!
From Biological to Artificial Neurons

> look at the brain’s architecture for inspiration on how to

build an intelligent machine.
> The key idea that sparked Artificial Neural Networks
(ANNs).
From Biological to Artificial Neurons

> look at the brain’s architecture for inspiration on how to

build an intelligent machine.
> The key idea that sparked Artificial Neural Networks
(ANNs).
From Biological to Artificial Neurons
> Biological Neurons
Let’s take a quick look at a biological neuron

Biological neuron
From Biological to Artificial Neurons
> Biological Neurons

Multiple layers in a biological neural network (human cortex)

From Biological to Artificial Neurons
From Biological to Artificial Neurons
> First introduced back in 1943 by the neurophysiologist Warren
McCulloch and the mathematician Walter Pitts.
> The first artificial neural network architecture
– a simplified computational model that perform complex
computations using propositional logic.
> until the 1960s, the widespread belief that we would soon be
conversing with truly intelligent machines but it became clear
that this promise would go unfulfilled, funding flew elsewhere
and ANNs entered a long winter.
> In the early 1980s there was a revival of interest in
connectionism, as new architectures were invented and better
training techniques were developed. But progress was slow.
> by the 1990s, other powerful Machine Learning techniques
were invented, such as Support Vector, Machines, once again
the study of neural networks entered a long winter.
From Biological to Artificial Neurons
Many of the core concepts for deep learning were in place
by the 80s or 90s, so what happened in the last years that
changed things?
Massive Labeled Data Sets and GPU Computing
> Appearance of large, high-quality labeled datasets
> Massively parallel computing with GPUs
> Backprop-friendly activation functions
> Improved architectures
> New regularization techniques
> Robust optimizers
From Biological to Artificial Neurons
> Scalable?

Biological: Get Bigger Brain

Artificial: Add More Neuron

and Layers
From Biological to Artificial Neurons
> Biological Neurons vs. Artificial Neurons

Artificial Neuron vs. Biological Neuron

From Biological to Artificial Neurons
> A given input is perceived at multiple levels of abstraction
such as edges, corners and contours, shapes, object parts
to object.

The signal path from the retina to human lateral occipital cortex (LOC)
which finally recognizes the object.
Figure credit to Jonas Kubilius
From Biological to Artificial Neurons
> A given input is perceived at multiple levels of abstraction
such as edges, corners and contours, shapes, object parts
to object.

Facial image response to Gabor Filters

From Biological to Artificial Neurons
> Convolution operation and Filter response
Artificial Neural Nets - Frameworks
> Keras.Js Demo
– Interactive Keras demonstration using web browser

Run Keras models (tensorflow backend) in the browser, with GPU

support and Investigate all layers visually.
Artificial Neural Nets - Frameworks
> Basic Principles
– Convolution and Max-Pooling Operation

Illustrations of convolution and max-pooling operation:

(a) convolutional operation; and (b) max-pooling operation.
Artificial Neural Nets - Frameworks
> Basic Principles
– Filters (Gabor kernels)

Gabor filters for 8 orientations and 5 wavelengths

From Biological to Artificial Neurons
> Logical Computations with Neurons
Warren McCulloch and Walter Pitts proposed the biological neuron,
which later became known as an artificial neuron:
one or more binary (on/off) inputs and one binary output.

ANNs performing simple logical computations

From Biological to Artificial Neurons
> The Perceptron
From Biological to Artificial Neurons
> The Perceptron
The Perceptron is one of the simplest ANN architectures, invented in
1957 by Frank Rosenblatt, called a threshold logic unit (TLU), or
sometimes a linear threshold unit (LTU):

Common step functions

Threshold logic unit
From Biological to Artificial Neurons
> The Perceptron
A Perceptron is simply composed of a single layer of TLUs with each
TLU connected to all the inputs.
Fully Connected Layer (Dense Layer): All the neurons in a layer are
connected to every neuron in the previous
Bias Neuron: Outputs 1 (x0 = 1) all the time.
Input Layer: All the input neurons

Perceptron diagram
From Biological to Artificial Neurons
> The Perceptron
How is computing the outputs of a fully connected layer?

• X represents the matrix of input features.

• The weight matrix W (except for the ones from the bias
neuron)
• The bias vector b
• ϕ is called the activation function: when the artificial
neurons are TLUs, it is a step function (but we will discuss
other activation functions shortly).
From Biological to Artificial Neurons
> The Perceptron
How is a Perceptron trained?
• The Perceptron training algorithm proposed by Frank Rosenblatt was largely inspired
by Hebb’s rule.
• Hebb’s rule(or Hebbian learning): The connection weight between two neurons is
increased whenever they have the same output.

Perceptron learning rule (weight update)

wi, j : connection weight between the ith input neuron and the jth output
neuron.
xi : ith input value of the current training instance.
yj : output of the jth output neuron for the current training instance.
ŷj : target output of the jth output neuron for the current training instance.
η : learning rate.
From Biological to Artificial Neurons
> The Perceptron - Example
Scikit-Learn provides a Perceptron class that implements a single
TLU network.
From Biological to Artificial Neurons
> The Perceptron - Example
From Biological to Artificial Neurons
> The Perceptron
• At 1969, Marvin Minsky and Seymour Papert highlighted a number
of serious weaknesses of Perceptrons of solving some trivial
problems (E.g. the Exclusive OR (XOR) classification problem)
• Some of the limitations of Perceptrons can be eliminated by stacking
multiple Perceptrons. The resulting ANN is called a Multi-Layer
Perceptron (MLP).

XOR classification problem and an MLP that solves it

From Biological to Artificial Neurons
> Multi-Layer Perceptron and Backpropagation

Multi-Layer Perceptron
From Biological to Artificial Neurons
> Multi-Layer Perceptron and Backpropagation
For many years researchers struggled to find a way to train MLPs,
without success.
In 1986; David Rumelhart, Geoffrey Hinton and Ronald Williams
published a groundbreaking paper introducing the backpropagation
training algorithm.
forward pass: for each training instance the backpropagation algorithm
first makes a prediction
measures the error: calculate output error by using a loss function
backward pass: goes through each layer in reverse to measure the
error contribution from each connection
Gradient Descent step: slightly tweaks the connection weights to
reduce the error
From Biological to Artificial Neurons
> Multi-Layer Perceptron and Backpropagation
Algorithm Details
• It handles one mini-batch at a time and it goes through the full training set
multiple times. Each pass is called an epoch.
• Forward pass : Each mini-batch instances are passed to the network’s input
layer and algorithm computes the output of all the neurons until we get the
output of the last layer, the output layer. This is same as making predictions
except all intermediate results are preserved since they are needed for the
backward pass.
• Next, the algorithm measures the network’s output error by using a loss
function.
• Backward pass : Then it computes how much each output connection
contributed to the error by simply applying the chain rule. The error
contribution calculation continues until the algorithm reaches the input layer.
This reverse pass measures the error gradient across all the connection
weights in the network by propagating the error gradient backward through
the network.
• Finally, the algorithm performs a Gradient Descent step to tweak all the
connection weights in the network, using the error gradients it just computed.
From Biological to Artificial Neurons
> Multi-Layer Perceptron and Backpropagation
Activation Functions
In order for this algorithm to work properly, the authors made a key
change to the MLP’s architecture:
• they replaced the step function with the logistic function
σ(z) =1 / (1 + exp(–z))
• This was essential because the step function contains only flat
segments, so there is no gradient to work with while the logistic
function has a well-defined nonzero derivative everywhere.
• Other popular activation functions
Hyperbolic tangent function Rectified Linear Unit function
tanh(z) = 2σ(2z) – 1 ReLU(z) = max(0, z)
From Biological to Artificial Neurons
> Multi-Layer Perceptron and Backpropagation
Activation Functions

Activation functions and their derivatives

From Biological to Artificial Neurons
> Regression MLPs
• MLPs can be used for regression tasks. You need one output neuron
per output dimension.
• E.g. predict the price of a house (one output neuron), location of the
center of object (two output neurons) or bounding box of object
(four output neurons)
• To free output any range of values, do not use any activation
function.
• To guarantee that the output will always be positive use the ReLU
activation function, or the softplus activation function in the output
layer.
• To guarantee that the predictions will fall within a given range of
values, use the logistic function (range 0 to 1) or the hyperbolic
tangent function(range -1 to 1)
From Biological to Artificial Neurons
> Regression MLPs
Typical Regression MLP Architecture
From Biological to Artificial Neurons
> Classification MLPs
• MLPs can also be used for classification tasks.
• Binary classification: Single output neuron using the logistic
activation function: the output will be a number between 0 and 1.
• Multilabel binary classification: Need two output neurons, both
using the logistic activation function: output gives the probability of
labels. E.g. email classification(ham or spam, urgent or non-urgent
email)
• Multi class classification: need to have one output neuron per class,
and you should use the softmax activation function for the whole
output layer.
From Biological to Artificial Neurons
> Classification MLPs

A modern MLP (including ReLU and softmax) for classification

Machine / Deep Learning Frameworks
Neural Networks in Your Browser

http://playground.tensorflow.org/

Unit 5
No ratings yet
Unit 5
61 pages
Yousef Udacity Deep Learning Part1 Introdution + Part 2 NN
No ratings yet
Yousef Udacity Deep Learning Part1 Introdution + Part 2 NN
437 pages
Livro 4 - Deep-Learning
No ratings yet
Livro 4 - Deep-Learning
271 pages
08 09 23 Soft Computing - ANN - PPT
No ratings yet
08 09 23 Soft Computing - ANN - PPT
153 pages
Artificial Neural Networks
No ratings yet
Artificial Neural Networks
54 pages
Introduction To Artificial Neural Networks and Perceptron
No ratings yet
Introduction To Artificial Neural Networks and Perceptron
59 pages
NNDL
No ratings yet
NNDL
96 pages
The Following Papers Belong To: WSEAS NNA-FSFS-EC 2001, February 11-15, 2001, Puerto de La Cruz, Tenerife, Spain
No ratings yet
The Following Papers Belong To: WSEAS NNA-FSFS-EC 2001, February 11-15, 2001, Puerto de La Cruz, Tenerife, Spain
228 pages
Mod 2.1,2.2
No ratings yet
Mod 2.1,2.2
24 pages
Ann L1
No ratings yet
Ann L1
81 pages
Soft Computing 2016
No ratings yet
Soft Computing 2016
4 pages
Artificial Neural Networks
No ratings yet
Artificial Neural Networks
82 pages
FALLSEM2023-24 CSE4020 ETH VL2023240103694 2023-09-01 Reference-Material-I
No ratings yet
FALLSEM2023-24 CSE4020 ETH VL2023240103694 2023-09-01 Reference-Material-I
35 pages
Isch 4
No ratings yet
Isch 4
44 pages
Machine Learning: Feed Forward Neural Networks Backpropagation Algorithm Cnns and Rnns
No ratings yet
Machine Learning: Feed Forward Neural Networks Backpropagation Algorithm Cnns and Rnns
127 pages
Lecture Slides-Week13,14
No ratings yet
Lecture Slides-Week13,14
62 pages
Artificial Neural Network
No ratings yet
Artificial Neural Network
14 pages
Chapter 1 - Introduction To Deep Learning 2023
No ratings yet
Chapter 1 - Introduction To Deep Learning 2023
50 pages
Neural Networks and CNN
No ratings yet
Neural Networks and CNN
25 pages
Artificial Neural Network
No ratings yet
Artificial Neural Network
75 pages
Instructor's Solution Manual For Neural Networks
No ratings yet
Instructor's Solution Manual For Neural Networks
40 pages
Final Report PDF
No ratings yet
Final Report PDF
35 pages
Deep Learning - Part-1
No ratings yet
Deep Learning - Part-1
143 pages
Simple Neural Nets For Pattern Classification
No ratings yet
Simple Neural Nets For Pattern Classification
68 pages
AI Module 4
No ratings yet
AI Module 4
38 pages
This Document Is About Artificial Inteligence.
No ratings yet
This Document Is About Artificial Inteligence.
81 pages
Detection of Malicious Web Contents Using Machine and Deep Learning Approaches
No ratings yet
Detection of Malicious Web Contents Using Machine and Deep Learning Approaches
6 pages
2 DeepLearning
No ratings yet
2 DeepLearning
46 pages
Cours 1 - Intro To Deep Learning
100% (1)
Cours 1 - Intro To Deep Learning
38 pages
Artifical Neural Network
No ratings yet
Artifical Neural Network
69 pages
ML Unit4
No ratings yet
ML Unit4
38 pages
Unit 2
No ratings yet
Unit 2
25 pages
Types of Networks
No ratings yet
Types of Networks
31 pages
Unit 1
No ratings yet
Unit 1
25 pages
Widrow-Hoff (-LMS) Learning Rule
No ratings yet
Widrow-Hoff (-LMS) Learning Rule
24 pages
Lab2 - Perceptron and Adaline Networks
No ratings yet
Lab2 - Perceptron and Adaline Networks
7 pages
Learning Global Inverse Kinematics Solutions For A Continuum Robot
No ratings yet
Learning Global Inverse Kinematics Solutions For A Continuum Robot
8 pages
LIET III-II CSE AIML IV UNIT Previous Yrs QN Papers Qns and Answers
No ratings yet
LIET III-II CSE AIML IV UNIT Previous Yrs QN Papers Qns and Answers
15 pages
Artificial Intelligence - Chapter 7
No ratings yet
Artificial Intelligence - Chapter 7
18 pages
Lecture Notes For Chapter 4 Artificial Neural Networks Introduction To Data Mining, 2 Edition
No ratings yet
Lecture Notes For Chapter 4 Artificial Neural Networks Introduction To Data Mining, 2 Edition
20 pages
Artificial Neural Network
100% (2)
Artificial Neural Network
20 pages
Soft Compute
No ratings yet
Soft Compute
21 pages
Chapter10 Keras
No ratings yet
Chapter10 Keras
66 pages
Lecture 1
No ratings yet
Lecture 1
38 pages
ML Unit 5
No ratings yet
ML Unit 5
33 pages
Lec 6-7 (Neural Networks)
No ratings yet
Lec 6-7 (Neural Networks)
26 pages
3 Non Linear Classifiers
No ratings yet
3 Non Linear Classifiers
74 pages
ML QB
No ratings yet
ML QB
23 pages
Mod-1 Part 1
No ratings yet
Mod-1 Part 1
143 pages
Machine Learning - AI Course
No ratings yet
Machine Learning - AI Course
2 pages
Short - Ques - Answers FML
No ratings yet
Short - Ques - Answers FML
10 pages
Unit I - Afs
No ratings yet
Unit I - Afs
18 pages
Data Mining Classification and Prediction
No ratings yet
Data Mining Classification and Prediction
17 pages
Class N9
No ratings yet
Class N9
32 pages
DL - Unit II
No ratings yet
DL - Unit II
78 pages
Neural Networks
No ratings yet
Neural Networks
17 pages
Neural Network
No ratings yet
Neural Network
18 pages
Neural Network
No ratings yet
Neural Network
85 pages
Basic Concepts of Machine Learning For Beginners 1732109263
No ratings yet
Basic Concepts of Machine Learning For Beginners 1732109263
102 pages
Notes ML 02 Slides RNN ANN
No ratings yet
Notes ML 02 Slides RNN ANN
105 pages
Technical Seminar Index
No ratings yet
Technical Seminar Index
4 pages
Introduction To Artificial Neural Networks With Keras - IITR Batch 2
No ratings yet
Introduction To Artificial Neural Networks With Keras - IITR Batch 2
252 pages
MLT Unit 4
No ratings yet
MLT Unit 4
15 pages
Machine Learning
No ratings yet
Machine Learning
13 pages
Neural Network - Overview
No ratings yet
Neural Network - Overview
37 pages
Lecture-20 21 22 (ANN)
No ratings yet
Lecture-20 21 22 (ANN)
30 pages
Artificial Neural Network
No ratings yet
Artificial Neural Network
75 pages
Ain3001 - Introduction - To.ann
No ratings yet
Ain3001 - Introduction - To.ann
39 pages
Module 2
No ratings yet
Module 2
84 pages
ML Unit-5 Final
No ratings yet
ML Unit-5 Final
23 pages
Types of Neural Networks and Definition of Neural Network
No ratings yet
Types of Neural Networks and Definition of Neural Network
15 pages
DL CS05
No ratings yet
DL CS05
22 pages
Unit-Ii MLT1
No ratings yet
Unit-Ii MLT1
45 pages
Unit-3 ML
No ratings yet
Unit-3 ML
21 pages
UNIT I-PGI20C05J-Deep Neural Networks
No ratings yet
UNIT I-PGI20C05J-Deep Neural Networks
35 pages
Week 2
No ratings yet
Week 2
47 pages
Deep Learning Lecture 6
No ratings yet
Deep Learning Lecture 6
8 pages
CCS355-NN&DL Lab Manual
No ratings yet
CCS355-NN&DL Lab Manual
77 pages
Module - 04 Machine Learning (BCS602) Search Creators
No ratings yet
Module - 04 Machine Learning (BCS602) Search Creators
21 pages
Artificial Neural Networks
No ratings yet
Artificial Neural Networks
26 pages
(Ebook PDF) Cognitive Science: An Introduction To The Science of The Mind 2nd Editioninstant Download
No ratings yet
(Ebook PDF) Cognitive Science: An Introduction To The Science of The Mind 2nd Editioninstant Download
51 pages
Unit - 4 ANN
No ratings yet
Unit - 4 ANN
46 pages
Unit V
No ratings yet
Unit V
49 pages
Cox Book Review of The Alignment Problem
No ratings yet
Cox Book Review of The Alignment Problem
6 pages
Wk. 12. Artificial Neural Networks (12!05!2021)
No ratings yet
Wk. 12. Artificial Neural Networks (12!05!2021)
48 pages
Unit 4 Neural Networks
No ratings yet
Unit 4 Neural Networks
76 pages
Machine Learning
No ratings yet
Machine Learning
39 pages
UNIT1
No ratings yet
UNIT1
72 pages
Learning Oreilly Com Library View Aws Certified Ai 9798341622326 Ch04 HTML Ch04 Pre Training 1746034578002934
No ratings yet
Learning Oreilly Com Library View Aws Certified Ai 9798341622326 Ch04 HTML Ch04 Pre Training 1746034578002934
20 pages
Bio Inspired Computing: Fundamentals and Applications for Biological Inspiration in the Digital World
From Everand
Bio Inspired Computing: Fundamentals and Applications for Biological Inspiration in the Digital World
Fouad Sabry
No ratings yet

Ain3001 - Introduction - To.ann

Uploaded by

Ain3001 - Introduction - To.ann

Uploaded by

AIN-3001

Dr. Fatih KAHRAMAN

> look at the brain’s architecture for inspiration on how to

> look at the brain’s architecture for inspiration on how to

Multiple layers in a biological neural network (human cortex)

Biological: Get Bigger Brain

Artificial: Add More Neuron

Artificial Neuron vs. Biological Neuron

Facial image response to Gabor Filters

Run Keras models (tensorflow backend) in the browser, with GPU

Illustrations of convolution and max-pooling operation:

Gabor filters for 8 orientations and 5 wavelengths

ANNs performing simple logical computations

Common step functions

• X represents the matrix of input features.

Perceptron learning rule (weight update)

XOR classification problem and an MLP that solves it

Activation functions and their derivatives

A modern MLP (including ReLU and softmax) for classification

You might also like