Introduction To Artificial Neural Networks

Chapter 5 provides an introduction to Artificial Neural Networks (ANNs), detailing their structure, including biological neurons, perceptrons, multi-layer perceptrons, and backpropagation. It discusses the architecture of neural networks, activation functions, loss functions, optimizers, and limitations like overfitting. Additionally, it introduces TensorFlow as a tool for implementing deep learning models, explaining key concepts such as tensors, computational graphs, sessions, variables, and constants.

Uploaded by

awetbrhanu122119

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

14 views31 pages

Introduction To Artificial Neural Networks

Uploaded by

awetbrhanu122119

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 31

Chapter 5

Introduction to

Artificial Neural Networks

Outlines
 Biological Neurons
 The Perceptron
 Multi-Layer Perceptron and Backpropagation
 Neural Network Architecture
 Activation Functions
 Loss Function
 Limitations of Neural Network
Introduction

 Artificial Neural Networks are popular

machine learning techniques that stimulate
the mechanism of learning in biological
organisms.
 ANN is a supervised learning system built of a
large number of simple elements, called
neurons or Perceptrons.
 Each neuron can make simple decisions, and
feeds those decisions to other neurons,
organized in interconnected layers.
Biological Neurons
 Individual biological neurons seem to behave in a
rather simple way, but they are organized in a vast
network of billions of neurons, each neuron
typically connected to thousands of other neurons.
The Perceptron
 Is one of the simplest ANN architectures
 It is based on a slightly different artificial neuron
called a linear threshold unit (LTU): the inputs and
outputs are now numbers (instead of binary on/off
values) and each input connection is associated
with a weight.
 The LTU computes a weighted sum of its inputs:
y
 Then applies a step function to that sum and
outputs the result
Con.
. Perceptrons are the simples types of artificial
neurons, invented as a simple model for binary
classification
Multi-Layer Perceptron and Backpropagation
 An MLP is composed of one (pass through) input
layer, one or more layers of LTUs, called hidden
layers, and one final layer of LTUs called the
output layer.
 Every layer except the output layer includes a bias
neuron and is fully connected to the next layer.
When an ANN has two or more hidden layers, it is
called a deep neural network (DNN).
Con.
 Backpropagation
 For each training instance, the algorithm feeds it to
the network and computes the output of every
neuron in each consecutive layer (this is the forward
pass, just like when making predictions).
 Reverse pass efficiently measures the error
gradient across all the connection weights in
the network by propagating the error
gradient backward in the network
Neural Network Architecture
 An Artificial Neural Network (ANN) is
composed off our principal objects:
Layers– all the learning occurs in the
layers. There are 3 layers: Input, Hidden,
Output
 The input data and corresponding targets
 Loss function– Metric used to estimate
the performance of the learning phase. It
defines the feedback signal.
 Optimizer–Improve the learning by
updating the knowledge in the network
Con.
Cont.
Con.
 Layers
 A layer is where all the learning takes place.
 Inside a layer, there are an infinite amount of

weights (neurons).
 A typical neural network is often processed by

densely connected layers (also called fully

connected layers)
 It means all the inputs are connected to the

output.
 A typical neural network takes a vector of input

and a scalar that contains the labels.

 The most comfortable setup is a binary classification with
only two classes: 0 and 1
 The network takes an input, sends it to all connected

nodes and computes the signal with an activation

Activation Function
 The activation function of a node defines the
output given a set of inputs.
 You need an activation function to allow the
network to learn non-linear pattern.
 A common activation function is a Relu, Rectified
linear unit.
 The function gives a zero for all negative values.
Loss Function
 After you have defined the hidden layers and the
activation function, you need to specify the loss
function and the optimizer.
 For binary classification, it is common practice to
use a binary cross entropy loss function.
 In the linear regression, you use the mean
square error
 The loss function is an important metric to
estimate the performance of the optimizer.
 During training, this metric will be minimized.
 You need to select this quantity carefully
depending on the type of problem you are dealing
with.
Optimizer
 The loss function is a measure of the model’s
performance.
 The optimizer will help improve the weights of
the network in order to decrease the loss.
 There are different optimizers available, but
the most common one is Stochastic Gradient
Descent.
 The conventional optimizers are:
 Momentum optimization
 Nesterove Accelerated Gradient
 AdaGrad
 Adam Optimization
Limitations of Neural Network
 Overfitting
A common problem with the complex neural
net is the difficulties in generalizing un
seen data.
 A neural network with lots of weights can
identify specific details in the train set very
well but often leads to overfitting.
 If the data are unbalanced within groups
(i.e, not enough data available in some
groups), the network will learn very well
during the training but will not have the
ability to generalize such pattern to
never-seen-before data.
Con.
 There is a trade-off in machine
learning between optimization and
generalization.
 Optimizing a model requires to find
the best parameters that minimize
the loss of the training set.
 Generalization, however, tells how
the model behaves for un seen data.
 To prevent the model from capturing
specific details or un wanted patterns of
the training data, you can use different
Con.
 The best method is to have a
balanced dataset with sufficient
amount of data.
 The art of reducing overfitting is
called regularization.
 Let’s review some of the conventional
techniques
Network size
 Weight Regularization
 DropOut
Introduction to TensorFlow
 TensorFlow is an open source software library for
numerical computation using data flow
graphs.
 The concept of a computational graph is very
important in TensorFlow and was specifically
designed for creating deep learning models.
 TensorFlow provides multiple APIs:
 Low level: TensorFlow Core –lowest-level API
which gives complete programming control with
a high degree of flexibility
 High level: High-level APIs such as
tf.contrib.learn, keras,and TF-Slimwhich take
care of repetitive tasks and low-level details.
 They are designed for the fast implementation of
commonly used models.
Tensorflow Basic
 There are some major concepts that
we need to understand before
actually using the tensorflow
library.
Tensors
 Computational Graphs
 Sessions
 Variables
 Placeholders
 Constants
Con.
Tensors
 A tensor is the primary data structure
of TensorFlow
 A tensor is a vector or matrix of n-
dimensions that represents all types of
data.
 All values in a tensor hold identical data
type with a known (or partially known)
shape.
 The shape of the data is the
dimensionality of the matrix or array.
 Feature vectors (in ML) will be the
Con.
 In TensorFlow, a tensor is a collection
of feature vectors(i.e., array) of n-
dimensions.
 For instance, if we have a 2x3 matrix
with values from 1 to 6, we write:

 TensorFlow represents this matrix as:

Computational Graph
 A computational graph is a series of
TensorFlow operations
 The following two principles are used by
TensorFlow Core:
 Computational graph
 Run the computational graph
Session and Placeholders
Session: is an object that
encapsulates the environment in
which operation objects are
executed.
 Sessions are objects that place
operations onto devices such as
CPUs or GPUs.
 Placeholders: A placeholder is a
promise to provide a value later.
 These objects are usually used to
Variables and Constants

 Variables: objects initialized with a

value, and that value can change
during the execution of the graph.
 Typically, they are used as trainable
variables
 Constants: objects whose values
never change
Example
 Step 1: Importing libraries

 Then, we define some TesorFlow objects,

placeholders, and a constant by executing
the following:
Con.
Import tensorflow as tf
from tensorflow.examples.tutorials.mnist import
input_data
mnist = input_data.read_data_sets(‘mnist_data’,
one_hot=True)
Sess = tf.InteractiveSession()
#placeholders
x = tf.placeholder(tf.float32, shape=[None, 784])
y_ = tf.placeholder(tf.float32, shape=[None, 10])
#variables
w = tf.variable(tf.zeros[784, 10])
b = tf.variable(tf.zeros[10])
sess.run(tf.global_variables_initializer())
#predicte class and loss function
Con.
cross_entropy = tf.reduce_mean(
tf.nn.softmax_cross_entropy_with_logits(
labels=y_ , logits = y))
# train the model
train_step =
tf.train.AdamOptimizer(0.5).minimize(cross_entrop
y)
for _ in range(1000):
batch = mnist.train.next_batch(100)
train_step.run(feed_dict={x: batch[0],
y_:batch[1]})
# Evaluate the model
correct_prediction = tf.equal(tf.argmax(y , axis=1),
Con.
accuracy = tf.reduce_mean(tf.cast(
correct_prediction,
tf.float32))

print(accuracy.eval(feed_dict={x:mnist.test.ima
ges,
y_: mnist.test.labels}))
Con.

Unit 5
No ratings yet
Unit 5
61 pages
TensorFlow in 1 Day: Make your own Neural Network
From Everand
TensorFlow in 1 Day: Make your own Neural Network
Krishna Rungta
3.5/5 (10)
Unit-5: Introduction To Deep Learning: Artificial Neural Networks
No ratings yet
Unit-5: Introduction To Deep Learning: Artificial Neural Networks
14 pages
3rd Unit ML
No ratings yet
3rd Unit ML
7 pages
Deep Learning
No ratings yet
Deep Learning
20 pages
Neural Networks
No ratings yet
Neural Networks
28 pages
CV Lec5
No ratings yet
CV Lec5
54 pages
ML Unit-5
No ratings yet
ML Unit-5
19 pages
Unit 3 Endsem PYQs
No ratings yet
Unit 3 Endsem PYQs
19 pages
Unit 5
No ratings yet
Unit 5
102 pages
Lecture 5-Introduction To Neural Network
No ratings yet
Lecture 5-Introduction To Neural Network
42 pages
Artificial Intelligence - Chapter 7
No ratings yet
Artificial Intelligence - Chapter 7
18 pages
Artificial Intelligence: Outline
No ratings yet
Artificial Intelligence: Outline
35 pages
AML 03 Dense Neural Networks
No ratings yet
AML 03 Dense Neural Networks
20 pages
Unit 2 - ML
No ratings yet
Unit 2 - ML
18 pages
MLP 1122 20240509 ch10 DeepNN
No ratings yet
MLP 1122 20240509 ch10 DeepNN
47 pages
Unit 3 Self Made
No ratings yet
Unit 3 Self Made
23 pages
Sony Ai Content
No ratings yet
Sony Ai Content
26 pages
Unit - 4 Artificial Neural Networks
No ratings yet
Unit - 4 Artificial Neural Networks
33 pages
2 +DeepLearning+17 10 2023
No ratings yet
2 +DeepLearning+17 10 2023
46 pages
Unit 5
No ratings yet
Unit 5
10 pages
Module 1
No ratings yet
Module 1
22 pages
Neural Networks & Deep Learning - Study Notes
No ratings yet
Neural Networks & Deep Learning - Study Notes
8 pages
Machine Learning
No ratings yet
Machine Learning
83 pages
Unit 5
No ratings yet
Unit 5
59 pages
Chapter 5
No ratings yet
Chapter 5
63 pages
A Imprimer 4
No ratings yet
A Imprimer 4
4 pages
ML MU Unit 5NeuralNetworkpdf 2025 04 16 13 47 39
No ratings yet
ML MU Unit 5NeuralNetworkpdf 2025 04 16 13 47 39
57 pages
Neural Network
No ratings yet
Neural Network
97 pages
CC511 Week 5 - 6 - NN - BP
No ratings yet
CC511 Week 5 - 6 - NN - BP
62 pages
ML & AI Notes
No ratings yet
ML & AI Notes
81 pages
AI(3) (1)
No ratings yet
AI(3) (1)
16 pages
Unit-3 ML
No ratings yet
Unit-3 ML
21 pages
Deep Learning
No ratings yet
Deep Learning
19 pages
Neural NetworksChapter2Sup
No ratings yet
Neural NetworksChapter2Sup
20 pages
Unit I
No ratings yet
Unit I
90 pages
CSE488 - Lab7 - Neural Networks and TensorFlow
No ratings yet
CSE488 - Lab7 - Neural Networks and TensorFlow
21 pages
Unit - 2
No ratings yet
Unit - 2
24 pages
Chapter-4 Fundamental of Neural Network
No ratings yet
Chapter-4 Fundamental of Neural Network
26 pages
AIML Unit-5
No ratings yet
AIML Unit-5
26 pages
4.0 The Complete Guide To Artificial Neural Networks
No ratings yet
4.0 The Complete Guide To Artificial Neural Networks
23 pages
DL Mod 1 Final
No ratings yet
DL Mod 1 Final
4 pages
20MEMECH Part 6 - NN Vol - 1
No ratings yet
20MEMECH Part 6 - NN Vol - 1
34 pages
Deep Learning UNIT-3
No ratings yet
Deep Learning UNIT-3
20 pages
Class Notes DL Unit 2
No ratings yet
Class Notes DL Unit 2
47 pages
cst414 - Deep Learning
No ratings yet
cst414 - Deep Learning
34 pages
AI & ML Unit 5 Notes
No ratings yet
AI & ML Unit 5 Notes
23 pages
Introduction Neural
No ratings yet
Introduction Neural
13 pages
CS 611 Slides 5
No ratings yet
CS 611 Slides 5
28 pages
Aiml Unit 5
No ratings yet
Aiml Unit 5
34 pages
Unit III
No ratings yet
Unit III
29 pages
Neural Network Representation
No ratings yet
Neural Network Representation
5 pages
Unit Ii ML
No ratings yet
Unit Ii ML
22 pages
ML 6
No ratings yet
ML 6
10 pages
Convolutional Neural Network
No ratings yet
Convolutional Neural Network
37 pages
NNML Full
No ratings yet
NNML Full
19 pages
Deep Learning
No ratings yet
Deep Learning
37 pages
NNDL
No ratings yet
NNDL
96 pages
Lecture NN 2005
No ratings yet
Lecture NN 2005
137 pages
Neural Network Oxygen
No ratings yet
Neural Network Oxygen
25 pages
Bcse332p Deep-Learning-Lab Lo 1.0 0 Bcse332p
No ratings yet
Bcse332p Deep-Learning-Lab Lo 1.0 0 Bcse332p
2 pages
Module 3 SPECIAL LEARNING NETWORK
No ratings yet
Module 3 SPECIAL LEARNING NETWORK
13 pages
Single Layer Perceptron
No ratings yet
Single Layer Perceptron
14 pages
LSTM Recurrent Neural Networks - How To Teach A Network To Remember The Past - by Saul Dobilas - Towards Data Science
No ratings yet
LSTM Recurrent Neural Networks - How To Teach A Network To Remember The Past - by Saul Dobilas - Towards Data Science
20 pages
Convolution in Machine Learning
No ratings yet
Convolution in Machine Learning
2 pages
Facial Smile Detection Based On Deep Learning Features
No ratings yet
Facial Smile Detection Based On Deep Learning Features
5 pages
Report
No ratings yet
Report
11 pages
Detr
No ratings yet
Detr
5 pages
Neural Networks: Learning: Cost Function
No ratings yet
Neural Networks: Learning: Cost Function
33 pages
Deep Learning EECS 6327
No ratings yet
Deep Learning EECS 6327
43 pages
Convolution Neural Network
No ratings yet
Convolution Neural Network
12 pages
Unit IV Artificial Neural Networks
No ratings yet
Unit IV Artificial Neural Networks
25 pages
Notes Unit 1
No ratings yet
Notes Unit 1
13 pages
Btech Ee 5 Sem Neural Networks Fuzzy System Kee056 2023
No ratings yet
Btech Ee 5 Sem Neural Networks Fuzzy System Kee056 2023
2 pages
StyleSwin Transformer-Based GAN For High-Resolution Image Generation
No ratings yet
StyleSwin Transformer-Based GAN For High-Resolution Image Generation
11 pages
Introduction To Deep Learning - Assignment
No ratings yet
Introduction To Deep Learning - Assignment
4 pages
(23mca32) Practical 1 & Practical 2
No ratings yet
(23mca32) Practical 1 & Practical 2
9 pages
Deep Learning Unit 1
No ratings yet
Deep Learning Unit 1
35 pages
AI010 804L01 Neural Networks
No ratings yet
AI010 804L01 Neural Networks
41 pages
Physics Informed Neural Networks Reducing Data Size Requirements Via Hybrid Learning
No ratings yet
Physics Informed Neural Networks Reducing Data Size Requirements Via Hybrid Learning
2 pages
Addition Multiplication RNN
No ratings yet
Addition Multiplication RNN
7 pages
Unit III
No ratings yet
Unit III
58 pages
w1 01 Introtonn
No ratings yet
w1 01 Introtonn
42 pages
Total Pages: 2: Answer All Questions, Each Carries 3 Marks
No ratings yet
Total Pages: 2: Answer All Questions, Each Carries 3 Marks
2 pages
ISP560 Notes
No ratings yet
ISP560 Notes
139 pages
A Step by Step Backpropagation Example - Matt Mazur
No ratings yet
A Step by Step Backpropagation Example - Matt Mazur
7 pages
Reviewer
No ratings yet
Reviewer
7 pages
MY Research On XOR Problem
No ratings yet
MY Research On XOR Problem
4 pages
Activation Functions and Their Characteristics in Deep Neural Networks
No ratings yet
Activation Functions and Their Characteristics in Deep Neural Networks
6 pages
ELA注意力模块
No ratings yet
ELA注意力模块
12 pages

Introduction To Artificial Neural Networks

Uploaded by

Introduction To Artificial Neural Networks

Uploaded by

Chapter 5

Artificial Neural Networks

 Artificial Neural Networks are popular

densely connected layers (also called fully

and a scalar that contains the labels.

nodes and computes the signal with an activation

 TensorFlow represents this matrix as:

 Variables: objects initialized with a

 Then, we define some TesorFlow objects,

You might also like