0% found this document useful (0 votes)

3 views

Material

Uploaded by

Sulekha Isaac

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

3 views

Material

Uploaded by

Sulekha Isaac

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 16

21PG2IT7 - DATA SCEINCE

Single Layer Perceptron

A perceptron is a neural network unit that does a precise computation to detect
features in the input data. Perceptron is mainly used to classify the data into two
parts. Therefore, it is also known as Linear Binary Classifier.

A regular neural network looks like this:

21PG2IT7 - DATA SCEINCE

The perceptron consists of 4 parts.

 Input value or One input layer: The input layer of the perceptron is made
of artificial input neurons and takes the initial data into the system for
further processing.
 Weights and Bias:
Weight: It represents the dimension or strength of the connection between
units. If the weight to node 1 to node 2 has a higher quantity, then neuron
1 has a more considerable influence on the neuron.
Bias: It is the same as the intercept added in a linear equation. It is an
additional parameter which task is to modify the output along with the
weighted sum of the input to the other neuron.
 Net sum: It calculates the total sum.
 Activation Function: A neuron can be activated or not, is determined by an
activation function. The activation function calculates a weighted sum and
further adding bias with it to give the result.
21PG2IT7 - DATA SCEINCE

A standard neural network looks like the below diagram.

How does it work?

The perceptron works on these simple steps which are given below:

a. In the first step, all the inputs x are multiplied with their weights w.
21PG2IT7 - DATA SCEINCE

b. In this step, add all the increased values and call them the Weighted sum.

c. In our last step, apply the weighted sum to a correct Activation Function.
21PG2IT7 - DATA SCEINCE

 In a single layer perceptron, the weights to each input node are assigned
randomly since there is no a priori knowledge associated with the nodes.
 Also, a threshold value is assigned randomly
 Now SLP sums all the weights which are inputted and if the sums are is
above the threshold then the network is activated.
 If the calculated value is matched with the desired value, then the model is
successful
 If it is not, then since there is no back-propagation technique involved in
this the error needs to be calculated using the below formula and the
weights need to be adjusted again.

The single layer perceptron (SLP) model is the simplest form of neural network
and the basis for the more advanced models that have been developed in deep
learning. Typically, we use SLP in classification problems where we need to give
the data observations labels (binary or multinomial) based on inputs. The values
in the input layer are directly sent to the output layer after they are multiplied by
weights and a bias is added to the cumulative sum. This cumulative sum is then
put into an activation function, which is simply a function that defines the output.
When that output is above or below a user-determined threshold, the final
output is determined.
21PG2IT7 - DATA SCEINCE

Multilayer Perceptron Model

In a Multilayer Perceptron, the main intuition of using this method is when the
data is not linearly separable. Each node in a layer consists of a non-linear
activation function for processing. These functions are typically Sigmoid/Logistic
Function, tanh/Hyperbolic Tangent function, ReLU (Rectified Linear Unit),
Softmax. This neural network is fully connected and also has the capability to
learn by itself by changing the weights of connection after each data point is
processed and the amount of error it generates. Multi-Layer perceptron defines
the most complex architecture of artificial neural networks. It is substantially
formed from multiple layers of the perceptron. The pictorial representation of
multi-layer perceptron learning is as shown below-
21PG2IT7 - DATA SCEINCE

MLP networks are used for supervised learning format. A typical learning
algorithm for MLP networks is also called back propagation's algorithm.

A multilayer perceptron (MLP) is a feed forward artificial neural network that

generates a set of outputs from a set of inputs. An MLP is characterized by several
layers of input nodes connected as a directed graph between the input nodes
connected as a directed graph between the input and output layers. MLP uses
backpropagation for training the network. MLP is a deep learning method.

Very similar to SLP, the multilayer perceptron (MLP) model features multiple
layers that are interconnected in such a way that they form a feed-forward neural
network. Each neuron in one layer has directed connections to the neurons of a
separate layer. One of the key distinguishing factors in this model and the single
layer perceptron model is the back-propagation algorithm, a common method of
training neural networks. Back-propagation passes the error calculated from the
output layer to the input layer such that we can see each layer’s contribution to
the error and alter the network accordingly. Here, we use a gradient descent
algorithm to determine the degree to which the weights should change upon
each iteration. Gradient descent—another popular machine learning/optimization
algorithm—is simply the derivative of a function such that we find a scalar (a
number with magnitude as its only property) value that points in the direction of
greatest momentum. By subtracting the gradient, this leads us to a solution that is
more optimal than the one we currently are at until we reach a global optimum
21PG2IT7 - DATA SCEINCE

Convolutional Neural Network

Convolutional Neural Network, this type of neural network is an advanced version
of Multilayer Perceptron. In this type, there is one or more than one
convolutional layer. Now the basic question is what exactly is a convolutional
layer? Convolution is nothing but a simple filtering mechanism that enables
activation. When this filtering mechanism is repeated, it yields the location and
strength of a detected feature. As a result of this ability, these networks are
widely used in image processing, natural language processing, recommender
systems so as to yield effective results of the important feature detected.
21PG2IT7 - DATA SCEINCE

A Convolutional Neural Network (ConvNet/CNN) is a Deep Learning algorithm

which can take in an input image, assign importance (learnable weights and
biases) to various aspects/objects in the image and be able to differentiate one
from the other. The pre-processing required in a ConvNet is much lower as
compared to other classification algorithms. While in primitive methods filters are
hand-engineered, with enough training, ConvNets have the ability to learn these
filters/characteristics.
21PG2IT7 - DATA SCEINCE

How do convolutional neural networks work?

Convolutional neural networks are distinguished from other neural networks by

their superior performance with image, speech, or audio signal inputs. They have
three main types of layers, which are:

 Convolutional layer
 Pooling layer
 Fully-connected (FC) layer

The convolutional layer is the first layer of a convolutional network. While

convolutional layers can be followed by additional convolutional layers or pooling
layers, the fully-connected layer is the final layer. With each layer, the CNN
increases in its complexity, identifying greater portions of the image. Earlier layers
focus on simple features, such as colors and edges. As the image data progresses
through the layers of the CNN, it starts to recognize larger elements or shapes of
the object until it finally identifies the intended object.

Convolutional Layer

The convolutional layer is the core building block of a CNN, and it is where the
majority of computation occurs. It requires a few components, which are input
data, a filter, and a feature map.

Pooling Layer

Pooling layers, also known as downsampling, conducts dimensionality reduction,

reducing the number of parameters in the input. Similar to the convolutional
layer, the pooling operation sweeps a filter across the entire input, but the
difference is that this filter does not have any weights.

There are two main types of pooling:

 Max pooling: As the filter moves across the input, it selects the pixel with
the maximum value to send to the output array. As an aside, this approach
tends to be used more often compared to average pooling.
 Average pooling: As the filter moves across the input, it calculates the
average value within the receptive field to send to the output array.
21PG2IT7 - DATA SCEINCE

Fully-Connected Layer

The name of the full-connected layer aptly describes itself. The pixel values of the
input image are not directly connected to the output layer in partially connected
layers. However, in the fully-connected layer, each node in the output layer
connects directly to a node in the previous layer.

This layer performs the task of classification based on the features extracted
through the previous layers and their different filters. While convolutional and
pooling layers tend to use ReLu functions, FC layers usually leverage a softmax
activation function to classify inputs appropriately, producing a probability from 0
to 1.

Recurrent Neural Networks

RNN works on the principle of saving the output of a particular layer and feeding
this back to the input in order to predict the output of the layer.

Below is how you can convert a Feed-Forward Neural Network into a Recurrent
Neural Network:

In a RNN output of the current layer is the input of the next layer. In traditional
neural networks, all the inputs and outputs are independent of each other, but in
cases like when it is required to predict the next word of a sentence, the previous
words are required and hence there is a need to remember the previous words.
21PG2IT7 - DATA SCEINCE

It uses Long Short Term Memory (LSTM) to remember the previous output. In a
RNN same weights and bias are used for input of every layer. Because it performs
the same task on all the inputs of hidden layers to produce the output. The main
and most important feature of RNN is Hidden state, which remembers some
information about a sequence By LSTM. A RNN is the best use for sequential
model data like predicting the next word of a sentence or next position of a
running ball. The Recurrent Neural Network consists of multiple fixed activation
function units, one for each time step. Each unit has an internal state which is
called the hidden state of the unit. This hidden state signifies the past knowledge
that that the network currently holds at a given time step. This hidden state is
updated at every time step to signify the change in the knowledge of the network
about the past.

How Does Recurrent Neural Networks Work?

In Recurrent Neural networks, the information cycles through a loop to the

middle hidden layer. The input layer ‘x’ takes in the input to the neural network
and processes it and passes it onto the middle layer.

The middle layer ‘h’ can consist of multiple hidden layers, each with its own
activation functions and weights and biases. If you have a neural network where
the various parameters of different hidden layers are not affected by the previous
layer, ie: the neural network does not have memory, then you can use a recurrent
neural network.

The Recurrent Neural Network will standardize the different activation functions
and weights and biases so that each hidden layer has the same parameters. Then,
instead of creating multiple hidden layers, it will create one and loop over it as
many times as required.

Feed-Forward Neural Networks vs Recurrent Neural Networks

A feed-forward neural network allows information to flow only in the forward

direction, from the input nodes, through the hidden layers, and to the output
nodes. There are no cycles or loops in the network.
21PG2IT7 - DATA SCEINCE

Applications of Recurrent Neural Networks

Image Captioning

RNNs are used to caption an image by analyzing the activities present.

Time Series Prediction

Any time series problem, like predicting the prices of stocks in a particular month,
can be solved using an RNN.

Natural Language Processing

Text mining and Sentiment analysis can be carried out using an RNN for Natural
Language Processing (NLP).

Types of Recurrent Neural Networks

There are four types of Recurrent Neural Networks:

1. One to One
2. One to Many
3. Many to One
4. Many to Many

One to One RNN

This type of neural network is known as the Vanilla Neural Network. It's used for
general machine learning problems, which has a single input and a single output.
21PG2IT7 - DATA SCEINCE

One to Many RNN

This type of neural network has a single input and multiple outputs. An example
of this is the image caption.

Many to One RNN

This RNN takes a sequence of inputs and generates a single output. Sentiment
analysis is a good example of this kind of network where a given sentence can be
classified as expressing positive or negative sentiments.
21PG2IT7 - DATA SCEINCE

Many to Many RNN

This RNN takes a sequence of inputs and generates a sequence of outputs.

Machine translation is one of the examples.

Recurrent neural networks (RNNs) are models of Artificial neural networks

(ANNs) where the connections between units form a directed cycle. Specifically, a
21PG2IT7 - DATA SCEINCE

directed cycle is a sequence where the walk along the vertices and edges is
completely determined by the set of edges used and therefore has some
semblance of a specific order. RNNs are often specifically used for speech and
handwriting recognition

CS5228 Project 2 Twitter Sentiment Analysis Group No.: 29: 1 Problem Statement
No ratings yet
CS5228 Project 2 Twitter Sentiment Analysis Group No.: 29: 1 Problem Statement
15 pages
Unit 2 v1.
No ratings yet
Unit 2 v1.
41 pages
Unit 4
No ratings yet
Unit 4
38 pages
Unit 3
No ratings yet
Unit 3
8 pages
Graph theory report
No ratings yet
Graph theory report
9 pages
Unit - 2
No ratings yet
Unit - 2
24 pages
UNIT V NEURAL NETWORKS
No ratings yet
UNIT V NEURAL NETWORKS
35 pages
Unit Iv DM
No ratings yet
Unit Iv DM
58 pages
Module 5
No ratings yet
Module 5
8 pages
DL Anonymous Question Bank
No ratings yet
DL Anonymous Question Bank
22 pages
Artificial Neural Networks (ANN) : 1-Introduction
No ratings yet
Artificial Neural Networks (ANN) : 1-Introduction
5 pages
AIML-UNIT-5
No ratings yet
AIML-UNIT-5
34 pages
UNit 6 Machine Learning
No ratings yet
UNit 6 Machine Learning
23 pages
NNDL
No ratings yet
NNDL
96 pages
Exp6 - Artificial Neural Networks
No ratings yet
Exp6 - Artificial Neural Networks
16 pages
MODULE 1 DL
No ratings yet
MODULE 1 DL
6 pages
Unit 5
No ratings yet
Unit 5
8 pages
Neural Networks and Their Statistical Application
No ratings yet
Neural Networks and Their Statistical Application
41 pages
Architecture and Learning process in neural network - GeeksforGeeks
No ratings yet
Architecture and Learning process in neural network - GeeksforGeeks
6 pages
NNDL-1
No ratings yet
NNDL-1
13 pages
Step by Step Procedure That How I Resolve Given Task Pytorh
No ratings yet
Step by Step Procedure That How I Resolve Given Task Pytorh
6 pages
Machine Learning
No ratings yet
Machine Learning
13 pages
AI Week 12
No ratings yet
AI Week 12
2 pages
Neural Networks Unit 3
No ratings yet
Neural Networks Unit 3
93 pages
Assignment 4
No ratings yet
Assignment 4
7 pages
Artificial Neural Networks
No ratings yet
Artificial Neural Networks
4 pages
2111CS010077 deep learning
No ratings yet
2111CS010077 deep learning
10 pages
Unit 4 notes
No ratings yet
Unit 4 notes
19 pages
UNIT-III DLL full unit
No ratings yet
UNIT-III DLL full unit
63 pages
Deep Learning Unit2
No ratings yet
Deep Learning Unit2
43 pages
Unit I - Afs
No ratings yet
Unit I - Afs
18 pages
Artificial Neural Networks
No ratings yet
Artificial Neural Networks
7 pages
Deep Learning Notes
100% (1)
Deep Learning Notes
44 pages
neural-networks-unit-3 edited
No ratings yet
neural-networks-unit-3 edited
94 pages
Unit 1
No ratings yet
Unit 1
20 pages
Machine Learning Unit 3-5
No ratings yet
Machine Learning Unit 3-5
13 pages
Deep Learning Lab Manual
No ratings yet
Deep Learning Lab Manual
73 pages
UNIT_1_DL
No ratings yet
UNIT_1_DL
18 pages
Crash Course DL
No ratings yet
Crash Course DL
15 pages
Assignment - 4
No ratings yet
Assignment - 4
24 pages
unit-1
No ratings yet
unit-1
19 pages
A Literature Survey For Object Recognition Using Neural Networks in FPGA
No ratings yet
A Literature Survey For Object Recognition Using Neural Networks in FPGA
6 pages
FEM Presentation
No ratings yet
FEM Presentation
21 pages
Understanding Multi-Layer Feed-Forward Neural Networks in Machine Learning
No ratings yet
Understanding Multi-Layer Feed-Forward Neural Networks in Machine Learning
4 pages
Unit 2 Deep Learning
No ratings yet
Unit 2 Deep Learning
19 pages
Unit 2 SC
No ratings yet
Unit 2 SC
6 pages
5 Layers of A Convolutional Neural Network
No ratings yet
5 Layers of A Convolutional Neural Network
15 pages
MLP U3
No ratings yet
MLP U3
19 pages
Institute of Engineering and Technology Davv, Indore: Lab Assingment On
No ratings yet
Institute of Engineering and Technology Davv, Indore: Lab Assingment On
14 pages
Lesson 3 Basics of Neural Networks_Perceptron
No ratings yet
Lesson 3 Basics of Neural Networks_Perceptron
26 pages
Deep Learning Unit 1..
No ratings yet
Deep Learning Unit 1..
21 pages
Unit_2
No ratings yet
Unit_2
20 pages
unitV (1)
No ratings yet
unitV (1)
29 pages
Soft Computing
No ratings yet
Soft Computing
38 pages
chp1 NN, MLFFN, weight, bias, threshold, activation fn, loss fn
No ratings yet
chp1 NN, MLFFN, weight, bias, threshold, activation fn, loss fn
19 pages
DatMiningAssignment
No ratings yet
DatMiningAssignment
9 pages
What are Neural Networks
No ratings yet
What are Neural Networks
5 pages
chapter 3 soft computing
No ratings yet
chapter 3 soft computing
1 page
Unit 4 Notes
100% (1)
Unit 4 Notes
45 pages
E-Eli5-Way-3bd2b1164a53: CNN (Source:)
No ratings yet
E-Eli5-Way-3bd2b1164a53: CNN (Source:)
4 pages
Multilayer Perceptron: Fundamentals and Applications for Decoding Neural Networks
From Everand
Multilayer Perceptron: Fundamentals and Applications for Decoding Neural Networks
Fouad Sabry
No ratings yet
Artificial Neural Network-Adaline & Madaline
No ratings yet
Artificial Neural Network-Adaline & Madaline
18 pages
Bagging+Boosting+Gradient Boosting
100% (1)
Bagging+Boosting+Gradient Boosting
48 pages
EBPN
No ratings yet
EBPN
10 pages
Tutorial Sheet For Unit 1,2 and 3
No ratings yet
Tutorial Sheet For Unit 1,2 and 3
6 pages
11 ANN (Backpropagation)
No ratings yet
11 ANN (Backpropagation)
37 pages
ML Lesson Plan (2021-22)
No ratings yet
ML Lesson Plan (2021-22)
2 pages
Unit 3
No ratings yet
Unit 3
86 pages
A Comprehensive Guide To Understand and Implement Text Classification in Python
No ratings yet
A Comprehensive Guide To Understand and Implement Text Classification in Python
34 pages
2nd Unit NN Final Class Notes
No ratings yet
2nd Unit NN Final Class Notes
51 pages
Email Classification: Roll No-41463 (LP-3)
No ratings yet
Email Classification: Roll No-41463 (LP-3)
5 pages
ML Lab3 pgm
No ratings yet
ML Lab3 pgm
3 pages
DL Question Bank 2022-23
No ratings yet
DL Question Bank 2022-23
5 pages
ML Lecture#3
No ratings yet
ML Lecture#3
37 pages
Machine_Learning_Timetable
No ratings yet
Machine_Learning_Timetable
4 pages
Banana Leaf Disease Detection Using Deep Learning Approach
No ratings yet
Banana Leaf Disease Detection Using Deep Learning Approach
5 pages
Lesson Plan - ML24ECSC306
No ratings yet
Lesson Plan - ML24ECSC306
22 pages
R20A6610 DL Syllabus
No ratings yet
R20A6610 DL Syllabus
2 pages
Apriori Algorithm - Ipynb - Colaboratory
No ratings yet
Apriori Algorithm - Ipynb - Colaboratory
5 pages
790 1549 1 PB 1
No ratings yet
790 1549 1 PB 1
9 pages
5 2 Multilayer Perceptron
No ratings yet
5 2 Multilayer Perceptron
17 pages
Artificial Neural Network Part-2
No ratings yet
Artificial Neural Network Part-2
15 pages
Hands On Machine Learning with Scikit Learn and TensorFlow Concepts Tools and Techniques to Build Intelligent Systems 1st Edition by Aurelien Geron ISBN 1491962291 9781491962299 - The ebook is available for instant download, read anywhere
100% (6)
Hands On Machine Learning with Scikit Learn and TensorFlow Concepts Tools and Techniques to Build Intelligent Systems 1st Edition by Aurelien Geron ISBN 1491962291 9781491962299 - The ebook is available for instant download, read anywhere
89 pages
Terminology - What Is The Difference Between MLP and RBF - Cross Validated
No ratings yet
Terminology - What Is The Difference Between MLP and RBF - Cross Validated
2 pages
V05 SS24 DL CNNs Lecture2
No ratings yet
V05 SS24 DL CNNs Lecture2
73 pages
DL Jun - 2023
No ratings yet
DL Jun - 2023
2 pages
Unit 4 (2)
No ratings yet
Unit 4 (2)
148 pages
Understanding LSTM
No ratings yet
Understanding LSTM
34 pages
Model Questions DWT
No ratings yet
Model Questions DWT
2 pages
Vanishing and Exploding
No ratings yet
Vanishing and Exploding
9 pages