0% found this document useful (0 votes)

44 views7 pages

Types of Neural Networks

Uploaded by

Sai someone

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

44 views7 pages

Types of Neural Networks

Uploaded by

Sai someone

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 7

DESIGN OF OUTPUT LAYER:

Let's discuss the design of the last layer of network in particular. In addition to all hidden layers, it
completes the functions of dimensional transformation and feature extraction, and it is also used as an
output layer.
It is necessary to decide whether to use the activation function and what type of activation function to
use according to the specific tasks. We will classify the discussions based on the range of output
values.

[0, 1] Interval:
It is also common for output values to belong to interval [0, 1], such as image generation, and binary
classification problems. The binary classification network with single output node looks like:

In this case, you only need to add the Sigmoid function after the value of the output layer to translate
the output into a probability value.
Below figure shows the output layer of the binary classification network is two nodes.

The output value of the first node represents the probability of the occurrence of event A P(x), and the
output value of the second node represents the probability of the occurrence of the opposite event
P(x). The function can only compress a single value to the interval (0, 1) and does not consider the
relationship between the two node values. We hope that in addition to satisfy oi ∈ [0, 1], they can
satisfy the constraint that the sum of probabilities is 1:

[0,1] Interval with Sum 1:

For cases that the output value oi ∈ [0, 1], and the sum of all output values is 1, it is the most common
problem with multi-classification. As shown in Figure each output node of the output layer represents
a category.
The network structure in the figure is used to handle three classification tasks. The output value
distribution of the three nodes represents the probability that the current sample belongs to category
A, B, and C: P(x), P(B| x), and P(C| x). Because the sample in the multi-classification
problem can only belong to one of the categories, so the sum of the probabilities of all categories
should be 1.
This can be achieved by adding a Softmax function to the output layer. The Softmax function is
defined as:

The Softmax function can not only map the output value to the interval [0, 1] but also satisfy the
characteristic that the sum of all output values is 1.

z = tf.constant([2.,1.,0.1])
tf.nn.softmax(z)
Out[12]:
<tf.Tensor: id=19, shape=(3,), dtype=float32, numpy=array([0.6590012, 0.242433 , 0.0985659],
dtype=float32)>

(-1, 1) Interval
If you want the range of output values to be distributed in intervals (−1, 1), you can simply use the
tanh activation function:
I
x = tf.linspace(-6.,6.,10)
tf.tanh(x)
Out[15]:
<tf.Tensor: id=264, shape=(10,), dtype=float32, numpy= array([-0.9999877 , -0.99982315, -0.997458
, -0.9640276 ,-0.58278286, 0.5827831 , 0.9640276 , 0.997458 , 0.99982315,0.9999877 ],
dtype=float32)>

The design of the output layer has a certain flexibility, which can be designed according to the actual
application scenario, and make full use of the characteristics of the existing activation function.

ERROR CALCULATION:
 After building the model structure, the next step is to select the appropriate error function to
calculate the error.
 Common error functions are mean square error, cross-entropy, KL divergence, and hinge
loss.
 Among them, the mean square error function and cross-entropy function are more
common in deep learning.
 The mean square error function is mainly used for regression problems, and the cross-entropy
function is mainly used for classification problem.
Mean Square Error Function
Mean square error (MSE) function maps the output vector and the true vector to two points in the
Cartesian coordinate system, by calculating the Euclidean distance between these two points (to be
precise, the square of Euclidean distance) to measure the difference between the two vectors:

The value of MSE is always greater than or equal to 0. When the MSE function reaches the minimum
value of 0, the output is equal to the true label, and the parameters of the neural network reach the
optimal state.

o = tf.random.normal([2,10]) # Network output

y_onehot = tf.constant([1,3]) # Real label
y_onehot = tf.one_hot(y_onehot, depth=10)
loss = keras.losses.MSE(y_onehot, o) # Calculate MSE
loss
Out[16]:
<tf.Tensor: id=27, shape=(2,), dtype=float32,
numpy=array([0.779179 , 1.6585705], dtype=float32)>

You need to average again in the sample dimension to obtain the mean square error of the average
sample. The implementation is as follows:

loss = tf.reduce_mean(loss)
loss
Out[17]:
<tf.Tensor: id=30, shape=(), dtype=float32, numpy=1.2188747>
It can also be implemented in layer mode. The corresponding class is
keras.losses.MeanSquaredError().
Like other classes, the __call__ function can be called to complete the forward calculation. The code
is as follows:

criteon = keras.losses.MeanSquaredError()
loss = criteon(y_onehot,o)
loss
Out[18]:
<tf.Tensor: id=54, shape=(), dtype=float32, numpy=1.2188747>

Cross-Entropy Error Function:

Calculating the cross-entropy error function in a neural network involves

computing the loss between the predicted values (often probabilities) generated
by the model and the true labels or target values. As mentioned earlier, there are
two common variants of cross-entropy loss: binary cross-entropy and
categorical cross-entropy.

Binary Cross-Entropy (Binary Log Loss):

For binary classification problems, where there are only two classes (0 and 1),
the binary cross-entropy loss is used. Given a true label
y (0 or 1) and a predicted probability y^ (a value between 0 and 1), the binary
cross-entropy loss is calculated as follows:
L(y,y^)=−(y⋅log(y^)+(1−y)⋅log(1−y^))
Where, L(y, y^) is the binary cross-entropy loss.
y is the true label (0 or 1).
y^is the predicted probability of belonging to class 1.
To calculate this loss for a batch of samples, you typically average the
individual losses.

Categorical Cross-Entropy (Multi-Class Log Loss):

For multi-class classification problems, where there are more than two classes,
the categorical cross-entropy loss is used. Given a true label y (a one-hot
encoded vector) and predicted class probabilities y^ (a vector of predicted
probabilities), the categorical cross-entropy loss is calculated as follows:
L(y,y^)=−∑i=1Nyi⋅log(y^i)
Where, L(y, y^) is the categorical cross-entropy loss.
y is a one-hot encoded vector representing the true class.
y^is a vector of predicted class probabilities for each class.
N is the number of classes.

To calculate this loss for a batch of samples, you typically average the
individual losses.

Here's an example of how to set the loss function in Keras:

from tensorflow.keras.models import Sequential

from tensorflow.keras.layers import Dense

model = Sequential([
Dense(units=64, activation='relu', input_dim=input_dim),
Dense(units=1, activation='sigmoid') # For binary classification
])

model.compile(optimizer='adam', loss='binary_crossentropy',
metrics=['accuracy'])
For multi-class classification, you would typically use
'categorical_crossentropy' as the loss function and adapt your model architecture
accordingly.

During training, the optimization algorithm minimizes the chosen loss function,
which means it adjusts the model's parameters to make the predicted values
(probabilities) closer to the true labels.
TYPES OF NEURAL NETWORKS:
There are various types of neural networks, each designed to address specific
types of machine learning tasks. Here are some of the most common types of
neural networks:

Feedforward Neural Network (FNN):

 Also known as Multi-Layer Perceptrons (MLP).

 The simplest form of neural network.
 Consists of an input layer, one or more hidden layers, and an output layer.
 Used for tasks like classification and regression.

Convolutional Neural Network (CNN):

 Specifically designed for processing grid-like data, such as images.

 Employs convolutional layers to automatically learn spatial hierarchies of
features.
 Widely used in image classification, object detection, and image
segmentation.

Recurrent Neural Network (RNN):

 Designed for sequential data and time-series analysis.

 Contains recurrent layers that maintain hidden states to capture temporal
dependencies.
 Suitable for tasks like natural language processing, speech recognition,
and time-series prediction.

Long Short-Term Memory (LSTM):

 A type of RNN with improved ability to capture long-term dependencies.

 Utilizes memory cells and gates to control the flow of information.
 Excellent for sequential tasks where context over long sequences is
essential.

Gated Recurrent Unit (GRU):

 Similar to LSTM but with a simpler architecture.
 Uses gating mechanisms to control information flow.
 Offers a balance between performance and complexity compared to
LSTM.
Autoencoder (AE):

 Unsupervised learning neural network used for dimensionality reduction

and feature learning.
 Comprises an encoder to reduce input data dimensions and a decoder to
reconstruct the original data.
 Used in image denoising, anomaly detection, and recommendation
systems.

Variational Autoencoder (VAE):

 An extension of autoencoders with probabilistic properties.

 Encourages the model to generate data points similar to those in the
training dataset.
 Commonly used in generating new data samples and data representation
learning.

Generative Adversarial Network (GAN):

 Comprises a generator network and a discriminator network.
 Trains by having the generator and discriminator compete against each
other.
 Used for generating synthetic data, image-to-image translation, and style
transfer.

Radial Basis Function Network (RBFN):

 Utilizes radial basis functions as activation functions.
 Suitable for interpolation, approximation, and function approximation
tasks.

Self-Organizing Maps (SOM):

 Used for clustering and dimensionality reduction.

 Organizes data points in a low-dimensional grid while preserving
topological relationships.

Residual Neural Network (ResNet):

 Addresses the vanishing gradient problem by using skip connections.

 Enables the training of extremely deep neural networks.
 Commonly used in image recognition tasks.
Siamese Network:

 Designed for tasks involving similarity or dissimilarity comparisons.

 Consists of two identical subnetworks with shared weights.
 Often used in face recognition and signature verification.

Transformers:

 Introduced in the field of natural language processing (NLP).

 Utilizes attention mechanisms to capture contextual information.
 The basis for models like BERT, GPT, and T5 for various NLP tasks.

Graph Convolutional Neural Network (Graph CNN or GCN):

 Designed for processing graph-structured data.

 Utilizes graph convolutional layers to propagate information between
connected nodes in a graph.
 Used in tasks such as node classification, link prediction, and graph
classification in areas like social network analysis and recommendation
systems.

Attention Mechanism:
 Not a standalone network architecture but a mechanism integrated into
various neural networks.
 Introduced in models like Transformers.
 Allows the model to focus on different parts of the input sequence when
making predictions.
 Essential for capturing long-range dependencies in sequential data.
 Used in natural language processing for tasks like machine translation,
text summarization, and question-answering.

These are some of the fundamental types of neural networks, and there are
many more specialized architectures and variations tailored to specific
applications and research areas. The choice of the neural network architecture
depends on the nature of the problem you want to solve.

Syllabus COMP001-IntrotoCompBSIT
No ratings yet
Syllabus COMP001-IntrotoCompBSIT
5 pages
DL Unit-2
No ratings yet
DL Unit-2
24 pages
Question Dissection Strategy Example
No ratings yet
Question Dissection Strategy Example
2 pages
Corpuz
100% (2)
Corpuz
2 pages
Execution and Control of Operations
No ratings yet
Execution and Control of Operations
1 page
APKA Report
No ratings yet
APKA Report
3 pages
Ch2-Training, Optimization and Regularization of DNN-new
No ratings yet
Ch2-Training, Optimization and Regularization of DNN-new
114 pages
Practical-5 - 2CEIT606 - Artificial Intelligence
No ratings yet
Practical-5 - 2CEIT606 - Artificial Intelligence
14 pages
DL Practical 02 Binary Class Classifier Using ANN
No ratings yet
DL Practical 02 Binary Class Classifier Using ANN
5 pages
Loss Functions
No ratings yet
Loss Functions
15 pages
Notes Chapter8
No ratings yet
Notes Chapter8
4 pages
DL Mannual For Reference
No ratings yet
DL Mannual For Reference
58 pages
DeepLearning Recap
No ratings yet
DeepLearning Recap
104 pages
Shivansh Exp8
No ratings yet
Shivansh Exp8
5 pages
Day 2 - Loss & Activation Functions
No ratings yet
Day 2 - Loss & Activation Functions
8 pages
Crashcourse DL Pytorch Parr
No ratings yet
Crashcourse DL Pytorch Parr
39 pages
Kagan Lecture2
No ratings yet
Kagan Lecture2
118 pages
Deep Learning
No ratings yet
Deep Learning
15 pages
DeepLearning Workshop Humayun
No ratings yet
DeepLearning Workshop Humayun
63 pages
Activation - Loss - Accuracy
No ratings yet
Activation - Loss - Accuracy
16 pages
AML 03 Dense Neural Networks
No ratings yet
AML 03 Dense Neural Networks
20 pages
Unit 2.1
No ratings yet
Unit 2.1
37 pages
Neural Network Notes
No ratings yet
Neural Network Notes
8 pages
Tensors - Operations - and - Deep - Learning - Cycle - Jupyter Notebook
No ratings yet
Tensors - Operations - and - Deep - Learning - Cycle - Jupyter Notebook
24 pages
CHAPTER 3.3 - Activation - Loss - Accuracy
No ratings yet
CHAPTER 3.3 - Activation - Loss - Accuracy
14 pages
Neural Networks - 2
No ratings yet
Neural Networks - 2
79 pages
NLP-NeuralNetworks Reading Notes
No ratings yet
NLP-NeuralNetworks Reading Notes
13 pages
HODL Lec 2 Training NNs Intro TF
No ratings yet
HODL Lec 2 Training NNs Intro TF
83 pages
Domnic Object Detecion Basics
No ratings yet
Domnic Object Detecion Basics
62 pages
Video 7 - Building A Multilayer Feedforward Network For Classification in PyTorch
No ratings yet
Video 7 - Building A Multilayer Feedforward Network For Classification in PyTorch
18 pages
Introduction To Artificial Neural Networks
No ratings yet
Introduction To Artificial Neural Networks
31 pages
Coding Neural Networks-Classification & Regression
No ratings yet
Coding Neural Networks-Classification & Regression
39 pages
L6 Multilayer FeedForward Network XOR & MNIST DIGIT
No ratings yet
L6 Multilayer FeedForward Network XOR & MNIST DIGIT
51 pages
Module 2
No ratings yet
Module 2
44 pages
Deep Learning
No ratings yet
Deep Learning
19 pages
AI & ML Unit 5 Notes
No ratings yet
AI & ML Unit 5 Notes
23 pages
cs188 sp23 Note25
No ratings yet
cs188 sp23 Note25
8 pages
Deep Learning
No ratings yet
Deep Learning
20 pages
This Code Fragment Defines A Single Layer With Artificial Neurons, and It Expects Input Variables
No ratings yet
This Code Fragment Defines A Single Layer With Artificial Neurons, and It Expects Input Variables
9 pages
This Code Fragment Defines A Single Layer With Artificial Neurons, and It Expects Input Variables
No ratings yet
This Code Fragment Defines A Single Layer With Artificial Neurons, and It Expects Input Variables
9 pages
Unit 2 DL
No ratings yet
Unit 2 DL
70 pages
4-Tensors and Opeartions - Probability Basics-Gradient Descent-27!07!2024
No ratings yet
4-Tensors and Opeartions - Probability Basics-Gradient Descent-27!07!2024
18 pages
A Imprimer 4
No ratings yet
A Imprimer 4
4 pages
6.neural Networks 2
No ratings yet
6.neural Networks 2
44 pages
Cheatsheet Reflex Models
No ratings yet
Cheatsheet Reflex Models
4 pages
DL Unit2
No ratings yet
DL Unit2
22 pages
3 ArtificialNeuralNetworks PDF
No ratings yet
3 ArtificialNeuralNetworks PDF
77 pages
Lec 04 Deep Networks 2
No ratings yet
Lec 04 Deep Networks 2
78 pages
03-Linear Classification
No ratings yet
03-Linear Classification
17 pages
Week 2 Artificial Neural Networks
No ratings yet
Week 2 Artificial Neural Networks
62 pages
26 Neural Nets
No ratings yet
26 Neural Nets
77 pages
16 DL 1
No ratings yet
16 DL 1
9 pages
Lecture 09 Slides - After
No ratings yet
Lecture 09 Slides - After
57 pages
ML Lec 10 Neural Networks
No ratings yet
ML Lec 10 Neural Networks
87 pages
Ece18898g Neural Networks
No ratings yet
Ece18898g Neural Networks
47 pages
DNN - M2 - Deep Feedforward NN 23dec
No ratings yet
DNN - M2 - Deep Feedforward NN 23dec
97 pages
CS 461 - Fall 2021 - Neural Networks - Machine Learning
No ratings yet
CS 461 - Fall 2021 - Neural Networks - Machine Learning
5 pages
P95 Course Slides
No ratings yet
P95 Course Slides
86 pages
Project Report
No ratings yet
Project Report
20 pages
cs188 sp24 Note22
No ratings yet
cs188 sp24 Note22
8 pages
Deep Learning
No ratings yet
Deep Learning
21 pages
Rec Ex 11
No ratings yet
Rec Ex 11
13 pages
Line Drawing Algorithm: Mastering Techniques for Precision Image Rendering
From Everand
Line Drawing Algorithm: Mastering Techniques for Precision Image Rendering
Fouad Sabry
No ratings yet
Introduction to Algorithms
From Everand
Introduction to Algorithms
S VASIST
No ratings yet
Ember Electricity Data Methodology
No ratings yet
Ember Electricity Data Methodology
93 pages
Cpunit Iv
No ratings yet
Cpunit Iv
41 pages
Cpunit Iii
No ratings yet
Cpunit Iii
57 pages
ACD-UNIT-4 Notes
No ratings yet
ACD-UNIT-4 Notes
32 pages
DLL - English 6 - Q2 - W4
No ratings yet
DLL - English 6 - Q2 - W4
10 pages
Block Planning Template For Non-Xseed Teachers
No ratings yet
Block Planning Template For Non-Xseed Teachers
6 pages
Article Humanistic
No ratings yet
Article Humanistic
5 pages
The European Profiling Grid: 2011-1-FR1-LEO05-24446
No ratings yet
The European Profiling Grid: 2011-1-FR1-LEO05-24446
35 pages
Business Skills
No ratings yet
Business Skills
16 pages
Assignment 2nd Semester 2022-2024
No ratings yet
Assignment 2nd Semester 2022-2024
3 pages
Midterm Activity 1-FortunaCarol
No ratings yet
Midterm Activity 1-FortunaCarol
2 pages
Cookery 10 LESSON PLAN
No ratings yet
Cookery 10 LESSON PLAN
3 pages
Code No.:OO Paper - I Subject: General Paper On Teaching Research Aptitude Syllabus
No ratings yet
Code No.:OO Paper - I Subject: General Paper On Teaching Research Aptitude Syllabus
7 pages
Kelsie Whitehall Adjectives of Quality Lesson Plan
No ratings yet
Kelsie Whitehall Adjectives of Quality Lesson Plan
3 pages
BT4395 RR Final
No ratings yet
BT4395 RR Final
32 pages
Article 2 Peterson
No ratings yet
Article 2 Peterson
13 pages
Workshop On Fundamentals of Adr
No ratings yet
Workshop On Fundamentals of Adr
2 pages
Master Teachers Tabs
No ratings yet
Master Teachers Tabs
2 pages
CelebAI Hiring Notice - Campus
No ratings yet
CelebAI Hiring Notice - Campus
3 pages
The Impact of Multilingualism and Learning Patterns On Student Achievement in English and Other Subjects in Higher Education
No ratings yet
The Impact of Multilingualism and Learning Patterns On Student Achievement in English and Other Subjects in Higher Education
21 pages
Fluency Assignment
No ratings yet
Fluency Assignment
17 pages
Year 8 English Unit Plan Nips Xi
50% (2)
Year 8 English Unit Plan Nips Xi
3 pages
Taylor Spratt Cosmo The Cat
No ratings yet
Taylor Spratt Cosmo The Cat
4 pages
Your Guide To Developing Thinking Skills in Science 1726418158
No ratings yet
Your Guide To Developing Thinking Skills in Science 1726418158
19 pages
Lesson Plans 33 and 34: MODULE 2: English All Around 2.1 English Rocks CONTENTS: The Importance of The English Language
No ratings yet
Lesson Plans 33 and 34: MODULE 2: English All Around 2.1 English Rocks CONTENTS: The Importance of The English Language
5 pages
Internship Report
No ratings yet
Internship Report
26 pages
Unit Planner-EY2 - WWA
No ratings yet
Unit Planner-EY2 - WWA
4 pages
My Field Study 6 Portfolio
No ratings yet
My Field Study 6 Portfolio
76 pages
Math Fact Bingo
No ratings yet
Math Fact Bingo
17 pages
Inclusive Education Management Through Sen (Special Education Needs) Program: A Case Study in Buin Batu Primary School West Sumbawa Indonesia
No ratings yet
Inclusive Education Management Through Sen (Special Education Needs) Program: A Case Study in Buin Batu Primary School West Sumbawa Indonesia
9 pages

Types of Neural Networks

Uploaded by

Types of Neural Networks

Uploaded by

DESIGN OF OUTPUT LAYER:

[0,1] Interval with Sum 1:

o = tf.random.normal([2,10]) # Network output

Cross-Entropy Error Function:

Calculating the cross-entropy error function in a neural network involves

Binary Cross-Entropy (Binary Log Loss):

Categorical Cross-Entropy (Multi-Class Log Loss):

Here's an example of how to set the loss function in Keras:

from tensorflow.keras.models import Sequential

Feedforward Neural Network (FNN):

 Also known as Multi-Layer Perceptrons (MLP).

Convolutional Neural Network (CNN):

 Specifically designed for processing grid-like data, such as images.

Recurrent Neural Network (RNN):

 Designed for sequential data and time-series analysis.

Long Short-Term Memory (LSTM):

 A type of RNN with improved ability to capture long-term dependencies.

Gated Recurrent Unit (GRU):

 Unsupervised learning neural network used for dimensionality reduction

 An extension of autoencoders with probabilistic properties.

Generative Adversarial Network (GAN):

Radial Basis Function Network (RBFN):

Self-Organizing Maps (SOM):

 Used for clustering and dimensionality reduction.

Residual Neural Network (ResNet):

 Addresses the vanishing gradient problem by using skip connections.

 Designed for tasks involving similarity or dissimilarity comparisons.

 Introduced in the field of natural language processing (NLP).

Graph Convolutional Neural Network (Graph CNN or GCN):

 Designed for processing graph-structured data.

You might also like