0% found this document useful (0 votes)

6 views17 pages

Implementation of Activation Layer

The document discusses the implementation of activation function layers in neural networks, focusing on ReLU and sigmoid functions, and their roles in forward and backward propagation within a computational graph. It also covers affine layers, which connect inputs to outputs through learnable weights, and softmax layers that convert logits into probabilities for classification tasks. The summary includes mathematical representations and descriptions of the processes involved in these layers.

Uploaded by

tintu

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

6 views17 pages

Implementation of Activation Layer

Uploaded by

tintu

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 17

Implementing the activation

function layer
By,
Dr. TINTU GEORGE
Assistant Professor,
Dep of Computer Science with Data Analytics
Sri Ramakrishna College of Arts & Science
Implementing the activation function layer
•The notion of a computational graph is applied to a neural network.
• A computational graph is a graphical representation of mathematical
expressions involving variables and operations.

• In the context of neural networks, it represents how data flows through the
network, including operations like matrix multiplications, activations, and
backpropagation.

•Here, the layers which form a neural network are used for activation
functions in one class utilizing the ReLU and sigmoid layers.
ReLU function

• An activation function of a rectified linear unit (ReLU) is expressed in

the following equation
•This function sets all negative values in the input to zero.
•It is defined as: ReLU(x)=max (0,x)
•It means that for any input value, if it is negative, the output will be 0;
if positive, the value is unchanged.
• ReLU activation function behaves in the context of a computational
graph during both forward and backward propagation in a neural
network.

1.Forward Propagation with ReLU:

1. During forward propagation in a neural network, input x passes through the
ReLU activation function.
If x>0, the output ReLU(x) is equal to x itself.

If x≤0, the output ReLU(x) is 0.

2. Computational Graph Representation:
• A computational graph visually represents the flow of data and computations
in a neural network.
• Nodes in the graph represent variables or operations, and edges represent
dependencies or flows between them.

3. Backward Propagation (Backpropagation) with ReLU:

• During backpropagation, gradients (derivatives of the loss function with
respect to the network's parameters) are computed and propagated backward
through the network to adjust the parameters (weights).
• As shown in equation (5.8), if x is greater than 0 in forward propagation, the backpropagation passes
upstream value downstream without any alterations.

• However, if x is equal to or less than 0, the signal ceases in backwards propagation. Represent this in a
computational graph as follows:
Sigmoid Activation:
•Sigmoid Activation: This function maps input values to a range between 0
and 1.
• It is defined as:

• Here, the exponential function ensures that large positive values result in
outputs near 1 and large negative values result in outputs near 0.
• Sigmoid Formula

• The following diagram shows the computational graph for the sigmoid
function in forward propagation.
• The image shows a computational graph for the sigmoid function during
forward propagation. Here's a step-by-step explanation of the graph:
1.Input x:
The input to the sigmoid function is x.
2.Multiplication by −1:
x is multiplied by −1 to get −x.
3.Exponentiation:
The value −x is passed through an exponentiation operation to calculate exp⁡(-x).
4.Addition:
The value 1 is added to exp⁡(- x) to get 1+exp⁡(-x).
5.Division:
Implementing the affine and soft max layers
Affine layer
• An affine layer, also known as a fully connected layer or a dense layer, is a
fundamental building block used in neural networks. .

• It's a type of layer where each input is connected to each output by a learnable
weight.

• Affine layers are commonly used in both traditional neural networks and deep
learning models to transform input features into outputs that the network can use
for prediction or classification tasks.
• Mathematically, the output of an affine layer can be described by the equation:

• Y=np.dot(X,W) + B

where:

• X is the matrix.

• W is the weight matrix.

• B is the bias

• It consists of a linear transformation (the matrix multiplication) and a

translation (the bias addition) hence it is called as affine layer.
import numpy as np

X = np.random.rand(2) # Input values

W = np.random.rand(2,3) # Weights

B = np.random.rand(3) # Biases

X.shape # (2,)

W.shape # (2, 3)

B.shape # (3,)
Program for batch affine layer

• The biases are applied to each piece of data for• In this example, we initialize loss function of two
forward propagation. data items (N = 2). Y represent the o/p as 2x3

• Therefore, the values of each piece of data in array.

backpropagation must be included in the elements• The np.sum function sums the elements of dY
of biases when backpropagation takes place. along the specified axis.
Softmax Layer
• A softmax layer is often used in neural networks, particularly in the final
layer of a classification model. It converts the logits (raw prediction values)
of a neural network into probabilities.
1. Input Layer: The network starts with an input layer, represented by the number 2. This could mean the

second input instance, or it could signify that the input data has 2 dimensions/features.

2. Affine (Fully Connected) Layers: These layers perform linear transformations on the input data. An

affine transformation involves multiplying the input by a weight matrix and then adding a bias vector.

Mathematically, it is represented as y=W.X+B

3. ReLU Activation Functions: After each affine layer, there is a ReLU (Rectified Linear Unit) activation

function applied.

4. Multiple Affine and ReLU Layers: The network contains multiple alternating affine and ReLU layers.

5. Final Affine Layer: The last affine layer produces the raw output scores for each class. These are often

called logits.

6. Scores: The scores represent the raw outputs from the final affine layer. Each score corresponds to a
• Softmax Layer: The softmax function is applied to the scores to convert them into probabilities.

• Probabilities: The output of the softmax function is a probability distribution over the classes. Each
value represents the probability of the input belonging to a particular class. For instance, in the
diagram, the probabilities are 0.008,0.00005,0.991,0.00004,…

• Calculated by exp(affine)/ Sum (exp(affine)) ,axis

Summary of the Process

• Input: The network receives an input.

• Affine Transformation + ReLU Activation: The input is transformed through multiple affine layers
with ReLU activations in between.

• Final Affine Layer: The last affine layer produces scores for each class.

• Softmax Transformation: The scores are converted into probabilities using the softmax function.
Program to implement affine and softmax layers

NN Unit - 1
No ratings yet
NN Unit - 1
27 pages
Probability I - Mark Scheme
No ratings yet
Probability I - Mark Scheme
17 pages
Deep Learning Unit 1
No ratings yet
Deep Learning Unit 1
32 pages
The Balanced Scorecard: Superfactory Excellence Program™
No ratings yet
The Balanced Scorecard: Superfactory Excellence Program™
65 pages
Activation Function
No ratings yet
Activation Function
43 pages
Virtusa Placement Paper 2011without
No ratings yet
Virtusa Placement Paper 2011without
75 pages
Inventory Schedule
No ratings yet
Inventory Schedule
41 pages
LLM Ai Interview SS
No ratings yet
LLM Ai Interview SS
187 pages
Unit 2 - Machine Learning
No ratings yet
Unit 2 - Machine Learning
19 pages
Module-4 Neural Network
No ratings yet
Module-4 Neural Network
61 pages
Deep Learning
No ratings yet
Deep Learning
78 pages
Unit 2 DL
No ratings yet
Unit 2 DL
70 pages
DL Activation
No ratings yet
DL Activation
41 pages
Ad3451 ML Unit 4 Notes
No ratings yet
Ad3451 ML Unit 4 Notes
36 pages
Deep Learning Tutorial 9
No ratings yet
Deep Learning Tutorial 9
70 pages
UNIT-2 Machine Learning
No ratings yet
UNIT-2 Machine Learning
35 pages
Activation Function
No ratings yet
Activation Function
34 pages
Handout - Measuring Risk and Return
No ratings yet
Handout - Measuring Risk and Return
79 pages
Unit 2
No ratings yet
Unit 2
35 pages
Functii de Activare1
No ratings yet
Functii de Activare1
89 pages
Bracketing Paradoxes in Italian (Daniele Virgillito, Università Di Bologna 2010)
No ratings yet
Bracketing Paradoxes in Italian (Daniele Virgillito, Università Di Bologna 2010)
97 pages
MLfromBasics Ch2E
No ratings yet
MLfromBasics Ch2E
32 pages
Machine Learning (CSO851) - Lecture 08
No ratings yet
Machine Learning (CSO851) - Lecture 08
27 pages
Week 2 Artificial Neural Networks
No ratings yet
Week 2 Artificial Neural Networks
62 pages
Activation Functions
No ratings yet
Activation Functions
34 pages
Activation Functions
No ratings yet
Activation Functions
23 pages
Soft Computing Manual.-1
No ratings yet
Soft Computing Manual.-1
45 pages
Unit Ii DNN
No ratings yet
Unit Ii DNN
24 pages
Ad3451 ML Unit 4 Notes
No ratings yet
Ad3451 ML Unit 4 Notes
34 pages
003 Activation Functions in Machine Learning
No ratings yet
003 Activation Functions in Machine Learning
19 pages
Activation Function
No ratings yet
Activation Function
44 pages
ML Lec-22
No ratings yet
ML Lec-22
25 pages
Addmath F4C7 Coordinate Geo (Phase Two '24)
No ratings yet
Addmath F4C7 Coordinate Geo (Phase Two '24)
14 pages
26 - Netinput Activation Function Forward and Back Propogation
No ratings yet
26 - Netinput Activation Function Forward and Back Propogation
41 pages
Activation Functions - Ipynb - Colaboratory
No ratings yet
Activation Functions - Ipynb - Colaboratory
10 pages
ML Unit 4
No ratings yet
ML Unit 4
23 pages
ML PPT Activation Functions
No ratings yet
ML PPT Activation Functions
12 pages
Chapter Three: One-Dimensional, Two Dimensional, Three-Dimensional
No ratings yet
Chapter Three: One-Dimensional, Two Dimensional, Three-Dimensional
15 pages
UG Physics PH1101-wave-2
No ratings yet
UG Physics PH1101-wave-2
35 pages
CHAPTER 3.3 - Activation - Loss - Accuracy
No ratings yet
CHAPTER 3.3 - Activation - Loss - Accuracy
14 pages
f8194544 Microsoft PowerPoint DeepLearning
No ratings yet
f8194544 Microsoft PowerPoint DeepLearning
28 pages
Constraint Programming: Michael Trick Carnegie Mellon
No ratings yet
Constraint Programming: Michael Trick Carnegie Mellon
41 pages
Activation Function in NN
No ratings yet
Activation Function in NN
29 pages
Test Iqt Va Jixm
No ratings yet
Test Iqt Va Jixm
10 pages
Activation Function
No ratings yet
Activation Function
14 pages
Ad3451 ML Unit 4 Notes Eduengg
No ratings yet
Ad3451 ML Unit 4 Notes Eduengg
36 pages
Unit 2 - Activation Function - PR
No ratings yet
Unit 2 - Activation Function - PR
22 pages
Neural Network Notes
No ratings yet
Neural Network Notes
8 pages
Time Complexity: Dr. Zahid Halim
No ratings yet
Time Complexity: Dr. Zahid Halim
32 pages
Unit 4
No ratings yet
Unit 4
19 pages
4 - Activation Functions in Neural Networks
No ratings yet
4 - Activation Functions in Neural Networks
12 pages
DeepLearing Theory
No ratings yet
DeepLearing Theory
51 pages
Artificial Neural Network Notes
No ratings yet
Artificial Neural Network Notes
9 pages
DL M2 Tech
No ratings yet
DL M2 Tech
32 pages
4-Neural Networks and Activation Function
No ratings yet
4-Neural Networks and Activation Function
28 pages
Chapter 1 - Introduction To Finite Element Analysis
No ratings yet
Chapter 1 - Introduction To Finite Element Analysis
16 pages
Unconstrained Parameterizations For Variance-Covariance Matrices
No ratings yet
Unconstrained Parameterizations For Variance-Covariance Matrices
6 pages
Unit Iv
No ratings yet
Unit Iv
34 pages
Planimeters
No ratings yet
Planimeters
13 pages
Unit 3 Deep Learning
No ratings yet
Unit 3 Deep Learning
11 pages
Activation Functions 2
No ratings yet
Activation Functions 2
5 pages
Group 5
No ratings yet
Group 5
5 pages
Deep Learning: International Islamic University of Chittagong
No ratings yet
Deep Learning: International Islamic University of Chittagong
31 pages
UNIT-III Activation-Function
No ratings yet
UNIT-III Activation-Function
6 pages
Study of Ensemble of Activation Functions in Deep Learning
No ratings yet
Study of Ensemble of Activation Functions in Deep Learning
10 pages
1.4 Circle Diagram of Slip Ring Motor
No ratings yet
1.4 Circle Diagram of Slip Ring Motor
9 pages
2 Limits To Post
No ratings yet
2 Limits To Post
3 pages
Perceptron in Machine Learning
No ratings yet
Perceptron in Machine Learning
11 pages
Deep Learning
No ratings yet
Deep Learning
40 pages
Forward and Backward Propagation Deep Learning 1703697260
No ratings yet
Forward and Backward Propagation Deep Learning 1703697260
9 pages
Activation
No ratings yet
Activation
7 pages
Activation Function To Back Pro
No ratings yet
Activation Function To Back Pro
22 pages
Kind of Angle
No ratings yet
Kind of Angle
5 pages
Unit 2
No ratings yet
Unit 2
18 pages
Mock Test Spreadsheet 2024
No ratings yet
Mock Test Spreadsheet 2024
2 pages
Kel Sir Solution 1
No ratings yet
Kel Sir Solution 1
2 pages
Ws 4
No ratings yet
Ws 4
2 pages
Fundamentals Deep Learning Activation Functions When To Use Them
No ratings yet
Fundamentals Deep Learning Activation Functions When To Use Them
15 pages
CREATING A HISTOGRAM, Etc
No ratings yet
CREATING A HISTOGRAM, Etc
2 pages
Unit 2b
No ratings yet
Unit 2b
11 pages
Activation Functions in Neural Networks - 241102 - 224129
No ratings yet
Activation Functions in Neural Networks - 241102 - 224129
7 pages
RD ST ND RD
No ratings yet
RD ST ND RD
2 pages
Jain College, Jayanagar II PUC Mock Paper I 2018 Mathematics Duration: 3hr 15 Min Max - Marks: 100 Part A I. Answer All The Questions: 1 × 10 10
No ratings yet
Jain College, Jayanagar II PUC Mock Paper I 2018 Mathematics Duration: 3hr 15 Min Max - Marks: 100 Part A I. Answer All The Questions: 1 × 10 10
3 pages
Raw Cashew Moisture Tester: Operating Manual
No ratings yet
Raw Cashew Moisture Tester: Operating Manual
24 pages
Hydrocarbon Reservoir Modeling Comparison Between Theoretical and Real Petrophysical Properties From The Namorado Field (Brazil) Case Study
No ratings yet
Hydrocarbon Reservoir Modeling Comparison Between Theoretical and Real Petrophysical Properties From The Namorado Field (Brazil) Case Study
17 pages
Grip Worksheets 35 and 39 Grade 7
No ratings yet
Grip Worksheets 35 and 39 Grade 7
2 pages
Object Detection and Recognition: Final Project Title
No ratings yet
Object Detection and Recognition: Final Project Title
6 pages
Activation Functions
No ratings yet
Activation Functions
8 pages
Analog & Digital Control Systems
No ratings yet
Analog & Digital Control Systems
3 pages
Backpropagation: Fundamentals and Applications for Preparing Data for Training in Deep Learning
From Everand
Backpropagation: Fundamentals and Applications for Preparing Data for Training in Deep Learning
Fouad Sabry
No ratings yet

Implementation of Activation Layer

Uploaded by

Implementation of Activation Layer

Uploaded by

Implementing the activation

• An activation function of a rectified linear unit (ReLU) is expressed in

1.Forward Propagation with ReLU:

If x≤0, the output ReLU(x) is 0.

3. Backward Propagation (Backpropagation) with ReLU:

• W is the weight matrix.

• It consists of a linear transformation (the matrix multiplication) and a

X = np.random.rand(2) # Input values

• Therefore, the values of each piece of data in array.

Mathematically, it is represented as y=W.X+B

• Calculated by exp(affine)/ Sum (exp(affine)) ,axis

Summary of the Process

You might also like