0% found this document useful (0 votes)

655 views6 pages

How To Build Your Own Neural Network From Scratch in

This document provides a guide to building a neural network from scratch in Python without using deep learning libraries. It explains the key components of a neural network including the input, hidden and output layers. It also describes how to create a neural network class, implement the feedforward process to make predictions, calculate the loss function to measure error, and use backpropagation to update the weights and biases through gradient descent to minimize the loss. The guide walks through training a neural network on a sample problem and shows it successfully learns the patterns in the data.

Uploaded by

Brian Ramiro Oporto Quispe

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

655 views6 pages

How To Build Your Own Neural Network From Scratch in

Uploaded by

Brian Ramiro Oporto Quispe

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 6

How to build your own Neural Network

from scratch in Python

A beginner’s guide to understanding the inner
workings of Deep Learning
Motivation: As part of my personal journey to gain a better understanding of Deep
Learning, I’ve decided to build a Neural Network from scratch without a deep learning
library like TensorFlow. I believe that understanding the inner workings of a Neural
Network is important to any aspiring Data Scientist.

This article contains what I’ve learned, and hopefully it’ll be useful for you as well!

What’s a Neural Network?

Most introductory texts to Neural Networks brings up brain analogies when describing
them. Without delving into brain analogies, I find it easier to simply describe Neural
Networks as a mathematical function that maps a given input to a desired output.

Neural Networks consist of the following components

An input layer, x
An arbitrary amount of hidden layers
An output layer, ŷ
A set of weights and biases between each layer, W and b
A choice of activation function for each hidden layer, σ. In this tutorial, we’ll
use a Sigmoid activation function.

The diagram below shows the architecture of a 2-layer Neural Network (note that the
input layer is typically excluded when counting the number of layers in a Neural
Network)

Architecture of a 2-layer Neural Network

Creating a Neural Network class in Python is easy.

class NeuralNetwork:
def __init__(self, x, y):
self.input = x
self.weights1 = np.random.rand(self.input.shape[1],4)
self.weights2 = np.random.rand(4,1)
self.y = y
self.output = np.zeros(y.shape)

Training the Neural Network

The output ŷ of a simple 2-layer Neural Network is:

You might notice that in the equation above, the weights W and the biases b are the only
variables that affects the output ŷ.

Naturally, the right values for the weights and biases determines the strength of the
predictions. The process of fine-tuning the weights and biases from the input data is
known as training the Neural Network.

Each iteration of the training process consists of the following steps:

Calculating the predicted output ŷ, known as feedforward

Updating the weights and biases, known as backpropagation

The sequential graph below illustrates the process.

Feedforward

As we’ve seen in the sequential graph above, feedforward is just simple calculus and for
a basic 2-layer neural network, the output of the Neural Network is:

Let’s add a feedforward function in our python code to do exactly that. Note that for
simplicity, we have assumed the biases to be 0.

class NeuralNetwork:
def __init__(self, x, y):
self.input = x
self.weights1 = np.random.rand(self.input.shape[1],4)
self.weights2 = np.random.rand(4,1)
self.y = y
self.output = np.zeros(self.y.shape)

def feedforward(self):
self.layer1 = sigmoid(np.dot(self.input, self.weights1))
self.output = sigmoid(np.dot(self.layer1, self.weights2))

However, we still need a way to evaluate the “goodness” of our predictions (i.e. how far
off are our predictions)? The Loss Function allows us to do exactly that.

Loss Function

There are many available loss functions, and the nature of our problem should dictate
our choice of loss function. In this tutorial, we’ll use a simple sum-of-sqaures error as
our loss function.

That is, the sum-of-squares error is simply the average of the difference between each
predicted value and the actual value. The difference is squared so that we measure the
absolute value of the difference.

Our goal in training is to find the best set of weights and biases that minimizes the
loss function.

Backpropagation

Now that we’ve measured the error of our prediction (loss), we need to find a way to
propagate the error back, and to update our weights and biases.

In order to know the appropriate amount to adjust the weights and biases by, we need to
know the derivative of the loss function with respect to the weights and biases.

Recall from calculus that the derivative of a function is simply the slope of the function.

Gradient descent algorithm

If we have the derivative, we can simply update the weights and biases by
increasing/reducing with it(refer to the diagram above). This is known as gradient
descent.

However, we can’t directly calculate the derivative of the loss function with respect to
the weights and biases because the equation of the loss function does not contain the
weights and biases. Therefore, we need the chain rule to help us calculate it.

Chain rule for calculating derivative of the loss function with respect to the weights.
Note that for simplicity, we have only displayed the partial derivative assuming a 1-
layer Neural Network.

Phew! That was ugly but it allows us to get what we needed — the derivative (slope) of
the loss function with respect to the weights, so that we can adjust the weights
accordingly.

Now that we have that, let’s add the backpropagation function into our python code.

class NeuralNetwork:
def __init__(self, x, y):
self.input = x
self.weights1 = np.random.rand(self.input.shape[1],4)
self.weights2 = np.random.rand(4,1)
self.y = y
self.output = np.zeros(self.y.shape)

def feedforward(self):
self.layer1 = sigmoid(np.dot(self.input, self.weights1))
self.output = sigmoid(np.dot(self.layer1, self.weights2))

def backprop(self):
# application of the chain rule to find derivative of the loss
function with respect to weights2 and weights1
d_weights2 = np.dot(self.layer1.T, (2*(self.y - self.output) *
sigmoid_derivative(self.output)))
d_weights1 = np.dot(self.input.T, (np.dot(2*(self.y -
self.output) * sigmoid_derivative(self.output), self.weights2.T) *
sigmoid_derivative(self.layer1)))

# update the weights with the derivative (slope) of the loss

function
self.weights1 += d_weights1
self.weights2 += d_weights2

For a deeper understanding of the application of calculus and the chain rule in
backpropagation, I strongly recommend this tutorial by 3Blue1Brown.

Putting it all together

Now that we have our complete python code for doing feedforward and
backpropagation, let’s apply our Neural Network on an example and see how well it
does.

Our Neural Network should learn the ideal set of weights to represent this function.
Note that it isn’t exactly trivial for us to work out the weights just by inspection alone.

Let’s train the Neural Network for 1500 iterations and see what happens. Looking at the
loss per iteration graph below, we can clearly see the loss monotonically decreasing
towards a minimum. This is consistent with the gradient descent algorithm that we’ve
discussed earlier.

Let’s look at the final prediction (output) from the Neural Network after 1500 iterations.
Predictions after 1500 training iterations

We did it! Our feedforward and backpropagation algorithm trained the Neural Network
successfully and the predictions converged on the true values.

Note that there’s a slight difference between the predictions and the actual values. This
is desirable, as it prevents overfitting and allows the Neural Network to generalize
better to unseen data.

What’s Next?

Fortunately for us, our journey isn’t over. There’s still much to learn about Neural
Networks and Deep Learning. For example:

What other activation function can we use besides the Sigmoid function?
Using a learning rate when training the Neural Network
Using convolutions for image classification tasks

I’ll be writing more on these topics soon, so do follow me on Medium and keep and eye
out for them!

Final Thoughts

I’ve certainly learnt a lot writing my own Neural Network from scratch.

Although Deep Learning libraries such as TensorFlow and Keras makes it easy to build
deep nets without fully understanding the inner workings of a Neural Network, I find
that it’s beneficial for aspiring data scientist to gain a deeper understanding of Neural
Networks.

This exercise has been a great investment of my time, and I hope that it’ll be useful for
you as well!

Federated Learning - Hope and Scope
No ratings yet
Federated Learning - Hope and Scope
4 pages
Jeff Dean's Lecture For YC AI
100% (19)
Jeff Dean's Lecture For YC AI
86 pages
Demonstration of Artificial Neural Network in Matlab
No ratings yet
Demonstration of Artificial Neural Network in Matlab
5 pages
Artificial Neural Networks
No ratings yet
Artificial Neural Networks
55 pages
1 My First Perceptron With Python Eric Joel Barragan Gonzalez (WWW - Ebook DL - Com)
No ratings yet
1 My First Perceptron With Python Eric Joel Barragan Gonzalez (WWW - Ebook DL - Com)
96 pages
Topic 5 - Part1 Multilayer Perceptron
No ratings yet
Topic 5 - Part1 Multilayer Perceptron
28 pages
Face Recognition With Python
No ratings yet
Face Recognition With Python
5 pages
Little Book of Deep Learning
100% (1)
Little Book of Deep Learning
158 pages
Understanding and Creating Neural Networks
No ratings yet
Understanding and Creating Neural Networks
69 pages
Math4ml PDF
No ratings yet
Math4ml PDF
21 pages
Reservoir Engineering
100% (1)
Reservoir Engineering
178 pages
Machine Learning Curriculum Berkley
100% (1)
Machine Learning Curriculum Berkley
12 pages
2.signal and Linear System Analysis
No ratings yet
2.signal and Linear System Analysis
42 pages
Creating A Neural Network From Scratch in Python
100% (1)
Creating A Neural Network From Scratch in Python
12 pages
1
100% (2)
1
34 pages
AI-Lecture 12 - Simple Perceptron
100% (1)
AI-Lecture 12 - Simple Perceptron
24 pages
LangChain - Chat With Your Data
No ratings yet
LangChain - Chat With Your Data
32 pages
Unit 2 DL
No ratings yet
Unit 2 DL
44 pages
11.feature Selection, Extraction
No ratings yet
11.feature Selection, Extraction
38 pages
ANN Notes
No ratings yet
ANN Notes
54 pages
7 - Worksheet 7 - Trigonometry & Right-Angled Triangles
No ratings yet
7 - Worksheet 7 - Trigonometry & Right-Angled Triangles
8 pages
Pr2 ANN WriteUp
No ratings yet
Pr2 ANN WriteUp
11 pages
Backpropagation Learning in Neural Networks
No ratings yet
Backpropagation Learning in Neural Networks
27 pages
Feature Selection in Machine Learning
No ratings yet
Feature Selection in Machine Learning
4 pages
Instant Ebooks Textbook Deep Generative Modeling Jakub M. Tomczak Download All Chapters
No ratings yet
Instant Ebooks Textbook Deep Generative Modeling Jakub M. Tomczak Download All Chapters
49 pages
Brace Forces in Steel Box Girders With Single Diagonal Lateral Bracing Systems
No ratings yet
Brace Forces in Steel Box Girders With Single Diagonal Lateral Bracing Systems
12 pages
Adaline/Madaline:Applications
100% (1)
Adaline/Madaline:Applications
25 pages
Recurrent Neural Network Wiki
100% (1)
Recurrent Neural Network Wiki
7 pages
Notes On Introduction To Deep Learning
No ratings yet
Notes On Introduction To Deep Learning
19 pages
CNN PPT Unit Iv
No ratings yet
CNN PPT Unit Iv
134 pages
VMD0007 BL UP v3.1
No ratings yet
VMD0007 BL UP v3.1
47 pages
Back Propagation
No ratings yet
Back Propagation
29 pages
The Mostly Complete Chart of Neural Networks
100% (1)
The Mostly Complete Chart of Neural Networks
19 pages
Automatic Differentiation With Pytorch: Stat 479: Deep Learning, Spring 2019 Sebastian Raschka
No ratings yet
Automatic Differentiation With Pytorch: Stat 479: Deep Learning, Spring 2019 Sebastian Raschka
43 pages
Radar Systems
No ratings yet
Radar Systems
12 pages
Ai
No ratings yet
Ai
28 pages
Super 15marks Question
100% (1)
Super 15marks Question
2 pages
Notes On Backpropagation
No ratings yet
Notes On Backpropagation
14 pages
Noise Models and Filtering
No ratings yet
Noise Models and Filtering
29 pages
Experiment 9 1
No ratings yet
Experiment 9 1
3 pages
Back Propagation
100% (1)
Back Propagation
27 pages
Math Teaching PDF
No ratings yet
Math Teaching PDF
114 pages
M.E Maths
No ratings yet
M.E Maths
87 pages
Improved Spectrogram Analysis For ECG Signal in Emergency Medical Applications
No ratings yet
Improved Spectrogram Analysis For ECG Signal in Emergency Medical Applications
6 pages
SIMULATION MODEL of Permanent Magnet Synchronous Motor
No ratings yet
SIMULATION MODEL of Permanent Magnet Synchronous Motor
9 pages
Week 2 Measurements in Chemistry
No ratings yet
Week 2 Measurements in Chemistry
32 pages
Ex: Luggage / Baggage / Breakage / Advice / Furniture / Information / Scenery / Poetry / Work / Soap / Food / Bread / Fish / Paper / Machinery Etc
No ratings yet
Ex: Luggage / Baggage / Breakage / Advice / Furniture / Information / Scenery / Poetry / Work / Soap / Food / Bread / Fish / Paper / Machinery Etc
3 pages
P1 - Single Layer Feed Forward Networks
No ratings yet
P1 - Single Layer Feed Forward Networks
52 pages
Ann Chapter 2
No ratings yet
Ann Chapter 2
240 pages
Module 1: Complex Numbers
No ratings yet
Module 1: Complex Numbers
8 pages
Depth Prediction Single Image
No ratings yet
Depth Prediction Single Image
8 pages
ANN Supervised Learning (Compatibility Mode)
No ratings yet
ANN Supervised Learning (Compatibility Mode)
73 pages
A Comparative Review of 3D Container Loading Algorithms
No ratings yet
A Comparative Review of 3D Container Loading Algorithms
34 pages
The Simulation and Optimization of The CPU Heat Sink For A New Type of Graphite
No ratings yet
The Simulation and Optimization of The CPU Heat Sink For A New Type of Graphite
4 pages
14622inferenceforsingleproportions 160909005557
No ratings yet
14622inferenceforsingleproportions 160909005557
19 pages
Chapter 1 - Guided Notes To Trigonometry
No ratings yet
Chapter 1 - Guided Notes To Trigonometry
10 pages
WORKING
No ratings yet
WORKING
8 pages
Mehryar Mohri - Foundations of Machine Learning - Book
No ratings yet
Mehryar Mohri - Foundations of Machine Learning - Book
1 page
ANN Matlab
No ratings yet
ANN Matlab
13 pages
Introduction To TensorFlow For Artificial Intelligence
No ratings yet
Introduction To TensorFlow For Artificial Intelligence
41 pages
Lecture Notes SC
No ratings yet
Lecture Notes SC
21 pages
Balancing Hard To Balance Equations PDF
No ratings yet
Balancing Hard To Balance Equations PDF
2 pages
How To Build Your Own Neural Network From Scratch in Python
No ratings yet
How To Build Your Own Neural Network From Scratch in Python
11 pages
Chapter 3
No ratings yet
Chapter 3
12 pages
Btech CSE
No ratings yet
Btech CSE
17 pages
Colah Github Io Posts 2015 08 Understanding LSTMs
No ratings yet
Colah Github Io Posts 2015 08 Understanding LSTMs
16 pages
The Backpropagation Algorithm
No ratings yet
The Backpropagation Algorithm
4 pages
4.1 Reinforcement Learning 2
No ratings yet
4.1 Reinforcement Learning 2
31 pages
MACHINELEARING UNIT 1material
100% (1)
MACHINELEARING UNIT 1material
64 pages
The Classification of Stocks With Basic Financial Indicators An Application of Cluster Analysis On The BIST 100 Index
No ratings yet
The Classification of Stocks With Basic Financial Indicators An Application of Cluster Analysis On The BIST 100 Index
29 pages
Tensor Flow Guide
No ratings yet
Tensor Flow Guide
25 pages
Lecture 1: Introduction To Reinforcement Learning: David Silver
No ratings yet
Lecture 1: Introduction To Reinforcement Learning: David Silver
46 pages
Growth and Decay Basic Calculus Lesson Plan
No ratings yet
Growth and Decay Basic Calculus Lesson Plan
10 pages
COMP5046: Natural Language Processing
No ratings yet
COMP5046: Natural Language Processing
71 pages
P B Q Xii Maths 2023-24
No ratings yet
P B Q Xii Maths 2023-24
6 pages
MATLAB by Examples - Starting With Neural Network in Matlab
No ratings yet
MATLAB by Examples - Starting With Neural Network in Matlab
6 pages
Task Intermediate
No ratings yet
Task Intermediate
15 pages
Learning Rules of ANN
No ratings yet
Learning Rules of ANN
25 pages
Radial Basis Function
No ratings yet
Radial Basis Function
35 pages
Lecture 5 Power Set
No ratings yet
Lecture 5 Power Set
3 pages
Power Quality Performance Enhancement Using Single-Phase UPQC With Fuzzy Logic Controller Integrated With PV-BES System
No ratings yet
Power Quality Performance Enhancement Using Single-Phase UPQC With Fuzzy Logic Controller Integrated With PV-BES System
22 pages
Ann Book
No ratings yet
Ann Book
16 pages
Deep Learning
No ratings yet
Deep Learning
127 pages
Seminar Report Machine Learning
No ratings yet
Seminar Report Machine Learning
20 pages
Module2.3 Hyperparameter Optimization
No ratings yet
Module2.3 Hyperparameter Optimization
29 pages
(Math-AA 3.1-3.3) 3D GEOMETRY - TRIANGLES - Solutions
No ratings yet
(Math-AA 3.1-3.3) 3D GEOMETRY - TRIANGLES - Solutions
15 pages
RAG With Math
No ratings yet
RAG With Math
7 pages
Car Make and Model Recognition Using Ima
No ratings yet
Car Make and Model Recognition Using Ima
8 pages
PyTorch Workflow Fundamentals
No ratings yet
PyTorch Workflow Fundamentals
1 page