0% found this document useful (0 votes)

157 views

A-Simple-Neural-Network-From-Scratch - Jupyter Notebook

This document describes implementing a simple neural network from scratch in Python. It loads the Iris dataset and defines the network architecture as having 4 input units, 8 hidden units, and 3 output units. It then defines the hyperparameters like learning rate and number of iterations. Functions for initialization, forward propagation, backward propagation, and updating parameters are outlined. Forward propagation calculates activations and outputs. Backward propagation calculates error terms to update the weights and biases in order to minimize loss and improve accuracy over iterations.

Uploaded by

Thambi Smith

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

157 views

A-Simple-Neural-Network-From-Scratch - Jupyter Notebook

Uploaded by

Thambi Smith

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 9

a-simple-neural-network-from-scratch - Jupyter Notebook 24/11/2020, 13:10

Implementing Neural Network from

Scratch.
Neural Networks are really powerful algorithms used for classification.
Dataset = Iris_Dataset
Link = http://scikit-learn.org/stable/auto_examples/datasets/plot_iris_dataset.html
(http://scikit-learn.org/stable/auto_examples/datasets/plot_iris_dataset.html)

Import required libraries

In [ ]: from sklearn import datasets #for dataset

import numpy as np #for maths
import matplotlib.pyplot as plt #for plotting

Get Dataset

In [ ]: iris = datasets.load_iris() #load the dataset

data = iris.data #get features
target = iris.target #get labels

shape = data.shape #shape of data

#convert into numpy array

data = np.array(data).reshape(shape[0],shape[1])
target = np.array(target).reshape(shape[0],1)

#print shape
print("Data Shape = {}".format(data.shape))
print("Target Shape = {}".format(target.shape))
print('Classes : {}'.format(np.unique(target)))
print('Sample data : {} , Target = {}'.format(data[70],target[70]))

Define Parameters and Hyperparameters

http://localhost:8889/notebooks/notebooks/ml-learn/nn/a-simple-neural-network-from-scratch.ipynb Page 1 of 9
a-simple-neural-network-from-scratch - Jupyter Notebook 24/11/2020, 13:10

One hidden layer Neural Network.

Input Units = 4
Hidden Units = 8
Output Units = 3

http://localhost:8889/notebooks/notebooks/ml-learn/nn/a-simple-neural-network-from-scratch.ipynb Page 2 of 9
a-simple-neural-network-from-scratch - Jupyter Notebook 24/11/2020, 13:10

In [ ]: #HYPERPARAMETERS

#num of target labels

num_classes = len(np.unique(target))

#define layer_neurons
input_units = 4 #neurons in input layer
hidden_units = 8 #neurons in hidden layer
output_units = 3 #neurons in output layer

#define hyper-parameters
learning_rate = 0.03

#regularization parameter
beta = 0.00001

#num of iterations
iters = 4001

Dimesions of Parameters
Shape of layer1_weights (Wxh) = (4,8)
Shape of layer1_biasess (Bh) = (8,1)
Shape of layer2_weights (Why) = (8,3)
Shape of layer2_biasess (By) = (3,1)

In [ ]: #PARAMETERS

#initialize parameters i.e weights

def initialize_parameters():
#initial values should have zero mean and 0.1 standard deviation
mean = 0 #mean of parameters
std = 0.03 #standard deviation

layer1_weights = np.random.normal(mean,std,(input_units,hidden_unit
layer1_biases = np.ones((hidden_units,1))
layer2_weights = np.random.normal(mean,std,(hidden_units,output_uni
layer2_biases = np.ones((output_units,1))

parameters = dict()
parameters['layer1_weights'] = layer1_weights
parameters['layer1_biases'] = layer1_biases
parameters['layer2_weights'] = layer2_weights
parameters['layer2_biases'] = layer2_biases

return parameters

http://localhost:8889/notebooks/notebooks/ml-learn/nn/a-simple-neural-network-from-scratch.ipynb Page 3 of 9
a-simple-neural-network-from-scratch - Jupyter Notebook 24/11/2020, 13:10

Activation Function
Sigmoid

0.5

0
−6 −4 −2 0 2 4 6

In [ ]: #activation function
def sigmoid(X):
return 1/(1+np.exp((-1)*X))

#softmax function for output

def softmax(X):
exp_X = np.exp(X)
exp_X_sum = np.sum(exp_X,axis=1).reshape(-1,1)
exp_X = (exp_X/exp_X_sum)
return exp_X

http://localhost:8889/notebooks/notebooks/ml-learn/nn/a-simple-neural-network-from-scratch.ipynb Page 4 of 9
a-simple-neural-network-from-scratch - Jupyter Notebook 24/11/2020, 13:10

Define Utility Functions

1. Forward Propagation

---- Logits = matmul(X,Wxh) + Bh

---- A = sigmoid(logits)
---- logits = matmul(A,Why) + By
---- output = softmax(logits)

Store output and A in cache to use it in backward propagation

2. Backward Propagation

---- Error_output = output - train_labels

---- Error_activation = (matmul(error_output,Why.T))(A)(1-A)
---- dWhy = (matmul(A.T,error_output))/m
---- dWxh = (matmul(train_dataset.T,error_activation))/m

m = len(train_dataset)
Store derivatives in derivatives dict

3. Update Parameters

---- Wxh = Wxh - learning_rate(dWxh + betaWxh)

---- Why = Why - learning_rate(dWhy + betaWhy)

4. Calculate Loss and Accuracy

---- Loss = (-1(Y log(prediction)) + (1-Y) (log(1-predictions))) + beta * (sum(Wxh^2) +

sum(Why^2)))/m
---- Accuracy = sum(Y==predictions)/m

In [ ]: #forward propagation
def forward_propagation(train_dataset,parameters):
cache = dict() #to store the intermediate values for bac
m = len(train_dataset) #number of training examples

#get the parameters

layer1_weights = parameters['layer1_weights']
layer1_biases = parameters['layer1_biases']
layer2_weights = parameters['layer2_weights']
layer2_biases = parameters['layer2_biases']

#forward prop
logits = np.matmul(train_dataset,layer1_weights) + layer1_biases
activation1 = np.array(sigmoid(logits)).reshape(m,hidden_units)
activation2 = np.array(np.matmul(activation1,layer2_weights) + laye
output = np.array(softmax(activation2)).reshape(m,num_classes)

http://localhost:8889/notebooks/notebooks/ml-learn/nn/a-simple-neural-network-from-scratch.ipynb Page 5 of 9
a-simple-neural-network-from-scratch - Jupyter Notebook 24/11/2020, 13:10

#fill in the cache

cache['output'] = output
cache['activation1'] = activation1

return cache,output

#backward propagation
def backward_propagation(train_dataset,train_labels,parameters,cache
derivatives = dict() #to store the derivatives

#get stuff from cache

output = cache['output']
activation1 = cache['activation1']

#get parameters
layer1_weights = parameters['layer1_weights']
layer2_weights = parameters['layer2_weights']

#calculate errors
error_output = output - train_labels
error_activation1 = np.matmul(error_output,layer2_weights.T)
error_activation1 = np.multiply(error_activation1,activation1)
error_activation1 = np.multiply(error_activation1,1-activation1)

#calculate partial derivatives

partial_derivatives2 = np.matmul(activation1.T,error_output)/len
partial_derivatives1 = np.matmul(train_dataset.T,error_activation1

#store the derivatives

derivatives['partial_derivatives1'] = partial_derivatives1
derivatives['partial_derivatives2'] = partial_derivatives2

return derivatives

#update the parameters

def update_parameters(derivatives,parameters):
#get the parameters
layer1_weights = parameters['layer1_weights']
layer2_weights = parameters['layer2_weights']

#get the derivatives

partial_derivatives1 = derivatives['partial_derivatives1']
partial_derivatives2 = derivatives['partial_derivatives2']

#update the derivatives

layer1_weights -= (learning_rate*(partial_derivatives1 + beta*layer
layer2_weights -= (learning_rate*(partial_derivatives2 + beta*layer

#update the dict

parameters['layer1_weights'] = layer1_weights
parameters['layer2_weights'] = layer2_weights

http://localhost:8889/notebooks/notebooks/ml-learn/nn/a-simple-neural-network-from-scratch.ipynb Page 6 of 9
a-simple-neural-network-from-scratch - Jupyter Notebook 24/11/2020, 13:10

return parameters

#calculate the loss and accuracy

def cal_loss_accuray(train_labels,predictions,parameters):
#get the parameters
layer1_weights = parameters['layer1_weights']
layer2_weights = parameters['layer2_weights']

#cal loss and accuracy

loss = -1*np.sum(np.multiply(np.log(predictions),train_labels) +
accuracy = np.sum(np.argmax(train_labels,axis=1)==np.argmax(predict
accuracy /= len(train_dataset)

return loss,accuracy

Train Function
1. Initialize Parameters
2. Forward Propagation
3. Backward Propagation
4. Calculate Loss and Accuracy
5. Update the parameters

Repeat the steps 2-5 for the given number of iterations

http://localhost:8889/notebooks/notebooks/ml-learn/nn/a-simple-neural-network-from-scratch.ipynb Page 7 of 9
a-simple-neural-network-from-scratch - Jupyter Notebook 24/11/2020, 13:10

In [ ]: #Implementation of 3 layer Neural Network

#training function
def train(train_dataset,train_labels,iters=2):
#To store loss after every iteration.
J = []

#WEIGHTS
global layer1_weights
global layer1_biases
global layer2_weights
global layer2_biases

#initialize the parameters

parameters = initialize_parameters()

layer1_weights = parameters['layer1_weights']
layer1_biases = parameters['layer1_biases']
layer2_weights = parameters['layer2_weights']
layer2_biases = parameters['layer2_biases']

#to store final predictons after training

final_output = []

for j in range(iters):
#forward propagation
cache,output = forward_propagation(train_dataset,parameters)

#backward propagation
derivatives = backward_propagation(train_dataset,train_labels

#calculate the loss and accuracy

loss,accuracy = cal_loss_accuray(train_labels,output,parameters

#update the parameters

parameters = update_parameters(derivatives,parameters)

#append loss
J.append(loss)

#update final output

final_output = output

#print accuracy and loss

if(j%500==0):
print("Step %d"%j)
print("Loss %f"%loss)
print("Accuracy %f%%"%(accuracy*100))

return J,final_output

http://localhost:8889/notebooks/notebooks/ml-learn/nn/a-simple-neural-network-from-scratch.ipynb Page 8 of 9
a-simple-neural-network-from-scratch - Jupyter Notebook 24/11/2020, 13:10

In [ ]: #shuffle the dataset

z = list(zip(data,target))
np.random.shuffle(z)
data,target = zip(*z)

#make train_dataset and train_labels

train_dataset = np.array(data).reshape(-1,4)
train_labels = np.zeros([train_dataset.shape[0],num_classes])

#one-hot encoding
for i,label in enumerate(target):
train_labels[i,label] = 1

#normalizations
for i in range(input_units):
mean = train_dataset[:,i].mean()
std = train_dataset[:,i].std()
train_dataset[:,i] = (train_dataset[:,i]-mean)/std

In [ ]: #train data
J,final_output = train(train_dataset,train_labels,iters=4001)

Reached an Accuracy of 97%

Plot the loss vs iteration graph

In [ ]: #plot loss graph

plt.plot(list(range(1,len(J))),J[1:])
plt.xlabel('Iterations')
plt.ylabel('Loss')
plt.title('Iterations VS Loss')
plt.show()

In [ ]:

http://localhost:8889/notebooks/notebooks/ml-learn/nn/a-simple-neural-network-from-scratch.ipynb Page 9 of 9

Ansys Inc. Licensing Guide
No ratings yet
Ansys Inc. Licensing Guide
26 pages
CNN RNN Assignment Set 4
0% (1)
CNN RNN Assignment Set 4
2 pages
2 DNN-CNN-RNN
100% (1)
2 DNN-CNN-RNN
87 pages
JEDI Slides-DataSt-Chapter01-Basic Concepts and Notations
No ratings yet
JEDI Slides-DataSt-Chapter01-Basic Concepts and Notations
23 pages
Chapter 8. Software Prototyping
No ratings yet
Chapter 8. Software Prototyping
2 pages
Seminar Report Machine Learning
No ratings yet
Seminar Report Machine Learning
20 pages
Emerging Trends in Software
No ratings yet
Emerging Trends in Software
13 pages
Lecture - 2 Classification (Machine Learning Basic and KNN)
No ratings yet
Lecture - 2 Classification (Machine Learning Basic and KNN)
94 pages
71A Machine Learning
No ratings yet
71A Machine Learning
8 pages
ML Unit 2
No ratings yet
ML Unit 2
90 pages
Data Mining and Business Intelligence Lab Manual
No ratings yet
Data Mining and Business Intelligence Lab Manual
52 pages
Chap 11 12 - Practical Methodology and Applications - Heechul Lim
No ratings yet
Chap 11 12 - Practical Methodology and Applications - Heechul Lim
60 pages
ML - Full Slides Srikanth Allamshatty
No ratings yet
ML - Full Slides Srikanth Allamshatty
369 pages
ML Lab
No ratings yet
ML Lab
21 pages
Course Plan Natural Language Processing
No ratings yet
Course Plan Natural Language Processing
5 pages
ML UNIT-2 Notes
No ratings yet
ML UNIT-2 Notes
15 pages
Deep Learning Unit 1
No ratings yet
Deep Learning Unit 1
32 pages
AIML Lab Manual
No ratings yet
AIML Lab Manual
43 pages
SOC Lab Manual
No ratings yet
SOC Lab Manual
11 pages
Question Bank Module-1: Department of Computer Applications 18mca53 - Machine Learning
No ratings yet
Question Bank Module-1: Department of Computer Applications 18mca53 - Machine Learning
7 pages
Answers For End-Sem Exam Part - 2 (Deep Learning)
No ratings yet
Answers For End-Sem Exam Part - 2 (Deep Learning)
20 pages
Convolutional Neural Networks For Visual Recognition
No ratings yet
Convolutional Neural Networks For Visual Recognition
45 pages
Chandigarh Group of Colleges College of Engineering Landran, Mohali
No ratings yet
Chandigarh Group of Colleges College of Engineering Landran, Mohali
47 pages
Loss Functions
No ratings yet
Loss Functions
37 pages
Parallelism of Statistics and Machine Learning & Logistic Regression Versus Random Forest
100% (1)
Parallelism of Statistics and Machine Learning & Logistic Regression Versus Random Forest
72 pages
Back Propagation Network: Soft Computing
No ratings yet
Back Propagation Network: Soft Computing
33 pages
The Development of Mobile-Based Symptom Analysis For Early Detection of Diseases Using Hyper-Tuned C-Support Vector Classification Algorithm
No ratings yet
The Development of Mobile-Based Symptom Analysis For Early Detection of Diseases Using Hyper-Tuned C-Support Vector Classification Algorithm
6 pages
ML Unit-Iv
No ratings yet
ML Unit-Iv
19 pages
Gradient Descent
No ratings yet
Gradient Descent
15 pages
DL Lab Manual
100% (1)
DL Lab Manual
35 pages
Practice Final sp22
No ratings yet
Practice Final sp22
10 pages
Crime Prediction in Nigeria's Higer Institutions
No ratings yet
Crime Prediction in Nigeria's Higer Institutions
13 pages
Exploratory Data Analysis Using Python
No ratings yet
Exploratory Data Analysis Using Python
7 pages
ML Lab Observation
100% (1)
ML Lab Observation
44 pages
ML_LAB_Mannual-1
No ratings yet
ML_LAB_Mannual-1
79 pages
Data Preprocessing ML Lab
No ratings yet
Data Preprocessing ML Lab
6 pages
Evaluation Metrics in Machine Learning
No ratings yet
Evaluation Metrics in Machine Learning
14 pages
02 ML Supervised Learning
No ratings yet
02 ML Supervised Learning
32 pages
Neural Network and Their Applications
No ratings yet
Neural Network and Their Applications
2 pages
Scikit Learn
No ratings yet
Scikit Learn
4 pages
Data Literacy Questions All Types
No ratings yet
Data Literacy Questions All Types
2 pages
ML Unit 1
No ratings yet
ML Unit 1
44 pages
UE20CS302 Unit4 Slides
No ratings yet
UE20CS302 Unit4 Slides
312 pages
Lab Program
100% (1)
Lab Program
15 pages
CCS355 Neural Networks and Deep Learning Lab
No ratings yet
CCS355 Neural Networks and Deep Learning Lab
43 pages
Machine Learning Full Question Bank
No ratings yet
Machine Learning Full Question Bank
14 pages
Duda Solutions PDF
No ratings yet
Duda Solutions PDF
77 pages
Unit-V Deep Learning Techniques
100% (1)
Unit-V Deep Learning Techniques
31 pages
2.building Blocks of Neural Networks
100% (1)
2.building Blocks of Neural Networks
2 pages
Machine Learning Notes
No ratings yet
Machine Learning Notes
3 pages
ML First Unit
No ratings yet
ML First Unit
70 pages
Deep Learning Exp
No ratings yet
Deep Learning Exp
25 pages
Tools Machine Learning
No ratings yet
Tools Machine Learning
9 pages
Types of Data (Qualitative and Quantitative)
No ratings yet
Types of Data (Qualitative and Quantitative)
89 pages
Boosting and AdaBoost For Machine Learning
No ratings yet
Boosting and AdaBoost For Machine Learning
18 pages
ML Notes
No ratings yet
ML Notes
14 pages
NLP Lab Tasks
No ratings yet
NLP Lab Tasks
16 pages
SVM (Support Vector Machine) For Classification - by Aditya Kumar - Towards Data Science
100% (1)
SVM (Support Vector Machine) For Classification - by Aditya Kumar - Towards Data Science
28 pages
Machine Learning Report
No ratings yet
Machine Learning Report
58 pages
Equity of Cybersecurity in the Education System: High Schools, Undergraduate, Graduate and Post-Graduate Studies.
From Everand
Equity of Cybersecurity in the Education System: High Schools, Undergraduate, Graduate and Post-Graduate Studies.
Joseph O. Esin
No ratings yet
Deep Learning
No ratings yet
Deep Learning
4 pages
L03 Problem Solving As Search I
No ratings yet
L03 Problem Solving As Search I
66 pages
Ethics Scenarios
No ratings yet
Ethics Scenarios
4 pages
Speedmaster XL 106 Product Information
No ratings yet
Speedmaster XL 106 Product Information
24 pages
Real-World Use Cases For Jenkins Pipeline
No ratings yet
Real-World Use Cases For Jenkins Pipeline
48 pages
Cyber Security Specialist - Job Description - Targetjobs
No ratings yet
Cyber Security Specialist - Job Description - Targetjobs
6 pages
DBMS Lab Experiments
No ratings yet
DBMS Lab Experiments
6 pages
Ulrich Wildgruber - Wikipedia
No ratings yet
Ulrich Wildgruber - Wikipedia
1 page
A Report On The Efficiency of Business Kpis For Understanding Cyber Risks
No ratings yet
A Report On The Efficiency of Business Kpis For Understanding Cyber Risks
3 pages
Computer ch5 Class 8
No ratings yet
Computer ch5 Class 8
5 pages
GEO247 Handout 9
No ratings yet
GEO247 Handout 9
4 pages
Welcome To Everybody at Introduction To Computer
No ratings yet
Welcome To Everybody at Introduction To Computer
28 pages
Communication Projects in Labview With PDF
No ratings yet
Communication Projects in Labview With PDF
2 pages
Higher Order DEs Powerpoint
No ratings yet
Higher Order DEs Powerpoint
16 pages
Pop3 V SMTP
No ratings yet
Pop3 V SMTP
2 pages
Excel ICONS
No ratings yet
Excel ICONS
7 pages
Jaguar Effect
No ratings yet
Jaguar Effect
7 pages
01 Information Management Concepts
100% (1)
01 Information Management Concepts
37 pages
Unit 4 Python Numpy
No ratings yet
Unit 4 Python Numpy
18 pages
5l Glass Fermenter
No ratings yet
5l Glass Fermenter
17 pages
Sales Analytics
No ratings yet
Sales Analytics
6 pages
Cases
No ratings yet
Cases
5 pages
Annex 3 - Term of Reference
No ratings yet
Annex 3 - Term of Reference
10 pages
US Sandwich Catalog Valv Sun
No ratings yet
US Sandwich Catalog Valv Sun
378 pages
Intro Scanner
No ratings yet
Intro Scanner
7 pages
Co-Op Final Project
No ratings yet
Co-Op Final Project
13 pages
Data Science ML Full Stack Roadmap
No ratings yet
Data Science ML Full Stack Roadmap
35 pages
Imp Operating System (Linux)
No ratings yet
Imp Operating System (Linux)
12 pages
Data Structures and Algorithms - CSE 102 Program List: Week 1
No ratings yet
Data Structures and Algorithms - CSE 102 Program List: Week 1
1 page
Sat - 78.Pdf - Adaptive Transmission of Sensitive Information in Online Social Networks
No ratings yet
Sat - 78.Pdf - Adaptive Transmission of Sensitive Information in Online Social Networks
11 pages