0% found this document useful (0 votes)

10 views14 pages

Lab Report 03

The document details the design and implementation of a multi-layer neural network algorithm using the MNIST dataset for handwritten digit recognition and an XOR dataset for classification. It covers the initialization of weights, forward and backward passes, and training of the network, including accuracy evaluations for different learning rates. Limitations of the multi-layer perceptron learning algorithm are also discussed, highlighting issues like local minima, underfitting, and overfitting.

Uploaded by

Sadbin Mohshin

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

10 views14 pages

Lab Report 03

Uploaded by

Sadbin Mohshin

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 14

Experiment No.

Name of the Experiment: Design and implementation of Multi-layer Neural Networks algorithm
(i.e., Back-propagation learning neural networks algorithm)

Dataset: MINST dataset

The MNIST database (Modified National Institute of Standards and Technology database) is a
large dataset of handwritten digits. It was produced from NIST's original datasets. Half of the
training set and half of the test set were taken from NIST's training dataset, while the other half of
the training set and the other half of the test set were taken from NIST's testing dataset.

Characteristics of Dataset:
• Large dataset of handwritten digits
• Total 70,000 images
• 60,000 training images and 10,000 testing images
• The size of every image is 28x28 pixels.
• Number of total features is 784.
• Total 10 classes

Implementation:

At first all the dependencies are loaded.

Code:

import numpy as np
import time
import matplotlib.pyplot
%matplotlib inline

Then the dataset is stored in csv file

def convert(imgf, labelf, outf, n):

f = open(imgf, "rb")
o = open(outf, "w")
l = open(labelf, "rb")

f.read(16)
l.read(8)
images = []

for i in range(n):
image = [ord(l.read(1))]
for j in range(28*28):
image.append(ord(f.read(1)))
images.append(image)

for image in images:

o.write(",".join(str(pix) for pix in image)+"\n")
f.close()
o.close()
l.close()
convert("/content/drive/MyDrive/4-2/CSE 4203/train-images.idx3-
ubyte", "/content/drive/MyDrive/4-2/CSE 4203/train-labels.idx1-ubyte",
"mnist_train.csv", 60000)
convert("/content/drive/MyDrive/4-2/CSE 4203/t10k-images.idx3-
ubyte", "/content/drive/MyDrive/4-2/CSE 4203/t10k-labels.idx1-ubyte",
"mnist_test.csv", 10000)

After that the train and test dataset are loaded and then scaling is performed

train_file = open("/content/mnist_train.csv", 'r')

train_list = train_file.readlines()
train_file.close()

train_file = open("/content/mnist_train.csv", 'r')

train_list = train_file.readlines()
train_file.close()

scaled_input_train = (np.asfarray(all_values[1:]) / 255.0 * 0.99) + 0.01

test_file = open("/content/mnist_test.csv", 'r')

test_list = test_file.readlines()
test_file.close()

all_values = test_list[100].split(',')
image_array = np.asfarray(all_values[1:]).reshape((28,28))

scaled_input_test = (np.asfarray(all_values[1:]) / 255.0 * 0.99) + 0.01

At starting point of MLP algorithm, weights and threshold are initialized

def init(self, sizes, epochs, lr):

self.sizes = sizes
self.epochs = epochs
self.lr = lr

# number of nodes in each layer

input_layer=self.sizes[0]
hidden_1=self.sizes[1]
hidden_2=self.sizes[2]
output_layer=self.sizes[3]

self.params = {
'W1':np.random.randn(hidden_1, input_layer) * np.sqrt(1. / hidden_
1),
'W2':np.random.randn(hidden_2, hidden_1) * np.sqrt(1. / hidden_2),
'W3':np.random.randn(output_layer, hidden_2) * np.sqrt(1. / output
_layer)
}

Now sigmoid and softmax function is defined

def sigmoid(self, x, derivative=False):

if derivative:
return (np.exp(-x))/((np.exp(-x)+1)**2)
return 1/(1 + np.exp(-x))

def softmax(self, x, derivative=False):

# Numerically stable with large exponentials
exps = np.exp(x - x.max())
if derivative:
return exps / np.sum(exps, axis=0) * (1 - exps / np.sum(exps, ax
is=0))
return exps / np.sum(exps, axis=0)
At the time of forward pass, each layer calculates the output and pass the as input to next layer

def forward_pass(self, x_train):

params = self.params

# input layer activations becomes sample

params['A0'] = x_train

# input layer to hidden layer 1

params['Z1'] = np.dot(params["W1"], params['A0'])
params['A1'] = self.sigmoid(params['Z1'])

# hidden layer 1 to hidden layer 2

params['Z2'] = np.dot(params["W2"], params['A1'])
params['A2'] = self.sigmoid(params['Z2'])

# hidden layer 2 to output layer

params['Z3'] = np.dot(params["W3"], params['A2'])
params['A3'] = self.softmax(params['Z3'])

return params['A3']

Using backpropagation, error is back propagated from output layer to input layer and the weights
to be altered is proportional the error and calculated
def backward_pass(self, y_train, output):
params = self.params
change_w = {}

# Calculate W3 update
error = 2 * (output - y_train) / output.shape[0] * self.softmax(para
ms['Z3'], derivative=True)
change_w['W3'] = np.outer(error, params['A2'])

# Calculate W2 update
error = np.dot(params['W3'].T, error) * self.sigmoid(params['Z2'], d
erivative=True)
change_w['W2'] = np.outer(error, params['A1'])

# Calculate W1 update
error = np.dot(params['W2'].T, error) * self.sigmoid(params['Z1'], d
erivative=True)
change_w['W1'] = np.outer(error, params['A0'])

return change_w

Then, adapt the weights

def update_network_parameters(self, changes_to_w):

for key, value in changes_to_w.items():

self.params[key] -= self.lr * value

Now, it is time to train the network

def compute_accuracy(self, test_data, output_nodes):
predictions = []

for x in train_list:
all_values = x.split(',')
# scale and shift the inputs
inputs = (np.asfarray(all_values[1:]) / 255.0 * 0.99) + 0.01
# create the target output values (all 0.01, except the desired
label which is 0.99)
targets = np.zeros(output_nodes) + 0.01
# all_values[0] is the target label for this record
targets[int(all_values[0])] = 0.99
output = self.forward_pass(inputs)
pred = np.argmax(output)
predictions.append(pred == np.argmax(targets))

return np.mean(predictions)

def train(self, train_list, test_list, output_nodes):

start_time = time.time()
for iteration in range(self.epochs):
for x in train_list:
all_values = x.split(',')
# scale and shift the inputs
inputs = (np.asfarray(all_values[1:]) / 255.0 * 0.99) + 0.01
# create the target output values (all 0.01, except the desi
red label which is 0.99)
targets = np.zeros(output_nodes) + 0.01
# all_values[0] is the target label for this record
targets[int(all_values[0])] = 0.99
output = self.forward_pass(inputs)
changes_to_w = self.backward_pass(targets, output)
self.update_network_parameters(changes_to_w)

accuracy = self.compute_accuracy(test_list, output_nodes)

print('Epoch: {0}, Time Spent: {1:.2f}s, Accuracy: {2:.2f}%'.for
mat(
iteration+1, time.time() - start_time, accuracy * 100
))

nn = NN(sizes=[784, 128, 64, 10], epochs=10, lr=0.001)

nn.train(train_list, test_list, 10)

Table 3.1 :Evaluation the correctness and the accuracy

Learning rate=0.001 Learning rate=0.01 Learning rate=0.05
Epoch: 1, Time Epoch: 1, Time Epoch: 1, Time
Spent: 76.40s, Spent: 85.05s, Spent: 75.38s,
Accuracy: 23.36% Accuracy: 51.47% Accuracy: 73.53%
Epoch: 2, Time Epoch: 2, Time Epoch: 2, Time
Spent: 156.99s, Spent: 164.65s, Spent: 155.20s,
Accuracy: 28.21% Accuracy: 56.15% Accuracy: 75.27%
Epoch: 3, Time Epoch: 3, Time Epoch: 3, Time
Spent: 233.56s, Spent: 242.52s, Spent: 231.23s,
Accuracy: 33.65% Accuracy: 60.13% Accuracy: 79.42%
Epoch: 4, Time Epoch: 4, Time Epoch: 4, Time
Spent: 314.37s, Spent: 320.58s, Spent: 311.87s,
Accuracy: 39.00% Accuracy: 66.85% Accuracy: 81.18%
Epoch: 5, Time Epoch: 5, Time Epoch: 5, Time
Spent: 390.51s, Spent: 397.03s, Spent: 387.57s,
Accuracy: 43.31% Accuracy: 71.10% Accuracy: 82.71%
Epoch: 6, Time Epoch: 6, Time Epoch: 6, Time
Spent: 468.83s, Spent: 476.86s, Spent: 465.46s,
Accuracy: 46.22% Accuracy: 73.64% Accuracy: 83.93%
Epoch: 7, Time Epoch: 7, Time Epoch: 7, Time
Spent: 546.70s, Spent: 551.20s, Spent: 542.02s,
Accuracy: 48.07% Accuracy: 75.20% Accuracy: 84.78%
Epoch: 8, Time Epoch: 8, Time Epoch: 8, Time
Spent: 621.76s, Spent: 628.39s, Spent: 620.20s,
Accuracy: 49.25% Accuracy: 74.55% Accuracy: 85.28%
Dataset: XOR data set

Fig 3.1: Dataset in the feature space

From the above figure, we see that dataset is not linearly separable. So, multi layer perceptron
learning algorithm will be applied to see if the model can classify the data set

Characteristics of Dataset:
● XOR dataset
● Total 4 samples
● 4 training samples and 4 testing samples
● Number of features is 2.
● Total 2 classes

Implementation:

At first all the dependencies are loaded.

Code:

import numpy as np
import math
from matplotlib import pyplot as plt
import pandas as pd
from sklearn.datasets import make_blobs
from sklearn.model_selection import train_test_split

Then the dataset is generated and train data as x_train, train class label as y_train, test data as
x_ test, test class label as y_ test are extracted from the dataset

# Define the input and output data for the XOR problem
X = np.array([[0, 0], [0, 1], [1, 0], [1, 1]])
y = np.array([[0], [1], [1], [0]])

After that I have initialized weights and bias as follows

#weiight
def __init__(self, input_size, hidden_size, output_size):
# Initialize the weights for the hidden and output layers
self.weights_hidden = np.random.normal(size=(input_size, hidden_si
ze))
self.weights_output = np.random.normal(size=(hidden_size, output_s
ize))

# Initialize the biases for the hidden and output layers

self.bias_hidden = np.zeros((1, hidden_size))
self.bias_output = np.zeros((1, output_size))
Now sigmoid and softmax function is defined

# Define the sigmoid activation function

def sigmoid(x):
return 1 / (1 + np.exp(-x))

# Define the derivative of the sigmoid activation function

def sigmoid_derivative(x):
return x * (1 - x)

At the time of forward pass, each layer calculates the output and pass the as input to next layer

def feedforward(self, X):

# Perform the feedforward pass through the MLP

self.hidden = sigmoid(np.dot(X, self.weights_hidden) + self.bias_h
idden)
self.output = sigmoid(np.dot(self.hidden, self.weights_output) + s
elf.bias_output)

Using backpropagation, error is back propagated from output layer to input layer and the weights
to be altered is proportional the error and calculated
def backpropagation(self, X, y, learning_rate):
# Calculate the error between the predicted output and the true ou
tput
output_error = y - self.output

# Calculate the derivative of the error with respect to the output

output_derivative = sigmoid_derivative(self.output)

# Calculate the derivative of the error with respect to the weight

s and biases of the output layer
output_weights_derivative = np.dot(self.hidden.T, output_error * o
utput_derivative)
output_bias_derivative = np.sum(output_error * output_derivative,
axis=0, keepdims=True)

# Calculate the error for the hidden layer

hidden_error = np.dot(output_error * output_derivative, self.weigh
ts_output.T)

# Calculate the derivative of the error with respect to the hidden

layer
hidden_derivative = sigmoid_derivative(self.hidden)

# Calculate the derivative of the error with respect to the weight

s and biases of the hidden layer
hidden_weights_derivative = np.dot(X.T, hidden_error * hidden_deri
vative)
hidden_bias_derivative = np.sum(hidden_error * hidden_derivative,
axis=0, keepdims=True)

Then, adapt the weights

# Update the weights and biases using the derivatives and the learning r
ate

self.weights_hidden += learning_rate * hidden_weights_derivative

self.bias_hidden += learning_rate * hidden_bias_derivative
self.weights_output += learning_rate * output_weights_derivative
self.bias_output += learning_rate * output_bias_derivative
Now, it is time to train the network
def train(self, X, y, epochs, learning_rate):
# Train the MLP for the specified number of epochs
for epoch in range(epochs):
for i in range(len(X)):
self.feedforward(X[i:i+1])
self.backpropagation(X[i:i+1], y[i:i+1], learning_rate)

def predict(self, X):

# Make a prediction using the trained MLP
self.feedforward(X)
return self.output.round()

mlp = MLP(input_size=2, hidden_size=2, output_size=1)

epochs = 1000
learning_rate = 0.01
mlp.train(X, y, epochs, learning_rate)

y_pred = mlp.predict(X)
print("Predictions:", y_pred)
print("Accuracy:", np.mean(y_pred == y))

Table 3.2 :Evaluation the correctness and the accuracy

Epoch Learning rate Accuracy(%)
1000 0.1 50
1000 0.3 75
1000 0.5 100
Limitations of multi-layer perceptron learning algorithm:

⮚ Can be stable at local minima

⮚ Underfitting

⮚ Overfitting

⮚ Divergency

Conclusion:
In conclusion, MLP is a powerful and versatile neural network model that has proven to be
effective in various machine learning applications. Its ability to learn and generalize from data, as
well as its flexibility in terms of network architecture and activation functions, make it a popular
choice in the field. However, its limitations in terms of overfitting and computational resources
should also be taken into consideration when using MLP in practical applications. Overall, MLP
is a valuable tool in the field of machine learning and can provide valuable insights and predictions
when used appropriately.

ISTQB Agile Tester Exam - Answer
No ratings yet
ISTQB Agile Tester Exam - Answer
139 pages
DLP Lab
No ratings yet
DLP Lab
81 pages
ANN Programs
No ratings yet
ANN Programs
20 pages
Lab Manual DL (New)
No ratings yet
Lab Manual DL (New)
89 pages
Exp 4
No ratings yet
Exp 4
3 pages
Null 0
No ratings yet
Null 0
6 pages
Lab Report 04
No ratings yet
Lab Report 04
10 pages
Toodegrees Fractal Model PDF
No ratings yet
Toodegrees Fractal Model PDF
11 pages
Bananini Chimpanzini
No ratings yet
Bananini Chimpanzini
8 pages
02 ML PDF
No ratings yet
02 ML PDF
5 pages
A Gentle Introduction To Neural Networks With Python
100% (1)
A Gentle Introduction To Neural Networks With Python
85 pages
Math Lab 1
No ratings yet
Math Lab 1
7 pages
Perceptron Numpy
No ratings yet
Perceptron Numpy
3 pages
A-Simple-Neural-Network-From-Scratch - Jupyter Notebook
No ratings yet
A-Simple-Neural-Network-From-Scratch - Jupyter Notebook
9 pages
Genaifile
No ratings yet
Genaifile
39 pages
Building Your Deep Neural Network Step by Step V8a
No ratings yet
Building Your Deep Neural Network Step by Step V8a
16 pages
Deep Record
No ratings yet
Deep Record
44 pages
Ex No:1 Implementing A Perceptron Algorithm For Binary Classification Date: Aim
No ratings yet
Ex No:1 Implementing A Perceptron Algorithm For Binary Classification Date: Aim
41 pages
ML 2.4 Prashant
No ratings yet
ML 2.4 Prashant
3 pages
Soft Computing Lab Manual1
No ratings yet
Soft Computing Lab Manual1
23 pages
A Gentle Introduction To Neural Networks With Python
No ratings yet
A Gentle Introduction To Neural Networks With Python
85 pages
Assignment 1: Q1. Task Description
No ratings yet
Assignment 1: Q1. Task Description
12 pages
CCC
No ratings yet
CCC
25 pages
Sindhuja Assignment-2 AI
No ratings yet
Sindhuja Assignment-2 AI
22 pages
Ex No 11
No ratings yet
Ex No 11
4 pages
DL JOURNAL - Merged
No ratings yet
DL JOURNAL - Merged
27 pages
NN From Scratch PDF 1735495327
No ratings yet
NN From Scratch PDF 1735495327
19 pages
Paper II LDC DMR
No ratings yet
Paper II LDC DMR
9 pages
Deep Learning
No ratings yet
Deep Learning
30 pages
Soft Computing Lab Record
No ratings yet
Soft Computing Lab Record
28 pages
1-Data Mining and Applications
No ratings yet
1-Data Mining and Applications
70 pages
ĐỀ NGHE SỐ 13A
No ratings yet
ĐỀ NGHE SỐ 13A
10 pages
Week 2 - Lab
No ratings yet
Week 2 - Lab
9 pages
Software Laboratory II Code
No ratings yet
Software Laboratory II Code
27 pages
ANN PR Code and Output
No ratings yet
ANN PR Code and Output
25 pages
Preprocessing
No ratings yet
Preprocessing
90 pages
ML Expt 9
No ratings yet
ML Expt 9
9 pages
GEP June 2024 Chapter2 EAP
No ratings yet
GEP June 2024 Chapter2 EAP
64 pages
Python
No ratings yet
Python
3 pages
GEP June 2024 Chapter2 ECA
No ratings yet
GEP June 2024 Chapter2 ECA
60 pages
Deeplg 3
No ratings yet
Deeplg 3
8 pages
INFORMATION MANAGEMENT Unit 2
No ratings yet
INFORMATION MANAGEMENT Unit 2
35 pages
Soft Computing
No ratings yet
Soft Computing
16 pages
ML Record Print
No ratings yet
ML Record Print
20 pages
New Exp
No ratings yet
New Exp
12 pages
Quantum Technology Monitor
No ratings yet
Quantum Technology Monitor
53 pages
Lab 8
No ratings yet
Lab 8
10 pages
GK Deeplearning
No ratings yet
GK Deeplearning
15 pages
Experiment 3
No ratings yet
Experiment 3
9 pages
GEP June 2024 Chapter1 Box1
No ratings yet
GEP June 2024 Chapter1 Box1
39 pages
555610a19 DL Exp4
No ratings yet
555610a19 DL Exp4
11 pages
X OR Problem Using DNN
No ratings yet
X OR Problem Using DNN
3 pages
Lab Manual Ann
No ratings yet
Lab Manual Ann
12 pages
Da 3 Lab DL 21BCE2687
No ratings yet
Da 3 Lab DL 21BCE2687
15 pages
S. NO. Title of The Experiments Page No
No ratings yet
S. NO. Title of The Experiments Page No
11 pages
Week 4 - Lab
No ratings yet
Week 4 - Lab
7 pages
DT RF
No ratings yet
DT RF
64 pages
Using A Three Layer Deep Neural Network To Solve An Unsupervised Learning Problem
No ratings yet
Using A Three Layer Deep Neural Network To Solve An Unsupervised Learning Problem
13 pages
Experiment 2.4 DL
No ratings yet
Experiment 2.4 DL
4 pages
CVPR 2022 MainConference ProgramGuide Final
No ratings yet
CVPR 2022 MainConference ProgramGuide Final
70 pages
Exp 4
No ratings yet
Exp 4
9 pages
Experiment 4 NN
No ratings yet
Experiment 4 NN
3 pages
Week 7 - Lab
No ratings yet
Week 7 - Lab
6 pages
Assignment 3 DS5620
No ratings yet
Assignment 3 DS5620
11 pages
Lab 12
No ratings yet
Lab 12
6 pages
Telecom Knowledge and Experience Sharing - ? LTE KPI
No ratings yet
Telecom Knowledge and Experience Sharing - ? LTE KPI
8 pages
Code For Mean Squared
No ratings yet
Code For Mean Squared
2 pages
Lab 4
No ratings yet
Lab 4
2 pages
Ansible: Architecture
100% (1)
Ansible: Architecture
7 pages
Báo Cáo Java 4
No ratings yet
Báo Cáo Java 4
3 pages
Evaluation Metrics
No ratings yet
Evaluation Metrics
16 pages
Trainina A NN Backpropagation
No ratings yet
Trainina A NN Backpropagation
6 pages
Deep Learning
No ratings yet
Deep Learning
4 pages
ML Assignment-9
No ratings yet
ML Assignment-9
4 pages
ISTN212 Exam 2023 V2 - PRINT
No ratings yet
ISTN212 Exam 2023 V2 - PRINT
21 pages
Paper 4
No ratings yet
Paper 4
33 pages
Safuu X Calculator
No ratings yet
Safuu X Calculator
97 pages
UI/UX Presentation6
No ratings yet
UI/UX Presentation6
39 pages
QP - 12-CS - PB-I 23-24 Set 1
No ratings yet
QP - 12-CS - PB-I 23-24 Set 1
10 pages
DP Failure Mode Effects Analysis Assurance Framework Risk Based Guidance
100% (2)
DP Failure Mode Effects Analysis Assurance Framework Risk Based Guidance
93 pages
Python Code PDF
No ratings yet
Python Code PDF
3 pages
Wide Enterprise Networking
No ratings yet
Wide Enterprise Networking
8 pages
Resume Sia
No ratings yet
Resume Sia
10 pages
Module 5.4 LOGIC
No ratings yet
Module 5.4 LOGIC
11 pages
Cloudera Administrator Training For Apache Hadoop PDF
50% (2)
Cloudera Administrator Training For Apache Hadoop PDF
2 pages
Application Report
No ratings yet
Application Report
1 page
"A Study On Influence of Video Conferencing Apps
No ratings yet
"A Study On Influence of Video Conferencing Apps
25 pages
MV1 2023 IDBC Strategy Plan
No ratings yet
MV1 2023 IDBC Strategy Plan
16 pages
Short Notes Regional Geography
No ratings yet
Short Notes Regional Geography
6 pages
Sce5401 Ay21-22-S2 Tutr7-Sol (R0)
No ratings yet
Sce5401 Ay21-22-S2 Tutr7-Sol (R0)
7 pages
Installation Procedure
No ratings yet
Installation Procedure
9 pages
Flyer Ki M en
No ratings yet
Flyer Ki M en
2 pages
DSB For R PDF
No ratings yet
DSB For R PDF
6 pages
Monolithic Applications and Microservices: Applications Are Made of Multiple Components. The
No ratings yet
Monolithic Applications and Microservices: Applications Are Made of Multiple Components. The
4 pages
Exemple de Contrôle Continu
No ratings yet
Exemple de Contrôle Continu
1 page
Automatic Generation of CNC Codes Based On Machining Features
No ratings yet
Automatic Generation of CNC Codes Based On Machining Features
5 pages
Chapter One Lab-4 - Implement Basic Connectivity
No ratings yet
Chapter One Lab-4 - Implement Basic Connectivity
3 pages
Profound Python Data Science
From Everand
Profound Python Data Science
Onder Teker
No ratings yet