0% found this document useful (0 votes)

7 views27 pages

MLP v4

The document discusses the performance of a neural network using different activation functions (sigmoid, relu, tanh) and hyperparameters, revealing that relu yields the highest accuracy. It notes that while batch size and learning rate affect execution time, regularization had minimal impact on accuracy. Overall, the findings suggest that relu is the most effective activation function for this network architecture, with detailed observations on training and validation loss trends.

Uploaded by

am23s006

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

7 views27 pages

MLP v4

Uploaded by

am23s006

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 27

mlp-v4

September 8, 2024

INFERENCE:
The same network architeture with inbuilt pytorch and tensorflow packages works tremendously
faster compared to hand coded one.The accuracy value as well as confusionmatrix also doesnt match
exactly which might be due to some other approximations or changes in inbuilt packages.
For batch size and learning rate specified in the question, the network takes longer time for execution
and hence for practical comparison purpose the code was executed for 4 epoch with batch size of
256 and learning rate of 0.3 to observe the following:
The accuracy with sigmoid, relu and tanh (rest all being same) is 88.11%,97.3% and 96.61% re-
spectively. Hence, the network works best for relu followed by tanh and sigmoid.
further hyperparameter tuning was done by changing the values of batch size,epoch and learning
rate to observe the results. The inference is similar with other settings as well. For example, for
batch size of 128 , learning rate of 0.3 and max epoch of 8 , the accuracy obtained were 98.10%,
97.64%, and 93.06 for relu, tanh, and sigmoid respectively.
For the same hyperparameter the pytorch function with relu activation gave an accuracy of 93.14%.
Additionally on using L2 regularisation with weightage of 0.1,0.01,0.001 in all layers didnt give
much change in accuracy. For eg, 0 regularisation in hidden layers and 0.0001 relularisation in
final layer resulted in accuracy of 93.16% with sigmoid activation.However regularisation with
0,0,0.0001,0.0001 in each layer with relu activation improoved to 99.19%.
Apart from accuracy from an implementation point of view, the tanh consumed more time for
execution for each epoch. Increased batch size also reduced computation time as it reduces no of
iterations. Regularisation didnt give much improvement in results. The training and testing loss
followed a trajectory which almost aligned except that training loss graph had some more noise
compared to the testing one.
[41]: print('program starts')

program starts

[47]: import tensorflow as tf

import matplotlib.pyplot as plt
import numpy as np
from tensorflow.keras import regularizers
from sklearn.metrics import ConfusionMatrixDisplay

1
# from tensorflow.keras.layers import Dense, Dropout, Activation, Flatten,␣
↪Conv2D, MaxPooling2D

# from tensorflow.keras.models import Sequential

# import random
# import pandas as pd
from tensorflow import keras
from sklearn import metrics

[48]: def ANN_func(x_train, y_train, x_test, y_test,activation1, alpha, batch_size,␣

↪max_epoch):

update_interval=200
y_train = tf.keras.utils.to_categorical(y_train, num_classes=10)
y_test = tf.keras.utils.to_categorical(y_test, num_classes=10)

# Build the model

model = tf.keras.Sequential([
tf.keras.layers.Flatten(input_shape=(28, 28)),
tf.keras.layers.Dense(500, activation=activation1),
tf.keras.layers.Dense(250, activation=activation1),
tf.keras.layers.Dense(100, activation=activation1),
tf.keras.layers.Dense(10, activation='softmax')
])

# Compile the model

model.compile(optimizer=tf.keras.optimizers.SGD(learning_rate=alpha),
loss='categorical_crossentropy',
metrics=['accuracy'])

# Lists to store training and validation loss at each interval

train_loss_interval = []
val_loss_interval = []

# Function to track loss every 'update_interval' batches

def batch_loss_callback(batch, logs):
if (batch + 1) % update_interval == 0:
# Store the training loss (logs['loss'] stores the training loss␣
↪for the batch)

train_loss_interval.append(logs['loss'])

# Evaluate and store the validation loss

val_loss = model.evaluate(x_test, y_test, verbose=0)[0]
val_loss_interval.append(val_loss)

# Train the model

history = model.fit(x_train, y_train,
batch_size=batch_size,
epochs=max_epoch,

2
validation_data=(x_test, y_test),
verbose=0, # Suppress default output
callbacks=[tf.keras.callbacks.
↪LambdaCallback(on_batch_end=lambda batch, logs: batch_loss_callback(batch,␣

↪logs))])

# Evaluate the model

loss, accuracy = model.evaluate(x_test, y_test)

# Make predictions
y_predicted = model.predict(x_test)
y_predicted_labels = [np.argmax(i) for i in y_predicted]

# Calculate confusion matrix using the original labels

cm = tf.math.confusion_matrix(labels=np.argmax(y_test, axis=1),␣
↪predictions=y_predicted_labels)

# Plot training and validation loss at each interval

plt.plot(train_loss_interval, label=f'Training Loss (Every␣
↪{update_interval} Batches)')

plt.plot(val_loss_interval, label=f'Validation Loss (Every␣

↪{update_interval} Batches)')

plt.title(f'Loss Every {update_interval} Batches')

plt.xlabel(f'Batch (Every {update_interval} Updates)')
plt.ylabel('Loss')
plt.legend()
plt.show()

return cm, accuracy

[61]: def ANN_func_regularised(x_train, y_train, x_test, y_test,activation1, alpha,␣

↪batch_size, max_epoch,lambda1,lambda2,lambda3,lambda4):

update_interval=200
y_train = tf.keras.utils.to_categorical(y_train, num_classes=10)
y_test = tf.keras.utils.to_categorical(y_test, num_classes=10)

# Build the model

model = tf.keras.Sequential([
tf.keras.layers.Flatten(input_shape=(28, 28)),
tf.keras.layers.Dense(500,␣
↪activation=activation1,kernel_regularizer=regularizers.l2(lambda1)),

tf.keras.layers.Dense(250,␣
↪activation=activation1,kernel_regularizer=regularizers.l2(lambda2)),

tf.keras.layers.Dense(100,␣
↪activation=activation1,kernel_regularizer=regularizers.l2(lambda3)),

3
tf.keras.layers.Dense(10,␣
↪activation='softmax',kernel_regularizer=regularizers.l2(lambda4))
])

# Compile the model

model.compile(optimizer=tf.keras.optimizers.SGD(learning_rate=alpha),
loss='categorical_crossentropy',
metrics=['accuracy'])

# Lists to store training and validation loss at each interval

train_loss_interval = []
val_loss_interval = []

# Function to track loss every 'update_interval' batches

def batch_loss_callback(batch, logs):
if (batch + 1) % update_interval == 0:
# Store the training loss (logs['loss'] stores the training loss␣
↪for the batch)

train_loss_interval.append(logs['loss'])

# Evaluate and store the validation loss

val_loss = model.evaluate(x_test, y_test, verbose=0)[0]
val_loss_interval.append(val_loss)

# Train the model

history = model.fit(x_train, y_train,
batch_size=batch_size,
epochs=max_epoch,
validation_data=(x_test, y_test),
verbose=0, # Suppress default output
callbacks=[tf.keras.callbacks.
↪LambdaCallback(on_batch_end=lambda batch, logs: batch_loss_callback(batch,␣

↪logs))])

# Evaluate the model

loss, accuracy = model.evaluate(x_test, y_test)

# Make predictions
y_predicted = model.predict(x_test)
y_predicted_labels = [np.argmax(i) for i in y_predicted]

# Calculate confusion matrix using the original labels

cm = tf.math.confusion_matrix(labels=np.argmax(y_test, axis=1),␣
↪predictions=y_predicted_labels)

# Plot training and validation loss at each interval

4
plt.plot(train_loss_interval, label=f'Training Loss (Every␣
↪{update_interval} Batches)')
plt.plot(val_loss_interval, label=f'Validation Loss (Every␣
↪{update_interval} Batches)')

plt.title(f'Loss Every {update_interval} Batches')

plt.xlabel(f'Batch (Every {update_interval} Updates)')
plt.ylabel('Loss')
plt.legend()
plt.show()

return cm, accuracy

[50]: def create_batches(x, y, batch_size):

num_batches = len(x) // batch_size
x_batches = np.array_split(x[:num_batches * batch_size], num_batches)
y_batches = np.array_split(y[:num_batches * batch_size], num_batches)
return x_batches, y_batches

[51]: def activate(z,activation):

if activation=="relu":
array=np.where(z>0,z,0)
#array[i][j]=0
return array

elif activation=='tanh':
x=np.exp(z)
y=np.exp(-z)
num=x-y
den=x+y
return num/den
else:
return 1/(1+np.exp(-z))

def deri_activate(a,activation):

if activation=="relu":
array=np.where(a>0,1,0)
return array
elif activation=='tanh':
return 1-a*a
else:
return a*(1-a)

def cross_entropy(epoch,y_hat,y_label):
loss1=-y_label*np.log(y_hat)
loss=np.sum(loss1)/(np.size(loss1))
return loss

5
[52]: def forward_prop(a0,w1,w2,w3,w4,b1,b2,b3,b4,activation):
m=a0.shape[1]

z1=np.dot(w1,a0)+b1
a1=activate(z1,activation)

z2=np.dot(w2,a1)+b2
a2=activate(z2,activation)

z3=np.dot(w3,a2)+b3
a3=activate(z3,activation)

z4=np.dot(w4,a3)+b4
a4=np.exp(z4)
s4=np.sum(a4,0)
a4= (1/s4)*a4
return a1,a2,a3,a4

[53]: def back_prop(w2,w3,w4,a0,a1,a2,a3,a4,y_train,activation):

m=a0.shape[1]

d4=a4-y_train
dw4=np.dot(d4,np.transpose(a3))
db4=np.dot(d4,np.ones([m,1]))
dw4=dw4/m
db4=db4/m

df=deri_activate(a3,activation)
d3=np.dot(np.transpose(w4),d4)*df
dw3=np.dot(d3,np.transpose(a2))
db3=np.dot(d3,np.ones([m,1]))
dw3=dw3/m
db3=db3/m

df=deri_activate(a2,activation)
d2=np.dot(np.transpose(w3),d3)*df
dw2=np.dot(d2,np.transpose(a1))
db2=np.dot(d2,np.ones([m,1]))
dw2=dw2/m
db2=db2/m

df=deri_activate(a1,activation)
d1=np.dot(np.transpose(w2),d2)*df
dw1=np.dot(d1,np.transpose(a0))
db1=np.dot(d1,np.ones([m,1]))
dw1=dw1/m
db1=db1/m

6
return dw1,dw2,dw3,dw4,db1,db2,db3,db4

[54]: def grad_descent(alpha,w1,w2,w3,w4,b1,b2,b3,b4,dw1,dw2,dw3,dw4,db1,db2,db3,db4):

w1=w1-alpha*dw1
b1=b1-alpha*db1
w2=w2-alpha*dw2
b2=b2-alpha*db2
w3=w3-alpha*dw3
b3=b3-alpha*db3
w4=w4-alpha*dw4
b4=b4-alpha*db4
return w1,w2,w3,w4,b1,b2,b3,b4

[56]: def evaluation(y_pred,y_test):

confusion_matrix = metrics.confusion_matrix(y_test, y_pred)

# Display confusion matrix

cm_display = ConfusionMatrixDisplay(confusion_matrix=confusion_matrix,␣
↪display_labels=[0, 1,2,3,4,5,6,7,8,9])

cm_display.plot(cmap=plt.cm.Blues)
plt.title('Confusion Matrix')
plt.show()
TP=np.array([ confusion_matrix[i][j] for i in range(10) for j in range(10)␣
↪if i==j])

accuracy=np.sum(TP)/np.sum(confusion_matrix)
return confusion_matrix,accuracy

0.1 Default settings with sigmoid activation

[82]: #initialisation
init=1;
n0=784;n1=500;n2=250;n3=100;n4=10;
m1=(6/(n0+n1))**0.5
m2=(6/(n1+n2))**0.5
m3=(6/(n2+n3))**0.5
m4=(6/(n3+n4))**0.5
w1=init*np.random.uniform(-m1,m1,[n1,n0])
w2=init*np.random.uniform(-m2,m2,[n2,n1])
w3=init*np.random.uniform(-m3,m3,[n3,n2])
w4=init*np.random.uniform(-m4,m4,[n4,n3])
b1=init*np.zeros([n1,1])
b2=init*np.zeros([n2,1])
b3=init*np.zeros([n3,1])
b4=init*np.zeros([n4,1])

7
#data load and split
data=tf.keras.datasets.mnist
(x_train1,y_train1),(x_test1,y_test1)=data.load_data()

#normalise
x_train1 = x_train1 / 255
x_test1 = x_test1 / 255

#Hyperparameters
batch_size = 1024
alpha=0.01
max_epoch=4
activation='relu'

num_classes=10
x_train_batches, y_train_batches = create_batches(x_train1, y_train1,␣
↪batch_size)

x_test=x_test1.reshape(x_test1.shape[0], 28 * 28)
x_test=np.transpose(x_test)

[83]: train_loss_array=[]
test_loss_array=[]
iteration=0
for epoch in range(1,max_epoch+1):

# alpha=alpha*np.exp(-0.01*epoch)
for b in range(len(x_train_batches)):
iteration=iteration+1
print('\repoch {} batch {} iteration {}'.
↪format(epoch,b,iteration),end='',flush=True)

# Preparing the data

x_train = x_train_batches[b].reshape(x_train_batches[b].shape[0], 28 *␣
↪28)
x_train= np.transpose(x_train)
y_train=np.zeros((y_train_batches[b].shape[0], num_classes))
y_train[np.arange(y_train_batches[b].shape[0]), y_train_batches[b]] = 1
y_train = np.transpose(y_train)

y_test=np.zeros((y_test1.shape[0], num_classes))
y_test[np.arange(y_test1.shape[0]),y_test1] = 1
y_test = np.transpose(y_test)

# Training
a1,a2,a3,a4=forward_prop(x_train,w1,w2,w3,w4,b1,b2,b3,b4,activation)

8
␣
↪dw1,dw2,dw3,dw4,db1,db2,db3,db4=back_prop(w2,w3,w4,x_train,a1,a2,a3,a4,y_train,activation)
␣
↪w1,w2,w3,w4,b1,b2,b3,b4=grad_descent(alpha,w1,w2,w3,w4,b1,b2,b3,b4,dw1,dw2,dw3,dw4,db1,db2,d

loss=cross_entropy(epoch,a4,y_train)
train_loss_array.append(loss)

#Testing(Validation)
a1,a2,a3,a4=forward_prop(x_test,w1,w2,w3,w4,b1,b2,b3,b4,activation)
y_pred=np.array([np.argmax(x) for x in np.transpose(a4)])
loss=cross_entropy(epoch,a4,y_test)
test_loss_array.append(loss)

epoch 4 batch 57 iteration 232

[84]: #Evaluation
c_mat,accuracy=evaluation(y_pred,y_test1)
print("WITH MY FUNCTIONS")
print(f"Final Test Accuracy: {accuracy * 100:.2f}% and conf mat is ")
print(c_mat)
plt.figure()
y1=train_loss_array[199::200]
y2=test_loss_array[199::200]
x=np.arange(199,len(train_loss_array),200)
#x=np.arange(len(y1))
plt.plot(x,y1,label='Train loss',color='blue')
plt.plot(x,y2,label='Test loss',color='red')

plt.legend()
plt.xlabel('Batch')
plt.ylabel('loss')

9
WITH MY FUNCTIONS
Final Test Accuracy: 81.74% and conf mat is
[[ 941 0 4 4 1 3 17 2 8 0]
[ 0 1111 3 7 0 1 4 0 9 0]
[ 26 27 818 49 22 1 33 19 36 1]
[ 5 2 24 874 2 9 8 28 50 8]
[ 2 12 7 1 777 1 14 3 10 155]
[ 24 40 9 189 56 416 36 29 64 29]
[ 20 10 39 0 14 11 859 0 5 0]
[ 4 44 24 3 9 1 1 885 10 47]
[ 14 26 21 101 18 15 17 20 722 20]
[ 15 19 14 11 99 6 1 55 18 771]]

[84]: Text(0, 0.5, 'loss')

10
0.2 Tanh Activation
[14]: batch_size = 1024
alpha=0.3
max_epoch=3
activation='tanh'

num_classes=10
x_train_batches, y_train_batches = create_batches(x_train1, y_train1,␣
↪batch_size)

x_test=x_test1.reshape(x_test1.shape[0], 28 * 28)
x_test=np.transpose(x_test)
train_loss_array=[]
test_loss_array=[]

w1=init*np.random.uniform(-m1,m1,[n1,n0])
w2=init*np.random.uniform(-m2,m2,[n2,n1])
w3=init*np.random.uniform(-m3,m3,[n3,n2])
w4=init*np.random.uniform(-m4,m4,[n4,n3])
b1=init*np.zeros([n1,1])
b2=init*np.zeros([n2,1])

11
b3=init*np.zeros([n3,1])
b4=init*np.zeros([n4,1])

for epoch in range(1,max_epoch+1):

# alpha=alpha*np.exp(-0.01*epoch)
for b in range(len(x_train_batches)):
print('\repoch {} batch {} iteration {}'.
↪format(epoch,b,epoch*b),end='',flush=True)

# Preparing the data

y_test=np.zeros((y_test1.shape[0], num_classes))
y_test[np.arange(y_test1.shape[0]),y_test1] = 1
y_test = np.transpose(y_test)

# Training
a1,a2,a3,a4=forward_prop(x_train,w1,w2,w3,w4,b1,b2,b3,b4,activation)
␣
↪dw1,dw2,dw3,dw4,db1,db2,db3,db4=back_prop(w2,w3,w4,x_train,a1,a2,a3,a4,y_train,activation)

␣
↪w1,w2,w3,w4,b1,b2,b3,b4=grad_descent(alpha,w1,w2,w3,w4,b1,b2,b3,b4,dw1,dw2,dw3,dw4,db1,db2,d

loss=cross_entropy(epoch,a4,y_train)
train_loss_array.append(loss)

c_mat,accuracy=evaluation(y_pred,y_test1)
print("WITH MY FUNCTIONS")
print(f"Final Test Accuracy: {accuracy * 100:.2f}% and conf mat is ")
print(c_mat)
plt.figure()
y1=train_loss_array[199::200]
y2=test_loss_array[199::200]
x=np.arange(199,len(train_loss_array),200)
plt.plot(x,y1,label='Train loss',color='blue')
plt.plot(x,y2,label='Test loss',color='red')

12
plt.legend()
plt.xlabel('Batch')
plt.ylabel('loss')

epoch 5 batch 467 iteration 2335

WITH MY FUNCTIONS
Final Test Accuracy: 97.32% and conf mat is
[[ 972 0 0 0 1 3 0 1 3 0]
[ 0 1126 1 2 0 1 1 2 2 0]
[ 7 1 1004 3 3 0 2 5 7 0]
[ 1 0 5 976 0 13 0 10 3 2]
[ 0 0 4 0 960 0 2 3 1 12]
[ 3 1 0 5 2 873 1 2 3 2]
[ 6 3 1 0 7 26 914 0 1 0]
[ 2 4 8 4 0 0 0 1003 0 7]
[ 6 2 2 6 4 8 0 6 939 1]
[ 3 3 1 8 12 6 0 9 2 965]]

13
[14]: Text(0, 0.5, 'loss')

0.3 Relu Activation

[15]: batch_size = 1024
alpha=0.3
max_epoch=3
activation='relu'

num_classes=10
x_train_batches, y_train_batches = create_batches(x_train1, y_train1,␣
↪batch_size)

x_test=x_test1.reshape(x_test1.shape[0], 28 * 28)
x_test=np.transpose(x_test)
train_loss_array=[]
test_loss_array=[]

w1=init*np.random.uniform(-m1,m1,[n1,n0])
w2=init*np.random.uniform(-m2,m2,[n2,n1])
w3=init*np.random.uniform(-m3,m3,[n3,n2])
w4=init*np.random.uniform(-m4,m4,[n4,n3])

14
b1=init*np.zeros([n1,1])
b2=init*np.zeros([n2,1])
b3=init*np.zeros([n3,1])
b4=init*np.zeros([n4,1])

for epoch in range(1,max_epoch+1):

# alpha=alpha*np.exp(-0.01*epoch)
for b in range(len(x_train_batches)):
print('\repoch {} batch {} iteration {}'.
↪format(epoch,b,epoch*b),end='',flush=True)

# Preparing the data

y_test=np.zeros((y_test1.shape[0], num_classes))
y_test[np.arange(y_test1.shape[0]),y_test1] = 1
y_test = np.transpose(y_test)

# Training
a1,a2,a3,a4=forward_prop(x_train,w1,w2,w3,w4,b1,b2,b3,b4,activation)
␣
↪dw1,dw2,dw3,dw4,db1,db2,db3,db4=back_prop(w2,w3,w4,x_train,a1,a2,a3,a4,y_train,activation)

␣
↪w1,w2,w3,w4,b1,b2,b3,b4=grad_descent(alpha,w1,w2,w3,w4,b1,b2,b3,b4,dw1,dw2,dw3,dw4,db1,db2,d

loss=cross_entropy(epoch,a4,y_train)
train_loss_array.append(loss)

15
plt.plot(x,y2,label='Test loss',color='red')

plt.legend()
plt.xlabel('Batch')
plt.ylabel('loss')

epoch 5 batch 467 iteration 2335

WITH MY FUNCTIONS
Final Test Accuracy: 97.90% and conf mat is
[[ 971 1 1 0 0 2 1 1 1 2]
[ 0 1134 0 0 0 1 0 0 0 0]
[ 1 3 1015 3 1 0 2 5 2 0]
[ 0 1 4 990 0 4 0 3 5 3]
[ 0 0 5 0 963 1 2 0 0 11]
[ 3 1 0 9 1 869 1 1 3 4]
[ 5 4 2 0 8 21 918 0 0 0]
[ 1 3 8 1 2 0 0 1002 1 10]
[ 2 3 4 4 4 6 1 2 944 4]
[ 1 2 0 3 8 3 1 6 1 984]]

16
[15]: Text(0, 0.5, 'loss')

0.4 WITH PACKAGES

[31]: # Load and preprocess the MNIST dataset
#(X_train, y_train1), (X_test, y_test1) = keras.datasets.mnist.load_data()
#X_train = X_train / 255.0
#X_test = X_test / 255.0

activation='sigmoid'
max_epoch=15
alpha=0.01
batch_size=64
print("WITH INBUILT FUNCTION AND PACKAGES")
cm,accuracy=ANN_func(x_train1,y_train1,x_test1,y_test1,activation,alpha,batch_size,max_epoch)
print(f"Final Test Accuracy: {accuracy * 100:.2f}% and conf mat is ")
print(cm)

WITH INBUILT FUNCTION AND PACKAGES

313/313 �� 1s 4ms/step -
accuracy: 0.7916 - loss: 0.7165

17
313/313 �� 2s 5ms/step

Final Test Accuracy: 81.80% and conf mat is

tf.Tensor(
[[ 943 0 1 1 2 23 8 1 1 0]
[ 0 1102 8 6 0 0 1 1 16 1]
[ 14 31 787 40 30 19 56 14 39 2]
[ 4 6 26 849 1 55 1 20 41 7]
[ 1 3 2 0 794 0 26 2 11 143]
[ 32 3 21 134 13 568 25 27 45 24]
[ 28 2 40 0 17 17 847 0 7 0]
[ 6 44 7 2 3 4 0 895 11 56]
[ 11 22 30 94 25 84 20 19 637 32]
[ 15 5 0 7 128 17 3 68 8 758]], shape=(10, 10),
dtype=int32)

[ ]:

[32]: activation='sigmoid'
max_epoch=3

18
alpha=0.01
batch_size=1024
print("WITH INBUILT FUNCTION AND PACKAGES-reularisation")
lambda1,lambda2,lambda3,lambda4=0,0,0.0001,0.0001
cm,accuracy=ANN_func_regularised(x_train1,y_train1,x_test1,y_test1,activation,alpha,batch_size
print(f"Final Test Accuracy: {accuracy * 100:.2f}% and conf mat is ")
print(cm)

WITH INBUILT FUNCTION AND PACKAGES-reularisation

313/313 �� 2s 5ms/step -
accuracy: 0.7993 - loss: 0.7207
313/313 �� 2s 5ms/step

Final Test Accuracy: 82.53% and conf mat is

tf.Tensor(
[[ 947 0 2 1 0 20 9 1 0 0]
[ 0 1115 4 5 0 1 1 2 7 0]
[ 20 28 822 40 36 12 38 10 22 4]
[ 9 12 35 801 1 56 1 17 67 11]
[ 2 6 1 0 790 0 26 0 11 146]

19
[ 25 3 26 130 10 570 24 24 61 19]
[ 28 2 39 1 17 17 844 0 10 0]
[ 12 42 11 2 6 1 0 891 8 55]
[ 6 23 17 45 26 59 13 10 747 28]
[ 13 9 1 4 166 21 2 57 10 726]], shape=(10, 10),
dtype=int32)

[33]: activation='sigmoid'
max_epoch=3
batch_size=1024
alpha=0.01
print("WITH INBUILT FUNCTION AND PACKAGES")
cm,accuracy=ANN_func(x_train1,y_train1,x_test1,y_test1,activation,alpha,batch_size,max_epoch)
print(f"Final Test Accuracy: {accuracy * 100:.2f}% and conf mat is ")
print(cm)

WITH INBUILT FUNCTION AND PACKAGES

313/313 �� 1s 4ms/step -
accuracy: 0.7816 - loss: 0.7223
313/313 �� 2s 5ms/step

20
Final Test Accuracy: 80.70% and conf mat is
tf.Tensor(
[[ 909 0 1 1 3 52 10 1 3 0]
[ 0 1106 10 5 0 1 2 0 10 1]
[ 15 41 826 12 20 6 30 17 63 2]
[ 3 15 21 837 2 52 1 22 52 5]
[ 1 1 3 1 702 0 33 8 9 224]
[ 45 8 16 127 11 562 33 13 56 21]
[ 33 2 32 0 20 15 847 1 8 0]
[ 1 53 11 1 5 3 0 893 7 54]
[ 21 29 45 83 13 45 20 19 668 31]
[ 9 5 1 8 138 16 5 99 8 720]], shape=(10, 10),
dtype=int32)

[34]: activation='sigmoid'
max_epoch=3
batch_size=1024
alpha=0.01
print("WITH INBUILT FUNCTION AND PACKAGES-reularisation")
lambda1,lambda2,lambda3,lambda4=0,0,0.0001,0.0001
cm,accuracy=ANN_func_regularised(x_train1,y_train1,x_test1,y_test1,activation,alpha,batch_size
print(f"Final Test Accuracy: {accuracy * 100:.2f}% and conf mat is ")
print(cm)

WITH INBUILT FUNCTION AND PACKAGES-reularisation

313/313 �� 2s 5ms/step -
accuracy: 0.7797 - loss: 0.7583
313/313 �� 2s 4ms/step

21
Final Test Accuracy: 81.12% and conf mat is
tf.Tensor(
[[ 935 0 2 1 2 30 9 1 0 0]
[ 0 1101 7 4 0 1 1 2 19 0]
[ 17 38 792 26 25 12 72 13 37 0]
[ 12 3 30 820 2 61 1 23 52 6]
[ 1 7 2 0 754 0 32 2 10 174]
[ 28 3 29 167 12 552 40 23 24 14]
[ 26 2 30 0 15 17 865 0 3 0]
[ 7 53 7 1 11 6 0 882 10 51]
[ 7 23 34 99 17 61 23 13 666 31]
[ 10 6 0 8 145 22 4 63 6 745]], shape=(10, 10),
dtype=int32)

[35]: activation='tanh'
max_epoch=3
batch_size=1024
alpha=0.01
print("WITH INBUILT FUNCTION AND PACKAGES")
cm,accuracy=ANN_func(x_train1,y_train1,x_test1,y_test1,activation,alpha,batch_size,max_epoch)

22
print(f"Final Test Accuracy: {accuracy * 100:.2f}% and conf mat is ")
print(cm)

WITH INBUILT FUNCTION AND PACKAGES

313/313 �� 1s 4ms/step -
accuracy: 0.9516 - loss: 0.1565
313/313 �� 2s 4ms/step

Final Test Accuracy: 95.87% and conf mat is

tf.Tensor(
[[ 964 0 1 1 0 4 8 1 1 0]
[ 0 1117 3 3 0 1 4 2 5 0]
[ 6 1 982 8 6 1 7 7 13 1]
[ 0 0 4 970 0 14 0 10 8 4]
[ 1 0 5 0 948 0 6 3 2 17]
[ 8 2 0 15 2 843 8 1 10 3]
[ 8 3 4 0 7 10 919 1 6 0]
[ 1 5 19 2 3 1 0 979 2 16]
[ 3 3 4 13 4 8 5 10 922 2]
[ 5 5 1 10 25 7 1 10 2 943]], shape=(10, 10),

23
dtype=int32)

[69]: activation='tanh'
max_epoch=3
batch_size=128
alpha=0.01
print("WITH INBUILT FUNCTION AND PACKAGES-reularisation")
lambda1,lambda2,lambda3,lambda4=0,0,0.0001,0.0001
cm,accuracy=ANN_func_regularised(x_train1,y_train1,x_test1,y_test1,activation,alpha,batch_size
print(f"Final Test Accuracy: {accuracy * 100:.2f}% and conf mat is ")
print(cm)

WITH INBUILT FUNCTION AND PACKAGES-reularisation

313/313 �� 2s 6ms/step -
accuracy: 0.8999 - loss: 0.3725
313/313 �� 2s 6ms/step

Final Test Accuracy: 91.33% and conf mat is

tf.Tensor(
[[ 961 0 1 2 0 5 6 1 4 0]
[ 0 1108 3 3 1 2 3 1 14 0]

24
[ 10 3 919 13 14 1 16 17 33 6]
[ 3 0 21 921 0 23 3 15 15 9]
[ 1 2 4 2 908 1 16 2 6 40]
[ 11 5 7 49 10 749 14 10 30 7]
[ 11 3 6 1 11 16 906 1 3 0]
[ 3 11 28 6 11 1 0 937 2 29]
[ 11 9 10 28 9 27 16 11 840 13]
[ 12 7 4 10 44 10 0 31 7 884]], shape=(10, 10),
dtype=int32)

[37]: activation='relu'
max_epoch=3
batch_size=1024
alpha=0.01
print("WITH INBUILT FUNCTION AND PACKAGES")
cm,accuracy=ANN_func(x_train1,y_train1,x_test1,y_test1,activation,alpha,batch_size,max_epoch)
print(f"Final Test Accuracy: {accuracy * 100:.2f}% and conf mat is ")
print(cm)

WITH INBUILT FUNCTION AND PACKAGES

313/313 �� 2s 6ms/step -
accuracy: 0.9683 - loss: 0.1064
313/313 �� 2s 6ms/step

25
Final Test Accuracy: 97.25% and conf mat is
tf.Tensor(
[[ 965 0 0 1 1 5 4 2 2 0]
[ 0 1124 3 1 0 1 4 1 1 0]
[ 7 2 1000 1 2 0 4 11 5 0]
[ 0 0 9 978 0 7 0 7 6 3]
[ 1 0 6 0 961 0 2 2 2 8]
[ 4 1 0 4 2 869 6 0 4 2]
[ 5 3 2 0 9 5 933 0 1 0]
[ 0 8 9 1 0 0 0 1000 2 8]
[ 4 0 2 7 5 7 6 7 934 2]
[ 3 8 1 7 14 3 1 10 1 961]], shape=(10, 10),
dtype=int32)

[68]: activation='relu'
max_epoch=3
batch_size=128
alpha=0.01
print("WITH INBUILT FUNCTION AND PACKAGES-reularisation")
lambda1,lambda2,lambda3,lambda4=0,0,0.0001,0.0001
cm,accuracy=ANN_func_regularised(x_train1,y_train1,x_test1,y_test1,activation,alpha,batch_size
print(f"Final Test Accuracy: {accuracy * 100:.2f}% and conf mat is ")
print(cm)

WITH INBUILT FUNCTION AND PACKAGES-reularisation

313/313 �� 2s 6ms/step -
accuracy: 0.9019 - loss: 0.3557
313/313 �� 3s 7ms/step

26
Final Test Accuracy: 91.67% and conf mat is
tf.Tensor(
[[ 951 0 4 2 0 10 8 2 3 0]
[ 0 1116 1 4 0 1 4 2 7 0]
[ 10 7 914 18 19 0 15 16 31 2]
[ 2 1 16 933 1 21 1 18 13 4]
[ 1 3 4 2 928 0 11 2 3 28]
[ 9 2 7 44 11 764 17 8 21 9]
[ 15 3 4 1 15 17 899 2 2 0]
[ 3 20 21 8 8 1 0 941 1 25]
[ 5 7 9 34 12 24 12 12 847 12]
[ 8 8 5 14 56 11 1 27 5 874]], shape=(10, 10),
dtype=int32)

[ ]:

Applied Machine Learning For Engineers: Artificial Neural Networks
0% (1)
Applied Machine Learning For Engineers: Artificial Neural Networks
6 pages
C2 W2ok
No ratings yet
C2 W2ok
109 pages
NNDL Record Final
No ratings yet
NNDL Record Final
46 pages
Labpractice 2
100% (2)
Labpractice 2
29 pages
Python For Artificial Intelligence. A Comprehensive Guide
100% (1)
Python For Artificial Intelligence. A Comprehensive Guide
254 pages
# !pip Install Keras Tensorflow - U
No ratings yet
# !pip Install Keras Tensorflow - U
24 pages
AM19 ADL U-Net-Model
No ratings yet
AM19 ADL U-Net-Model
37 pages
Regularization and Regression Using ANN
No ratings yet
Regularization and Regression Using ANN
17 pages
Lab
No ratings yet
Lab
12 pages
Deep Learning LAB
No ratings yet
Deep Learning LAB
47 pages
Report
No ratings yet
Report
21 pages
BLDD VIT ResNet50v2 CustomCNN
No ratings yet
BLDD VIT ResNet50v2 CustomCNN
38 pages
Jackmckelligonstoplight
No ratings yet
Jackmckelligonstoplight
17 pages
Assignment 7 ML
No ratings yet
Assignment 7 ML
20 pages
NNDL 2
No ratings yet
NNDL 2
67 pages
Rice Disease Classifier 2
No ratings yet
Rice Disease Classifier 2
9 pages
Sota Image Classification Models With Pytorch Api
No ratings yet
Sota Image Classification Models With Pytorch Api
10 pages
Kolmogorov-Arnold-Networks in Python
No ratings yet
Kolmogorov-Arnold-Networks in Python
8 pages
Stress Level Detection
No ratings yet
Stress Level Detection
10 pages
DL Lab Experiments 2
No ratings yet
DL Lab Experiments 2
12 pages
LAB03
No ratings yet
LAB03
8 pages
01 249212 012 10129792044 11122022 112910pm
No ratings yet
01 249212 012 10129792044 11122022 112910pm
8 pages
(23mca32) Practical 1 & Practical 2
No ratings yet
(23mca32) Practical 1 & Practical 2
9 pages
Hand Written
No ratings yet
Hand Written
13 pages
NNDL Record Final 2
No ratings yet
NNDL Record Final 2
85 pages
Deep Learning Program
No ratings yet
Deep Learning Program
5 pages
Keras - Datasets Keras - Datasets: "X - Train Shape" "Y - Train Shape" "X - Test Shape" "Y - Test Shape"
No ratings yet
Keras - Datasets Keras - Datasets: "X - Train Shape" "Y - Train Shape" "X - Test Shape" "Y - Test Shape"
6 pages
Practical No11
No ratings yet
Practical No11
6 pages
(23mca24) Practical 1 & Practical 2
No ratings yet
(23mca24) Practical 1 & Practical 2
6 pages
A3 - Jupyter Notebook PDF
No ratings yet
A3 - Jupyter Notebook PDF
5 pages
Neural Network
No ratings yet
Neural Network
10 pages
Deep Learning Notebook
No ratings yet
Deep Learning Notebook
7 pages
Dlweek 6
No ratings yet
Dlweek 6
4 pages
DL2 - Jupyter Notebook
No ratings yet
DL2 - Jupyter Notebook
5 pages
Regularization For Neural Network
No ratings yet
Regularization For Neural Network
37 pages
Ta1 U1 DL
No ratings yet
Ta1 U1 DL
3 pages
Movie Review Classification
No ratings yet
Movie Review Classification
5 pages
VGG16 Estudo
No ratings yet
VGG16 Estudo
10 pages
DL 5 Excuted
No ratings yet
DL 5 Excuted
13 pages
Import Numpy As NP
No ratings yet
Import Numpy As NP
5 pages
HW4ML Project Code
No ratings yet
HW4ML Project Code
24 pages
Ass 3
No ratings yet
Ass 3
5 pages
DL Lab Manual A.Y 2022-23-1
100% (1)
DL Lab Manual A.Y 2022-23-1
67 pages
Deep Learning Programs Updated
No ratings yet
Deep Learning Programs Updated
24 pages
AIML Lab Ex 2
No ratings yet
AIML Lab Ex 2
4 pages
Conv Net
No ratings yet
Conv Net
7 pages
Deeplg 3
No ratings yet
Deeplg 3
8 pages
Assignment No 2
No ratings yet
Assignment No 2
3 pages
EXP4 Regulizars
No ratings yet
EXP4 Regulizars
8 pages
Neural DEEP
No ratings yet
Neural DEEP
39 pages
Chapter05 Fundamentals-Of-Ml
No ratings yet
Chapter05 Fundamentals-Of-Ml
7 pages
24CS4505 ML Assignment
No ratings yet
24CS4505 ML Assignment
3 pages
Lecture 3 HandDigitRecognition
No ratings yet
Lecture 3 HandDigitRecognition
2 pages
NN From Scratch
No ratings yet
NN From Scratch
5 pages
DL 6
No ratings yet
DL 6
5 pages
MLP 40522 97.11%
No ratings yet
MLP 40522 97.11%
12 pages
Program 5
No ratings yet
Program 5
3 pages
Exp-10 VGG16 VGG19
No ratings yet
Exp-10 VGG16 VGG19
2 pages
6 Neural Network
No ratings yet
6 Neural Network
4 pages
Ex 6
No ratings yet
Ex 6
7 pages
Minor Project - Final Report 2076 Batch BCT - 28-36-41 48
No ratings yet
Minor Project - Final Report 2076 Batch BCT - 28-36-41 48
155 pages
Machine Learning Roadmap 2025
No ratings yet
Machine Learning Roadmap 2025
4 pages
Aparna Resume PDF
No ratings yet
Aparna Resume PDF
1 page
Deep Learning With Keras and Tensorflow
No ratings yet
Deep Learning With Keras and Tensorflow
9 pages
Final Year Report Submitted
No ratings yet
Final Year Report Submitted
61 pages
Deepfake Detection System Using AI
No ratings yet
Deepfake Detection System Using AI
11 pages
Software Development With Python
No ratings yet
Software Development With Python
38 pages
Probabilistic Deep Learning With Python 1st Edition Oliver Duerr - The Ebook in PDF Format Is Available For Download
No ratings yet
Probabilistic Deep Learning With Python 1st Edition Oliver Duerr - The Ebook in PDF Format Is Available For Download
58 pages
Prithu Mazumdar
No ratings yet
Prithu Mazumdar
6 pages
Machine Mastermind
No ratings yet
Machine Mastermind
6 pages
Memory Forensics
No ratings yet
Memory Forensics
25 pages
Inchara Resume 11th Nov
No ratings yet
Inchara Resume 11th Nov
1 page
Self-Driving Nano Degree Syllabus Udacity
No ratings yet
Self-Driving Nano Degree Syllabus Udacity
17 pages
Design and Implementation of Ai Device That Counts The Number of People in A Hall
No ratings yet
Design and Implementation of Ai Device That Counts The Number of People in A Hall
3 pages
A Systematic Review of Green AI: Roberto Verdecchia June Sallou Luís Cruz
No ratings yet
A Systematic Review of Green AI: Roberto Verdecchia June Sallou Luís Cruz
16 pages
Ai Resos
No ratings yet
Ai Resos
16 pages
Deep Learning AMI: Developer Guide
No ratings yet
Deep Learning AMI: Developer Guide
142 pages
List of Different Learning Resources
No ratings yet
List of Different Learning Resources
2 pages
Advancing Road Safety Helmet Detection With Artificial Intelligence
No ratings yet
Advancing Road Safety Helmet Detection With Artificial Intelligence
8 pages
MD Abu Zahid Ai ML PDF
No ratings yet
MD Abu Zahid Ai ML PDF
2 pages
Junior AI Engineer
No ratings yet
Junior AI Engineer
2 pages
Chapter 2
No ratings yet
Chapter 2
22 pages
PGP in Data Science and AI With Fellowship
No ratings yet
PGP in Data Science and AI With Fellowship
14 pages
Dev Board Mini Datasheet: Features
No ratings yet
Dev Board Mini Datasheet: Features
14 pages
Ishaan Garg Resume
No ratings yet
Ishaan Garg Resume
2 pages
One-Day-Workshop On Introduction To Deep Learning Using Tensorflow
No ratings yet
One-Day-Workshop On Introduction To Deep Learning Using Tensorflow
1 page
Application Development For Mask Detection and Social Distancing Violation Detection Using Convolutional Neural Networks
No ratings yet
Application Development For Mask Detection and Social Distancing Violation Detection Using Convolutional Neural Networks
8 pages
Muhammad Abdullah 17561
No ratings yet
Muhammad Abdullah 17561
1 page
Profound Python Data Science
From Everand
Profound Python Data Science
Onder Teker
No ratings yet
Amazing Java: Learn Java Quickly
From Everand
Amazing Java: Learn Java Quickly
Andrei Besedin
No ratings yet

MLP v4

Uploaded by

MLP v4

Uploaded by

mlp-v4

[47]: import tensorflow as tf

# from tensorflow.keras.models import Sequential

[48]: def ANN_func(x_train, y_train, x_test, y_test,activation1, alpha, batch_size,␣

# Build the model

# Compile the model

# Lists to store training and validation loss at each interval

# Function to track loss every 'update_interval' batches

# Evaluate and store the validation loss

# Train the model

# Evaluate the model

# Calculate confusion matrix using the original labels

# Plot training and validation loss at each interval

plt.plot(val_loss_interval, label=f'Validation Loss (Every␣

plt.title(f'Loss Every {update_interval} Batches')

return cm, accuracy

[61]: def ANN_func_regularised(x_train, y_train, x_test, y_test,activation1, alpha,␣

# Build the model

# Compile the model

# Lists to store training and validation loss at each interval

# Function to track loss every 'update_interval' batches

# Evaluate and store the validation loss

# Train the model

# Evaluate the model

# Calculate confusion matrix using the original labels

# Plot training and validation loss at each interval

plt.title(f'Loss Every {update_interval} Batches')

return cm, accuracy

[50]: def create_batches(x, y, batch_size):

[51]: def activate(z,activation):

[53]: def back_prop(w2,w3,w4,a0,a1,a2,a3,a4,y_train,activation):

[54]: def grad_descent(alpha,w1,w2,w3,w4,b1,b2,b3,b4,dw1,dw2,dw3,dw4,db1,db2,db3,db4):

[56]: def evaluation(y_pred,y_test):

confusion_matrix = metrics.confusion_matrix(y_test, y_pred)

# Display confusion matrix

0.1 Default settings with sigmoid activation

# Preparing the data

epoch 4 batch 57 iteration 232

[84]: Text(0, 0.5, 'loss')

for epoch in range(1,max_epoch+1):

# Preparing the data

epoch 5 batch 467 iteration 2335

0.3 Relu Activation

for epoch in range(1,max_epoch+1):

# Preparing the data

epoch 5 batch 467 iteration 2335

0.4 WITH PACKAGES

WITH INBUILT FUNCTION AND PACKAGES

Final Test Accuracy: 81.80% and conf mat is

WITH INBUILT FUNCTION AND PACKAGES-reularisation

Final Test Accuracy: 82.53% and conf mat is

WITH INBUILT FUNCTION AND PACKAGES

WITH INBUILT FUNCTION AND PACKAGES-reularisation

WITH INBUILT FUNCTION AND PACKAGES

Final Test Accuracy: 95.87% and conf mat is

WITH INBUILT FUNCTION AND PACKAGES-reularisation

Final Test Accuracy: 91.33% and conf mat is

WITH INBUILT FUNCTION AND PACKAGES

WITH INBUILT FUNCTION AND PACKAGES-reularisation

You might also like