0% found this document useful (0 votes)

1 views21 pages

DeepLearningLab Manual

The document outlines multiple experiments focused on implementing and understanding perceptrons and multilayer perceptrons (MLPs) in machine learning. It covers the structure and activation functions of perceptrons, the implementation of MLPs using Keras on datasets like MNIST and Fashion MNIST, and techniques for hyperparameter tuning using Grid Search and Random Search. Each experiment aims to enhance model performance in tasks such as image classification and pattern recognition.

Uploaded by

chaya02

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

1 views21 pages

DeepLearningLab Manual

Uploaded by

chaya02

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 21

Experiment No-1

Aim: Implementation of perceptron from scratch

Objective: To study the structure, contents and working principle of basic
Perceptron.

Perceptron Neural Networks

Rosenblatt created many variations of the perceptron. One of the simplest was a
single-layer network whose weights and biases could be trained to produce a
correct target vector when presented with the corresponding input vector.The
training technique used is called the perceptron learning rule. The perceptron
generated great interest due to its ability to generalize from its training vectors
and learn from initially randomly distributed connections.
Perceptron are especially suited for simple problems in pattern classification.
They are fast and reliable networks for the problems they can solve. In addition,
an understanding of the operations of the perceptron provides a good basis for
understanding more complex networks.
Activation functions
Activation functions are mathematical functions that can be used in Perceptrons
to determine the output given its input. As we said it determines whether the
neuron(Perceptron) needs to be activated or not. Activation functions take in a
weighted sum of the input data, called the activation, and produce an output that
can be used for prediction.
Activation functions are an essential part of Perceptrons and neural networks
because they allow the model to learn and make decisions based on the input
data. They also help to introduce non-linearity into the model, which is necessary
for learning more complex relationships in the data.
Some common types of activation functions used in Perceptrons are the Sign
function, Heaviside function, Sigmoid function, ReLU function,
Implementing Perceptron in Python

import numpy as np
import tensorflow as tf
from tensorflow import keras
import matplotlib.pyplot as plt
%matplotlib inline

(x_train, y_train),(x_test, y_test) =

keras.datasets.mnist.load_data()
len(x_train)
len(x_test)
x_train[0].shape
plt.matshow(x_train[0])
# Normalizing the dataset
x_train = x_train/255
x_test = x_test/255

# Flatting the dataset in order

# to compute for model building
x_train_flatten = x_train.reshape(len(x_train), 28*28)
x_test_flatten = x_test.reshape(len(x_test), 28*28)
model = keras.Sequential([
keras.layers.Dense(10, input_shape=(784,),
activation='sigmoid')
])
model.compile(
optimizer='adam',
loss='sparse_categorical_crossentropy',
metrics=['accuracy'])

model.fit(x_train_flatten, y_train, epochs=5)

model.evaluate(x_test_flatten, y_test)

Results:

Experiment-02
Aim:
Theory:
A Multilayer Perceptron (MLP) is an extension of the basic perceptron
that can handle more complex, non-linear data by using multiple layers
of neurons. It forms the backbone of many modern deep learning
models and is often called a fully connected neural network because
each neuron in one layer is connected to every neuron in the next
layer.
Structure of a Multilayer Perceptron (MLP)
1. Input Layer: The initial layer where data features are fed into the
network. The number of neurons in this layer corresponds to the
number of features in the input data.
2. Hidden Layers: One or more layers between the input and output
layers that allow the network to learn complex patterns. Each
hidden layer applies a linear transformation followed by a non-
linear activation function, such as ReLU.
3. Output Layer: The final layer that outputs predictions. In a
classification task, this layer has one neuron per class with a
softmax activation function (for multi-class classification) or
sigmoid activation (for binary classification). For regression, it
usually has a single neuron without activation.
Key Hyperparameters in MLP
Training an MLP involves setting various hyperparameters that impact
the network’s performance. These hyperparameters control aspects of
model architecture, training dynamics, and regularization.
1. Number of Hidden Layers and Neurons per Layer
 Definition: The depth (number of layers) and width (number of
neurons per layer) of the network.
 Tuning Impact: More hidden layers and neurons generally allow
the network to learn more complex functions, but it can also lead
to overfitting if the network becomes too large for the available
data.
 Tuning Strategy: Start with a small network (e.g., 1–2 hidden
layers with a moderate number of neurons) and increase
complexity as needed.
2. Activation Functions
 Common Choices: ReLU (Rectified Linear Unit) for hidden layers,
Softmax for multi-class output, and Sigmoid for binary output.
 Impact on Training: Activation functions affect how information
flows through the network. ReLU is often used because it
mitigates the vanishing gradient problem and speeds up training.
 Tuning Strategy: Generally, ReLU is a good default for hidden
layers, but other options like tanh may sometimes be tested to
see if they improve performance.
3. Learning Rate
 Definition: Controls how much the model’s weights are adjusted
with each step of training.
 Impact on Training: A high learning rate can lead to faster
convergence but may overshoot optimal values. A low learning
rate is more stable but may lead to slow convergence.
 Tuning Strategy: Use learning rate schedules or adaptive
optimizers (e.g., Adam, which adapts the learning rate). Start with
a learning rate around 0.001, then fine-tune by increasing or
decreasing as needed.
4. Batch Size
 Definition: The number of training samples processed before
updating model weights.
 Impact on Training: Larger batch sizes offer smoother gradients
but require more memory. Smaller batch sizes may introduce
noise, potentially helping generalization.
 Tuning Strategy: Common batch sizes are powers of 2 (e.g., 32,
64, 128). Try several values to find the best balance between
training stability and performance.
5. Number of Epochs
 Definition: Number of complete passes through the entire training
dataset.
 Impact on Training: Too few epochs can lead to underfitting, while
too many can lead to overfitting.
 Tuning Strategy: Use early stopping to prevent overfitting and
dynamically set the optimal number of epochs based on validation
performance.
6. Regularization Techniques
 Dropout: Randomly drops neurons during training, reducing
overfitting. Common values are 0.2 to 0.5 for dropout rate.
 L2 Regularization (Weight Decay): Penalizes large weights by
adding a regularization term to the loss function, helping to
prevent overfitting.
 Tuning Strategy: Test different dropout rates and regularization
strengths. Dropout is especially effective for large networks with
many layers.
7. Optimizer Choice
 Popular Choices: Adam (adaptive learning rate), SGD (Stochastic
Gradient Descent), RMSprop.
 Impact on Training: The choice of optimizer affects convergence
speed and stability. Adam is widely used because of its adaptive
learning rates.
 Tuning Strategy: Start with Adam, but for some tasks, SGD with
momentum can improve performance.

Results
Experiment No-03:

Aim: Hyper Tuning using Grid search and Random Search

Objective: The objective of hyperparameter tuning using Grid Search and Random
Search is to find the optimal combination of hyperparameter values for a machine
learning model by systematically evaluating different combinations, with Grid
Search exhaustively testing all possible combinations and Random Search
randomly sampling combinations, to ultimately achieve the best possible model
performance on a given task

Theory
Grid search
Grid search is the simplest algorithm for hyperparameter tuning. Basically, we
divide the domain of the hyperparameters into a discrete grid. Then, we try every
combination of values of this grid, calculating some performance metrics using
cross-validation. The point of the grid that maximizes the average value in cross-
validation, is the optimal combination of values for the hyperparameters.

Example of a grid search

Grid search is an exhaustive algorithm that spans all the combinations, so it can
actually find the best point in the domain. The great drawback is that it’s very
slow. Checking every combination of the space requires a lot of time that,
sometimes, is not available. Don’t forget that every point in the grid needs k-fold
cross-validation, which requires k training steps. So, tuning the hyperparameters
of a model in this way can be quite complex and expensive. However, if we look
for the best combination of values of the hyperparameters, grid search is a very
good idea.
Random search
Random search is similar to grid search, but instead of using all the points in the
grid, it tests only a randomly selected subset of these points. The smaller this
subset, the faster but less accurate the optimization. The larger this dataset, the
more accurate the optimization but the closer to a grid search.

Example of random search

Random search is a very useful option when you have several hyperparameters
with a fine-grained grid of values. Using a subset made by 5-100 randomly
selected points, we are able to get a reasonably good set of values of the
hyperparameters. It will not likely be the best point, but it can still be a good set
of values that gives us a good model.
Code:
import tensorflow as tf
from tensorflow.keras.datasets import mnist
from tensorflow.keras.models import Sequential
from tensorflow.keras.layers import Dense, Flatten
from tensorflow.keras.optimizers import Adam, SGD
from sklearn.model_selection import ParameterGrid, ParameterSampler
import numpy as np

# Load MNIST dataset

(x_train, y_train), (x_test, y_test) = mnist.load_data()

# Normalize the data

x_train = x_train / 255.0
x_test = x_test / 255.0

# Convert labels to categorical

y_train = tf.keras.utils.to_categorical(y_train, 10)
y_test = tf.keras.utils.to_categorical(y_test, 10)
def build_model(hidden_layers, hidden_units, activation, optimizer):
model = Sequential()
model.add(Flatten(input_shape=(28, 28)))
for _ in range(hidden_layers):
model.add(Dense(hidden_units, activation=activation))
model.add(Dense(10, activation='softmax'))
model.compile(optimizer=optimizer, loss='categorical_crossentropy',
metrics=['accuracy'])
return model
# Define parameter grid
param_grid = {
'hidden_layers': [1, 2],
'hidden_units': [32, 64],
'activation': ['relu', 'tanh'],
'optimizer': ['adam', 'sgd'],
'batch_size': [32, 64],
'epochs': [5, 10]
}

grid = ParameterGrid(param_grid)

best_model = None
best_accuracy = 0
# Perform Grid Search
for params in grid:
print(f"Testing params: {params}")
model = build_model(params['hidden_layers'], params['hidden_units'],
params['activation'], params['optimizer'])
model.fit(x_train, y_train, batch_size=params['batch_size'],
epochs=params['epochs'], verbose=0)
_, accuracy = model.evaluate(x_test, y_test, verbose=0)
print(f"Accuracy: {accuracy}")
if accuracy > best_accuracy:
best_accuracy = accuracy
best_model = model

print(f"Best Accuracy: {best_accuracy}")

from sklearn.ensemble import RandomForestClassifier

rf = RandomForestClassifier()
grid_space={'max_depth':[3,5,10,None],
'n_estimators':[10,100,200],
'max_features':[1,3,5,7],
'min_samples_leaf':[1,2,3],
'min_samples_split':[1,2,3]
}
from sklearn.model_selection import GridSearchCV
grid = GridSearchCV(rf,param_grid=grid_space,cv=3,scoring='accuracy')
model_grid = grid.fit(X,y)
print('Best hyperparameters are: '+str(model_grid.best_params_))
print('Best score is: '+str(model_grid.best_score_))
Challenging Experiment 01
Aim: Implementation and Performance Evaluation of a Multi-Layer Perceptron
(MLP) for Cats and Dogs Image Recognition
Experiment no 04

Aim:MLP Implementation using Keras on MNIST Fashion Dataset

Objective: Multilayer Perceptron (MLP) with the Fashion MNIST dataset is to train
a machine learning model to accurately classify grayscale images of clothing items
from different categories (like t-shirts, pants, dresses) by predicting the correct
clothing class for each image

Theory:
Code:
import keras
import numpy as np
import matplotlib.pyplot as plt
# %matplotlib inline

keras.backend.backend()

fm = keras.datasets.fashion_mnist
(X_train, y_train), (X_test, y_test) = fm.load_data()

X_train.shape

X_test.shape

X_train[0]

y_train[0]

plt.matshow(X_train[0])

"""<h3 style='color:purple'>Normalize training data before training the neural

net</h3>"""

X_train = X_train/255
X_test = X_test/255

"""<h3 style='color:purple'>Now build the Sequential Model and add layers into
it</h3>"""

from keras.models import Sequential

from keras.layers import Flatten, Dense, Activation

model = Sequential()
model.add(Flatten(input_shape=[28, 28]))
model.add(Dense(100, activation="relu"))
model.add(Dense(10, activation="softmax"))

"""<img src='fashion_neural_net.png' />"""

model.summary()

model.compile(loss="sparse_categorical_crossentropy",
optimizer="adam",
metrics=["accuracy"])

model.fit(X_train, y_train)

model.evaluate(X_test, y_test)
"""**Above shows accuracy score of 82.76%. The first parameter is loss**"""

plt.matshow(X_test[0])

yp = model.predict(X_test)

np.argmax(yp[0])

class_labels =
["T-shirt/top","Trouser","Pullover","Dress","Coat","Sandal","Shirt","Sneaker","Ba
g","Ankle boot"]

class_labels[np.argmax(yp[0])]

Results:

Deep Learning R18 Jntuh Lab Manual
0% (1)
Deep Learning R18 Jntuh Lab Manual
21 pages
KR23 DL Lab Record
No ratings yet
KR23 DL Lab Record
59 pages
This Code Fragment Defines A Single Layer With Artificial Neurons, and It Expects Input Variables
No ratings yet
This Code Fragment Defines A Single Layer With Artificial Neurons, and It Expects Input Variables
9 pages
This Code Fragment Defines A Single Layer With Artificial Neurons, and It Expects Input Variables
No ratings yet
This Code Fragment Defines A Single Layer With Artificial Neurons, and It Expects Input Variables
9 pages
DL Practical 02 Binary Class Classifier Using ANN
No ratings yet
DL Practical 02 Binary Class Classifier Using ANN
5 pages
Gen Aiml Notes by Piyush
No ratings yet
Gen Aiml Notes by Piyush
39 pages
DL Lab Manual
No ratings yet
DL Lab Manual
52 pages
Day 2 - Loss & Activation Functions
No ratings yet
Day 2 - Loss & Activation Functions
8 pages
AI & ML Unit 5 Notes
No ratings yet
AI & ML Unit 5 Notes
23 pages
Aiml Unit 5
No ratings yet
Aiml Unit 5
16 pages
Aiml Unit 5
No ratings yet
Aiml Unit 5
34 pages
A Imprimer 4
No ratings yet
A Imprimer 4
4 pages
Adaptive Linear Neuron Using Linear (Identity) Activation Function With Batch Gradient Method
No ratings yet
Adaptive Linear Neuron Using Linear (Identity) Activation Function With Batch Gradient Method
19 pages
ML Concepts
No ratings yet
ML Concepts
3 pages
Kenny-230718-The Ultimate Machine Learning Cheat Sheet
No ratings yet
Kenny-230718-The Ultimate Machine Learning Cheat Sheet
20 pages
DL PYTH Keras
No ratings yet
DL PYTH Keras
57 pages
Deep Learning Sem
No ratings yet
Deep Learning Sem
128 pages
PDF Hyperparameter Tuning Batch Normalization
No ratings yet
PDF Hyperparameter Tuning Batch Normalization
11 pages
Pure Optimization
No ratings yet
Pure Optimization
23 pages
Control System Term Paper
No ratings yet
Control System Term Paper
12 pages
Deep MLP's
No ratings yet
Deep MLP's
44 pages
Introduction To Artificial Neural Networks
No ratings yet
Introduction To Artificial Neural Networks
31 pages
NISS Deep Learning Tutorial
No ratings yet
NISS Deep Learning Tutorial
58 pages
ML and DL
No ratings yet
ML and DL
15 pages
Deep Learning
No ratings yet
Deep Learning
21 pages
AIML Unit-5
No ratings yet
AIML Unit-5
26 pages
Assignment 2 QSN 1
No ratings yet
Assignment 2 QSN 1
4 pages
MCQ Deep Learning Engineering Syllabus 1to 5 Unit ..
No ratings yet
MCQ Deep Learning Engineering Syllabus 1to 5 Unit ..
2 pages
Domnic Object Detecion Basics
No ratings yet
Domnic Object Detecion Basics
62 pages
Lecture 9
No ratings yet
Lecture 9
97 pages
5.MLP in Practice
No ratings yet
5.MLP in Practice
19 pages
DL Lab Manual
No ratings yet
DL Lab Manual
29 pages
Hyper Parameters
No ratings yet
Hyper Parameters
24 pages
Lecture 5-Introduction To Neural Network
No ratings yet
Lecture 5-Introduction To Neural Network
42 pages
Secrets of Deep Learning 1716536527
No ratings yet
Secrets of Deep Learning 1716536527
12 pages
Data Analysis ch1
No ratings yet
Data Analysis ch1
13 pages
Unit-1 and 2 and 3
No ratings yet
Unit-1 and 2 and 3
212 pages
AML 03 Dense Neural Networks
No ratings yet
AML 03 Dense Neural Networks
20 pages
DLP Lab
No ratings yet
DLP Lab
81 pages
Deep Learning
No ratings yet
Deep Learning
20 pages
01 - Neural Network Basics
No ratings yet
01 - Neural Network Basics
93 pages
Artificial Intelligence MIDTERM
No ratings yet
Artificial Intelligence MIDTERM
5 pages
3 Non Linear Classifiers
No ratings yet
3 Non Linear Classifiers
74 pages
Artificial Neural Networks - Lect - 4
No ratings yet
Artificial Neural Networks - Lect - 4
17 pages
cst414 - Deep Learning
No ratings yet
cst414 - Deep Learning
34 pages
DL Unit 3 Important Questions and Answers PDF .. - 1
No ratings yet
DL Unit 3 Important Questions and Answers PDF .. - 1
8 pages
Advanced Machine Learning: Neural Networks Decision Trees Random Forest Xgboost
No ratings yet
Advanced Machine Learning: Neural Networks Decision Trees Random Forest Xgboost
61 pages
Machine Learning
No ratings yet
Machine Learning
9 pages
Report 2
No ratings yet
Report 2
17 pages
Deep Learning
No ratings yet
Deep Learning
19 pages
Assignment Mtech
No ratings yet
Assignment Mtech
5 pages
4_NN
No ratings yet
4_NN
25 pages
DL Unit2
No ratings yet
DL Unit2
22 pages
Deep Learning Lab Manual
No ratings yet
Deep Learning Lab Manual
73 pages
Unit 3
No ratings yet
Unit 3
110 pages
3) Multi-Layer Perceptron Learning in Tensorflow
No ratings yet
3) Multi-Layer Perceptron Learning in Tensorflow
7 pages
Unit 3
No ratings yet
Unit 3
9 pages
Exp 3
No ratings yet
Exp 3
7 pages
Richi's Neural Nets Summary
No ratings yet
Richi's Neural Nets Summary
114 pages
Random Sample Consensus: Robust Estimation in Computer Vision
From Everand
Random Sample Consensus: Robust Estimation in Computer Vision
Fouad Sabry
No ratings yet
DATA MINING AND MACHINE LEARNING. PREDICTIVE TECHNIQUES: REGRESSION, GENERALIZED LINEAR MODELS, SUPPORT VECTOR MACHINE AND NEURAL NETWORKS
From Everand
DATA MINING AND MACHINE LEARNING. PREDICTIVE TECHNIQUES: REGRESSION, GENERALIZED LINEAR MODELS, SUPPORT VECTOR MACHINE AND NEURAL NETWORKS
César Pérez López
No ratings yet
Slide 2 Introduction to Text Tokeni
No ratings yet
Slide 2 Introduction to Text Tokeni
5 pages
React Experiments
No ratings yet
React Experiments
8 pages
B22EN0304 Sol Python
No ratings yet
B22EN0304 Sol Python
12 pages
Grade 6 Cloud Computing and AI Notes-2
No ratings yet
Grade 6 Cloud Computing and AI Notes-2
3 pages
Agile Notes
No ratings yet
Agile Notes
138 pages
Optimized Operations: It Allow The Provisioning of Services Ans: It Can Carry Large Payloads
No ratings yet
Optimized Operations: It Allow The Provisioning of Services Ans: It Can Carry Large Payloads
6 pages
Presentation 1
No ratings yet
Presentation 1
20 pages
Opticalnetwork Questionbank
No ratings yet
Opticalnetwork Questionbank
19 pages
VHDL Implementation of Reversible Full Adder Using Peres Gate IJERTV3IS20334 1
No ratings yet
VHDL Implementation of Reversible Full Adder Using Peres Gate IJERTV3IS20334 1
5 pages
Computer Communication and Telecom Networks Syllabus
No ratings yet
Computer Communication and Telecom Networks Syllabus
3 pages
A Jayk Sharma
No ratings yet
A Jayk Sharma
6 pages
Continuous Probability Distributions
No ratings yet
Continuous Probability Distributions
37 pages
MAT6007 Session4 MP Neuron Perceptrons
No ratings yet
MAT6007 Session4 MP Neuron Perceptrons
15 pages
Non-Deterministic Finite Automata: Costas Busch - LSU 1
No ratings yet
Non-Deterministic Finite Automata: Costas Busch - LSU 1
102 pages
Probability Slides
No ratings yet
Probability Slides
12 pages
Normal Forms For Context-Free Grammars
No ratings yet
Normal Forms For Context-Free Grammars
57 pages
Lecture-4 Multi-Layer Perceptrons
No ratings yet
Lecture-4 Multi-Layer Perceptrons
23 pages
Xtune An XAI-Based Hyperparameter Tuning Method Fo
No ratings yet
Xtune An XAI-Based Hyperparameter Tuning Method Fo
19 pages
Performance Analysis of Handwritten Marathi Character Recognition With RBF, Cascade, Elman and Feed Forward Neural Networks
No ratings yet
Performance Analysis of Handwritten Marathi Character Recognition With RBF, Cascade, Elman and Feed Forward Neural Networks
5 pages
Unit - I Artificial Neural Networks
No ratings yet
Unit - I Artificial Neural Networks
23 pages
Artificial Neural Network - Hopfield Networks - Tutorialspoint
No ratings yet
Artificial Neural Network - Hopfield Networks - Tutorialspoint
3 pages
11-Class Diagram
No ratings yet
11-Class Diagram
22 pages
Uni2 NNDL
No ratings yet
Uni2 NNDL
21 pages
Deep Learnig-CNN-new - DMI-compressed
No ratings yet
Deep Learnig-CNN-new - DMI-compressed
118 pages
机器学习绘图模板
No ratings yet
机器学习绘图模板
101 pages
Moore and Mealy Machine
No ratings yet
Moore and Mealy Machine
21 pages
Arima Model
No ratings yet
Arima Model
45 pages
Seminar Slides
No ratings yet
Seminar Slides
28 pages
TAFL 1st Sessional
No ratings yet
TAFL 1st Sessional
2 pages
MODULE 1: Probability Distributions
No ratings yet
MODULE 1: Probability Distributions
2 pages
Discrete Random Variables and Probability Distributions
No ratings yet
Discrete Random Variables and Probability Distributions
56 pages
Document From Rohit Jain-Unlocked
No ratings yet
Document From Rohit Jain-Unlocked
221 pages
Probability and Statistics
No ratings yet
Probability and Statistics
56 pages
Creating A UML Design From Scratch - Object Model + Class Diagram
No ratings yet
Creating A UML Design From Scratch - Object Model + Class Diagram
1 page
Review Problems With Key
No ratings yet
Review Problems With Key
5 pages
Unit 4 CFG
No ratings yet
Unit 4 CFG
137 pages
Dataeng Lq1 Notes
No ratings yet
Dataeng Lq1 Notes
11 pages
BOX Jenkins PDF
No ratings yet
BOX Jenkins PDF
7 pages
Time Series Analysis of Electricity Consumption Forecasting Using ARIMA Model
No ratings yet
Time Series Analysis of Electricity Consumption Forecasting Using ARIMA Model
4 pages