0% found this document useful (0 votes)

22 views21 pages

Gen AI

This document outlines the implementation of various machine learning models, including a perceptron, convolutional neural network (CNN), and recurrent neural network (RNN), along with their training, evaluation, and hyperparameter tuning. Key findings include a CNN achieving 85.87% accuracy on the CIFAR-10 dataset and an RNN achieving 92.70% accuracy on a Shakespeare text dataset. The report emphasizes the importance of data augmentation and hyperparameter optimization in improving model performance.

Uploaded by

i212559

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

22 views21 pages

Gen AI

Uploaded by

i212559

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 21

Generative AI Assignment 01

Malaika Hussain
March 2025

1 Question 1: Implementing Rosenblatt’s Per-

ceptron from Scratch
1.1 Objective
This task involves implementing a single-layer perceptron model from scratch
to understand forward propagation, backward propagation, and weight updates.
The model is evaluated on a synthetic dataset.

1.2 Dataset Generation

A synthetic dataset with 500 samples and two features was generated. The data
points were assigned binary labels based on a non-linear decision boundary. The
dataset was split into 80% training and 20% testing.

1.3 Perceptron Model

The perceptron consists of: - A weight matrix initialized with zeros. - A bias
term. - A step function as the activation function. - Forward pass to compute
the weighted sum of inputs. - Backward pass to update weights using the
perceptron learning rule.
Training was performed over multiple epochs, adjusting weights to minimize
classification errors.

1.4 Training Performance

During training, the number of misclassified samples was recorded at each epoch
to observe convergence.

1.5 Decision Boundary and Evaluation

After training, the perceptron’s decision boundary was visualized. The test data
was overlaid to analyze classification accuracy.
The perceptron achieved a test accuracy of 72%, indicating reasonable clas-
sification performance given the non-linearity of the dataset.

1
Figure 1: Visualization of the generated dataset

Figure 2: Error rate reduction over training epochs

2
Figure 3: Trained perceptron decision boundary with test data

Figure 4: Final test results and model performance

3
2 Question 2: Implementing Convolution from
Scratch
2.1 Objective
The purpose of this assignment is to develop a deeper understanding of convo-
lution operations by manually implementing them without using built-in deep
learning libraries. This will help analyze how different kernels influence image
processing.

2.2 Implementation Details

2.3 Generalized Convolution Function
A convolution function was implemented that takes a grayscale image and ap-
plies a user-defined kernel. The function includes the following parameters:
• Input image: The grayscale image to be processed.
• Kernel: User-defined kernel (defaults to a random kernel if none is pro-
vided).

• Kernel size: Size of the kernel matrix.

• Stride: Step size for sliding the kernel.
• Padding: Can be ”valid” (no padding) or ”same” (zero-padding to main-
tain size).

• Mode: Option to perform either convolution or correlation.

The function manually implements the convolution operation using nested
loops, ensuring that no deep learning libraries are used.

2.4 Application of Different Kernels

The convolution function was applied to a grayscale image using various kernels
to achieve different effects:
• Edge Detection: A Sobel filter was used to detect edges in the image.

• Blurring: A Gaussian blur kernel was applied to smoothen the image.

• Sharpening: A sharpening kernel was applied to enhance image details.

4
2.5 Comparison of Convolution vs. Correlation
To compare convolution and correlation, two different kernels were used:
• A symmetric kernel:
0.250.250.250.25

• A non-symmetric kernel:

121000 − 1 − 2 − 1

The results were analyzed to observe how convolution and correlation yield
different outcomes.

2.6 Results and Analysis

2.7 Visualization of Results
• The original grayscale image was displayed.
• Output images for each applied kernel were visualized.
• A side-by-side comparison was conducted between manually implemented
convolution and NumPy-based convolution.

• The differences between convolution and correlation results were analyzed.

2.8 Effect of Different Kernels

• Edge Detection: The Sobel filter effectively highlighted edges in the
image by detecting intensity changes.
• Blurring: The Gaussian blur kernel smoothed the image by reducing
high-frequency components.
• Sharpening: The sharpening kernel enhanced details by amplifying high-
frequency components.

2.9 Impact of Kernel Size, Stride, and Padding

• Kernel Size: Larger kernels tend to blur the image more, while smaller
kernels maintain finer details.
• Stride: Increasing the stride reduces the output size and can lead to
information loss.
• Padding: Using ”same” padding preserves the original image size, whereas
”valid” padding reduces the output size.

5
2.10 Observations on Convolution vs. Correlation
• Convolution flips the kernel before applying it, while correlation does not.
• The difference in results is more pronounced when using asymmetric ker-
nels.

• Edge detection kernels, such as Sobel, yield different results when used in
convolution versus correlation.

2.11 Application of Multiple Kernels

Applying multiple kernels sequentially can enhance feature extraction. For ex-
ample:
• First applying an edge detection kernel, followed by a sharpening kernel,
enhances detected edges.

• Combining a blur filter followed by an edge detection filter reduces noise

before extracting edges.
This approach is often used in image preprocessing for deep learning models.

2.12 Conclusion
Through manual implementation, we have gained insights into how convolution
operations function and how different kernel types affect image processing. The
analysis of stride, padding, and kernel size provided a better understanding of
convolution’s impact on image transformation.

2.13 Images and Results

6
Figure 5: Original image before processing.

7
Figure 6: Converted grayscale image.

Figure 7: Effect of applying multiple kernels sequentially.

8
Figure 8: Edge detection using the Sobel filter.

Figure 9: Effect of padding on convolution output.

Figure 10: Effect of different stride values on convolution output.

9
Figure 11: Demonstration of different kernel applications.

10
3 Question 3: Implementation of a Convolu-
tional Neural Network for CIFAR-10
3.1 Introduction
The objective of this report is to implement a Convolutional Neural Network
(CNN) for image classification using the CIFAR-10 dataset. The study includes
dataset preparation, CNN model implementation, evaluation, data augmenta-
tion, and an ablation study to analyze hyperparameter effects.

3.2 Dataset
The CIFAR-10 dataset consists of 60,000 32x32 RGB images belonging to 10
classes. The dataset is split into 50,000 training images and 10,000 testing
images.

3.3 Dataset Preparation

• Load the dataset from Hugging Face.
• Normalize pixel values between 0 and 1.
• Convert labels into one-hot encoded format.

• Split dataset into training (80%) and testing (20%) subsets.

3.4 CNN Model Architecture

The Convolutional Neural Network (CNN) designed for CIFAR-10 classification
follows a hierarchical structure to extract spatial features from images effectively.
The architecture consists of multiple layers, each playing a crucial role in the
learning process.

3.5 Convolutional Layers

The network begins with convolutional layers that apply a set of learnable filters
to the input image. These filters help capture local patterns such as edges,
textures, and shapes. Each convolutional layer is followed by a Rectified Linear
Unit (ReLU) activation function, introducing non-linearity into the model and
enhancing its ability to learn complex features.

3.6 Batch Normalization

Batch normalization is applied after each convolutional layer to standardize
activations, stabilize training, and accelerate convergence. This reduces internal
covariate shifts and helps improve generalization.

11
3.7 Pooling Layers
Pooling layers are introduced after groups of convolutional layers to reduce the
spatial dimensions of feature maps. Max pooling is employed to retain the most
significant features while reducing computational complexity. This helps make
the model more robust to small translations and distortions in images.

3.8 Dropout Layers

To prevent overfitting, dropout layers are included after pooling layers. Dropout
randomly disables a fraction of neurons during training, forcing the network to
learn more generalized patterns rather than memorizing specific examples.

3.9 Fully Connected Layers

Following the feature extraction process, the output from the convolutional lay-
ers is flattened into a one-dimensional vector and passed through fully connected
layers. These layers learn complex representations and patterns in the data. The
final fully connected layer outputs class probabilities using a softmax activation
function.

3.10 Optimization and Loss Function

The model is optimized using the Adam optimizer, which combines momentum
and adaptive learning rates for efficient training. Categorical cross-entropy is
used as the loss function since CIFAR-10 is a multi-class classification problem.
This architecture ensures that the CNN captures both low-level and high-
level features, leading to effective classification of CIFAR-10 images.

3.11 Evaluation and Comparison of Model Performance

3.12 Performance Metrics
The CNN models are evaluated using the following metrics:

• Accuracy
• Precision
• Recall
• F1-Score

• Confusion Matrix

12
3.13 Comparison of Models
Two models were trained and evaluated:
• Model without Data Augmentation
• Model with Data Augmentation (flipping, rotation, shifting)

Model Accuracy Precision Recall F1-Score

Without Augmentation 0.8587 0.8604 0.8587 0.8589
With Augmentation 0.8507 0.8564 0.8507 0.8506

Table 1: Performance Metrics Comparison of CNN Models

3.14 Confusion Matrix Visualization

For both models, confusion matrices were generated to analyze classification
performance. The misclassified categories were identified and examined.

3.15 Loss and Accuracy Curves

Training and validation loss/accuracy curves were plotted to analyze conver-
gence and detect overfitting or underfitting.

3.16 Feature Map Visualization

Feature maps were extracted and visualized from different layers to analyze how
the model learns representations. Early layers capture edges, while deeper layers
capture high-level features.

3.17 Ablation Study: Hyperparameter Impact

An ablation study was conducted by modifying hyperparameters:
• Learning Rate: Tested values: 0.001, 0.01, 0.1.
• Batch Size: Compared batch sizes: 16, 32, 64.
• Number of Convolutional Filters: Evaluated 16, 32, and 64 filters.
• Number of Layers: Compared architectures with 3, 5, and 7 convolu-
tional layers.

3.18 Conclusion
This report presented the implementation of a CNN for CIFAR-10 classification.
The effect of data augmentation and hyperparameters on model performance
was analyzed. Results show that data augmentation improves generalization,
while hyperparameters significantly impact accuracy.

13
3.19 Images and Results

Figure 12: Confusion Matrix with Augmentation

14
Figure 13: Confusion Matrix with Augmentation

Figure 14: Loss Accuracy Curves

15
4 Question 4: Implementation of a Vanilla RNN
for Next-Word Prediction
4.1 Introduction
The objective of this project is to implement a Vanilla Recurrent Neural Network
(RNN) trained on a Shakespeare text dataset for next-word prediction. Instead
of using pre-trained embeddings (e.g., Word2Vec or GloVe), the model will learn
its own embeddings using a trainable embedding layer.

4.2 Dataset
The dataset used for training is the publicly available Shakespeare text dataset
from Hugging Face. The dataset consists of a large corpus of Shakespeare’s
works, and preprocessing involves tokenization and vocabulary creation.

4.3 Implementation Steps

4.4 Data Preprocessing
• Load the Shakespeare dataset from Hugging Face.
• Tokenize the text and create a vocabulary.
• Convert words into integer sequences.
• Split the dataset into training (80

4.5 Vanilla RNN Model

• Implement a custom RNN cell without LSTMs or GRUs.
• Utilize a trainable Embedding Layer to learn word representations.

• Process word sequences and predict the next word.

• Use Cross-Entropy Loss and the Adam optimizer for training.

4.6 Training the Model

• Train the model using Backpropagation Through Time (BPTT).
• Monitor training and validation loss across epochs.
• Save the trained model for text generation.

16
4.7 Evaluation and Performance Metrics
4.8 Generated Text Sequences
To test the model, a seed phrase such as “To be or not to” is provided, and the
model generates the next words iteratively.

4.9 Performance Metrics

The model is evaluated using the following metrics:
• Perplexity: Measures the uncertainty of the model.
• Word-level accuracy: Measures how often the predicted words match
the expected words.
• Loss curve visualization: Helps analyze model convergence.

4.10 Experimental Results

The trained model achieved the following results on the test dataset:

• Test Loss: 6.0157

• Test Perplexity: 409.8084
• Test Accuracy: 9.37

4.11 Ablation Study: Impact of Pretrained Word Embed-

dings
To analyze the impact of using pretrained word embeddings, the model is trained
separately using:
• Randomly initialized embeddings.
• Learned embeddings from the trainable embedding layer.

4.12 Conclusion
This report presents the implementation of a Vanilla RNN for next-word pre-
diction using a Shakespeare text dataset. The model successfully learns word
embeddings and generates text sequences. An ablation study is conducted to
compare learned embeddings with random embeddings, showing the effect of
word representation on model performance. The test results indicate that the
model struggles with long-term dependencies, leading to high perplexity and
low accuracy.

4.13 Images and Results

17
Figure 15: Pretrained Embeddings Confusion matrix.

Figure 16: Pretrained Embeddings Learning curves.

18
5 Question 5: Hyperparameter Search for CNN
and RNN
5.1 Introduction
Hyperparameter tuning is crucial for optimizing deep learning models. In this
task, we perform a random search over a predefined set of hyperparameters for
both Convolutional Neural Networks (CNNs) and Recurrent Neural Networks
(RNNs). The best configurations are selected based on validation accuracy, and
the models are evaluated on the test dataset.

5.2 Hyperparameter Search

The following hyperparameters were considered in the search:
• Learning rate
• Number of layers
• Number of neurons (for RNN) or filters (for CNN)
• Batch size
• Optimizer (Adam, SGD, RMSprop)
• Activation functions (ReLU, Tanh, Sigmoid)
• Dropout rate
• Kernel size (for CNN)
• Stride (for CNN)
• Weight initialization method (Xavier, He Normal)
RandomizedSearchCV from Scikit-Learn or a custom random sampling ap-
proach was used for hyperparameter tuning. Multiple models were trained, and
the best-performing configuration was selected based on validation accuracy.

5.3 Best Hyperparameter Configurations

The optimal hyperparameters for both CNN and RNN models are as follows:

5.4 Test Results

The best hyperparameter configurations were evaluated on the test dataset. The
results are summarized below:

5.5 Model Evaluation Metrics

The classification report for the best RNN and CNN models is presented below:

19
Hyperparameter Best Value
Weight Initialization Glorot Uniform
Stride 2
Optimizer SGD
Number of Units (RNN) 128
Number of Layers 2
Number of Filters (CNN) 128
Learning Rate 0.0001
Kernel Size (CNN) 3
Dropout 0.5
Batch Size 32
Activation Function Sigmoid

Table 2: Best Hyperparameter Configurations for CNN and RNN

Model Test Accuracy

Best RNN 92.70%
Best CNN 92.50%

Table 3: Test Accuracy for Best CNN and RNN Models

5.6 RNN Model Performance

Class Precision Recall F1-Score Support

0 0.85 0.93 0.89 310
1 0.97 0.93 0.95 690
Accuracy 0.93
Macro Avg 0.91 0.93 0.92 1000
Weighted Avg 0.93 0.93 0.93 1000

Table 4: RNN Model Evaluation

5.7 CNN Model Performance

20
Class Precision Recall F1-Score Support
0 0.85 0.91 0.88 310
1 0.96 0.93 0.94 690
Accuracy 0.93
Macro Avg 0.91 0.92 0.91 1000
Weighted Avg 0.93 0.93 0.93 1000

Table 5: CNN Model Evaluation

5.8 Conclusion
The hyperparameter search yielded optimal configurations for both CNN and
RNN models. The RNN model achieved slightly better test accuracy (92.70%)
than the CNN model (92.50%). Both models demonstrated high classification
performance, confirming the effectiveness of hyperparameter tuning in improv-
ing model performance.

3 # Deep Learning
No ratings yet
3 # Deep Learning
36 pages
Module5 ML
No ratings yet
Module5 ML
112 pages
AI Facilitators Handbook X
No ratings yet
AI Facilitators Handbook X
42 pages
DL Unit 4&5
No ratings yet
DL Unit 4&5
30 pages
Intro To CNN
No ratings yet
Intro To CNN
93 pages
Convolutional Networks
No ratings yet
Convolutional Networks
37 pages
Week6 - Intro To Convolutional Neural Networks
No ratings yet
Week6 - Intro To Convolutional Neural Networks
25 pages
Convolutional Neural Networks - Part 1
No ratings yet
Convolutional Neural Networks - Part 1
44 pages
22BEECE12
No ratings yet
22BEECE12
5 pages
CNN and RNN Mixed Model For Image Classification
No ratings yet
CNN and RNN Mixed Model For Image Classification
7 pages
ChatGPT - Convolution and Pooling Operations
No ratings yet
ChatGPT - Convolution and Pooling Operations
43 pages
Convolution Nueral Networks
No ratings yet
Convolution Nueral Networks
32 pages
03 - CNN
No ratings yet
03 - CNN
10 pages
PNAL9 CNNs
No ratings yet
PNAL9 CNNs
61 pages
CNN 1
No ratings yet
CNN 1
9 pages
Chap4 CNN (20240205) - DL4H Practioner Guide
No ratings yet
Chap4 CNN (20240205) - DL4H Practioner Guide
23 pages
Convolution Neural Networks: S. Sumitra Department of Mathematics Indian Institute of Space Science and Technology
No ratings yet
Convolution Neural Networks: S. Sumitra Department of Mathematics Indian Institute of Space Science and Technology
123 pages
Convolutional Networks 2024
No ratings yet
Convolutional Networks 2024
44 pages
CH VI - Convolutional Neural Network - 24
No ratings yet
CH VI - Convolutional Neural Network - 24
33 pages
Experiment 10 1
No ratings yet
Experiment 10 1
3 pages
Experiment 10-1
No ratings yet
Experiment 10-1
3 pages
Experiment 2
No ratings yet
Experiment 2
7 pages
Convolutional Neural Networks
No ratings yet
Convolutional Neural Networks
35 pages
Unit 3 CNN 2024
No ratings yet
Unit 3 CNN 2024
58 pages
ML 03
No ratings yet
ML 03
9 pages
ANN Unit 4
No ratings yet
ANN Unit 4
66 pages
Aiml Ece Unit-5
No ratings yet
Aiml Ece Unit-5
48 pages
Convolutional Neural Network
No ratings yet
Convolutional Neural Network
11 pages
MCS-224 Artificial Intelligence and Machine Learning
No ratings yet
MCS-224 Artificial Intelligence and Machine Learning
493 pages
Expt 6
No ratings yet
Expt 6
1 page
Unit 1 - Unit 2
No ratings yet
Unit 1 - Unit 2
288 pages
Explain The Convolution Operation in The Context of Image Processing. How Does It Differ From Standard Matrix Multiplication?
No ratings yet
Explain The Convolution Operation in The Context of Image Processing. How Does It Differ From Standard Matrix Multiplication?
5 pages
Chapter 4 Ann
No ratings yet
Chapter 4 Ann
33 pages
Assignment No 2 - OCR CNN
No ratings yet
Assignment No 2 - OCR CNN
2 pages
MLOA Exp 1 - C121
No ratings yet
MLOA Exp 1 - C121
18 pages
DL6 - Convnets 4
No ratings yet
DL6 - Convnets 4
57 pages
Lecture 6
No ratings yet
Lecture 6
17 pages
CS601 Machine Learning Unit 3
No ratings yet
CS601 Machine Learning Unit 3
47 pages
Building Agenticai
No ratings yet
Building Agenticai
29 pages
Co2 CNN 3
No ratings yet
Co2 CNN 3
31 pages
CNN 1
No ratings yet
CNN 1
23 pages
Selected Algorithm: Mse N y y
No ratings yet
Selected Algorithm: Mse N y y
2 pages
The Math Behind Convolutional Neural Networks - Towards Data Science
No ratings yet
The Math Behind Convolutional Neural Networks - Towards Data Science
37 pages
Convolutinal Neural Networks
No ratings yet
Convolutinal Neural Networks
43 pages
Convolutional Neural Network (CNN)
No ratings yet
Convolutional Neural Network (CNN)
27 pages
Unit Iii Deep Learning
No ratings yet
Unit Iii Deep Learning
31 pages
Module-4 DL
No ratings yet
Module-4 DL
22 pages
CV - T3 - Unit-7
No ratings yet
CV - T3 - Unit-7
36 pages
First Report
No ratings yet
First Report
8 pages
Mnist Handwritten Digit Classification
No ratings yet
Mnist Handwritten Digit Classification
26 pages
Image Recognition Using Neural Networks
No ratings yet
Image Recognition Using Neural Networks
18 pages
Convolutional Neural Networks: Convolutions, Pooling and Cnns. Neural Architectures For Computer Vision
No ratings yet
Convolutional Neural Networks: Convolutions, Pooling and Cnns. Neural Architectures For Computer Vision
64 pages
DLT Unit - 4
No ratings yet
DLT Unit - 4
36 pages
Practical 08 Solutions
No ratings yet
Practical 08 Solutions
6 pages
Module 3 Notes
No ratings yet
Module 3 Notes
22 pages
Lab 5 - Intro To Convolutional Neural Networks
No ratings yet
Lab 5 - Intro To Convolutional Neural Networks
52 pages
Guddu Jha - Organized
No ratings yet
Guddu Jha - Organized
3 pages
Vineela Ann1
No ratings yet
Vineela Ann1
9 pages
3098 15835 1 PB 2011 PDF
No ratings yet
3098 15835 1 PB 2011 PDF
6 pages
Shivani Verma (Editor), Pradeep Tomar (Editor) - Impact of AI Technologies On Teaching, Learning, and Research in Higher Education-IGI Global (2020)
No ratings yet
Shivani Verma (Editor), Pradeep Tomar (Editor) - Impact of AI Technologies On Teaching, Learning, and Research in Higher Education-IGI Global (2020)
321 pages
Assigment 2024
No ratings yet
Assigment 2024
25 pages
Ch3 CNN
No ratings yet
Ch3 CNN
64 pages
Social Psychology Trends
No ratings yet
Social Psychology Trends
5 pages
Convolution Neural Networks U2
No ratings yet
Convolution Neural Networks U2
24 pages
Fake Homework Assignments
100% (1)
Fake Homework Assignments
5 pages
Mirdasm Report File
No ratings yet
Mirdasm Report File
101 pages
Understanding of Convolutional Neural Network (CNN) - Deep Learning
No ratings yet
Understanding of Convolutional Neural Network (CNN) - Deep Learning
7 pages
Rtmnu Machine Learning Paper Winter 2024
100% (1)
Rtmnu Machine Learning Paper Winter 2024
4 pages
Science Fiction and Artificial Intellige
No ratings yet
Science Fiction and Artificial Intellige
45 pages
GANs 1
No ratings yet
GANs 1
25 pages
Report
No ratings yet
Report
33 pages
Kashish Research Paper
No ratings yet
Kashish Research Paper
5 pages
There Are 5 Questions. Answer All Questions
No ratings yet
There Are 5 Questions. Answer All Questions
4 pages
W20 L1 Add Mixed Numbers
No ratings yet
W20 L1 Add Mixed Numbers
20 pages
Industry 4.0 - The Future of Manufacturing - SAP
No ratings yet
Industry 4.0 - The Future of Manufacturing - SAP
9 pages
Assignment 2
No ratings yet
Assignment 2
12 pages
Gen AI - Sessional-II Exam Solution
No ratings yet
Gen AI - Sessional-II Exam Solution
10 pages
200001
100% (1)
200001
21 pages
ML Unit 1
No ratings yet
ML Unit 1
36 pages
Aiml Unit 4
No ratings yet
Aiml Unit 4
20 pages
Black Book Syncaxis
No ratings yet
Black Book Syncaxis
41 pages
EAP5A Blueprint Example 4
No ratings yet
EAP5A Blueprint Example 4
4 pages
CH 12
No ratings yet
CH 12
37 pages
Machine Learning-Lecture 03
No ratings yet
Machine Learning-Lecture 03
19 pages
Module 2 - Intelligent Systems
100% (1)
Module 2 - Intelligent Systems
7 pages
Organizational Dynamics: Daniele Scarpi, Eleonora Pantano
No ratings yet
Organizational Dynamics: Daniele Scarpi, Eleonora Pantano
10 pages
NIM-BSEE-2021-26 Lab 8 67440d1f9613d
No ratings yet
NIM-BSEE-2021-26 Lab 8 67440d1f9613d
7 pages
Unit 3 Clustering
No ratings yet
Unit 3 Clustering
28 pages
AIML Lab Programs
No ratings yet
AIML Lab Programs
13 pages
Six Figure Career Blueprint
No ratings yet
Six Figure Career Blueprint
21 pages
Support Vector Machine
No ratings yet
Support Vector Machine
7 pages
Generating Adversarial Malware Examples For Black-Box Attacks Based On GAN
No ratings yet
Generating Adversarial Malware Examples For Black-Box Attacks Based On GAN
7 pages
CV
No ratings yet
CV
2 pages
Machine Learning - Advanced Concepts
From Everand
Machine Learning - Advanced Concepts
Derrick Mwiti
No ratings yet