0% found this document useful (0 votes)

7 views6 pages

Application of LLMs

The document outlines the design and implementation of a Convolutional Neural Network (CNN) for multi-class image classification using the Imagenette dataset. It details the dataset selection, preprocessing steps, model architecture modifications, and performance metrics, achieving over 76% accuracy on test data. The analysis highlights specific misclassifications and challenges faced during model training and optimization.

Uploaded by

arnab.datta.dbpc

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

7 views6 pages

Application of LLMs

Uploaded by

arnab.datta.dbpc

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 6

Application of LLMs

Task 1: Design a CNN Architecture for Multi-Class

Classification

Objective: Develop a Convolutional Neural Network (CNN) for multi-class

classification using a publicly available dataset of your choice from the internet. This
task will involve designing, experimenting, and improving your CNN architecture
while documenting all key steps, challenges, and insights.

Dataset Selection:

Imagenette( https://s3.amazonaws.com/fast-ai-imageclas/imagenette2-
160.tgz ) is a subset of the larger ImageNet dataset, designed to be more
manageable while still providing a meaningful challenge for image
classification tasks. This dataset includes images from 10 different classes:
Cassette player, Chain saw, Church, English, springer, French horn, Garbage
truck, Gas pump, Golf ball, Parachute, Tench

Dataset Preprocessing:

The train data was merged into one tracking the target labels, files were
renamed. Then zero padding was added to images to make it a square, and
finally the images were resized to 80px X 80px. The latter was done to reduce
the number of parameters of the model. A similar preprocessing was done on
the test(validation) dataset.

MODEL ARCHITECTURE
Initial Approach:

As outlined in TensorFlow Convolutional Neural Network (CNN)

documentation
(https://storage.googleapis.com/tensorflow_docs/docs/site/en/tutorials/
images/cnn.ipynb)
Modification:

 Applying Data Augmentation: Rotate, shift, and flip images.

 Adding Noise to Images Before Training: Use Gaussian noise.
 Adding Extra Convolutional Layer: Another convolutional layer in the
architecture.
 Using Leaky ReLU: Activation functions in certain layers.
 Using Dropout: Regularization method in specific layers.
 Using L2 Regularization: Regularization applied to certain layers.
 Using Early Stopping: Callback to prevent overfitting.
 Reduce Learning Rate on Plateau: Callback to adjust learning rate during
training.
These were aimed at reducing the chance of overfitting the model. The extra
convolutional layer improved accuracy

Experiments:
Increasing Dropout Too High, Adding Too Much L2 Regularization, and Adding
Another Dense Layer:

These modifications collectively slowed the learning process significantly. The

excessive dropout rates and high L2 regularization penalized the model's weights
too harshly, while the added complexity from another dense layer increased the
model's capacity but made it harder to optimize. As a result, the model struggled
to reach good accuracy during training and began to overfit prematurely, unable
to generalize well to new data.

Final structure

1. Input Layer: Accepts input images of size (80x80x3).

2. Convolutional Layer 1:

 32 filters, 3x3 kernel size, with ReLU activation.

 Output is followed by:

 MaxPooling2D: Pool size of (2x2).

 Dropout: Rate of 0.2 to prevent overfitting.

3. Convolutional Layer 2:

 64 filters, 3x3 kernel size, with ReLU activation.

 Includes L2 regularization with a factor of 0.001.

 Output is followed by:

 AveragePooling2D: Pool size of (2x2).

 Dropout: Rate of 0.1.

4. Convolutional Layer 3:

 64 filters, 3x3 kernel size, with ReLU activation.

 Output is followed by:

 MaxPooling2D: Pool size of (2x2).

 Dropout: Rate of 0.2.

5. Convolutional Layer 4:

 128 filters, 3x3 kernel size.

 Activation is LeakyReLU with an alpha of 0.05.

 Output is followed by:

 Dropout: Rate of 0.1.

6. Flatten Layer: Converts the multi-dimensional feature maps into a 1D vector.

7. Fully Connected Layer 1:

 256 units.

 Activation is LeakyReLU with an alpha of 0.1.

 Output is followed by:

 Dropout: Rate of 0.3.

8. Fully Connected Output Layer:

 10 units, corresponding to the number of output classes.

 Activation is sigmoid.

 Includes L2 regularization with a factor of 0.0001.

Performance of The Model

The model achieves an accuracy of over 76% on test data and over 80% on training
data.
Other metrics:

precisi f1- suppo

on recall score rt

cassette
player 0.79 0.79 0.79 357
chain saw 0.68 0.65 0.66 386
church 0.76 0.81 0.79 409
English
springer 0.92 0.79 0.85 395
French
horn 0.63 0.83 0.72 394
garbage
truck 0.77 0.85 0.81 389
gas pump 0.71 0.71 0.71 419
golf ball 0.84 0.71 0.77 399
parachute 0.87 0.84 0.85 390
tench 0.92 0.81 0.86 387

accuracy 0.78 3925

macro avg 0.79 0.78 0.78 3925
weighted
avg 0.79 0.78 0.78 3925

Analysis of Performance: (Please refer to the following confusion matrix)

The model misclassifies many images labelled chainsaw as French Horn. This can
be explained because many images labelled chainsaw and French Horn also has a
human holding them. Some other misclassifications can be attributed to a similar
reason, though most of them are due to the shortcoming of the model, which could
not have been completely eliminated.

DLV Lab Manual Print
No ratings yet
DLV Lab Manual Print
29 pages
CSE 2021 2025 - Syllabus
No ratings yet
CSE 2021 2025 - Syllabus
170 pages
Deep Learning For Vision Lab Manual 2024
100% (1)
Deep Learning For Vision Lab Manual 2024
25 pages
Image Classification Using MNIST Dataset
No ratings yet
Image Classification Using MNIST Dataset
28 pages
Aga Perplexity
No ratings yet
Aga Perplexity
29 pages
Final Presentation
No ratings yet
Final Presentation
30 pages
Deep Learning
No ratings yet
Deep Learning
30 pages
Microproject Report Group 2
No ratings yet
Microproject Report Group 2
15 pages
Report 2
No ratings yet
Report 2
17 pages
Aicw
No ratings yet
Aicw
19 pages
Experiment No 13 Final
No ratings yet
Experiment No 13 Final
9 pages
How To Calculate The Center of Gravity
No ratings yet
How To Calculate The Center of Gravity
18 pages
Introduction To Genetic Algorithm Neural Networks
No ratings yet
Introduction To Genetic Algorithm Neural Networks
44 pages
Deep Learning Lab With Output
No ratings yet
Deep Learning Lab With Output
12 pages
Multi Layer Perceptron Tf2 Code Description
No ratings yet
Multi Layer Perceptron Tf2 Code Description
10 pages
Traffic Sign Classification Slides
No ratings yet
Traffic Sign Classification Slides
29 pages
Deepfake Image Detection
No ratings yet
Deepfake Image Detection
19 pages
Digital Logic Design: Number System
No ratings yet
Digital Logic Design: Number System
48 pages
C1W3 Assignment
No ratings yet
C1W3 Assignment
7 pages
Unit - I CHP - 5
No ratings yet
Unit - I CHP - 5
26 pages
Group - 5 - AI in Manufacturing Project
No ratings yet
Group - 5 - AI in Manufacturing Project
18 pages
Introduction To ANN With Steps 10 25
No ratings yet
Introduction To ANN With Steps 10 25
30 pages
Image Classification Using CNN Pallavi
No ratings yet
Image Classification Using CNN Pallavi
26 pages
UNIT-III Convolution Neural Networks
No ratings yet
UNIT-III Convolution Neural Networks
9 pages
EXP4 Regulizars
No ratings yet
EXP4 Regulizars
8 pages
Assignment SQGAN
No ratings yet
Assignment SQGAN
14 pages
D. Del Operator
No ratings yet
D. Del Operator
18 pages
Machine Learning Assignment:6: Jupyter Notebook
No ratings yet
Machine Learning Assignment:6: Jupyter Notebook
44 pages
Inception New
No ratings yet
Inception New
11 pages
Week 6
No ratings yet
Week 6
8 pages
Zubair Mohammad - Homework
No ratings yet
Zubair Mohammad - Homework
9 pages
Optimizing Traffic Signal Control With Deep Reinforcement Learning Exploring Decay Rate Tuning For Enhanced Exploration-Exploitation Trade-Off
No ratings yet
Optimizing Traffic Signal Control With Deep Reinforcement Learning Exploring Decay Rate Tuning For Enhanced Exploration-Exploitation Trade-Off
8 pages
Deep Learning Assignment
No ratings yet
Deep Learning Assignment
11 pages
Trigonometry p2 Revision
No ratings yet
Trigonometry p2 Revision
30 pages
Batch 17 Paper
No ratings yet
Batch 17 Paper
10 pages
06a. Practice Set 2
0% (1)
06a. Practice Set 2
6 pages
DL LAB MANUAL Mugesh
No ratings yet
DL LAB MANUAL Mugesh
12 pages
Cep Dip
No ratings yet
Cep Dip
9 pages
21BCP167 Ai 9
No ratings yet
21BCP167 Ai 9
10 pages
Assignment 2 DL
No ratings yet
Assignment 2 DL
10 pages
Nihad: Teacher: Shanaz Shabazova Full Name: Subject: Machine Learning Group: E27-24 Date: 16-01-2025
No ratings yet
Nihad: Teacher: Shanaz Shabazova Full Name: Subject: Machine Learning Group: E27-24 Date: 16-01-2025
4 pages
CS8082U4L04 - Case Based Reasoning
No ratings yet
CS8082U4L04 - Case Based Reasoning
9 pages
Report
No ratings yet
Report
6 pages
Batch Distillation of Water-Methanol System
50% (4)
Batch Distillation of Water-Methanol System
78 pages
BS Assignment 2: σ given, z−test, H H
No ratings yet
BS Assignment 2: σ given, z−test, H H
28 pages
594 Assignment 3
No ratings yet
594 Assignment 3
4 pages
ABCs2018 Paper 156
No ratings yet
ABCs2018 Paper 156
5 pages
Untitled Document
No ratings yet
Untitled Document
5 pages
Assignment 8
No ratings yet
Assignment 8
2 pages
DAX Functions For Reference
No ratings yet
DAX Functions For Reference
8 pages
Final Code
No ratings yet
Final Code
16 pages
Project Automating Port Operations
No ratings yet
Project Automating Port Operations
5 pages
Artificial Intelligence ME: Manufacturing 6324
No ratings yet
Artificial Intelligence ME: Manufacturing 6324
23 pages
Eng21cs0302 - Sgan
No ratings yet
Eng21cs0302 - Sgan
7 pages
Vineela Ann1
No ratings yet
Vineela Ann1
9 pages
Arabic OCR Report
No ratings yet
Arabic OCR Report
20 pages
1-GAN Mnist - Ipynb - Colab
No ratings yet
1-GAN Mnist - Ipynb - Colab
4 pages
Analysis of Power System Stability
100% (2)
Analysis of Power System Stability
31 pages
Case Study - AP23322130042
No ratings yet
Case Study - AP23322130042
7 pages
NTCC-Aditya Reddy PDF
No ratings yet
NTCC-Aditya Reddy PDF
28 pages
CNN On Original Steps
No ratings yet
CNN On Original Steps
2 pages
MVS - Expt8 Object Detection and Reconstruction Using CNN
No ratings yet
MVS - Expt8 Object Detection and Reconstruction Using CNN
5 pages
Digit Recognizer Using CNN
No ratings yet
Digit Recognizer Using CNN
4 pages
Combining Neural Network Models For Blood Cell
No ratings yet
Combining Neural Network Models For Blood Cell
4 pages
Exno 4
No ratings yet
Exno 4
3 pages
Gauss-Siedel Method: Civil Engineering Majors Authors: Autar Kaw
No ratings yet
Gauss-Siedel Method: Civil Engineering Majors Authors: Autar Kaw
37 pages
CH 02 Summary
No ratings yet
CH 02 Summary
3 pages
Lab Manual (PHYS 001)
No ratings yet
Lab Manual (PHYS 001)
36 pages
Chapter 8: Gradually Varied Flow
No ratings yet
Chapter 8: Gradually Varied Flow
12 pages
C1W2 Assignment
No ratings yet
C1W2 Assignment
5 pages
C1W2 Assignment
No ratings yet
C1W2 Assignment
4 pages
(IJCST-V11I2P11) :dr. Girish Tere, Mr. Kuldeep Kandwal
No ratings yet
(IJCST-V11I2P11) :dr. Girish Tere, Mr. Kuldeep Kandwal
7 pages
Valuation and Capital Budgeting For Levered Firm (Chaper 18)
No ratings yet
Valuation and Capital Budgeting For Levered Firm (Chaper 18)
26 pages
Computer Vision (CS 6384.002) Project 2: Program Description
No ratings yet
Computer Vision (CS 6384.002) Project 2: Program Description
3 pages
NATIONAL-LEARNING-CAMP-ACCOMPLISHMENT-REPORT Ok
No ratings yet
NATIONAL-LEARNING-CAMP-ACCOMPLISHMENT-REPORT Ok
4 pages
Artificial Intelligence Mini Project
No ratings yet
Artificial Intelligence Mini Project
5 pages
Arrays, Strings and Collections
No ratings yet
Arrays, Strings and Collections
28 pages
Root Locus Notes
No ratings yet
Root Locus Notes
27 pages
PPT
No ratings yet
PPT
20 pages
Smart Highway PDF
No ratings yet
Smart Highway PDF
6 pages
A Collection of Useful Scripts, Tutorials and Other Python Related Thing
No ratings yet
A Collection of Useful Scripts, Tutorials and Other Python Related Thing
5 pages
Q2-Translating English Phrases To Mathematical Phrases
No ratings yet
Q2-Translating English Phrases To Mathematical Phrases
6 pages
N-Homeomorphism and N - Homeomorphism in Supra Topological Spaces
No ratings yet
N-Homeomorphism and N - Homeomorphism in Supra Topological Spaces
5 pages
Roll Cage: Table 1: Material Specifications
No ratings yet
Roll Cage: Table 1: Material Specifications
2 pages
Stella Math LP 1 1
No ratings yet
Stella Math LP 1 1
5 pages
Checklist For Lab Report Self-Assessment & Report Framework
No ratings yet
Checklist For Lab Report Self-Assessment & Report Framework
6 pages
Elfini Solver Verification: What's New? User Tasks
No ratings yet
Elfini Solver Verification: What's New? User Tasks
98 pages
Math 135 Fractals Unit v1
No ratings yet
Math 135 Fractals Unit v1
11 pages
Piet Mondrian Lesson
No ratings yet
Piet Mondrian Lesson
3 pages
Projects With Microcontrollers And PICC
From Everand
Projects With Microcontrollers And PICC
Guillermo Perez Guillen
5/5 (1)

Application of LLMs

Uploaded by

Application of LLMs

Uploaded by

Application of LLMs

Task 1: Design a CNN Architecture for Multi-Class

Objective: Develop a Convolutional Neural Network (CNN) for multi-class

As outlined in TensorFlow Convolutional Neural Network (CNN)

 Applying Data Augmentation: Rotate, shift, and flip images.

These modifications collectively slowed the learning process significantly. The

1. Input Layer: Accepts input images of size (80x80x3).

 32 filters, 3x3 kernel size, with ReLU activation.

 Output is followed by:

 Dropout: Rate of 0.2 to prevent overfitting.

 64 filters, 3x3 kernel size, with ReLU activation.

 Includes L2 regularization with a factor of 0.001.

 Output is followed by:

 AveragePooling2D: Pool size of (2x2).

 Dropout: Rate of 0.1.

 64 filters, 3x3 kernel size, with ReLU activation.

 Output is followed by:

 MaxPooling2D: Pool size of (2x2).

 Dropout: Rate of 0.2.

 128 filters, 3x3 kernel size.

 Activation is LeakyReLU with an alpha of 0.05.

 Output is followed by:

 Dropout: Rate of 0.1.

6. Flatten Layer: Converts the multi-dimensional feature maps into a 1D vector.

7. Fully Connected Layer 1:

 Activation is LeakyReLU with an alpha of 0.1.

 Output is followed by:

 Dropout: Rate of 0.3.

8. Fully Connected Output Layer:

 10 units, corresponding to the number of output classes.

 Includes L2 regularization with a factor of 0.0001.

precisi f1- suppo

accuracy 0.78 3925

Analysis of Performance: (Please refer to the following confusion matrix)

You might also like