Application of LLMs
Application of LLMs
Dataset Selection:
Imagenette( https://s3.amazonaws.com/fast-ai-imageclas/imagenette2-
160.tgz ) is a subset of the larger ImageNet dataset, designed to be more
manageable while still providing a meaningful challenge for image
classification tasks. This dataset includes images from 10 different classes:
Cassette player, Chain saw, Church, English, springer, French horn, Garbage
truck, Gas pump, Golf ball, Parachute, Tench
Dataset Preprocessing:
The train data was merged into one tracking the target labels, files were
renamed. Then zero padding was added to images to make it a square, and
finally the images were resized to 80px X 80px. The latter was done to reduce
the number of parameters of the model. A similar preprocessing was done on
the test(validation) dataset.
MODEL ARCHITECTURE
Initial Approach:
Experiments:
Increasing Dropout Too High, Adding Too Much L2 Regularization, and Adding
Another Dense Layer:
Final structure
2. Convolutional Layer 1:
3. Convolutional Layer 2:
4. Convolutional Layer 3:
5. Convolutional Layer 4:
256 units.
Activation is sigmoid.
The model achieves an accuracy of over 76% on test data and over 80% on training
data.
Other metrics:
cassette
player 0.79 0.79 0.79 357
chain saw 0.68 0.65 0.66 386
church 0.76 0.81 0.79 409
English
springer 0.92 0.79 0.85 395
French
horn 0.63 0.83 0.72 394
garbage
truck 0.77 0.85 0.81 389
gas pump 0.71 0.71 0.71 419
golf ball 0.84 0.71 0.77 399
parachute 0.87 0.84 0.85 390
tench 0.92 0.81 0.86 387