Paper 3887
Paper 3887
IJARSCT
International Journal of Advanced Research in Science, Communication and Technology (IJARSCT)
Abstract: In this project, we work on image classification of the CIFAR-10 dataset using supervised machine
learning techniques. The dataset consists of 60,000 32x32RGB images containing one of 10 object classes,
with 6000 images per class. We experiment with various learning algorithms including nearest neighbor
classifier, one-vs-all classification, Softmax classifier, two-layer fully connected artificial neural network
(ANN), deep convolutional neural network (CNN), and deep residual networks (ResNet). We use cross
validation by splitting the 50,000training data into49,000 training samples and 1,000 validation samples to
select the optimized hyper parameters for each parametric classifier. Among all methods, the 56-layer deep
residual network yields the best performance with a training accuracy above 99% and validation accuracy
of 93.6%.
Keywords: Image Classification; CIFAR-10; Supervised Machine Learning Algorithm; Deep Convolutional
Neural Network (CNN); Deep Residual Network (ResNet).
I. INTRODUCTION
Image classification is an active area of research and has been studied in common applications such as unmanned vehicles
and emergency robotics. In this paper, an integrated neural network (CNN)-based architecture is proposed using the Cifar
10 dataset, which has a total of 60,000 images [1]. These images are divided into training and testing sections, each with
50,000 and 10,000 images respectively. Although the CNN-based image classification methods presented herein are very
efficient, they require a large amount of memory. The purpose of our article is to perform image classification under limited
memory conditions. Such cases often occur in embedded systems. The proposed method achieves an accuracy of 85.9 while
requiring only 2 GB of GPU memory.
4.2 Comparison
The accuracy, as well as running time of all the tested models, are presented in the following table
V. CONCLUSION
The results obtained in this study are important because they suggest that the accuracy of the CNN model can be improved
simply by using a programming language to execute and modify traditional structures. One of the interesting conclusions is
the ratio of accuracy to delay. This is because the last model required the most computational power to achieve the highest
accuracy, while the most traditional CNN structure achieved the highest delay-to accuracy ratio. Based on existing
components, AI implementers need to determine if it is worth relying on a more robust model.
REFERENCES
[1]. Raniah Zaheer , Humera Shaziya “A Study of the Optimization Algorithms in Deep Learning” in IEEE 2019.
[2]. K. Simonyan and A. Zisserman, “Very deep convolutional networks for Large-Scale image recognition,” Sep. 2014.
[3]. Ian Goodfellow, Yoshua Bengio, and Aaron Courville. Deep Learning. MIT Press, 2016.
[4]. https://www.ijcaonline.org/archives/volume176/nu mb er37/pandit-2020-ijca-920489.pdf
[5]. http://www.divaportal.org/smash/get/diva2:11111 44/F ULLTEXT02.pdf
[6]. https://scihub.hkvisa.net/10.1109/IAEAC47372.2019. 8997743
[7]. https://scihub.hkvisa.net/10.1109/AICCSA.2018.861 2 873
[8]. http://www.divaportal.org/smash/get/diva2:11111 44/F ULLTEXT02.pdf
[9]. https://www.ijert.org/rfid-based intelligent- busmanagement-and-monitoring system
[10]. https://scihub.hkvisa.net/10.1109/ICCASM.2010.5 62 0407
[11]. https://scihub.hkvisa.net/10.1109/IAEAC47372.2019 .8997743
[12]. https://scihub.hkvisa.net/10.1109/IAEAC47372.2019 .8997743
[13]. https://ieeexplore.ieee.org/document/8997743
[14]. http://www.divaportal.org/smash/get/diva2:1111144/FULLTEXT02.pdf