DAP Project Group6
DAP Project Group6
Nguyen Thien Ha, Nguyen Xuan An, Nguyen Le Hoang Nam, and Le Nguyen
Dang Khoi
1 Introduction
1.1 Background and Market Overview
With the rapid advancement of artificial intelligence (AI) and deep learning,
image classification has become a crucial task in various fields such as health-
care, security, autonomous vehicles, and e-commerce. Convolutional Neural Net-
works (CNNs) have revolutionized image recognition, significantly outperform-
ing traditional machine learning algorithms. Globally, companies and research
institutions continuously develop and optimize deep learning models to enhance
accuracy and efficiency. In Vietnam and other developing countries, AI appli-
cations in image processing are gaining attention, particularly in smart cities,
automated surveillance, and retail analytics. This project focuses on classify-
ing images from the CIFAR-10 dataset using CNNs, aligning with the trend of
AI-driven computer vision solutions.
By following this structured approach, the project aims to develop a CNN model
that not only achieves high classification accuracy on the CIFAR-10 dataset but
also generalizes effectively to unseen data. The final model will demonstrate re-
duced overfitting, optimized computational efficiency, and enhanced applicability
for real-world image classification tasks.
The expected outcomes of this project include the development of a well-
optimized CNN model with competitive accuracy on CIFAR-10, targeting a
performance range of approximately 90–95%. Additionally, the project will in-
troduce improved training strategies and provide valuable insights into address-
ing challenges associated with small-image classification. By refining existing
methodologies and exploring innovative solutions, this research aims to con-
tribute to the advancement of deep learning techniques in image recognition.
Beyond CIFAR-10, the findings from this study can be applied to various
domains, including real-time video processing, automated surveillance, and AI-
driven diagnostic tools in healthcare. The insights gained from evaluating dif-
ferent architectures and training techniques will contribute to the broader AI
research community, fostering further advancements in deep learning-based im-
age classification.
2 Methodology
The image classification task on the CIFAR-10 dataset requires building a con-
volutional neural network (CNN) model to classify 60,000 color images of size
32×32 pixels into 10 classes (airplane, automobile, bird, cat, deer, dog, frog,
horse, ship, truck).
Solution Name: Using VGG-16 model with Transfer Learning and Data Aug-
mentation.
4 Group 6
Algorithm: VGG16_CIFAR10_Training
Input: Training data (X_train, y_train), Test data (X_test, y_test)
IC by CNN 5
3 Related Works
3.1 Related Solutions
This section reviews studies on image classification using convolutional neural
networks (CNNs) with the CIFAR-10 dataset, emphasizing the VGG-16 model.
It explores existing approaches, practical applications of VGG-16, and research
involving CIFAR-10, providing a foundation for the current project.
6 Group 6
References
1. Y. LeCun, L. Bottou, Y. Bengio, and P. Haffner, “Gradient-Based Learning Applied
to Document Recognition,” Proceedings of the IEEE, vol. 86, no. 11, pp. 2278–2324,
1998.
2. A. Krizhevsky, I. Sutskever, and G. E. Hinton, “ImageNet Classification with Deep
Convolutional Neural Networks,” Advances in Neural Information Processing Sys-
tems (NeurIPS), pp. 1097–1105, 2012.
3. K. Simonyan and A. Zisserman, “Very Deep Convolutional Networks for Large-
Scale Image Recognition,” arXiv:1409.1556, 2014.
4. G. Huang, Z. Liu, L. van der Maaten, and K. Q. Weinberger, “Densely Con-
nected Convolutional Networks,” IEEE Conference on Computer Vision and Pat-
tern Recognition (CVPR), pp. 4700–4708, 2020.
8 Group 6