Project Final Report Group 9
Project Final Report Group 9
1. Introduction
The coronavirus, commonly called COVID-19, has impacted the global community
over the last couple of years. According to information from the World Health
Organization (WHO), there have been millions of confirmed cases of COVID-19, with
a significant number of fatalities.
However, mask face recognition technology has penetrated modern society as a
phenomenon of today’s contemporary daily life. This technology has an essential
function in keeping people alive and healthy. However, the contexts differ from those
of 2020 in general and local settings. More importantly, contemporary research explores
the development of mask detection models starting with specific conditions where
CCTV monitoring guarantees adherence to mandatory wearing. Nonetheless, this
approach compromises dynamic changes yet acknowledges targeted preventative
measures.
This project focuses on developing a mask face detection system utilizing multiple
classifiers within a Convolutional Neural Network (CNN). In terms of security
applications, mask face detection exhibits faster processing than alternative security
systems, allowing for simultaneous analysis or detection of multiple faces. Employing
CNN enhances accuracy in identifying masked faces within a specific area. Compared
to other approaches, the system's quickness is made possible by its quick response to
detect faces entering the webcam area.
3. Experimental Evaluation
3.1 Methodology
To objectively evaluate our method, we used a set of essential metrics such as AUC,
accuracy, recall, and precision precisely on the test data to rate its performance
comprehensively. The experiment intended to test the hypothesis that, even though data
augmentation is considered optional for most deep learning algorithms, it can
dramatically improve model performance in a small dataset. The experimental
methodology included preprocessing photos with data augmentation, training the
model, and evaluating its performance on a testing set.
For the experiment, a relatively small dataset was used, with the use of data
augmentation to enhance its size artificially. The training and test data were carefully
chosen to generalize effectively in different instances. 80% of the data was set aside for
training, and 20% was provided in a test, which is standard practice when conducting
machine learning evaluations, ensuring adequate relevant information was used to train.
In contrast, the reserved testing set was an independent measure of the model's
generalization performance.
In addition to the traditional metrics, real-world performance data was collected by
implementing the model with a live webcam to detect whether individuals were wearing
masks. This real-time application assessed the model's performance in a scenario
relevant to its intended use. In the presentation and analysis of results, detailed
consideration was given to each metric, showing the strengths and weaknesses of the
model under discussion. Also, comparisons were drawn between other methods dealing
with similar problems to put the performance of our proposed approach in perspective.
Alongside the 80-20 training split and practical implementation, the selected radical
evaluation framework was built to encompass more than just the model's utility by its
results, providing further opportunities for image classification scholars.
While Support Vector Machines (SVM) have been employed in related papers for face
mask detection, the prevalent advantages of Convolutional Neural Networks (CNNs) in
this context must be considered. SVMs rely on features and require assistance to capture
complex patterns and relationships inside images, particularly in challenging
applications such as face mask recognition. On the other hand, CNN outperforms by
automatically learning hierarchical features, allowing for better adaption to the nuanced
complexity of visual input. However, SVM has been employed in previous research but
the current trend suggests that CNNs are a more practical approach for face mask
recognition, resulting in an efficient model.
3.2 Results
Table 1 Results
0.9
0.8
0.7
0.6
0.5
0.4
0.3
0.2
0.1
0
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20
0.7
0.6
0.5
0.4
0.3
0.2
0.1
0
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20
Loss
3.3 Discussion
As show above in Table 1 and illustrated in Figure 1, precision, recall, and accuracy are
all equal, it means that the model has correctly predicted all instances of both classes
(true positives and true negatives), and there are no false positives or false negatives.
This is an ideal situation, but it is important to know that it is uncommon.
The initial run of the code runs all epochs and keeps record of the model with the highest
accuracy. As show in Table 1 and in the 2 figures above, epoch 17/20 had the best
accuracy, and therefore the model will be saved and ready to run whenever the program
is needed. This model achieved an accuracy, precision and recall of 86.27%, an AUC
of 93.43% and a loss value of 0.3267.
Although our model produced promising results, it is important to highlight the
drawbacks of our model. The most important drawback and the one that needs attention
in future work is that our model can only detect blue masks, as it was only trained on
this type of mask. It needs to be trained on different types and colors of masks despite
the blue medical masks being the most common ones, this is because there now exists
many variations and colors and it would need to detect them all for it to be practical. It
is also important to mention that the model can’t detect masks when they are resting on
the chin, this is an advantage as many people just place the masks on their chin which
is not safe.
4. Related Work
4.1 Face Mask Detection Using Transfer Learning of InceptionV3
[1] addresses the critical challenge of identifying individuals not complying with the
mask-wearing protocols in public spaces, particularly in the context of the COVID-19
pandemic. The authors employed the InceptionV3 deep learning model through transfer
learning. Trained on the Simulated Masked Face Dataset (SMFD), the model benefits
from image enhancement techniques applied to the training data. The achieved accuracy
is remarkable, achieving 99.9% during training and a perfect 100% during testing,
highlighting the solution's efficiency in successfully detecting individuals without
masks. While the results are undoubtedly attractive, this method comes with a high cost
of resources due to the increased number of layers required. Our method will be more
superior as the resources required for it are significantly lower while maintaining a high
value of accuracy at the same time.
Bibliography
[1] Jignesh Chowdary, G., et al. "Face mask detection using transfer learning of inceptionv3." Big
Data Analytics: 8th International Conference, BDA 2020, Sonepat, India, December 15–18, 2020,
Proceedings 8. Springer International Publishing, 2020.
[2] Ieamsaard, Jirarat, Surapon Nathanael Charoensook, and Suchart Yammen. "Deep learning-
based face mask detection using yolov5." 2021 9th International Electrical Engineering Congress
(iEECON). IEEE, 2021.