Strategies For Improving Object Detection in Real-Time Projects That Use Deep Learning Technology
Strategies For Improving Object Detection in Real-Time Projects That Use Deep Learning Technology
net/publication/371130999
Strategies for Improving Object Detection in Real-Time Projects that use Deep
Learning Technology
CITATIONS READS
0 4
2 authors:
Some of the authors of this publication are also working on these related projects:
All content following this page was uploaded by Niloofar Abed on 29 May 2023.
Abstract— The popularity and prevalence of devices brain by algorithms, they have provided a Deep Learning (DL)
equipped with object detection technology and controllable via field. Deep Learning is a type of machine learning and artificial
the Internet of Things (IoT) have increased, especially in the intelligence that utilizes artificial neural networks to develop
post-Corona era. The development of neural networks and solutions for tasks such as speech recognition, music
artificial intelligence by combining them with IoT systems has composition, and pharmaceutical development.
achieved acceptable satisfaction among users in adverse
conditions by reducing the need for manpower and increasing A. Neural Networks
productivity. Therefore, the scope of using such mechanisms has The contraction of the neural network is taken from the
expanded in most fields, from self-driving vehicles to agricultural network of the human brain, so it is also called the artificial
crops. Beginners will be confronted with a massive amount of neurons of the node, which are structured in three layers
complex information as a result of the design and application of
(Fig.1).
such technologies in interdisciplinary fields. Due to the
popularity of using the You Only Look Once (YOLO) object ¾ The input layer
detection algorithm, this article provided a guideline as a traffic
light subject classification and, offers suggested solutions and ¾ The hidden layer(s)
exclusive approches to increase the accuracy of object detection ¾ The output layer
in real-time projects with a practical application attitude for the
enthusiasts and developers particularly in object detection
scenarios by employing YOLO.
I. INTRODUCTION
The desire to use gadgets equipped with object detection
technology in interdisciplinary fields is undeniable. As a result,
the following article is offered as a traffic light to make
arrangements to select the most appropriate tool from the vast
array of possibilities already available. Therefore, after
introducing the principal concepts, the YOLO algorithm will
be describing briefly, and finally, solutions to enhance the
accuracy of the object detection mechanisms based on deep
learning will be offered in this paper [1]. Fig. 1. Layer of neural network
Techniques for detecting objects constitute the foundation Each node sends the calculated received information with
of artificial intelligence (AI) [2]. Machine vision systems random weighting and applying bias to a non-linear function or
provide operational behavior by interpreting and processing activation function to estimate and select neuron firing [3].
visuals collected from their environment. They are collections
of integrated, computer hardware, electrical components, and B. Deep Learning Algorithms
software algorithms. The proposed process is controlled and A deep learning algorithm uses unknown elements in the
automated through the data obtained from the vision system. distribution and extraction of features during self-learning.
Indeed, computer vision is a subset of Machine Learning (ML) Additionally, clustering objects and discovering efficient paths
that makes it possible for computers to process, analyze, and work in the same way. Deep learning algorithms use different
interpret the visual environment that it is based on the purpose layers for modeling, and it should be noted that these models
of analyzing data extracted from images and videos. In the include several algorithms. Among the deep learning
fascinating world of ML, experts have gone beyond, and by algorithms, the following can be mentioned, each of which is
trying to bring the cognitive power of artificial intelligence as used for one or more specific fields (Table1). Considering the
close as possible to the magnificent functioning of a human
2
Authorized licensed use limited to: AMRITA VISHWA VIDYAPEETHAM AMRITA SCHOOL OF ENGINEERING. Downloaded on May 29,2023 at 12:37:33 UTC from IEEE Xplore. Restrictions apply.
object detection project the model helps the users to put their
profits on a sustainable track and can become self-dependent.
IoT system is an effective pathway in the sustainability
assessment of object detection in complex and remote projects.
Using an IoT monitoring system helps Through retrieving
optimal data from the cloud server, users are capable of making
more knowledgeable choices and execute their projects
effectively.
To improve the implementation of object recognition
projects, users use its integration with Internet of Things
technology through various platforms such as Google IoT,
Microsoft Azure, and Amazon Web Services. In many real-
Fig. 2. Hyperspectral image in the detection of fruit texture quality [6]
time projects, through the model quantization and its transfer to
boards such as Raspberry Pi, Arduino, or Tinker embedded
The phenomenon "Internet of Things" (IoT) that was first boards, very effective and practical gadgets are provided.
coined in 1998 and is divided into three categories: hardware,
software, and cloud. Internet as a basic element of IoT includes
making communication between things or devices for II. MATERIAL AND METHODS
processing, and sensing employing software and sensors, Evaluating the performance of object detection and
respectively. The Internet of Things includes different physical prediction models requires two criteria - Average Precision
layers and data links that figure 3 depicts a (IoT) network with (AP) and Intersection over Union (IoU). IoU is employed to
seven layers. Technology is advancing and being used more measure the accuracy of localization and to calculate the
often in today's society, with the aim of simplifying many localization errors in object detection models. A positive
processes and allowing for improved efficiency. IoT has prediction is determined when the IoU value is greater than
enabled short communication, with applications such as 0.5, while a negative prediction is found when the IoU value is
ethernet, Wi-Internet, VoIP, instant messaging and emails. It less than 0.5.
has also expanded to multiple areas, such as agriculture, due
the increase in people and commodities required [9]. ை௩
=ܷܫ (1)
Wireless Sensor Networks (WSN) and the Internet of
Things (IoT) have become powerful tools that allow businesses How accurately you can identify true positives (TP) from
to develop more effective and sustainable strategies for their all positive expectations is a measure of precision. (TP+FP).
communities. These technological advancements offer a range ்
of potential applications.. for establishing sustainable ܴ݈݈݁ܿܽ ൌ (2)
்ାிே
communities since they provide a multitude of options, ்
including environmental monitoring and structural health ܲ݁ ݊݅ݏ݅ܿݎൌ (3)
்ାி
monitoring [10]. The combination of an IoT system with an
௦ିோ
ͳܨൌ (4)
௦ାோ
3
Authorized licensed use limited to: AMRITA VISHWA VIDYAPEETHAM AMRITA SCHOOL OF ENGINEERING. Downloaded on May 29,2023 at 12:37:33 UTC from IEEE Xplore. Restrictions apply.
III. RESULT AND DISCUSSION TABLE II. DETECTION DOCS IN DIFFERENT MODEL OF YOLO8 [20]
4
Authorized licensed use limited to: AMRITA VISHWA VIDYAPEETHAM AMRITA SCHOOL OF ENGINEERING. Downloaded on May 29,2023 at 12:37:33 UTC from IEEE Xplore. Restrictions apply.
also selects rectangular parts of an image and erases the pixels doing this, the object detection model can more accurately
in that area. Often, this category of methods and dozens of detect objects of various sizes, reducing the number of false
other methods are used by developers in upgrading different positives and increasing the overall accuracy of the model. [17]
versions of detectors, even though such improvements and and taking classes from different datasets to avoid overlap
developments are effective in increasing accuracy, sometimes between two different classes.
the design of a new individual version or an ensemble model
does not show any significant advantages[18].The methods 9) Controlling; In projects that include the Internet of
mentioned below can easily be used by any user and result in Things, human users can be given permission to modify and
an effective output. control the detection system in addition to monitoring using an
application or any other method. This method will be effective
1) Class balancing; Training the model on more and rare for increasing the efficiency of systems that recognize objects
classes to solve the problem of unbalanced data. Because that are only responsible for the identification part. In projects
usually objects with easier access have more photos in the where the target object or the observer is moving, such as
corresponding class. For example, if 4 main goals are pursued checking the status of an animal [19] The use of drones in
in the project, 4 groups of 100 photos must be prepared, and if surveillance systems has become increasingly popular in recent
there is a group of 25 among them, the accuracy will be greatly years, offering the ability to broaden the scope of data
reduced. collection and improve the accuracy of the output. Alongside
the use of the Internet of Things (IoT), GPS-based systems are
2) Data Augmentation; Changing existing photos to create also often employed to further enhance the overall
new images and add to the corresponding class. effectiveness of the system.
3) Image duplication; Using the same image multiple times
to train a model can better model the data behavior of a
particular class.
4)Ensemble; An object recognition model should be
trained on the alternating class in the data set, and another
model should be trained to recognize specific objects in
images, objects that are difficult to recognize, or there is little
data to learn.
5) Real image; In real-time projects, it is better not to use
merely ready-made datasets and to use real photos of the same
area in each class. For example, in animal recognition projects,
don't use only the classes available on the internet and take
Fig. 5. Different light saturation and color tone
pictures of the animals in the area with a natural background.
6) Different position; Humans are able to identify that IV. CONCLUSION
object only by seeing a small part of an object in different Undoubtedly, deep learning detectors are one of the
angles. To teach the machine, you should improve each class appropriate choices in object detection projects due to their
by using photos in different angles. trainability and customization capability. Although algorithms
7) Different light saturation; As mentioned, detectors like Yolo are being developed to cover more and more positive
such as YOLOv 7 with all efficiency are sensitive to changing attributes, some of the performance enhancement techniques
the amount and angle of light and the detection accuracy outlined in this article can be applied to improve identification
decreases. However, in a real project, animals often attack the accuracy depending on the peculiarities of the object being
fields during darkness or sunset, or it is difficult to recognize investigated. It should be kept in mind that the developers and
them even with a real human driving. Therefore, to get rid of creators of a program or algorithm act with the vision of
this problem, with the tools that are available on most mobile covering general users, so to increase the accuracy and speed
phones, we can change a number of photos using different light of an object recognition project, appropriate and
and color filters and tone and add them to the corresponding personalization solutions should always be used
class (Fig.5).
ACKNOWLEDGMENT
8) Clearing; For training, you should avoid photos that This project has been funded by the E4LIFE International
contain different parts of several objects and remove Ph.D. Fellowship Program offered by Amrita Vishwa
unnecessary objects. If you use boxing methods, you must use Vidyapeetham. I extend my gratitude to the Amrita Live-in-
the most suitable anchor for the object in the theme and use the Labs® academic program for providing all the support.
appropriate labeling tool with your coding program.
Making anchor boxes smaller to better fit different sizes of REFERENCES
objects is a way to improve the accuracy of object detection [1] Yang J, Liu S, Su H, Tian Y. Driving assistance system based on data
models. It helps to reduce false positives, which can occur fusion of multisource sensors for autonomous unmanned ground
when the anchor boxes are too large or too small to accurately vehicles. Computer Networks. 2021 Jun 19;192:108053
detect objects. The size of the anchor boxes can be adjusted so [2] Jiang P, Ergu D, Liu F, Cai Y, Ma B. A Review of Yolo algorithm
that it better fits the size of the objects being detected. By developments. Procedia Computer Science. 2022 Jan 1;199:1066-73.
5
Authorized licensed use limited to: AMRITA VISHWA VIDYAPEETHAM AMRITA SCHOOL OF ENGINEERING. Downloaded on May 29,2023 at 12:37:33 UTC from IEEE Xplore. Restrictions apply.
[3] L. Jiao et al., "A Survey of Deep Learning-Based Object Detection," in [13] Redmon J, Farhadi A. YOLO9000: better, faster, stronger.
IEEE Access, vol. 7, pp. 128837-128868, 2019, doi: InProceedings of the IEEE conference on computer vision and pattern
10.1109/ACCESS.2019.2939201. recognition 2017 (pp. 7263-7271).
[4] Lohia A, Kadam KD, Joshi RR, Bongale AM. Bibliometric analysis of [14] Zhang Z. Improved adam optimizer for deep neural networks. In2018
one-stage and two-stage object detection. Libr. Philos. Pract. 2021 Feb IEEE/ACM 26th international symposium on quality of service
1;4910:34. (IWQoS) 2018 Jun 4 (pp. 1-2). Ieee.
[5] Lone, Zubair Ahmad, and Alwyn Roshan Pais. "Object detection in [15] Aburaed, Nour, Mina Alsaad, Saeed Al Mansoori, and Hussain Al-
hyperspectral images." Digital Signal Processing (2022): 103752. Ahmad. "A Study on the Autonomous Detection of Impact Craters."
[6] Saha, Dhritiman, and Annamalai Manickavasagan. "Machine learning In Artificial Neural Networks in Pattern Recognition: 10th IAPR TC3
techniques for analysis of hyperspectral images to determine quality of Workshop, ANNPR 2022, Dubai, United Arab Emirates, November 24–
food products: A review." Current Research in Food Science 4 (2021): 26, 2022, Proceedings, pp. 181-194. Cham: Springer International
28-44. Publishing, 2022.
[7] Amudhan, A.N., Vrajesh, S.R., Sudheer, A.P. and Lijiya, A., 2022. [16] Subbiah, Uma, D. Kavin Kumar, Senthil Kumar Thangavel, and Latha
RFSOD: a lightweight single-stage detector for real-time embedded Parameswaran. "An extensive study and comparison of the various
applications to detect small-size objects. Journal of Real-Time Image approaches to object detection using deep learning." In 2020
Processing, 19(1), pp.133-146. International Conference on Smart Electronics and Communication
(ICOSEC), pp. 183-194. IEEE, 2020.
[8] R. Ippalapally, Mudumba, S. Harsha, Adkay, M., and Nandi Vardhan H.
R., “Object Detection Using Thermal Imaging”, in 2020 IEEE 17th [17] K. K. T R, S. Thiruvikkraman, G. R, N. A and K. R, "Evaluating the
India Council International Conference (INDICON), New Delhi, India, Scalability of a Multi-Object Detector Trained with Multiple
2020. Datasets," 2021 5th International Conference on Intelligent Computing
and Control Systems (ICICCS), Madurai, India, 2021, pp. 1359-1366,
[9] Manne, Ravi, and Sneha Chowdary Kantheti. "Green IoT Towards doi: 10.1109/ICICCS51141.2021.9432350.
Environmentally Friendly, Sustainable and Revolutionized Farming."
Green Internet of Things and Machine Learning: Towards a Smart [18] Allaparthi, Sree Roja Rani, and G. Jeyakumar. "An Investigational
Sustainable World (2021): 113-139. Study on Ensemble Learning Approaches to Solve Object Detection
Problems in Computer Vision." Mathematical Statistician and
[10] Ramesh, ManeeshaVinodini, Rekha Prabha, HemalathaThirugnanam, Engineering Applications 71, no. 3s (2022): 399-412.
AryadeviRemanideviDevidas, Dhanesh Raj, Sruthy Anand, and Rahul
Krishnan Pathinarupothi. "Achieving sustainability through smart city [19] Ramesh, Gowtham, Senthilkumar Mathi, Sini Raj Pulari, and Vidya
applications: protocols, systems and solutions using IoT and wireless Krishnamoorthy. "An automated vision-based method to detect
sensor network." CSI Transactions on ICT 8 (2020): 213-230. elephants for mitigation of human-elephant conflicts." In 2017
International conference on advances in computing, communications
[11] Cao, D., Chen, Z. & Gao, L. An improved object detection algorithm and informatics (ICACCI), pp. 2284-2288. IEEE, 2017.
based on multi-scaled and deformable convolutional neural networks.
Hum. Cent. Comput. Inf. Sci. 10, 14 (2020). [20] https://github.com/ultralytics/ultralytics
https://doi.org/10.1186/s13673-020-00219-9
[12] Redmon J, Farhadi A. Yolov3: An incremental improvement. arXiv
preprint arXiv:1804.02767. 2018 Apr
6
Authorized licensed use limited to: AMRITA VISHWA VIDYAPEETHAM AMRITA SCHOOL OF ENGINEERING. Downloaded on May 29,2023 at 12:37:33 UTC from IEEE Xplore. Restrictions apply.