Object Tracking in Crowd Environment Using Deep Learning
Object Tracking in Crowd Environment Using Deep Learning
https://doi.org/10.22214/ijraset.2022.44530
International Journal for Research in Applied Science & Engineering Technology (IJRASET)
ISSN: 2321-9653; IC Value: 45.98; SJ Impact Factor: 7.538
Volume 10 Issue VI June 2022- Available at www.ijraset.com
Abstract: Object tracking is fetching a primary set of object identification, allocating a unique ID to each one of them, and then
ensuing each of the objects as they move around frames in a movie while safe keeping the ID assignment. Person tracking is a
difficult task in video surveillance. In recent years, many computer vision, deep and machine learning have been developed.
Convolutional Neural Networks (CNN) are transforming target tracking. The project's purpose is to recognize and track images
utilizing object identification techniques such as Region based Convolutional Neural Networks (RCNN), Faster RCNN, Single
Shot Detector (SSD), and You Only Look Once (YOLO). Faster-RCNN and SSD have superior accuracy among them, whereas
YOLO performs better when speed is prioritized above accuracy. Deep learning combines SSD and Mobile Nets for efficient
identification and tracking implementation. This technique detects objects quickly and efficiently without sacrificing
performance. Convolutionally default boxex are passed over several feature maps. If a detected object matches one of the object
classifiers during prediction, a score is produced..
Keywords: Computer vision, Convolutional-Neural-Networks (CNN), Deep Learning, Mobile Net, You-Only-Look-Once
(YOLO).
I. INTRODUCTION
Real-time object recognition and tracking for Object Detection has been the focus of much investigation in current years due to its
close alliance with video analysis and image elucidation [1]. Fabricated characteristic and superficial formable construction are the
foundation of conventional object detection systems [2]. Combining a variety of low-level visual features with high-level context
provided by object detectors and location categorization, they can be effortlessly put out of action. With swift advancements in deep
learning, more influential tools capable of semantic learning, higher level and in depth features are being advance to solve problems
that infestation current systems [3]. As far as of network architecture, training techniques, and enhancement functions, these models
act in a unexpected way. In this review, we study deep learning-based object identification architectures. Our investigation begins
with a short history of deep learning and its most delegate instrument, the Convolutional Neural Network (CNN).
©IJRASET: All Rights are Reserved | SJ Impact Factor 7.538 | ISRA Journal Impact Factor 7.894 | 2989
International Journal for Research in Applied Science & Engineering Technology (IJRASET)
ISSN: 2321-9653; IC Value: 45.98; SJ Impact Factor: 7.538
Volume 10 Issue VI June 2022- Available at www.ijraset.com
C. Faster R-CNN
To find regional recommendations, the two algorithms above (RCNN and Fast RCNN) use selective search. Selective search is a
sluggish and tedious activity that corrupts network execution.
D. MANet
Target detection has been a hotbed in computer vision for many years because it is a fundamentally difficult problem. The motive
and goal of target identification is to see if an image contains examples of a particular type of object. Target detection returns the
spatial coordinates and spatial degree of component cases on the off chance that there is an item to be recognized in a given picture
(for example in view of jumping box utilization).
©IJRASET: All Rights are Reserved | SJ Impact Factor 7.538 | ISRA Journal Impact Factor 7.894 | 2990
International Journal for Research in Applied Science & Engineering Technology (IJRASET)
ISSN: 2321-9653; IC Value: 45.98; SJ Impact Factor: 7.538
Volume 10 Issue VI June 2022- Available at www.ijraset.com
To predict the height, width, centre and class of objects, YOLO uses a single bounding box regression. The probability that an
object appears in the bounding box is shown in the graph above.
Initially, the image is distributed into matrix cells. Every framework cell predicts a B jumping box and gives their confidence
opinion. The cell predicts the class likelihood and decides the class for each object.
For example, you can perceive objects of not less than three categories: Car, Canine, and Bicycle. All expectations are made at the
same time utilizing a solitary convolutional brain organization.
©IJRASET: All Rights are Reserved | SJ Impact Factor 7.538 | ISRA Journal Impact Factor 7.894 | 2991
International Journal for Research in Applied Science & Engineering Technology (IJRASET)
ISSN: 2321-9653; IC Value: 45.98; SJ Impact Factor: 7.538
Volume 10 Issue VI June 2022- Available at www.ijraset.com
Intersections on the merger ensure that the forecasted bounding box is even to the actual box of the object. This action eliminates
extraneous bounding boxes that don't correspond to the object's attributes (like level and width). The last recognition consists of a
unique bounding box that exactly matches the object.
For instance, a pink bouncing box encompasses a vehicle and a yellow jumping box encompasses a bike. The blue bouncing box
was utilized to underline the canine.
V. RESULTS
In this chapter we will discuss about the results
1) Commands to Execute
©IJRASET: All Rights are Reserved | SJ Impact Factor 7.538 | ISRA Journal Impact Factor 7.894 | 2992
International Journal for Research in Applied Science & Engineering Technology (IJRASET)
ISSN: 2321-9653; IC Value: 45.98; SJ Impact Factor: 7.538
Volume 10 Issue VI June 2022- Available at www.ijraset.com
In the Fig 6.2, that the model successfully identified the objects in a video displaying news, thereby it is evident that this model can
be integrated with CCTV software and can be used in complex applications to improve the security of people's lives. Irrespective of
the video quality available, the model is able to identify the objects and returns the list at the top left corner and also highlights the
objects with a rectangular box with the confidence intervals being displayed
©IJRASET: All Rights are Reserved | SJ Impact Factor 7.538 | ISRA Journal Impact Factor 7.894 | 2993
International Journal for Research in Applied Science & Engineering Technology (IJRASET)
ISSN: 2321-9653; IC Value: 45.98; SJ Impact Factor: 7.538
Volume 10 Issue VI June 2022- Available at www.ijraset.com
In the Fig 6.4, the objects are clumsy, moreover the suitcases were kept together indistinguishably. yet, the model was able to
identify each suitcase separately and also the trucks at the corner of the image
The end detection consists of distinctive set of bounding boxes which are displayed around the detected objects with confidence
value for the given inputs.
REFERENCES
[1] J. Kang, S. Tariq, H. Oh and S. S. Woo, "A Survey of Deep Learning-Based Object Detection Methods and Datasets for Overhead Imagery," in IEEE Access,
vol. 10, pp. 20118-20134, 2022, doi: 10.1109/ACCESS.2022.3149052.
[2] Asra Aslam, Edward Curry,A Survey on Object Detection for the Internet of Multimedia Things (IoMT) using Deep Learning and Event-based Middleware:
Approaches, Challenges, and Future Directions,Image and Vision Computing, Volume 106, 2021, 104095, ISSN 0262-8856,
https://doi.org/10.1016/j.imavis.2020.104095
[3] N. Kumaran and U. S. Reddy, "Object detection and tracking in crowd environment — A review," 2017 International Conference on Inventive Computing and
Informatics (ICICI), 2017, pp. 777-782, doi: 10.1109/ICICI.2017.8365242.
[4] C. P. Papageorgiou, M. Oren and T. Poggio, "A general framework for object detection," Sixth International Conference on Computer Vision (IEEE Cat.
No.98CH36271), 1998, pp. 555-562, doi 10.1109/ICCV.1998.710772.
[5] C. Tang, Y. Feng, X. Yang, C. Zheng and Y. Zhou, "The Object Detection Based on Deep Learning," 2017 4th International Conference on Information
Science and Control Engineering (ICISCE), 2017, pp. 723-728, doi: 10.1109/ICISCE.2017.156.
[6] X. Zhou, W. Gong, W. Fu and F. Du, "Application of deep learning in object detection," 2017 IEEE/ACIS 16th International Conference on Computer and
Information Science (ICIS), 2017, pp. 631-634, doi: 10.1109/ICIS.2017.7960069.
©IJRASET: All Rights are Reserved | SJ Impact Factor 7.538 | ISRA Journal Impact Factor 7.894 | 2994