Base Paper (YOLO)
Base Paper (YOLO)
YOLO ALGORITHM
DEPARTMENT OF CSE
CMR ENGINEERING COLLEGE, HYDERABAD
ABSTRACT - The Objective is to detect of objects using You Only Look Once
(YOLO) approach. This method has several advantages as compared to other
object detection algorithms. In other algorithms like Convolutional Neural
Network, Fast Convolutional Neural Network the algorithm will not look at the
image completely but in YOLO the algorithm looks the image completely by
predicting the bounding boxes using convolutional network and the class
probabilities for these boxes and detects the image faster as compared to other
algorithms.This project aims to harness YOLO’s efficiency and accuracy to
address the need for timely and precise object detection in various scenarios
Keywords:
Convolutional Neural Network, Fast-Convolutional Neural Network, Bounding
Boxes, YOLO.
I. INTRODUCTION
The main aim of Object Detection is to detect all the objects that are present
in that particular image. The important thing in object detection is to make
system detect the object by making it to learn how to do that which is
nothing but training the dataset and allowing system to detect the object by
itself. The searching or recognition process in a real time scenario is so
tough. So far, no effective solution has been found. Though there is a lot of
work going on, the methods developed till now aren’t efficient, require long
training time, are not suitable for real time application, and are not scalable
to large numbers of classes. Object detection is relatively simpler if the
machine is looking for detecting one particular object. However, recognizing
all the objects inherently requires the skill to differentiate one object from
the other, though they may be of the same type.
Such a problem is very difficult for machines, if they don’t know about the
various possibilities of objects. When anyone goes to any hospital or to a
Traffic controlling monitor or self driving cars all they do is to detect the
objects for the best performances. For example in a hospital when the
doctor sees the scan report they will be able to tell whether there is a virus
in our body or not. So we decided to make a REAL TIME OBJECT DETECTION
System. Sometime it’s very tough to negotiate between different Object
detection techniques. In image classification we assign the class label to an
image, In Object localization we draw the bounding boxes around the
objects. Detecting the Object is tough because we combine both the tasks
where we include both the bounding box around the each object and then
we assign the class label to an image, all this comes under the category of
Image
In Object detection it first detects the location and scale of an input objects
instance if such things are present in an image. So the objective of the Object
detector is to find any number of the object belongs to particular class
irrespective of class type objects, location, size in the input image
Limited Functionality: Without object counting, the system can only detect
and draw bounding boxes around objects.
Accuracy and False Positives: SSD, like any object detection model, can have
false positives and false negatives. Without counting and object identification
it's challenging to assess and mitigate these errors effectively.
No Object Identification: The system can detect objects but doesn't identify
them and you won't get information about the specific classes or labels of
detected objects.
Convolution Layers
V. RESULTS
The image which we passed for input will be divided into the same number
of the grids as in the training dataset. For every grid the output value will be
3x3x19.16 values will be the same as in the prediction model.
YOLO is already known for its speed, but there's always room for improving
accuracy. Future work could focus on refining the algorithm to better detect
objects in various conditions such as low light, occlusions, or different
viewpoints. While YOLO is primarily designed for object detection, integrating
it with algorithms for object tracking could enable more sophisticated
applications such as video surveillance, traffic monitoring, and crowd
analysis. Optimizing energy efficiency could extend its applicability to
battery-powered devices. Making YOLO more accurate at recognizing objects,
especially in different situations like when it's dark or when objects are
partially hidden.
VIII. REFERENCES
[1] Joseph Redmon, Santosh Divvala, Ross Girshick, "You Only Look Once:
Unified, Real-Time Object Detection`",The IEEE Conference on Computer
Vision and Pattern Recognition(CVPR), 2016, pp. 779-788
[7] Shaoqing Ren, Kaiming He, Ross Girshick, Jian Sun, ""Faster R-CNN:
Towards Real-Time Object Detection with Region Proposal Networks",
Published in Advances in Neural Information Processing Systems 28 (NIPS
2015).
[9] Jifeng Dai, Yi Li, Kaiming He, Jian Sun, "R-FCN: Object Detection via Region-
based Fully Convolutional Networks", published in: Advances in Neural
Information Processing Systems 29 (NIPS 2016).