Object Detection

This thesis compares two object detection algorithms, YOLO and RCNN, focusing on their configurations, performance, and accuracy. It emphasizes the importance of object detection for the autonomy of devices like smartphones and robots, driven by advancements in machine learning and deep learning. The project aims to evaluate the algorithms' effectiveness in detecting, classifying, and tracking multiple objects in images and videos.

Uploaded by

minsetpaing.11

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

6 views11 pages

Object Detection

Uploaded by

minsetpaing.11

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 11

ABSTRACT

The ability to recognize objects is born with human and animals. Humans and animals
can recognize objects without much effort. Object recognition is part of their daily lives and they
don’t even notice about it. The ability to recognize and classify objects for computers is called
object detection. Object detection is a key ability for many computers, smartphones and robots.
Many deep learning algorithms have made object detection to progress greatly in many
directions. This thesis focuses on comparison of object detection using two algorithms, YOLO
and RCNN. The configurations, performance and accuracy will be compared and discussed.
CHAPTER 1

INTRODUCTION

1.1 Motivation

As technologies have been made significantly advanced progress in the recent years, people
wanted their devices and gadgets to be automated, starting from smartphones, robots to self-
driving cars. When making devices to be autonomous, using scripts, programs or sensors
cannot satisfy the needs due to the fact that both of them will work as the way of how they’re
programmed by the programmer. Devices needed intelligence to make decisions or classify
items. As machine learning and deep learning researchers and practitioners have contributed
to the field of artificial intelligence, devices can recognize objects in images, classify music
from audio files and predict the prices and stock shares. Intelligence for smartphones,
machines, computers and robots to make them more and more autonomous and independent
of human supervision is a sustain dream for the mankind. Many science-fiction movies have
shown robots that do domestic work, providing healthcare, fight in battlegrounds and
companioning humans.

A robot cannot be intelligent and independent if it cannot see and adapt to the
surrounding environment. Engineers and scientist implemented image recognition
technologies inside the intelligence robots. It must also be able to recognize people’s faces,
determine which object to pick up, drop objects at the required place or give them to people,
avoid the objects that are obstacles in its path and ability to understand human language. The
key ability for a robot or computer is object detection. Scientists and researchers have
contributed several algorithms to carry out object detection.

1.2 Purpose and Scope

The purpose of the thesis is to compare the algorithms in detection, classification and
tracking the objects. According to the need for detecting objects, the goal of the thesis project
is to identify multiple objects in the image or video using two algorithms, YOLO and RCNN.
Once the development of the project is finished, there will be measurements and evaluations
in terms of configurations, performance and accuracy of detecting objects.

1.3 Development

As mentioned earlier, detection algorithms will be implemented using Python, open

source interpreted programming language. Python has many libraries that supports
machine learning and scientific computations such as NumPy, TensorFlow, PyTorch,
Scikit Learn and Matplotlib. When an image is feed as input to the neural network, the
detected objects will be shown with inside a rectangle with their respective color and
label text on the rectangle. If the input is video, the same process will be carried out for
every single frame.

Cars and pedestrians detected in an image

Chair, monitor and plant detected in an image
Elephants and zebras detected in an image.

CHAPTER 2

THEORY

2.1 Neural Networks

A neural network is inspired

from the networks of neurons found inside
brains of humans and animals. Neural
Networks can do signal processing,
predicting, regression, classification and
clustering. A neuron is a single processing unit inside the
neural network. Neurons connect to each other with coefficients bounded
with coefficients called weights and additional values called bias. This mathematical
framework is one of the most used in the artificial intelligence.
A simple neural network with input layer, a hidden layer and output layer.

2.2 YOLO

Existing detection algorithms from the last decade make use of classifiers to perform
detection. To detect an object, they take a classifier for the object and calculate its probabilities
and confidence values at different locations in an image.

More recent approaches like RCNN use region proposal technics to generate bounding
boxes in the image that is being classified to run a classifier on the bounding boxes. After
classification, a method called post-processing is used to improve the quality of the bounding
boxes, eliminate nearby duplicate detections. These algorithms are slow, resource-hungry and
difficult to optimize because each individual component must be trained separately.
YOLO reframes object detection as a single regression problem, straight from image
pixels to bounding box coordinates and class probabilities. Using YOLO, you only look once at
an image to predict what objects are in the image and location of the objects in the image. YOLO
is amazingly simple a simultaneously predicts multiple bounding boxes and class probabilities
for those boxes. YOLO trains on full images and directly optimizes detection performance. This
unified model has several benefits over traditional methods of object detection.

Detecting dog, bike and vehicle with YOLO, each color showing the class of objects
An example of convolutional neural network

The detection system divides the input image into a S × S grid. If the center of an object falls
into a grid cell, that grid cell is responsible for detecting that object. Each grid cell predicts B
bounding boxes and confidence scores for those boxes. These confidence scores reflect how
confident the model is that the box contains an object and also how accurate it thinks the box
is that it predicts. If no object exists in that cell, the confidence scores should be zero.
Otherwise, the confidence score should be equal to the intersection over union (IOU)
between the predicted box and the ground truth. Each bounding box consists of 5 predictions:
x, y, w, h, and confidence. The (x, y) coordinates represent the center of the box relative to
the bounds of the grid cell. The width and height are predicted relative to the whole image.
Finally, the confidence prediction represents the IOU between the predicted box and any
ground truth box.

Each grid cell also predicts C conditional class probabilities, Pr (Classi | Object). These
probabilities are conditioned on the grid cell containing an object. We only predict one set of
class probabilities per grid cell, regardless of the number of boxes B. At test time we multiply
the conditional class probabilities and the individual box confidence predictions
YOLO detecting a bird, bounding box(red), grid cells(green) and x, y, w, h values

IoU Formula
Ground truth box and predicted box while detecting a stop sign
Accuracy of YOLO depending on IoU

Opcrf Template For School Heads 04152019174309
100% (1)
Opcrf Template For School Heads 04152019174309
28 pages
YOLO V3 ML Project
No ratings yet
YOLO V3 ML Project
15 pages
Detection and Content Retrieval of Object in An Image Using YOLO
No ratings yet
Detection and Content Retrieval of Object in An Image Using YOLO
8 pages
Final Synopsis1
No ratings yet
Final Synopsis1
10 pages
Yolo Paper
No ratings yet
Yolo Paper
10 pages
You Only Look Once - Unified, Real-Time Object Detection
No ratings yet
You Only Look Once - Unified, Real-Time Object Detection
10 pages
Team 10
No ratings yet
Team 10
20 pages
Object Detection Using Yolo
No ratings yet
Object Detection Using Yolo
42 pages
Presentation1 FINAL 1
No ratings yet
Presentation1 FINAL 1
11 pages
Deep Learning For Object Detection - 131124
No ratings yet
Deep Learning For Object Detection - 131124
35 pages
Project
100% (1)
Project
30 pages
Deep Learning YOLOv2
No ratings yet
Deep Learning YOLOv2
3 pages
YOLO-LITE: A Real-Time Object Detection Algorithm Optimized For Non-GPU Computers
No ratings yet
YOLO-LITE: A Real-Time Object Detection Algorithm Optimized For Non-GPU Computers
8 pages
Design of A Real-Time Object Detection Prototype S
No ratings yet
Design of A Real-Time Object Detection Prototype S
6 pages
Yolo Algorithm
No ratings yet
Yolo Algorithm
37 pages
Yolopdf
No ratings yet
Yolopdf
10 pages
YOLO Based Detection and Classification of Objects in Video Records
No ratings yet
YOLO Based Detection and Classification of Objects in Video Records
5 pages
Synopsis - Internship - Group-53
No ratings yet
Synopsis - Internship - Group-53
8 pages
10 - CPU Based YOLO A Real Time Object Detection Algorithm
No ratings yet
10 - CPU Based YOLO A Real Time Object Detection Algorithm
4 pages
Finish Presentation
No ratings yet
Finish Presentation
56 pages
Report 34
No ratings yet
Report 34
26 pages
Overview of YOLO ObjectDetectionAlgorithm
No ratings yet
Overview of YOLO ObjectDetectionAlgorithm
7 pages
YOLO-LITE: A Real-Time Object Detection Algorithm Optimized For Non-GPU Computers
No ratings yet
YOLO-LITE: A Real-Time Object Detection Algorithm Optimized For Non-GPU Computers
8 pages
Unified Real-Time Object Detection
No ratings yet
Unified Real-Time Object Detection
36 pages
Object Detection Using Yolo Algorithm-1
No ratings yet
Object Detection Using Yolo Algorithm-1
9 pages
MJEER-Volume 30-Issue 1 - Page 52-57
No ratings yet
MJEER-Volume 30-Issue 1 - Page 52-57
6 pages
Base Paper (YOLO)
No ratings yet
Base Paper (YOLO)
6 pages
Yolo
No ratings yet
Yolo
10 pages
Yolo
No ratings yet
Yolo
10 pages
(IJCST-V8I3P4) :sakshi Gupta, Dr. T. Uma Devi
No ratings yet
(IJCST-V8I3P4) :sakshi Gupta, Dr. T. Uma Devi
5 pages
Yolov 3
No ratings yet
Yolov 3
42 pages
Incremental Training For Image Classification of Unseen Objects
No ratings yet
Incremental Training For Image Classification of Unseen Objects
19 pages
Ref 14
No ratings yet
Ref 14
5 pages
Object Detection Presentation
100% (3)
Object Detection Presentation
28 pages
YOLO Object Detection Explained - A Beginner's Guide - DataCamp
No ratings yet
YOLO Object Detection Explained - A Beginner's Guide - DataCamp
14 pages
Project Report (Group 9)
No ratings yet
Project Report (Group 9)
20 pages
Ex No 06
No ratings yet
Ex No 06
4 pages
YOLO Algorithm Implementation For Real Time Object Detection and Tracking-1
No ratings yet
YOLO Algorithm Implementation For Real Time Object Detection and Tracking-1
6 pages
Result - 9 - 19 - 2023, 7 - 14 - 17 AM
No ratings yet
Result - 9 - 19 - 2023, 7 - 14 - 17 AM
43 pages
Ijramt V3 I5 11
No ratings yet
Ijramt V3 I5 11
3 pages
Thesis (2) Removed
No ratings yet
Thesis (2) Removed
34 pages
Object Detection Document
No ratings yet
Object Detection Document
4 pages
IRJET Smart Traffic Control System Using
No ratings yet
IRJET Smart Traffic Control System Using
4 pages
Seminar 201202175023
No ratings yet
Seminar 201202175023
16 pages
Enhancing Surveillance Systems With YOLO Algorithm For Real-Time Object Detection and Tracking
No ratings yet
Enhancing Surveillance Systems With YOLO Algorithm For Real-Time Object Detection and Tracking
4 pages
BIOMETRICS
No ratings yet
BIOMETRICS
18 pages
"Object Detection With Yolo": A Seminar On
No ratings yet
"Object Detection With Yolo": A Seminar On
14 pages
Paper 45
No ratings yet
Paper 45
7 pages
Object Detection Using Image Processing
No ratings yet
Object Detection Using Image Processing
17 pages
Real Time American Sign Language Detection Using Yolo-V9: ETIT-KIT, Germany ETIT-KIT, Germany IIIT at ETIT-KIT, Germany
No ratings yet
Real Time American Sign Language Detection Using Yolo-V9: ETIT-KIT, Germany ETIT-KIT, Germany IIIT at ETIT-KIT, Germany
11 pages
Analytical Study On Object Detection Using Yolo Algorithm
No ratings yet
Analytical Study On Object Detection Using Yolo Algorithm
3 pages
Yolo Vs RCNN
No ratings yet
Yolo Vs RCNN
5 pages
Signature Object Detection Based On YOLOv3
No ratings yet
Signature Object Detection Based On YOLOv3
4 pages
EdgeYOLO AnEdge-Real-Time Object Detector
No ratings yet
EdgeYOLO AnEdge-Real-Time Object Detector
7 pages
Object Detection With Artificial Intelligence Yo Lo Application
No ratings yet
Object Detection With Artificial Intelligence Yo Lo Application
19 pages
Efficient Object Detection With YOLO A C
No ratings yet
Efficient Object Detection With YOLO A C
13 pages
723 Seminar Report
No ratings yet
723 Seminar Report
24 pages
Object Detection
No ratings yet
Object Detection
13 pages
Board of Examiners
No ratings yet
Board of Examiners
1 page
Future: Light
No ratings yet
Future: Light
1 page
KZP Cover
No ratings yet
KZP Cover
1 page
CD - Course Syllabus Example
No ratings yet
CD - Course Syllabus Example
5 pages
Assessment in Education
No ratings yet
Assessment in Education
22 pages
Small Group Teaching
No ratings yet
Small Group Teaching
15 pages
CD - Module Handbook Example
No ratings yet
CD - Module Handbook Example
1 page
COLREGS Myanamar Translation
No ratings yet
COLREGS Myanamar Translation
34 pages
Photominutes CD August18-19-2015 HoChiMinh
No ratings yet
Photominutes CD August18-19-2015 HoChiMinh
9 pages
Sara Sepulcri Assessment A
No ratings yet
Sara Sepulcri Assessment A
10 pages
NT Invoicing - Report Invoice Template
0% (1)
NT Invoicing - Report Invoice Template
1 page
MSC 482
No ratings yet
MSC 482
5 pages
Accelerated Learning Integrated by Discovery Learning in History Course: How Z Generation Learn
No ratings yet
Accelerated Learning Integrated by Discovery Learning in History Course: How Z Generation Learn
13 pages
Self Evaluation Wheel Professional Standards For Teachers
No ratings yet
Self Evaluation Wheel Professional Standards For Teachers
1 page
Mark Marking Matters
No ratings yet
Mark Marking Matters
52 pages
Action Plan in Mathematics
No ratings yet
Action Plan in Mathematics
2 pages
BP-XII-English Core (301) - All Sets
No ratings yet
BP-XII-English Core (301) - All Sets
2 pages
Direct Instruction Lesson Plan: Day 5 of Unit On College Admission Essays
No ratings yet
Direct Instruction Lesson Plan: Day 5 of Unit On College Admission Essays
3 pages
M5a1-Edc 257 Mini-Lesson and Reflection
No ratings yet
M5a1-Edc 257 Mini-Lesson and Reflection
3 pages
Social Psychology 20073-20075
No ratings yet
Social Psychology 20073-20075
18 pages
Group 17 "Lived Experiences of Non-Degree Daycare Workers in Tacloban City"
No ratings yet
Group 17 "Lived Experiences of Non-Degree Daycare Workers in Tacloban City"
27 pages
Less Is More B2 1st Edition Sarah Frazier Download
No ratings yet
Less Is More B2 1st Edition Sarah Frazier Download
60 pages
Creating Effective Powerpoint Presentations Facilitators Guide
No ratings yet
Creating Effective Powerpoint Presentations Facilitators Guide
10 pages
Instructions and Rubric For Case Study MOG1010
No ratings yet
Instructions and Rubric For Case Study MOG1010
10 pages
Chapter 7 Humanist Approaches To Learning
No ratings yet
Chapter 7 Humanist Approaches To Learning
51 pages
Distinguished Club Program and Club Success Plan
No ratings yet
Distinguished Club Program and Club Success Plan
38 pages
Juan Perez - Resume
No ratings yet
Juan Perez - Resume
1 page
Blooms Taxonomy
100% (11)
Blooms Taxonomy
4 pages
Musical Plays (Semi-Detailed)
No ratings yet
Musical Plays (Semi-Detailed)
4 pages
Developing Students' Writing Skill Through Peer and Teacher Correction: An Action Research
No ratings yet
Developing Students' Writing Skill Through Peer and Teacher Correction: An Action Research
13 pages
The Impact of Facilities On Student'S Academic Achievement: November 2019
No ratings yet
The Impact of Facilities On Student'S Academic Achievement: November 2019
14 pages
Curriculum Vitae: Personal Data
No ratings yet
Curriculum Vitae: Personal Data
4 pages
CV - Utari Amilia Anhar - Universitas Sumatera Utara
No ratings yet
CV - Utari Amilia Anhar - Universitas Sumatera Utara
2 pages
Technology Grade 7 T1 Lesson 1
No ratings yet
Technology Grade 7 T1 Lesson 1
2 pages
Discussion 1 Lesson 1
No ratings yet
Discussion 1 Lesson 1
2 pages
Classroom Orientation
No ratings yet
Classroom Orientation
8 pages
Co Po Mapping Bda With Justiificaton
No ratings yet
Co Po Mapping Bda With Justiificaton
4 pages
Teacher S Individual Plan For Professional Development Ippd
No ratings yet
Teacher S Individual Plan For Professional Development Ippd
26 pages
Eapp Activities 2025
No ratings yet
Eapp Activities 2025
7 pages
Cot 1 2023
No ratings yet
Cot 1 2023
4 pages
Self Efficacy Scale For Filipinos
No ratings yet
Self Efficacy Scale For Filipinos
14 pages