Project Detecto!: A Real-Time Object Detection Model

The document discusses Project DetectO!, a real-time object detection model. The model aims to incorporate state-of-the-art object detection techniques to achieve high accuracy with real-time performance. It trains a neural network on a challenging publicly available dataset used in an annual object detection challenge. The resulting system is fast and accurate, making it suitable for applications requiring object detection. Template matching is used to detect objects by performing normalized cross-correlations between template images of training objects and new images. Filtering and blob analysis techniques are also discussed as early vision processing steps.

Uploaded by

ANURAG V NAIR

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

263 views3 pages

Project Detecto!: A Real-Time Object Detection Model

Uploaded by

ANURAG V NAIR

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 3

1

Project DetectO!: A real-time object detection

model
Afeeza Ali, Anurag, Anudeep Palliyath, Anjana K, and Under the guidance of Prof. Arvind Naik

Abstract—Efficient and accurate object detection has been an Appearance-based object recognition systems are currently
important topic in the advancement of computer vision systems. the most successful approach for dealing with 3D recognition
With the advent of deep learning techniques, the accuracy for of arbitrary objects in the presence of clutter and occlusion.
object detection has increased drastically. The project aims to
incorporate state-of-the-art technique for object detection with For appearance-based models, only the appearance is used,
the goal of achieving high accuracy with a real-time performance. which is usually captured by different two-dimensional views
A major challenge in many of the object detection systems is the of the object-of-interest. Based on the applied features these
dependency on other computer vision techniques for helping the methods can be sub-divided into two main classes, i.e., local
deep learning based approach, which leads to slow and non- and global approaches. A local feature is a property of an
optimal performance. In this project, the network is trained on
the most challenging publicly available dataset, on which a object image (object) located on a single point or small region. It is
detection challenge is conducted annually. The resulting system a single piece of information describing a rather simple, but
is fast and accurate, thus aiding those applications which require ideally distinctive property of the object’s projection to the
object detection. camera (image of the object). Examples for local features of
an object are, e.g., the colour, (mean) gradient or (mean) gray
value of a pixel or small region. For object recognition tasks
I. I NTRODUCTION
the local feature should be invariant to illumination changes,
The Object Detection and Recognition system in images is noise, scale changes and changes in viewing direction, but,
the technique which mainly aims to detect the multiple objects in general, this cannot be reached due to the simpleness of
from various types of images. It also recognizes the images the features itself. Thus, several features of a single point
after performing the detection. Object detection is a computer or distinguished region in various forms are combined and a
technology related to computer vision and image processing more complex description of the image usually referred to as
that deals with detecting instances of semantic objects of a descriptor is obtained. A distinguished region is a connected
certain class (such as humans, buildings, or cars) in digital part of an image showing a significant and interesting image
images and videos. Well-researched domains of object detec- property. It is usually determined by the application of a region
tion include face detection and pedestrian detection. Object of interest detector to the image.
detection has applications in many areas of computer vision, 1.3 Proposed System
including image retrieval and video surveillance. Object recog- Template matching is a technique in digital image pro-
nition is an important task in image processing and computer cessing for finding small parts of an image which match a
vision. It is concerned with determining the identity of an template image. It can be used in manufacturing as a part
object being observed in an image from a set of known tags. of quality control, a way to navigate a mobile robot, or as a
Humans can recognize any object in the real world easily way to detect edges in images. Template matching is a simple
without any efforts; on contrary machines by itself cannot task of performing a normalized cross-correlation between a
recognize objects. template image (object in training set) and a new image. For
1.1 Problem Statement matching the template with the data image, different iterations
Many problems in computer vision were saturating on their of geometrical parameters (such as scale, rotation etc) are
accuracy before a decade. However, with the rise of deep applied and the required image is found. This method is
learning techniques, the accuracy of these problems drastically normally implemented by first picking out a part of the search
improved. One of the major problems was that of image image to use as a template.
classification, which is defined as predicting the class of 1.4 Objective
the image. A slightly complicated problem is that of image A well known application of object detection is face de-
localization, where the image contains a single object and tection, that is used in almost all the mobile cameras. A
the system should predict the class of the location of the more generalized (multi-class) application can be used in
object in the image (a bounding box around the object). The autonomous driving where a variety of objects need to be
more complicated problem (this project), of object detection detected. Also it has a important role to play in surveillance
involves both classification and localization. In this case, the systems. These systems can be integrated with other tasks such
input to the system will be a image, and the output will be a as pose estimation where the first stage in the pipeline is to
bounding box corresponding to all the objects in the image, detect the object, and then the second stage will be to estimate
along with the class of object in each box. pose in the detected region. It can be used for tracking objects
1.2 Existing System and thus can be used in robotics and medical applications.
2

Thus this problem serves a multitude of applications. When an image is acquired by a camera or other imaging
Flow of the model: system, often the vision system for which it is intended is
unable to use it directly. The image may be corrupted by
random variations in intensity, variations in illumination, or
poor contrast that must be dealt with in the early stages of
vision processing.
Filtering is a technique for modifying or enhancing an im-
age. For example, you can filter an image to emphasize certain
features or remove other features. Image processing operations
implemented with filtering include smoothing, sharpening, and
edge enhancement.
Filtering is a neighborhood operation, in which the value of
any given pixel in the output image is determined by applying
some algorithm to the values of the pixels in the neighborhood
of the corresponding input pixel. A pixel’s neighborhood is
some set of pixels, defined by their locations relative to that
pixel. (See Neighborhood or Block Processing: An Overview
for a general discussion of neighborhood operations.) Linear
filtering is filtering in which the value of an output pixel is
a linear combination of the values of the pixels in the input
pixel’s neighborhood.
Blob Analysis:
Blob Analysis is a fundamental technique of machine vision
based on analysis of consistent image regions. As such it
is a tool of choice for applications in which the objects
being inspected are clearly discernible from the background.
Diverse set of Blob Analysis methods allows to create tailored
solutions for a wide range of visual inspection problems.
Main advantages of this technique include high flexi-
bility and excellent performance. Its limitations are: clear
background-foreground relation requirement (see Template
Matching for an alternative) and pixel-precision (see 1D Edge
Detection for an alternative).
Fig. 1. Flow diagram Blob detection methods are aimed at detecting regions in
a digital image that differ in properties, such as brightness or
color, compared to surrounding regions. Informally, a blob is
Capture Video:
a region of an image in which some properties are constant
The real time video is captured using the webcame and
or approximately constant; all the points in a blob can be
the frames are thus extracted. In order to capture the video
considered in some sense to be similar to each other. The
openCV library is used as a feature of Python.
most common method for blob detection is convolution.
Feature extraction starts from an initial set of measured data
and builds derived values (features) intended to be informative
and non-redundant, facilitating the subsequent learning and II. C ONCLUSION
generalization steps, and in some cases leading to better human In this paper we propose a model that can efficiently
interpretations. detect realtime object using a live video recording device.
Feature extraction is a dimensionality reduction process, Using machine learning techniques to image classifications
where an initial set of raw variables is reduced to more man- and counting we detect and classify the categorical object.
ageable groups (features) for processing, while still accurately REFERENCES:
and completely describing the original data set. [1] Dong, C., Loy, C.C., He, K. and Tang, X., 2014,
When the input data to an algorithm is too large to be September. Learning a deep convolutional network for image
processed and it is suspected to be redundant then it can super-resolution. In European conference on computer vision
be transformed into a reduced set of features (also named a (pp. 184-199).
feature vector). Determining a subset of the initial features [2] Ignatov, A., Kobyshev, N., Timofte, R., Vanhoey, K. and
is called feature selection. The selected features are expected Van Gool, L., 2017, October. DSLR-quality photos on mobile
to contain the relevant information from the input data, so devices with deep convolutional networks. In the IEEE Int.
that the desired task can be performed by using this reduced Conf. on Computer Vision (ICCV)
representation instead of the complete initial data. [3] Vinyals, O., Toshev, A., Bengio, S. and Erhan, D.,
Image filtering: 2015. Show and tell: A neural image caption generator. In
3

Proceedings of the IEEE conference on computer vision and

pattern recognition (pp. 3156-3164)
[4]Karpathy, A. and Fei-Fei, L., 2015. Deep visual-semantic
alignments for generating image descriptions. In Proceedings
of the IEEE conference on computer vision and pattern recog-
nition (pp. 3128-3137).

(MCQ) Data
No ratings yet
(MCQ) Data
8 pages
SAP Analytics Cloud
No ratings yet
SAP Analytics Cloud
2 pages
Theories of Second Language Acquisition
100% (2)
Theories of Second Language Acquisition
69 pages
External PPT 18
No ratings yet
External PPT 18
29 pages
Control System For DC Motor Drive System: (MATLAB/Simulink)
100% (1)
Control System For DC Motor Drive System: (MATLAB/Simulink)
54 pages
Object Detection - Deep Learning: Jamia Hamdard
No ratings yet
Object Detection - Deep Learning: Jamia Hamdard
26 pages
Vibration Analysis in Bearings For Failure Prevent
No ratings yet
Vibration Analysis in Bearings For Failure Prevent
17 pages
Chapter 6 Basic Control Theory
0% (1)
Chapter 6 Basic Control Theory
94 pages
KNN Algorithm - PPT (Autosaved)
0% (1)
KNN Algorithm - PPT (Autosaved)
8 pages
Drowsiness Detection Using Opencv Final
No ratings yet
Drowsiness Detection Using Opencv Final
83 pages
Non Tech Data Analytics Roadmap 1689017100
No ratings yet
Non Tech Data Analytics Roadmap 1689017100
10 pages
FIFA 18 - Data Analysis: - Harsh Takrani - Pranay Lulla
No ratings yet
FIFA 18 - Data Analysis: - Harsh Takrani - Pranay Lulla
16 pages
Audio Classification
No ratings yet
Audio Classification
1 page
Systems and Transfer Function
No ratings yet
Systems and Transfer Function
48 pages
Tree Data Structure
No ratings yet
Tree Data Structure
5 pages
TRANSFORMER
No ratings yet
TRANSFORMER
29 pages
Skilldzire Report PDF
0% (1)
Skilldzire Report PDF
37 pages
Understanding Conversational Systems
100% (3)
Understanding Conversational Systems
2 pages
Final Report
No ratings yet
Final Report
51 pages
Python Project of Gender and Age Detection With OpenCV
No ratings yet
Python Project of Gender and Age Detection With OpenCV
8 pages
Anomaly Detection 2
No ratings yet
Anomaly Detection 2
8 pages
Object Detection Using Yolo
No ratings yet
Object Detection Using Yolo
42 pages
02 K-Means
No ratings yet
02 K-Means
25 pages
Image Processing Based Facial Emotion Recognition: A Project Report On
No ratings yet
Image Processing Based Facial Emotion Recognition: A Project Report On
39 pages
Emotion Detection
No ratings yet
Emotion Detection
17 pages
Image Recognition Using CNN
0% (1)
Image Recognition Using CNN
12 pages
Canonical Variate Analysis (CVA) For Closed-Loop Identification
No ratings yet
Canonical Variate Analysis (CVA) For Closed-Loop Identification
17 pages
Object Detection and Identification A Project Report: November 2019
No ratings yet
Object Detection and Identification A Project Report: November 2019
45 pages
Object Detection and Identification A Project Report: November 2019
No ratings yet
Object Detection and Identification A Project Report: November 2019
45 pages
Bahan Ajar Pemodelan Dan Identifikasi Sistem PDF
No ratings yet
Bahan Ajar Pemodelan Dan Identifikasi Sistem PDF
5 pages
OBJECT DETECTION IN AUTONOMOUS VEHICLES USING CNN Report FINAL
No ratings yet
OBJECT DETECTION IN AUTONOMOUS VEHICLES USING CNN Report FINAL
62 pages
Object Detection
0% (1)
Object Detection
49 pages
Ch1 Part 1
No ratings yet
Ch1 Part 1
41 pages
Fruit Old
No ratings yet
Fruit Old
37 pages
Bachelor of Technology in Computer Science and Engineering: A Project Report
100% (1)
Bachelor of Technology in Computer Science and Engineering: A Project Report
47 pages
b3 Plant Leaf Disease Detection
No ratings yet
b3 Plant Leaf Disease Detection
62 pages
EC-350 AI and Decision Support Systems: Week 1 Dr. Arslan Shaukat
No ratings yet
EC-350 AI and Decision Support Systems: Week 1 Dr. Arslan Shaukat
21 pages
Lo1:-Identify Customer Needs: Instruction Sheet Learning Guide #60
No ratings yet
Lo1:-Identify Customer Needs: Instruction Sheet Learning Guide #60
41 pages
Chem Eng 3P4 Assignment 6 DUE 5:00pm Fri 04/01/11
No ratings yet
Chem Eng 3P4 Assignment 6 DUE 5:00pm Fri 04/01/11
4 pages
Python and Machine Learning: A Practical Training Report On
No ratings yet
Python and Machine Learning: A Practical Training Report On
65 pages
Object Detection
No ratings yet
Object Detection
73 pages
Eye Blink Detection: Integrated - Master of Computer Applications
100% (1)
Eye Blink Detection: Integrated - Master of Computer Applications
34 pages
Ai For Everyone Notes
No ratings yet
Ai For Everyone Notes
7 pages
ECE 5th Sem Syllabus
0% (1)
ECE 5th Sem Syllabus
84 pages
Semisupervised Autoencoder For Sentiment Analysis12059-55631-1-PB
No ratings yet
Semisupervised Autoencoder For Sentiment Analysis12059-55631-1-PB
7 pages
Human Activity Recognition
No ratings yet
Human Activity Recognition
40 pages
Theoretical and Practical Analysis On CNN, MTCNN and Caps-Net Base Face Recognition and Detection PDF
No ratings yet
Theoretical and Practical Analysis On CNN, MTCNN and Caps-Net Base Face Recognition and Detection PDF
35 pages
Classification of Fruits and Detection of Disease Using CNN: Bachelor of Engineering IN Information Technology
No ratings yet
Classification of Fruits and Detection of Disease Using CNN: Bachelor of Engineering IN Information Technology
65 pages
Report Digital Image Processing On Edge Detection of Image
100% (2)
Report Digital Image Processing On Edge Detection of Image
15 pages
Program Controller PDF
No ratings yet
Program Controller PDF
1 page
IBM Pre - C1000-136 63q-DEMO
No ratings yet
IBM Pre - C1000-136 63q-DEMO
19 pages
A Facial Expression Recognition System A PDF
No ratings yet
A Facial Expression Recognition System A PDF
45 pages
Big Data Environment
No ratings yet
Big Data Environment
23 pages
Face Recognition Using CNN
No ratings yet
Face Recognition Using CNN
17 pages
Roo Project
No ratings yet
Roo Project
16 pages
Colour Detection
No ratings yet
Colour Detection
6 pages
Phishing Websites Features Classification Based On Extreme Learning Machine
No ratings yet
Phishing Websites Features Classification Based On Extreme Learning Machine
1 page
Object Detector For Blind Person
No ratings yet
Object Detector For Blind Person
20 pages
Nimbalkar Sandesh Seminar PPT Final
No ratings yet
Nimbalkar Sandesh Seminar PPT Final
20 pages
SR22804211151
No ratings yet
SR22804211151
8 pages
Klasifikasi Image Processing Pada Citra Warna Daun Padi Menggunakan Metode Convolutional Neural Network
No ratings yet
Klasifikasi Image Processing Pada Citra Warna Daun Padi Menggunakan Metode Convolutional Neural Network
12 pages
Object Detection and Recognition System (Using TensorFlow)
No ratings yet
Object Detection and Recognition System (Using TensorFlow)
8 pages
Virtual Mirror - A Hassle Free Approach To The Use of Trial Room
No ratings yet
Virtual Mirror - A Hassle Free Approach To The Use of Trial Room
38 pages
Sign Language Recognition Using Deep Learning
No ratings yet
Sign Language Recognition Using Deep Learning
6 pages
AE - IEEE - REPORT - 01fe20bei040
No ratings yet
AE - IEEE - REPORT - 01fe20bei040
5 pages
Age and Gender Detection
No ratings yet
Age and Gender Detection
13 pages
Traffic Signal Annunciator: Government College of Engineering, Jalgaon 425002
No ratings yet
Traffic Signal Annunciator: Government College of Engineering, Jalgaon 425002
32 pages
Seminar On Deep CNN
No ratings yet
Seminar On Deep CNN
36 pages
Major 2 Report
No ratings yet
Major 2 Report
41 pages
Face Recognition Based Attendance Management System
No ratings yet
Face Recognition Based Attendance Management System
5 pages
First Review PDF
No ratings yet
First Review PDF
36 pages
Ab5 PDF
No ratings yet
Ab5 PDF
93 pages
Project Report PDF
No ratings yet
Project Report PDF
29 pages
Skin Cancer Detection Using Convolutional Neural Network
No ratings yet
Skin Cancer Detection Using Convolutional Neural Network
8 pages
General Framework For Object Detection
No ratings yet
General Framework For Object Detection
9 pages
NLP Sentiment Analysis On Movie Reviews With Toxic Comment Detection
No ratings yet
NLP Sentiment Analysis On Movie Reviews With Toxic Comment Detection
33 pages
Image Analysis - Pattern Recognition - Pattern Patterns Represent Knowledge
No ratings yet
Image Analysis - Pattern Recognition - Pattern Patterns Represent Knowledge
22 pages
A Comprehensive Study of Camouflaged Object Detection Using Deep Learning
No ratings yet
A Comprehensive Study of Camouflaged Object Detection Using Deep Learning
8 pages
Real Time Face Detection
No ratings yet
Real Time Face Detection
70 pages
Computer Vision Module Application For Finding A Target in A Live Camera
No ratings yet
Computer Vision Module Application For Finding A Target in A Live Camera
8 pages
A Study On Real Time Object Detection Using Deep Learning IJERTV11IS050269
No ratings yet
A Study On Real Time Object Detection Using Deep Learning IJERTV11IS050269
7 pages
Detection and Classification of Dental Caries in X-Ray Images Using Deep Neural Networks
No ratings yet
Detection and Classification of Dental Caries in X-Ray Images Using Deep Neural Networks
5 pages
Face Recognition Report PDF
No ratings yet
Face Recognition Report PDF
16 pages
Object Detection Using Faster R-CNN Deep Learning: Trainfasterrcnnobjectdetector
No ratings yet
Object Detection Using Faster R-CNN Deep Learning: Trainfasterrcnnobjectdetector
9 pages
Feature Matching in Iris Recognition System Using MATLAB
No ratings yet
Feature Matching in Iris Recognition System Using MATLAB
10 pages
Face Recognition System
No ratings yet
Face Recognition System
32 pages
Project Detecto!: A Real-Time Object Detection Model
No ratings yet
Project Detecto!: A Real-Time Object Detection Model
3 pages
Facial K: Dynamic Selfie Filters Using ML
No ratings yet
Facial K: Dynamic Selfie Filters Using ML
10 pages
Vehicle Detection and Tracking
No ratings yet
Vehicle Detection and Tracking
11 pages
Satellite Image Classification With Deep Learning Survey
No ratings yet
Satellite Image Classification With Deep Learning Survey
5 pages

Project Detecto!: A Real-Time Object Detection Model

Uploaded by

Project Detecto!: A Real-Time Object Detection Model

Uploaded by

1

Project DetectO!: A real-time object detection

Proceedings of the IEEE conference on computer vision and

You might also like