0% found this document useful (0 votes)
30 views3 pages

Vitamin Deficiency Detection (Base Paper)

Uploaded by

elavarasan77new
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
30 views3 pages

Vitamin Deficiency Detection (Base Paper)

Uploaded by

elavarasan77new
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
You are on page 1/ 3

REAL TIME OBJECT DETECTION USING

DEEP LEARNING

ABSTRACT
Real time object detection is a vast, vibrant and complex area of computer vision. Ifthere is a
single object to be detected in an image, it is known as Image Localization and ifthere are
multiple objects in an image, then it is Object Detection. This detects the semanticobjects of a
class in digital images and videos. The applications of real time object detectioninclude
tracking objects, video surveillance, pedestrian detection, people counting, self-driving cars,
face detection, ball tracking in sports and many more. Convolution NeuralNetworks is a
representative tool of Deep learning to detect objects using OpenCV(Opensource Computer
Vision), which is a library of programming functions mainly aimed at realtime computer
vision.
Keywords: Computer vision, Deep Learning, Convolution Neural Networks.

I. INTRODUCTION
Object detection is a technology to detect various objects in digital images and videos too. It
is mainly helpful within the self- driving cars, face detection, etc., where the objects are to be
continuously monitored. The algorithm or the technique involved for object detection during
this project is Convolutional Neural Networks which is a class of Deep learning. This uses
MobileNet SSD technique during which MobileNet is a neural network used for image
classification and recognition whereas SSD is a framework that is used to realize the
multibox detector. The mixture of both MobileNet and SSD can do object detection. The
main advantage or purpose of choosing Deep learning is that we do not need to do feature
extraction from data as compared to machine learning. The Haar-like trait play a crucial role
in detecting the objects in a picture. They scan the entire picture starting from the top left and
compares every small box with the trained data. In this way, even small-detailed objects
present within the images are identified.

III. METHODOLOGY
Deep learning, a subset of machine learning which in turn is a subset of artificial intelligence
(AI) has networks capable of learning things from the data that is unstructured or unlabeled.
The approach utilized in this project is Convolutional Neural Networks (CNN). It uses the
Haar-cascade classifiers which help us in the detection of objects.

1. CNN:

The Convolutional Neural Network, or CNN is a type of neural network designed for
image processing but can handle 1D and 3D data. The key component is the convolutional
layer, where the network gets its name. This layer performs "convolutions"—a linear
operation involving multiplication of input data by a set of weights (filters/kernels).In CNNs,
a smaller filter moves over the input, performing element-wise multiplication between the
filter and sections of the input, then summing the results into a single value. This process
repeats systematically over the input, left to right, top to bottom, allowing the filter to extract
features from different regions.

2. Open CV:
Open CV stands for open source computer vision. it's a group of libraries in Python. it's a
tool by which we will be able to manipulate the pictures , like image scaling, etc. This
supports and helps us in developing real time computing applications. It mainly concentrates
and targets on image processing, video capture and analysis. It includes several features like
face detection and also object detection. Currently OpenCV supports differing types of
programming languages like C++, Python, Java etc., and it's available on various platforms
including Windows, Linux, OS X, Android etc.
3.Training the Data Set:
The data set is typically the gathering of knowledge . the info set could also be collection
of images or alphabets or numbers or documents and files too. the info set we used for the
thing detection is that the collection of images of all the objects that are to be identified.
Several different images of every and each object is typically present within the data set. If
there are more number of images like each object within the datasets then the accuracy are
often improved. The important thing that's to be remembered is that the info within the data
set must be labelled. there'll be actually 3 data set. they're the training data set, the validation
dataset and therefore the other one is testing data set. The training data set will usually
contains around 85-90% of the entire labelled data. This training dataset are going to be
training our machine and therefore the model is obtained by training the info set. The
validation data set consists of around 5-10% of the entire labelled data. this is often used for
the validation purpose. the opposite data set is that the testing dataset and it's wont to test the
performance of our machine.
4. Developing a real time object detector:

To develop a real-time object detector using deep learning and OpenCV, you’ll need to
access the webcam efficiently and apply object detection to each frame and process the Each
frame, treated as an image, is given as input to the model. The model identifies objects,
surrounds them with boxes, and labels them. we’ll only view the processed output video, not
the raw input.
Block Diagram of Proposed System

You might also like