real object detection system using yolov3 images
real object detection system using yolov3 images
IMAGES
ABSTRACT
1. Limited Memory.
3. Time consuming
PROPOSED SYSTEM
In proposed system, we try to provide an optimized architecture to concurrently detect
objects on multiple cameras. This proposed system uses YOLOv3 for concurrent real-time
objection detection on a single GPU server using a multi-thread architecture. YOLOv3 uses
Darknet-53 architecture, which has 53 convolutional layers trained on ImageNet dataset. The
proposed model is designed to spot even tiny objects from the image. The proposed model able
to recognize 80 different objects in a single image. For the task of detection, 53 additional layers
are stacked onto it, giving us a 106 layer fully convolutional layers. Our purpose is to provide an
optimized architecture that significantly decreases memory usage.
2) Analyze
It is used for detection of faces and to reduce the noises present in the complicated movie
scenes. It defines various graph editing operations as per the noise analysis and then designs the
edit cost function to improve the performance.
3) Testing Dataset
After training and analyzing phase, go to testing phase to test the detection algorithm
whether the given image is tested based on previous training working properly or not. we first
highlight the importance of learning strategy of detection due to the difficulty of training
detectors, and then introduce the optimization techniques for both training and testing stages in
detail. Finally, we review some real-world object detection based applications including face
detection, pedestrian detection, logo detection and video analysis. During testing, images are
resized to different scales followed by multiple detectors and the detection results are merged.
4) Detection
In this module we are going to detect the face of the movie characters. In this module we
are using cv library and facenet library. After installing the cv2 and facenet libraries in this
project which refers to read the images from specified path, splits the regions and compare them
with every trained data which was already implemented in pre-process mechanism. Facenet
library is used to load the model from the given directory, get the RGB colors for the divided
regions to match them. When the completion of this process it detects the objects which are
available the given image or given video. It get the precise localization of objects, 5 million
Object instances and 200 K pictures are labelled out of 330 K pictures.
The below Hardware Specifications were used in both Server and Client machines when
developing.
SOFTWARE CONFIGURATION
The below Software Specifications were used in both Server and Client machines when
developing.
SERVER
Operating System : Windows 7
Technology Used : Python
Database : My-SQL
Database Connectivity : Native Connectivity
Web Server : Django
Browser : Chrome
CLIENT
Operating System : Windows 7
Browser : Chrome