0% found this document useful (0 votes)

19 views10 pages

JJSB 2018

1. The document presents a vehicle detection method for aerial images based on the YOLO deep learning algorithm. It trains a model using three public aerial image datasets. 2. Experiments show the trained model has good performance on unknown aerial images, especially for small, rotating, compact and dense objects, while meeting real-time requirements. 3. The method integrates aerial image datasets suitable for training YOLO by processing three public datasets: VEDAI, COWC, and DOTA. It then trains a YOLO model using the integrated dataset.

Uploaded by

paguda

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

19 views10 pages

JJSB 2018

Uploaded by

paguda

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 10

Journal of Computer and Communications, 2018, 6, 98-107

http://www.scirp.org/journal/jcc
ISSN Online: 2327-5227
ISSN Print: 2327-5219

A Vehicle Detection Method for Aerial Image

Based on YOLO

Junyan Lu1*, Chi Ma2, Li Li2, Xiaoyan Xing2, Yong Zhang2, Zhigang Wang2, Jiuwei Xu2

Chang Guang Satellite Technology Co., Ltd., Changchun, China

Department of Transportation of Jilin Province, Changchun, China

How to cite this paper: Lu, J.Y., Ma, C., Li, Abstract
L., Xing, X.Y., Zhang, Y., Wang, Z.G. and
Xu, J.W. (2018) A Vehicle Detection Me- With the application of UAVs in intelligent transportation systems, vehicle
thod for Aerial Image Based on YOLO. detection for aerial images has become a key engineering technology and has
Journal of Computer and Communications, academic research significance. In this paper, a vehicle detection method for
6, 98-107.
https://doi.org/10.4236/jcc.2018.611009
aerial image based on YOLO deep learning algorithm is presented. The me-
thod integrates an aerial image dataset suitable for YOLO training by
Received: August 29, 2018 processing three public aerial image datasets. Experiments show that the
Accepted: November 12, 2018
training model has a good performance on unknown aerial images, especially
Published: November 19, 2018
for small objects, rotating objects, as well as compact and dense objects, while
meeting the real-time requirements.

Keywords
Vehicle Detection, Aerial Image, YOLO, VEDAI, COWC, DOTA

1. Introduction
In recent years, with the rapid development of information technology, intelli-
gent transportation systems have become an important way of modern traffic
management and an inevitable trend. As the key technology of intelligent trans-
portation system, vehicle detection is the basis for realizing many important
functions [1], such as measurement and statistics of traffic parameters such as
traffic flow and density, vehicle location and tracking, and traffic data mining,
etc.
At the same time, with the technology maturity and market popularization of
UAV (Unmanned Aerial Vehicle), which has characteristics of being lightweight,
flexible, and cheap, the aerial photography of UAVs in the application of scenes
such as traffic information collection and traffic emergency response reflects a
huge advantage.

DOI: 10.4236/jcc.2018.611009 Nov. 19, 2018 98 Journal of Computer and Communications

J. Y. Lu et al.

In summary, vehicle detection for aerial image plays an important role in en-
gineering applications. In addition, the technology relies on machine vision, ar-
tificial intelligence, image processing and other disciplines, and is a typical ap-
plication of interdisciplinary research. Therefore, it also has important research
significance in academics.
Based on YOLO deep learning algorithm and three public aerial image data-
sets, this paper presents a vehicle detection method for aerial image.

2. Related Work
The commonly used vehicle detection methods proposed by domestic and for-
eign scholars are mainly divided into three categories: based on motion informa-
tion, based on features, and based on template matching. Cheng and others use
background subtraction and registration methods to detect dynamic vehicles [2],
Azevedo and others based on median background difference method to detect
vehicles in aerial images [3]. The above two methods achieve the detection of
moving objects, however, because the aerial video has the characteristics of
complex scenes and diverse objects, the two methods cannot achieve the desired
effect for accurate vehicle detection, and false and missed detection are also se-
rious. Sivaraman and others combined Haar features and Adaboost to detect ve-
hicles and implement vehicle detection on highways [4], Tehrani and others
proposed a vehicle detection method based on HOG features and SVM to
achieve vehicle detection in urban roads [5]. The above two methods improve
the accuracy of detection, but since the traditional machine learning method
only supports training for a small amount of data, there is still a shortage of de-
tection of vehicle diversity.
In recent years, with the updating of computer hardware, especially GPU
technology, the deep learning algorithms have been rapidly developed when
solving problems in the fields of pattern recognition and image processing, and
are more efficient and precise than traditional algorithms. Therefore, this paper
uses a deep learning algorithm, YOLO, to achieve vehicle detection.

3. YOLO Deep Learning Object Detection Algorithm

YOLO, which has been proposed by Joseph Redmon and others in 2015 [6], is a
real-time object detection system based on CNN (Convolutional Neural Net-
work). On the CVPR (Conference on Computer Vision and Pattern Recogni-
tion) in 2017, Joseph Redmon and Ali Farhadi released YOLO v2 which has im-
proved the algorithm’s accuracy and speed [7]. In April this year, Joseph Red-
mon and Ali Farhadi proposed the latest YOLO v3, which has further improved
the performance on object detection [8]. This chapter introduces the basic prin-
ciples of the YOLO algorithm according to its update process.

3.1. YOLO v1
1) Basic idea

DOI: 10.4236/jcc.2018.611009 99 Journal of Computer and Communications

J. Y. Lu et al.

YOLO divides the input image into S × S grids. If the center coordinate of
the GT (Ground Truth) of an object falls into a grid, the grid is responsible for
detecting the object. The innovation of YOLO is that it reforms the Region Pro-
posal detection framework: RCNN series need to generate Region Proposal in
which to complete classification and regression. But there is overlap between
Region Proposal, which will bring a lot of repetition work. However, YOLO pre-
dicts the bbox (bounding box) of the object contained in all grids, the location
reliability, as well as the probability vectors of all classes at one time, thus it
solves problem one-shot.
2) Network structure
YOLO network borrows Google Net while the difference is that YOLO uses
the 1 × 1 convolutional layer (for cross-channel information integration) +
3 × 3 convolutional layer instead of the Inception module simply. YOLO v1
network structure consists of 24 convolution layers and 2 full connection layers,
as shown in Figure 1.

3.2. YOLO v2
Compared with the region proposal based method such as Fast R-CNN, YOLO
v1 has a larger positioning error and a lower recall rate. Therefore, the main im-
provements of YOLO v2 are to enhance the recall rate and positioning ability,
and include:
1) BN (Batch Normalization)
BN is a popular training technique since 2015. By adding BN layer after each
layer, the entire batch data can be normalized to a space with a mean of 0 and
variance of 1, which can prevent the gradient from disappearing as well as gra-
dient explosion, and make network convergence faster.
2) Anchor boxes
In YOLO v1, the full connection layer is used to predict the coordinates of
bbox directly after the convolutional layer. YOLO v2 removes the full connec-
tion layer by using the idea of Faster R-CNN, and adds Anchor Boxes, which ef-
fectively improves the recall rate.
3) Multi-scale training

Figure 1. YOLO v1network structure.

DOI: 10.4236/jcc.2018.611009 100 Journal of Computer and Communications

J. Y. Lu et al.

The input image size for the YOLO v1 training network is fixed, where YOLO
v2 adjusts the input image size randomly every 10 epoch during training, so that
the model has a good detection effect on the multi-scale input images during the
test.

3.3. YOLO v3
YOLO v3 model is much more complex than YOLO v2, and its detection on
small objects, as well as compact dense or highly overlapping objects is very ex-
cellent. The main improvements include:
1) Loss
YOLO v3 replaces the Softmax Loss of YOLO v2 with Logistic Loss. When the
predicted objects classes are complex, especially when there are many overlap-
ping labels in the dataset, it is more efficient to use Logistic Regression.
2) Anchor
YOLO V3 uses nine anchors instead of the five anchors of YOLO v2, which
improves the IoU.
3) Detection
YOLO v2 only uses one detection while YOLO v3 uses three, which greatly
improves the detection effect on small objects.
4) Backbone
YOLO v3 replaces darknet-19 network of YOLO v2 with darknet-53 network,
which improves the accuracy of object detection by deepening the network.
This paper uses the latest YOLO v3 model to achieve the vehicle detection for
aerial image.

4. Public Datasets for YOLO Training

The performance of the classifier trained based on conventional dataset is poor
on aerial images, because that aerial images have the following special features:
1) Scale diversity
The shooting height of UAVs ranges from tens of meters to kilometers, re-
sulting in a wide range of size of similar object on the ground.
2) Perspective specificity
The perspectives of aerial images are basically high-altitude overlooking, while
most of the conventional datasets are ground-level perspectives.
3) Small object
The objects of aerial images are generally only a few dozen or even a few pix-
els, so their amount of information is less also.
4) Multidirectional
Aerial images are taken from a bird’s view, and the direction of objects are
uncertain (while the object direction on the conventional dataset tends to have
certainty, such as pedestrians are generally upright).
5) High background complexity
Aerial images have a large field of view (usually with a few square kilometers

DOI: 10.4236/jcc.2018.611009 101 Journal of Computer and Communications

J. Y. Lu et al.

of coverage), and it may contain a variety of backgrounds, which will have a

strong interference with object detection.
For the above reasons, it is often difficult to train an ideal classifier on con-
ventional datasets for the object detection tasks on aerial images. Therefore, a
specialized aerial image dataset is needed. In this paper, three public aerial image
datasets are used and processed to make a new aerial image dataset suitable for
YOLO training. This chapter introduces the specific information of the three
datasets.

4.1. VEDAI Dataset

The VEDAI (Vehicle Detection in Aerial Imagery) dataset is made by Sebastien
Razakarivony and Frederic Jurie of University of Caen [9], whose original ma-
terial is from the public Utah AGRC database. The raw images have 4 uncom-
pressed color channels (three visible color channels and one near infrared chan-
nel). The authors firstly split the original large-field satellite image into 1024 ×
1024 pixels JPEG format images, and then create the visible color channels data-
set and the near infrared channel dataset, and finally down sample the above two
datasets into 512 × 512 pixels, so VEDAI contains 4 subsets. In this paper, only
the first subset of VEDAI (1024 × 1024, RGB 3 channels) is used. The shooting
heights of all images in VEDAI are the same, and the GSD (Ground Sampling
Distance) of 1024 × 1024 image is 12.5 cm pp (cm per pixel). VEDAI contains a
total of 1250 images, and is manually annotated nine classes of vehicle (“plane”,
“boat”, “camping car”, “car”, “pick-up”, “tractor”, “truck”, “van”, and “other”), a
total of 2950 samples. The annotation of each sample includes: sample class,
GT’s center point coordinates, direction, and the coordinates of GT’s 4 corners.

4.2. COWC Dataset

COWC (Cars Overhead with Context) dataset is made by T. Nathan Mundhenk
and others of Lawrence Livermore National Laboratory [10], whose original
materials are from six public websites. The COWC contains a total of 53 pictures
in TIFF format, and the image size is between 2000 × 2000 to 19,000 × 19,000
pixels. COWC images have covered six geographic locations, namely Toronto
(Canada), Selwyn (New Zealand), Potsdam and Vaihingen (Germany), Colum-
bus and Utah (United States), in which the images of Vaihingen and Columbus
are grayscale, while the others are in RGB color. The GSD of the image is 15
cmpp, so the size of vehicle is basically between 24 to 48 pixels. CWOC is ma-
nually annotated one class of positive samples (“car”) with a number of 32,716,
as well as four classes of negative samples (“boats”, “trailers”, “bushes” and “A/C
units”) that are easily confused with the vehicle with a number of 58,247. The
annotation of each sample includes: sample class, and GT’s center point coordi-
nates.

4.3. DOTA
DOTA (Dataset for Object detection in Aerial images) is an aerial image dataset

DOI: 10.4236/jcc.2018.611009 102 Journal of Computer and Communications

J. Y. Lu et al.

made by Xia Guisong of Wuhan University, Bai Xiang of Huazhong University

of Science and Technology, and others [11]. In order to eliminate the deviation
caused by different sensors, the original material comes from multiple platforms
(such as Google Earth). DOTA is characterized by multi-sensor and mul-
ti-resolution, namely that the GSDs of the images are diversified. DOTA con-
tains a total of 2806 images about 4000 × 4000 pixels, and is manually annotated
15 classes of sample (“plane”, “ship”, “storage tank”, “baseball diamond”, “tennis
court”, “swimming pool”, “ground track field”, “harbor”, “bridge”, “large ve-
hicle”, “small vehicle”, “helicopter”, “roundabout”, “soccer ball field” and “bas-
ketball court”) with a number of 188,282.The annotation of each sample in-
cludes: sample class, and the coordinates of GT’s 4 corners (where the top left
corner is the starting point, arranged in a clockwise order).

5. A Vehicle Detection Method for Aerial Image Based on

YOLO
In this paper, we process and integrate the above three public aerial image data-
sets first and then modify the network parameters of YOLO algorithm map pro-
priately to train a model. Thus, we propose a vehicle detection method for aerial
image. The specific steps are as follows.

5.1. Make Standard Datasets for YOLO Training

The standard dataset for YOLO training mainly consists of two parts: images
and labels, where images are JPEG format and labels are txt format documents.
Labels and images are in one-to-one correspondence. Each label records annota-
tions of the samples in the corresponding image. The annotation format is:
class GT’s center point coordinates ( x, y ) GT’s width and height ( w, h)

where ( x, y, w, h) are normalized values, wrap the line to distinguish when

there are multiple samples in one image. Since the input dimension of YOLO v3
training network is 416 × 416 × 3, the size of image used for training should not
be too large, otherwise the characteristics of the sample after resize may be lost
seriously. The basic information of the three public aerial image datasets de-
scribed in Chapter 4 is shown in Table 1.

Table 1. The basic information of the three public aerial image datasets.

Image
Dataset Images Classes Image size Annotations
Format

sample class, GT’s center

VEDAI JPEG 1250 9 1024×1024 point coordinate, direction,
coordinates of GT’s 4 corners

sample class, GT’s center

COWC TIFF 53 4 2000×2000 - 19,000 × 19,000
point coordinates

sample class, coordinates

DOTA JPEG 2,806 15 about 4000 × 4000
of GT’s 4 corners

DOI: 10.4236/jcc.2018.611009 103 Journal of Computer and Communications

J. Y. Lu et al.

We process the above three datasets separately.

1) VEDAI
a) Image size is suitable and do not need to be processed;
b) Delete the annotation of “plane”, “boat”, and “other” three classes in labels;
c) Delete the “direction” in annotations;
d) According to the coordinates of GT’s 4 corners, calculate width and height:
w=xmax − xmin , h =ymax − ymin (1)

2) COWC
a) Delete the grayscale images;
b) Delete the annotation of negative samples, leaving only the positive sample
“car”;
c) Split the images of COWC into 416 × 416 size and convert to JPEG format.
When splitting, the coordinate of the sample center point is converted accor-
dingly to ensure its position in the new image is correct. The remaining images
less than 416 × 416 are padded with black.
d) According to the GSD of COWC, it is assumed that the size of vehicle in
the image is unified to 48 * 48 pixels, therefore,
w= h= 48 / 416= 0.115384615384615... (2)
3) DOTA
a) Except for “large vehicle” and “small vehicle”, delete all the annotations of
other 13 classes in labels, “large vehicle” and “small vehicle” are unified to “car”;
b) Split the images of DOTA into 1024 × 1024 size. When splitting, the coor-
dinates of GT’s 4 corners are converted accordingly to ensure their positions in
the new images are correct. Abandon the remaining imagesless than 1024 ×
1024.
c) Center point coordinate:
x=
( xmax + xmin ) / 2, y =
( ymax + ymin ) / 2 (3)

d) Width and height:

w=xmax − xmin , h =ymax − ymin (4)

After processing, the information of the new datasets are shown in Table 2.

5.2. Configure Network Parameters for YOLO Training

1) Batch size
Use YOLO v3 default parameter batch _ size = 64 .

Table 2. The processed datasets information.

Dataset Image Format Images Classes Image size Annotations

VEDAI JPEG 1250 6 1024 × 1024 (class, x, y, w, h)

COWC JPEG 4944 1 416 × 416 (class, x, y, w, h)

DOTA JPEG 14,348 1 1024 × 1024 (class, x, y, w, h)

Total JPEG 20,542 6 416 × 416, 1024 × 1024 (class, x, y, w, h)

DOI: 10.4236/jcc.2018.611009 104 Journal of Computer and Communications

J. Y. Lu et al.

2) Number of iterations
The dataset contains a total of 20,542 images, so one epoch needs to iterate:
20542 / 64 ≈ 320 times.
The YOLO training defaults to iterate 160 epochs, so the number of iterations
is: 160 × 320 =51200 times.
3) Learning rate
The initial learning rate is 0.001, after 60 epoch divided by 10, after 90 epoch
divided by 10 once again.
4) Number of filters in the last layer of the network
filters = (class + 5) × 3 = (6 + 5) × 3 = 33

6. Experimental Results
In this paper, we use NVIDIA’s TITAN X graphics card for training. The train-
ing duration is about 60 hours. The test results of the training model are shown
in Table 3.
The detection effect of the training model on unknown images are shown in
Figure 2 (the original images are from Internet, please inform if there is any in-
fringement).
Figure 2 (left) shows that the training model has a good effect on detection of
small objects. The vehicles in Figure 2 (middle) are mostly not horizontal or
vertical with rotation, test result shows that the model has a good performance
on the detection of rotating objects, especially the leftmost vehicle in the image is
very close to the background, while the manual detection may miss the object,
and the model correctly detects it. Figure 2 (right) indicates it is outstanding
that the model on detection of compact and dense objects, more than 95% of the
vehicles are correctly detected except for those in the far left shadow.

Figure 2. Training model test on unknown images.

Table 3. Test results of the training model.

Indicator mAP recall IoU fps

Value 76.7% 92.1% 82.3% 55

DOI: 10.4236/jcc.2018.611009 105 Journal of Computer and Communications

J. Y. Lu et al.

7. Conclusion
In this paper, a vehicle detection method based on YOLO deep learning algo-
rithm for aerial image is presented. This method integrates an aerial image data-
set suitable for YOLO training by processing three public datasets. The training
model has good test results especially for small objects, rotating objects, as well
as compact and dense objects, and meets the real-time requirements. Next, we
will integrate more public aerial image datasets to increase the number and di-
versity of training samples, at the same time, optimize the YOLO algorithm to
further improve the detection accuracy.

Conflicts of Interest
The authors declare no conflicts of interest regarding the publication of this pa-
per.

References
[1] Qiu, Y. (2014) Video-Based Vehicle Detection in Intelligent Transportation System.
Master Thesis, Jilin University, China.
[2] Cheng, P., Zhou, G. and Zheng, Z. (2009) Detecting and Counting Vehicles from
Small Low-Cost UAV Images. Proceedings of ASPRS 2009 Annual Conference, Bal-
timore, 1-7.
[3] Azevedo, C.L., Cardoso, J.L., Ben-Akiva, M., Costeira, J.P. and Marques, M. (2014)
Automatic Vehicle Trajectory Extraction by Aerial Remote Sensing. Procedia-Social
and Behavioral Sciences (S1877-0428), 111, 849-858.
https://doi.org/10.1016/j.sbspro.2014.01.119
[4] Sivaraman, S. and Trivedi, M.M. (2010) A General Active-Learning Framework for
On-Road Vehicle Recognition and Tracking. Nlgn Ranoraon Ym Ranaon on, 2,
267-276. https://doi.org/10.1109/TITS.2010.2040177
[5] Tehrani, H., Akihiro, T., Mita, S. and Mcallester, D.A. (2012) On-Road Multivehicle
Tracking Using Deformable Object Model and Particle Filter with Improved Like-
lihood Estimation. IEEE Transactions on Intelligent Transportation, 2, 748-758.
https://doi.org/10.1109/TITS.2012.2187894
[6] Redmon, J., Divvala, S., Girshick, R. and Farhadi, A. (2016) You Only Look Once:
Unified, Real-Time Object Detection. Proceedings of the IEEE Conference on
Computer Vision and Pattern Recognition, Las Vegas, 27-30 June 2016, 779-788.
https://doi.org/10.1109/CVPR.2016.91
[7] Redmon, J. and Farhadi, A. (2017) YOLO9000: Better, Faster, Stronger. Proceedings
of the IEEE Conference on Computer Vision and Pattern Recognition, Honolulu,
21-26 July 2017, 6517-6525. https://doi.org/10.1109/CVPR.2017.690
[8] Redmon, J. and Farhadi, A. (2018) YOLO v3: An Incremental Improvement. ar-
xiv:1804.02767v1 [cs.CV], Unpublished.
[9] Razakarivony, S. and Jurie, F. (2015) Vehicle Detection in Aerial Imagery: A Small
Target Detection Benchmark. Journal of Visual Communication & Image Repre-
sentation, 34, 187-203. https://doi.org/10.1016/j.jvcir.2015.11.002
[10] Mundhenk, T.N., Konjevod, G., Sakla, W.A. and Boakye, K. (2016) A Large Con-
textual Dataset for Classification, Detection and Counting of Cars with Deep
Learning. Proceedings of European Conference on Computer Vision, Springer,

DOI: 10.4236/jcc.2018.611009 106 Journal of Computer and Communications

J. Y. Lu et al.

2016, 785-800.
[11] Xia, G.S., Bai, X., Ding, J., Zhu, Z., Belongie, S., Luo, J.L., et al. (2018) DOTA: A
Large-Scale Dataset for Object Detection in Aerial Images. arXiv: 1711.10398v2
[cs.CV], Unpublished.

DOI: 10.4236/jcc.2018.611009 107 Journal of Computer and Communications

Risk Management - Questions
100% (3)
Risk Management - Questions
4 pages
Lightweight Aerial Image
No ratings yet
Lightweight Aerial Image
10 pages
27 GSJ8976
No ratings yet
27 GSJ8976
16 pages
The Real-Time Detection of Traffic Participants Using YOLO Algorithm
No ratings yet
The Real-Time Detection of Traffic Participants Using YOLO Algorithm
4 pages
Synopsis - Internship - Group-53
No ratings yet
Synopsis - Internship - Group-53
8 pages
Applsci 13 04144 v2
No ratings yet
Applsci 13 04144 v2
26 pages
Article 1
No ratings yet
Article 1
9 pages
YOLO v2
No ratings yet
YOLO v2
9 pages
YOLOv1 v8综述
No ratings yet
YOLOv1 v8综述
36 pages
Research On Application of Improved YOLO V3 Algorithm in Road Target Detection
No ratings yet
Research On Application of Improved YOLO V3 Algorithm in Road Target Detection
10 pages
YOLO Series Algorithms in Object Detection of Unmanned Aerial Vehicles: A Survey
No ratings yet
YOLO Series Algorithms in Object Detection of Unmanned Aerial Vehicles: A Survey
30 pages
YOLO Model-Based Target Detection Algorithm For UA
No ratings yet
YOLO Model-Based Target Detection Algorithm For UA
5 pages
Overview of YOLO ObjectDetectionAlgorithm
No ratings yet
Overview of YOLO ObjectDetectionAlgorithm
7 pages
Aerial Vehicle Monitoring System
No ratings yet
Aerial Vehicle Monitoring System
6 pages
Finish Presentation
No ratings yet
Finish Presentation
56 pages
Deep Learning YOLOv2
No ratings yet
Deep Learning YOLOv2
3 pages
Object Detection Document
No ratings yet
Object Detection Document
4 pages
Yolopdf
No ratings yet
Yolopdf
10 pages
ADD YOLO: A New Model For Object Detection in Aerial Images: Yifei Yang Zhengyong Feng Wei Jin Pengcheng Miao
No ratings yet
ADD YOLO: A New Model For Object Detection in Aerial Images: Yifei Yang Zhengyong Feng Wei Jin Pengcheng Miao
19 pages
Asim Ws 2021 Paper 33
No ratings yet
Asim Ws 2021 Paper 33
6 pages
Yolo Paper
No ratings yet
Yolo Paper
10 pages
Electronics 12 03141
No ratings yet
Electronics 12 03141
14 pages
YOLOv4 With Deformable-Embedding-Transformer Featu
No ratings yet
YOLOv4 With Deformable-Embedding-Transformer Featu
23 pages
Ex No 06
No ratings yet
Ex No 06
4 pages
Applsci 14 05841 With Cover
No ratings yet
Applsci 14 05841 With Cover
26 pages
Paper 5
No ratings yet
Paper 5
13 pages
You Only Look Once - Unified, Real-Time Object Detection
No ratings yet
You Only Look Once - Unified, Real-Time Object Detection
10 pages
Pothole and Object Detection For An Autonomous Vehicle Using YOLO
No ratings yet
Pothole and Object Detection For An Autonomous Vehicle Using YOLO
5 pages
YED-YOLO: An Object Detection Algorithm For Automatic Driving
No ratings yet
YED-YOLO: An Object Detection Algorithm For Automatic Driving
9 pages
4377-Article Text-19556-1-10-20240731
No ratings yet
4377-Article Text-19556-1-10-20240731
6 pages
Project
100% (1)
Project
30 pages
Make 05 00083 v2
No ratings yet
Make 05 00083 v2
37 pages
Sensors 23 06423
No ratings yet
Sensors 23 06423
23 pages
Modified YOLO Module For Efficient Object Tracking in A Video
No ratings yet
Modified YOLO Module For Efficient Object Tracking in A Video
10 pages
YOLOv8n-FAWL Object Detection For Autonomous Driving Using YOLOv8 Network On Edge Devices
No ratings yet
YOLOv8n-FAWL Object Detection For Autonomous Driving Using YOLOv8 Network On Edge Devices
12 pages
Efficient Object Detection With YOLO A C
No ratings yet
Efficient Object Detection With YOLO A C
13 pages
Yolo
No ratings yet
Yolo
10 pages
(IJCST-V8I3P4) :sakshi Gupta, Dr. T. Uma Devi
No ratings yet
(IJCST-V8I3P4) :sakshi Gupta, Dr. T. Uma Devi
5 pages
Yolo
No ratings yet
Yolo
34 pages
YOLO Algorithm For Real-Time Object Detection: 2.1. Network Design
No ratings yet
YOLO Algorithm For Real-Time Object Detection: 2.1. Network Design
3 pages
MJEER-Volume 30-Issue 1 - Page 52-57
No ratings yet
MJEER-Volume 30-Issue 1 - Page 52-57
6 pages
Weixun 2021
No ratings yet
Weixun 2021
6 pages
Object Detection Method Based On Yolov3 Using Deep Learning Networks
No ratings yet
Object Detection Method Based On Yolov3 Using Deep Learning Networks
4 pages
Yolo
No ratings yet
Yolo
10 pages
YOLO Object Detection Explained - A Beginner's Guide - DataCamp
No ratings yet
YOLO Object Detection Explained - A Beginner's Guide - DataCamp
14 pages
14489-Article Text-52397-76611-10-20240112
No ratings yet
14489-Article Text-52397-76611-10-20240112
10 pages
Vehicle_Detection_Counting_and_Classification_Using_YOLOv8_Deep_Learning_Model_and_OpenCV_for_Urban_Traffic_Management_System
No ratings yet
Vehicle_Detection_Counting_and_Classification_Using_YOLOv8_Deep_Learning_Model_and_OpenCV_for_Urban_Traffic_Management_System
6 pages
Enhancing Surveillance Systems With YOLO Algorithm For Real-Time Object Detection and Tracking
No ratings yet
Enhancing Surveillance Systems With YOLO Algorithm For Real-Time Object Detection and Tracking
4 pages
Lightweight and Efficient Tiny-Object Detection Based On Improved YOLOv8n For UAV Aerial Images
No ratings yet
Lightweight and Efficient Tiny-Object Detection Based On Improved YOLOv8n For UAV Aerial Images
20 pages
A Comprehensive Review of YOLO From YOLOv1 To YOLO
No ratings yet
A Comprehensive Review of YOLO From YOLOv1 To YOLO
27 pages
Drones 07 00174 With Cover
No ratings yet
Drones 07 00174 With Cover
21 pages
BIOMETRICS
No ratings yet
BIOMETRICS
18 pages
Tinier YOLO
No ratings yet
Tinier YOLO
10 pages
Vehicle Detection and Classification Based On C-DSO Dataset Using YOLO v3 With SRBD Method For Intelligent Transportation Applications
No ratings yet
Vehicle Detection and Classification Based On C-DSO Dataset Using YOLO v3 With SRBD Method For Intelligent Transportation Applications
5 pages
Remotesensing 14 05063 v2
No ratings yet
Remotesensing 14 05063 v2
25 pages
2023 - Comparison of Transfer Learning Techniques For Object Detection
No ratings yet
2023 - Comparison of Transfer Learning Techniques For Object Detection
10 pages
Object Detection
No ratings yet
Object Detection
11 pages
Object Detection Method Based On YOLOv3 Using - Deep Learning Networks
No ratings yet
Object Detection Method Based On YOLOv3 Using - Deep Learning Networks
4 pages
Object Detection: Advances, Applications, and Algorithms
From Everand
Object Detection: Advances, Applications, and Algorithms
Fouad Sabry
No ratings yet
Transportation Management Land & Sea, Aviation and Infrastructure Concepts: Analyzing the influence of Covid on company processes
From Everand
Transportation Management Land & Sea, Aviation and Infrastructure Concepts: Analyzing the influence of Covid on company processes
BoD - Books on Demand
No ratings yet
Visual Sensor Network: Exploring the Power of Visual Sensor Networks in Computer Vision
From Everand
Visual Sensor Network: Exploring the Power of Visual Sensor Networks in Computer Vision
Fouad Sabry
No ratings yet
Quantum Physics Final (1) .Docx Syllabus
No ratings yet
Quantum Physics Final (1) .Docx Syllabus
3 pages
Sample Final 1 PDF
No ratings yet
Sample Final 1 PDF
2 pages
Introduction To Functional Programming in Racket: CS 270 Math Foundations of CS Jeremy Johnson
100% (1)
Introduction To Functional Programming in Racket: CS 270 Math Foundations of CS Jeremy Johnson
26 pages
25-36 Pengaruh Akreditasi Terhadap Peningkatan Mutu Pelayanan Rumah Sakit
No ratings yet
25-36 Pengaruh Akreditasi Terhadap Peningkatan Mutu Pelayanan Rumah Sakit
12 pages
Chapter 5 - EN - Option Valuation
No ratings yet
Chapter 5 - EN - Option Valuation
19 pages
BCS304 DS Module 1 KMP Algorithm
No ratings yet
BCS304 DS Module 1 KMP Algorithm
6 pages
Graph Neural Networks: Self-Supervised Learning
No ratings yet
Graph Neural Networks: Self-Supervised Learning
15 pages
Full Download Cybersecurity Analytics 1st Edition Rakesh M. Verma PDF
100% (3)
Full Download Cybersecurity Analytics 1st Edition Rakesh M. Verma PDF
62 pages
Topic 6 - Static Panel Data
No ratings yet
Topic 6 - Static Panel Data
21 pages
New Extended LMI Characterization For State Feedback Control of Continuous-Time Uncertain Linear Systems
No ratings yet
New Extended LMI Characterization For State Feedback Control of Continuous-Time Uncertain Linear Systems
6 pages
Assignment 4
No ratings yet
Assignment 4
3 pages
Turing Machines (TM) : Model of Computation
No ratings yet
Turing Machines (TM) : Model of Computation
58 pages
Openssl PPT For Lab
No ratings yet
Openssl PPT For Lab
31 pages
STAT 101 QUIZ 1 Marks
No ratings yet
STAT 101 QUIZ 1 Marks
17 pages
Control of Multiple-Input, Multiple-Output Processes
No ratings yet
Control of Multiple-Input, Multiple-Output Processes
27 pages
Chinese Postman Problem - Causeway Bay
No ratings yet
Chinese Postman Problem - Causeway Bay
21 pages
Inferential Statistics
No ratings yet
Inferential Statistics
19 pages
Understanding and Using Factor Scores - Considerations For The App
No ratings yet
Understanding and Using Factor Scores - Considerations For The App
12 pages
MDOF Response Spectrum Print
No ratings yet
MDOF Response Spectrum Print
17 pages
MCQ DM
No ratings yet
MCQ DM
2 pages
Pattern 3
No ratings yet
Pattern 3
9 pages
Convolution
No ratings yet
Convolution
12 pages
Designing Fuzzy Controller - Step by Step Guide - CodeCrucks
No ratings yet
Designing Fuzzy Controller - Step by Step Guide - CodeCrucks
17 pages
DR Benson PDF
No ratings yet
DR Benson PDF
40 pages
Research Paper Hybrid DL Model With Code
No ratings yet
Research Paper Hybrid DL Model With Code
4 pages
Using Deep Learning For Automatically Determining Correct Application of Basic Quranic Recitation Rules
No ratings yet
Using Deep Learning For Automatically Determining Correct Application of Basic Quranic Recitation Rules
7 pages
Online Convolutional Neural Network-Based Anomaly Detection and Quality Control For Fused Filament Fabrication Process
No ratings yet
Online Convolutional Neural Network-Based Anomaly Detection and Quality Control For Fused Filament Fabrication Process
19 pages
Algorithm in C Language - Javatpoint
No ratings yet
Algorithm in C Language - Javatpoint
19 pages
SRM'24 AI Unit 3
No ratings yet
SRM'24 AI Unit 3
52 pages

JJSB 2018

Uploaded by

JJSB 2018

Uploaded by

Journal of Computer and Communications, 2018, 6, 98-107

A Vehicle Detection Method for Aerial Image

Chang Guang Satellite Technology Co., Ltd., Changchun, China

Department of Transportation of Jilin Province, Changchun, China

DOI: 10.4236/jcc.2018.611009 Nov. 19, 2018 98 Journal of Computer and Communications

3. YOLO Deep Learning Object Detection Algorithm

DOI: 10.4236/jcc.2018.611009 99 Journal of Computer and Communications

Figure 1. YOLO v1network structure.

DOI: 10.4236/jcc.2018.611009 100 Journal of Computer and Communications

4. Public Datasets for YOLO Training

DOI: 10.4236/jcc.2018.611009 101 Journal of Computer and Communications

of coverage), and it may contain a variety of backgrounds, which will have a

4.1. VEDAI Dataset

4.2. COWC Dataset

DOI: 10.4236/jcc.2018.611009 102 Journal of Computer and Communications

made by Xia Guisong of Wuhan University, Bai Xiang of Huazhong University

5. A Vehicle Detection Method for Aerial Image Based on

5.1. Make Standard Datasets for YOLO Training

where ( x, y, w, h) are normalized values, wrap the line to distinguish when

sample class, GT’s center

sample class, GT’s center

sample class, coordinates

DOI: 10.4236/jcc.2018.611009 103 Journal of Computer and Communications

We process the above three datasets separately.

d) Width and height:

5.2. Configure Network Parameters for YOLO Training

Table 2. The processed datasets information.

Dataset Image Format Images Classes Image size Annotations

VEDAI JPEG 1250 6 1024 × 1024 (class, x, y, w, h)

COWC JPEG 4944 1 416 × 416 (class, x, y, w, h)

DOTA JPEG 14,348 1 1024 × 1024 (class, x, y, w, h)

Total JPEG 20,542 6 416 × 416, 1024 × 1024 (class, x, y, w, h)

DOI: 10.4236/jcc.2018.611009 104 Journal of Computer and Communications

Figure 2. Training model test on unknown images.

Table 3. Test results of the training model.

Indicator mAP recall IoU fps

Value 76.7% 92.1% 82.3% 55

DOI: 10.4236/jcc.2018.611009 105 Journal of Computer and Communications

DOI: 10.4236/jcc.2018.611009 106 Journal of Computer and Communications

DOI: 10.4236/jcc.2018.611009 107 Journal of Computer and Communications

You might also like