0% found this document useful (0 votes)

5 views7 pages

Paper 3

This paper presents a deep learning approach for automatic pothole detection using four models: YOLO V3, SSD, HOG with SVM, and Faster R-CNN. The study demonstrates that YOLO V3 outperforms the other models in terms of speed and accuracy for pothole detection. The methodology includes data preparation, model training, and performance analysis, highlighting the importance of accurate pothole detection for road safety and efficiency.

Uploaded by

Logeshwar Balasubramanian

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

5 views7 pages

Paper 3

Uploaded by

Logeshwar Balasubramanian

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 7

2020 IEEE Sixth International Conference on Big Data Computing Service and Applications (BigDataService)

A Deep Learning Approach for Street Pothole

Detection
Ping Ping1; Xiaohui Yang1; Zeyu Gao 2
1. College of Computer and Information, Hohai University, Nanjing, China
2. College of Engineering, San Jose State University, San Jose, USA

e-mail: [email protected]

Abstract—Potholes are a structural damage to the road efficient in detecting the cracks and uneven surfaces on the
with hollow which can cause severe traffic accidents and road. Zhang L, et al. proposed to use a deep CNN model
impact road efficiency. In this paper, we propose an efficient along with some sensors for automatic crack detection[8].
pothole detection system using deep learning algorithms which This model can learn the features without any feature
can detect potholes on the road automatically. Four models are extraction processes automatically. Tedeschi A, et al.
trained and tested with preprocessed dataset, including YOLO developed a real-time pothole detection system for Android
V3, SSD, HOG with SVM and Faster R-CNN. In the phase one, devices[9].
initial images with potholes and non-potholes are collected and
labeled. In the phase two, the four models are trained and In this paper, we propose a solution aimed to use
tested for the accuracy and loss comparison with the processed machine learning and artificial intelligence algorithms to
image dataset. Finally, the accuracy and performance of all create an accurate and efficient pothole detection system.
four models are analyzed. The experimental results show that Four modern deep learning models are trained to see which
the YOLO V3 model performs best for its faster and more model or ensemble of models produces the best results,
reliable detection results. including Yolo V3(You Only Look Once) Algorithm,
SSD(Single Shot Detector) Algorithm, HOG(Histogram of
Keywords—YOLO, Deep learning, Pothole detection, CNN, Oriented Gradients) with Support Vector Machine and Faster
SVM. R-CNN. Our pothole detection schema consists of two parts:
(1) data preparation and (2) predict potholes in images using
I. INTRODUCTION machine learning models. In the first part, the subsets of all
A pothole is a structure failure in a road surface. It cannot available data that relate to our schema are selected,
be ignored since it may cause severe traffic accidents and containing training datasets and test datasets, positive and
impact road efficiency. The 2006 Asian Development Bank negative images. As positive image indicates that there exists
(ADB) study showed that about half of these paved roads are a pothole in the street and negative image indicates there are
in a poor condition. All developed countries almost have the no potholes. We need label images to generate these datasets
similar problem. Potholes are formed by the terrible weather and then convert image file to train.record which will be used
and heavy vehicles movement. The most important step to by the model as input. In the second part, the prepared data is
maintain the road condition is to detect potholes with high fed to deep learning models which will predict the potholes
accuracy[1,2]. Recent years, a lot of studies have been to do training and predictions. Finally, from the
conducted to detect pothole in the road automatically. Lin J, experimentally results, we can see that YOLO V3
et al. proposed to use SVM(Support Vector Machine) for outperforms best in terms of speed, and it is also decent in
pothole detection[3]. The image region was extracted based terms of accuracy for all object sizes. From the aspect of
on the histogram of the image and simple kernel SVM was time consuming, YOLO V3 still has superb performance.
used to locate the pothole. The target was well recognizable SSD has quite high accuracy but it is slower compared to
using this method. CNN ( Convolutional Neural Network) other models. HOG model has mediocre performance in both
based deep learning is used to classify the potholes and accuracy and speed. Faster R-CNN has best performance in
cracks based on the images. A model was built using CNN accuracy but this model needs more computing power and
which was not influenced by the noise due to incorrect training time.
illumination and shadows[4]. Hiroya Maeda, et al. [5]
developed a system to detect the road damage using CNN The rest of the paper is organized as follows: Section II
methods on the images taken by phones. They gathered a presents data pre-process relevant to the pothole detection
huge dataset for pothole detection and applied deep learning solution. Section III describes the proposed solution with
algorithms to solve the problem. The accuracy and speed of four trained models. In Section IV, the simulation results and
the road damage detection system was approving. Some performance analysis are given. Finally, the conclusion is
other researchers have employed binary classification [6] given in Section V.
based on deep neural networks to classify the road images II. FUNDAMENTAL KNOWLEDGE AND PRELIMINARIES
whether they belong to normal road images or the ones with
This section presents the process of prepared training
pothole. The features of the images need to be fed to the
dataset and test dataset in a statistics format, including
system before it can perform the classification. A new neural
classes, sizes, media types in each cycle.
model Crack-net [7] was proposed for detecting the cracks
on the road. The difference with other neural models was that The dataset we chosen is created by Electrical and
pooling layers are not included. This method was very Electronic Department, Stellenbosch University in 2015. The

978-1-7281-7022-0/20/$31.00 ©2020 IEEE 198

DOI 10.1109/BigDataService49289.2020.00039

Authorized licensed use limited to: Cornell University Library. Downloaded on August 29,2020 at 16:31:47 UTC from IEEE Xplore. Restrictions apply.
entire dataset consists of two different sets, one was have created a dataset of 2036 images. Out of the total
considered to be simple and the other more complex. The dataset, 1384 images are training data while the remaining
dataset is collected by clicking pictures on smart phones by 652 images will be used for test data. The detailed statistics
setting it up on the dashboard of a car. These datasets do is listed in table 1.
share some files and there are a few instances where two TABLE 1
different images would have the same name. Therefore,
appropriate measures need to be taken if the data is DETAILED STATISTICS OF THE ENTIRE DATASET
combined into one larger dataset. Every folder contains 2
subfolders which contain the training data and test data. Index name Description
Furthermore, the training data folder is divided into 2 more Image size 72dpi*72dpi
Total categories 2
such subfolders namely positive data which contains the
Total dataset size 2036
pictures of roads with potholes and negative data which Training dataset size 1384
consists of pictures of roads with no potholes. Figure.1 Test dataset size 652
shows examples of training data which contain positives and
negatives in the dataset and figure.2 shows examples of test B. Data Validation
data. The models will be tested on 652 images to derive the
loss and accuracy of the model for the test images. The loss
and accuracy will be compared to finally choose the result
model. The models we employed are explained in detail in
the next section.
III. THE PROPOSED POTHOLE DETECTION SOLUTION
A. You Only Look Once(YOLO) Algorithm
YOLO is an object detection algorithm which is popular
for detecting objects in images. This algorithm uses single
neural network to predict the vector of the bounding boxes
and potholes[10,11]. It works by splitting images into a grid
with size of ShS. Every cell in the grid can predict N
Figure 1 Examples of training data possible bounding boxes and the level of probability(i.e.
confidence score) of it being the object which in our case is a
pothole. This will give us S h S h N boxes. Figure.3
demonstrates the architecture of YOLO. Most of these boxes
will have a quite low probability, that’s why the algorithm
proceeds to delete the boxes that are below a certain
threshold of minimum probability. The rest of the bounding
boxes are then moved towards a non-max suppression to
remove all the duplicate objects. This paper uses YOLO V3,
the training of this model is done on full images and
Figure 2 Examples of test data probability of the class in the bounding boxes. This method
has a lot of benefits than the original methods for object
A. Data Preparation detection. The YOLO V3 model is very fast. A complex
To create the training data, we need labeled images as pipeline is not needed because YOLO V3 works on object
labeled images contain the position and name of the object to detection as a regression problem. The neural network needs
be classified in the model. The images are labeled by to be run on the new image whenever we need to make
creating a rectangular bounding box around the object predictions. Using the GPU, the 45fps are run and on faster
manually on all the training images. Finding the exact version runs with 150fps. This implies that real time video
position of these bounding box for all the training images can also be processed with latency as less as 25ms. YOLO
could be a tedious task. To overcome this, we will use an V3 looks at the image as a overall package before detecting
image labeling tool like Labelme or LabelImg. This tool and making predictions.
makes labeling the potholes easier as an object can be
labeled by just dragging a line across a pothole. Below are
the steps that were performed for data preparation:
Step 1:Generate dataset using LabelImg which converts
JPEG image file to XML with pothole labeled.
Step 2:Convert the XML file to CSV records which has Bounding boxes + confidence

image details.
Step 3:Convert this csv file to train.record which will be
S×S grid on input Final detections
used by the model as input.
Once all the images are labeled, a .xml is created for
every image which contains the top-left and bottom-right Class probability map
coordinates of the bounding box. These coordinates are then
fed to the model which will predict the potholes position. We Figure 3 YOLO V3 architecture

199

Authorized licensed use limited to: Cornell University Library. Downloaded on August 29,2020 at 16:31:47 UTC from IEEE Xplore. Restrictions apply.
Unlike the other methods like sliding window, YOLO V3 Location Loss (LL) : It is a parameter to measure how
looks at the entire image during the training as well as during far are the predicted bounded boxes from the actual bounding
the predictions. This allows the model to have all the boxes of the object.
contextual information about the classes of objects. If we
look at CNN, it sometimes classifies the background as an C. Histogram of Oriented Gradients(HOG) with Support
object because it is not able to see the picture as a whole. It is Vector Machine
unable to get larger context out of the picture. So the error The shape of the object is an important feature to
rate of YOLO V3 in terms of background errors is half of distinguish any object. HOG is an algorithm for feature
what is for CNN model. YOLO V3 learns the general extraction which distinguishes objects on the basis of their
structure of the object rather than cramming the exact shape. shapes. The histograms are calculated for each gradient
Due to this reason, the YOLO V3 can make good predictions orientation of the picture. Each image has different colors
on the natural photos which do not all have exact same shape and the intensity of colors vary in all of them. Gradient
of the object. This feature allows YOLO V3 to outperform orientation is the directional change in the color, intensity
many other superior algorithms. and other properties of the image. Given below are the steps
of how feature extraction is done using HOG:
B. Single Shot Detector(SSD) Algorithm
SSD which stands for single shot detector is another Step 1:Resize the image to a smaller size and keep all the
algorithm used in object detection. It is based on simple features preserved. This is needed so that the code could run
neural networks where the nodes do not form cycle, rather faster. The opencv function resize() can be used to achieve
the information only moves in forward direction. This this.
algorithm creates bounding boxes of fixed size and gives a Step 2:Convert the image to some particular colorspace
score to decide the presence of the object in the box. Next is where some specific information can be extracted. There are
the non-maximum compression step where the bounding box many different colorspaces like RGB, YUV, LUV, etc. For
which has the maximum overlap gets the highest score and example, we can vary the lighting and saturation of the
produces the detection of the object[12]. The localization and image and then train the system using those images to
classification are done as one single step.j It is similar to identify objects under shadow. This is usually done by HLS
YOLO in the sense that this model also divides the image color scheme.
into grids of equal sizes. Figure.4 below shows the
architecture of how SSD works and how it goes through the Step 3:Use function numpy.histogram() to create color
process of detecting an object. histogram of the image. It is the most important step as
histogram contains all the feature information.
Step 4:Use function hog() to achieve Histogram of
Oriented Gradients(HOG). Figure.4 below is an instance of
how HOG visualizes an image.

Figure 3 YOLO architecture

The main task of SSD is to match labels with default

boxes of different aspects as dashed rectangles. A number is
associated with every element of the feature map. Something
is considered a match if the IOU(intersection over union)
value of any default box crosses 0.5. The 4 by 4 box is
matched with the object dog and 8 by 8 box is matched with
the object cat. Objects are identified with the help of 6
Figure 4 HOG feature of a car
different feature maps. As seen in the figure.3 above, SSD
architecture does not consider the fully connected layers After the feature extraction is completed, we use
while using the VGG-16 architecture. We use VGG-16 SVM(Support Vector Machine) to train the classifier. SVM
because it provides high performance in classification of the performs classification by finding the hyperplane that
images. We add auxiliary convolutional layers to extract the maximizes the margin between two classes[3].
features in different scales and to reduce the size of input in
the consecutive layers. Loss function(LF) can be calculated D. Faster R_CNN
by Eq.(1), Alpha is the parameter used to balance the CNN is a deep learning algorithm which takes in an input
contribution of the location loss function. image, assigns importance to various aspects/objects in the
image and is able to differentiate one from the other. There
LF = CL + alpha * LL (1)
are very few pre-processing steps needed for CNN as
Two important components are put together by the loss opposed to other machine learning classification algorithms.
function (LF) which are : The convolutional network is able to successfully capture the
spatial and temporal dependencies in an image by applying
Confidence Loss (CL) : It is a parameter which defines relevant filters.
how confident is the network of the objectness of the
bounded box.

200

Authorized licensed use limited to: Cornell University Library. Downloaded on August 29,2020 at 16:31:47 UTC from IEEE Xplore. Restrictions apply.
Figure 5 CNN architecture

Figure.5 shows the process of the CNN algorithm. The The following morphological operations are applied on
first step is to provide an image. Then, parameters are chosen, images for size calculation:
padding is added, and filters are applied to that image. Next,
convolution is performed on the image. Pooling is also
performed in order to reduce the numbers of parameters.
Additional convolutional layers can be added if needed.
Then, the next step is to flatten the output and feed it to the
fully connected layer. The last step is to output the class[4,5].
Similar to CNN, Faster R-CNN is also used for
classification models. There are two main networks in R-
CNN. One is RPN, which is used to generate region
proposals. Another is a network that uses the region
proposals to detect objects. We choose to use R-CNN
because it uses RPN to generate fixed set regions and anchor
boxes for object detection. Furthermore, it has no Figure 6 Flow Chart of size calculation process
requirement for extensive data augmentation. Faster R-CNN
has faster speed. Edge Detection: the set of processes to identify points in
the image where the change in the brightness is sharp and
The Faster R-CNN model we used has 10 layers: 3 not continuous. These points are put together into a set of
convolutional layers, 3 max-pooling layers and 4 fully lines called edges.
connected layers. The purpose of the Convolutional Layers is
to reduce the dimensionality of the image for faster Dilation: remove the extra unwanted edges from the gray
processing and less complexity. The function of Pooling scale image. Figure.7 shows the process of dilation.
layer is to reduce the size of the image or amount of input
parameters for the next layer. The final detection network
takes input from both the previous layers and generate the
final bounding boxes and classify the images. This layer
consists of 4 fully connected layers. We need to give image
as an input to the CNN , then to SVM(Support vector
Machine) which helps in predicting the class for each
bounding box or region. Next, we need to optimize the
bounding boxes by training each bounding box separately. Figure 7 Dilation process
We need to handle differences in the image scale and the Erosion: shrink the objects in the gray scale image. The
aspect ratio, due to which CNN includes the concept of process of erosion is shown in figure.8 and the original
anchor boxes. There are 3 different sizes of the anchor boxes : image is figure.9.
128h128, 256h256 and 512h512. For the aspect ratio,
three different ratios are used: 1:1, 2:1, 1:2. This allows 9
possible boxes at each location which can also be named as
background or the object.
E. Size Calculation of Potholes
We convert the image to the gray scale image to get rid
of the noise. Some unwanted edges are also created in
process which is due to the shadows of trees and insufficient Figure 8 Erosion process
light. Other vehicles can also result in those extra edges.
One of the major problems in size calculation is that there
are many unconnected sharp edges and noise. These extra
edges can be removed by dilation process. Figure.6
demonstrates the process of the size calculation of potholes.
Figure 9 Original image

201

Authorized licensed use limited to: Cornell University Library. Downloaded on August 29,2020 at 16:31:47 UTC from IEEE Xplore. Restrictions apply.
Thresholding: the process to identify whether an edge is From the plot of loss Figure.13(a), we can see that the
present at a particular point in the image or not. If the model SSD has comparable performance on both train and
threshold is low, there will be more edges. test datasets (labeled test). If these parallel plots start to
depart consistently, it might be a sign to stop training at an
Closing: increase the boundary of bright regions in the earlier epoch. From the plot of accuracy Figure.13(b), we can
image without destroying the original shape. see SSD model could probably be trained a little more as the
After the bounding boxes are predicted, the cropped trend for accuracy on both datasets is still rising for the last
image of the pothole is then converted to a black and white few epochs. We can also see that the model has not yet over-
image as shown. The depth of potholes is defined by the learned the training dataset, showing comparable skill on
maximum number of black pixels in the vertical direction both datasets.
and is calculated by running a loop across all the columns
and finding the number of black pixels for each column. A
lot of white noise can be seen in figure.10 which could result
in miscalculation of the pothole. To avoid this, we used
several morphological techniques like closing techniques and
experimented with various kernel sizes and iterations as
shown above. The best result was obtained by using a 9 by 9
kernel with one iteration which is shown in figure.11.
Finding the actual size and depth of the potholes was the
most challenging task since even the potholes with same size
would appear differently according to the distance between
them and the camera. That is to say, the potholes that are
close to the camera would appear bigger against the potholes
that are at some distance from the camera. To overcome this (a) SSD loss vs epoch
issue we’ll be using Multiple Regression to predict the actual
height, width and depth of the pothole by providing the
calculated values(in pixels) as inputs.

Figure 10 After black and white conversion

(b) SSD accuracy vs epoch
Figure 13 SSD loss and accuracy

Figure.14 demonstrates the relevance between accuracy

and dataset size for HOG model. We can see the accuracy of
HOG is rising with the increase of dataset size. The accuracy
of SVM when we have to classify two classes is very high.
Also our dataset is not large enough, so we need an
Figure 11 Closing process algorithm which performs well with smaller datasets.
IV. SIMULATION RESULTS
A. Experimental results
This section shows the experimental results of four deep
learning models. Figure.12~15 are the results of YOLO V3,
SSD, HOG with support vector machine and Faster R-CNN
models based on dataset we chosen respectively.
As shown in the figure.12, YOLO V3 does not accurately
detect the entire pothole dimensions. The reason that YOLO
V3 did not perform well is because the model is known to
have trouble detecting small objects in an image. Figure 14 Accuracy vs dataset size for HOG model

Figure.15 shows the relevance between accuracy and

dataset size for Faster R-CNN model. Comparing with other
models, this model performs better in accuracy. However,
Faster R-CNN needs more computing power and its training
time is more than other models’ as it has two different
networks which need to be trained.

Figure 12 YOLO V3 pothole detection test image

202

Authorized licensed use limited to: Cornell University Library. Downloaded on August 29,2020 at 16:31:47 UTC from IEEE Xplore. Restrictions apply.
datasets from training. The results of the different models are
compared in the table below:
Table 2
TIME TAKEN TO TRAIN DIFFERENT MODELS

Size YOLO V3 SSD HOG Faster R-CNN

200 Images 3 hours 4 hours 2 hours 2 hours
650 Images 4 hours 5 hours 3.5 hours 2.5 hours
5.5
850 Images 4 hours 4 hours 3 hours
hours
1000 Images 4 hours 6 hours - 3.5 hours
Figure 15 Accuracy vs dataset size for Faster R-CNN model 1100 Images 4.5 hours - - 4 hours
8.5
B. Comparison of results of different models 1500 Images 5 hours
hours
- 6 hours
The SSD model has many feature layers along with the Table 3
base network which gives it higher accuracy but slower COMPARISON OF ACCURACY OF DIFFERENT MODELS
speed comparing to the YOLO V3 model. So if we talk about
speed then YOLO V3 outperforms both SSD and Faster R- Size YOLO V3 SSD HOG Faster R-CNN
CNN model. SSD has high accuracy but it is slow compared 200 Images 53% 47% 24% 72%
to other models like YOLO V3. Figure.16 gives the visual 650 Images 67% 59% 25% 71%
understanding of time vs accuracy trade-off: YOLO V3 is 850 Images 65% 55% 27% 67%
the fastest but has the lowest accuracy, SSD is balanced as it 1000 Images 69% 59% - 69%
has good speed and is very accurate. CNNs more are a little 1100 Images 73% - - 60%
less accurate than SSD and YOLO V3 but they are very slow 1500 Images 82% 80% - 74%
compared to YOLO V3 and faster than SSD. From the C. Model selection and justification
figure.17 which shows the performances of different models
based on the size of the pothole, we can see a very different We implemented the machine learning and deep learning
trend. If the size of the potholes are large then the models of YOLO V3, SSD, HOG and Faster R-CNN. Finally,
performance of SSD is similar to that of YOLO V3. The we have selected the YOLO V3 model as the best model for
difference in accuracy increases as the object size becomes our pothole detection system for the following reasons.
smaller. For the smaller objects, the performance of the YOLO V3 is the fastest model amongst the four models
YOLO V3 is the best and next comes SSD and then CNN. we chose and due to computational limitations, we needed a
model which has fast speed. Also, YOLO V3 is pretty decent
in terms of accuracy for all object sizes. It has good

<2/29
performance for large objects and is decent event for small
66' overlapping objects. From the aspect of time consuming, the
$FFXUDF\

GPU time required to train the YOLO V3 model was not too

high and was manageable when the model was trained on
HPC (High Performance Computing). Furthermore, we

evaluated the performance of the YOLO V3 model on a set
of 1000 road images, the learning rate for YOLO V3 was
0.01. F-1 score measures accuracy using the statistics
K K K K
7LPH precision and recall values. Precision is the ratio of true
positives to all the predicted positives. Recall is the ratio of
true positives to all the actual positives. The biggest
Figure 16 Time vs accuracy of different algorithms
advantage of the YOLO V3 algorithm is its superb speed. It
&11 can be used in real time as the processing speed is as fast as
66' 45 frames per second. The YOLO V3 we used has improved
<2/29
average precision so even the accuracy for detection for the

small objects improved greatly which was a big drawback
$FFXUDF\

with YOLO earlier. With the improvement in MAP, there

was a significant decrease in the localization errors. Due to
the addition of new feature pyramid, the predictions greatly
improved when the image was at different scale or aspect

6PDOO 0HGLXP /DUJH ratio.
2EMHFWVL]H
V. CONCLUSION AND SUGGESTION FOR FUTURE WORK
Figure 17 Accuracy vs size of the objects This paper proposes an efficient pothole detection system
Table 2 and table 3 show the mean average precision, using deep learning algorithms which can detect potholes on
frames per second and the GPU time needed by different the road with only a camera attached to the dash of a car and
models. It is clear that the training time needed for the SSD an internet connection. Four models are trained and tested
model is the highest. From comparing each model’s testing with preprocessed dataset, including YOLO V3, SSD, HOG
accuracy result, we can tell that adding more images actually with SVM and Faster R-CNN. We select available data and
increases the accuracy since the model will learn more then convert labeled image file to train.record which will be
used as input by the models. Hyper parameters is tuned for

203

Authorized licensed use limited to: Cornell University Library. Downloaded on August 29,2020 at 16:31:47 UTC from IEEE Xplore. Restrictions apply.
all four models and size calculation of potholes is considered [6] Bray J, Verma B, Li X, et al. A neural network based technique for
for more accurate detection results. Comparing the results of automatic classification of road cracks[C]//The 2006 IEEE
International Joint Conference on Neural Network Proceedings. IEEE,
all four models, the YOLO V3 model performed best with 2006: 907-912.
accuracy of 82%. The future work direction includes [7] Allen Zhang, Kelvin C. P. Wang, Baoxian Li, Enhui Yang, Xianxing
extending the detection object to broken drains and manhole Dai, Yi Peng, Yue Fei, Yang Liu, Joshua Q. Li, Cheng Chen.
covers and using images taken from moving vehicles in a Automated Pixel - Level Pavement Crack Detection on 3D Asphalt
realistic scenario. Surfaces Using a Deep-Learning Network, Computer-Aided Civil and
Infrastructure Engineering, vol. 00, pp. 1-15, 2017.
REFERENCES [8] Zhang L, Yang F, Zhang Y D, et al. Road crack detection using deep
convolutional neural network[C]//2016 IEEE international conference
[1] Anon, (2019). [online] Available at: https://www.pothole.info/the- on image processing (ICIP). IEEE, 2016: 3708-3712.
facts/
[9] Tedeschi A, Benedetto F. A real-time automatic pavement crack and
[2] Potholes Dataset, Google Drive. [Online]. Available at:
pothole recognition system for mobile Android-based devices[J].
https://drive.google.com/drive/folders/1vUmCvdW32lMrhsMbXdM Advanced Engineering Informatics, 2017, 32: 11-25.
WeLcEzOcuy.
[10] Chablani M. YOLO—You only look once, real time object detection
[3] Lin J, Liu Y. Potholes detection based on SVM in the pavement explained[J]. Towards Data Science [online].[cit. 2019-04-25].
distress image[C]//2010 Ninth International Symposium on Dostupné z: https://towardsdatascience. com/yolo-you-only-look-
Distributed Computing and Applications to Business, Engineering once-real-time-object-detection-explained-492dc9230006.
and Science. IEEE, 2010: 544-547.
[11] M. Hollemans, Real-time object detection with YOLO. [Online].
[4] Cha Y J, Choi W, Büyüköztürk O. Deep learning-based crack damage Available: http://machinethink.net/blog/object-detection-with-yolo/.
detection using convolutional neural networks[J]. Computer-Aided [Accessed: 13-Mar-2019].
Civil and Infrastructure Engineering, 2017, 32(5): 361-378.
[12] Liu W, Anguelov D, Erhan D, et al. Ssd: Single shot multibox
[5] Maeda H, Sekimoto Y, Seto T, et al. Road damage detection using
detector[C]//European conference on computer vision. Springer,
deep neural networks with images captured through a smartphone[J]. Cham, 2016: 21-37.
arXiv preprint arXiv:1801.09454, 2018.

204

Authorized licensed use limited to: Cornell University Library. Downloaded on August 29,2020 at 16:31:47 UTC from IEEE Xplore. Restrictions apply.

Mu Tybsc CS Syllabus 2023
No ratings yet
Mu Tybsc CS Syllabus 2023
63 pages
Pothole 2023
No ratings yet
Pothole 2023
12 pages
Pothole Detection Using Machine Learning
No ratings yet
Pothole Detection Using Machine Learning
6 pages
Chitale Et Al (2020)
No ratings yet
Chitale Et Al (2020)
6 pages
Paper Icmlas 2025
No ratings yet
Paper Icmlas 2025
5 pages
Drone Based Potholes Detection Using Machine Learning On Various Edge AI Devices in Real-Time
No ratings yet
Drone Based Potholes Detection Using Machine Learning On Various Edge AI Devices in Real-Time
5 pages
Real Time Pothole Detection
No ratings yet
Real Time Pothole Detection
6 pages
CPaper 12
No ratings yet
CPaper 12
6 pages
Pothole Detection Model For Road Safety Using Computer Vision and Machine Learning
No ratings yet
Pothole Detection Model For Road Safety Using Computer Vision and Machine Learning
8 pages
Ijsred V8i2p63
No ratings yet
Ijsred V8i2p63
6 pages
Classification of Different Size of Potholes Based
No ratings yet
Classification of Different Size of Potholes Based
25 pages
Application of Various YOLO Models For Computer Vi
No ratings yet
Application of Various YOLO Models For Computer Vi
14 pages
DeepBus: Machine Learning Based Real Time Pothole Detection System For Smart Transportation Using IoT
No ratings yet
DeepBus: Machine Learning Based Real Time Pothole Detection System For Smart Transportation Using IoT
6 pages
Pothole Detection and Dimension Estimation by Deep
No ratings yet
Pothole Detection and Dimension Estimation by Deep
14 pages
Pothole Severity Prediction Using Monocular Depth (3) (1) - 2
No ratings yet
Pothole Severity Prediction Using Monocular Depth (3) (1) - 2
15 pages
Heiwins Paper
No ratings yet
Heiwins Paper
5 pages
Image-Based Pothole Detection Using Multi-Scale Feature Network and Risk Assessment
No ratings yet
Image-Based Pothole Detection Using Multi-Scale Feature Network and Risk Assessment
22 pages
Classification of Paved Road and Unpaved Road
No ratings yet
Classification of Paved Road and Unpaved Road
5 pages
Pothole Detection Using Machine Learning
No ratings yet
Pothole Detection Using Machine Learning
5 pages
5 Object & Potholes Detection To Control Car Speed Using IOT Final Report
No ratings yet
5 Object & Potholes Detection To Control Car Speed Using IOT Final Report
49 pages
Result Paper
No ratings yet
Result Paper
8 pages
Intelligent Pothole Detection and Road Condition A
No ratings yet
Intelligent Pothole Detection and Road Condition A
8 pages
Rastogi 2020
No ratings yet
Rastogi 2020
6 pages
Pothole Detection and Geological Mapping Through Aerial Vehicle
No ratings yet
Pothole Detection and Geological Mapping Through Aerial Vehicle
6 pages
Real Time Detection of Road Anomalies: Group Members Guide
No ratings yet
Real Time Detection of Road Anomalies: Group Members Guide
12 pages
SURVEY ON POTHOLE DETECTION AND Complaint Management System Using Deep Learning
No ratings yet
SURVEY ON POTHOLE DETECTION AND Complaint Management System Using Deep Learning
3 pages
A Deep Learning-Based Pothole Detection System Using Unmanned Aerial Vehicle Images
No ratings yet
A Deep Learning-Based Pothole Detection System Using Unmanned Aerial Vehicle Images
31 pages
EasyChair Preprint 9151
No ratings yet
EasyChair Preprint 9151
13 pages
Smart System For Potholes Detection Using Computer Vision With Transfer Learning
No ratings yet
Smart System For Potholes Detection Using Computer Vision With Transfer Learning
9 pages
IEEE
No ratings yet
IEEE
5 pages
Tracking of Potholes and Measurement of Noise and Illumination Level in Roadways
No ratings yet
Tracking of Potholes and Measurement of Noise and Illumination Level in Roadways
6 pages
Kuch Bhi
No ratings yet
Kuch Bhi
5 pages
EDI Finall
No ratings yet
EDI Finall
15 pages
Ae - 01fe20bei007
No ratings yet
Ae - 01fe20bei007
4 pages
PotholePaper (1) - 1
No ratings yet
PotholePaper (1) - 1
5 pages
Project Feasibility Presentation Ver2
No ratings yet
Project Feasibility Presentation Ver2
13 pages
Detection of Road Cracks and Potholes Using IOT Device
No ratings yet
Detection of Road Cracks and Potholes Using IOT Device
6 pages
Real-Time Pothole Detection System: A Deep Learning Approach With SSD
No ratings yet
Real-Time Pothole Detection System: A Deep Learning Approach With SSD
9 pages
Focus (Pathole Detection
No ratings yet
Focus (Pathole Detection
7 pages
Paper 6
No ratings yet
Paper 6
6 pages
Fin Irjmets1683721244
No ratings yet
Fin Irjmets1683721244
8 pages
Sensors: An Automated Machine-Learning Approach For Road Pothole Detection Using Smartphone Sensor Data
No ratings yet
Sensors: An Automated Machine-Learning Approach For Road Pothole Detection Using Smartphone Sensor Data
23 pages
Jimaging 10 00227
No ratings yet
Jimaging 10 00227
22 pages
Deep Learning Enhanced Feature Extraction of Potholes Using Vision and Lidar Data For Road Maintenance
No ratings yet
Deep Learning Enhanced Feature Extraction of Potholes Using Vision and Lidar Data For Road Maintenance
9 pages
POT-YOLO Real-Time Road Potholes Detection Using Edge Segmentation-Based Yolo V8 Network
No ratings yet
POT-YOLO Real-Time Road Potholes Detection Using Edge Segmentation-Based Yolo V8 Network
8 pages
Jpeodx Pveng-1194
No ratings yet
Jpeodx Pveng-1194
18 pages
Pitfree: Pot-Holes Detection On Indian Roads Using Mobile Sensors
No ratings yet
Pitfree: Pot-Holes Detection On Indian Roads Using Mobile Sensors
6 pages
Applsci 11 03725
No ratings yet
Applsci 11 03725
16 pages
Bridging The Day-Night Gap in Pothole Detection Using Generative Models and Deep Learning-Based Object Detection
No ratings yet
Bridging The Day-Night Gap in Pothole Detection Using Generative Models and Deep Learning-Based Object Detection
4 pages
Expert Systems With Applications: Oche Alexander Egaji, Gareth Evans, Mark Graham Griffiths, Gregory Islas
No ratings yet
Expert Systems With Applications: Oche Alexander Egaji, Gareth Evans, Mark Graham Griffiths, Gregory Islas
7 pages
Android Pothole Detection System Using Deep Learning
No ratings yet
Android Pothole Detection System Using Deep Learning
3 pages
1 Sustainable Road Pothole Detection A Crowdsourcing Based MultiSensors Fusion ApproachSustainability Switzerland
No ratings yet
1 Sustainable Road Pothole Detection A Crowdsourcing Based MultiSensors Fusion ApproachSustainability Switzerland
23 pages
Smart IOT Based Pothole Detection and Filling System
No ratings yet
Smart IOT Based Pothole Detection and Filling System
6 pages
Review 3 Report
No ratings yet
Review 3 Report
17 pages
ProjectProposal TresMarias
No ratings yet
ProjectProposal TresMarias
3 pages
Pothole Detection Method
No ratings yet
Pothole Detection Method
6 pages
Analysis and Improvements On Current Pothole Detection Techniques
No ratings yet
Analysis and Improvements On Current Pothole Detection Techniques
4 pages
Yolov8 and Point Cloud Fusion For Enhanced Road Pothole Detection and Quantification
No ratings yet
Yolov8 and Point Cloud Fusion For Enhanced Road Pothole Detection and Quantification
13 pages
HEYOLOv5s Efficient Road Defect Detection Networkentropy 25 01280
No ratings yet
HEYOLOv5s Efficient Road Defect Detection Networkentropy 25 01280
16 pages
Booysen Det
No ratings yet
Booysen Det
10 pages
Machine Learning - Advanced Concepts
From Everand
Machine Learning - Advanced Concepts
Derrick Mwiti
No ratings yet
Active Learning For Data Streams A Survey
No ratings yet
Active Learning For Data Streams A Survey
48 pages
Plant Disease Detection and Classification Using Machine Learning Algorithm
No ratings yet
Plant Disease Detection and Classification Using Machine Learning Algorithm
6 pages
1694600937-Unit2.5 Support Vector Machine CU 2.0
No ratings yet
1694600937-Unit2.5 Support Vector Machine CU 2.0
26 pages
Detection of Rooftop Regions in Rural Areas Using Support Vector Machine
No ratings yet
Detection of Rooftop Regions in Rural Areas Using Support Vector Machine
5 pages
BTAIML10 Major Project Report
No ratings yet
BTAIML10 Major Project Report
25 pages
Sobrang Pahirap Na Sa Buhay
No ratings yet
Sobrang Pahirap Na Sa Buhay
31 pages
Slides Chap5 KernelMethods
No ratings yet
Slides Chap5 KernelMethods
24 pages
Predictive Model For Diabetes Using Machine Learning
No ratings yet
Predictive Model For Diabetes Using Machine Learning
38 pages
Twitter Data Preprocessing For Spam Detection: Myungsook Klassen
No ratings yet
Twitter Data Preprocessing For Spam Detection: Myungsook Klassen
6 pages
1 s2.0 S0208521620300590 Main
No ratings yet
1 s2.0 S0208521620300590 Main
8 pages
Master Thesis Support Vector Machine
100% (3)
Master Thesis Support Vector Machine
5 pages
Teaching Data Science Through Storytelling
No ratings yet
Teaching Data Science Through Storytelling
15 pages
Historical and Modern Features For Buddha Statue Classification
No ratings yet
Historical and Modern Features For Buddha Statue Classification
8 pages
Cybersecurity of Autonomous Vehicles A Systematic Literature Review of Adversarial Attacks and Defense Models
No ratings yet
Cybersecurity of Autonomous Vehicles A Systematic Literature Review of Adversarial Attacks and Defense Models
21 pages
Unit - 2 ML
No ratings yet
Unit - 2 ML
32 pages
Handbook of HydroInformatics: Volume I: Classic Soft-Computing Techniques 1st Edition - Ebook PDF PDF Download
100% (11)
Handbook of HydroInformatics: Volume I: Classic Soft-Computing Techniques 1st Edition - Ebook PDF PDF Download
81 pages
Machine Learning With Python
No ratings yet
Machine Learning With Python
3 pages
Medical Image Analysis
No ratings yet
Medical Image Analysis
9 pages
Lavender Presentation Main One
No ratings yet
Lavender Presentation Main One
21 pages
IITG Credit Linked DS
No ratings yet
IITG Credit Linked DS
10 pages
Lung Cancer Project
No ratings yet
Lung Cancer Project
92 pages
Autism BASE
No ratings yet
Autism BASE
9 pages
WIREs Data Min Knowl - 2020 - Wang - Knowledge Discovery From Remote Sensing Images A Review
No ratings yet
WIREs Data Min Knowl - 2020 - Wang - Knowledge Discovery From Remote Sensing Images A Review
31 pages
FAM 101 Fundamentals of Analytics Modeling
No ratings yet
FAM 101 Fundamentals of Analytics Modeling
47 pages
1 s2.0 S092702562400048X Main
No ratings yet
1 s2.0 S092702562400048X Main
13 pages
I Wander
No ratings yet
I Wander
4 pages
Integration of Facial Thermography in EEG-based Classification of ASD IJAC - 2020
No ratings yet
Integration of Facial Thermography in EEG-based Classification of ASD IJAC - 2020
18 pages
4.machine Learning-Enabled Identification of New Medium To High Entropy Alloys With Solid Solution Phases
No ratings yet
4.machine Learning-Enabled Identification of New Medium To High Entropy Alloys With Solid Solution Phases
9 pages
Proceedings of The International Conference On Signal Networks, Computing and Systems
No ratings yet
Proceedings of The International Conference On Signal Networks, Computing and Systems
336 pages

Paper 3

Uploaded by

Paper 3

Uploaded by

2020 IEEE Sixth International Conference on Big Data Computing Service and Applications (BigDataService)

A Deep Learning Approach for Street Pothole

978-1-7281-7022-0/20/$31.00 ©2020 IEEE 198

Figure 3 YOLO architecture

The main task of SSD is to match labels with default

Figure 10 After black and white conversion

Figure.14 demonstrates the relevance between accuracy

Figure.15 shows the relevance between accuracy and

Figure 12 YOLO V3 pothole detection test image

Size YOLO V3 SSD HOG Faster R-CNN

with YOLO earlier. With the improvement in MAP, there

You might also like