A Deep Learning-Based Pothole Detection System Using Unmanned Aerial Vehicle Images
A Deep Learning-Based Pothole Detection System Using Unmanned Aerial Vehicle Images
Article
PDS-UAV: A Deep Learning-Based Pothole Detection System
Using Unmanned Aerial Vehicle Images
Ohoud Alzamzami 1, * , Amal Babour 2, * , Waad Baalawi 1 and Lama Al Khuzayem 1
Abstract: Smart cities utilize advanced technologies to enhance quality of life by improving urban
services, infrastructure, and environmental sustainability. Effective pothole detection and repair
strategies are essential for improving quality of life as they directly impact the comfort and safety
of road users. In addition to causing serious harm to residents’ lives, potholes can also cause costly
vehicle damage. In this study, a pothole detection system utilizing unmanned aerial vehicles, called
PDS-UAV, is developed. The system aids in automatically detecting potholes using deep learning
techniques and managing their status and repairs. In addition, it allows road users to view an overlay
of the detected potholes on the maps based on their selected route, enabling them to avoid the
potholes and increase their safety on the roads. Two data collection methods were used, an interview
and a questionnaire, to gather data from the target system users. Based on the data analysis, the
system’s requirements, design, and implementation were completed. For the pothole detection, a
deep learning model using YOLOv8 was developed, which achieved an overall performance of 95%,
98%, and 92% for F1 score, precision, and recall, respectively. Different types of testing has been
performed on the target users to ensure the system’s validity, effectiveness, and ease of use, including
unit testing, integration testing, and usability testing. As a future work, more features will be added
to the system in addition to improving the deep learning model accuracy.
Citation: Alzamzami, O.; Babour, A.;
Baalawi, W.; Al Khuzayem, L.
Keywords: smart city; intelligent transportation systems; pothole detection; deep learning; YOLOv8
PDS-UAV: A Deep Learning-Based
Pothole Detection System Using
Unmanned Aerial Vehicle Images.
Sustainability 2024, 16, 9168. https://
doi.org/10.3390/su16219168 1. Introduction
Improving the quality of life is one of the main priorities for modern urban develop-
Academic Editors: Jun Qin and
Hong Tang
ment. Maintaining the comfort and safety of drivers and passengers on the roads is crucial
for improving life quality. By 2030, almost 60% of the world’s population is predicted to
Received: 17 July 2024 reside in cities. Given that millions of people depend on the road infrastructure for their
Revised: 14 October 2024 daily transportation, road safety is critically important. This implies that the demand for
Accepted: 15 October 2024 transportation would surpass the capacity of most present transportation systems, placing
Published: 22 October 2024
a high load on the road infrastructure in these cities [1].
Ensuring that roads are maintained in good condition is critical for the safety and
comfort of citizens. However, poor surface quality, aging infrastructure, heavy traffic,
Copyright: © 2024 by the authors.
and natural events such as earthquakes and heavy rainfall can lead to significant road
Licensee MDPI, Basel, Switzerland. damage, including potholes and cracks [2]. Besides being detrimental to comfort and
This article is an open access article leading to costly vehicle damages, potholes and other road anomalies can be very danger-
distributed under the terms and ous, potentially causing accidents with serious injuries or fatalities. Thus, addressing this
conditions of the Creative Commons problem in a timely manner is essential for safeguarding drivers and passengers.
Attribution (CC BY) license (https:// In Saudi Arabia, road traffic accidents account for about 5% of all fatalities [3]. Over the
creativecommons.org/licenses/by/ past decade, the rate of road deaths has risen from 17.4 to 24 per 100,000 people [3]. Addi-
4.0/). tionally, Saudi Arabia experiences 19 fatalities and 4 injuries due to automobile accidents
every hour, underscoring the significant public health risk posed by these incidents [3].
One of the primary elements leading to accidents and fatalities is the road infrastructure,
which should be designed and maintained to eliminate or decrease hazards for all road
users [1].
The current system used in Saudi Arabia for pothole detection is managed by the Saudi
municipalities. The system requires that employees at the road maintenance department
manually scan the roads, inspect the potholes, estimate the danger, and report the detected
potholes. Thus, the reporting is subjective, relying on the employees’ expertise, and it is also
opportunistic, depending on the potholes encountered during inspections. Additionally,
this task can be labor-intensive, requiring many employees to cover extensive urban areas.
It is particularly challenging in congested streets, posing a significant threat to the safety
of the employees. Alternatively, drivers or citizens can report potholes. However, this
depends on their willingness to take pictures of the potholes and upload them using a web
portal or mobile application, a task that can be time-consuming and inconvenient for some
drivers [4]. Moreover, the current system does not include any mechanism for informing
drivers about nearby potholes.
Several research approaches have been proposed to automatically analyze road con-
ditions for identifying potholes and other road anomalies [5]. These approaches can
be categorized into two main categories, vision-based, and vibration-based [6]. While
vibration-based approaches for road condition monitoring are cost-effective, easier to in-
stall, and more reliable under various environmental conditions, vision-based systems
provide extensive and contextual information on road conditions. This information can
lead to more comprehensive maintenance strategies and enhanced road safety.
This paper proposes a system that automates pothole detection using deep learning.
The proposed system, PDS-UAV, uses unmanned aerial vehicles (UAVs), also known as
drones, to capture images of potholes, providing a wide sensing range compared with
drivers’ smartphones or in-vehicle cameras. Images are then uploaded to the system for
analysis and classification using a deep neural network detection algorithm, specifically
You Only Look Once (YOLO), known for its accuracy and speed. Thus, the proposed
system allows for easier pothole detection without relying on the chance of encountering
potholes or individuals’ perceptions and viewpoints.
The main contribution of this paper is the development of a pothole detection and
management system that utilizes a deep learning model for detecting potholes from images
captured by UAVs. Initially, a deep learning model for pothole detection is trained using a
publicly available dataset. The model is then tested using a dataset collected for Jeddah
city in Saudi Arabia to confirm its accuracy and reliability in identifying potholes by using
cross-city testing as a method for cross-domain generalization. Additionally, two web
applications are developed: one for the road maintenance employees (RMEs) to manage
the repairs of the detected potholes, and another is used to inform drivers about nearby
potholes based on their routes by utilizing map annotation.
The remainder of the paper is organized as follows: Section 2 reviews various ap-
proaches proposed in the literature for pothole detection using smartphones, in-vehicle
cameras, or UAV images. Section 3 introduces the proposed PDS-UAV system. Section 4
elaborates on the development methodology of PDS-UAV. The PDS-UAV deep learning
model for pothole detection is presented in Section 5. Section 6 covers the testing of the
PDS-UAV system. Section 7 presents the discussion, and Section 8 concludes the paper.
2. Related Work
Different vision-based approaches have been proposed in the literature for pothole
detection. These approaches can be classified based on the technology used for capturing
images into approaches that use UAV cameras and approaches that use smartphones or
in-vehicle cameras, which are discussed in the following subsections. Table 1 compares the
reviewed pothole detection approaches. Approaches that utilize UAV images are depicted
Sustainability 2024, 16, 9168 3 of 30
at the beginning of the table, and other approaches, using smartphones or in-vehicle
cameras, are presented in the second part of the table.
abnormalities. The results of this study demonstrated that the error rate of the proposed
algorithm was reduced by approximately 13%.
In [15], the authors proposed an approach for pothole detection using entropy thresh-
olding segmentation combined with a sigmoid calibration function embedded in a mul-
tilayer perceptron (MLP) neural network. The used dataset includes a total of 500 sam-
pled pixels of a pothole surface. The method achieved the highest area under the curve
(AUC) value of 0.71, indicating improved accuracy in pothole detection compared with
other approaches.
In [16], a multi-agent system architecture that utilizes drone images for pavement
monitoring was built. The authors used the YOLOv4-tiny algorithm because of its time and
cost effectiveness, which makes it suitable for real-time detection. A drone camera was used
to collect a dataset of 1362 images. The drone flew in public locations with a speed range
of 15–25 km/h and at a height of 50–70 m. The authors considered three different types
of users: drone pilots, road maintenance agents, and road monitoring agents. The system
showed good performance with a precision of 96.54% and a mAP of 98.45%, as reported by
the best experiments’ results.
In [17], four supervised learning algorithms, K-nearest neighbor (KNN), support vec-
tor machine (SVM), artificial neural network (ANN), and random forest (RF), were used to
detect potholes and cracks on the roads from high-resolution UAV images. Additionally,
to extract potholes and cracks from images, a multi-resolution segmentation (MS) algo-
rithm was used. Furthermore, the gray-level co-occurrence Matrix (GLCM) was also used
to measure the variations between the distressed and non-distressed areas (damage-free
pavement). The dataset consisted of 1430 sample images, containing three different classes:
221 images of potholes, 678 images of cracks, and 531 images of undamaged roads. The run-
ning times of all four algorithms on the same PC were recorded to compare their perfor-
mance. The overall accuracy of the KNN, SVM, ANN, and RF algorithms were 98.81%,
98.95%, 98.81%, and 98.46%, respectively.
The authors in [21] used an edge detection classification method to detect potholes
from images captured using a smartphone camera. The method included three phases,
which are: preprocessing of images, extraction of features for road damage, and classifica-
tion of road damage. To reduce the computation cost, RGB (red, green, and blue) images
were converted to gray scale. The YOLO algorithm was used for the detection and classifi-
cation phases. When there is only one pothole in the images of the dataset, the results were
77.86% for accuracy and 83.45% for precision. However, when there are multiple potholes
in the images or potholes in addition to other road anomalies, the results of the average
accuracy of the model is about 74% and the average precision is about 75%.
A prototype for pothole detection and intelligent driving behaviors in autonomous
vehicles was proposed in [22]. The dataset in this study consisted of 783 images captured
by smartphone and Pi cameras installed on vehicles. The proposed system consisted of
three modules: a module for detecting potholes, a module for data processing, and an
autonomous-vehicle-system module. A convolutional neural network (CNN) model was
used for pothole detection. The model performance achieved values of 99.02% and 98.03%
for accuracy and precision, respectively.
In [23], a large-scale dataset of 18,345 road damage images was used. The images
were captured from a smartphone camera mounted on a vehicle. They were obtained from
several public datasets and augmented with crowdsourced images. Different object detec-
tion models were trained, which were: MobileNet, RetinaNet, and local binary patterns
(LBPs) object detectors. The RetinaNet model is an AI-based object detection model used
on smartphones, which was improved to suit the problem of road damage detection. It
achieved the highest accuracy among the other compared models with a percentage of
98.23%. The authors concluded that their improved RetinaNet model outperformed the
other models because it is more efficient and takes less memory, which is good for small
devices like smartphones.
The authors in [24] used a smartphone camera installed inside a vehicle to build a
low-cost and large-scale dataset of 9053 images. The collected images contained different
weather and lightning conditions. The images were divided into different types of damages,
and they were annotated manually. The single-shot multibox detector (SSD) was used to
train the model by utilizing two frameworks: Inception V2 and SSD MobileNet. These
frameworks can run on limited computational resource devices, such as smartphones,
while maintaining acceptable accuracy. The best results for the class containing the potholes
were obtained for the SSD MobileNet algorithm with an accuracy of 95% and a precision
of 99%. However, the pothole class in this study includes other road damages, which are
rutting, bump, and separation.
In summary, the reviewed pothole detection approaches utilize machine learning
or deep learning for pothole detection from either UAV images or smartphones and in-
vehicle cameras. The proposed system, PDS-UAV, advances pothole detection using a
state-of-the-art deep learning model, particularly YOLOv8, for accurate identification
from UAV images. YOLOv8 architecture includes multiple convolutional layers that
automatically identify relevant features such as edges, textures, and shapes associated
with potholes. It is known for its speed and accuracy, crucial for timely maintenance and
safety precautions. Using cross-city testing, the ability of the proposed system model to be
generalized across domains is demonstrated. This highlights the model’s high accuracy
on datasets from different regions, setting it apart from prior approaches that typically
focused on localized datasets. Additionally, the proposed system includes a web-based
application for monitoring, reporting, and viewing detected potholes, helping to streamline
the process for road maintenance employees and road users. Thus, PDS-UAV not only
enhances detection accuracy but also offers significant operational benefits by automating
data collection and analysis, reducing the need for manual inspections and increasing
overall road safety for drivers and passengers.
Sustainability 2024, 16, 9168 6 of 30
The system is operated as follows: the UAV captures video of the road surface,
and frames (images) are extracted from this video. Then, the extracted images are uploaded
to the system through the RME’s web application. The deep learning model performs the
classification for pothole detection from the uploaded images. Classified images of potholes
are stored in the system’s database along with their locations and relevant information.
RMEs retrieve detected potholes’ information and view them on a map. Additionally, they
can update the status of these potholes based on the progress of their repair. On the other
hand, RUs retrieve potholes’ information and locations based on their routes by overlaying
Sustainability 2024, 16, 9168 8 of 30
the pothole locations on the map and displaying information about the number of detected
potholes on their routes, thereby allowing RUs to avoid these potholes and preventing their
impact on safety or vehicle condition.
Responding to the question about having a new system that detects potholes using
AI and UAV images, the interviewee revealed that incorporating such a system would be
highly beneficial. It was explained that such a system would aid employees by allowing
them to cover larger areas of the road network quickly and safely, especially in big cities
and congested main streets. This would accelerate pothole reporting and repairs. Moreover,
using AI would help to minimize human errors associated with the current manual system.
Furthermore, the interviewee expressed strong support for adding a feature that allows
road users to view the locations of detected potholes in order to avoid them. A mobile
interface for locating potholes would be particularly advantageous for road users, providing
contextual information to enhance their travel safety.
The survey data reveals that potholes are a significant concern for drivers, with 88.2%
reporting that potholes cause damage to their vehicles and 99% believing that potholes
can lead to accidents. Nearly 80% of the participants had recently encountered potholes,
and 83.2% expressed interest in an application to view nearby potholes. Additionally, 75%
of participants felt that they would benefit from such an application, with 59% preferring
a mobile interface and suggesting features like ease of use, the ability to comment on
potholes, warning signs, and reporting capabilities to the municipality.
Sustainability 2024, 16, 9168 10 of 30
Sign Up
Login
Allow location
access
Insert pothole
image and location Annotate route with View potholes on the
<<extend>> Road User
detected potholes route
(Driver or Walker or Bicyclist)
View potholes
location on the map
Road
Maintenance
Department
Employee
Update
(Authorized User)
pothole status
Logout
Image
acquisition Retrieve the pothole
image and location
Authentication
Image
preprocessing yes Modify pothole status
check
Image
classification
classification
valid
Pothole
detected
invalid
no yes view potholes on the
map
Modify
potholes
status
no
Logout
In Figure 4, the user is asked to allow location access. If the user allows location access,
the pothole locations are retrieved from the database. The RUs enter their starting and
destination points. The system then displays a map showing the users’ route annotated
with all the detected potholes.
UAV Pilot
Control
Images/Location
Detect
Store
The RMEs log in and retrieve pothole information from the database, which can be
viewed on the map or in the list view. In addition, they update the pothole statuses based
on their repairs. The application controller enables the communication between the web
app GUI and the database, as seen in Figure 6.
Road Maintenance
Employee Log in
Update
Alternative
Update
[if pothole
repaired]
Figure 7 illustrates the sequence diagram for the RUs. The user begins by being asked
to allow their location and entering the starting and destination points. If the user allows
location access, the pothole locations are retrieved from the database. After retrieving the
pothole locations, the system informs the user about detected potholes by annotating the
map with the potholes’ locations.
Alternative
Map Annotation
[if pothole]
The welcome page shown in Figure 8 appears when the users enter the website.
On this page, multiple options are displayed for the user to choose from. The user can view
the system’s goals, sign up as RME, or use the system services as a road user.
On the sign-in page, shown in Figure 10, the RMEs enter their already registered
e-mails and passwords to gain access to their accounts and use the application’s services,
which include adding new potholes information and updating the status of stored potholes.
If the entered email and password match a registered user in Firestore, the user is authenti-
Sustainability 2024, 16, 9168 15 of 30
cated and granted access to the application. Otherwise, the user is denied from accessing
the system.
Figure 11 shows the main home page of the RUs web application. The upper part
shown in Figure 11a includes statistics about the detected potholes, categorizing them into
those that are waiting for repair, in-progress, or already repaired. Additionally, it features
a map displaying the locations of detected potholes. This map is implemented using the
Google Maps application programming interface (API), which is integrated into the system.
To embed a Google map on the website, a JavaScript is used to interact with the Google
Maps API. The API provides the necessary functions and methods to display the map,
add markers, handle user interactions, and customize the map’s appearance and behavior.
In addition, CSS is used to style the map container and ensure it fits correctly within the
website’s layout. Potholes’ coordinates are retrieved from Firestore and represented using
markers on the map. Users can update pothole information by clicking on each marker.
This interactive functionality enable users to easily manage and monitor pothole statuses
directly from the home page.
The lower part of the home page, shown in Figure 11b, presents an HTML table,
displaying the information about potholes currently stored in the Firestore database. This
table helps present pothole information in a structured and organized manner, facilitating
easy access and updates by the RMEs. The table includes details such as the pothole’s
location, status, and relevant notes, allowing RMEs to efficiently manage and monitor the
progress of pothole repairs. JavaScript is used to retrieve the potholes’ information from
Firestore and create a table dynamically based on the retrieved data. This combined view
of the map and table ensures that users have a comprehensive overview of the potholes
and their statuses, enhancing the functionality and user experience of the home page.
Figure 12 shows the interface for adding new pothole information. RMEs add new
pothole information by entering the coordinates and district and uploading an image.
When the user enters the pothole information and presses the predict button to classify
the image using the deep learning model, the classified image will appear as shown in
Figure 13, with the pothole location marked on that image. The detected pothole informa-
tion is stored in the database for further retrieval and processing. These potholes are then
displayed on the map using markers and are presented in the potholes table for easy access
and management.
Sustainability 2024, 16, 9168 16 of 30
In Figure 14, the pothole update page is shown. On this page, the user can change
the pothole information. This page allows RMEs to update the details of existing potholes,
including coordinates, district, image, and status. When the user clicks the update button,
the potholes updates are saved to the database.
Figure 14. Updating the pothole status page of the web application.
Sustainability 2024, 16, 9168 18 of 30
and images were taken from a distance of 60 m above the ground. The UAV flight time was
34 min.
Cross-domain generalization [25], specifically cross-city testing [26,27], is utilized to
assess how well the developed deep learning model, trained on the Spain dataset [16],
performs on detecting potholes from unseen images collected from Jeddah, Saudi Arabia.
Thus, in this study, the testing data were collected from Saudi Arabia using a DJI Mini
3 Pro quadcopter UAV, which is manufactured by DJI (Da-Jiang Innovations), a Chinese
technology company based in Shenzhen, Guangdong. The DJI Mini 3 Pro quadcopter
features 4K video capture resolution and an embedded GPS in the remote controller. Two
UAV trips were conducted to collect the necessary images. Each trip lasted about 30 min.
The total number of collected images was 59. Figure 16 shows a sample of the dataset
collected from Saudi Arabia versus the dataset collected from Spain.
Figure 16. Pothole image samples: (a) Jeddah, Saudi Arabia. (b) Spain.
consisted of 111 images. Figure 18 illustrates various augmented versions of the same
original image.
Figure 18. Augmentation: (a) Original image. (b) Noise, stretching, and flipping. (c) Noise and flipping.
versions of YOLO. The backbone is a pre-trained CNN responsible for extracting useful
features from the input image. The neck is another component that connects the backbone
to the head. It merges the feature maps using path aggregation blocks like the feature
pyramid network (FPN) and then passes them onto the head. The head is responsible for
classifying objects and predicting bounding boxes based on the features provided by the
backbone and neck.
The detection process is illustrated using a series of steps as shown in Figure 20.
The detection process begins by taking the input image and passing it through the fea-
ture extraction phase, which enhances detection efficiency by reducing data redundancy.
The CNN layers are responsible for detecting potholes by computing the probability of an
image containing a pothole based on the learned patterns from the training dataset. Each
layer contributes to the classification decision. The final decision of the deep learning algo-
rithm confirms the detection. Ultimately, the output classifies an image as either containing
a pothole or not.
The publicly available dataset discussed in the previous section was used to train the
deep learning model, while the collected and annotated dataset from Jeddah, Saudi Arabia,
was used for testing with the aim of cross-domain generalization. Different versions of
the YOLO algorithm, YOLOv4-tiny, YOLOv5, and YOLOv8, were evaluated to determine
which model produced the best results. The deep learning model for PDA-UAV was
developed using Python on Google Colab.
Figure 21. Random sample for the deep learning model testing results.
The various tested versions of the YOLO algorithm produced acceptable results.
Table 3 outlines the experiments’ results with varying epoch number in each experiment.
Confusion matrix metrics, including precision, recall, and F1 score, were utilized to evaluate
the performance of the model. These metrics were based on counts of true positives (TPs),
true negatives (TNs), false positives (FPs), and false negatives (FNs). TP represents the
correct detection of potholes when there is a pothole, FP represents the detection of a
pothole when there is no pothole, FN represents no detection of a pothole when there is a
pothole, and TN represents the correct non-detection of potholes when there is no pothole.
Precision, reflecting the accuracy of detected potholes, is calculated using Equation (1).
Recall, which assesses the model’s ability to identify potholes across various road segments,
is computed using Equation (2). The F1 score, which balances precision and recall, is
calculated using Equation (3).
TP
Precision = , (1)
TP + FP
TP
Recall = , (2)
TP + FN
2 × Precision × Recall
F1 = , (3)
Precision + Recall
Based on the results presented in Table 3, YOLOv8 yielded the best performance with
a precision of 98%, recall of 92%, and F1 score of 95%. Therefore, it was selected to be
deployed in the proposed PDS-UAV system. The confusion matrix shows the correla-
tion between the predicted and actual classes using YOLOv8 for the different number of
epochs, with zero representing potholes and one representing non-potholes, as depicted in
Figure 22.
6. PDS-UAV Testing
To ensure the quality and effectiveness of the PDS-UAV system, different types of
software testing were utilized, which include unit testing, integration testing, and usability
testing. Each of the performed testing techniques is discussed in detail in the following
subsections to ensure that every feature in the application works effectively and that the
system has achieved its goals.
Test ID Test Case Description Input Data Expected Results Actual Results Pass or Fail
Tests if the user enters an Web app will show this
Email:
1 existing email on the message: As expected Pass
[email protected]
sign-up page “email-already-in-use”
Test if the user inputs Web app will show this
2 incomplete information for Email: Not provided message: “Enter valid As expected Pass
signing up email”
Web app will show this
Test if the user inputs
3 Email: samaher@gmail message: “Invalid email As expected Pass
wrong information
address”
Web app will show this
message: “Password must
Test if the user enters a
4 Password: 12345 be composed of characters As expected Pass
wrong password
and numbers and must be
at least 6 characters”
Web app will show this
Test if the user enters a
message: “the name
5 wrong first name or last First Name: samaher3 As expected Pass
should only contain
name format
alphabets”
Web app will show this
Test if the user enters a
6 phone number: 55703456 message: “Phone number As expected Pass
wrong phone number
must be 9 numbers“
Test ID Test Case Description Input Data Expected Results Actual Results Pass or Fail
Test if the user enters the Web app will show this
wrong location message: “the district
1 District: 12345 As expected Pass
information for the should only start with
porthole alphabets”
Web app will show this
Test if the user enters a
2 Longitude: Not Provided message: “Longitude Field As expected Pass
valid longitude
is required”
Web app will show this
Test if the user enters a
3 Latitude: 0.26333 message: “Please enter a As expected Pass
valid Latitude
valid Latitude”
Web app will show this
Test if the user uploaded a
4 No Image is uploaded message: “Please upload As expected Pass
pothole image
the pothole image”
Test ID Test Case Objective Test Case Description Expected Results Actual Results Pass or Fail
If the user clicks on the
Test the link between
sign-up on the welcoming The sign-up page will be
1 welcoming page and As expected Pass
page, the sign-up page will shown.
sign-up page.
be displayed.
If the user clicks on log-in
Test the link between
on the welcoming page, The log-in page will be
2 welcoming page and As expected Pass
the log-in page will be shown.
log-in page.
displayed.
If the user enters the
Test the link between the The home page will be
correct email and
3 log-in page and home displayed if the entered As expected Pass
password, the home page
page. credentials are correct
will be displayed.
The map is displayed in
Test the interface link The map with clickable
the center with markers
4 between the homepage potholes markers will be As expected Pass
showing detected
and the map. displayed
potholes.
If the user clicks on a The user will be directed
Test the interface link
specific marker on the to the pothole update page
5 between the marker on the As expected Pass
map, the pothole update for the selected marker on
map and the update page.
page will be displayed. the map.
Test the interface link If the user clicks on upload
between the homepage pothole on the homepage, The user will be directed
6 As expected Pass
and the pothole upload the upload pothole page to the pothole upload
page. will be displayed.
If the user fills in pothole
Test the interface link information, click on The user will be directed
between the pothole upload, and if the upload to the home page if
7 As expected Pass
upload page and the is successful, uploading the pothole
homepage. the homepage will be information is successful.
displayed.
If the user uploads a
Test the interface link The model prediction
pothole image and then
between the pothole result of the uploaded
8 clicks ’Predict,’ the As expected Pass
upload page and the pothole image will be
prediction results will be
model prediction page. displayed.
displayed.
1. Sign up: The user signs up to the system by filling in the required information, such
as first name, last name, email, password, and phone number.
2. Sign in: The user signs in to the system using email and password.
3. Add new pothole: The user fills in the required information to add a new pothole in the
add new pothole page. Information includes latitude, longitude, district, and image of
the pothole, where the needed information, coordinates of the pothole, and its image
are provided to the user before starting the test.
4. Update pothole status with ID number 1 to repaired: The user selects the edit option
from the potholes table at the bottom of the home page and updates the pothole status
to “repaired” in the update pothole status page.
5. Sign out: The user signs out of the system by pressing the sign-out button displayed
in the header.
6. View potholes: The user enters the starting and destination points of the intended
route and presses on the show route button to view detected potholes on the se-
lected route.
During the usability testing, the task completion time for each participant was recorded.
Additionally, participants were asked to rate the easiness of each performed task on a scale
from 1 to 5, provide comments on the task, and specify any aspects they found confusing.
This comprehensive approach ensures that quantitative and qualitative data are collected,
allowing for a thorough analysis of usability and user satisfaction.
To ensure task success in the usability study, clear criteria should be established based
on two parameters: completion within the specified time range and achieving a satisfaction
score. Task success criteria are defined as follows:
• Time range completion: Participants must complete each task within the specified time
range. The targeted average times of each task were set in the planning of usability
testing based on a pilot test performed with a small sample of representative users.
The target times were set as follows: sign-up task = 120 s–180 s, sign-in task = 30 s–60 s,
add new pothole task = 90 s–120 s, update pothole status with id number 1 to repaired
= 100 s–130 s, view potholes on a route = 90 s–120 s, sign out = 5 s–20 s.
• Rating of task easiness: Participants must rate the easiness of the task as at least 4 out
of 5 on a satisfaction scale.
Table 7 shows the time taken by each participant to complete each task, the rating of
each task’s easiness, and the average of times and rates for each task.
Update
Participant Add New View
Sign Up Sign In Pothole Sign Out Task Rating
No. Pothole Potholes
Status
1 120 s 45 s 90 s 90 s 5s 80 s 4.8
2 240 s 60 s 120 s 50 s 8s 110 s 4.4
3 180 s 31 s 182 s 95 s 15 s 112 s 3.8
4 64 s 15 s 53 s 75 s 2s 133 s 4.0
5 103 s 42 s 133 s 82 s 11 s 97 s 4.7
Average 141.4 s 38.6 s 115.6 s 78.4 s 8.2 s 106.4 s 4.3
In general, the average time taken by the five participants to complete each task was
within the targeted time. Additionally, the results show that the participants’ average rating
of task easiness was 4.3 out of 5, indicating overall satisfaction with the system’s ease of
use. Overall, the usability testing showed that PDS-UAV’s design was easy to use and
was well accepted by its target users. Some participants also provided further comments
and suggestions, such as placing the pothole table on the same screen as the pothole map,
Sustainability 2024, 16, 9168 27 of 30
clarifying the pothole status on the ‘update’ pothole screen, and providing the interface in
other languages, such as Array. Based on the users’ comments, the system was updated.
For example, in the update pothole status page, the confusion about the current status field
of a pothole was solved by moving the drop-box of status to a more recognizable place on
the screen. Other users’ comments will be addressed in future work.
7. Discussion
The detection and maintenance of potholes are crucial for ensuring road safety and
enhancing the quality of life for drivers and passengers. Traditional methods of pothole
detection often rely on manual reporting by drivers or municipal employees, which can
be inefficient and unreliable. This study introduces an innovative automated system for
pothole detection using UAVs and AI. The proposed PDS-UAV system uses UAVs to capture
images of road surfaces, then utilizes a deep neural network to analyze the captured images
and detect potholes.
Previous research has explored various methods for pothole detection, including the
use of smartphone cameras and in-vehicle cameras [23,24]. While useful, these methods
often suffer from limited coverage and require manual intervention. In contrast, UAVs offer
a broader sensing range, allowing for more extensive coverage and the ability to detect
potholes in areas that are difficult to access by conventional means without endangering
human lives. Thus, several recent studies in the literature have considered UAV images
for pothole detection [12–17]. Some of the UAV approaches for pothole detection utilized
machine learning techniques [12,17], while others utilized deep learning techniques with or
without image processing [13–16]. The image processing techniques were utilized to reduce
the error rate and increase the model’s ability to distinguish between classes [14,15]. Other
approaches used deep learning for pothole detection. For instance, a multi-agent system
using the YOLOv4-tiny algorithm was employed for real-time pothole detection using UAV
images with accuracy rates of 96.54% and a precision of 98.45% [16]. Similarly, UAV images
were also used to improve road damage detection efficiency and accuracy by employing
multiple YOLO versions (YOLOv4, YOLOv5, YOLOv5 with transformer, and YOLOv7),
achieving precision rates of 26.86%, 59.9%, 65.7%, and 73.2%, respectively [13].
The PDS-UAV system extends the efforts of automating pothole detection by incorpo-
rating YOLO deep learning algorithm. Different variants of the YOLO model were tested
for pothole detection in the proposed PDS-UAV system. The models were trained using a
publicly available dataset collected from Spain and tested on a proprietary dataset collected
from Jeddah, Saudi Arabia, for cross-domain generalization. The examined models in-
cluded YOLOv4-tiny, YOLOv5, and YOLOv8. The best precision percentages for the models
were achieved when the number of epochs was equal to 100 and the percentages were 82%,
82%, and 98%, respectively. Due to its significantly higher detection precision, YOLOv8
was selected as the best model for the system to increase the reliability and accuracy of
pothole identification.
In [16], the same dataset of the present study was employed to train the model
exclusively on YOLOv4-tiny. In contrast, the present study utilized the same dataset to
train three models: YOLOv4-tiny, YOLOv5, and YOLOv8, enabling a comparative analysis
across different versions of the YOLO model. Although both studies used the same dataset,
the YOLOv4-tiny model in [16] achieved higher precision than this study (98.45% versus
82%). This discrepancy could be due to the model in [16] being optimized and tested
on a domain-specific dataset, which likely improved its performance. In contrast, this
study tested the models on a dataset from a different domain (country), introducing more
variability and possibly making the detection task more challenging. Another notable
factor is that the authors in [16] did not specify the number of training epochs, whereas
in the current study, the number of epochs ranged between 10 and 100 for each model.
A longer or more fine-tuned training process may have contributed to the higher precision
in [16]. While increasing the number of epochs can improve the model performance, it also
raises the risk of overfitting, where the model performs well on training data but poorly on
Sustainability 2024, 16, 9168 28 of 30
unseen data. To prevent this, the number of epochs in the present study was deliberately
kept between 10 and 100, balancing training time and ensuring better generalization to
unseen data.
The implementation of the PDS-UAV system demonstrated several key findings.
The YOLOv8 model demonstrated high effectiveness in detecting potholes from UAV-
captured images, achieving an F1 score of 95%, a precision of 98%, and a recall of 92%,
underscoring its accuracy and efficiency. Usability testing with road maintenance employ-
ees and road users revealed a high average satisfaction rating of 4.3 out of 5, reflecting
the system’s ease of use and acceptance among target users. The system’s operational
benefits were also notable. It effectively informs drivers of nearby potholes on their routes
by overlaying the detected potholes’ locations on the map.
Automating pothole identification using UAV images has a significant impact on
infrastructure management, transportation efficiency, and road safety, benefiting vari-
ous stakeholders, including road users, road maintenance employees, and city planners.
For road users, such as drivers, bikers, and daily commuters, the technology offers safer
routes by providing information on pothole locations, allowing them to avoid hazardous
areas and reducing the risk of accidents and vehicle damage. Maintenance employees gain
from enhanced safety due to reduced exposure to traffic and the elimination of manual
inspections, which also cuts labor costs and allows for more efficient repairs. The sys-
tem’s continuous monitoring capabilities ensure proactive maintenance, preventing costly
road damage and enabling timely interventions. For city planners, the PDS-UAV system
provides valuable data to prioritize road repair projects and allocate resources efficiently.
Its integration with traffic management and emergency services enhances urban mobility
and safety by maintaining critical routes in good condition. Furthermore, the precise
data supports informed decision-making and smart city initiatives, contributing to bet-
ter infrastructure management and environmental sustainability. Utilizing an advanced
detection algorithm, YOLOv8, enhances the system’s reliability and accuracy, making it
an essential tool for improving road maintenance and ensuring safer and more efficient
transportation networks.
Despite the promising results of the proposed PDS-UAV system, several limitations
can be highlighted. First, the performance of the YOLOv8 model relies heavily on the
quality and diversity of the training dataset. In addition, potholes can vary depending on
regions, road surfaces, lighting conditions, and weather, impacting the model’s detection
accuracy. Second, while there are many benefits to using UAVs for data collection, there are
also shortcomings. Deploying UAVs for pothole detection faces operational limitations such
as battery life, which could result in restricted flight length (usually 30–40 min). In addition,
the effects of bad weather conditions, such as heavy rain, strong winds, or low visibility,
can limit UAV operations. These UAV-related variables may impact the dependability and
consistency of data gathering across long periods or wide geographic areas. Third, the use
of UAVs in urban contexts is restricted by stringent regulatory frameworks designed to
ensure safety and privacy, making it time-consuming to obtain appropriate approvals and
manage the danger of accidents with other airborne objects or structures. Addressing
these limitations through future research and development will be crucial for enhancing
the robustness, scalability, and practical applicability of the PDS-UAV system in diverse
urban environments.
Future work will include developing a real-time system by integrating the system
into the UAVs. In addition, it will focus on improving the website’s functionality and
conducting further usability testing with a larger user base. Also, enabling a bilingual
interface for the application using English and Arabic will make the web application more
accessible to target users. Moreover, including real-time alerts for RUs based on their
current location, and using banners and sounds would enhance their safety. The alerts
should consider the safe distance that separates the RUs from the pothole based on their
mobility mode (walking, driving, or cycling) so that they can take proper action at the right
time to prevent the impact of these potholes on their safety. Furthermore, investigating the
Sustainability 2024, 16, 9168 29 of 30
use of UAVs for other forms of infrastructure monitoring, such as traffic monitoring and
control, might broaden the technology’s applications.
8. Conclusions
Smart cities use modern technology to enhance urban services, infrastructure, and en-
vironmental sustainability. Ensuring the safety and comfort of drivers and passengers is
crucial for improving quality of life. This study aimed to develop a system using a deep
learning model to detect potholes from UAV images. Data were collected from target users
via interviews with a RME from Jeddah’s IT department and a questionnaire distributed
to road users. Two web applications were developed for RMEs and RUs. The deep learn-
ing model, trained on a public dataset of UAV pothole images, was tested on a dataset
collected in Jeddah, Saudi Arabia. The YOLOv8 model achieved 95% on the F1 score, 98%
on precision, and 92% on recall. Unit testing validated individual system functions while
integration testing ensured that all components worked together, and the deep learning
model was integrated with the RME web application. Usability testing with representative
users showed overall satisfaction with the system. In practice, the proposed PDS-UAV
system can significantly reduce road maintenance costs by automating detection and mon-
itoring processes, thereby reducing the need for frequent and risky manual inspection.
Furthermore, the system’s ability to deliver timely information to drivers about pothole
locations improves traffic safety, potentially reducing accidents caused by hazardous road
conditions. Future research will focus on developing a real-time system for UAVs, improv-
ing the website’s functionalities, and conducting further usability testing with a larger
user base.
References
1. World Health Organization Global Status Report on Road Safety 2023; Technical Report; World Health Organization: Geneva,
Switzerland, 2023.
2. Ranyal, E.; Sadhu, A.; Jain, K. Road condition monitoring using smart sensing and artificial intelligence: A review. Sensors 2022,
22, 3044. [CrossRef] [PubMed]
3. Alenezi, E.Z.; AlQahtani, A.M.; Althunayan, S.F.; Alanazi, A.S.; Aldosari, A.O.; Alharbi, A.M.; Alanazi, S.T.; Alanazi, S.S.S.;
Tubayqi, H.G.A.; Taheri, T.A. Prevalence and determinants of road traffic accidents in Saudi Arabia: A systematic review. Cureus
2023, 15, e51205 . [CrossRef] [PubMed]
4. HemaMalini, B.; Padesur, A.; Manoj, V.; Shet, A. Detection of Potholes on Roads using a Drone. EAI Endorsed Trans. Energy Web
2021, 9, e4.
5. Rathee, M.; Bačić, B.; Doborjeh, M. Automated Road Defect and Anomaly Detection for Traffic Safety: A Systematic Review.
Sensors 2023, 23, 5656. [CrossRef]
6. Ozoglu, F.; Gökgöz, T. Detection of Road Potholes by Applying Convolutional Neural Network Method Based on Road Vibration
Data. Sensors 2023, 23, 9023. [CrossRef]
Sustainability 2024, 16, 9168 30 of 30
7. Butilă, E.V.; Boboc, R.G. Urban traffic monitoring and analysis using unmanned aerial vehicles (UAVs): A systematic literature
review. Remote Sens. 2022, 14, 620. [CrossRef]
8. Byun, S.; Shin, I.K.; Moon, J.; Kang, J.; Choi, S.I. Road traffic monitoring from UAV images using deep learning networks. Remote
Sens. 2021, 13, 4027. [CrossRef]
9. Dewan, R.; Rahman, K.F. A Survey on Applications of Unmanned Aerial Vehicles (UAVs). In Recent Innovations in Computing;
Springer: Berlin/Heidelberg, Germany, 2022; pp. 95–110.
10. Feitosa, I.; Santos, B.; Almeida, P.G. Pavement Inspection in Transport Infrastructures Using Unmanned Aerial Vehicles (UAVs).
Sustainability 2024, 16, 2207. [CrossRef]
11. Liang, H.; Lee, S.C.; Bae, W.; Kim, J.; Seo, S. Towards UAVs in construction: Advancements, challenges, and future directions for
monitoring and inspection. Drones 2023, 7, 202. [CrossRef]
12. Ibrahim, H.B.; Salah, M.; Zarzoura, F.; El-Mewafi, M. Smart monitoring of road pavement deformations from UAV images by
using machine learning. Innov. Infrastruct. Solut. 2024, 9, 1–18. [CrossRef]
13. Silva, L.A.; Leithardt, V.R.Q.; Batista, V.F.L.; Villarrubia González, G.; De Paz Santana, J.F. Automated Road Damage Detection
Using UAV Images and Deep Learning Techniques. IEEE Access 2023, 11, 62918–62931. [CrossRef]
14. Kim, S.; Seo, D.; Jeon, S. Improvement of Tiny Object Segmentation Accuracy in Aerial Images for Asphalt Pavement Pothole
Detection. Sensors 2023, 23, 5851. [CrossRef] [PubMed]
15. Nomqupu, S.; Sali, A.; Nyamugama, A.; Ndou, N. Integrating Sigmoid Calibration Function into Entropy Thresholding
Segmentation for Enhanced Recognition of Potholes Imaged Using a UAV Multispectral Sensor. Appl. Sci. 2024, 14, 2670.
[CrossRef]
16. Silva, L.A.; Sanchez San Blas, H.; Peral García, D.; Sales Mendes, A.; Villarubia González, G. An architectural multi-agent system
for a pavement monitoring system with pothole recognition in UAV images. Sensors 2020, 20, 6205. [CrossRef]
17. Pan, Y.; Zhang, X.; Sun, M.; Zhao, Q. Object-based and supervised detection of potholes and cracks from the pavement images
acquired by UAV. Int. Arch. Photogramm. Remote Sens. Spat. Inf. Sci. 2017, 42, 209–217. [CrossRef]
18. Xin, H.; Ye, Y.; Na, X.; Hu, H.; Wang, G.; Wu, C.; Hu, S. Sustainable Road Pothole Detection: A Crowdsourcing Based Multi-Sensors
Fusion Approach. Sustainability 2023, 15, 6610. [CrossRef]
19. Opara, J.N.; Thein, A.B.B.; Izumi, S.; Yasuhara, H.; Chun, P.J. Defect Detection on Asphalt Pavement by Deep Learning. Geomate J.
2021, 21, 87–94.
20. Park, S.S.; Tran, V.T.; Lee, D.E. Application of various yolo models for computer vision-based real-time pothole detection. Appl.
Sci. 2021, 11, 11229. [CrossRef]
21. Baek, J.W.; Chung, K. Pothole classification model using edge detection in road image. Appl. Sci. 2020, 10, 6662. [CrossRef]
22. Dewangan, D.K.; Sahu, S.P. PotNet: Pothole detection for autonomous vehicle system using convolutional neural network.
Electron. Lett. 2021, 57, 53–56. [CrossRef]
23. Ochoa-Ruiz, G.; Angulo-Murillo, A.A.; Ochoa-Zezzatti, A.; Aguilar-Lobo, L.M.; Vega-Fernández, J.A.; Natraj, S. An asphalt
damage dataset and detection system based on retinanet for road conditions assessment. Appl. Sci. 2020, 10, 3974. [CrossRef]
24. Maeda, H.; Sekimoto, Y.; Seto, T.; Kashiyama, T.; Omata, H. Road damage detection and classification using deep neural networks
with smartphone images. Comput.-Aided Civ. Infrastruct. Eng. 2018, 33, 1127–1141. [CrossRef]
25. Wang, J.; Lan, C.; Liu, C.; Ouyang, Y.; Qin, T.; Lu, W.; Chen, Y.; Zeng, W.; Yu, P.S. Generalizing to Unseen Domains: A Survey on
Domain Generalization. IEEE Trans. Knowl. Data Eng. 2023, 35, 8052–8072. [CrossRef]
26. Saremi, F.; Abdelzaher, T. Combining Map-Based Inference and Crowd-Sensing for Detecting Traffic Regulators. In Proceedings
of the 2015 IEEE 12th International Conference on Mobile Ad Hoc and Sensor Systems, Dallas, TX, USA, 19–22 October 2015;
pp. 145–153. [CrossRef]
27. Zourlidou, S.; Sester, M. Traffic Regulator Detection and Identification from Crowdsourced Data—A Systematic Literature
Review. ISPRS Int. J. Geo-Inf. 2019, 8, 491. [CrossRef]
28. Hao, X.; Liu, L.; Yang, R.; Yin, L.; Zhang, L.; Li, X. A review of data augmentation methods of remote sensing image target
recognition. Remote Sens. 2023, 15, 827. [CrossRef]
29. Theophilus, S. Roboflow: Converting Annotations for Object Detection. 3 September 2021. Available online: https://medium.com/
analytics-vidhya/converting-annotations-for-object-detection-using-roboflow-5d0760bd5871 (accessed on 5 December 2022).
30. Redmon, J.; Divvala, S.; Girshick, R.; Farhadi, A. You Only Look Once: Unified, Real-Time Object Detection. In Proceedings of
the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, NV, USA, 27–30 June 2016.
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual
author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to
people or property resulting from any ideas, methods, instructions or products referred to in the content.
Reproduced with permission of copyright owner. Further reproduction
prohibited without permission.