CCD Image-Based Pixel-Level Identification Model For Pavement Cracks Under Complex Noises Using Artificial Intelligence
CCD Image-Based Pixel-Level Identification Model For Pavement Cracks Under Complex Noises Using Artificial Intelligence
ABSTRACT Existing manual detection methods have limitations, particularly for pavement cracks in
complex backgrounds, which are manifested in low recognition accuracy, high misjudgment rate, and
long time-consuming. To overcome these problems, artificial intelligence technology and charge-coupled
devices (CCD) imaging technology are combined to construct an automatic identification method for
pavement hidden cracks under complex background interference conditions. First, the classic semantic
segmentation model U-net is selected as the basic model, and the MobileNet lightweight network is utilized to
replace the encoder part of U-Net with huge parameters, to realize the lightweight of the model and improve
the segmentation effect of pavement cracks. On this basis, the Atrous Channel Pyramid Attention (ACPA)
mechanism is introduced into the U-net to further improve contextual information capability to focus on
selectively relevant features. A pavement crack data set containing different complex and diverse crack types
and background noise is used to evaluate the effectiveness and scope of application of the developed model.
Quantitative evaluation results show that the developed model achieves an overall performance in the test
set with a precision of 88.84%, recall of 89.76%, accuracy of 98.87%, and IoU of 89.95%, respectively.
Combined with the analysis of the results of the comparison experiment and the ablation experiment, it can
be inferred that the utilization of the MobileNet lightweight network to replace the encoder part of U-net can
effectively construct a lightweight model while the ACPA module can effectively perform multi-scale and
long-distance cross-channel interaction, help suppress useless features, strengthen useful features, and help
the network learn stronger feature representations of hidden areas of pavement cracks.
INDEX TERMS Pavement disease, machine vision, deep learning, damage assessment, feature extraction.
Compared with manual detection, this automatic detection To obtain accurate segmentation results for pavement
technology has significantly improved accuracy, but the cracks in complicated background and noise interferences,
detection vehicles are expensive and suffer from challenging the developed network needs to obtain high-quality semantic
issues due to the complex road background [6]. Moreover, features and sufficient spatial detail information. The network
pavement cracks mostly present a linear shape distribution generally performs a downsampling operation on the feature
feature, which makes their phase pixel ratio relatively low, map to obtain high-quality semantic features. This operation
and the difficulty of crack identification is more challenging can not only reduce the resolution of the feature map, and
than other types of defects, especially in complex environ- reduce the amount of calculation, but also help to increase
ments with a lot of background noise [7]. the network receptive field. However, in the process of down-
In the past few decades, with the rapid development of the sampling the feature map, the spatial detail information will
field of artificial intelligence(AI), various intelligent algo- be lost. To address the above-mentioned problems, this study
rithms have been applied to the identification and detection utilizes the U-shaped network architecture(U-net) as the base
of road surface defects [8], [9], [10], [11], [12]. Machine network. To realize real-time detection of pavement cracks at
learning-based algorithms represented by artificial neural the pixel level, the mobile network MobileNet is introduced to
networks and support vector machines are often used to replace the backbone part of the conventional UNet network.
learn the intrinsic knowledge association of pavement crack The improved UNet network integrates the feature maps in
data [13], [14]. However, these methods rely on traditional the encoder with similar resolutions as the decoder into the
manual feature extraction methods, and their performance is decoder through skip connections. To further eliminate the
limited in scenes with complex road backgrounds, uneven problem of sample loss when gradually increasing the spatial
pixel distribution, and noise interference. In addition, the resolution, the Atrous Channel Pyramid Attention (ACPA)
cumbersome feature extraction process makes the overall module is introduced into the U-net network to improve the
inference efficiency of the model slow, making it difficult to integrity and accuracy of identifying micro-cracks in com-
meet real-time detection requirements. plex backgrounds. On this basis, the model loss function is
In recent years, deep learning methods have been improved to eliminate the serious imbalance of the sample
researched and partially applied in the fields of computa- background caused by tiny cracks
tional photography, image recognition, and automatic driv- The main contributions of this study can be attributed as
ing, and have achieved remarkable results [15], [16]. The follows.
use of deep learning methods for CCD image data min- 1) The application of the replacement of the backbone
ing and effective extraction has become a hot research network using the MobileNet lightweight network can
topic in the field of road surface management. For exam- maintain high detection accuracy, greatly reduce the
ple, Liu et al. [17] proposed an automated pavement crack size of the model weight file and reduce the crack
detection and segmentation method using two-step convo- segmentation time.
lutional neural networks. Yang et al. [18] proposed a feature 2) The ACPA module can effectively perform multi-scale
pyramid and hierarchical boosting network for pavement and long-distance cross-channel interaction, help sup-
crack detection. Fan et al. [19] utilized the parallel ResNet press useless features, strengthen useful features, help
to develop a high-performance pavement detection and mea- the network learn stronger feature representations of
surement model. In other research, Loprencipe et al. [20] pavement crack areas, and have fewer parameters.
proposed an ensemble method using deep convolutional neu- 3) The comparative experiments on the pavement engi-
ral networks for automatic pavement crack detection and neering crack data set show that the construction
measurement. The effective implementation of the above method has higher inference accuracy and efficiency
research shows that deep learning is an effective method than other benchmark methods, indicating that the
to solve the hidden crack identification of pavement. The model has achieved an effective balance in inference
effective implementation of the above research shows that accuracy and efficiency
deep learning is an effective method to solve the hidden The rest of this paper is mainly as follows. Section II gives
crack identification of pavement. But on the other hand, an introduction to the architecture of the proposed network
the aforementioned deep learning-based networks achieve and the mathematical principles of each part. Section III
good performance, but these existing methods still suffer describes the implementation process and the dataset sources.
from insufficient feature extraction capability, especifically Section IV describes the experimental results and feasibility
in long-range contextual information, which is crucial for tiny analysis. Lastly, the conclusions are provided in the final part
crack detection. The interference of the small receptive field of this paper.
area can easily lead to the phenomenon of fracture or false
positive in the crack recognition area under the interference II. METHODOLOGY
of the model under complex light and shadow condi- In this section, the overall architecture of the developed net-
tions. In addition, the balance mode between network infer- work is first presented to give a workflow. Then, the theory
ence efficiency and inference accuracy needs to be further about the components of the model composition is further
studied. elaborated. The specific content is as follows.
89734 VOLUME 11, 2023
F. Song et al.: CCD Image-Based Pixel-Level Identification Model for Pavement Cracks
C. ATROUS CHANNEL PYRAMID ATTENTION improves the network’s ability to capture the global informa-
Due to the complex environmental conditions of the road, tion of cracks.
such as landmarks, oil stains, and water stains, when the
visual detection method detects hidden cracks on the road
surface, if there are breaks and discontinuities, it is easy to
cause misjudgment of subsequent results. To further improve
this problem, ACPA is introduced to enhance the model
identification and segmentation effect of the model on small
target objects. ACPA is an attention mechanism used in
computer vision tasks, particularly in the field of seman-
tic segmentation [22]. It is designed to capture long-range
dependencies and enhance the feature representations in con-
volutional neural networks (CNNs). Semantic segmentation
involves assigning a class label to each pixel in an image,
which requires understanding the context and relationships
between different regions. ACPA helps in capturing such FIGURE 4. The architecture diagram of the ACPA.
yi is the real label of the i-th pixel (the background pixel label that there are significant differences in the shape of pavement
is 0, the crack pixel label is 1); p (yi ) is the probability that the cracks in the data set, with inconsistent crack widths and
network predicts that the i-th pixel is the crack pixel label. different bifurcation types.
The Dice loss is a suitable loss function for training image
segmentation models because it encourages the model to
produce segmentation masks that align well with the ground
truth masks [24]. The utilization of the Dice loss function
can improve the model’s ability to capture fine details and
handle class imbalance. Minimizing the Dice loss effectively
maximizes the overlap between the predicted and ground
truth masks. Dice loss function that maximizes the overlap
between the predicted and ground truth segmentation masks.
The Dice loss is defined as 1 - Dice_coefficient, and it can
help improve the model’s ability to capture fine details and
handle class imbalance.
2|X ∩ Y |
Dice_coefficient = (4)
|X | + |Y |
where |X ∩ Y | represents the intersection of sets X and Y; |X |
and |Y | represents the number of its elements. For the seg-
mentation task, |X | and |Y | represent the segmented ground
true and predict mask, respectively.
Since the relationship between Dice loss and Dice coeffi-
cient is: LossDice = 1 - Dice_coefficient, the formula for Dice FIGURE 5. Different types of pavement cracks.
Loss can be explained as follows.
2|X ∩Y |
LossDice = 1 − (5)
|X |+|Y |
hydraulic concrete structures. It can be seen from the figure [3] F. Guo, Y. Qian, Y. Wu, Z. Leng, and H. Yu, ‘‘Automatic railroad track
that the proposed method has achieved good performance components inspection using real-time instance segmentation,’’ Comput.-
Aided Civil Infrastruct. Eng., vol. 36, no. 3, pp. 362–377, Mar. 2021, doi:
on different types of cracks in the test set, and the results 10.1111/mice.12625.
of neural network segmentation and crack identification are [4] L. Zhao, Y. Wu, X. Luo, and Y. Yuan, ‘‘Automatic defect detection of
basically consistent. pavement diseases,’’ Remote Sens., vol. 14, no. 19, p. 4836, Sep. 2022,
doi: 10.3390/rs14194836.
FIGURE 9 shows the pavement crack identification effect [5] C. Chen, S. Chandra, Y. Han, and H. Seo, ‘‘Deep learning-based thermal
of the developed method under real noise interferences. It can image analysis for pavement defect detection and classification consider-
ing complex pavement conditions,’’ Remote Sens., vol. 14, no. 1, p. 106,
be inferred from the results that the constructed model can
Dec. 2021, doi: 10.3390/rs14010106.
accurately identify pavement cracks even in the presence [6] L. Pei, Z. Sun, L. Xiao, W. Li, J. Sun, and H. Zhang, ‘‘Virtual generation
of significant noise contamination (including paint, cigarette of pavement crack images based on improved deep convolutional gener-
ative adversarial network,’’ Eng. Appl. Artif. Intell., vol. 104, Sep. 2021,
butts, and wooden sticks). Also, it can be also seen that its Art. no. 104376, doi: 10.1016/j.engappai.2021.104376.
geometric profile is consistent with the real labeling results, [7] L. Deng, A. Zhang, J. Guo, and Y. Liu, ‘‘An integrated method for
indicating the effectiveness of the developed method. road crack segmentation and surface feature quantification under complex
backgrounds,’’ Remote Sens., vol. 15, no. 6, p. 1530, Mar. 2023, doi:
10.3390/rs15061530.
V. CONCLUSION [8] Y. Shi, L. Cui, Z. Qi, F. Meng, and Z. Chen, ‘‘Automatic road
crack detection using random structured forests,’’ IEEE Trans.
Pavement crack is a potential and harmful pavement disease,
Intell. Transp. Syst., vol. 17, no. 12, pp. 3434–3445, Dec. 2016, doi:
which has attracted great attention from the engineering man- 10.1109/TITS.2016.2552248.
agement circle. If it is not dealt with properly, it will seriously [9] T. Rateke and A. von Wangenheim, ‘‘Road surface detection and differ-
entiation considering surface damages,’’ Auton. Robots, vol. 45, no. 2,
threaten driving safety and cause serious safety accidents. pp. 299–312, Feb. 2021, doi: 10.1007/s10514-020-09964-3.
The existing manual-based detection methods have the dis- [10] H. Maeda, T. Kashiyama, Y. Sekimoto, T. Seto, and H. Omata, ‘‘Gen-
advantages of low efficiency, long time-consuming, and poor erative adversarial network for road damage detection,’’ Comput.-Aided
Civil Infrastruct. Eng., vol. 36, no. 1, pp. 47–60, Jan. 2021, doi:
accuracy, and it is difficult to meet the needs of large-scale 10.1111/mice.12561.
urban pavement crack diagnosis. Based on this, this study [11] O. D. Adeniji, D. B. Adekeye, S. A. Ajagbe, A. O. Adesina, Y. J. Oguns,
combines artificial intelligence methods and CCD image and M. A. Oladipupo, ‘‘Development of DDoS attack detection approach
in software defined network using support vector machine classifier,’’ in
technology to propose an automatic identification method for Proc. ICPCSN. Singapore: Springer, 2022, pp. 319–331.
hidden cracks in pavement with high detection performance. [12] S. A. Ajagbe and M. O. Adigun, ‘‘Deep learning techniques for detection
A series of experimental comparisons and multi-angle ver- and prediction of pandemic diseases: A systematic literature review,’’
Multimedia Tools Appl., vol. 6, pp. 1–35, 2023, doi: 10.1007/s11042-023-
ifications show that the proposed method can still achieve 15805-z.
higher detection accuracy and performance than other bench- [13] N. Aravind, S. Nagajothi, and S. Elavenil, ‘‘Machine learning model
mark methods under complex road noise interference, which for predicting the crack detection and pattern recognition of geopoly-
mer concrete beams,’’ Construct. Building Mater., vol. 297, Aug. 2021,
illustrates the feasibility of the idea. Art. no. 123785, doi: 10.1016/j.conbuildmat.2021.123785.
[14] A. Malekloo, E. Ozer, M. Alhamaydeh, and M. Girolami, ‘‘Machine
learning and structural health monitoring overview with emerging
A. LIMITATION technology and high-dimensional data source highlights,’’ Struct.
However, this study also has some limitations, which need to Health Monit., vol. 21, no. 4, pp. 1906–1955, 2021, doi: 10.1177/
be further explained and explained. First of all, the method 14759217211036880.
[15] J. Huyan, W. Li, S. Tighe, Z. Xu, and J. Zhai, ‘‘CrackU-net: A novel deep
developed in this study takes pavement cracks as the research convolutional neural network for pixelwise pavement crack detection,’’
object to verify the feasibility of the method. It should be pro- Struct. Control Health Monitor., vol. 27, no. 8, pp. 1–19, Aug. 2020, doi:
moted and applied to the identification of different types of 10.1002/stc.2551.
[16] Y. Wu, Y. Qin, Y. Qian, F. Guo, Z. Wang, and L. Jia, ‘‘Hybrid deep
pavement defects, including potholes, depressions, etc. The learning architecture for rail surface segmentation and surface defect detec-
proposed method can be further combined with unmanned tion,’’ Comput.-Aided Civil Infrastruct. Eng., vol. 37, no. 2, pp. 227–244,
aerial photography remote sensing technology to realize real- Feb. 2022, doi: 10.1111/mice.12710.
[17] J. Liu, X. Yang, S. Lau, X. Wang, S. Luo, V. C. Lee, and L. Ding,
time, efficient, and large-scale automatic identification and ‘‘Automated pavement crack detection and segmentation based on two-
diagnosis of hidden cracks in the pavement, to improve the step convolutional neural network,’’ Comput.-Aided Civil Infrastruct.
Eng., vol. 35, no. 11, pp. 1291–1305, Nov. 2020, doi: 10.1111/mice.
efficiency and automation of pavement management. Also, 12622.
laser scanning and ground radar photography can be com- [18] F. Yang, L. Zhang, S. Yu, D. Prokhorov, X. Mei, and H. Ling, ‘‘Feature
bined to study the 3D reconstruction of pavement cracks and pyramid and hierarchical boosting network for pavement crack detection,’’
IEEE Trans. Intell. Transp. Syst., vol. 21, no. 4, pp. 1525–1535, Apr. 2020,
improve the understanding degree of pavement diseases. doi: 10.1109/TITS.2019.2910595.
[19] Z. Fan, H. Lin, C. Li, J. Su, S. Bruno, and G. Loprencipe, ‘‘Use
of parallel ResNet for high-performance pavement crack detection and
REFERENCES
measurement,’’ Sustainability, vol. 14, no. 3, p. 1825, Feb. 2022, doi:
[1] W. Cao, Q. Liu, and Z. He, ‘‘Review of pavement defect detec- 10.3390/su14031825.
tion methods,’’ IEEE Access, vol. 8, pp. 14531–14544, 2020, doi: [20] Z. Fan et al., ‘‘Ensemble of deep convolutional neural networks for auto-
10.1109/aCCESS.2020.2966881. matic pavement crack detection and measurement,’’ Coatings, vol. 10,
[2] A. Zhang, K. C. P. Wang, B. Li, E. Yang, X. Dai, Y. Peng, Y. Fei, no. 2, p. 152, 2020.
Y. Liu, J. Q. Li, and C. Chen, ‘‘Automated pixel-level pavement crack [21] J. Zhang, X. Yang, W. Li, S. Zhang, and Y. Jia, ‘‘Automatic detection of
detection on 3D asphalt surfaces using a deep-learning network,’’ Comput.- moisture damages in asphalt pavements from GPR data with deep CNN
Aided Civil Infrastruct. Eng., vol. 32, no. 10, pp. 805–819, Oct. 2017, doi: and IRS method,’’ Autom. Construct., vol. 113, May 2020, Art. no. 103119,
10.1111/mice.12297. doi: 10.1016/j.autcon.2020.103119.
[22] O. Ronneberger, P. Fischer, and T. Brox, ‘‘U-Net: Convolutional net- YU ZOU received the M.E.E. degree in electronics
works for biomedical image segmentation,’’ in Proc. 18th Int. Conf. Med. and communication engineering from the Nanjing
Image Comput. Comput.-Assist. Intervent. (MICCAI), Munich, Germany, University of Information Science and Technol-
Oct. 2015, pp. 234–241. ogy, Nanjing, China, in 2019. He is currently
[23] S. Feroz and S. A. Dabous, ‘‘UAV-based remote sensing applications pursuing the Ph.D. degree in artificial intelligence
for bridge condition assessment,’’ Remote Sens., vol. 13, no. 9, p. 1809, with the Nanjing University of Information Sci-
May 2021, doi: 10.3390/rs13091809.
ence and Technology. He is a Administrative Staff
[24] X. Li, X. Sun, Y. Meng, J. Liang, F. Wu, and J. Li, ‘‘Dice loss for data-
of Jiangsu Open University. His research interests
imbalanced NLP tasks,’’ in Proc. 58th Annu. Meeting Assoc. Comput.
Linguistics, 2020, pp. 465–476, doi: 10.18653/v1/2020.acl-main.45. include research and application of artificial intel-
[25] S. Guan, A. A. Khan, S. Sikdar, and P. V. Chitnis, ‘‘Fully dense UNet for ligence and deep learning technology.
2-D sparse photoacoustic tomography artifact removal,’’ IEEE J. Biomed.
Health Informat., vol. 24, no. 2, pp. 568–576, Feb. 2020.
[26] V. Badrinarayanan, A. Kendall, and R. Cipolla, ‘‘SegNet: A deep convolu-
tional encoder-decoder architecture for image segmentation,’’ IEEE Trans.
Pattern Anal. Mach. Intell., vol. 39, no. 12, pp. 2481–2495, Dec. 2017. WENSHA SHAO received the M.E. degree in
[27] C. Liu, L.-C. Chen, F. Schroff, H. Adam, W. Hua, A. L. Yuille, and technology of computer application from the Nan-
L. Fei-Fei, ‘‘Auto-DeepLab: Hierarchical neural architecture search for jing University of Finance and Economics, China,
semantic image segmentation,’’ in Proc. IEEE/CVF Conf. Comput. Vis. in 2014. She is currently the Manager with Jiangsu
Pattern Recognit. (CVPR), Jun. 2019, pp. 82–92. Open University. Her research interests include
[28] L. Yan, D. Liu, Q. Xiang, Y. Luo, T. Wang, D. Wu, H. Chen, Y. Zhang, and
research and application of artificial intelligence
Q. Li, ‘‘PSP net-based automatic segmentation network model for prostate
and deep learning technology.
magnetic resonance imaging,’’ Comput. Methods Programs Biomed.,
vol. 207, Aug. 2021, Art. no. 106211, doi: 10.1016/j.cmpb.2021.106211.
FEI SONG received the master’s degree in tech- XIAOYUAN XU received the M.B.A. degree from
nology of computer application from the Nanjing Nanjing University, Nanjing, China, in 2007.
University of Finance and Economics, Nanjing, She is currently an Associate Professor with
China, in 2014. She is currently a Lecturer with Jiangsu Open University. Her research interests
Jiangsu Open University. Her research interests include research and application of data processing
include research and application of artificial intel- and cloud computing.
ligence and deep learning technology.