0% found this document useful (0 votes)

23 views24 pages

Segundo

Uploaded by

watice6976

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

23 views24 pages

Segundo

Uploaded by

watice6976

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 24

applied

sciences
Article
A Method for Extracting Joints on Mountain Tunnel Faces Based
on Mask R-CNN Image Segmentation Algorithm
Honglei Qiao 1 , Xinan Yang 1 , Zuquan Liang 1 , Yu Liu 1 , Zhifan Ge 1 and Jian Zhou 2,3, *

1 The Key Laboratory of Road and Traffic Engineering, Ministry of Education, Tongji University,
Shanghai 201804, China; [email protected] (H.Q.); [email protected] (X.Y.)
2 Department of Civil Engineering, Hangzhou City University, Hangzhou 310015, China
3 Key Laboratory of Safe Construction and Intelligent Maintenance for Urban Shield Tunnels of Zhejiang
Province, Hangzhou City University, Hangzhou 310015, China
* Correspondence: [email protected]

Abstract: The accurate distribution of joints on the tunnel face is crucial for assessing the stability and
safety of surrounding rock during tunnel construction. This paper introduces the Mask R-CNN image
segmentation algorithm, a state-of-the-art deep learning model, to achieve efficient and accurate
identification and extraction of joints on tunnel face images. First, digital images of tunnel faces were
captured and stitched, resulting in 286 complete images suitable for analysis. Then, the joints on
the tunnel face were extracted using traditional image processing algorithms, the commonly used
U-net image segmentation model, and the Mask R-CNN image segmentation model introduced in
this paper to address the lack of recognition accuracy. Finally, the extraction results obtained by the
three methods were compared. The comparison results show that the joint extraction method based
on the Mask R-CNN image segmentation deep learning model introduced in this paper achieved
the best joint extraction effect with a Dice similarity coefficient of 87.48%, outperforming traditional
methods and the U-net model, which scored 60.59% and 75.36%, respectively, realizing accurate and
efficient acquisition of tunnel face rock joints. These findings suggest that the Mask R-CNN model
can be effectively implemented in real-time monitoring systems for tunnel construction projects.

Citation: Qiao, H.; Yang, X.; Liang, Z.; Keywords: mountain tunnel; tunnel construction safety; rock mass joints; image processing; deep
Liu, Y.; Ge, Z.; Zhou, J. A Method for learning; Dice similarity coefficient
Extracting Joints on Mountain Tunnel
Faces Based on Mask R-CNN Image
Segmentation Algorithm. Appl. Sci.
2024, 14, 6403. https://doi.org/ 1. Introduction
10.3390/app14156403
The development degree of joints on the tunnel face reflects rock mass integrity, which
Academic Editor: Syed Minhaj is crucial for dynamic evaluation during tunnel construction. Currently, joint development
Saleem Kazmi descriptions rely on hand-drawn records and qualitative judgments, limiting accuracy and
efficiency. Digital image capturing is primarily used for records, yet these images contain
Received: 17 June 2024
valuable rock mass information. If an effective method can be established to digitally
Revised: 15 July 2024
extract and obtain complete rock mass information from tunnel face images, it would
Accepted: 19 July 2024
Published: 23 July 2024
significantly improve the efficiency of real-time dynamic grading of the surrounding rock
on construction sites.
Early methods, such as manual counting, were inefficient and prone to human error.
Ross-Brown and Atkinson [1] first used camera images for rock mass characterization,
Copyright: © 2024 by the authors. while subsequent studies introduced digital image processing techniques [2] for better
Licensee MDPI, Basel, Switzerland. accuracy and efficiency. Advancements in image processing, such as Fourier and Hough
This article is an open access article transforms [3,4], grayscale elevation methods [5], and structural analysis techniques [6],
distributed under the terms and have improved joint extraction but still rely heavily on manual intervention and experience.
conditions of the Creative Commons Recent studies [7,8] developed algorithms to overcome these limitations but faced chal-
Attribution (CC BY) license (https:// lenges in complex environments. However, traditional image processing methods heavily
creativecommons.org/licenses/by/ rely on experience, and their processing effectiveness still needs improvement.
4.0/).

Appl. Sci. 2024, 14, 6403. https://doi.org/10.3390/app14156403 https://www.mdpi.com/journal/applsci

Appl. Sci. 2024, 14, 6403 2 of 24

AI algorithms, especially deep-learning-based object detection methods, have shown

promise in civil engineering for recognizing cracks in concrete linings [9–16]. However,
applying these methods to tunnel rock faces poses challenges due to larger areas, harsh
conditions, and complex backgrounds. Many researchers have conducted studies on
this topic. For instance, Liu et al. [17], Chen et al. [18], and Lee et al. [19] introduced
modifications to existing networks to enhance performance in pixel-level joint extraction.
Recent advancements, such as the Path Aggregation Network (PA-Net) [20], have further
improved detection accuracy in dark environments.
Using deep learning neural network models for joint extraction is currently the main-
stream research direction for tunnel face joint extraction methods. However, the extraction
accuracy still needs improvement. Upgrading and adapting the structures of deep learn-
ing convolutional neural networks and deepening the learning for specific engineering
geological rock masses are the main breakthroughs at present. Among the many image
segmentation convolutional neural networks, the U-net network, proposed in 2015 for
biomedical image segmentation [21], has been applied by some researchers in the process of
crack or joint extraction due to its simple and intuitive architecture, relatively low training
data requirements, and ability to generate pixel-level masks [22–25]. Despite progress,
extraction accuracy needs further improvement. This study focuses on enhancing deep
learning convolutional neural networks, particularly the Mask R-CNN model [26], for
better joint extraction accuracy. The Mask R-CNN, proposed in 2017, offers high precision
in object detection and segmentation and is well-suited for complex tasks. It has been
widely adopted in various fields [27–32]. It can be considered for introduction into the
process of extracting complex tunnel face joints.
This paper utilizes data from the Henan Luanlu Expressway Nianpan Tunnel and
Zhejiang Hangwen Railway Tunnel to compare joint extraction results using traditional
methods, the U-net model, and the Mask R-CNN model. We aim to demonstrate the
superiority of the Mask R-CNN in achieving accurate and efficient joint extraction.

2. Acquisition of Tunnel Face Images

Field images of rock masses are the basic data for obtaining joint information on the
tunnel face. High-quality images enhance the accuracy and speed of joint extraction. This
section introduces the image partitioning acquisition method and the stitching and fusion
method of partitioned images, aiming to obtain clear and complete field image data.

2.1. Method of Partitioned Image Acquisition

2.1.1. Principles for Selecting Photography Equipment
Due to the dim lighting, severe dust, low visibility near the tunnel face, and the
typically large area of the tunnel face, the requirements for photographic equipment are
relatively high.
Selecting a camera with a larger frame allows for a wider field of view, and a larger
sensor size results in more detailed images. An aperture of f/2.0 or larger is recommended
for low-light conditions in tunnel environments. A focal length of 35 mm to 60 mm is ap-
propriate for tunnel projects, balancing visible distance and field of view. Longer exposure
times are good for capturing more information about the rock in low-light environments
and are recommended to be between 1/30 s and 1/8 s. These settings ensure high image
quality in low-light conditions typical of mountain tunnels. The equipment selected for
this study meets the above settings, including a Canon EOS 6D MARK II camera with a
50 mm f/1.4 lens and a DSLR tripod (see Figure 1).
Appl.
Appl.
Appl. Sci.
Sci.
Sci. 2024,
2024, 14,
14,14,
2024, x FOR
6403
x FOR PEER
PEER REVIEW
REVIEW 3 of
3 of 24
2424
3 of

(a)(a)
EOSEOS
6D6D MARK
MARK II Camera
II Camera (b)(b)
50 50
mm mm
f1.4f1.4 Lens
Lens (c) (c) DSLR
DSLR Tripod
Tripod
Figure
Figure 1. Image
1.1.
Image acquisition
acquisition equipment.
equipment.
Figure Image acquisition equipment.

2.1.2.
2.1.2.
2.1.2. Principles
Principles
Principles for for
for Selecting
Selecting
Selecting Light
Light
Light Sources
Sources
Sources
To
ToTo improve
improve
improve image
image
image quality
quality
quality under
under
under low-light
low-light
low-light conditions,
conditions,
conditions, various
various
various lighting
lighting
lighting equipment
equipment
equipment
such
such as
as flash,
flash, reflector
reflector lamps,
lamps, and
and mechanical
mechanical equipment
equipment
such as flash, reflector lamps, and mechanical equipment light sources are used. Each light
light sources
sources are
are used.
used. Each
Each has
has
has
its
its advantages
advantages and
and
its advantages and limitations. limitations.
limitations.
Flash
Flash
Flash isisthe
isthe
the most
mostmost direct
direct
direct complementary
complementary
complementary light
light
light source
source
source forfor
for digital
digital
digital cameras.
cameras.
cameras. It produces
It produces
It produces a
strong
a strong lighting
lighting effect
effect at the
at themoment
moment ofofexposure.
exposure. However,
However,
a strong lighting effect at the moment of exposure. However, in the dusty environment of ininthe
thedusty
dusty environment
environment ofof
tunnel
tunnel
tunnel engineering,
engineering,
engineering, it itcan
itcan
can easily
easily
easily cause
cause
cause diffuse
diffuse
diffuse reflection
reflection
reflection ofof
of dust
dust dust particles,
particles,
particles, which
which
which affects
affects
affects the
thethe
imaging quality. Reflector lamps are a spotlight
imaging quality. Reflector lamps are a type of spotlight with a wide lighting range and a a
imaging quality. Reflector type of spotlight with
with aa wide
wide lighting
lighting range
range and
and
astable
stable
stable light light
light source.
source.
source. They They
They provide
provide
provide betterbetter
betterlightinglighting
lighting forfor for
the the the tunnel
tunnel
tunnel working
working
working surface
surface
surface butbut but
are
are are
less
less
less portable
portable and and
requirerequire a a
power powersupplysupply that that
may may
be
portable and require a power supply that may be inconvenient at the construction site. be inconvenient
inconvenient at at
the the construction
construction site.
site.
MechanicalMechanical
Mechanical equipment equipment
equipment at atthethe attunnel
the construction
tunnel tunnel construction
construction site,
site, site,
such
such such
as as as wet
loaders,
loaders, loaders,
wet wet
spray
spray spray
trucks,
trucks,
trucks,
and and
dump dump
trucks, trucks,
generally generally
have have
lighting lighting
systems. systems.
These These
light light
sources sources
have have
wide wide
cover-
and dump trucks, generally have lighting systems. These light sources have wide cover-
coverage and stable illumination, which can provide a good lighting effect for the tunnel
ageageandand stable
stable illumination,
illumination, which
which cancan provide
provide a good
a good lighting
lighting effect
effect forfor
thethe tunnel
tunnel work-
work-
working surface. Although they may be interfered with by mechanical shadows, they can
ing surface. Although they may be interfered with by mechanical shadows, they can bebe
ing surface. Although they may be interfered with by mechanical shadows, they can
be effectively
effectively avoidedininpractice.
avoided practice.Therefore,
Therefore,ininthis thisstudy,
study, mechanical
mechanical equipment
equipment shown
shown in
effectively avoided in practice. Therefore, in this study, mechanical equipment shown in in
Figure
Figure 22isisselected
selected for
for lighting
lighting and
and fill
filllight.
light.
Figure 2 is selected for lighting and fill light.

Figure
Figure
Figure 2.Tunnel-lined
2.2. Tunnel-lined
Tunnel-lined platform
platform
platformcarcar light
light
car source.
source.
light source.

2.1.3.
2.1.3.
2.1.3. Partitioned
Partitioned
Partitioned Shooting
Shooting
Shooting Plan
Plan
Plan andand
and Timing
Timing
Timing for
forforTunnel
Tunnel
Tunnel Face
Face
Face Photography
Photography
Photography
ToTo
To obtain
obtain
obtain high-quality
high-quality
high-quality images,
images, the
thethe camera
camera
camera should
should
should bebe beplaced
placed
placed 10–20
10–20
10–20 mm inmin in front
front
front of of of
thethe
the tunnel
tunnel face,face, perpendicular
perpendicular to it.to
Theit. The
tunnel tunnel
face is face is
divided divided
into into
sections
tunnel face, perpendicular to it. The tunnel face is divided into sections to ensure compre- sections
to ensureto ensure
compre-
comprehensive
hensive coverage.coverage.
hensive coverage.
Various
Various construction
construction activities
activities can obstruct the tunnel face and complicate photog-
Various construction activities cancan obstruct
obstruct thethe tunnel
tunnel face
face andand complicate
complicate photog-
photog-
raphy. During drilling and charging, the drilling jumbo and the tunnel-lined platform
raphy. During drilling and charging, the drilling jumbo and the tunnel-lined platform carcar
raphy. During drilling and charging, the drilling jumbo and the tunnel-lined platform
car can block the view.During
Duringmucking,
mucking,rubblerubblecovers
coversthe the tunnel
tunnel face,
face, and high dust
cancan block
block thethe view.
view. During mucking, rubble covers the tunnel face, andand high
high dust
dust
Appl. Sci.
Appl. 2024,
Sci. 14,14,
2024, x FOR PEER
x FOR REVIEW
PEER REVIEW 4 4of of2424
Appl. Sci. 2024, 14, 6403 4 of 24

concentration
concentrationmakes
concentration makesphotographing
makes photographingdifficult.
photographing difficult.During
difficult. Duringthe
During theinstallation
the installationofof
installation ofsteel
steelarches
steel archesand
arches and
and
shotcrete
shotcrete application,
shotcreteapplication, the
application,the tunnel-lined
thetunnel-lined
tunnel-lined platform
platform
platform car can
carcar
cancan again
again block
block
again the
the the
block tunnel
tunnel face,
face,
tunnel and
face,and
the
and
thetheshotcrete
shotcrete
shotcreteprocess
process
process reduces
reducesreduces visibility,
visibility, affecting
affecting
visibility, photo
affectingphoto quality.
quality.
photo Therefore,
Therefore,
quality. thetheoptimal
the optimal
Therefore, times
optimal
times
for for
forphotography
photography
times are
are after
photography areafter
aftermucking
mucking and
andbefore
and before
mucking beforeinstalling
installing steel
steelarches,
steel arches,
installing arches,avoiding
avoiding the
the adverse
avoiding the
adverse interferences
interferences shown shown
in Figurein Figure
3 and 3 and
ensuring ensuring
clear clear visibility.
visibility.
adverse interferences shown in Figure 3 and ensuring clear visibility.

(a)(a)
Tunnel-lined platform
Tunnel-lined car
platform shadow
car shadow (b)(b)
Tunnel-lined platform
Tunnel-lined obstruction
platform obstruction

(c)(c)
Rubble obstruction
Rubble obstruction (d)(d)
Shotcrete coverage
Shotcrete coverage
Figure
Figure
Figure 3.3.
Adverse
3. Interferences
Adverse
Adverse inin
Interferences Tunnel Face
Tunnel Photography.
Face (The
Photography. red
(The
(The boxes
red
red are
boxes
boxes where
are
are the
where
where tunnel
the
the tunnel
tunnel
face
faceis obscured).
faceisisobscured).
obscured).

This
Thisarticle
This articlerelies
article relieson
relies onthe
on theLuanchuan–Lushi
the Luanchuan–LushiExpressway
Luanchuan–Lushi ExpresswayTunnel
Expressway Tunnelinin
Tunnel inHenan
Henanand
Henan andthe
and the
the
Hangzhou–Wenzhou
Hangzhou–Wenzhou Railway
Railway Tunnel
Tunnel inin Zhejiang
Zhejiang (shown
(shown inin
Hangzhou–Wenzhou Railway Tunnel in Zhejiang (shown in Figure 4), where the tunnel Figure
Figure 4),4), where
where the the tunnel
tunnel face
face
area
facearea
is
areais isgenerally
generally lessless
generally lessthan
than than100
100 100square
square meters.
meters.
square Considering
Considering
meters. the the
Considering onsite
the onsite shooting
shooting
onsite condi-
conditions,
shooting condi-
tions,
tions, the shooting plan shown in Figure 5 is adopted: the tunnel face is divided intosixsix
the the shooting
shooting plan plan
shown shown
in in
Figure Figure
5 is 5 is
adopted: adopted:
the the
tunnel tunnel
face is face is
divided divided
into six into
sections,
and the and
sections,
sections, camera
andthe iscamera
placed
thecamera 10
is is m in10
placed
placed front
10mmin ofinfront
the face.
frontofoftheThe
the optimal
face.
face.The time
Theoptimal
optimalfor photography
time
timeforforphotog-
photog-is
after
raphy mucking
raphyis isafter and
aftermucking before
muckingand installing
andbefore steel
beforeinstalling arches.
installingsteel During this
steelarches. period,
arches.During
Duringthis uniform
thisperiod, lighting
period,uniform can
uniform
be provided
lighting can using
be the
provided tunnel-lined
using the platform
tunnel-lined car light source,
platform car improving
lighting can be provided using the tunnel-lined platform car light source, improvinglight the
source, lighting
improving quality
thethe
for photographing
lighting quality for the tunnel
photographing face. the
lighting quality for photographing the tunnel face. tunnel face.

Figure 4. 4.
Figure Map ofof
Map tunnel locations.
tunnel locations.
Figure 4. Map of tunnel locations.
Appl. Sci. 2024, 14, x FOR PEER REVIEW 5 of 24

Appl.Sci.
Appl. Sci.2024,
2024,14,
14,6403
x FOR PEER REVIEW 55 of
of 24
24

Figure 5. Onsite digital image shooting plan. (Each number corresponds to a part of the tunnel
face that is divided.

2.2.Figure
Stitching
Figure anddigital
5.5. Onsite
Onsite Fusion
digital of Partitioned
image
image shooting Photography
shootingplan.
plan. Images
(Each number
(Each number correspondsto
corresponds toaapart
partof
ofthe
thetunnel
tunnelface
faceisthat
that is divided.
divided.
After obtaining the six partitioned photographic images of the tunnel face, stitching
them2.2.together
2.2. Stitching
Stitching andto form
and Fusion
Fusiona complete
of
of Partitionedand
Partitioned clear tunnel
Photography
Photography face image is a prerequisite for the next
Images
Images
step of After
joint extraction.
After obtaining
obtaining the the sixsix partitioned
partitioned photographic
photographic images
images of of the
the tunnel
tunnel face,
face, stitching
stitching
When
them
themtogethertaking
together to the
toform
form images,
aa completeto ensure
complete and
andclear complete
clear tunnel coverage
tunnelface
face image
imageisof
is aaeach partition,
prerequisite
prerequisite forthe
for the area
next cov-
thenext
eredstepbyof
step ofeach
jointpartition
joint extraction.
extraction. image is often slightly larger than the actual corresponding parti-
tion area. When
When This inevitably
taking the leadstoto
images,
the images, toensure
overlapping
ensure complete
complete images inofadjacent
coverage
coverage of each
each regions,
themaking
partition,
partition, cov-it im-
the area
area
covered
possible
ered by by
toeach each
achieve partition
image
partition image image
stitching is often
is oftenthrough slightly larger
simplethan
slightly larger than the
positioning. actual corresponding
The followingparti-
the actual corresponding example
partition
illustrates
tion area. area.
theThis This
stitching inevitably
inevitablyandleads leads
fusion to overlapping
algorithm
to overlapping images
ofimages
partitioned in adjacent
images
in adjacent regions,
on
regions,the making
tunnel
making it us-
it face,
im-
impossible
possible to to achieve
achieve image
image stitching
stitching through
through simple
simple positioning.
positioning.
ing the right arch foot region and the floor region of a tunnel face as examples. The
The following
following example
example
illustrates
illustratesthe
thestitching
stitchingand
andfusion
fusionalgorithm
algorithmofof
partitioned images
partitioned imagesonon
thethe
tunnel face,
tunnel using
face, us-
the right arch foot region and the floor region of a tunnel face as examples.
ing the right arch foot region and the floor region of a tunnel face as examples.
2.2.1. Image Stitching of Tunnel Work Face Partitions
2.2.1.
As Image
shown Stitching
in Figureof 6,
Tunnel Work
the blue FaceofPartitions
part the floor partition image overlaps with the red
2.2.1. Image Stitching of Tunnel Work Face Partitions
part of the
As right arch foot partition image. This
shown in Figure 6, the blue part of the overlapping
floor areaoverlaps
partition image needs to be the
with stitched
As shown in Figure 6, the blue part of the floor partition image overlaps with the red
red to-
part of the right arch foot partition image. This overlapping area needs to be stitched together.
gether.
part of the right arch foot partition image. This overlapping area needs to be stitched to-
gether.

Figure 6. Image
Figure
Figure 6.6. Imageoverlapping
Image overlappingarea.
overlapping area.
area.

The
The
The image
image
image stitchingprocess
stitching
stitching process uses
process uses the
uses the SURF
theSURF (Speeded-Up
SURF(Speeded-Up
(Speeded-Up Robust Features)
Robust algorithm,
Features) algorithm,
which
which performs
performs faster
faster compared
compared to
to other
other algorithms
algorithms [33].
[33]. First,
First, the
the
which performs faster compared to other algorithms [33]. First, the Hessian matrix Hessian
Hessian matrix
matrix of the
of theof the
image
image is
is calculated
calculated according
according to
to Equation
Equation
image is calculated according to Equation (1): (1):
(1):

 f  f  ∂2 f 2 ∂2 f 2
 

H ( f ( x, y)) =    f  f 
2
∂x2 ∂x∂y 
2
(1)
x xy  
∂2 f 2∂2 f
H ( f ( x, y ) ) = x xy
 ∂x∂y ∂y22
( )
H f ( x, y ) =  2 f2  2 f2  
(1)
(1)
  value
where H() is the Hessian matrix, and f (x, y) is the color f   image coordinates (x, y).
f the
2 at
 xy y  
 xy y 
2

where H() is the Hessian matrix, and f(x, y) is the color value at the image coordinates (x,
y). H() is the Hessian matrix, and f(x, y) is the color value at the image coordinates (x,
where
y).
Appl. Sci. 2024, 14, 6403 6 of 24

Next, the determinant of the Hessian matrix is calculated using Equation (2) to obtain
the local extremum points of the pixels, which are used as the SURF feature points of
the image.
Appl. Sci. 2024, 14, x FOR PEER REVIEW 6 of 24
∂2 ∂2 f
2
∂ f
det( H ) = 2 2 − (2)
∂x ∂y ∂x∂y
whereNext,
H is the determinant of the
Hessian matrix, andHessian
f is thematrix is calculated
color value usingcoordinates
at the image Equation (2)(x,
toy).
obtain
the local
Afterextremum
obtaining points of the
the feature pixels,
points of which are used
the reference as the
image SURF
and feature points
the matching image,of the
the
similarity
image. of the feature points is calculated using the Euclidean distance criterion shown
in Equation (3):
2 2 f  2 f 
= ( H∑)i=
q
n
ldet =1 ( X21 (i) −2 X−2(i))2  (3)
(2)
x y  xy 
where l is the distance between the two points, n is the dimension of the feature points, X1
where
is H is the Hessian
the descriptor vector ofmatrix, and fpoint
the feature is theincolor value at the
the reference image
image, andcoordinates (x, y).
X2 is the descriptor
After obtaining the feature points of the
vector of the feature point in the matching image. reference image and the matching image, the
similarity
Whenofthe thedistance
feature is
points is calculated
less than using the(found
the set threshold Euclidean distance
to be criterion
optimally betweenshown
0.6
and 0.8), the (3):
in Equation two feature points are considered successfully matched.

 ( X (shown ) ) 7) are corrected using the

( i ) − X in(iFigure
2.2.2. Image Fusion of Tunnel Work FacenPartitions 2
l= i =1 1 2
(3)
After stitching, unnatural color transitions
fade-in–fade-out weighted fusion algorithm shown in Equation (4).
where l is the distance between the two points, n is the dimension of the feature points,
X 1 is the descriptor vector and X 2 is the

 xof1 (the
 a, b)feature point in the reference
( a, b) ∈ ximage,
1
descriptor vectorXof ( a,the = (1 −
b) feature γ) x1in
point b) +
( a,the ( a, b) ∈ x1 ∩ x2
γx2 ( a, b)image.
matching (4)

When the distance is less 
x1 (than
a, b) the set threshold (found ( a, b)to∈be
x2optimally between 0.6
and 0.8), the two feature points are considered successfully matched.
where x1 , x2 are the images to be stitched, X is the stitched image, γ is the weighting factor,
= wwImage
γ2.2.2. d
∈ (0,
Fusion
1), w isof the
Tunnel Work Face
horizontal Partitions
coordinate distance of the overlapping part of the
stitched images, and w
After stitching, unnatural
d is the horizontal coordinate
color transitions (shown distance
in Figure of the pixel
7) are points in
corrected the
using
overlapping part of the stitched images from the start
the fade-in–fade-out weighted fusion algorithm shown in Equation (4).of the overlapping section.

Figure 7.
Figure 7. Unnatural
Unnatural edges
edges in
in image
image stitching.
stitching. (As
(As the
the yellow
yellow box
box marks).
marks).

The final complete tunnel work face fusion image compared with the full tunnel work
face captured image is shown xin1 (Figure
a, b ) 8. ( a, b )  x1

X ( a, b ) = (1 −  ) x1 ( a, b ) +  x2 ( a, b ) ( a, b )  x1  x2 (4)
 x ( a, b ) ( a, b )  x2
 1
where x1 , x2 are the images to be stitched, X is the stitched image,  is the weighting
wd
factor, γ=  ( 0,1) , w is the horizontal coordinate distance of the overlapping part
w
of the stitched images, and wd is the horizontal coordinate distance of the pixel points in
the overlapping part of the stitched images from the start of the overlapping section.
The final complete tunnel work face fusion image compared with the full tunnel
Appl. Sci.
Appl. Sci.2024, 14,x6403
2024,14, FOR PEER REVIEW 7 7ofof24
24

(a) Full Cross-Section Image of the tunnel face

(b) Partitioned stitching and fusion image of the tunnel face

Figure
Figure8.8.Comparison
Comparisonof of
thethe
Effects Between
Effects Stitched
Between andand
Stitched Fused Images
Fused and Full
Images andCross-Section Pho-
Full Cross-Section
tographed Images.
Photographed Images.

Stitched
Stitchedand
andfused
fusedimages
imagessignificantly
significantlyimprove
improvequality
qualityand
andrestore
restoregeological
geologicalstruc-
struc-
ture
ture information,
information, laying
laying aa solid
solid foundation
foundation for
for subsequent
subsequent joint
joint extraction
extraction (see
(see Figure
Figure 8).
8).

3. Joint
3. JointExtraction
Extractionfrom
fromTunnel
Tunnel Face
Face Based
Based on
on Traditional
Traditional Image Processing Methods
This section
This section employs
employs traditional
traditionalcomputer
computerimage
imageprocessing
processingmethods
methodsfor forjoint
jointex-
ex-
traction from
traction from tunnel
tunnel face
face images.
images. The
Themain
mainprocess
processincludes
includesgrayscale
grayscaleprocessing,
processing, spatial
spatial
filtering, image
filtering, image binarization,
binarization,morphological
morphologicalprocessing,
processing,noise
noiseremoval,
removal,and andfinally,
finally,out-
out-
putting the joint extraction image of the tunnel
putting the joint extraction image of the tunnel face. face.
The following
The following demonstrates the the image
imageprocessing
processingprocedure
procedureusing
usingthe
thecomplete
complete tunnel
tun-
faceface
nel image
imageobtained through
obtained stitching
through and and
stitching fusion in Section
fusion 2.2, as2.2,
in Section shown in Figure
as shown 8b.
in Figure
8b.
3.1. Grayscale Processing
Grayscale
3.1. Grayscale processing reduces image dimensions, facilitating feature extraction by
Processing
converting RGB images to grayscale
Grayscale processing using dimensions,
reduces image Equation (5).facilitating
The result is shownextraction
feature in Figure 9.
by
converting RGB images to grayscale using Equation (5). The result is shown in Figure 9.
Gray = ( R + G + B)/3 (5)
(
Gray = R + G + B / 3 )
where Gray is the calculated grayscale value of the pixel, R is the red component value of
(5)
the pixel,
where G is the
Gray green
is the component
calculated valuevalue
grayscale of theofpixel, and BRisisthe
the pixel, theblue component value
red component value
of the pixel.
of the pixel, G is the green component value of the pixel, and B is the blue component
value of the pixel.
Appl.
Appl.Sci. 2024,14,
Sci.2024, 14,6403
x FOR PEER REVIEW 88 of
of 24
24

Figure 9. Grayscale processing result of tunnel face image.

3.2. Spatial Filtering

Spatial filtering, particularly bilateral filtering (as shown in Equation (6)), enhance
image quality by preserving edges while removing noise.
Figure9.9.Grayscale
Figure Grayscaleprocessing
processingresult
resultof
oftunnel
tunnelface

faceimage.
image.
f k, l ( )  ( x, y , k , l )
g ( x, y ) = k ,l
(6
3.2.
3.2. Spatial
Spatial Filtering
Spatial
Filtering  k ,l ( x, y, k , l )
Spatial filtering,
filtering, particularly
particularly bilateral
bilateral filtering
filtering (as
(as shown
shown in
inEquation
Equation(6)),
(6)),enhances
enhances
image
image ( )
quality
quality by
by preserving
preserving edges
edges while
while removing
removing noise.
noise.
where f k , l is the pixel value in the neighborhood centered at point x, y , and ( )
 ( x, y, k , l ) is the weighting k,l f (fk,(lk)for
∑
coefficient ,ωl ()x,
gg((x,x,yy) )== ∑k,l ω ( x, y, k, l )
k ,l
( xk,,neighboring
the
y, yl ), k , l ) pixel ( k , l ) centered
(6)
(6)
a
point ( x, y ) . This coefficient is determined   by ( x, the
k ,l
l)
y, k ,product of the spatial kernel and th
where f (k, l ) is the pixel value in the neighborhood centered at point ( x, y), and ω ( x, y, k, l )
range
iswhere ( )
kernel,
f k , l with
the weighting thepixel
is the
coefficient expression
forvalue ingiven by Equation
pixel (k, l )(7):
the neighborhood
the neighboring centered
centered at ( )
point (xx,, yy). ,This
at point and

( )
coefficient is determined by the product of the spatial kernel and the range kernel, with2the
 x, y, k , l is the weighting coefficient
expression given by Equation (7):

( x − ) (
for 2the neighboring
k + y − l )
2
( () ) ( k , l at
)
x, y k−, l f centered
fpixel
− 
( ) (
point x, y . Thisx,coefficient)
y, k , l ="isexpdetermined
 2
by the2 product −
2  2
of the spatial #
2 2
kernel
2 and the

(7
( x − k) + (y − l )d || f ( x, y) − f (k, l )|| r
range kernel,ωwith
( x, y,the
k, l )expression
= exp − given by Equation −(7): 
(7)
2σd2 2σr2
  f x, y − f k, l
( σr is) and ( r )isradius
2

( xthe ) + ( ydomain
− kspatial l)
2
−kernel,
2
whereΣ isdtheis the filter radius
where d  (filter k , l ) = of
x, y, radius exp  −of
the spatial domain
−
kernel,
and the filter the filter
 of the radius o
(7)
the range
range domaindomain
kernel.kernel.  2 d2 2 r2 
The
Theeffect of of
effect thethe
tunnel
tunnel 
image
faceface after after
image bilateral filtering
bilateral is shown
filtering is in in10.
Figure
shown Figure 10.
where d is the filter radius of the spatial domain kernel, and r is the filter radius of
the range domain kernel.
The effect of the tunnel face image after bilateral filtering is shown in Figure 10.

Figure10.
Figure 10.Bilateral
Bilateral filtering
filtering effect
effect on tunnel
on tunnel face image.
face image.

3.3. Image Binarization

3.3. Image Binarization
Figure
To10. Bilateralthe
separate filtering
tunneleffect
faceoninformation
tunnel face image.
from the background, image binarization is
To separate
necessary. Common the tunnel facemethods
binarization information from
include the background,
the histogram bimodalimage binarization
method, the i
3.3. Image
maximum Binarization
necessary.entropy
Common binarization
method, the minimummethods includeand
error method, thethe
histogram
OTSU [34]bimodal
method. method, th
maximum entropy
To separate method,
the tunnel facethe minimumfrom
information error
themethod, and the
background, OTSU
image [34] method.
binarization is
necessary. Common binarization methods include the histogram bimodal method, the
maximum entropy method, the minimum error method, and the OTSU [34] method.
The OTSU method (Otsu’s thresholding method) is a global adaptive segmentation
The
algorithm OTSU
that usesmethod (Otsu’s thresholding
image grayscale to divide the method)
image intois foreground
a global adaptive segmentation
and background.
Appl. Sci. 2024, 14, 6403 Thealgorithm
maximum that uses image grayscale
between-class variance Ktoisdivide the image
calculated into foreground
as shown in Equationand
(8).9background
of 24
The maximum between-class variance K is calculated as shown in Equation (8).
( ) ( )
2 2
K = Po  − o +2Pb  − b
' '
(8)
( ) ( )
2
K = Po  − o + Pb  − b
The OTSU method (Otsu’s thresholding method)' is a global adaptive
' segmentation (8
algorithm that uses image grayscale to divide the image into foreground and background.
where  is thebetween-class
The maximum mean grayscale value
variance ofcalculated
K is the image,
'
o is intheEquation
as shown mean grayscale
(8). value of
 '
where  is the' mean grayscale value of the image, o is the mean grayscale value o
 2 2 P
the foreground, b is' the mean Po µ − µ′o value
K = grayscale + Pb of theµ′b background, o is the proportion
the foreground,
 b is the mean grayscale
µ−
Po (8)
Pb value of the background,
µ′o isproportion
is the proportion
of where µ is the mean
the foreground grayscale
in the entire value
image,ofand
the image, is the the mean grayscale value of thein the
of the background
foreground, µ′b is theinmean grayscale value
Pb background, Po is the proportion of
of the
of the
entire foreground
image. the entire image, and is the proportion of the background in th
the foreground
entire
The in the entire
image.method
OTSU image,
is chosen forand Pb is the proportion
its efficiency and globalofadaptive
the background in the capa-
thresholding
entire image.
bilities, The OTSU
making method for
it suitable is chosen
tunnelfor
faceitsimage
efficiency and globalThe
segmentation. adaptive thresholding
binarization segmen-capa
The OTSU method is chosen for its efficiency and global adaptive thresholding capabil-
bilities,
tation effectmaking
is shown it suitable
in Figurefor11.
tunnel face image segmentation. The binarization segmen
ities, making it suitable for tunnel face image segmentation. The binarization segmentation
tationis effect
effect shownisinshown
Figure in11.Figure 11.

Figure 11. Binarized image.

Figure11.
Figure 11.Binarized
Binarized image.
image.
3.4. Morphological Processing
3.4. Morphological Processing
3.4.
DueMorphological Processing
to the presence of filling materials in the joints and the effects of lighting, pixels
Due to the presence of filling materials in the joints and the effects of lighting, pixels
that
that
Due to the
originally presence
originallybelonged
belonged to
ofsame
tothe
the filling
same materials
joint
joint maymay in the
appear
appear
joints and thepoints,
effects
as disconnected
as disconnected
of lighting,
points,
as shown
pixel
asinshown
that originally
in Figure
Figure 12a, belonged
making it lookto the
like same
multiplejoint
12a, making it look like multiple joints. may
joints. appear as disconnected points, as shown
in Figure 12a, making it look like multiple joints.

(a) Joint with breakpoints (b) Joint after morphological processing

(a) Joint with breakpoints (b) Joint after morphological processing
Figure 12.12.Comparison
Figure Comparisonof of Joints Beforeand
Joints Before andAfter
After Morphological
Morphological Processing.
Processing.
Figure 12. Comparison of Joints Before and After Morphological Processing.
ToToaddress
addressthe themissing
missingpixels
pixelsshown
shown in in Figure
Figure 12a,
12a, morphological
morphological processing
processingisis per-
performed,
formed,To as
as address illustrated
illustrated in Figure
theinmissing 13.
Figure pixels The principle
13. Theshown involves
in Figure
principle the
12a,the
involves following:
morphological
following: processing is per
(1) Dilation operation: applying a dilation operation to the two joints in Figure 13a results
(1) formed,
Dilationas operation:
illustrated applying
in Figure a13. The principle
dilation involves
operation to the the
twofollowing:
joints in Figure 13a re-
in the connected joint image shown in Figure 13b.
(1)sults
(2) Dilation
in the
Erosion operation:
connected
operation: applying
joint anaerosion
image
performing dilation
shown inoperation to thethetwo
Figureremoves
operation 13b. joints
dilated in Figure
pixels from 13a re
sults
(2) Erosion in the connected joint image shown in Figure 13b.
non-breakpoint areas while retaining the breakpoint pixels, resulting in Figure 13c. from
operation: performing an erosion operation removes the dilated pixels
(2)non-breakpoint
(3) Erosion pixels:
Merging operation:
areas performing
while
finally, retaining
merging theannew
erosion operation
the breakpoint
breakpoint removes
pixels,
pixels the
resulting
with the dilated pixels
in Figure
original joint 13c.from
non-breakpoint
(3) Merging
pixels pixels:
forms areas
the finally,
new while
merging
joint shown retaining
inthe new
Figurethe breakpointpixels
breakpoint
13d. pixels,with
resulting in Figure
the original 13c.
joint
(3)pixels
Merging
forms pixels:
the newfinally, merging
joint shown the new
in Figure breakpoint pixels with the original join
13d.
pixels forms the new joint shown in Figure 13d.
Appl. Sci. 2024, 14, x FOR PEER REVIEW 10 of 24
Appl.
Appl.Sci. 2024,14,
Sci.2024, 14,6403
x FOR PEER REVIEW 10
10 of 24
of 24

Figure 13.
13. Schematic diagram
diagram of morphological
morphological processing.
Figure 13. Schematic
Figure Schematic diagram of
of morphological processing.
processing.

After applying
applyingmorphological
morphologicalprocessing
processingto thethe
joints with breakpoints shown in Fig-
After applying morphological processing totothe joints
joints with
with breakpoints
breakpoints shown
shown in
in Fig-
ure 12a,
Figure thethe
12a, result is as
result shown
is as shown ininFigure
Figure12b.
12b.ItItcan
canbebeseen
seenthat
thatmorphological
morphological processing
ure 12a, the result is as shown in Figure 12b. It can be seen that morphological processing
connects pixels
connects pixelsin
injoints
jointsbybyapplying
applyingdilation
dilationandanderosion
erosion operations,
operations, effectively
effectively address-
addressing
connects pixels in joints by applying dilation and erosion operations, effectively address-
ing disconnected
disconnected points
points due due to lighting
to lighting or filling
or filling materials.
materials.
ing disconnected points due to lighting or filling materials.
3.5.
3.5. Noise
Noise Removal
Removal
3.5. Noise Removal
As
As shown in
shown in Figure
Figure 14a,
14a, after
after morphological
morphological processing
processing ofof the
the joints,
joints, aa large
large number
number
of As pixels
noise shownandin Figure 14a, pixel
significant after morphological
interference fromprocessing
the of the joints,
surrounding rock a large
still existnumber
in the
of noise pixels and significant pixel interference from the surrounding rock still exist in
of noise
image. pixels
Noise and
removal significant
is requiredpixel
to interference
address these from the
issues. surrounding rock still exist in
the image. Noise removal is required to address these issues.
the image. Noise removal is required to address these issues.

(a) Image with Noise Points (b) Surrounding Rock Area Removed
(a) Image with Noise Points (b) Surrounding Rock Area Removed

(c) Small Noise Points Removed (d) Non-Joint Areas Removed and Contours Added
(c) Small Noise Points Removed (d) Non-Joint Areas Removed and Contours Added
Figure 14.
Figure 14. Image
Image noise
noise removal
removal process.
process.
Figure 14. Image noise removal process.

Noiseremoval
Noise removalinvolves
involveseliminating
eliminating large
large surrounding
surrounding rockrock
areasareas (as shown
(as shown in Figure
in Figure 14b),
Noise removal involves eliminating large surrounding rock areas (as shown in Figure
14b), small
small noise noise
pointspoints (as shown
(as shown in Figure
in Figure 14c),14c),
and and non-joint
non-joint areasareas (as shown
(as shown in Figure
in Figure 15)
14b), small noise points (as shown in Figure 14c), and non-joint areas (as shown in Figure
through region-growing
15) through algorithms
region-growing and and
algorithms geometric shape
geometric analysis.
shape analysis.
15) through region-growing algorithms and geometric shape analysis.
Appl. Sci.Sci.
Appl. 2024, 14,14,x 6403
2024, FOR PEER REVIEW 11
11 of 24

Figure15.15.
Figure Comparison
Comparison of non-joint
of non-joint andareas.
and joint joint areas.

After removing the non-joint areas and importing the tunnel contour curve, the final
After removing the non-joint areas and importing the tunnel contour curve, the
recorded structure of the tunnel face is obtained, as shown in Figure 14d.
recorded structure of the tunnel face is obtained, as shown in Figure 14d.
4. Joint Extraction on Tunnel Faces Based on Image Segmentation Neural
Network
4. Joint Models
Extraction on Tunnel Faces Based on Image Segmentation Neural Network
As can be seen from the tunnel face structure catalog obtained in Section 3, traditional
Models
image processing methods for extracting joints are generally ineffective, involve substantial
manual Asintervention,
can be seenhave froma the tunnel
complex face structure
processing catalog
workflow, and obtained in Section
result in some loss of3, tradit
image
joint processing
information. methods
This makes it for extracting
difficult to meetjoints are generally
the requirements ineffective,
for quick involve sub
and accurate
tial manualofintervention,
identification havetunnel
joints in mountain a complex processing
engineering workflow,
faces. To and
address this, result
recent in some lo
image
segmentation algorithms
joint information. Thishavemakesbeenitintroduced
difficult totomeet
achievethemore intelligent and
requirements foraccurate
quick and accu
extraction of faceof
identification joints.
jointsIn in
thismountain
section, based on digital
tunnel image samples
engineering faces. of
Tothe face obtained
address this, recent im
through onsite shooting and stitching, the U-Net convolutional neural network algorithm
segmentation algorithms have been introduced to achieve more intelligent and accu
and the Mask R-CNN convolutional neural network algorithm are employed for learning
extraction
and of face
recognition joints.The
extraction. In this section,
extraction based
results areon
thendigital image
analyzed andsamples
compared. of the face obta
through onsite shooting and stitching, the U-Net convolutional neural network algor
4.1.
andData
theCollection,
Mask R-CNN Annotation, and Augmentation
convolutional neural network algorithm are employed for lear
4.1.1. Onsite Data Collection
and recognition extraction. The extraction results are then analyzed and compared.
In image recognition, the dataset is the foundation for training and evaluation, and
selecting an appropriate dataset is crucial for the algorithm’s performance and accuracy.
4.1. Data Collection, Annotation, and Augmentation
The onsite tunnel face image collection was carried out as described in Section 2.1, and
4.1.1.
the Onsite
collected Data Collection
partitioned digital images were stitched and fused using the algorithm de-
scribedIninimage
Sectionrecognition,
2.2. The onsitethe
tunnel face image
dataset is thecollection
foundationresulted
for in 1,716 partitioned
training and evaluation
photographs, which were stitched into 286 complete images.
selecting an appropriate dataset is crucial for the algorithm’s performance and accu
The onsite
4.1.2. tunnel face image collection was carried out as described in Section 2.1, an
Data Annotation
collected partitioned
Data annotation digital
with images were
the interactive stitched annotation
segmentation and fused software
using the algorithm
EISeg 1.1.1 descr
in Section
(Efficient 2.2. TheSegmentation
Interactive onsite tunnel1.1.1)
face (shown
image collection resulted
in Figure 16) in for
is crucial 1,716 partitioned ph
accurately
marking
graphs,joint
whichareas andstitched
were facilitating precise
into model training.
286 complete The effect of segmentation is
images.
shown in Figure 17.
4.1.2. Data Annotation
Data annotation with the interactive segmentation annotation software EISeg
(Efficient Interactive Segmentation 1.1.1) (shown in Figure 16) is crucial for accur
marking joint areas and facilitating precise model training. The effect of segmentati
shown in Figure 17.
Appl.
Appl.Sci. 2024,14,
Sci.2024, 14,6403
x FOR PEER REVIEW 12 of
of 24
24
Appl. Sci. 2024, 14, x FOR PEER REVIEW 1

Figure16.
16.Main
Maininterface
interface viewofofEISeg
EISegannotation
annotationsoftware.
software.
Figure Figure 16.view
Main interface view of EISeg annotation software.

(a) Engineering site photo

(b) Joint schematic diagram (orange areas are joint areas)

(b) Joint schematic diagram (orange areas are joint areas)
Figure 17. Using EISeg software for joint data annotation.
Figure 17. UsingFigure
EISeg 17. Usingfor
software EISeg software
joint for joint data annotation.
data annotation.
4.1.3.Dataset
4.1.3. DatasetAugmentation
Augmentation
4.1.3. Dataset Augmentation
Convolutionalneural
Convolutional neuralnetwork
networklearning
learning requires
requires a large
a large number
number of of image
image samples.
samples.
Convolutional neural network learning requires a large number ofTo
image sam
To increase the number
increase the number of image
of image samples,
samples, data augmentation operations such as left–right
To increase the number of data
imageaugmentation
samples, dataoperations
augmentationsuchoperations
as left–right
such as left
flipping,up–down
flipping, up–downflipping,
flipping,rotation,
rotation,and
and translationwere
wereperformed
performedon onthe
theannotated
annotated
flipping, up–down flipping, translation
rotation, and translation were performed on the anno
images. A total of
images. A totalimages.8580 annotated
of 8580Aannotated images
images were
were obtained.
obtained. The
The augmented
augmented image
image samples
samples
total of 8580 annotated images were obtained. The augmented image sa
areshown
are shownininFigure
Figure18.
18.
are shown in Figure 18.
Appl. Sci. 2024, 14, x FOR PEER REVIEW 13 of 2

Appl. Sci. 2024, 14, x FOR PEER REVIEW

Appl. Sci. 2024, 14, 6403
13 of 24
13 of 24

(a) Original Image (b) Left–Right Flip (c) Up–Down Flip (d) Rotation (e) Translation
(a) Original Image (b) Left–Right Flip (c) Up–Down Flip (d) Rotation (e) Translation
Figure 18. Dataset augmentation operations. (The orange line is added later to determine the direc
tion of
Figure
Figure 18.the
18. picture).
Dataset
Dataset augmentationoperations.
augmentation operations. (The
(The orange
orange lineline is added
is added laterlater to determine
to determine the direc-
the direction
tion of the picture).
of the picture).
4.2. Joint Extraction of Tunnel Face Based on U-Net Deep Learning Architecture
4.2.Joint
4.2. JointExtraction
Extraction of of Tunnel
Tunnel Face
FaceBased
Basedon onU-Net
U-NetDeep
DeepLearning
LearningArchitecture
Architecture
4.2.1.U-Net
4.2.1. U-Net Convolutional
Convolutional Neural
Neural Network
Network Architecture
Architecture
4.2.1. U-Net Convolutional Neural Network Architecture
TheU-Net
The U-Net convolutional
convolutional neural
neural network
network was proposed
was proposed in 2015in 2015
[21] and[21]
has and has achieve
achieved
The U-Net
goodresults
results convolutional neural network was proposed in 2015 [21] and has achieved
good in in
thethe field
field of medical
of medical imageimage cell segmentation.
cell segmentation. The network,
The U-Net U-Net network,
suitable suitabl
good
for
results
forjoint
joint in the field
extraction,
extraction,
of medical
classifies
classifies
image
allallpixels
pixels cell
inin
segmentation.
ananimage.
image.Its The U-Netneural
Itsconvolutional
convolutional network,
neural suitablestruc
network
network
for joint extraction,
structure is shown classifies all pixels in an image. Its convolutional neural network struc-
ture is shown in in Figure19.
Figure 19.
ture is shown in Figure 19.

Figure
Figure
Figure19.
19.
19.U-Net
U-Net
U-Netconvolutional
convolutional neural
neural
convolutional network
network
neural architecture.
architecture.
network architecture.
4.2.2. U-Net Convolutional Neural Network Parameter Selection
4.2.2. U-Net
4.2.2. U-Net Convolutional
Convolutional Neural
Neural Network
Network Parameter Selection
Parameter Selection
The input image size of this convolutional neural network is 512 × 512. After four
TheTheinput
down-sampling
image
input image
and
sizesize
of this
of this
four up-sampling
convolutional
convolutional
processes,
neural network
neural
the output
is 512is×512
network
image size remains
512. × After
512 512.
× 512,
four fou
After
down-sampling
down-sampling
the and
same as the input. four
andThe up-sampling
four up-sampling
U-Net processes,
uses a 3 × 3processes, the
convolution output
the image
output
kernel, ReLU size
image remains function,512,
512
size remains
activation × 512 × 512
the
and same
2 × as
2 the
max input.
pooling Thefor U-Net uses
down-sampling. a 3 × 3 convolution kernel, ReLU activation function,
the same as the input. The U-Net uses a 3 × 3 convolution kernel, ReLU activation function
and 2 × 2 max pooling for down-sampling.
and Convolution
(1) 2 × 2 max pooling for down-sampling.
layer parameter selection
(1) Convolution layer parameter selection
(1) TheConvolution
convolutionlayer
kernelparameter
size is 3 × 3, selection
and its convolution processing principle is shown
The convolution
in Figure kernel size is 3convolution
× 3, and its convolution
is 1. Toprocessing thatprinciple is size
shown
The convolution kernel size is 3 × 3, andkernel
20. The sliding step of the its convolution ensure
processing the image
principle is show
inafter
Figure 20. The
convolution sliding
remains step of
consistentthe convolution
with kernel
the original image, is 1. To
the ensure
original that the
image needs imageto be size
in
after Figure 20.
convolution The sliding
remains step
consistentof the convolution kernel is 1. To ensure that the
needs to siz
image
padded with a value of 0. The numberwith the original
of output image, the
image channels original
depends onimage
the number
be after convolution remains consistent
numberwith the original image, the original onimage needs t
ofpadded withkernels
convolution a value inofthe
0. convolution
The of output
layer. image channels depends the num-
berbeofpadded with akernels
convolution value in of the
0. The number of
convolution output image channels depends on the num
layer.
ber of convolution kernels in the convolution layer.
Appl. Sci. 2024, 14, x FOR PEER REVIEW 14 of 24
Appl. Sci. 2024, 14, x
6403
FOR PEER REVIEW 1414of
of 24
24

Figure 20. Schematic diagram of convolution processing.

Figure 20. Schematic diagram of convolution processing.
(2) Activation function selection
(2) Activation
(2) Activation function selection
selection
In the U-Netfunction
convolutional network structure shown in Figure 19, the convolution lay-
In the
theU-Net
ers indicated U-Net
by blueconvolutional
arrows correspond
convolutional network
network structure
tostructure
the shown
Rectified
shown inUnit
Linear Figure
in Figure 19, convolution
(ReLU)
19, the the convolution
activation func-
lay-
layers
tion indicated
shown inby
ers indicated by blue
Equation arrows
(9). Its
blue arrows correspond
corresponding
correspond to the
to thefunctionRectified
Rectifiedgraph Linear
LinearisUnit
shownUnit (ReLU)
in Figure
(ReLU) activation
21.
activation func-
function
tion shown shown in Equation
in Equation (9). Its(9). Its corresponding
corresponding function
function graphgraph is shown
is shown in Figure
in Figure 21. 21.
 0, x  0
f ( x ) = 0,
 0,
x≤ x 0 0 (9)
f (fx() x=) =  x, x  0
x>0
x,
(9)
(9)
 x, x  0

Figure
Figure 21.
21. ReLU
ReLU Function
Function Graph.
Graph.
Figure 21. ReLU Function Graph.
The
The ReLU
ReLU function
function isis chosen
chosen forforseveral
severalreasons.
reasons.Firstly,
Firstly, ititincreases
increases thethenon-linearity
non-linearity
of
of the
the network,
The ReLU which
network, whichisisessential
function chosenfor
isessential for learning
for learning
several complex
complex
reasons. patterns.
patterns.
Firstly, Secondly,
it increases it improves
Secondly,
the the
it improves
non-linearity
computational
thethe
of network,speed
computational which due
speed todue
its simple
is essential to for mathematical
its learning
simple operation.
mathematical
complex Lastly,
operation.
patterns. Secondly,unlike
Lastly, the sigmoid
unlike
it improves the
function,
computationalReLUspeed
sigmoid function, does not
ReLU suffer
duedoes from
to itsnot the
suffer
simple vanishing
from gradient
the vanishing
mathematical problem.
gradient
operation. Lastly, The
problem.vanishing
unlike gra-
Thesigmoid
the vanish-
ing gradient
dient problem
function, ReLU problem
occurs occurs
when
does not sufferwhen
fromgradients
gradients used
the forused for updating
updating
vanishing gradientneural
problem. neural
network network
Theweights
vanishingweights
dimin-
gra-
diminish,
ish,
dient making making
problem training
training
occurs ineffective.
ineffective.
when ReLU
gradients ReLU avoids
avoids
used for thisthis issue
issue
updating byby
neural allowing
allowing
network gradients
gradients toflow
weightstodimin-flow
through
through
ish, the network
the
making network without
trainingwithout significant
significant
ineffective. diminishment,
ReLU diminishment,
avoids this issue making
making itit particularly
by allowing particularly
gradients suitable
suitable for
for
to flow
large-scale
large-scale
through convolution
theconvolution
network withoutoperations.
operations. This justification
This
significant justification highlights
diminishment, highlights
makingwhy why ReLU is
ReLU
it particularly is preferred
preferred
suitable forin
in
deep learning
deep learning
large-scale applications,
applications,
convolution particularly
particularly
operations. Thisin convolutional
justification neural why
highlights networks
ReLU(CNNs).
is preferred in
deep
(3) learningmethod
(3) Pooling
Pooling applications,
method selection
selectionparticularly in convolutional neural networks (CNNs).
(3) Pooling
Pooling
Pooling ismethod selection method
is aa down-sampling
down-sampling method that
that can
can reduce
reduce the
the image
image size
size and
and help
help prevent
prevent
overfitting. There
PoolingThere
overfitting. are two main
is a down-sampling types of
method
are two main types pooling: max
that canmax
of pooling: pooling
reduce and
the image
pooling average pooling.
size andpooling.
and average help prevent
The
The principle
overfitting. There are
principle of
of max
two pooling
max main is
is shown
types
pooling in
in Figure
of pooling:
shown max22,
Figure where
pooling
22, the
whereand maximum
theaverage
maximum pixel
pixel value
pooling. value
within
The
within theprinciple
neighborhood
of maxispooling
taken as
is the center
shown pixel value.
in Figure 22, where the maximum pixel value
within the neighborhood is taken as the center pixel value.
Appl. Sci. 2024, 14, x FOR PEER REVIEW 15 of 24

Appl.
Appl.Sci. 2024,14,
Sci.2024, 14,6403
x FOR PEER REVIEW 15 of
of 24
24
Appl. Sci. 2024, 14, x FOR PEER REVIEW 15 of 24

Figure 22. Max pooling diagram.

The principle of average pooling is shown in Figure 23, where the average pixel value
within
Figure the
Figure22.
22. neighborhood
Max
Max is taken as the center pixel value.
poolingdiagram.
pooling diagram.
FigureThis
22. Max pooling
paper diagram.
introduces the U-Net network to identify the structural information of the
The
The
tunnel principle
principle
face. of
ofaveragethepooling
average
To maximize pooling isisshown
distinction shown in
betweeninFigure
Figure 23,
23,where
structural where the
theaverage
informationaverage pixel
pixelvalue
value
and background
The
within
within theprinciple
the of average
neighborhood
neighborhood is
is pooling
taken
taken as
astheis center
the shown in
pixel
center Figure
pixel 23, where the average pixel value
value.
value.
information, a 2 × 2 max pooling method is used for image down-sampling.
within thepaper
This neighborhood
introduces is the
taken as the
U-Net center to
network pixel value.
identify the structural information of the
This paper introduces the U-Net network to identify the
tunnel face. To maximize the distinction between structural information structural information of the
and background
tunnel face. To maximize the distinction between structural
information, a 2 × 2 max pooling method is used for image down-sampling. information and background
information, a 2 × 2 max pooling method is used for image down-sampling.

Figure 23.Average
Figure23. Averagepooling
poolingdiagram.
diagram.

4.2.3.This paper of
Analysis introduces the U-Net network
U-Net Convolutional NeuraltoNetwork
identify for
the Tunnel
structuralFaceinformation of the
Joint Extraction
tunnel
Figure face. To maximize
23.8580
Average pooling the distinction between structural information and background
diagram.
The sample images were divided into training, validation, and test sets in a 60%,
information, a 2 ×pooling
Figure 23. Average diagram.method is used for image down-sampling.
2 max pooling
20%, and 20% ratio. The preprocessed dataset was input into the U-Net convolutional
4.2.3.
neural Analysis
network of U-Net in Convolutional Neural Network for Tunnel Face Joint Extraction
4.2.3.
4.2.3. Analysis
Analysis ofwritten
of U-Net PyTorch using
U-Net Convolutional
Convolutional Python
Neural
Neural 3.7 forfor
Network
Network training.
for TunnelBy
Tunnel Facecalculating
Face Joint the loss
JointExtraction
Extraction
The 8580 sample images were divided into training, validation,
and accuracy, the network parameters were iteratively updated to minimize the loss and test sets in a 60%,
on
20%, The
The
and8580
8580
20%sample
sample
ratio. images
images
The were
were divided
divided
preprocessed into
into
dataset training,
training,
was validation,
validation,
input into the and
and test
U-Nettest sets
sets in
in a 60%,
convolutional
the validation set. Once the minimum loss value stabilizes, the model converges.
20%,
20%, and
and 20%
20% ratio. The preprocessed dataset was input into the U-Net convolutional
neural Under thisratio.
network written
U-Net The inpreprocessed
PyTorch using
convolutional dataset
neural Python was
network input
3.7structure,intothethe
for training. ByU-Net inconvolutional
calculating
changes thefunc-
the loss loss
neural
neural network
networkthe written
written in PyTorch
in PyTorch using
using Python
Python 3.7 for training. By calculating
calculating the
3.7 for training.toByminimize the loss
loss
and accuracy,
tion and accuracy network
for parameters
the training were
and validationiteratively
sets areupdated
shown in Figures 24 the andloss on
25, re-
and
and accuracy,
accuracy,the thenetwork parameters were iteratively updated to minimize the the
loss loss
on the
the validation
spectively. set. network
Once the parameters
minimum loss were iteratively
value updated
stabilizes, the modelto minimize
converges. on
validation set. OnceOnce
the validation the minimum loss loss
value stabilizes, the model converges.
Under thisset. the minimum
U-Net convolutional neuralvalue
networkstabilizes,
structure,thethe
model converges.
changes in the loss func-
Under
Under
0.18 this U-Net
this U-Net convolutional
convolutional neural
neural network
network structure,
structure, the
the in changes
changes in24 inloss
the thefunc-
loss
tion and accuracy for the training and validation sets are shown Figures and 25, re-
function
tion and and accuracy
accuracy for for
the the training
training and and validation
validation sets sets
are are
shown shownin in Figures
Figures 24 24
and and
25, 25,
re-
spectively.
respectively.
0.16
spectively. Training set
0.18 Validation set
0.18
0.14
0.16 Training set
0.16
0.12 Training setset
Validation
Loss

0.14 Validation set

0.14
0.10
0.12
Loss

0.12
0.08
Loss

0.10
0.10
0.06
0.08
0.08
0.04
0.06 0 2 4 6 8 10 12 14 16 18 20 22 24 26 28 30
0.06
Epoch
0.04
Figure
0.04 24. Changes in Loss Values for Training and Validation Sets.
0 2 4 6 8 10 12 14 16 18 20 22 24 26 28 30
0 2 4 6 8 10 12 14 16 18 20 22 24 26 28 30
Epoch
Epoch
Figure 24. Changes in Loss Values for Training and Validation Sets.
Figure24.
Figure 24. Changes
Changes in
in Loss
LossValues
Valuesfor
forTraining
Trainingand
andValidation
ValidationSets.
Sets.
Appl. Sci.
Appl. Sci. 2024,
2024, 14,
14, 6403
x FOR PEER REVIEW 16
16 of 24
of 24

Appl. Sci. 2024, 14, x FOR PEER REVIEW 0.84 16 of 24

0.80
0.84

0.76
0.80

Accuracy
0.72
0.76

0.68
Accuracy

0.72

0.64
0.68 Training set
Validation set
0.64
0.60 Training set
Validation set
0.60
0.56

0.56
0 2 4 6 8 10 12 14 16 18 20 22 24 26 28 30
0 2 4 6 8 Epoch
10 12 14 16 18 20 22 24 26 28 30
Epoch
Figure 25.
Figure 25. Changes in accuracy
accuracy for
for training
training and
and validation
validation sets.
sets.
Figure 25. Changes in accuracy for training and validation sets.
As
As shown in inFigures
Figures2424and
and25,25,
thethe U-Net
U-Net achieved
achieved an accuracy
an accuracy of 82.2%
of 82.2% on theon the
train-
ing As
setshown
training in Figures
set 82.6%
and and 82.6% 24
onand
on the the25, the U-Net
validation
validation achieved
set, an
ataccuracy
stabilizing
set, stabilizing of 82.2%
at epoch
epoch 29. 29. on the train-
ing setThe
and trained
82.6% onU-Net
the validation set, stabilizing
convolutional neural at epoch
network29. was used to test the test set. A
The trained U-Net convolutional neural network was used to test the test set. A com-
The trained
comparison U-Net
of randomlyconvolutional
selected neural
predictednetwork
images was used
and to test
their the test set. Alabeled
corresponding com- images
parison of randomly selected predicted images and their corresponding
parison of randomly selected predicted images and their corresponding labeled images is
labeled images is
is shown
shown inin Figure
Figure 26.
26.
shown in Figure 26.

Figure 26. Cont.

Appl. Sci.
Appl. Sci. 2024,14,
2024, 14,6403
x FOR PEER REVIEW 17 of 24 17 of 24

(a) Original image (b) Labeled image (c) U-Net predicted image
Figure 26. Comparison of U-Net prediction results.
(a) Original image (b) Labeled image (c) U-Net predicted image
As seen in Figure 26, despite good segmentation, it struggled with ‘rough edges’ and
Figure 26. Comparison of U-Net prediction results.
Figure 26. Comparison of U-Net prediction results.
smaller targets.
Comparing
As seen in Figurethe 26,
predicted groups,
despite good it is evident
segmentation, that when
it struggled the segmented
with ‘rough and target occu-
As seen in Figure 26, despite good segmentation, it struggled withedges’
‘rough edges’ and
smaller
pies targets.
a larger proportion of the total image, the overall segmentation effect is better. How-
smaller targets.
ever, in Comparing
the fourth the predicted
and fifthgroups,
groups,it is evident the that when the segmented target occu- a smaller pro-
Comparing the predicted groups, where
it is evident segmented
that when thetarget
pies a larger proportion of the total image, the overall segmentation effect is better. How-
segmentedoccupies
target occupies
portion
a of
larger the total
proportion image,
of the non-target
total image, images
the overallappear after
segmentation segmentation.
ever, in the fourth and fifth groups, where the segmented target occupies a smaller pro- However,
effect is better.This issue is re-
lated
in to
the the
fourth principle
and fifthof U-Net,
groups, which
where the calculates
segmented classification
target occupies
portion of the total image, non-target images appear after segmentation. This issue is re- loss
a pixel
smaller by pixel. When
proportion
the of
lated the
target total
to the image,
segmentation non-target
principle of U-Net,object images
occupies
which appear
calculates a smallafter segmentation.
portion
classification lossof This
thebyentire
pixel issue is
pixel. image, related to
When the iterative
lossthe
thevalueprinciple
target can of U-Net,
segmentation
easily drop which
object calculates
occupies
very low,amaking classification
small portion loss pixel
of the entire
it difficult for the by
image, pixel.
target theto When
iterative the target
be fully segmented.
segmentation
loss object
value can easily drop occupies
very low,amaking
small portion
it difficultoffor
thetheentire
targetimage, thesegmented.
to be fully iterative loss value
To address the shortcomings of semantic segmentation methods like the U-Net neu-
canTo address
easily dropthevery
shortcomings
low, making of semantic
it difficultsegmentation
for the target methods
to belike thesegmented.
fully U-Net neu-
ralralnetwork,
network, theauthor
the author uses
uses an an instance
instance segmentation
segmentation algorithm algorithm
that combines that combines
object de- object de-
To address the shortcomings of semantic segmentation methods like the U-Net neural
tection
tection and
network,andthesemantic
semantic segmentation,
authorsegmentation,
uses Mask
Masksegmentation
an instance R-CNN, R-CNN,
to extract to extract
joints.
algorithm This joints.
that approach
combines This approach
allows
object allows
detection
forforprecise
precise segmentation
segmentation of of object
object edges edges
based based
on on
bounding bounding
boxes from
and semantic segmentation, Mask R-CNN, to extract joints. This approach allows for precise boxes
object from object
detection, detection,
achieving more
achieving more accurate
accurate segmentation
segmentationresults.results.
segmentation of object edges based on bounding boxes from object detection, achieving
more accurate segmentation results.
4.3. Joint Extraction of Tunnel Face Based on Mask R-CNN Deep Learning Architecture
4.3. Joint Extraction of Tunnel Face Based on Mask R-CNN Deep Learning Architecture
4.3.1. Mask R-CNN Convolutional Neural Network Architecture
4.3. Joint Extraction of Tunnel Face Based on Mask R-CNN Deep Learning Architecture
4.3.1. Mask
The
4.3.1. R-CNN
Mask
Mask R-CNNConvolutional
R-CNN Neural
convolutional neural
Convolutional Neural Network
network Architecture
[26] Architecture
Network was proposed by He et al. in
2017. It adds
The Maskan FCN (Fullyconvolutional
Convolutional Network) structure to the was
Faster-RCNN net-
The MaskR-CNN neural
R-CNN convolutional neural network
network [26] was[26] proposed
proposed by He et by
work, achieving precise segmentation while detecting objects. Its network architecture is
He
al. in et al. in
2017.
2017.
It It
addsadds
an an
FCN FCN
shown in Figure 27.
(Fully
(Fully Convolutional
Convolutional Network)Network)
structure structure
to the to the Faster-RCNN
Faster-RCNN network, net-
work, achieving
achieving precise
precise segmentation
segmentation while detecting
while detecting objects. Itsobjects.
network Itsarchitecture
network architecture
is shown is
in Figure
shown 27. 27.
in Figure

uses ResNet101 + FPN for the backbone and ROIAlign for pooling, enhancing small object
recognition accuracy.
(1) Backbone architecture parameter selection
The backbone architecture of the Mask R-CNN convolutional neural network consists
of ResNet + FPN. The commonly used configurations are ResNet50 + FPN and ResNet101
+ FPN. The network structures of ResNet50 and ResNet101 are compared in Table 1.

Table 1. Comparison of ResNet network structures.

Layer Name 50-Layer 101-Layer

Conv1 7 × 7, 64, stride 2
 ×
 3 max pool, stride
3  2 
Conv2_x 1 × 1, 64 1 × 1, 64
 3 × 3, 64 ×3  3 × 3, 64 ×3
1 × 1, 256  1 × 1, 256 
1 × 1, 128 1 × 1, 128
Conv3_x 3 × 3, ×4 3 × 3, ×4
128 128
 1 × 1, 512   1 × 1, 512 
1 × 1, 256 1 × 1, 256
Conv4_x  3 × 3, ×6  3 × 3,  × 23
256 256
1 × 1, 1024  1 × 1, 1024 
1 × 1, 512 1 × 1, 512
Conv5_x  3 × 3, ×3  3 × 3, ×3
512 512
1 × 1, 2048 1 × 1, 2048

As shown in Table 1, the ResNet101 network has a deeper structure compared to

ResNet50, allowing for more precise extraction of image details. Therefore, this study uses
ResNet101 + FPN as the backbone architecture for the research on tunnel face joint extraction.
(2) Anchor calibration rules in RPN
Unlike simply dividing anchors into positive and negative samples based on IoU
values, this study uses the Non-Maximum Suppression (NMS) [35] method for iterative
calculation, as demonstrated by the principle in Equation (10). First, the box with the
highest score is selected, and the IoU value of other boxes is calculated against this box. If
the IoU value is greater than 0.6, the box is marked as a negative sample. Then, the box
with the next highest score is selected for the next iteration, continuing until the process
is complete.
s , IoU ( M, bi ) < Nt
si = i (10)
0, IoU ( M, bi ) ≥ Nt
where si is the score value of the i-th box, M is the target box, bi is the proposal box with
the highest confidence, and Nt is the IoU threshold, set to 0.6 here.
(3) ROI pooling method selection
For networks based on the R-CNN architecture, ROI pooling methods mainly include
ROIPooling and ROIAlign. Both aim to map features to a fixed-size feature map. The
difference lies in that ROIPooling rounds off pixel values during pooling, while ROIAlign
retains the floating-point values of pixels using bilinear interpolation, as shown in Figure 28.
ROIAlign provides higher accuracy for small object recognition. To obtain detailed images
of tunnel face joints, this study uses the ROIAlign pooling method.
difference lies in that ROIPooling rounds off pixel values during pooling, while ROIAlign
retains the floating-point values of pixels using bilinear interpolation, as shown in Figure
difference
28. ROIAlignlies in that ROIPooling
provides rounds for
higher accuracy off pixel
smallvalues
objectduring pooling,
recognition. Towhile
obtainROIAlign
detailed
retains the floating-point values of pixels using bilinear interpolation,
images of tunnel face joints, this study uses the ROIAlign pooling method. as shown in Figure
Appl. Sci. 2024, 14, 6403 19 of 24
28. ROIAlign provides higher accuracy for small object recognition. To obtain detailed
images of tunnel face joints, this study uses the ROIAlign pooling method.

Figure 28. Bilinear interpolation effect.

Figure
Figure 28. Bilinear
4.3.3. Analysis
28. Bilinear interpolation
ofinterpolation
Mask R-CNN effect.
Convolutional Neural Network for Tunnel Face Joint
effect.
Extraction
4.3.3. Analysis of Mask R-CNN Convolutional Neural Network for Tunnel Face Face Joint
Joint In this experiment, the dataset is the same as in Section 4.1.3. The 8580 sample images
Extraction
Extraction
of size
In 512
this×experiment,
512 are divided
experiment, into
thedataset training
dataset is theand test sets
same ininSection
an 80% to 20% The
ratio.8580
The sample
prepro-
cessedIn this
dataset is input the
into the Mask is the same
R-CNN as inasSection
convolutional 4.1.3. 4.1.3.
Thenetwork
neural 8580 sample for images
training.
images
of size of size
512 ×learning512 × 512
512 are divided are divided
into training into training
and test sets and test sets
in an 80% in an
to 20%of 80%
ratio. to 20% ratio.
The prepro-
The
The initial
preprocessed rate
dataset is set
is input to 0.001,
into R-CNN and the maximum
the Maskconvolutional
R-CNN convolutional number iterations
neural fornetwork (max
for
cessed dataset is input into the Mask neural network training.
training.
The The
initial initial
learning learning rate is set to 0.001,loss
and the
cls maximum number loss
of iterations
bbox (max
epochs) is set to 200. rate
The is set to 0.001,
classification and
loss ( the maximum number
), localization loss (of iterations (max
), segmen-
epochs) is set to 200. The classification loss (losscls ), localization loss (lossbbox ), segmentation
loss (loss loss losscls in Equation (11). lossbbox
epochs)
tation is
loss
mask( ),to
set and total
mask
200. ),Theloss
and are loss
calculated,
classification
total areloss as
( shown
calculated, ), localization
as loss ( (11). ), segmen-
shown in Equation
lossmask ==
cls + loss
tation loss ( loss
loss
), and total lossloss
loss
are calculated, ++
+ lossbbox as loss
loss (11)
cls bbox shownmaskin Equation (11).
mask (11)

When
Whenthetheloss
lossfunction
function loss
reaches=itsloss
reaches minimum + lossvalue
its clsminimum +and
lossand
bboxvalue
stabilizes, the model converges.
(11)
mask stabilizes, the model con-
The changes
verges. in the loss
The changes in functions are shown
the loss functions areinshown
Figurein29.
Figure 29.
When the loss function reaches its minimum value and stabilizes, the model con-
verges.
1.2 The changes in the loss functions are shown in Figure 29.
1.1
1.2 loss_mask
1.0
1.1 loss_cls
0.9 loss_mask
loss_bbox
1.0
0.8 loss_cls
loss_total
0.9 loss_bbox
0.7
loss_total
Loss Loss

0.8
0.6
0.7
0.5
0.6
0.4
0.5
0.3
0.4
0.2
0.3
0.1
0.2
0.0
0.1
0 2 4 6 8 10 12 14 16 18 20 22 24 26 28 30 32 34 36
0.0 Epoch
0 2 4 6 8 10 12 14 16 18 20 22 24 26 28 30 32 34 36
Figure 29.
Figure 29. Changes
Changes in
in loss
loss values.
values. Epoch

Figure
As29.shown
Changes
ininFigure
loss values.
29, the Mask R-CNN achieved stable loss values at epoch 35,
with localization loss lower than classification and segmentation losses. The trained Mask
R-CNN convolutional neural network is then used to test the test set. The comparison of the
five groups of predicted images with their corresponding labeled images, as in Section 4.1.3,
is shown in Figure 30.
4.1.3, is shown in Figure 30.
As shown in Figure 30, after classification, bounding box selection, and mask calcu-
lation, the Mask R-CNN network prediction results achieve good joint segmentation ef-
fects compared to the annotated results of the original images. Additionally, comparing
Appl. Sci. 2024, 14, 6403 the prediction results of the U-Net network shows that the joint segmentation effect is not
20 of 24
affected by the proportion of the segmentation target in the image. Both the overall and
local details are accurately segmented.

fissure1

fissure2

fissure3

fissure4

fissure1
fissure2

fissure1
fissure1

fissure1
fissure1
fissure1
fissure1

fissure1

fissure1 fissure1

fissure2
fissure3

fissure4
fissure5

fissure6

fissure7

fissure1
fissure2

fissure1

fissure4 fissure3

fissure5

fissure1

fissure3
fissure4

fissure2

(a) Original Image (b) Labeled Image (c) Object Detection (d) Object Mask
Figure 30. Comparison of Mask R-CNN prediction results. (The red boxes in subfigure (c) are the
Figure 30. Comparison of Mask R-CNN prediction results. (The red boxes in subfigure (c) are the
identified joints).
identified joints).

As shown in Figure 30, after classification, bounding box selection, and mask calcula-
tion, the Mask R-CNN network prediction results achieve good joint segmentation effects
compared to the annotated results of the original images. Additionally, comparing the
prediction results of the U-Net network shows that the joint segmentation effect is not
affected by the proportion of the segmentation target in the image. Both the overall and
local details are accurately segmented.

4.4. Comparison of Tunnel Face Joint Recognition Effect and Acquisition of Joint
Morphology Parameters
Figure 31 presents the prediction results of five test sample images after traditional
image processing, the U-Net convolutional neural network, and the Mask R-CNN convolu-
tional neural network.
4.4. Comparison of Tunnel Face Joint Recognition Effect and Acquisition of Joint Morphology
Parameters

Appl. Sci. 2024, 14, 6403

Figure 31 presents the prediction results of five test sample images after traditional
21 of 24
image processing, the U-Net convolutional neural network, and the Mask R-CNN convo-
lutional neural network.

(a) Original Image (b) Annotated image (c) Image Processing (d) U-Net (e) Mask R-CNN
Figure 31.
Figure 31. Comparison
Comparison of
of prediction results.
prediction results.

From Figure31,
From Figure 31,it is
it evident
is evident
thatthat overall,
overall, all image
all three three image segmentation
segmentation methodsmethods
achieve
achieve certain joint segmentation effects. Specifically, the Mask R-CNN
certain joint segmentation effects. Specifically, the Mask R-CNN convolutional neural convolutional
neural network
network demonstrates
demonstrates the bestthe best segmentation
segmentation results, followed
results, followed by the
by the U-Net U-Net con-
convolutional
volutional
neural neuraland
network, network, and traditional
traditional image processing
image processing shows the shows
least the least effective
effective results. re-
To
sults. Toquantitatively
further further quantitatively
compare thecompare the segmentation
segmentation effectivenesseffectiveness of these
of these three three
methods,
methods, appropriate
appropriate metrics will metrics will befor
be selected selected for subsequent
subsequent comparative comparative
analysis. analysis.

4.4.1. Evaluation Metrics

(1) Dice similarity
(1) Dice similarity coefficient
coefficient
The Dice coefficient is used to quantify the overlap between the predicted image and
the annotated image, calculated as shown in Equation (12).

Dice= =2TP/(2TP
Dice 2TP / (2TP + FN
+ FN + +FP)
FP) (12)
(12)

where TP represents true positives, which are predicted positive samples that are indeed
positive; FN represents false negatives, which are predicted negative samples that are
indeed positive; and FP represents false positives, which are predicted positive samples
that are indeed negative.
Appl. Sci. 2024, 14, 6403 22 of 24

(2) Precision
Precision represents the proportion of predicted positive samples that are actually
positive, calculated as shown in Equation (13).

Precision = TP/(TP + FP) (13)

(3) Recall
Recall represents the proportion of actual positive samples that are predicted correctly,
calculated as shown in Equation (14).

Recall = TP/(TP + FN) (14)

4.4.2. Comparison of Recognition Effects

Table 2 evaluates the segmentation effectiveness of three segmentation methods—
traditional image processing, U-Net convolutional neural network, and Mask R-CNN
convolutional neural network—using the metrics described in Section 4.4.1.

Table 2. Comparison of joint segmentation effects. (The bolded portion is the highest value.)

Method Dice (%) Precision (%) Recall (%)

Traditional Image Processing 60.59 57.31 70.03
U-Net 75.36 70.85 85.58
Mask R-CNN 87.48 89.74 84.73

As shown in Table 2, the Dice similarity coefficient, Precision, and Recall of traditional
image processing are 60.59%, 57.31%, and 70.03%, respectively. For the U-Net network, the
Dice similarity coefficient, Precision, and Recall are 75.36%, 70.85%, and 85.58%, respectively.
For the Mask R-CNN network, these values are 87.48%, 89.74%, and 84.73%, respectively.
Among the three metrics, the Dice similarity coefficient most accurately reflects the
true effect of target segmentation. Although the Recall value of U-Net is slightly higher
than that of Mask R-CNN, its Precision is significantly lower, indicating that the U-Net
network’s segmentation results are rougher and contain more non-target information points.
Comprehensive comparison and analysis show that the Mask R-CNN network has the best
segmentation effect on the face joints of the tunnel.

5. Conclusions
Based on the digital images of the tunnel face obtained through sectional shooting,
this paper obtained complete and clear images of the tunnel face through image stitching
and fusion algorithms. Then, the tunnel face joint information was extracted using three
methods: traditional image processing, U-Net convolutional neural network, and Mask
R-CNN convolutional neural network. The extraction effects were compared, and the main
conclusions are as follows:
(1) Using the SURF algorithm and weighted fusion, sectional images of the tunnel face
were stitched into complete, high-clarity images suitable for deep learning algorithms.
(2) Traditional image processing methods, including grayscale processing, spatial filter-
ing, binarization, morphological processing, and noise removal, produced suboptimal
results with a Dice similarity coefficient of 60.59%. These methods are inefficient,
involve significant manual intervention, and lose joint information, making them
unsuitable for tunnel engineering applications.
(3) The U-Net convolutional neural network achieved relatively good segmentation
results with a Dice similarity coefficient of 75.36%. However, it lacked precision and
lost target details, indicating room for improvement.
Appl. Sci. 2024, 14, 6403 23 of 24

(4) The Mask R-CNN model excelled in both overall and detailed segmentation, achieving
a Dice similarity coefficient of 87.48%. This model demonstrated efficient and accurate
extraction of tunnel face joints, outperforming traditional and U-Net methods.

Author Contributions: Data curation, Y.L.; formal analysis, H.Q.; funding acquisition, X.Y.; investiga-
tion, H.Q.; methodology, H.Q.; project administration, X.Y.; resources, Y.L.; software, H.Q. and, Z.L.;
supervision, J.Z.; validation, H.Q.; visualization, Z.G.; writing—original draft, H.Q.; writing—review
& editing, J.Z. All authors have read and agreed to the published version of the manuscript.
Funding: This research received no external funding.
Institutional Review Board Statement: This study did not require ethical approval.
Informed Consent Statement: Informed consent was obtained from all subjects involved in the study.
Data Availability Statement: The data presented in this study are available on request from the
corresponding author.
Conflicts of Interest: The authors declare no conflict of interest.

References
1. Ross-Brown, D.M.; Atkinson, K. Terrestrial photogrammetry in open-pits: 1-description and use of the Phototheodolite in mine
surveying. Inst. Min. Metall. 1972, 81, 7–11.
2. Huang, S.L.; Speck, R.C. Digital image processing for rock joint surface studies. Photogramm. Eng. Remote Sens. 1988, 54, 395–400.
3. Krishnan, R.; Sommer, H.J. Estimation of Rock Face Stability; The Pennsylvania State University: University Park, PA, USA, 1994.
4. Fitton, N.; Cox, S. Optimising the application of the Hough transform for automatic feature extraction from geoscientific images.
Comput. Geosci. 1998, 24, 933–951. [CrossRef]
5. Reid, T.R.; Harrison, J.P. A semi-automated methodology for discontinuity trace detection in digital images of rock mass exposures.
Int. J. Rock Mech. Min. Sci. 2000, 37, 1–5. [CrossRef]
6. Holden, E.-J.; Dentith, M.; Kovesi, P. Towards the automated analysis of regional aeromagnetic data to identify regions prospective
for gold deposits. Comput. Geosci. 2008, 34, 1505–1513. [CrossRef]
7. Liu, C.; Wang, B.; Shi, B.; Tang, C. Analytic method of morphological parameters of cracks for rock and soil based on image
processing and recognition. Chin. J. Geotech. Eng. 2008, 30, 1383–1388.
8. Chen, B.; Wang, Y.; Wang, H.; Zhu, C.; Fu, J. Identification of tunnel surrounding rock joint and fracture based on SLIC super
pixel segmentation and combination. J. Highw. Transp. Res. Dev. 2022, 39, 139–146.
9. Jung, S.Y.; Lee, S.K.; Park, C.I.; Cho, S.Y.; Yu, J.H. A method for detecting concrete cracks using deep-learning and image
processing. J. Archit. Inst. Korea Struct. Constr. 2019, 35, 163–170.
10. Bhowmick, S.; Nagarajaiah, S.; Veeraraghavan, A. Vision and deep learning-based algorithms to detect and quantify cracks on
concrete surfaces from UAV videos. Sensors 2020, 20, 6299. [CrossRef] [PubMed]
11. Yu, Y.; Rashidi, M.; Samali, B.; Yousefi, A.M.; Wang, W. Multi-image-feature-based hierarchical concrete crack identification
framework using optimized SVM multi-classifiers and D-S fusion algorithm for bridge structures. Remote Sens. 2021, 13, 240.
[CrossRef]
12. Zhao, S.; Zhang, D.; Xue, Y.; Zhou, M.; Huang, H. A deep learning-based approach for refined crack evaluation from shield
tunnel lining images. Autom. Constr. 2021, 132, 103934. [CrossRef]
13. Dang, L.M.; Wang, H.; Li, Y.; Park, Y.; Oh, C.; Nguyen, T.N.; Moon, H. Automatic tunnel lining crack evaluation and measurement
using deep learning. Tunn. Undergr. Space Technol. 2022, 124, 104472. [CrossRef]
14. Zhou, Z.; Zhang, J.; Gong, C. Hybrid semantic segmentation for tunnel lining cracks based on Swin Transformer and convolutional
neural network. Comput.-Aided Civ. Infrastruct. Eng. 2023, 38, 2491–2510. [CrossRef]
15. Song, F.; Liu, B.; Yuan, G.X. Pixel-level crack identification for bridge concrete structures using unmanned aerial vehicle
photography and deep learning. Struct. Control. Health Monit. 2024, 2024, 1299095. [CrossRef]
16. Wang, F.; Chen, T.; Gai, M. A dual-tree-complex wavelet transform-based infrared and visible image fusion technique and its
application in tunnel crack detection. Appl. Sci. 2024, 14, 114. [CrossRef]
17. Liu, H.X.; Li, W.S.; Zha, Z.Y.; Jiang, W.J.; Xu, T. Method for surrounding rock mass classification of highway tunnels based on
deep learning technology. Chin. J. Geotech. Eng. 2018, 40, 1809–1817.
18. Chen, J.; Zhou, M.; Huang, H.; Zhang, D.; Peng, Z. Automated extraction and evaluation of fracture trace maps from rock tunnel
face images via deep learning. Int. J. Rock Mech. Min. Sci. 2021, 142, 104745. [CrossRef]
19. Lee, Y.-K.; Kim, J.; Choi, C.-S.; Song, J.-J. Semi-automatic calculation of joint trace length from digital images based on deep
learning and data structuring techniques. Int. J. Rock Mech. Min. Sci. 2022, 149, 104981. [CrossRef]
20. Peng, L.; Wang, H.; Zhou, C.; Hu, F.; Tian, X.; Hongtai, Z. Research on intelligent detection and segmentation of rock joints based
on deep learning. Adv. Civ. Eng. 2024, 2024, 8810092. [CrossRef]
Appl. Sci. 2024, 14, 6403 24 of 24

21. Ronneberger, O.; Fischer, P.; Brox, T. U-Net: Convolutional networks for biomedical image segmentation. In Proceedings of the
18th International Conference, Munich, Germany, 5–9 October 2015.
22. Li, G.; Ma, B.; He, S.; Ren, X.; Liu, Q. Automatic tunnel crack detection based on U-Net and a convolutional neural network with
alternately updated clique. Sensors 2020, 20, 717. [CrossRef]
23. Chang, H.; Rao, Z.; Zhao, Y.; Li, Y. Research on tunnel crack segmentation algorithm based on improved U-Net network. Comput.
Eng. Appl. 2021, 57, 215–222.
24. Zhao, S.; Zhang, G.; Zhang, D.; Tan, D.; Huang, H. A hybrid attention deep learning network for refined segmentation of cracks
from shield tunnel lining images. J. Rock Mech. Geotech. Eng. 2023, 15, 3105–3117. [CrossRef]
25. Shi, Y.; Ballesio, M.; Johansen, K.; Trentman, D.; Huang, Y.; McCabe, M.F.; Bruhn, R.; Schuster, G. Semi-universal geo-crack
detection by machine learning. Front. Earth Sci. 2023, 11, 1073211. [CrossRef]
26. He, K.; Gkioxari, G.; Dollár, P.; Girshick, R. Mask R-CNN. In Proceedings of the 16th IEEE International Conference on Computer
Vision (ICCV), Venice, Italy, 22–29 October 2017.
27. Lin, Z.; Ji, K.F.; Leng, X.G.; Kuang, G. Squeeze and excitation rank faster R-CNN for ship detection in SAR images. IEEE Geosci.
Remote Sens. Lett. 2019, 16, 751–755. [CrossRef]
28. Yu, Y.; Zhang, K.L.; Yang, L.; Zhang, D. Fruit detection for strawberry harvesting robot in non-structural environment based on
Mask-RCNN. Comput. Electron. Agric. 2019, 163, 104846. [CrossRef]
29. Jia, W.; Tian, Y.; Luo, R.; Zhang, Z.; Lian, J.; Zheng, Y. Detection and segmentation of overlapped fruits based on optimized mask
R-CNN application in apple harvesting robot. Comput. Electron. Agric. 2020, 172, 105380. [CrossRef]
30. Hao, Z.; Lin, L.; Post, C.J.; Mikhailova, E.A.; Li, M.; Chen, Y.; Yu, K.; Liu, J. Automated tree-crown and height detection in a young
forest plantation using mask region-based convolutional neural network (Mask R-CNN). ISPRS J. Photogramm. Remote Sens. 2021,
178, 112–123. [CrossRef]
31. Xu, X.Y.; Zhao, M.; Shi, P.X.; Ren, R.; He, X.; Wei, X.; Yang, H. Crack detection and comparison study based on faster R-CNN and
mask R-CNN. Sensors 2022, 22, 1215. [CrossRef] [PubMed]
32. Qin, J.; Zhang, Y.; Zhou, H.; Yu, F.; Sun, B.; Wang, Q. Protein crystal instance segmentation based on Mask R-CNN. Crystals 2021,
11, 157. [CrossRef]
33. Bay, H.; Tuytelaars, T.; van Gool, L. SURF: Speeded up Robust Features; Springer: Berlin/Heidelberg, Germany, 2006.
34. Otsu, N. Threshold selection method from gray-level histograms. IEEE Trans. Syst. Man Cybern. 1979, 9, 62–66. [CrossRef]
35. Girshick, R.; Donahue, J.; Darrell, T.; Malik, J. Rich feature hierarchies for accurate object detection and semantic segmentation. In
Proceedings of the 27th IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Columbus, OH, USA, 23–28
June 2014.

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual
author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to
people or property resulting from any ideas, methods, instructions or products referred to in the content.

Log Sheet Chiller
80% (5)
Log Sheet Chiller
1 page
Comparison of Deep Concolutional Neural Networks and Edge Detectors For Image-Based Crack Detection in Concrete
No ratings yet
Comparison of Deep Concolutional Neural Networks and Edge Detectors For Image-Based Crack Detection in Concrete
15 pages
Section 23 Adco Bab Field MFS Presentation PDF
No ratings yet
Section 23 Adco Bab Field MFS Presentation PDF
43 pages
JNTU Kakinada M.tech CAD-CAM Syllabus
No ratings yet
JNTU Kakinada M.tech CAD-CAM Syllabus
18 pages
AIX SSP ServiceEduDec
No ratings yet
AIX SSP ServiceEduDec
98 pages
Vsphere Esxi Vcenter Server 551 Security Guide
0% (1)
Vsphere Esxi Vcenter Server 551 Security Guide
182 pages
Advanced Topic 8a - Crack Detection 1
No ratings yet
Advanced Topic 8a - Crack Detection 1
9 pages
Addressing Tunnel Segment Misalignment Challenges A Comparative Analysis of Detection Techniques 15152
100% (1)
Addressing Tunnel Segment Misalignment Challenges A Comparative Analysis of Detection Techniques 15152
20 pages
Automation and Robotics in Construction and Civil Engineering
No ratings yet
Automation and Robotics in Construction and Civil Engineering
4 pages
Deep Learning-Based Crack Damage Detection
No ratings yet
Deep Learning-Based Crack Damage Detection
18 pages
JEDEC Coding System Table For Diodes and Transistors
100% (1)
JEDEC Coding System Table For Diodes and Transistors
3 pages
CRP Sampling Literature
No ratings yet
CRP Sampling Literature
24 pages
Ultimate Tricks To Remember National & International Organisations
No ratings yet
Ultimate Tricks To Remember National & International Organisations
3 pages
Sensors 24 01725
No ratings yet
Sensors 24 01725
22 pages
Sensors 19 04738 v2
No ratings yet
Sensors 19 04738 v2
17 pages
Data Alat Yang Belum Ada
No ratings yet
Data Alat Yang Belum Ada
5 pages
Mil STD 1168
No ratings yet
Mil STD 1168
29 pages
Innovative Methodology For Railway Tunnel Inspection: Faculty of Engineering, University of Porto
No ratings yet
Innovative Methodology For Railway Tunnel Inspection: Faculty of Engineering, University of Porto
207 pages
Instalación de Servicios Básicos en OpenBSD 5
No ratings yet
Instalación de Servicios Básicos en OpenBSD 5
46 pages
Mechine Learning
No ratings yet
Mechine Learning
15 pages
Jungle Safari Booking Management System: Mini Project Report
100% (1)
Jungle Safari Booking Management System: Mini Project Report
19 pages
53 Fox-Ivey Et Al - 3D Laser Scanning and AI For STW Inspection
No ratings yet
53 Fox-Ivey Et Al - 3D Laser Scanning and AI For STW Inspection
45 pages
Cost Effective Automated Crack Detection Scheme by Using A Mobile Robot
No ratings yet
Cost Effective Automated Crack Detection Scheme by Using A Mobile Robot
11 pages
1 Hindawi
No ratings yet
1 Hindawi
16 pages
A Computational Framework For Next-Generation Inspection Imaging
No ratings yet
A Computational Framework For Next-Generation Inspection Imaging
176 pages
Sustainability 15 01509 v2
No ratings yet
Sustainability 15 01509 v2
31 pages
A Novel Convolutional Neural Network For Enhancing The Continuity of Pavement Crack Detection
No ratings yet
A Novel Convolutional Neural Network For Enhancing The Continuity of Pavement Crack Detection
20 pages
SSRN Id4621630
No ratings yet
SSRN Id4621630
50 pages
Infrastructures 08 00114 v2
No ratings yet
Infrastructures 08 00114 v2
19 pages
A Handheld LiDAR-Based Semantic Automatic Segmenta
No ratings yet
A Handheld LiDAR-Based Semantic Automatic Segmenta
40 pages
Drones 09 00393
No ratings yet
Drones 09 00393
19 pages
3D Segmentation and Color Coding
No ratings yet
3D Segmentation and Color Coding
18 pages
Efficient Crack Detection and Quantification in Concrete Structures Using IoT
No ratings yet
Efficient Crack Detection and Quantification in Concrete Structures Using IoT
16 pages
Image Super Resolution ESR-GAN
No ratings yet
Image Super Resolution ESR-GAN
15 pages
1 s2.0 S0926580522002618 Main
No ratings yet
1 s2.0 S0926580522002618 Main
17 pages
Water 15 02082
No ratings yet
Water 15 02082
21 pages
Resanti Audrienne - Utilization of Digital Technologies For Especially Maintenance of Concrete or Steel Structures
No ratings yet
Resanti Audrienne - Utilization of Digital Technologies For Especially Maintenance of Concrete or Steel Structures
10 pages
Li 等 - 2025 - SnakeConv and SFC Boosting Precise Segmentation On
No ratings yet
Li 等 - 2025 - SnakeConv and SFC Boosting Precise Segmentation On
17 pages
Ai Thesis 9
No ratings yet
Ai Thesis 9
19 pages
Name Logo Project
No ratings yet
Name Logo Project
2 pages
Sensors 23 01419 v2
No ratings yet
Sensors 23 01419 v2
21 pages
Importance of Physics
No ratings yet
Importance of Physics
1 page
3D Reconstruction of Existing Concrete Bridges Using Optical Methods
No ratings yet
3D Reconstruction of Existing Concrete Bridges Using Optical Methods
14 pages
ABinocular Vision-Based Crack Detection and Measurement Method Incorporating Semantic Segmentation 2023
No ratings yet
ABinocular Vision-Based Crack Detection and Measurement Method Incorporating Semantic Segmentation 2023
23 pages
Costing of Pipelines - 1 - PIPING GUIDE
No ratings yet
Costing of Pipelines - 1 - PIPING GUIDE
7 pages
Crack Detection in Concrete Using Transfer Learning
No ratings yet
Crack Detection in Concrete Using Transfer Learning
12 pages
Recording of Bridge Damage Areas by 3D Integration of Multiple Images and Reduction of The Variability in Detected Results
No ratings yet
Recording of Bridge Damage Areas by 3D Integration of Multiple Images and Reduction of The Variability in Detected Results
17 pages
A Practical Photogrammetric Workflow in The Field For The Construction of A 3D Rock Joint Surface Database
No ratings yet
A Practical Photogrammetric Workflow in The Field For The Construction of A 3D Rock Joint Surface Database
16 pages
1 s2.0 S0926580518308835 Main
No ratings yet
1 s2.0 S0926580518308835 Main
12 pages
Detection of Road Extraction From Satellite Images With Deep Learning Method
No ratings yet
Detection of Road Extraction From Satellite Images With Deep Learning Method
10 pages
D3FC-UK Parker
No ratings yet
D3FC-UK Parker
8 pages
ADBMS Lab Manual
No ratings yet
ADBMS Lab Manual
33 pages
Crack Detection in Randomly Textured Surfaces Using Spiral Search
No ratings yet
Crack Detection in Randomly Textured Surfaces Using Spiral Search
7 pages
A Comparative Review of Image Processing Based Crack Detection Techniques On Civil Engineering Structures
No ratings yet
A Comparative Review of Image Processing Based Crack Detection Techniques On Civil Engineering Structures
17 pages
Complex Fuzzy System Based Predictive
No ratings yet
Complex Fuzzy System Based Predictive
10 pages
Deep Learning-Based Instance Segmentation of Cracks From Shield Tunnel Lining Images
No ratings yet
Deep Learning-Based Instance Segmentation of Cracks From Shield Tunnel Lining Images
15 pages
Computer Aided Civil Eng - 2017 - Cha - Deep Learning Based Crack Damage Detection Using Convolutional Neural Networks
No ratings yet
Computer Aided Civil Eng - 2017 - Cha - Deep Learning Based Crack Damage Detection Using Convolutional Neural Networks
18 pages
Point Cloudwróblewski - 2023 - IOP - Conf. - Ser. - Earth - Environ. - Sci. - 1189 - 012005
No ratings yet
Point Cloudwróblewski - 2023 - IOP - Conf. - Ser. - Earth - Environ. - Sci. - 1189 - 012005
13 pages
Densely Connected Deep Neural Network Considering Connectivity of Pixels For Automatic Crack Detection
No ratings yet
Densely Connected Deep Neural Network Considering Connectivity of Pixels For Automatic Crack Detection
13 pages
Research On Road Extraction From High-Resolution Remote Sensing Images Based On Improved UNet
No ratings yet
Research On Road Extraction From High-Resolution Remote Sensing Images Based On Improved UNet
10 pages
BSB Capstone Guide
No ratings yet
BSB Capstone Guide
39 pages
Research Article: A 3D Image Reconstruction Model For Long Tunnel Geological Estimation
No ratings yet
Research Article: A 3D Image Reconstruction Model For Long Tunnel Geological Estimation
10 pages
Nosql and Hadoop Technologies On Oracle Cloud: Volume 2, Issue 2, March - April 2013
No ratings yet
Nosql and Hadoop Technologies On Oracle Cloud: Volume 2, Issue 2, March - April 2013
6 pages
Remote Sensing: Tunnel Monitoring and Measuring System Using Mobile Laser Scanning: Design and Deployment
No ratings yet
Remote Sensing: Tunnel Monitoring and Measuring System Using Mobile Laser Scanning: Design and Deployment
19 pages
Overviewof Tunnel Detection Technology
No ratings yet
Overviewof Tunnel Detection Technology
14 pages
Environmental Law Project
No ratings yet
Environmental Law Project
16 pages
2013 Brochure Modernizations Upgrades Steam Turbines en
No ratings yet
2013 Brochure Modernizations Upgrades Steam Turbines en
16 pages
Laser Based Intersection Detection For Reactive Na
No ratings yet
Laser Based Intersection Detection For Reactive Na
7 pages
IJCRT2005287
No ratings yet
IJCRT2005287
6 pages
Paper For Worldcomp-Finalpaper
No ratings yet
Paper For Worldcomp-Finalpaper
7 pages
Road Crack Detection Using Deep Convolutional Neural Network and Adaptive Thresholding
No ratings yet
Road Crack Detection Using Deep Convolutional Neural Network and Adaptive Thresholding
6 pages
Semiautomatic Road Extraction Framework Based On Shape Features and LS-SVM From High-Resolution Images
No ratings yet
Semiautomatic Road Extraction Framework Based On Shape Features and LS-SVM From High-Resolution Images
12 pages
(Asce) Is 1943-555X 0000591
No ratings yet
(Asce) Is 1943-555X 0000591
12 pages
PQM Ge - Esquema de Conexiones
No ratings yet
PQM Ge - Esquema de Conexiones
22 pages
02 Basic GUI Programming - Lab
No ratings yet
02 Basic GUI Programming - Lab
2 pages
Crack Identification From Concrete Structure Images Using Deep Transfer Learning
No ratings yet
Crack Identification From Concrete Structure Images Using Deep Transfer Learning
7 pages
Implementation of Computer Vision Technique For Crack Monitoring in Concrete Structure
No ratings yet
Implementation of Computer Vision Technique For Crack Monitoring in Concrete Structure
5 pages
Kim 2020 IOP Conf. Ser. Mater. Sci. Eng. 829 012027-1
No ratings yet
Kim 2020 IOP Conf. Ser. Mater. Sci. Eng. 829 012027-1
9 pages
A Novel Robust Crack Detection Technique For Railtrack Inspection RCDT Rti Using Image Processing Ip
No ratings yet
A Novel Robust Crack Detection Technique For Railtrack Inspection RCDT Rti Using Image Processing Ip
5 pages
Advanced Engineering Informatics: Satoshi Nishiyama, Nao Minakata, Teruyuki Kikuchi, Takao Yano
No ratings yet
Advanced Engineering Informatics: Satoshi Nishiyama, Nao Minakata, Teruyuki Kikuchi, Takao Yano
8 pages
KFC - Remera II - Mechanicall Plans
No ratings yet
KFC - Remera II - Mechanicall Plans
3 pages
Chap 1
No ratings yet
Chap 1
7 pages
2WS21EN
No ratings yet
2WS21EN
8 pages
Personalized Image Search For Photo Sharing Website: Asst. Professor Rajni Pamnani
No ratings yet
Personalized Image Search For Photo Sharing Website: Asst. Professor Rajni Pamnani
6 pages
AMC Spare Parts List May 2019 - OPL
No ratings yet
AMC Spare Parts List May 2019 - OPL
5 pages
Shantui Brochure PDF
No ratings yet
Shantui Brochure PDF
2 pages
Hydrazene Analyzer-Hach Hydrastat 9186-Manual
No ratings yet
Hydrazene Analyzer-Hach Hydrastat 9186-Manual
2 pages
Precision Navigation with the Compass: Definitive Reference for Developers and Engineers
From Everand
Precision Navigation with the Compass: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Machine Learning - Advanced Concepts
From Everand
Machine Learning - Advanced Concepts
Derrick Mwiti
No ratings yet
Mesh Generation: Advances and Applications in Computer Vision Mesh Generation
From Everand
Mesh Generation: Advances and Applications in Computer Vision Mesh Generation
Fouad Sabry
No ratings yet

Segundo

Uploaded by

Segundo

Uploaded by

applied

Appl. Sci. 2024, 14, 6403. https://doi.org/10.3390/app14156403 https://www.mdpi.com/journal/applsci

AI algorithms, especially deep-learning-based object detection methods, have shown

2. Acquisition of Tunnel Face Images

2.1. Method of Partitioned Image Acquisition

 ( X (shown ) ) 7) are corrected using the

(a) Full Cross-Section Image of the tunnel face

(b) Partitioned stitching and fusion image of the tunnel face

Figure 9. Grayscale processing result of tunnel face image.

3.2. Spatial Filtering

3.3. Image Binarization

Figure 11. Binarized image.

(a) Joint with breakpoints (b) Joint after morphological processing

(a) Engineering site photo

(b) Joint schematic diagram (orange areas are joint areas)

Appl. Sci. 2024, 14, x FOR PEER REVIEW

Figure 20. Schematic diagram of convolution processing.

Figure 22. Max pooling diagram.

0.14 Validation set

Appl. Sci. 2024, 14, x FOR PEER REVIEW 0.84 16 of 24

Figure 26. Cont.

Figure 27. Mask R-CNN network architecture.

Table 1. Comparison of ResNet network structures.

Layer Name 50-Layer 101-Layer

As shown in Table 1, the ResNet101 network has a deeper structure compared to

Figure 28. Bilinear interpolation effect.

Appl. Sci. 2024, 14, 6403

4.4.1. Evaluation Metrics

Precision = TP/(TP + FP) (13)

Recall = TP/(TP + FN) (14)

4.4.2. Comparison of Recognition Effects

Method Dice (%) Precision (%) Recall (%)

You might also like