0% found this document useful (0 votes)
27 views11 pages

FaultSeg3D - Using Synthetic Data Sets To Train An End-To-End Convolutional Neural Network For 3D Seismic Fault Segmentation

Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
27 views11 pages

FaultSeg3D - Using Synthetic Data Sets To Train An End-To-End Convolutional Neural Network For 3D Seismic Fault Segmentation

Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 11

GEOPHYSICS, VOL. 84, NO. 3 (MAY-JUNE 2019); P. IM35–IM45, 11 FIGS.

10.1190/GEO2018-0646.1
Downloaded 04/24/19 to 128.83.63.20. Redistribution subject to SEG license or copyright; see Terms of Use at http://library.seg.org/

FaultSeg3D: Using synthetic data sets to train an end-to-end convolutional


neural network for 3D seismic fault segmentation

Xinming Wu1, Luming Liang2, Yunzhi Shi3, and Sergey Fomel3

ABSTRACT which are shown to be sufficient to train a good fault segmentation


network. Because a binary fault image is highly imbalanced be-
Delineating faults from seismic images is a key step for seismic tween zeros (nonfault) and ones (fault), we use a class-balanced
structural interpretation, reservoir characterization, and well place- binary cross-entropy loss function to adjust the imbalance so that
ment. In conventional methods, faults are considered as seismic the network is not trained or converged to predict only zeros. After
reflection discontinuities and are detected by calculating attributes training with only the synthetic data sets, the network automati-
that estimate reflection continuities or discontinuities. We con- cally learns to calculate rich and proper features that are important
sider fault detection as a binary image segmentation problem of for fault detection. Multiple field examples indicate that the neural
labeling a 3D seismic image with ones on faults and zeros else- network (trained by only synthetic data sets) can predict faults
where. We have performed an efficient image-to-image fault seg- from 3D seismic images much more accurately and efficiently
mentation using a supervised fully convolutional neural network. than conventional methods. With a TITAN Xp GPU, the training
To train the network, we automatically create 200 3D synthetic processing takes approximately 2 h and predicting faults in a
seismic images and corresponding binary fault labeling images, 128 × 128 × 128 seismic volume takes only milliseconds.

INTRODUCTION in computing seismic coherence to enhance faults while suppressing


the stratigraphic features. Similarly, some other authors (Bakker, 2002;
Faults are typically recognized as lateral reflection discontinuities Hale, 2009; Wu, 2017) apply smoothing in directions perpendicular to
in a 3D seismic image. Based on this observation, numerous meth- seismic reflections in computing coherence or semblance by assuming
ods have been proposed to detect faults by calculating attributes of that faults are typically normal to reflections. However, faults are sel-
measuring seismic reflection continuity such as semblance (Marfurt dom vertical or are not necessarily perpendicular to seismic reflec-
et al., 1998) and coherency (Marfurt et al., 1999; Li and Lu, 2014; tions. Therefore, some authors (Hale, 2013; Wu and Hale, 2016)
Wu, 2017), or reflection discontinuity such as variance (Van Bem- propose smoothing the numerator and denominator of the semblance
mel and Pepper, 2000; Randen et al., 2001) and gradient magnitude along fault strikes and dips to compute the fault-oriented semblance or
(Aqrawi and Boe, 2011). These seismic attributes, however, can be fault likelihood. However, calculating fault-oriented semblance is
sensitive to noise and stratigraphic features, which also correspond computationally more expensive than the previous attributes because
to reflection discontinuities in a seismic image. This means that it requires scanning over all possible combinations of fault strikes and
measuring seismic reflection continuity or discontinuity alone is in- dips to find the maximum fault likelihoods.
sufficient to detect faults (Hale, 2013). Some fault detection methods start with some initial fault attrib-
Faults are typically more vertically aligned, whereas stratigraphic utes and further enhance them by smoothing the attributes along
features mostly extend laterally. Based on this observation, Gerszten- fault strikes and dips (Neff et al., 2000; Cohen et al., 2006; Wu
korn and Marfurt (1999) suggest using vertically elongated windows and Zhu, 2017). These methods also need to smooth the fault

Manuscript received by the Editor 10 September 2018; revised manuscript received 21 December 2018; published ahead of production 12 February 2019;
published online 16 April 2019.
1
University of Science and Technology of China, Laboratory of Seismology and Physics of Earth’s Interior, School of Earth and Space Sciences, Hefei, China
and The University of Texas at Austin, Bureau of Economic Geology, Austin, Texas, USA. E-mail: [email protected].
2
Microsoft Applied Science Group, Redmond, Washington 98052, USA. E-mail: [email protected].
3
The University of Texas at Austin, Bureau of Economic Geology, Austin, Texas, USA. E-mail: [email protected]; [email protected].
© 2019 Society of Exploration Geophysicists. All rights reserved.

IM35
IM36 Wu et al.

attributes over all possible combinations of fault strikes and dips to binary fault segmentation. We simplify the original U-Net by reduc-
obtain the best enhanced fault features. Similarly, some authors ing the number of convolutional layers and features at each layer,
(Pedersen et al., 2002, 2003) propose to enhance fault features which significantly saves graphics processing unit (GPU) memory
along paths of “artificial ants” by assuming that the paths follow and computational time but still preserves high performance in our
faults. Wu and Fomel (2018) propose an efficient method to extract 3D fault detection tasks. Considering a fault binary image is highly
optimal surfaces following maximum fault attributes and use these biased with mostly zeros but only very limited ones on the faults,
Downloaded 04/24/19 to 128.83.63.20. Redistribution subject to SEG license or copyright; see Terms of Use at http://library.seg.org/

optimal surfaces to vote for enhanced fault images of fault proba- we use a balanced cross-entropy loss function for optimizing the
bilities, strikes, and dips. parameters of our CNN model. To train and validate the neural net-
Recently, some convolutional-neural-network (CNN) methods work, we design a workflow to automatically generate 3D synthetic
have been introduced to detect faults by pixel-wise fault classifica- seismic and corresponding fault images. In this workflow, the seis-
tion (fault or nonfault) with multiple seismic attributes (Huang et al., mic folding and faulting structures, wavelet peak frequencies, and
2017; Di et al., 2018; Guitton, 2018; Guo et al., 2018; Zhao and noise are defined by a set of parameters, and each parameter can be
Mukhopadhyay, 2018). Wu et al. (2018) use a CNN-based pixel- chosen from some predefined range. By randomly choosing a com-
wise classification method to not only predict the fault probability bination of these parameters within the predefined ranges, we are
but also estimate the fault orientations at the same time. These meth- able to generate numerous unique seismic images and correspond-
ods need to choose a local window or cube to make fault prediction ing fault labeling images. We train and validate, respectively, by
at every image pixel, which is computationally highly expensive, using 200 and 20 pairs of synthetic seismic and fault images, which
especially in 3D fault detection. In this paper, we consider the fault turned to be sufficient to train a good CNN model for our fault
detection as a more efficient end-to-end binary image segmentation detection tasks. Although trained by using only synthetic seismic data
problem by using CNNs. Image segmentation has been well-studied sets, our CNN model can work much better and more efficiently than
in computer science, and multiple powerful CNN architectures the conventional methods for 3D fault detection in field seismic data
(e.g., Girshick et al., 2014; Ren et al., 2015; Ronneberger et al., sets that are recorded at totally different surveys. By using a TITAN
2015; Xie and Tu, 2015; Badrinarayanan et al., 2017; He et al., Xp GUP, our CNN model takes less than 5 min to predict faults in a
2017) have been proposed to obtain superior segmentation results. large seismic volume with 450 × 1950 × 1200 samples.
In this paper, we use an efficient end-to-end CNN, simplified
from U-Net (Ronneberger et al., 2015), to perform our task of 3D TRAINING DATA SETS
Training and validating a CNN model often
requires a large amount of images and correspond-
ing labels. Manually labeling or interpreting faults
in a 3D seismic image could be extremely time
consuming and highly subjective. In addition,
inaccurate manual interpretation, including misla-
beled and unlabeled faults, may mislead the learn-
ing process. To avoid these problems, we propose
an effective and efficient way to create synthetic
seismic images and corresponding fault labels for
training and validating our CNN model.

Synthetic seismic and fault images


The workflow that we use to create synthetic
seismic and fault images is similar to the one
used for generating synthetic data sets by Wu
and Hale (2016). In this workflow, we first gen-
erate a 1D horizontal reflectivity model rðx; y; zÞ
(Figure 1a) with a sequence of random values
that are in the range of ½−1; 1. We then create
some folding structures in the reflectivity model
by vertically shearing the model. We define
the folding structures by using the following
function:

1.5z X
k¼N ðx−ck Þ2 þðy−dk Þ2
Figure 1. The workflow of creating 3D synthetic training data sets. We first generate a 2σ 2
horizontal reflectivity model (a) with a sequence of random numbers between −1 and 1. s1 ðx; y; zÞ ¼ a0 þ be k ;
We then add some folding structures (b) to the model by vertically shearing the model and zmax k¼1 k
the shearing shifts are defined by a combination of several 2D Gaussian functions. We also (1)
add some planar shearing (c) to the model to increase the complexity of folding structures.
We further add planar faulting to the model to obtain a folded and faulted reflectivity
model (d). We finally convolve the reflectivity model with a Ricker wavelet to obtain a which combines with multiple 2D Gaussian func-
synthetic seismic image (e) and add some random noise to obtain a final image (f). tions and a linear-scale function 1.5z∕zmax . The
CNN for 3D fault segmentation IM37

combination of 2D Gaussian functions yields laterally varying fold- of 3D unique seismic images and corresponding fault labeling im-
ing structures, whereas the linear-scale function damps the folding ages. We can actually generate much more unique training data sets,
vertically from below to above. In this equation, each combination but we found that 200 pairs of images are sufficient to train a pretty
of the parameters a0 ; bk ; ck ; dk , and σ k yields some specific spatially good neural network for fault segmentation. Using the same work-
varying folding structures in the model. By randomly choosing each of flow, we also automatically generated 20 pairs of seismic and fault
the parameters from the predefined ranges, we are able to create nu- labeling images for the validation.
Downloaded 04/24/19 to 128.83.63.20. Redistribution subject to SEG license or copyright; see Terms of Use at http://library.seg.org/

merous models with unique structures. With the shift map s1 ðx; y; zÞ,
we use a sinc interpolation to vertically shift the original reflectivity Data augmentation
model rðx; y; zÞ to obtain a folded model rðx; y; z þ s1 ðx; y; zÞÞ
as shown in Figure 1b. To further increase the complexity of the Creating unique training seismic and fault labeling images, as
structures in the model, we also add some planar shearing defined discussed above, is crucial to successfully train a fault segmentation
as follows: neural network. Data augmentation during the training is also help-
ful to increase the diversity of the data sets and to prevent the neural
s2 ðx; y; zÞ ¼ e0 þ fx þ gy; (2) network from learning irrelevant patterns. We apply simple data
augmentations including vertical flip and rotation around the
where the shearing shifts are laterally planar while being vertically vertical time or depth axis. To avoid interpolation and artifacts near
invariant. The parameters e0 ; f, and g, again, are randomly chosen boundaries, we rotate the seismic and fault labeling volumes by
from some predefined ranges. By sequentially applying the planar
shifts s2 ðx; y; zÞ to the previously folded model rðx; y; z þ s1
ðx; y; zÞÞ, we obtain a new reflectivity model rðx; y; z þ s1 þ s2 Þ
as shown in Figure 1c.
After obtaining a folded reflectivity model, we then add planar
faulting in the model. Although all the faults are planar, the fault
orientations (dip and strike) and displacements of the faults are
all different from each other. The fault displacements on each fault
are allowed to be spatially varying along the directions of fault
strike and dip. The common patterns of fault displacement distri-
bution have been discussed by some authors (Muraoka and Kamata,
1983; Mansfield and Cartwright, 1996; Stewart, 2001). In generat-
ing faults in our synthetic models, we define the fault displacement
distributions as a Gaussian function or linear function. In the case of
Gaussian distribution, the fault displacements decrease from the
fault center in all directions along the fault plane. In the other case
of linear distribution, the fault displacements linearly increase (nor-
mal fault) or decrease (reverse fault) in the fault dip direction along
the fault plane. The maximum fault displacement for each fault is
randomly chosen in the range between 0 and 40 samples. From our
experience, the images with more faults are more effective than
those with fewer faults to train a CNN for fault segmentation.
We therefore add more than five faults within a training image with
the size of 128 × 128 × 128. However, these faults should not be too
close to each other as shown in Figure 1d, in which we have added
six planar faults.
After creating a folded and faulted reflectivity model (Figure 1d),
we finally convolve this model with a Ricker wavelet to obtain a 3D
seismic image shown in Figure 1e. The peak frequency of the wave-
let is also randomly chosen from a predefined range. Note that we
convolve the reflectivity model with a wavelet after (not before)
creating the folding and faulting in the model because the convo-
lution will blur the sharp discontinuities near faults and therefore
make the faults look more realistic. To further improve the realism
of the synthetic seismic image, we also add some random noise to
the image as shown in Figure 1f. From this noisy image (Figure 1f),
we crop a final training seismic image (Figure 2a) with the size of
128 × 128 × 128 to avoid the artifacts near the boundaries. Figure 2b
shows the corresponding binary fault labeling image, in which the
faults are labeled by ones at two pixels adjacent to the faults from
Figure 2. (a) A final synthetic seismic image is cropped from the
the hanging wall and footwall sides. larger image in Figure 1f. (b) The corresponding true fault image
By using this workflow, we randomly choose parameters of fold- (with labeling ones on faults and zeros elsewhere) is overlaid with
ing, faulting, wavelet peak frequency, and noise to obtain 200 pairs the cropped seismic image.
IM38 Wu et al.

only three options of 90°, 180°, and 270°. Because our input tional layers followed by a ReLU activation. Different from the
seismic and fault labeling volumes are 128 × 128 × 128 cubes, original U-Net architecture, we do not include a 2 × 2 × 2 “up-con-
the flip and rotation will preserve the image size without needing volution” layer after each upsampling as in the original expansion
interpolation or extrapolation. Note that we do not want to rotate the path. The upsampling operation is implemented by using the func-
seismic and fault volumes around the inline or crossline axis be- tion UpSampling3D defined in Keras (Chollet, 2015). The final out-
cause it will yield vertical seismic structures and flat faults, which put layer is a 1 × 1 × 1 convolutional layer with a sigmoid activation
Downloaded 04/24/19 to 128.83.63.20. Redistribution subject to SEG license or copyright; see Terms of Use at http://library.seg.org/

are geologically unrealistic.

FAULT SEGMENTATION BY CNN


We consider 3D fault detection as an image segmentation prob-
lem of labeling ones on faults, whereas zeros elsewhere in a 3D
seismic image. We achieve such fault segmentation by using a sim-
plified version of U-Net, an end-to-end fully CNN, which was first
proposed by Ronneberger et al. (2015) for biomedical image seg-
mentation, and then it was widely used for many other image seg-
mentation problems. In 3D seismic images, the distribution of fault
samples and nonfault samples is typically highly imbalanced; there-
fore, we use a balanced binary cross-entropy loss to optimize the
parameters of the network as discussed by Xie and Tu (2015).

CNN architecture
We started our research on fault segmentation by using the origi-
nal U-Net architecture (Ronneberger et al., 2015), which turned out
to be more complicated than necessary for our problem of fault de-
tection. We reduce the convolutional layers and features at each
layer to save memory and computation but still preserve good per-
formance in fault detection.
The finally simplified U-Net that we use for 3D fault detection is
shown in Figure 3, in which an input 3D seismic image is fed to a
network that contains a contracting path (left side) and an expansive
path (right side) as in the original U-Net architecture. In the left
contracting path, each step contains two 3 × 3 × 3 convolutional
layers followed by a ReLU activation and a 2 × 2 × 2 max pooling
operation with stride 2 for downsampling. We double the number of
features after each step. Every step in the right expansion path
Figure 4. (a) The training and validation accuracy increase with
contains a 2 × 2 × 2 upsampling operation, a concatenation with epochs, whereas (b) the training and validation loss decrease with
features from the left contracting path, and two 3 × 3 × 3 convolu- epochs.

Figure 3. A simplified end-to-end convolutional neural network (U-Net) for 3D fault detection.
CNN for 3D fault segmentation IM39

to map each 16C feature vector to a probability value in the output 1), the first term measures the prediction errors at the image pixels
fault probability map, which has the same size as the input seismic labeled by ones, whereas the second term measures the prediction
image. This simplified U-Net architecture totally consists of 15 errors at the pixels labeled by zeros.
convolutional layers, reduced from 23 convolutional layers in the This loss function works well for binary segmentation of
original U-Net architecture. The number of features at these convo- common images, in which the distribution of zero/nonzero samples
lutional layers is also significantly reduced from the original is more or less balanced. This loss function, however, is not suitable
Downloaded 04/24/19 to 128.83.63.20. Redistribution subject to SEG license or copyright; see Terms of Use at http://library.seg.org/

architecture. to measure the errors of fault segmentation, in which more than


90% of the samples are nonfault samples (labeled by zeros),
Balanced cross-entropy loss
The following binary cross-entropy loss func-
tion is widely used in the binary segmentation of
a common image:

X
i¼N
L¼− yi logðpi Þ
i¼0

X
i¼N
− ð1 − yi Þ logð1 − pi Þ; (3)
i¼0

where N denotes the number of pixels in the in-


put 3D seismic image. The term yi represents the
true binary labels and pi represents the prediction
probabilities ð0 < pi < 1Þ computed from the Figure 6. (a) Precision-recall and (b) ROC curves are used to evaluate the eight fault
sigmoid activation in the last convolutional layer. detections on the synthetic validation volume in Figure 5. Our CNN fault detection
Because the true labels yi are binary values (0 or method performs significantly better than the other methods.

Figure 5. Fault detection on the synthetic validation volume (Figure 2a) by using (a-g) seven commonly used methods and (h) our CNN
method. Compared with the methods of (a) C3 (Gersztenkorn and Marfurt, 1999), (b) C2 (Marfurt et al., 1999), (c) planarity (Hale, 2009),
(d) structure-oriented planarity (Wu, 2017), (e) structure-oriented semblance (Hale, 2009), (f) the fault likelihood (Hale, 2013; Wu and Hale,
2016), and (g) optimal surface voting (Wu and Fomel, 2018) methods perform better fault detections. (h) Our CNN method achieves the best
performance in obtaining an accurate, clean, and complete fault detection.
IM40 Wu et al.

whereas the fault samples (labeled by ones) are very limited. If we The validation data set contains another 20 pairs of such synthetic
train the neural network using this loss function, the network can seismic and fault images, which are not used in the training data set.
easily converge to the wrong direction and make zero predictions Considering the amplitude values of different real seismic images
everywhere because zero prediction is a good solution to this loss can be much different from each other, we normalize all the training
function in the fault segmentation problem. seismic images, each image is subtracted by its mean value and
To solve this problem, we use the following balanced cross-en- divided by its standard deviation.
Downloaded 04/24/19 to 128.83.63.20. Redistribution subject to SEG license or copyright; see Terms of Use at http://library.seg.org/

tropy loss function as discussed by Xie and Tu (2015): The size of each 3D seismic or fault image is 128 × 128 × 128.
We choose this relatively small size because the memory of our
X
i¼N X
i¼N
GPU is limited to 12 GB. We would suggest to choose a larger size
L ¼ −β yi logðpi Þ − ð1 − βÞ ð1 − yi Þ logð1 − pi Þ; if the GPU memory is allowed. We feed the 3D seismic images to
i¼0 i¼0
the neural network in batches and each batch contains four images,
(4) which consist of an original image and the same image rotated
P around the vertical time/depth axis by 90°, 180°, and 270°. We did
where β ¼ i¼N i¼0 ð1 − yi Þ∕N represents the ratio between nonfault
not try a larger batch size, again, because of the GPU memory limi-
pixels and the total image pixels, whereas 1 − β denotes the ratio of
tation. We use the Adam method (Kingma and Ba, 2014) to opti-
fault pixels in the 3D seismic image.
mize the network parameters and set the learning rate to be 0.0001.
We train the network with 25 epochs, and all the 200 training im-
Training and validation
ages are processed at each epoch. As shown in Figure 4, the training
We train the CNN by using 200 pairs of synthetic 3D seismic and and validation accuracies gradually increase to 95%, whereas the
fault images that are automatically created as in Figures 1 and 2. training and validation loss converges to 0.01 after 25 epochs.

Figure 7. (a) A 3D seismic image is displayed with faults that are detected by using (b) the trained CNN model, (c) fault likelihood, and
(d) thinned fault likelihood.
CNN for 3D fault segmentation IM41

To verify the CNN model trained with 25 epochs, we apply this activation in the last convolutional layer. Although trained by
trained model together with another seven commonly used fault de- only synthetic data sets, this CNN model works pretty well to pro-
tection methods to the synthetic seismic volume (Figure 2a), which vide a clean and accurate prediction of faults in this field seismic
was not included in the training data sets. Figure 5a–5h shows image. In this CNN fault probability image, most fault features have
all eight fault detection results that are, respectively, computed by very high probabilities (close to 1) and only very limited noisy fea-
using the methods of C3 (Gersztenkorn and Marfurt, 1999), C2 tures are observed. Although we added only planar faults in the
Downloaded 04/24/19 to 128.83.63.20. Redistribution subject to SEG license or copyright; see Terms of Use at http://library.seg.org/

(Marfurt et al., 1999), planarity (Hale, 2009), structure-oriented lin- training data sets, the networks actually learn to detect curved faults
earity (Wu, 2017), structure-oriented semblance (Hale, 2009), fault in the field seismic image as shown on the time slice in Figure 7b.
likelihood (Hale, 2013; Wu and Hale, 2016), optimal surface voting Figure 7c and 7d, respectively, shows the fault likelihood attribute
(Wu and Fomel, 2018), and our CNN-based segmentation. The in- (Hale, 2013; Wu and Hale, 2016) before and after thinning. The
put for the optimal surface voting method is the planarity volume thinned fault likelihood (Figure 7d) works fine to highlight the
(Figure 5c), and the input for all the other methods is the amplitude faults within this seismic image. However, a lot of more noisy
volume (Figure 2a). Compared with the first five methods, the fault
likelihood and optimal surface voting methods provide better fault
detections in which the fault features are less noisy and can be more
continuously tracked. Our CNN method achieves the best perfor-
mance in computing an accurate, clean, and complete fault detec-
tion, which is most consistent with the true fault labeling shown in
Figure 2b.
To quantitatively evaluate the fault detection methods, we further
calculate the precision-recall (Martin et al., 2004) and receiver-
operating-characteristic (ROC) (Provost et al., 1998) curves shown
in Figure 6. From the precision-recall curves (Figure 6a), we can
clearly observe that our CNN method (the red curve in Figure 6a)
provides the highest precision for all choices of recall. The preci-
sions of the fault likelihood (the orange curve in Figure 6a) and
optimal surface voting (the magenta curve in Figure 6a) methods
are relatively lower than the CNN method, but they are higher than
the other five methods. The ROC curves in Figure 6b provide similar
evaluations of the methods. In the next section, we will use the same
CNN model (trained by only synthetic data sets) to four field seismic
images that are acquired at different surveys. In this highly strict pre-
cision evaluation, the fault detections are expected to perfectly match
the true fault labels with the thickness of only two samples. However,
all of the methods should have higher precision if we consider that
each fault is a thicker zone and all fault detections within the zone are
good enough.

Applications
It might not be surprising that the CNN model, trained by synthetic
data sets, works well to detect faults in a synthetic seismic image
(Figure 5h) that is also created by using the same workflow for creat-
ing the training data sets. We further test the same CNN model on
four field seismic images that are acquired at different surveys. To be
consistent with the synthetic training seismic images, each of the field
seismic images is subtracted by its mean value and divided by its
standard deviation to obtain a consistently normalized image. We
compare our fault prediction results with the thinned fault likelihood
(Hale, 2013; Wu and Hale, 2016), which is a superior attribute (better
than most of the conventional attributes [Figures 5 and 6]) for fault
detection.
The first 3D seismic volume in Figure 7a is a subset (128 [ver-
tical] × 384 [inline] × 512 [crossline] samples) extracted from the
Netherlands off-shore F3 block seismic data, which is graciously
provided by the Dutch government through TNO and dGB Earth
Sciences. Multioriented faults are apparent within this 3D seismic
volume. Figure 7b shows the fault probability image predicted Figure 8. (a) A 3D seismic image is displayed with faults that are
by using the trained CNN model. The color in this fault image detected by using (b) the trained CNN model and (c) thinned fault
represents the fault probability, which is computed by the sigmoid likelihood.
IM42 Wu et al.

features are observed than in the CNN fault probability image from the previous synthetic and field examples, the faults in this
(Figure 7b). In addition, as denoted by the yellow arrows on seismic image are not apparent as sharp reflection discontinuities.
the inline slice (Figure 7d), the fault-oriented smoothing in calcu- Instead, the faults are imaged like reflections in this 3D seismic im-
lating the fault likelihood actually extends the fault features beyond age as shown in Figure 8a. However, the CNN model still works
the top of true faults. In addition, the fault likelihood is computed pretty well to detect the faults shown in Figure 8b, which means
from the semblance/coherence of seismic reflections, which can be that the network wisely learned to predict faults by not detecting
Downloaded 04/24/19 to 128.83.63.20. Redistribution subject to SEG license or copyright; see Terms of Use at http://library.seg.org/

sensitive to noisy reflections (the red features on the crossline in sharp discontinuities or edges. Figure 8c shows the thinned fault
Figure 7d) but insensitive to the faults with small fault displace- likelihoods that are noisier than the CNN fault probabilities as
ments (like those faults denoted by white arrows in Figure 7d). shown on the horizontal slice.
However, the trained CNN model is more robust to noise and can The third 3D seismic image shown in Figure 9 is a subset (210
better measure the probability of faults with small displacements. [vertical] × 600 [inline] × 825 [crossline] samples) extracted from a
The second 3D seismic image shown in Figure 8a is graciously larger seismic reflection volume that is acquired across the Costa
provided by Clyde Petroleum Plc. through Paradigm. Different Rica margin, northwest of the Osa Peninsula to image the fault

Figure 9. Faults are detected in a complicated 3D example by using (a-c) the trained CNN model and (d-f) thinned fault likelihoods.
CNN for 3D fault segmentation IM43

properties in the subduction zone. Multiple sets of closely spaced which can detect most faults, but the fault features are much noisier
faults are apparent in this 3D seismic volume as discussed by Bangs than the CNN fault probabilities. In addition, many of the faults are
et al. (2015). The fault detection in this example is more challenging mislabeled, especially in areas where the seismic structures are
than the previous ones because the faults are very close to each noisy.
other, the reflection structures are not well-imaged, and the image Figure 10 shows the fourth larger seismic volume (450 [vertical]
is pretty noisy. Figure 9a–9c shows the CNN fault probabilities at × 1950 [inline] × 1200 [crossline] samples) that is acquired at the
Downloaded 04/24/19 to 128.83.63.20. Redistribution subject to SEG license or copyright; see Terms of Use at http://library.seg.org/

different slices. We observe that most faults are clearly labeled in Campos Basin, offshore Brazil. This image shows that the sedi-
this CNN fault probability images, and these faults can be contin- ments are heavily faulted due to the salt bodies at the bottom of
uously tracked by following the probability features. Multiple sets the volume. The CNN fault probabilities shown in Figure 10a and
of faults striking in different directions can be clearly observed on 10b clearly and accurately label numerous closely spaced faults in
the horizontal slice in these CNN fault probability images. Fig- this seismic volume. The faulting patterns are clearly visible on
ure 9d–9f shows the thinned fault likelihoods at the same slices, the time slices of the CNN fault probability image. To be able to

Figure 10. A 3D seismic image overlaid with the


CNN fault probabilities at different slices ((a) and
(b)), where most of the faults are clearly and ac-
curately labelled.
IM44 Wu et al.
Downloaded 04/24/19 to 128.83.63.20. Redistribution subject to SEG license or copyright; see Terms of Use at http://library.seg.org/

Figure 11. Two subvolumes of (a and c) the seismic amplitude and (b and d) CNN fault probability are extracted from the full volumes in
Figure 10.

better visualize the fault detection in this example, we further display fault likelihoods for the same volume, however, required approxi-
two subvolumes of seismic amplitude and CNN fault probabilities mately 1.5 h when using a workstation with 32 cores.
(Figure 11b and 11d), in which most of the faults are clearly and In our training and validating data sets, we avoid including thrust
accurately labeled except some subtle faults. The horizontal slices and listric faults with low dip angles. These faults often appear
in Figure 11b and 11d, respectively, display clear patterns of polygo- strong reflection features in a seismic image other than reflection
nal and radial faults that may be associated with salt diapirs (Rowan discontinuities as the faults discussed in this paper. Therefore, all
et al., 1999; Carruthers, 2012). the conventional fault detection methods, based on measuring reflec-
In addition to the above field examples, we also applied the same tion discontinuity or continuity, often fail to detect the thrust and lis-
trained CNN model to two other 3D seismic images Kerry-3D and tric faults. However, our CNN-based method has potential to
Opunake-3D, which are provided on the SEG Wiki website. The fault successfully detect these faults by training another specific model,
segmentation results are clean and accurate as shown in the SEG which is actually what we will focus on in future research.
Wiki website (Wu, 2018a, 2018b, 2019).
In summary, although the CNN model is trained by using only
CONCLUSION
200 synthetic seismic images, it works pretty well to detect faults in
3D field seismic volumes that are recorded at totally different surveys. We have discussed an end-to-end CNN to efficiently detect faults
In addition, the 3D fault prediction using the trained CNN model is from 3D seismic images, in which the fault detection is considered
highly efficient. By using one TITAN Xp GPU, computing the large as a binary segmentation problem. This neural network is simplified
CNN fault probability volume (450 [vertical] × 1950 [inline] × 1200 from the originally more complicated U-Net to save GPU memory
[crossline] samples) in Figure 10 takes less than 3 min. Computing and computational time (for training and prediction) but still
CNN for 3D fault segmentation IM45

preserve the high performance for fault detection. Because the dis- Huang, L., X. Dong, and T. E. Clee, 2017, A scalable deep learning platform
for identifying geologic features from seismic attributes: The Leading
tribution of fault and nonfault samples is heavily biased, we use a Edge, 36, 249–256, doi: 10.1190/tle36030249.1.
balanced loss function to optimize the CNN model parameters. We Kingma, D. P., and J. Ba, 2014, Adam: A method for stochastic optimiza-
train the neural network by using only 200 pairs of 3D synthetic tion: CoRR, abs/1412.6980.
Li, F., and W. Lu, 2014, Coherence attribute at different spectral scales: In-
seismic and fault volumes, which are all automatically generated terpretation, 2, no. 1, SA99–SA106, doi: 10.1190/INT-2013-0089.1.
by randomly adding folding, faulting, and noise in the volumes. Mansfield, C., and J. Cartwright, 1996, High resolution fault displacement
Downloaded 04/24/19 to 128.83.63.20. Redistribution subject to SEG license or copyright; see Terms of Use at http://library.seg.org/

mapping from three-dimensional seismic data: Evidence for dip linkage


Although trained by using only synthetic data sets, the neural net- during fault growth: Journal of Structural Geology, 18, 249–263, doi: 10
work can accurately detect faults from 3D field seismic volumes .1016/S0191-8141(96)80048-4.
that are acquired at totally different surveys. Although we add only Marfurt, K. J., R. L. Kirlin, S. L. Farmer, and M. S. Bahorich, 1998, 3-D
seismic attributes using a semblance-based coherency algorithm: Geo-
planar faults in the training data sets, the network actually learns to physics, 63, 1150–1165, doi: 10.1190/1.1444415.
detect curved faults in the field seismic images. Marfurt, K. J., V. Sudhaker, A. Gersztenkorn, K. D. Crawford, and S. E.
Nissen, 1999, Coherency calculations in the presence of structural dip:
Geophysics, 64, 104–111, doi: 10.1190/1.1444508.
ACKNOWLEDGMENT Martin, D. R., C. C. Fowlkes, and J. Malik, 2004, Learning to detect natural
image boundaries using local brightness, color, and texture cues: IEEE
This research is financially supported by the sponsors of the Transactions on Pattern Analysis and Machine Intelligence, 26, 530–549,
doi: 10.1109/TPAMI.2004.1273918.
Texas Consortium for Computation Seismology (TCCS). The 1st, Muraoka, H., and H. Kamata, 1983, Displacement distribution along minor
3rd and 4th authors gratefully acknowledge the support of NVIDIA fault traces: Journal of Structural Geology, 5, 483–495, doi: 10.1016/
0191-8141(83)90054-8.
Corporation with the donation of the Titan Xp GPU used for this Neff, D. B., J. R. Grismore, and W. A. Lucas, 2000, Automated seismic fault
research. detection and picking: U.S. Patent 6, 018, 498.
Pedersen, S. I., T. Randen, L. Sonneland, and Ø. Steen, 2002, Automatic
fault extraction using artificial ants: 72nd Annual International Meeting,
DATA AND MATERIALS AVAILABILITY SEG, Expanded Abstracts, 512–515, doi: 10.1190/1.1817297.
Pedersen, S. I., T. Skov, A. Hetlelid, P. Fayemendy, T. Randen, and L.
Data and source code associated with this research are available Sønneland, 2003, New paradigm of fault interpretation: 73rd Annual
and can be accessed via the following URL: https://github.com/ International Meeting, SEG, Expanded Abstracts, 350–353, doi: 10
.1190/1.1817918.
xinwucwp/faultSeg. Provost, F. J., T. Fawcett, and R. Kohavi, 1998, The case against accuracy
estimation for comparing induction algorithms: Proceedings of the 15th
REFERENCES International Conference on Machine Learning, 445–453.
Randen, T., S. I. Pedersen, and L. Sønneland, 2001, Automatic extraction of
Aqrawi, A. A., and T. H. Boe, 2011, Improved fault segmentation using a dip fault surfaces from three-dimensional seismic data: 81st Annual Interna-
guided and modified 3D Sobel filter: 81st Annual International Meeting, tional Meeting, SEG, Expanded Abstracts, 551–554, doi: 10.1190/1
SEG, Expanded Abstracts, 999–1003, doi: 10.1190/1.3628241. .1816675.
Badrinarayanan, V., A. Kendall, and R. Cipolla, 2017, Segnet: A deep con- Ren, S., K. He, R. Girshick, and J. Sun, 2015, Faster R-CNN: Towards real-
volutional encoder-decoder architecture for image segmentation: IEEE time object detection with region proposal networks: Advances in Neural
Transactions on Pattern Analysis and Machine Intelligence, 39, 2481– Information Processing Systems, 91–99.
2495, doi: 10.1109/TPAMI.2016.2644615. Ronneberger, O., P. Fischer, and T. Brox, 2015, U-Net: Convolutional
Bakker, P., 2002, Image structure analysis for seismic interpretation: Ph.D. networks for biomedical image segmentation: International Conference
thesis, Delft University of Technology. on Medical Image Computing and Computer-Assisted Intervention,
Bangs, N. L., K. D. McIntosh, E. A. Silver, J. W. Kluesner, and C. R. Ra- 234–241.
nero, 2015, Fluid accumulation along the Costa Rica subduction thrust Rowan, M. G., M. P. Jackson, and B. D. Trudgill, 1999, Salt-related fault
and development of the seismogenic zone: Journal of Geophysical Re- families and fault welds in the Northern Gulf of Mexico: AAPG Bulletin,
search: Solid Earth, 120, 67–86, doi: 10.1002/2014JB011265. 83, 1454–1484.
Carruthers, T., 2012, Interaction of polygonal fault systems with salt diapirs: Stewart, S., 2001, Displacement distributions on extensional faults: Impli-
Ph.D. thesis, Cardiff University. cations for fault stretch, linkage, and seal: AAPG Bulletin, 85, 587–599.
Chollet, F., 2015, Keras, https://github.com/fchollet/keras, accessed Septem- Van Bemmel, P. P., and R. E. Pepper, 2000, Seismic signal processing
ber 2018. method and apparatus for generating a cube of variance values: U.S.
Cohen, I., N. Coult, and A. A. Vassiliou, 2006, Detection and extraction of Patent 6,151,555.
fault surfaces in 3D seismic data: Geophysics, 71, no. 4, P21–P27, doi: 10 Wu, X., 2017, Directional structure-tensor based coherence to detect seismic
.1190/1.2215357. faults and channels: Geophysics, 82, no. 2, A13–A17, doi: 10.1190/
Di, H., M. Shafiq, and G. AlRegib, 2018, Patch-level MLP classification for geo2016-0473.1.
improved fault detection: 88th Annual International Meeting, SEG, Ex- Wu, X., 2018a, Kerry-3D, https://wiki.seg.org/wiki/Kerry-3D, accessed
panded Abstracts, 2211–2215, doi: 10.1190/segam2018-2996921.1. March 2019.
Gersztenkorn, A., and K. J. Marfurt, 1999, Eigenstructure-based coherence Wu, X., 2018b, Opunake-3D, https://wiki.seg.org/wiki/Opunake-3D, ac-
computations as an aid to 3-D structural and stratigraphic mapping: Geo- cessed March 2019
physics, 64, 1468–1479, doi: 10.1190/1.1444651. Wu, X., 2019, GitHub repository, https://github.com/xinwucwp/faultSeg,
Girshick, R., J. Donahue, T. Darrell, and J. Malik, 2014, Rich feature accessed March 2019
hierarchies for accurate object detection and semantic segmentation: Pro- Wu, X., and S. Fomel, 2018, Automatic fault interpretation with optimal
ceedings of the IEEE Conference on Computer Vision and Pattern Rec- surface voting: Geophysics, 83, no. 5, O67–O82, doi: 10.1190/
ognition, 580–587. geo2018-0115.1.
Guitton, A., 2018, 3D convolutional neural networks for fault interpretation: Wu, X., and D. Hale, 2016, 3D seismic image processing for faults: Geo-
80th Annual International Conference and Exhibition, EAGE, Extended physics, 81, no. 2, IM1–IM11, doi: 10.1190/geo2015-0380.1.
Abstracts, doi: 10.3997/2214-4609.201800732. Wu, X., Y. Shi, S. Fomel, and L. Liang, 2018, Convolutional neural
Guo, B., L. Li, and Y. Luo, 2018, A new method for automatic seismic networks for fault interpretation in seismic images: 88th Annual Interna-
fault detection using convolutional neural network: 88th Annual Inter- tional Meeting, SEG, Expanded Abstracts, 1946–1950, doi: 10.1190/
national Meeting, SEG, Expanded Abstracts, 1951–1955, doi: 10 segam2018-2995341.1.
.1190/segam2018-2995894.1. Wu, X., and Z. Zhu, 2017, Methods to enhance seismic faults and construct
Hale, D., 2009, Structure-oriented smoothing and semblance: CWP Report fault surfaces: Computers and Geosciences, 107, 37–48, doi: 10.1016/j
635. .cageo.2017.06.015.
Hale, D., 2013, Methods to compute fault images, extract fault surfaces, and Xie, S., and Z. Tu, 2015, Holistically-nested edge detection: Proceedings of
estimate fault throws from 3D seismic images: Geophysics, 78, no. 2, the IEEE International Conference on Computer Vision, 1395–1403.
O33–O43, doi: 10.1190/geo2012-0331.1. Zhao, T., and P. Mukhopadhyay, 2018, A fault-detection workflow using
He, K., G. Gkioxari, P. Dollár, and R. Girshick, 2017, Mask R-CNN: deep learning and image processing: 88th Annual International Meeting,
Proceedings of the IEEE International Conference on Computer Vision, SEG, Expanded Abstracts, 1966–1970, doi: 10.1190/segam2018-
2980–2988. 2997005.1.

You might also like