0% found this document useful (0 votes)

28 views14 pages

Sensors 20 04228 v2

Uploaded by

capstoneecs

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

28 views14 pages

Sensors 20 04228 v2

Uploaded by

capstoneecs

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 14

sensors

Article
Machine Learning Modelling and Feature
Engineering in Seismology Experiment
Michail Nikolaevich Brykov 1 , Ivan Petryshynets 2 , Catalin Iulian Pruncu 3,4, * ,
Vasily Georgievich Efremenko 5 , Danil Yurievich Pimenov 6 , Khaled Giasin 7 ,
Serhii Anatolievich Sylenko 1 and Szymon Wojciechowski 8
1 Zaporizhzhia Polytechnic National University, 69063 Zaporizhzhia, Ukraine; [email protected] (M.N.B.);
[email protected] (S.A.S.)
2 Institute of Materials Research, Slovak Academy of Sciences, 04001 Kosice, Slovak; [email protected]
3 Mechanical Engineering, Imperial College London, Exhibition Rd., London SW7 2AZ, UK
4 Mechanical Engineering, School of Engineering, University of Birmingham, Birmingham B15 2TT, UK
5 Pryazovskyi State Technical University, Physics department, 87555 Mariupol, Ukraine;
[email protected]
6 Department of Automated Mechanical Engineering, South Ural State University, Lenin Prosp. 76,
454080 Chelyabinsk, Russia; [email protected]
7 School of Mechanical and Design Engineering, University of Portsmouth, Portsmouth PO1 3DJ, UK;
[email protected]
8 Faculty of Mechanical Engineering, Poznan University of Technology, Piotrowo 3, 60–965 Poznan, Poland;
[email protected]
* Correspondence: [email protected]; Tel.: +44-(0)7745-1331-58

Received: 1 July 2020; Accepted: 27 July 2020; Published: 29 July 2020

Abstract: This article aims to discusses machine learning modelling using a dataset provided by
the LANL (Los Alamos National Laboratory) earthquake prediction competition hosted by Kaggle.
The data were obtained from a laboratory stick-slip friction experiment that mimics real earthquakes.
Digitized acoustic signals were recorded against time to failure of a granular layer compressed between
steel plates. In this work, machine learning was employed to develop models that could predict
earthquakes. The aim is to highlight the importance and potential applicability of machine learning
in seismology The XGBoost algorithm was used for modelling combined with 6-fold cross-validation
and the mean absolute error (MAE) metric for model quality estimation. The backward feature
elimination technique was used followed by the forward feature construction approach to find the best
combination of features. The advantage of this feature engineering method is that it enables the best
subset to be found from a relatively large set of features in a relatively short time. It was confirmed
that the proper combination of statistical characteristics describing acoustic data can be used for
effective prediction of time to failure. Additionally, statistical features based on the autocorrelation
of acoustic data can also be used for further improvement of model quality. A total of 48 statistical
features were considered. The best subset was determined as having 10 features. Its corresponding
MAE was 1.913 s, which was stable to the third decimal point. The presented results can be used to
develop artificial intelligence algorithms devoted to earthquake prediction.

Keywords: seismology; earthquake prediction; laboratory experiment; acoustic data; machine

learning; feature engineering; artificial intelligence

1. Introduction
In recent years, artificial intelligence has been extensively used to solve problems in different
fields of human or natural activities. Artificial intelligence methods are widely used in a variety

Sensors 2020, 20, 4228; doi:10.3390/s20154228 www.mdpi.com/journal/sensors

Sensors 2020, 20, 4228 2 of 14

of engineering applications, for example, in artificial intelligence-based hole quality prediction in

micro-drilling [1], or in the prognosis of bearing and gear wear using a convolutional neural network [2].
In addition, it can also be used to predict surface wear based on surface isotropy levels [3], or in the
prediction of ore crushing-plate lifetimes [4]. New technologies like machine learning (ML) [5] and
deep learning (DL) [2] take analytical work to the next level. This paper presents the implementation
of the machine learning approach to predict earthquakes. The used approach combines ML with
proper methods of feature engineering to determine the set of statistical features that are most suitable
for earthquake prediction. The main advantage of the proposed approach is that it develops our
understanding of the physical aspects of the features used for modelling. Thus, the results of this work
can be broadly utilized in different artificial intelligence algorithms for earthquake prediction.
An earthquake is an intense shaking of the earth’s surface due to a sudden slip of rocks in
the earth’s crust, or faults near the interface between tectonic plates. The earliest earthquake with
descriptive information occurred in China in 1177 B.C [6]. The earliest seismoscope was that invented
by the Chinese philosopher Chang Heng in A.D. 132 [7]. Although earthquake monitoring and
recording is now well developed, humans remain unable to predict a major earthquake, nor understand
its exact cause. Nevertheless, earthquakes and their consequences are among the most life-changing
events in the history of humanity. Seismic activities and their aftermath disasters been the cause of
more human life loss than all other natural hazards combined during the past two decades [8] because
many countries are located in highly seismically active zones [9–11].
Accurate prediction of earthquakes can potentially save lives and spare humanity further
devastation. Therefore, predicting the time and magnitude of earthquakes is one of the fundamental
goals of geoscience [12]. Despite long time-series of observations and field research, the accurate
prediction of the size or timing of earthquakes remains a challenge [13,14]. Even more concerning is that
devastating subduction earthquakes with magnitude 9.0 or more are considered unpredictable [14].
Seismology is a field in which big data is intensively generated [15]. Recent few years have
demonstrated rapid growth in the application of ML and other techniques of big data analysis in
seismology [16,17]. The main goals in ML, as expressed by [18], are identifying some general patterns
or features in data and dimensionality reduction. Special attention has been paid to the detection of
specific earthquake precursors in past studies [12,19].
There have been a number of cases that have shown successful implementation of ML in earthquake
warning systems for predicting real earthquakes [20,21], their geographical coordinates [22,23] and
other aspects. Combining physically based and ML-based models, a synergetic effect can be obtained
when physical intuition insights are empowered by the strength of data-driven methods [15,18,24].
The recent earthquake prediction competition hosted by Los Alamos National Laboratory (LANL)
in cooperation with Kaggle.com [25] attracted hundreds of data scientists globally with the goal of
predicting time to failure (TTF) using acoustic data (AD) as a source of information for modelling.
The current paper shares the findings from the datasets provided by organizers of the LANL earthquake
prediction competition are analyzed using the training model developed for this study, and then
discussed. The goal is to use seismic signals to predict the timing of laboratory earthquakes

2. Data and Methods

2.1. Data
A laboratory experiment that closely mimics real earthquakes is described in [12]. The main idea
of the modelling technique is the slow relative motion of rigid, usually steel, plates pressed against
each other and separated by a thin granular layer. This granular layer mimics the contact surface of
the layer between tectonic plates in which rocks are located. A laboratory quake machine reproduces
the stick-slip motion of conjunct plates; acoustic emission from granular gauge interlayer and contact
stress values are constantly recorded against the time that remains to failure of the granular layer.
These periodic failures are accompanied by a drastic increase in acoustic emission and a drop in contact
Sensors 2020, 20, 4228 3 of 14

stress, and are considered to be analogous to real earthquakes [26]. It is recognized that the greater the
drop in stress, the more intense the ground motions during real earthquakes [27]. Because a material
emits acoustic signals in the course of work and especially before failure, a similar approach may be
used for predicting not only real earthquakes but also other types of failures in nature and industry,
such as landslides, avalanches, and failure of machine parts [12].
For modelling purposes, a training dataset provided by the LANL Earthquake Prediction
Competition hosted by Kaggle.com [25] was used. The dataset comprised a two-column csv file
in which AD values derived from the acoustic signal from a laboratory machine gauge layer were
recorded against TTF. The experimentation consisted of repeated cycles of model “earthquakes” (EQs).
The training dataset contained the record of 16 full cycles between earthquakes. The length of the
cycles varied from 7 to 16 s. The data for incomplete cycles were at the head and the tail of the training
dataset. This is because the training dataset was cut from another bigger file of records. The goal was
to build a model that can predict TTF for a given piece of recorded AD consisting of 150,000 entries.
The length of the time window for 150 K pieces of AD was approximately 0.04 s and therefore it may
be considered a single time spot.

2.2. Methods
The training dataset was split into 17 files, each containing an AD and TTF for one separate cycle.
The first part of the work was carried out using a file for the longest cycle, i.e., the 8th piece in the
training dataset. The first TTF in this cycle was 16.1074 s. Figure 1 shows several examples of 150 K
pieces of AD for this cycle.
The beginning of the cycle is covered by the first 150 K piece (Figure 1a). Data provided in
Figure 1b,c illustrates the gradual increase of spikes of AD during the seismic cycle. The piece which
contains the EQ event is shown at different levels of magnification in Figure 1d,e. The role of these
spikes is discussed further below. In the final stage of work, modelling was performed using all
datasets provided by [25].
The XGBoost library providing the gradient boosted trees approach [28] was used for modelling.
It is currently agreed that this technique leads to the best performance compared to other modelling
algorithms [29]. For example, XGBoost was used to determine the dominant frequency of an eruptive
tremor of the volcano Piton de la Fournaise [30]. XGBoost stands for eXtreme Gradient Boosting.
This algorithm implements an ensemble of decision trees and uses gradient boosting to build models
more accurate than the single decision tree or random forest approaches.
Model quality was assessed by mean absolute error (MAE) in 6-fold cross-validation (CV).
Cross-Validation is used to estimate model quality by splitting the training dataset into n-folds. One of
the folds is used for model validation, while the rest n−1 folds are used for modelling. Modeling is
repeated n times, so every fold is used once as a validation dataset. MAE is one of the metrics used
to estimate model accuracy. It counts the mean absolute difference between the value of the target
parameter (TTF in our case), which is predicted by the model, and the actual parameter value from
the validation dataset. Python 3.7 and the necessary libraries, such as pandas, sklearn, and xgboost,
were employed to carry out the study.
The main aim of this study was to find an appropriate set of features derived from AD that gives
the least MAE in CV. A detailed approach to feature engineering is discussed further below. Since the
speed of data processing is critical for the detection and early warning of earthquakes [15] the goal
of this work was to determine the feature(s) that are not only useful for building ML models with
acceptable accuracy but also enable relatively rapid processing of real-time data.
Sensors 2020, 20, 4228 4 of 14
Sensors 2020, 20, 4228 4 of 15

FigureFigure 1. Plot
1. Plot of of acousticdata
acoustic data in
inthe
the8th “earthquake”
8th “earthquake”cycle cycle
from the
fromdataset
the provided by [25]: (a)—
dataset provided by [25]:
the beginning of the seismic cycle; (b,c)—middle parts of the seismic cycle; (d,e)—part of
(a)—the beginning of the seismic cycle; (b,c)—middle parts of the seismic cycle; (d,e)—part of the the seismic
cycle with the earthquake (EQ) event at different levels of magnification. Arrows are explained in the
seismic cycle with the earthquake (EQ) event at different levels of magnification. Arrows are explained
text.
in the text.
3. Feature Engineering
3. Feature Engineering
The following key approach was used for feature engineering. In the first step, it is assumed that
The following key approach was used for feature engineering. In the first step, it is assumed that
the distribution of AD is the source of useful features. This assumption is based on “common sense”
the distribution
suggestions,ofobservation
AD is the of source of useful
changes features. This
in AD distribution overassumption is based
time (see Figure 1) andon “common
also on resultssense”
suggestions,
publishedobservation of changes
in related works [12,26]. in AD distribution over time (see Figure 1) and also on results
published Itinisrelated
evidentworks [12,26].failure (see arrow 3 on Figure 1) is preceded by a number of spikes of
that stick-slip
ItAD (see arrow
is evident that2 and similarfailure
stick-slip symbols on arrow
(see Figure 1).
3 onThese spikes
Figure appear
1) is as a result
preceded by a of micro failure
number of spikes of
AD (see arrow 2 and similar symbols on Figure 1). These spikes appear as a result of micro failure
events and may predict TTF [12,13]. Generally, the shorter the TTF the more frequent the AD spikes.
Hence, the statistical characteristics of AD may serve as features for modelling.
Sensors 2020, 20, 4228 5 of 14

As shown in [26] instantaneous statistical characteristics of AD appear as a “fingerprint” of the

fault zone stress state. The variance of the seismic signal is the most important feature, although other
statistical characteristics are also important [26,29,31]. The authors of [13] stressed that the kurtosis of
the acoustic signal is an additional powerful feature for the prediction of TTF.
A total of 18 statistical features were derived from each of the 150 K pieces of AD in this work.
Nine of these statistical features were maximum, minimum, mean, standard deviation, (standard
deviation)/(mean), skewness, kurtosis, mode, number of mode appearance. The remaining nine
features were percentiles at the following percent levels: 1st, 5th, 10th, 25th, 50th, 75th, 90th, 95th and
99th. The “maximum” and “minimum” features were calculated but not used for modelling because
these features were only used to indicate the main EQ event due to their outstandingly large values.
Using the initial data sequence from the 8th cycle in the LANL dataset [25], a database of statistical
features was created as a result of feature calculation for every subsequent 150 K portion of AD.
The database contains 413 rows which cover the TTF range of 16.0691–0.0156 s. Figure 2 shows
several features plotted against TTF. It is evident that at least some of the features correlate with TTF,
for example, the number of mode appearances (see Figure 2c, “mode_count_signal”). Another point
to consider is that a certain portion of data is recorded after the EQ at approximately 0.3 s (arrow on
Figure 2a). Values of any given feature 0.2–0.3 s after an EQ are similar to those long before the EQ in
the early stage of a seismic cycle. If values recorded after the EQ were incorporated, then an additional
error would appear in the model. This is due to the similarity in feature values at the beginning and
the end Sensors
of a cycle (arrow on Figure 2b).
2020, 20, 4228 6 of 15

2. Some
Figure Figure of of
2. Some thethefeatures plottedagainst
features plotted against
timetime to failure
to failure (TTF) for(TTF)
the 8thfor the 8th “earthquake”
“earthquake” cycle:
cycle: (a)—maximums
(a)—maximums and and minimums;
minimums; (b)—number
(b)—number of modeof mode appearance;
appearance; (c,d)—99th,
(c,d)—99th, 95th, 95th, 5th,
5th, and 1st
and 1st percentiles.
percentiles.

In the current work, the final selection of features was based on the building of different models
to compare MAEs and picking the best combination of features that gives the lowest MAE. However,
according to the well-known curse of dimensionality, the total number of possible combinations of
features increases far faster than the number of features in the set (Figure 3).
Sensors 2020, 20, 4228 6 of 14

In order to increase the model accuracy, all tail rows which correspond to the period after the EQ
should be deleted from the database of statistical features. Another rationale for deleting data after
an EQ is that the main goal of modelling using data from laboratory EQs is to identify features that
would be useful to predict real EQs. It is obvious that in reality only data before an EQ would be used
for prediction. Any data after an EQ has neither a logical nor practical sense for prediction of that
particular EQ.
Due to the development of modelling tools such as Python and appropriate libraries, training a
model can be performed rapidly using only several lines of code. The major challenge in training any
model is determining which features should be used.
In the current work, the final selection of features was based on the building of different models
to compare MAEs and picking the best combination of features that gives the lowest MAE. However,
according to the well-known curse of dimensionality, the total number of possible combinations of
Sensors 2020,
features 20, 4228 far faster than the number of features in the set (Figure 3).
increases 7 of 15

Figure 3. The total number of possible combinations versus the number of features.
Figure 3. The total number of possible combinations versus the number of features.
For example, four features in the set give fifteen possible combinations, 7–127, 10–1023, 15–8191,
For example, four features in the set give fifteen possible combinations, 7–127, 10–1023, 15–8191,
16–64,995, 18–262,143, and so on. The “brute force” (BF) method of feature engineering involves
16–64,995, 18–262,143, and so on. The “brute force” (BF) method of feature engineering involves
sequential modelling and CV score calculation for each combination of features and picking the
sequential modelling and CV score calculation for each combination of features and picking the
combination with the least MAE. This method guarantees that the best combination of features is
combination with the least MAE. This method guarantees that the best combination of features is
determined. However, BF is time consuming if the number of features exceeds some threshold.
determined. However, BF is time consuming if the number of features exceeds some threshold. In
In general, the higher the number of features analyzed, the greater the time required to solve the model.
general, the higher the number of features analyzed, the greater the time required to solve the model.
For example, a set of 43 features [13] gives a total of 8.796 × 1012 combinations, which would require a
For example, a set of 43 features [13] gives a total of 8.796 × 1012 combinations, which would require
significant amount of time to find the best combination.
a significant amount of time to find the best combination.
In the paper [13], only two best features were chosen from 43 for prediction of TTF. This means
In the paper [13], only two best features were chosen from 43 for prediction of TTF. This means
that most of the features are either excessive or not suitable for modelling. Therefore, it can be
that most of the features are either excessive or not suitable for modelling. Therefore, it can be
concluded that the first step in feature engineering is to exclude all non-significant features from the
concluded that the first step in feature engineering is to exclude all non-significant features from the
set. Every feature excluded can significantly decrease the total amount of combinations to examine
set. Every feature excluded can significantly decrease the total amount of combinations to examine
during the BF approach. In our, case excluding only two features decrease the number of combinations
during the BF approach. In our, case excluding only two features decrease the number of
from 262,143 to 64,995 (16 features instead of 18).
combinations from 262,143 to 64,995 (16 features instead of 18).
The “maximum” and “minimum” features can be excluded based on the following reasoning:
The maximum values of AD in 150 K pieces is equivalent to the 100th percentile value. This study
uses the “99th percentile” feature which is close to the 100th percentile; therefore the 100th percentile
(i.e., “maximum”) is superfluous and can be excluded. Similarly, the “minimum” feature is
equivalent to the “0 percentile” and can be excluded as the “1st percentile” feature has already been
considered. The only reason for calculating “maximum” and “minimum” features is because they are
Sensors 2020, 20, 4228 7 of 14

The “maximum” and “minimum” features can be excluded based on the following reasoning:
The maximum values of AD in 150 K pieces is equivalent to the 100th percentile value. This study
uses the “99th percentile” feature which is close to the 100th percentile; therefore the 100th percentile
(i.e., “maximum”) is superfluous and can be excluded. Similarly, the “minimum” feature is equivalent
to the “0 percentile” and can be excluded as the “1st percentile” feature has already been considered.
The only reason for calculating “maximum” and “minimum” features is because they are needed for
correct identification of the 150 K piece which contains the EQ. It also helps to correctly delete tail rows
containing AD after the EQ. After excluding “maximum” and “minimum” features, 16 features remain
in the model, giving a total of 64,995 possible combinations.
The backward feature elimination technique (BFE) was employed for reducing the number of
features. The rationale behind using this method in the current study is that, if there is a total of n
features in a set, then there are n possible combinations of (n−1) features in the subset. Assuming that
the vast majority of features are either bad or neutral for model quality, it is highly probable that the
MAE for the model—which uses all n features—would be bigger than the least MAE for n models
using (n−1) features. If so, then only one model is needed such that it uses all n features, and MAEn is
then calculated; thereafter, n models are required, each of which uses one of the possible subsets of
(n−1) features. It is also important to choose the subset which results in the least MAEn−1 . If a full set
of n features contains at least one feature that is bad or excessive for the model, then MAEn would be
greater than or equal to the least MAEn−1 . This bad or excessive feature should be absent in the subset
that generates the model with the least MAEn−1 . Thus, this feature can be excluded from the set of
features. BFE takes about a minute in semiautomatic mode to exclude one feature and can be fully
automated if necessary. BFE is consistently used to reduce the number of features from n to about 10.
Thereafter, the straight BF method is used to find the combination of features that gives a model with
the least MAE.
The next important point to consider is how many CV cycles are necessary for every step of
the work. Each single CV cycle returns the mean MAE for only six calculations in total. Therefore,
the resulting MAE varies at the second decimal point from one run to the other. In order to decrease the
variance of MAE the number of repetitions (cycles) of CV should be increased. Two cycles (CV-2) were
used for BFE and 500 cycles (CV-500) were used in the final calculation of MAE for the best combination
of features. Using CV-500 enables MAEs that are stable to the third decimal point to be obtained.
Features other than AD may also be useful for TTF prediction. It may be seen that sudden spikes
in the signal are presented in the AD–TTF diagram (see Figure 1a–d). The lower the TTF, the more
often spikes occur. As stated in [12], these spikes appear due to micro shifts in the gauge layer.
Figure 4a shows a short portion of 1000 values for the AD–TTF diagram corresponding to arrow
1 on Figure 1a. This portion of data contains no spikes in the AD and the AD distribution seems to
be random. Figure 5a shows a short portion of 1000 values for the AD–TTF diagram corresponding
to arrow 2 (see Figure 1a); that is, the beginning of the first significant spike observed in the AD.
The AD distribution, in this case, seems to be more or less periodic with a gradual increase of random
constituents. Spikes in AD are characterized not only by an increase in AD amplitude but also by the
grade of AD periodicity.
This grade of AD periodicity may be assessed by the autocorrelation coefficient (AC). Figure 6
shows several first steps for calculating the AC for 13 consequent AD values corresponding to arrow
2 in Figure 1a. These values are {39,66,92,102,103,90,62,29,−2,−31,−53,−73,−83}. Each AC value (red
numbers in Figure 6) is calculated for a given sequence of AD that is duplicated against itself and
shifted by one position. For example, after the first shift there are two sequences of 12 numbers that
overlap: {39,66,92,102,103,90,62,29,−2,−31,−53,−73} and {66,92,102,103,90,62,29,−2,−31,−53,−73,−83}.
These overlapping sequences are included in a black rectangle under the words “Shift number: 1”.
Calculating the usual correlation coefficient for this pair of sequences gives 0.9559. This is the first
value of AC. After the second shift, only 11 numbers remain in each overlapped sequence (see black
rectangle under “Shift number: 2”). This pair of sequences gives a correlation coefficient of 0.8350.
greater than or equal to the least MAE n-1. This bad or excessive feature should be absent in the subset
that generates the model with the least MAEn-1. Thus, this feature can be excluded from the set of
features. BFE takes about a minute in semiautomatic mode to exclude one feature and can be fully
automated if necessary. BFE is consistently used to reduce the number of features from n to about 10.
Thereafter, the straight BF method is used to find the combination of features that gives a model with
Sensorsthe least
2020, 20, MAE.
4228 8 of 14
The next important point to consider is how many CV cycles are necessary for every step of the work.
Each single CV cycle returns the mean MAE for only six calculations in total. Therefore, the resulting MAE
This is the second
varies value
at the second of AC.point
decimal Further shifts
from one rungive new
to the values
other. of AC.
In order Figurethe
to decrease 6 variance
shows five consequent
of MAE the
shifts,number
but theofnumber
repetitions (cycles)may
of shifts of CVbeshould be increased.
arbitrary. The onlyTwo cycles (CV-2)
limitation is the were
lengthused for BFE
of the andsequence.
initial 500
In ourcycles
case(CV-500)
we use were used window
a sliding in the finalof
calculation
1000 ADofvalues,
MAE for the best combination
therefore quite a largeof features.
numberUsing CV- may
of shifts
500
be used. enables MAEs that are stable to the third decimal point to be obtained.
Features other than AD may also be useful for TTF prediction. It may be seen that sudden spikes
A total of 98 shifts were used to calculate the ACs for every position of the “sliding window”,
in the signal are presented in the AD–TTF diagram (see Figure 1a–d). The lower the TTF, the more
containing 1000 values of AD. The results from the calculation of the AC for AD in Figures 4a and 5a
often spikes occur. As stated in [12], these spikes appear due to micro shifts in the gauge layer.
are shownFigureon Figures
4a shows 4b aand 5bportion
short respectively.
of 1000 values for the AD–TTF diagram corresponding to arrow
It canFigure
1 on be observed that a highly
1a. This portion of data aperiodic
contains noAD (see
spikes inFigure
the AD 4a)andproduces an AC which
the AD distribution seemsvaries
to be in a
very narrow
random.range shows±0.1
Figureof5aabout (Figure
a short 4b).ofIn1000
portion contrast,
valuesafor highly periodicdiagram
the AD–TTF AD (Figure 5a) produces
corresponding to an
AC which
arrowvaries in a wide
2 (see Figure 1a); range
that is,oftheabout ±0.8 (Figure
beginning 5b).significant
of the first If AD is aperiodic it means
spike observed in thethat
AD.noThespikes
AD distribution, in this case, seems to be more or less periodic with a gradual
are present in AD inside the sliding window. The greater the AC amplitude (see Figure 5b), the more increase of random
constituents.
periodic Spikes inhigh
AD. Therefore, AD are characterized
values not only by
of AC amplitude an increase
mean in sliding
that the AD amplitude
window butcontains
also by thespikes
of AD. grade of AD periodicity.

Sensors 2020, 20, 4228 9 of 15

overlap: {39,66,92,102,103,90,62,29,−2,−31,−53,−73} and {66,92,102,103,90,62,29,−2,−31,−53,−73,−83}.

These overlapping sequences are included in a black rectangle under the words “Shift number: 1”.
Calculating the usual correlation coefficient for this pair of sequences gives 0.9559. This is the first
value of AC. After the second shift, only 11 numbers remain in each overlapped sequence (see black
rectangle under “Shift number: 2”). This pair of sequences gives a correlation coefficient of 0.8350.
This is the second value of AC. Further shifts give new values of AC. Figure 6 shows five consequent
shifts, but the number of shifts may be arbitrary. The only limitation is the length of the initial
sequence. In our case we use a sliding window of 1000 AD values, therefore quite a large number of
shifts may be used.
A total of 98 shifts were used to calculate the ACs for every position of the “sliding window”,
containing
Figure 4. 1000
(a):
Figure values
The
4. (a): 1000
The ofKAD.
1000 The results
window
K window fromdata
ofofacoustic
acoustic the calculation
data arrow1of
from arrow
from theFigure
1(see
(see AC for
Figure AD
1); 1); in
(b):(b): Figures 4a and 5a
corresponding
corresponding
areautocorrelation
shown on Figures 4b and
coefficients.
autocorrelation 5b
coefficients. respectively.

This grade of AD periodicity may be assessed by the autocorrelation coefficient (AC). Figure 6
shows several first steps for calculating the AC for 13 consequent AD values corresponding to arrow
2 in Figure 1a. These values are {39,66,92,102,103,90,62,29,−2,−31,−53,−73,−83}. Each AC value (red
numbers in Figure 6) is calculated for a given sequence of AD that is duplicated against itself and
shifted by one position. For example, after the first shift there are two sequences of 12 numbers that

Figure 5. (a)—1000 K window of acoustic data from arrow 2 (see Figure 1); (b)—corresponding
Figure 5. (a)—1000 K window of acoustic data from arrow 2 (see Figure 1); (b)—corresponding
autocorrelation coefficients.
autocorrelation coefficients.
Sensors 2020, 20, 4228 9 of 14
Figure 5. (a)—1000 K window of acoustic data from arrow 2 (see Figure 1); (b)—corresponding
autocorrelation coefficients.

Sensors 2020, 20, 4228 10 of 15

It can be observed that a highly aperiodic AD (see Figure 4a) produces an AC which varies in a
very narrow range of about ±0.1 (Figure 4b). In contrast, a highly periodic AD (Figure 5a) produces
an AC which varies in a wide range of about ±0.8 (Figure 5b). If AD is aperiodic it means that no
spikes are present in AD inside the sliding window. The greater the AC amplitude (see Figure 5b),
the more periodic AD. Therefore, high values of AC amplitude mean that the sliding window
Figure 6.spikes
contains Several
of first
AD. calculations of autocorrelation coefficient (AC) for 13 consequent acoustic data
Figure 6. Several first calculations of autocorrelation coefficient (AC) for 13 consequent acoustic data
(AD) An values corresponding
additional sign of ADto arrow 2 in Figure
periodicity is the1.first value of AC (see arrows on Figures 4b and 5b).
(AD) values corresponding to arrow 2 in Figure 1.
The more periodic AD, the higher the first value of AC.
These observations can be checked on AD for an “earthquake” (see arrow 3 on Figures 1d,e and
An additional sign of AD periodicity is the first value of AC (see arrows on Figures 4b and 5b).
7). Figure 7a represents AD for a whole EQ event, and Figure 7b contains AD for a sliding window
The morestarting
periodic AD, the higher the first value of AC.
at the position indicated by arrow 4. It can be noted that AD in Figure 7b are less periodic
These observations
than those in Figure can5a.beInchecked
accordance onwith
AD this
for difference
an “earthquake” (see the
in periodicity, arrow
first 3value
on Figures
of AC in1d,e and
7). FigureFigure
7a represents
7c (0.3406) isADlessfor
thana that
whole EQ event,
in Figure and The
5b (0.7283). Figure 7b contains
amplitude of AC inAD
Figurefor7caissliding
also lesswindow
than that in Figure 5b.
starting at the position indicated by arrow 4. It can be noted that AD in Figure 7b are less periodic than
The frequency of AD oscillation was considered during a spike as an additional feature which
those in Figure 5a. In accordance with this difference in periodicity, the first value of AC in Figure 7c
can be used for modelling. However, the comparison of AD of early and late spikes (see arrows 2 and
(0.3406) is5 less
in Figure 1)that
than in Figure
shows 5b (0.7283).
that periods TheTamplitude
of oscillation of AC
for both cases in Figure 7c is
are approximately also(Figure
equal less than
8). that in
Figure 5b.Therefore, the frequency of AD oscillation during a spike was not used for modelling.

Figure 7. (a): “earthquake” event, see arrow 3 on Figure 1; (b): 1000 K window of acoustic data from
Figure 7. (a): “earthquake” event, see arrow 3 on Figure 1; (b): 1000 K window of acoustic data from
arrow 4; (c):
arrowcorresponding autocorrelation
4; (c): corresponding coefficients.
autocorrelation coefficients.

This way, three major parameters were used for modelling which are: acoustic data (AD); the
first value of AC on every “sliding window” (AC_first); the amplitude of AC on every “sliding
window” (AC_ampl). Each 150 K piece of AD contains 150 sliding windows and, therefore, 150 values
Sensors 2020, 20, 4228 10 of 14

SensorsThe
2020,frequency
20, 4228 of AD oscillation was considered during a spike as an additional feature which
11 ofcan
15
be used for modelling. However, the comparison of AD of early and late spikes (see arrows 2 and
of
5 inAC_first and
Figure 1) 150 that
shows values of AC_ampl.
periods Because
of oscillation T for16both
statistics were
cases are calculated for
approximately each(Figure
equal of three
8).
parameters, thefrequency
Therefore, the overall number of featuresduring
of AD oscillation considered was
a spike 48.not used for modelling.
was

Figure 8. Comparison of frequency of AD splashes far from (a) and near (b) the “earthquake”:
Figure 8. Comparison of frequency of AD splashes far from (a) and near (b) the “earthquake”: (a)—
(a)—first 200 AD values in the dashed window in Figure 5a; (b)—first 200 AD values from arrow 5 in
first 200 AD values in the dashed window in Figure 5a; (b)—first 200 AD values from arrow 5 in
Figure 1.
Figure 1.
This way, three major parameters were used for modelling which are: acoustic data (AD); the first
These features were calculated for every separate seismic cycle in the database provided by [25].
value of AC on every “sliding window” (AC_first); the amplitude of AC on every “sliding window”
For every portion containing 150 K of AD, the features were calculated and recorded with the
(AC_ampl). Each 150 K piece of AD contains 150 sliding windows and, therefore, 150 values of
corresponding TTF in the separate file of features. Since the TTF change during 150 K of AD was just
AC_first and 150 values of AC_ampl. Because 16 statistics were calculated for each of three parameters,
0.04 s, the TTF value is considered a constant during any given 150 K piece. The last value of TTF in
the overall number of features considered was 48.
the 150 K piece was used as this constant time. Maximum and minimum values of AD were also
These features were calculated for every separate seismic cycle in the database provided by [25].
recorded; they allowed the row that contained the EQ event to be located. All rows after the row with
For every portion containing 150 K of AD, the features were calculated and recorded with the
EQ event were deleted according to the reasons explained above. It should be noted that in all files
corresponding TTF in the separate file of features. Since the TTF change during 150 K of AD was just
TTF for the EQ event was approximately the same (near 0.3 s).
0.04 s, the TTF value is considered a constant during any given 150 K piece. The last value of TTF
Finally, all separate files with features were merged into one database.
in the 150 K piece was used as this constant time. Maximum and minimum values of AD were also
recorded; they allowed the row that contained the EQ event to be located. All rows after the row with
4. Results and Discussion
EQ event were deleted according to the reasons explained above. It should be noted that in all files
Figure
TTF for the 9EQ
shows
eventthe best
was MAE obtained
approximately for
the any (near
same given0.3
number
s). of features in the subset of features.
ThreeFinally,
sequences of dots represent three sets of features:
all separate files with features were merged into one 16 features for AD only; 32 features for
database.
AD+AC_first; and 48 features for AD+AC_first+AC_ampl. Each sequence of dots is a function “MAE
4. Results
vs. Numberand Discussion
of features” for the corresponding set of features.
It is evident
Figure thatthe
9 shows in each of theobtained
best MAE three setsforofany
features
giventhere are of
number good, excessive
features in the(neutral),
subset of and bad
features.
features.
Three sequences of dots represent three sets of features: 16 features for AD only; 32 features vs.
During BFE, bad features were gradually removed from subsets of features and “MAE for
Number
AD+AC_first;of features”
and 48functions
features forslowly decreased for each ofEach
AD+AC_first+AC_ampl. threesequence
sets withofthe decreasing
dots number
is a function “MAEof
features.
vs. Number As the number of
of features” forfeatures reached 10, set
the corresponding the of
BFfeatures.
method was used to find the best combination
of features that gave the least MAE. It can be seen that MAE decreases significantly for each of the
three functions in the early stages of the increasing number of features during the BF stage. This
means that useful features are gradually added to the subsets. After a certain number of features,
MAE stabilizes and minimums are reached. Addition of more features does not lead to a decrease of
MAE, so these additional features are excessive (neutral) for modelling.
The result shows that all three parameters—AD, AC_first, and AC_ampl—have potential for
predicting TTF. Even statistical features which are derived solely from AD result in MAE at the level
of 1.93 s if chosen in the optimal combination. Addition of AC_first and AC_ampl gradually reduces
levels of the best MAEs to 1.92 and 1.91 s, respectively.
Sensors 2020, 20, 4228 11 of 14
Sensors 2020, 20, 4228 12 of 15

Figure 9. Comparison of modelling results for a different combination of features.

Figure 9. Comparison of modelling results for a different combination of features.
It is evident that in each of the three sets of features there are good, excessive (neutral), and bad
Table 1 shows minimal MAEs for three subsets and the corresponding features in each subset.
features. During BFE, bad features were gradually removed from subsets of features and “MAE vs.
According to Table 1, the only features that appear in all three optimal sets are “mean of AD” and
Number of features” functions slowly decreased for each of three sets with the decreasing number of
“number of mode appearance of AD”. Hence these features possess the strongest predictive ability;
features. As the number of features reached 10, the BF method was used to find the best combination of
however, models built solely using these two features are not as good as those with optimal
features that gave the least MAE. It can be seen that MAE decreases significantly for each of the three
combinations. This means that some synergistic effect appears when other features are added to the
functions in the early stages of the increasing number of features during the BF stage. This means that
models, which leads to a decrease of MAE.
useful features are gradually added to the subsets. After a certain number of features, MAE stabilizes
It should be noted that optimal sets of features may vary if the training dataset changes. Therefore,
and minimums are reached. Addition of more features does not lead to a decrease of MAE, so these
the value of this work is not in these particular optimal sets of features. The main outcome here is that
additional features are excessive (neutral) for modelling.
autocorrelation is a valuable parameter which has a significant influence on model quality.
The result shows that all three parameters—AD, AC_first, and AC_ampl—have potential for
predicting TTF. Even statistical features which are derived solely from AD result in MAE at the level of
Table 1. Optimal subsets of features and corresponding mean absolute errors (MAEs).
1.93 s if chosen in the optimal combination. Addition of AC_first and AC_ampl gradually reduces
levels of the best MAEs to 1.92 and 1.91 s, respectively.AD+AC_First AD+AC_First+AC_Ampl
Statistic Feature AD
Table 1 shows minimal MAEs for three subsets AD AC_First AD AC_First
and the corresponding featuresAC_Ampl
in each subset.
According to Table mean 1, the only features+ that appear
+ in all three+optimal sets are “mean of AD”
and “number standard
of modedeviation
appearance of AD”. Hence these features possess the strongest +predictive
(standard
ability; however, deviation)/(mean)
models built solely using these two features + are not as good as those with optimal
combinations. skewness
This means that some synergistic effect appears when other features are added to the
models, which kurtosis
leads to a decrease of MAE.
It should bemodenoted that optimal sets of features may vary if the training dataset changes. Therefore,
number
the value of mode
of this work appearance +
is not in these particular +
optimal + The main
sets of features. + outcome +here is that
autocorrelation percentiles:
is a valuable parameter which has a significant influence on model quality.
It should also1st +
be noted that the calculation +time for AC_first is at least one order of magnitude
5th
less than that for AC_ampl. Calculation of AC_first requires just one + shift (see Figure 4), whereas the
calculation of AC_ampl 10th requires 98 shifts. Since processing time can be critical in some cases [32],
the usefulness of 25th AC_ampl may be questionable from a practical + +
point of view.
According to50th +
the results obtained and presented above, the calculation of AD autocorrelation in a
given sliding window 75th is a useful operation
+ that helps
+ to determine how + the periodicity of AD. Generally,
the more periodic 90th
the AD, the shorter the TTF. This leads to the idea + of including autocorrelation
calculation as a core 95thoperation of the deep learning algorithm + instead of or +in conjunction with the
usual convolutional 99thoperation. As shown in [33], deep learning modelling can be adapted to recognize
MAE (CV-500) 1.9343 1.9210 1.9130
Sensors 2020, 20, 4228 12 of 14

tectonic tremors. One of the possible directions of future research may be incorporating autocorrelation
of acoustic (tectonic) data into existing algorithms of artificial intelligence in the field of seismology.

Table 1. Optimal subsets of features and corresponding mean absolute errors (MAEs).

AD+AC_First AD+AC_First+AC_Ampl
Statistic Feature AD
AD AC_First AD AC_First AC_Ampl
mean + + +
standard deviation +
(standard deviation)/(mean) +
skewness
kurtosis
mode
number of mode appearance + + + + +
percentiles:
1st + +
5th +
10th
25th + +
50th +
75th + + +
90th +
95th + +
99th
MAE (CV-500) 1.9343 1.9210 1.9130

5. Conclusions
Periodic spikes of acoustic data is a phenomenon that can be used on its own to determine TTF
during laboratory earthquake tests. Only certain combinations of statistical characteristics lead to a
model with optimal performance.
The backward feature elimination approach in combination with the brute force method can be
successfully used to find this optimal combination even for relatively big sets of candidate features.
The backward feature elimination stage allows the number of features to be significantly reduced;
therefore, subsequent brute force attempts can be used to select the combination of features that yields
the model with the least MAE.
Three major parameters useful for predicting TTF were determined as follows: distribution
of acoustic data, the first values of autocorrelation coefficients in 1000 K sliding windows, and the
amplitudes of these autocorrelation coefficients. A total of 48 statistical features were derived from
these three parameters. The best combination of statistical features allowed a model with a mean
absolute error of 1.913 s to be obtained.
The autocorrelation of acoustic data series is an important parameter. It provides additional
information about the grade of acoustic data periodicity. The greater the amplitude and the first value
of the autocorrelation coefficient sequence, the more periodic the acoustic data. High periodicity
means that spikes of acoustic data are present that serve as the precursors of laboratory earthquakes.
Calculation of autocorrelation coefficients can be used as a valuable operation in artificial intelligence
systems deployed for real earthquake prediction.

Author Contributions: Conceptualization, M.N.B., I.P.; Methodology, M.N.B.; Software, M.N.B.; Validation,
M.N.B.; Analysis, M.N.B., S.A.S.; Investigation M.N.B.; Resources, M.N.B., S.A.S.; Data Curation, M.N.B., S.A.S.;
Writing-Original Draft Preparation M.N.B., V.G.E., D.Y.P., K.G.; Writing-Review & Editing, M.N.B., C.I.P., V.G.E.,
D.Y.P., K.G., S.W.; Visualization, M.N.B.; Supervision, M.N.B., I.P., V.G.E., D.Y.P., S.W.; Project Administration,
M.N.B., I.P.; Funding Acquisition, M.N.B., I.P., C.I.P. All authors have read and agreed to the published version of
the manuscript.
Funding: This work was financially supported by SAIA (Slovak Academic Information Agency).
Sensors 2020, 20, 4228 13 of 14

Conflicts of Interest: The authors declare no conflict of interest.

Nomenclature
AC Autocorrelation Coefficient
AC_ampl The amplitude of AC on sliding window
AC_first The first value of AC on sliding window
AD Acoustic Data
BF Brute Force method
BFE Backward Feature Elimination
CV Cross-Validation Algorithm
EQ Earthquake
MAE Mean Absolute Error
ML Machine Learning
TTF Time To Failure

References
1. Ranjan, J.; Patra, K.; Szalay, T.; Mia, M.; Gupta, M.K.; Song, Q.; Krolczyk, G.; Chudy, R.; Pashnyov, V.A.;
Pimenov, D.Y. Artificial Intelligence-Based Hole Quality Prediction in Micro-Drilling Using Multiple Sensors.
Sensors 2020, 20, 885. [CrossRef] [PubMed]
2. Lo, C.-C.; Lee, C.-H.; Huang, W.-C. Prognosis of Bearing and Gear Wears Using Convolutional Neural
Network with Hybrid Loss Function. Sensors 2020, 20, 3539. [CrossRef]
3. Bustillo, A.; Pimenov, D.Y.; Matuszewski, M.; Mikolajczyk, T. Using artificial intelligence models for the
prediction of surface wear based on surface isotropy levels. Robot. Comput. Integr. Manuf. 2018, 53, 215–227.
[CrossRef]
4. Juez-Gil, M.; Erdakov, I.N.; Bustillo, A.; Pimenov, D.Y. A regression-tree multilayer-perceptron hybrid
strategy for the prediction of ore crushing-plate lifetimes. J. Adv. Res. 2019, 18, 173–184. [CrossRef] [PubMed]
5. Jordan, M.I.; Mitchell, T.M. Machine learning: Trends, perspectives, and prospects. Science 2015, 349, 255–260.
[CrossRef]
6. International Handbook of Earthquake & Engineering Seismology, Part A. Available online:
https://books.google.com/books?hl=zh-CN&lr=&id=aFNKqnC2E-sC&oi=fnd&pg=PP1&dq=
International+Handbook+of+Earthquake+%26+Engineering+Seismology&ots=8NToos8_M0&sig=
sULBiycejgotrjKhg741Wr98RfA#v=onepage&q=International%20Handbook%20of%20Earthquake%20%
26%20Engineering%20Seismology&f=false (accessed on 29 July 2020).
7. Gates, A.E.; Ritchie, D. Encyclopedia of Earthquakes and Volcanoes, 3rd ed.; Infobase Publishing: New York, NY,
USA, 2006; ISBN 0-8160-6302-8.
8. Tapia-Hernandez, E.; Reddy, E.A.; Oros-Avilés, L.J. Earthquake predictions and scientific forecast: Dangers
and opportunities for a technical and anthropological perspective. Earth Sci. Res. J. 2019, 23, 309–315.
[CrossRef]
9. Verma, M.; Bansal, B.K. Seismic hazard assessment and mitigation in India: An overview. Int. J. Earth Sci.
2013, 102, 1203–1218. [CrossRef]
10. Kumar, N.; Kumar, P.; Chauhan, V.; Hazarika, D. Variable anelastic attenuation and site effect in estimating
source parameters of various major earthquakes including Mw 7.8 Nepal and Mw 7.5 Hindu kush earthquake
by using far-field strong-motion data. Int. J. Earth Sci. 2017, 106, 2371–2386. [CrossRef]
11. Riguzzi, F.; Tan, H.; Shen, C. Surface volume and gravity changes due to significant earthquakes occurred in
central Italy from 2009 to 2016. Int. J. Earth Sci. 2019, 108, 2047–2056. [CrossRef]
12. Rouet-Leduc, B.; Hulbert, C.; Lubbers, N.; Barros, K.; Humphreys, C.J.; Johnson, P.A. Machine learning
predicts laboratory earthquakes. Geophys. Res. Lett. 2017, 44, 9276–9282. [CrossRef]
13. Bolton, D.C.; Shokouhi, P.; Rouet-Leduc, B.; Hulbert, C.; Rivière, J.; Marone, C.; Johnson, P.A. Characterizing
Acoustic Signals and Searching for Precursors during the Laboratory Seismic Cycle Using Unsupervised
Machine Learning. Seism. Res. Lett. 2019, 90, 1088–1098. [CrossRef]
14. Corbi, F.; Sandri, L.; Bedford, J.; Funiciello, F.; Brizzi, S.; Rosenau, M.; Lallemand, S. Machine learning can
predict the timing and size of analog earthquakes. Geophys. Res. Lett. 2019, 46, 1303–1311. [CrossRef]
Sensors 2020, 20, 4228 14 of 14

15. Kong, Q.; Trugman, D.T.; Ross, Z.E.; Bianco, M.J.; Meade, B.J.; Gerstoft, P. Machine learning in seismology:
Turning data into insights. Seism. Res. Lett. 2019, 90, 3–14. [CrossRef]
16. Bergen, K.J.; Chen, T.; Li, Z. Preface to the Focus Section on Machine Learning in Seismology. Seism. Res. Lett.
2019, 90, 477–480. [CrossRef]
17. Asencio–Cortés, G.; Morales–Esteban, A.; Shang, X.; Martínez–Álvarez, F. Earthquake prediction in California
using regression algorithms and cloud-based big data infrastructure. Comput. Geosci. 2018, 115, 198–210.
[CrossRef]
18. Bergen, K.J.; Johnson, P.A.; de Hoop, M.V.; Beroza, G.C. Machine learning for data-driven discovery in solid
Earth geoscience. Science 2019, 363, 6433. [CrossRef]
19. Florido, E.; Asencio–Cortés, G.; Aznarte, J.L.; Rubio-Escudero, C.; Martínez–Álvarez, F. A novel tree-based
algorithm to discover seismic patterns in earthquake catalogs. Comput. Geosci. 2018, 15, 96–104. [CrossRef]
20. Kong, Q.; Allen, R.M.; Schreier, L.; Kwon, Y.W. MyShake: A smartphone seismic network for earthquake
early warning and beyond. Sci. Adv. 2016, 2, e1501055. [CrossRef]
21. Li, Z.; Meier, M.-A.; Hauksson, E.; Zhan, Z.; Andrews, J. Machine learning seismic wave discrimination:
Application to earthquake early warning. Geophysic. Res. Lett. 2018, 45, 4773–4779. [CrossRef]
22. Ochoa, L.H.; Niño, L.F.; Vargas, C.A. Support vector machines applied to fast determination of the
geographical coordinates of earthquakes. The case of El Rosal seismological station, Bogotá-Colombia.
DYNA 2019, 86, 230–237. [CrossRef]
23. Ochoa-Gutierrez, L.H.; Vargas-Jimenez, C.A.; Niño-Vásquez, L.F. Fast estimation of earthquake arrival
azimuth using a single seismological station and machine learning techniques. Earth Sci. Res. J. 2019, 23,
103–109. [CrossRef]
24. Ross, Z.E.; Yue, Y.; Meier, M.-A.; Hauksson, E.; Heaton, T.H. PhaseLink: A deep learning approach to seismic
phase association. J. Geophys. Res. Solid Earth 2019, 124, 856–869. [CrossRef]
25. LANL Earthquake Prediction. 2019. Available online: https://www.kaggle.com/c/LANL-Earthquake-
Prediction (accessed on 15 March 2020).
26. Rouet-Leduc, B.; Hulbert, C.; Bolton, D.C.; Ren, C.X.; Riviere, J.; Marone, C.; Guyer, R.A.; Johnson, P.A.
Estimating Fault Friction From Seismic Signals in the Laboratory. Geophys. Res. Lett. 2019, 45, 1321–1329.
[CrossRef]
27. Baltay, A.; Ide, S.; Prieto, G.; Beroza, G. Variability in earthquake stress drop and apparent stress.
Geophys. Res. Lett. 2011, 38, 6. [CrossRef]
28. Chen, T.; Guestrin, C. XGBoost: A Scalable Tree Boosting System. In Proceedings of the 22nd International
Conference on Knowledge Discovery and Data Mining, San Francisco, CA, USA, 9 August 2016; pp. 785–794.
[CrossRef]
29. Hulbert, C.; Rouet-Leduc, B.; Johnson, P.A.; Ren, C.X.; Riviere, J.; Bolton, D.C.; Marone, C. Similarity of fast
and slow earthquakes illuminated by machine learning. Nat. Geosci. 2019, 12, 69–74. [CrossRef]
30. Ren, C.X.; Peltier, A.; Ferrazzini, V.; Rouet-Leduc, B.; Johnson, P.A.; Brenguier, F. Machine learning reveals
the seismic signature of eruptive behavior at Piton de la Fournaise volcano. Geophys. Res. Lett. 2020, 47,
e2019GL085523. [CrossRef]
31. Ren, C.; Dorostkar, O.; Rouet-Leduc, B.; Carmeliet, J. Machine Learning Reveals the State of Intermittent
Frictional Dynamics in a Sheared Granular Fault. Geophys. Res. Lett. 2019, 46, 7395–7403. [CrossRef]
32. Yin, L.; Andrews, J.; Heaton, T. Reducing process delays for real-time earthquake parameter
estimation–An application of KD tree to large databases for Earthquake Early Warning. Comput. Geosci.
2018, 114, 22–29. [CrossRef]
33. Rouet-Leduc, B.; Hulbert, C.; McBrearty, I.W.; Johnson, P.A. Probing slow earthquakes with deep learning.
Geophys. Res. Lett. 2020, 47, e2019GL08587. [CrossRef]

© 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access
article distributed under the terms and conditions of the Creative Commons Attribution
(CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Get Hitler in Argentina But No Teutonic Conspiracy of 1000 Years 1st Edition Bruno Buike Free All Chapters
100% (1)
Get Hitler in Argentina But No Teutonic Conspiracy of 1000 Years 1st Edition Bruno Buike Free All Chapters
51 pages
Q1 Math1 Tos Grade1
100% (1)
Q1 Math1 Tos Grade1
2 pages
Chicago River Design Guidelines 2019
100% (2)
Chicago River Design Guidelines 2019
137 pages
Social Work Tools, Skills, and Techniques
100% (5)
Social Work Tools, Skills, and Techniques
63 pages
Cardiovascular Pharmacotherapeutics PDF
No ratings yet
Cardiovascular Pharmacotherapeutics PDF
798 pages
Laboratory Earthquake Prediction
No ratings yet
Laboratory Earthquake Prediction
12 pages
Lesson 4 Philippine Forest and Wildlife Resources: By: For. Leslie Sanchez Obiso CTU-Barili
No ratings yet
Lesson 4 Philippine Forest and Wildlife Resources: By: For. Leslie Sanchez Obiso CTU-Barili
12 pages
Gap Analysis 2024 2025
No ratings yet
Gap Analysis 2024 2025
4 pages
Chemistry For Physical Science
No ratings yet
Chemistry For Physical Science
9 pages
2 Dr.V.Vijaya Kumar
No ratings yet
2 Dr.V.Vijaya Kumar
10 pages
Ch. 5 Disposal of Wastewater Numericals
No ratings yet
Ch. 5 Disposal of Wastewater Numericals
6 pages
A Study On Earthquake Detection Using Machine Learning: N. Nirmala, K. Rekha
No ratings yet
A Study On Earthquake Detection Using Machine Learning: N. Nirmala, K. Rekha
9 pages
Machine Learning-Based Earthquake Detection and P-Wave Arrival Time Picking
No ratings yet
Machine Learning-Based Earthquake Detection and P-Wave Arrival Time Picking
5 pages
Earthquake Prediction Using Machine Learning Using Support Vector Machine Algorithm
No ratings yet
Earthquake Prediction Using Machine Learning Using Support Vector Machine Algorithm
7 pages
A Novel Approach For Earthquake Prediction
No ratings yet
A Novel Approach For Earthquake Prediction
5 pages
Machine Learning Applications in Seismology
No ratings yet
Machine Learning Applications in Seismology
6 pages
Summarise The Nature and Effects of Perceived Fairness in Groups C2
No ratings yet
Summarise The Nature and Effects of Perceived Fairness in Groups C2
1 page
OL Physics Book 2 (MCQ Theory) 2008 Till 2021
No ratings yet
OL Physics Book 2 (MCQ Theory) 2008 Till 2021
386 pages
Earthquake Prediction Model With Machine Learning
No ratings yet
Earthquake Prediction Model With Machine Learning
15 pages
Earthquake Prediction Using Machine Learning Algorithm: ISSN: 2277-3878 (Online), Volume-8 Issue-6, March 2020
No ratings yet
Earthquake Prediction Using Machine Learning Algorithm: ISSN: 2277-3878 (Online), Volume-8 Issue-6, March 2020
5 pages
Intro To Microscopes - Compiled Lesson Plan
No ratings yet
Intro To Microscopes - Compiled Lesson Plan
32 pages
Siesmic Signal Processing Thesis
No ratings yet
Siesmic Signal Processing Thesis
64 pages
LLM Alignment
No ratings yet
LLM Alignment
171 pages
Asoc Published PDF
No ratings yet
Asoc Published PDF
15 pages
2015 - 05 AAS Predict Magnitude Earthquakes BGTA 2015
No ratings yet
2015 - 05 AAS Predict Magnitude Earthquakes BGTA 2015
30 pages
Study of California Earthquake Prediction Using Machine Learning Approach
No ratings yet
Study of California Earthquake Prediction Using Machine Learning Approach
3 pages
Ahumada Et Al-Expert Systems
No ratings yet
Ahumada Et Al-Expert Systems
42 pages
Small Earthquakes Can Help Predict Large Earthquak
No ratings yet
Small Earthquakes Can Help Predict Large Earthquak
19 pages
Remotesensing 16 02159
No ratings yet
Remotesensing 16 02159
20 pages
Australian Standard: Inspection of Buildings Part 1: Pre-Purchase Inspections - Residential Buildings
No ratings yet
Australian Standard: Inspection of Buildings Part 1: Pre-Purchase Inspections - Residential Buildings
9 pages
Sample4 Survey
No ratings yet
Sample4 Survey
44 pages
Geohazards 03 00011 v2
No ratings yet
Geohazards 03 00011 v2
28 pages
Leishen LiDAR Product Guide 7.20213
No ratings yet
Leishen LiDAR Product Guide 7.20213
27 pages
Kluenter Pause - Look
No ratings yet
Kluenter Pause - Look
64 pages
Irjet V8i5574
No ratings yet
Irjet V8i5574
18 pages
LLM Testing - Methods and Strategies-1
No ratings yet
LLM Testing - Methods and Strategies-1
20 pages
Earthquake Prediction Technique: A Comparative Study
No ratings yet
Earthquake Prediction Technique: A Comparative Study
7 pages
AI Phase5
No ratings yet
AI Phase5
11 pages
Chapter - 1: 1.1 Problem Statement
No ratings yet
Chapter - 1: 1.1 Problem Statement
43 pages
12 Rashis and Their Lords
No ratings yet
12 Rashis and Their Lords
1 page
Chemistry Investigatory Project Anj
No ratings yet
Chemistry Investigatory Project Anj
16 pages
Romo
No ratings yet
Romo
18 pages
Lecture: System Degradation and Power Penalty
No ratings yet
Lecture: System Degradation and Power Penalty
9 pages
Completed Time Series Analysis! ?
No ratings yet
Completed Time Series Analysis! ?
24 pages
Achmet Et Al. (2024)
No ratings yet
Achmet Et Al. (2024)
20 pages
Recent Advances in Earthquake Seismology Using Machine Learning
No ratings yet
Recent Advances in Earthquake Seismology Using Machine Learning
32 pages
Ai Phase5
No ratings yet
Ai Phase5
16 pages
SSRN Id3513365
No ratings yet
SSRN Id3513365
5 pages
Literature Review
No ratings yet
Literature Review
9 pages
Innovative Mathematical Model For Earthquake Prediction: June 2014
No ratings yet
Innovative Mathematical Model For Earthquake Prediction: June 2014
12 pages
Main
No ratings yet
Main
9 pages
Documentation
No ratings yet
Documentation
8 pages
Anna Jakubczyk Gałczyńska A Proposed Machine
No ratings yet
Anna Jakubczyk Gałczyńska A Proposed Machine
8 pages
Earthquake Prediction Using Machine Learning Algorithm
No ratings yet
Earthquake Prediction Using Machine Learning Algorithm
5 pages
Report
No ratings yet
Report
13 pages
Earthquake Magnitude Prediction in Indonesia Using A Supervised Method Based On Cloud Radon Data
No ratings yet
Earthquake Magnitude Prediction in Indonesia Using A Supervised Method Based On Cloud Radon Data
9 pages
Seismic Trend Analysis: A Data Mining Approach For Pattern Prediction
No ratings yet
Seismic Trend Analysis: A Data Mining Approach For Pattern Prediction
12 pages
Brain Tumor Classification Using Vision Transformer (Vit) : Import As Import As
No ratings yet
Brain Tumor Classification Using Vision Transformer (Vit) : Import As Import As
11 pages
Using A Physics-Informed Neural Network and Fault Zone Acoustic Monitoring To Predict Lab Earthquakes
No ratings yet
Using A Physics-Informed Neural Network and Fault Zone Acoustic Monitoring To Predict Lab Earthquakes
12 pages
Jiang Et Al 2021
No ratings yet
Jiang Et Al 2021
11 pages
A Survey Onearthquake Analysis Byclassification of HyperSpectral Images With Convolutional Neural Network
No ratings yet
A Survey Onearthquake Analysis Byclassification of HyperSpectral Images With Convolutional Neural Network
6 pages
Burtonand Mieler 2022
No ratings yet
Burtonand Mieler 2022
6 pages
Bayu Kusuma Atmaja Paper Final Icitee
No ratings yet
Bayu Kusuma Atmaja Paper Final Icitee
6 pages
8421 23293 1 PB
No ratings yet
8421 23293 1 PB
10 pages
YMER2312A4
No ratings yet
YMER2312A4
12 pages
Pemanfaatan Serat Selulosa ECENG GONDOK (Eichhornia Crassipes) SEBAGAI BAHAN BAKU Pembuatan Kertas: Isolasi Dan Karakterisasi
No ratings yet
Pemanfaatan Serat Selulosa ECENG GONDOK (Eichhornia Crassipes) SEBAGAI BAHAN BAKU Pembuatan Kertas: Isolasi Dan Karakterisasi
8 pages
Food Culture and A Travelogue Nine Fishy Tales of Samanth Subramanian's Following Fish
No ratings yet
Food Culture and A Travelogue Nine Fishy Tales of Samanth Subramanian's Following Fish
4 pages
AI For Earthquake Prediction
No ratings yet
AI For Earthquake Prediction
14 pages
LSTM-based Models For Earthquake Prediction: March 2020
No ratings yet
LSTM-based Models For Earthquake Prediction: March 2020
8 pages
Building Social Protection Floors For All: Global Flagship Programme Strategy (2016-20)
No ratings yet
Building Social Protection Floors For All: Global Flagship Programme Strategy (2016-20)
24 pages
135 142 m2415 Maher July 2024 119
No ratings yet
135 142 m2415 Maher July 2024 119
8 pages
Study On Techniques of Earthquake Prediction: G.Preethi B.Santhi
No ratings yet
Study On Techniques of Earthquake Prediction: G.Preethi B.Santhi
4 pages
Analysis and Prediction of Earthquake Impact-A Machine Learning Approach
No ratings yet
Analysis and Prediction of Earthquake Impact-A Machine Learning Approach
5 pages
Comparison of B Value Predictions As Ear
No ratings yet
Comparison of B Value Predictions As Ear
6 pages
Electronic Temperature Controllers: Multipact
No ratings yet
Electronic Temperature Controllers: Multipact
6 pages
Year 5 Equivalent Fractions and Decimals Tenths RPS
No ratings yet
Year 5 Equivalent Fractions and Decimals Tenths RPS
2 pages
New Algorithm For Earthquake Prediction
No ratings yet
New Algorithm For Earthquake Prediction
7 pages
IJSRDV9I10257
No ratings yet
IJSRDV9I10257
4 pages
Hawas Bajawi CV
No ratings yet
Hawas Bajawi CV
4 pages
Fin Irjmets1684052910
No ratings yet
Fin Irjmets1684052910
3 pages
Methodlogy For Construction of Mastic Asphalt As Per Morth Section:515
No ratings yet
Methodlogy For Construction of Mastic Asphalt As Per Morth Section:515
2 pages
Updated Paper ID-118
No ratings yet
Updated Paper ID-118
5 pages
ME451: Control Systems Course Roadmap
No ratings yet
ME451: Control Systems Course Roadmap
5 pages
Use of Spatio-Temporal Features For Earthquake Forecasting of Imbalanced Data
No ratings yet
Use of Spatio-Temporal Features For Earthquake Forecasting of Imbalanced Data
5 pages
AP English Language & Composition - Rhetorical Analysis Terms
No ratings yet
AP English Language & Composition - Rhetorical Analysis Terms
4 pages
Journal of Computer Applications - WWW - Jcaksrce.org - Volume 4 Issue 2
No ratings yet
Journal of Computer Applications - WWW - Jcaksrce.org - Volume 4 Issue 2
3 pages
Earthquake Magnitude Prediction Based On Radon Cloud Data Near Grindulu Fault, Indonesia Using The Statistical Method
No ratings yet
Earthquake Magnitude Prediction Based On Radon Cloud Data Near Grindulu Fault, Indonesia Using The Statistical Method
12 pages
CH 4 - Book Exercise
No ratings yet
CH 4 - Book Exercise
3 pages
ArtificialIntelligence Phase1
No ratings yet
ArtificialIntelligence Phase1
3 pages
A Fuzzy Expert System For Earthquake Prediction, Case Study: The Zagros Range
No ratings yet
A Fuzzy Expert System For Earthquake Prediction, Case Study: The Zagros Range
4 pages
CV Rezaei
No ratings yet
CV Rezaei
2 pages
Ram Mohan Impact of West
No ratings yet
Ram Mohan Impact of West
2 pages
Real-Time Earthquake Tracking and Localisation: A Formulation for Elements in Earthquake Early Warning Systems (Eews)
From Everand
Real-Time Earthquake Tracking and Localisation: A Formulation for Elements in Earthquake Early Warning Systems (Eews)
George R. Daglish
No ratings yet
Seismic Instrumentation Design: Selected Research Papers on Basic Concepts
From Everand
Seismic Instrumentation Design: Selected Research Papers on Basic Concepts
Raman K. Attri
No ratings yet

Sensors 20 04228 v2

Uploaded by

Sensors 20 04228 v2

Uploaded by

sensors

Keywords: seismology; earthquake prediction; laboratory experiment; acoustic data; machine

Sensors 2020, 20, 4228; doi:10.3390/s20154228 www.mdpi.com/journal/sensors

of engineering applications, for example, in artificial intelligence-based hole quality prediction in

2. Data and Methods

As shown in [26] instantaneous statistical characteristics of AD appear as a “fingerprint” of the

Sensors 2020, 20, 4228 9 of 15

overlap: {39,66,92,102,103,90,62,29,−2,−31,−53,−73} and {66,92,102,103,90,62,29,−2,−31,−53,−73,−83}.

Sensors 2020, 20, 4228 10 of 15

Figure 9. Comparison of modelling results for a different combination of features.

Conflicts of Interest: The authors declare no conflict of interest.

You might also like