A Case Study On Advanced CPT Data Interpretation: From Stratification To Soil Parameters
A Case Study On Advanced CPT Data Interpretation: From Stratification To Soil Parameters
A Case Study On Advanced CPT Data Interpretation: From Stratification To Soil Parameters
net/publication/379219532
CITATIONS READS
3 483
4 authors:
All content following this page was uploaded by Andreas-Nizar Granitzer on 23 March 2024.
ORIGINAL PAPER
Abstract The cone penetration test (CPT) is consid- determine constitutive model parameters from in-situ
ered as one of the most reliable in-situ tests and has tests using a graph-based methodology. In the present
found numerous applications in the geotechnical engi- work, the developed automated parameter determina-
neering field. Traditional CPT interpretation includes, tion framework is applied to evaluate the soil param-
but are not limited to the identification of the soil eters of one selected soil layer identified from the
stratification and the determination of soil parameters. CPT interpretations. Potential lines of research in the
This paper presents a case study concerning a test site context of CPT interpretation are explored through-
located in Salzburg, Austria, in which we focus on the out this work and may serve as valuable reference in
interpretation of CPTs from different perspectives. future research.
The manuscript is divided into three main sections
dealing with three different aspects of CPT interpreta- Keywords CPT · Machine learning · Parameter
tion, namely stratification, ground variability and soil determination · Stratification
parameters. The first strategy introduces a machine
learning based stratification identification strategy to
detect soil layer boundaries from CPT measurements. 1 Introduction
A comparison with reference solutions demonstrates
relative merits of this approach to classical filter Since its introduction to the geotechnical engineer-
algorithms based on empirical CPT classifications. ing field the cone penetration test (CPT) has been
The second strategy introduces an intuitive approach applied to numerous engineering tasks, such as
to evaluate the ground variability. This is achieved the estimation of bearing capacity, design of shal-
by calculating the level of fluctuation on the basis low and deep foundations, walls and embankments,
of CPT measurements and could be used as a data- as well as the assessment of ground improvement
driven decision-making tool for the improved design measures and the site resistance to liquefaction
of CPT investigation layouts. The third strategy is (Umar et al. 2018; Kumar et al. 2021). Neverthe-
embedded in an ongoing research project that aims to less, estimating soil parameters and site charac-
terization may be regarded as the main purposes
I. Marzouk (*) · A.-N. Granitzer · S. Rauter · for carrying out CPTs (Lunne et al. 1997; Schnaid
F. Tschuchnigg 2009; Niazi 2022). Essentially, these tasks are
Institute of Soil Mechanics, Foundation Engineering non-trivial and often carried out considering
and Computational Geotechnics, Graz University
empirical interpretation approaches and simplified
of Technology, 8010 Graz, Austria
e-mail: [email protected] assumptions that may have limited applicability in
Vol.: (0123456789)
13
Geotech Geol Eng
Fig. 1 Test site located in Salzburg, Austria. Note: the position of different tests is measured from point (0,0) located in the lower
left corner of the figure
Vol:. (1234567890)
13
Geotech Geol Eng
sand-silt mixtures (from 0 to 4 m) and sandy gravel of CPTs with the aim to generate ground models is
(from 4 to 7 m); (ii) sand-silt alterations; (iii) clayey frequently bound to the subjective rationale of the
silt. engineers involved.
In recent years, numerous ML techniques have
been applied to geotechnical engineering problems,
3 Machine Learning Based Strata Identification such as soil liquefaction, the estimation of ultimate
capacity of driven piles and prediction of tunnel
The generation of a site specific ground model that boring machine performance (Kumar et al. 2021;
describes the type and extent of the soil layers as well Samui 2012; Fattahi and Babanouri 2017); addi-
as the subsoil properties is one of the fundamental tional geotechnical application cases of ML models
processes in geotechnical engineering. In traditional are presented in Gomes Correia et al. (2013) and
approaches to this task, the results of available soil Ebid (2021). To reduce subjective influences on the
investigations, such as trenches and core drillings, ground model and ensure a robust and comprehensi-
are usually combined with soundings. Subsequently, ble basis for the planning and execution of geotechni-
individual soil layers and their properties are deter- cal structures, a data-driven strategy allowing for the
mined based on engineering experience. The mini- automatic derivation of ground models (that is, solely
mum number of required core drillings and soundings based on investigation data) is pursued. The under-
is specified in respective standards (e.g., ÖNORM lying database was published by the Institute of Soil
2017). This process requires from the engineers Mechanics, Foundation Engineering and Computa-
involved not only broad expertise in the interpretation tional Geotechnics at Graz University of Technology
of subsurface investigations, but often also specific (TU Graz) in 2021 (Oberhollenzer et al. 2021b).
knowledge of regional characteristics. In many cases,
the latter are only partially substantiated by empirical 3.1 Theoretical Considerations
interpretation schemes, such as Soil Behaviour Type
diagrams (e.g., Robertson 2009, 2010, 2016) or ana- The main objective of the CPT data interpretation
lytical correlations. Consequently, the interpretation strategy presented in this section is to automatically
Vol.: (0123456789)
13
Geotech Geol Eng
Vol:. (1234567890)
13
Geotech Geol Eng
Table 1 Selected metrics Target Feature Mean Standard deviation Min Max
describing database detailed
in Rauter and Tschuchnigg Oberhollenzer_classes Depth (m) 13.22 10.58 0.01 75.92
(2021)
qc (MPa) 5.34 8.41 −8.61 101.73
fs (MPa) 54.76 70.40 −99.90 1591.40
Rf (%) 2.49 38.16 −100.00 22,000.00
𝜎v (kPa) 251.12 200.94 0.19 1442.48
u0 (kPa) 122.54 103.23 0.00 744.48
𝜎 v ′ (kPa) 128.58 99.69 0.09 697.70
Soil behavior types Depth (m) 12.40 10.43 0.01 103.00
(SBT, SBTn, ModS- qc (MPa) 5.57 8.48 −8.61 122.90
BTn)
fs (MPa) 64.56 254.20 −100.00 47,436.00
Rf (%) 2.70 43.32 −100.00 30,000.00
𝜎v (kPa) 235.68 198.21 0.19 1957.00
u0 (kPa) 114.86 101.48 0.00 1010.43
𝜎 v ′ (kPa) 120.83 98.62 0.09 946.57
the results obtained with the original "Oberhollenzer_ parametric studies with the aim to ensure practica-
classes" (0.75). ble classification results. However, it is explicitly
stated that the optimization of these parameters is
3.4 Machine Learning Methodology part of ongoing research.
The classification scheme described in the above
In analogy to CPT data interpretation strategies paragraph serves as basis for the automatic determi-
based on “Oberhollenzer classes” or Soil Behaviour nation of soil layer boundaries. The density distribu-
Type Charts, noise may have a detrimental effect on tions of the respective categories are calculated and
the ML-based prediction of EN classes. In order to displayed on a chart using the Kernel Density Esti-
overcome this problem in the course of the soil layer mation (KDE) model employed in the scikit-learn
boundary detection a novel classification approach library (Pedregosa et al. 2021). In the present case, a
has been developed. Subsequently, key aspects of the "Gaussian" kernel with a bandwidth of 0.9 was found
underlying workflow are presented. suitable for this task. The dominant distribution in
As a first step, the CPT measurements are clas- each case describes the predominant category; a soil
sified using the ML model. Based on the predicted layer boundary is therefore recovered at the intersec-
class probabilities, the samples are divided into tion point of two dominant distribution curves. This
three categories, namely coarse-grained, fine- approach has the advantage that the noise in the data
grained and transitional. It should be pointed out is considerably suppressed. The disadvantage, how-
that the transformation of EN classes into these cat- ever, is that thin layers may be ignored; moreover, a
egories allows for the direct identification of soil very uneven distribution of categories may lead to
layer boundaries. The class probabilities are used inaccurate positions of the predicted soil layer bound-
to categorize the classified CPT measuring points aries. The methodological steps can be summarized
on the basis of empirical threshold values. In more as follows (Rauter and Tschuchnigg 2022):
detail, a sample is classified as “coarse-grained” if
the proportional sum of gravel and sand exceeds 1. For each EN class, the determinant soil class is
80%; inversely, the measuring points are regarded as evaluated. For example, for siSa (si = silt, Sa =
"fine-grained" if (1) the proportion of clay and silt sand), sand would be identified as dominant soil
exceeds 65% and (2) the proportion of gravel and class as well as the corresponding target class
sand is less than 20%. Points for which no distinc- with respect to the ML model.
tion can be made are categorized as "transitional". 2. A classification model based on a random forest
The threshold values were defined on the basis of algorithm is built and trained using the input fea-
Vol.: (0123456789)
13
Geotech Geol Eng
tures (qc, Rf , depth) and the new targets (gravel, 5. The soil class density distributions computed
sand, silt, clay, organics). with the KDEs are plotted in one graph, whereas
3. In the scope of the predicted soil class labels, the curves with the highest magnitude are locally
class probability is of particular interest as it is considered as prevailing soil type. Intersections
used to distinguish between the fine-grained, of the distribution curves represent soil layer
coarse-grained or transitional soil labels. For boundaries.
example, if the sum of the probabilities for sand
and gravel is greater than the sum of the probabil- The interested reader may refer to Rauter and
ities for clay and silt (e.g., 80/20 ratio), the sam- Tschuchnigg (2022) for more detailed information
ple would be classified as coarse-grained, oth- about the ML-based methodology, in combination
erwise as fine-grained, and if no clear statement with a simple benchmark test.
can be made, the sample would be classified as Figure 3 visualizes the presented workflow. The
transitional soil. first plot shows the CPT measurement data ( Rf and
4. The identified soil classes (coarse-grained, fine- qc). The second plot indicates the classification
grained and transitional) are displayed by means according to Robertson’s normalized Soil Behaviour
of a binary plot, where each CPT recording Type chart (Robertson 2009), which serves as refer-
is associated with a unique soil class (coarse- ence basis for comparison with the EN-based classi-
grained, fine-grained and transitional). Based fication. The third plot yields the probability distri-
on this distribution, a kernel density estimation bution of the EN classes. The fourth plot shows the
is performed for each class. It is found that a binary plot with distinct soil class labels in vertical
Gaussian kernel with a bandwidth = 0.9 (that is, direction (coarse-grained, fine-grained and transi-
a hyperparameter that controls the smoothness of tional). The fifth plot depicts the probability density
the class probability curves) is reasonable to pre- distribution of the three soil classes. In the present
dict realistic soil layer boundaries. case, two intersections corresponding to two soil
Fig. 3 Example for the presented workflow. Plot of the CPT data, SBT classification, class probabilities of EN-classification, clas-
sification in the three newly introduced categories and density distribution (f.l.t.r.)
Vol:. (1234567890)
13
Geotech Geol Eng
Fig. 4 Cross-section A-A. Comparison between predicted soil layer boundaries and reference solution (dashed line)
Fig. 5 Cross-section B-B. Comparison between predicted soil layer boundaries and reference solution (dashed line)
layer boundaries can be identified (black dash-dotted defined by engineers in the course of a construction
lines). project. In addition to the CPTs, core drillings and
In the next subsection, the presented workflow is seismic dilatometer tests (SDMT) served as a basis
applied to seven different CPTs to identify the soil for soil layer determination in the reference model,
layer boundaries of the test site described in Sect. 2. which will be referred to as reference solution or ref-
For validation purposes, a comparison with the engi- erence boundaries.
neering ground model generated in the course of the Figures 4b and 5b show the spatial distribution of
construction project is provided as well. the predicted soil layers from two different perspec-
tives, in which the black vertical lines show the posi-
3.5 Results tion of the individual CPTs, the red surfaces mark the
top of the ground model, the green surfaces indicate
The ground model under study follows the dimen- the layer boundaries between coarse-grained and tran-
sions of the rectangular test site (approximately sitional soil, and the blue surfaces illustrate the tran-
90x50 m and a depth of 40 m). The arrangement sition from transitional to fine-grained soil. The sur-
of the CPTs in the ground model corresponds to faces were recovered employing the widely-applied
the actual arrangement of the soundings in the field surface triangulation interpolation technique; com-
(see Fig. 1). The sections used for comparison were pare Felic (2023).
Vol.: (0123456789)
13
Geotech Geol Eng
In the next step, the predicted soil layers are com- utilize an appropriate database that shall be continu-
pared to the reference solution along two selected ver- ously maintained to ensure a satisfactory performance
tical cross-sections. For the purpose of comparison, of the presented strategy in future applications. Fur-
the locations of these cross-sections are based on the thermore, the hyperparameter set employed in the ML
sectioning originally established during the design of and KDE models have to be evaluated and updated as
the original construction project. The first cross-sec- soon as the database is extended with new samples.
tion extends along CPTu 1/12, 2/12 and 3/12, and is This aspect is part of ongoing research at the Insti-
illustrated in Fig. 4a. The coloured areas represent the tute of Soil Mechanics, Foundation Engineering and
individual layers according to the three categories, Computational Geotechnics.
i.e., coarse-grained, fine-grained and transitional. The
dashed lines mark the soil layer boundaries of the
reference solution (as used for the design). As can be 4 Dynamic Updating Strategy for CPT Layouts
observed from the results, a very good agreement of
the predicted soil layer boundaries and the reference The spatial interpretation of CPT measurements, for
solution is achieved. The maximum deviation from example, on the basis of ML techniques as discussed
the reference solution is around 1 m. in Sect. 3, is sensitive to the CPT investigation layout
The second cross-section used for comparison (Jaksa et al. 2005b). Since the spatial interpretation is
extends along CPTu 2/12, 5/12 and CPT 1/15. (Note: deterministic at measured locations, it is unknown at
CPT 1/15 and CPTu x/12 were executed at different unmeasured locations and needs to be assessed based
times and from non-identical levels). Results con- on predictions from the available limited number of
cerning cross-section B-B are shown in Fig. 5a. Simi- measurements (Kulatilake and Um 2003). In many
lar to cross-section A-A, the soil layer boundary at the instances, the accuracy of predictions increases as the
interface between “transitional” and “fine-grained” scope of investigations and available data increases.
agrees reasonably well with the reference solution. However, narrow-spaced CPT layouts parallel both
However, the soil layer boundary at the interface high costs spent in carrying out the investigations and
between “coarse-grained” and “transitional” shows redundant information. Moreover, since every CPT
considerable deviations from the reference solution, leaves a zone of permanently disturbed soil where
especially close to CPT 1/15. soil properties could be considerably different from
the original in-situ state, there is need to account for
3.6 Applicability in Engineering Practice a minimum spacing between adjacent CPT locations
(Al-Sammarraie et al. 2022). Consequently, the selec-
The presented methodology for data-driven and auto- tion of the optimal location and number of CPT tests
matic identification of soil layer boundaries based on represents a multivariate optimization problem that
CPT measurement data represents an attractive alter- may be regarded as trade-off between stratigraphic
native to classical filter algorithms that are currently uncertainty and site investigation costs; cf. Kahlström
used for the determination of soil layers boundaries. et al. (2021).
Comparative analyses presented in this section indi- In common practice, the scope of the geotechni-
cate a good agreement of the predicted soil layer cal site investigations rarely addresses the antici-
boundaries with the reference solutions. However, pated variability of the ground, but it is rather chosen
several open research aspects deserve attention. First, to reduce initial costs. It is widely accepted that this
the performance of ML models used for geotechni- cost-driven approach is likely to result in cost-over-
cal problems is significantly controlled by the qual- runs, for example, due to construction delays invoked
ity of training samples. In addition, it relies on the by unforeseen conditions or uneconomic designs
amount of data and does not extrapolate well beyond resulting from inadequate stratigraphic models (Maz-
the scope of provided training patterns. There is also zoccola et al. 1997; Goldsworthy et al. 2004; Jaksa
a need to develop tailored machine learning methods et al. 2005b). The above discussion inevitably raises
that can account for uncertainty in the global data, the following question (Xie et al. 2022): How to use
variability between sites, and realistic sample sizes a limited budget to estimate the ground variability
(Phoon and Zhang 2023). It is therefore crucial to with the lowermost uncertainty? This section aims to
Vol:. (1234567890)
13
Geotech Geol Eng
contribute towards answering this question by means As indicated by Fig. 6a, 𝜙(x)-values close to 1
of an intuitive strategy that allows for an increase of signify a high degree of spatial variability in related
the ratio between gained understanding of the spatial soil profiles (for example, due to the nugget effect
subsoil variability and costs spent for carrying out (Jaksa et al. 2005a)), whereas 𝜙(x)-values slightly
CPTs. The applicability of this method is demon- above 0 indicate regions of nominally homogeneous
strated based on both synthetic and real test cases. soil properties. In simple terms, 𝜙(x) allows to iden-
tify spatial regions of low correlation between neigh-
4.1 Methodology bouring CPT datasets, thereby taking into account
the proximity of unsampled points to available CPT
Unlike conventional methods that traditionally con- locations by means of a deterministic spatial weight-
cern the ground variability solely based on the dis- ing approach. To prevent the execution of CPTs
tance between CPTs, such as kriging-based (Krige that result in redundant information being obtained,
1951) or distance-from-nearest-information (Weil the CPT layout should be sequentially densified in
2020) sources, the presented framework combines regions with high 𝜙(x)-values. With reference to 6a,
geotechnical and spatial sampling information to carrying out a new CPT at x1 would yield valuable
reduce the stratigraphic uncertainty in the field where insight to the soil stratification, whereas additional
CPT measurements are conducted. In more detail, the information gained from placing an additional CPT at
proposed strategy characterizes the 3D stratigraphic x2 is expected to be relatively limited. From a practi-
model based on the dimensionless scalar-valued field cal perspective, this enables to gain maximum insight
quantity termed level of fluctuation 𝜙(x) ∈ [0, 1] (not to the 3D stratigraphic model at minimum costs.
be confused with the scale of fluctuation occupying Nevertheless, it must be noted that 𝜙(x) introduces a
the unit m (Vanmarcke 1977)). In this respect, 𝜙(x) relative measure that helps to detect local regions of
embodies an unbiased estimator capable of character- high spatial variability on a site-specific level only; as
izing the spatial variability of the CPT measurements. a consequence, quantifying generally applicable 𝜙(x)
Fig. 6 (a) Schematic description of scalar-valued field quantity termed level of fluctuation 𝜙(x) and (b) framework of proposed
model
Vol.: (0123456789)
13
Geotech Geol Eng
-limits that can readily be adopted to any site investi- number of CPTs considered, the vertical coordinates
gation lies beyond its current capabilities. z, zmin and zmax are described in Fig. 6a. Equation 1
Figure 6b illustrates the process of generating the is numerically integrated using the trapezoidal New-
level of fluctuation which has been implemented in ton-Cotes rule with a step size equal to the CPT pen-
an object-oriented framework using the programming etration interval, traditionally h = 0.01 m. According
language Python. It is worth mentioning that the pre- to this nomenclature, L2-norm matrix entries A(i,j)
sented framework is theoretically applicable to any describe the deviation of qc-profiles obtained from the
continuously sampled 1D quantity, such as the sleeve ith and jth CPT recording, respectively. As diagonal
friction fs, friction ratio Rf or cone tip resistance qc. terms (i = j ) concern the inverse correlation between
For the sake of clarity, subsequent considerations are identical qc-profiles, the corresponding diagonal fill-
restricted to qc-samplings since the latter is frequently ins reduce to zero (i.e., they are perfectly correlated;
used as critical parameter in the CPT-based pile design see Fig. 7a).
(Bustamante and Gianesselli 1982; Hamza and Bellis Step 3: Partition of stratigraphic model
2016; Abu-Farsakh and Shoaib 2024). The details of As indicated in Fig. 7b, a regular tessellation pat-
this procedure can be summarized as follows: tern is used to generate a quadratic grid. The level
Step 1: Loading and pre-processing of CPT data of fluctuation 𝜙 is evaluated at cell-centered discrete
The one-dimensional qc-profiles obtained from points xp ∈ ℝ2. As suggested by previous researchers
CPT field tests are simultaneously loaded and cleaned. in view of similar problems (Jaksa et al. 2005b; Xie
The latter is realized by means of the 1D Gaussian fil- et al. 2022), it is recommended to select a minimum
ter technique, similar to Liu et al. (2021). This allows grid size of 0.5 x 0.5 m. This ensures a convenient
to minimize spurious contributions to the calculation resolution of the continuous field quantity by means
of 𝜙(x) originating from noise and outliers. of discrete 𝜙(xp )-values, as well as a high level of
Step 2: L2-norm matrix computational efficiency considering the large scope
The inverse degrees of correlation between indi- of many geotechnical fields.
vidual pairs of qc-profiles, q(CPTi)
c (z) and qc
(CPTj)
(z), are Step 4: Level of fluctuation
quantified through the scalar-valued entries A(i,j) of The discrete evaluation of 𝜙 at all cell-centered
the symmetric L2-norm matrix A ∈ ℝn×n: grid points xp forms the core of the multi-step pro-
(CPTj )
cedure and comprises several sub-steps; see Fig. 8a.
A(i,j) =‖q(CPT
c
i ) (z) − q
c (z)‖L2 (zmin ,zmax ) The first sub-step concerns the evaluation of the dis-
� tance vector d ∈ ℝn which contains the horizontal
zmax (1)
∫zmin
length between xp and all CPT positions; see Fig. 8b.
(CPTi ) (CPTj )
= [qc (z) − qc (z)]2 dz
The calculated values are stored in the form of an
attribute table at all xp, similar to Fig. 7a. In the next
In Eq. 1, i, j ∈ [1, n] denote CPT index numbers,
such as considered in Fig. 7b, where n is the total
Fig. 7 (a) Numbering system of L2-norm matrix A and (b) corresponding CPT layout
Vol:. (1234567890)
13
Geotech Geol Eng
Fig. 8 (a) Sub-steps required to evaluate 𝜙(xi ) and (b) point-wise evaluation of distance vector d
sub-step, d is used for a point-wise evaluation of the distributed soil properties where further investiga-
weight matrix W(xp ) ∈ ℝn×n, formally written as: tion is supposed to yield redundant information; cf.
Fig. 6.
1∕[di (xp ) ⋅ dj (xp )]2 It should be pointed out that the CPT layout opti-
W(xp )(i,j) = ∑n ∑n (2)
i=1 j=1 1∕[di (xp ) ⋅ dj (xp )]
2 mization procedure presented in this subsection
can readily be extended to analyse the ground vari-
In Eq. 2, the weight matrix entries W(xp )(i,j) account ability in the third dimension (vertical z-direction)
for the proximity of xp to the locations of considered as well; for this purpose, the level of fluctuation is
CPT pairs, and satisfy the condition of an unbiased alternatively computed at voxel-centered discrete
∑n ∑n
estimator i=1 j=1
W (i,j) = 1. As formulated in points xp ∈ ℝ3. This allows, for example, to antici-
Eq. 3, the final sub-step links contributions originat- pate potential consequences of a CPT investigation
ing from geotechnical data A(i,j) with spatial sample depth reduction, or to cross-check whether FE sub-
information Wp (i,j) in the form of the level of fluctua- domains experiencing high pile load concentrations
tion 𝜙(xp ): are circumscribed by reliable site investigation data.
∑n ∑n (i,j)
From a computational point of view, this requires
i=1 j=1 A ⋅ W (i,j) the consecutive application of the model framework
𝜙(xp ) = �∑ ∑
n n (i,j) (i,j)
� (3) described in Fig. 6b on multiple equally spaced lay-
max A ⋅ W
i=1 j=1
ers with constant depth interval, instead of a single
formation spanning the entire CPT investigation
The normalizing denominator component depth, see Fig. 6a. As a consequence, the integra-
involves a loop over all discrete points and facili- tion limits ( zmin , zmax ) in Eq. 1 have to be replaced
tates a systematic quantification of the sought field by corresponding pairs of interval boundaries,
quantity in a range between 0 and 1. whereas the mean value representing the z-coordi-
Step 5: Visualization and interpretation nate is additionally stored at all xp . Moreover, since
The level of flucation at discrete data points the normalization is now enforced on xp occupied
𝜙(xp ) is effectively visualized using standard trian- by different layers, the normalizing denomina-
gular irregular networks in combination with 2D tor component in Eq. 3 has to be equipped with an
filled contours. On a site-specific scale, this allows additional loop over all layers, each of which incor-
to anticipate regions with high spatial variability porates a set of xp where the three-dimensional level
inside the 3D stratigraphic model, which can be of fluctuation 𝜙3D (xp ) ∈ [0, 1] is explicitly com-
mathematically described by 𝜙(xp ) → 1−. Inversely, puted. For the sake of clarity, corresponding figures
𝜙(xp ) → 0+ characterize areas of homogeneously explaining both strategies (𝜙 , 𝜙3D ) are reported in
the subsequent Sects. 4.2 and 4.3.
Vol.: (0123456789)
13
Geotech Geol Eng
Fig. 9 Site investigation schemes and corresponding 𝜙(xp )-distributions obtained with (a) homogeneous, (b) cross-symmetric and
(c) heterogeneous case; (d) CPT data considered
Fig. 10 L2-norm matrices obtained with (a) homogeneous and (b) heterogeneous CPT layout
Vol:. (1234567890)
13
Geotech Geol Eng
vertical and horizontal site axis, peak-valued 𝜙(xp )- Unlike the synthetic test cases investigated in
areas evaluated for the heterogeneous test case are (as Sect. 4.2, 𝜙-values are also calculated at spatial
expected) non-symmetric. Obviously, they are shifted points xp that are located outside the convex hull-
towards the site edge connecting CPT 4 and CPT 1, polygon; i.e., the connection line of outermost CPT
or more specifically, the site edge between the CPT samplings (Fig. 11a). At a first glance, the extrapo-
pair occupying the maximum L2-norm matrix entry, lated 𝜙-regions appear consistent with their interpo-
see Fig. 10b. Again, this observation replicates the lated counterparts in close proximity to the convex
anticipated pattern, and confirms the reliability of the hull-polygon. However, due to the lack of physical
implemented workflow. evidence in extrapolated regions the use of related
results inherently implies a certain degree of ran-
4.3 Real Case Application domness (Henke et al. 2020); hence, care should be
taken when using extrapolated 𝜙-values for CPT lay-
In this subsection, the proposed strategy is employed out optimization purposes. As expected, minimum 𝜙
to identify both the optimal CPT location and a rea- -values are obtained in close proximity to the position
sonable investigation depth to supplement CPT sam- of recorded CPT samplings. This observation fol-
pling data described in Sect. 2. The corresponding lows the same pattern as discussed in Sect. 4.2 and
qc-recordings obtained from 10 CPTs are plotted in showcases the influence of the weight matrix W ; see
Fig. 11b. In accordance with Sect. 3.5, the adopted Sect. 4.1. On the contrary, Fig. 11a predicts the maxi-
local coordinates comprise a vertical z-axis starting mum level of fluctuation at coordinates (x = 20 m,
at the basement level. In this respect, the investiga- y = 36 m); in turn, the latter represents the optimal
tion area includes the subsoil formation extending position for the execution of subsequent CPT testing
between 0 m and -30 m of the test site, with at least in order to acquire maximum insight to the natural
two relevant soil layer boundaries; cf. Sect. 3. variability of qc, respectively, at minimum costs.
Fig. 11 (a) Site investigation layout and corresponding 𝜙(xp )-contour plot, as well as (b) 𝜙3D (xp )-contour plots presented in y–z and
x–y plane, respectively; (c) CPT data considered
Vol.: (0123456789)
13
Geotech Geol Eng
The above observations incorporate information early design phase, the ability to effectively com-
on the horizontal variability of qc, but only limited municate geotechnical information and risk associ-
information on its vertical distribution. Nevertheless, ated with the 3D stratigraphic model to all project
soil properties are typically distributed with greater stakeholders including authorities, architects, cli-
variation in the vertical than in the horizontal direc- ents, contractors and engineers is a central challenge
tion, primarily due to geological processes that form for geotechnical engineers (Kahlström et al. 2021).
soils (Jaksa et al. 2005b); for example, this tendency In this context, the proposed strategy may help geo-
can be triggered by changes in the porewater chemis- technical engineers to meet this demand. In principal,
try and groundwater levels, the influence of climatic the authors have identified three use cases where the
effects or the diagenesis; cf. Graham and Shields presented strategy may extend the toolbox of geotech-
(1985). Consequently, it may be important to account nical engineers: (1) objectification and prioritisation
for the vertical ground variability when updating the of CPT layout design, (2) effective communication
CPT layout. The incorporation of this aspect also of geo-spatial uncertainty to project stakeholders
facilitates a data-driven determination of the CPT through the power of visualization and (3) quality
investigation depth on the basic premise of uncer- assurance of CPT measurements (note that peak 𝜙
tainty reduction, rather than randomness. /𝜙3D-values may also be triggered by measurement
As theoretically explained at Sect. 4.1, the method- errors).
ical framework can readily be extended to allow for a In view of the current trend to integrate 3D strati-
three-dimensional analysis of the test site soil domain graphic models to the Building Information Modeling
on the basis of 𝜙3D (xp ). From a geometrical point of (BIM) process (Henke and Lerch 2020; Giangiulio
view, the only modification in the presented workflow et al. 2022), 𝜙/𝜙3D-values may be stored as soil vol-
for calculating the level of fluctuation concerns the ume attribute. In this way, they can be readily used
partition of the stratigraphic model into cubic voxels, as relative measure that link the level of uncertainty
instead of two-dimensional quadratic grid elements; with geotechnical parameters listed in respective soil
see step 3 in Sect. 4.1. Consequently, 𝜙3D-values are volume attribute tables. Moreover, 𝜙/𝜙3D-attributes
calculated at voxel-centered discrete points xp ∈ ℝ3 may be dynamically updated as new CPT samplings
which are defined in all three spatial dimensions. As are performed using the framework documented in
depicted in Fig. 11a, the abundant data is effectively Fig. 6b.
visualized by means of perpendicular cross-sections For the sake of clarity, the demonstration cases
presented in both the x–z and y–z-plane, respectively. presented above assess the geotechnical contribution
Obviously, peak 𝜙3D-values indicating a high verti- to 𝜙/𝜙3D-quantities solely based on qc-recordings (sin-
cal variability occur in the near-surface whereas soil gle-factor approach). Depending on the site investiga-
regions at vertical distances greater than 6 m below tion objectives and the soil properties, it may be ben-
the basement are dominated by fairly homogene- eficial to replace Eq. 1 by a weighted-averaging
ous geotechnical conditions. From a practical point scheme, in which the L2 −norm matrix entries A(i,j)
of view, subsequent site investigation effort should subsume contributions stemming from the cone tip
therefore focus on the near-surface subsoil instead resistance A(i,j)
q
, the sleeve friction Af and the fric-
(i,j)
tives, which could be a conceivable alternative for approach, however, is beyond the scope of this paper,
site investigation in near-surface regions, cf. Palla and subject of ongoing research.
et al. (2008).
5 An Automated System for Determining
4.4 Generalisation and Employment in Practice
Constitutive Model Parameters
It is widely acknowledged that, in building projects
Numerical analyses have several advantages com-
and civil engineering, the main source of financial
pared to traditional methods. One of the main advan-
and technical risk lies in the ground (Jaksa et al.
tages is the level of detail that could be obtained in
2005b; Rana et al. 2023). In particular during the
Vol:. (1234567890)
13
Geotech Geol Eng
Vol.: (0123456789)
13
Geotech Geol Eng
Nevertheless, other SBT charts (such as Robertson’s Table 2 SBT zones Robertson (2010)
normalized chart (Robertson 2009) and Robertson’s Zone Soil behaviour type (SBT)
updated normalized chart (Robertson 2016)) could
be used as well. Afterwards, the CPT profile is strati- 1 Sensitive fine-grained
fied into several layers sharing the same SBT. At the 2 Clays—organic soil
moment APD does not use any ML based strata iden- 3 Clays: clay to silty clay
tification (as discussed in Sect. 3). However, the pos- 4 Silt mixtures: clayey silt & Silty clay
sibility of using ML for the soil layer detection (with 5 Sand mixtures: silty sand to sandy silt
SBT charts) will be added to the framework. For each 6 Sands: clean sands to silty sands
layer, the CPT measurements (qc, fs and u2) are aver- 7 Dense sand to gravelly sand
aged in a layer-wise manner. The third module (Layer 8 Stiff sand to clayey sand (overconsolidated)
state) uses the averaged CPT measurements to deter- 9 Stiff fine-grained (overconsolidated)
mine the state of the layers (overconsolidation ratio
OCR and coefficient of earth pressure K0). The fourth
module (Graph-based approach) uses the output of 5.1.3 Graph‑Based Approach
modules 2 and 3 to connect parameters to equations
(correlations) and calculates the parameters. The Graph theory is a branch of discrete mathematics
final module (Constitutive model parameters) con- where relationships between different objects within
verts parameters calculated in module 4 to constitu- a network are studied. The network is described by
tive model parameters. The system is built in the pro- two different objects, namely nodes and edges. Nodes
gramming language Python. describe the entities of the graph, while edges illus-
The layering process and determination of the trate the relationship between two nodes. One of the
"Layer state" are not considered (modules 2 and 3) main advantages of graphs is their ability to repre-
in this contribution. It has to be pointed out that the sent complex systems (e.g., transportation network
layering used in this case study is based on the strati- (Likaj et al. 2013) and social media (Chakraborty
fication obtained from the ML models presented in et al. 2018)). The APD framework uses a weighted
Sect. 3, thus, the implemented layering algorithms directed graph, where the direction between two
have not been used. Furthermore, this section only nodes (sharing a relationship) in the graph is defined
illustrates the determination of soil parameters (out- (Van Berkom et al. 2022). In addition, weights could
put of module 4) without the transition to constitutive be assigned to the edges between the nodes. One-way
model parameters (module 5). roads are an example of weighted directed graphs,
where roads could have an assigned weight represent-
5.1.2 SBT Interpretation ing distance or travel time.
Van Berkom et al. (2022) illustrated the con-
In the present study, the CPT profile is classified cept of the graph-based approach employed in APD
according to Robertson’s modified non-normalized in detail. Figure 13 summarizes the graph-based
SBT chart (Robertson 2010). The chart is based on approach, where source parameters (CPT raw data)
the dimensionless cone resistance (qc ∕pa, where pa are linked via intermediate parameters to destination
is the atmospheric pressure) and friction ratio ( Rf in parameters (soil or constitutive model parameters).
percent, Rf = fs ∕qc 100%). The chart is divided into By using a given set of correlations, the system
nine different zones corresponding to different soil creates all paths (chains of correlations) that link
behaviour types (Table 2). Therefore, fine and coarse- source parameters to the destinations parameters,
grained soils are distinguished by this module. Never- and calculates the values of the destination param-
theless, the classification is not limited to Robertson’s eters from the input values of the source parameters
modified non-normalized SBT chart, as other charts (CPT raw data). Within the framework of APD, the
are already implemented in the system (e.g., Robert- terms ‘correlation’, ‘formula’, ‘equation’, ‘rule of
son’s normalized SBT chart (Robertson 2009) and thumb’ are replaced by the term ‘method’. Param-
Robertson’s updated normalized SBT chart (Robert- eters could be determined based on several ways
son 2016)).
Vol:. (1234567890)
13
Geotech Geol Eng
Fig. 13 Graph-based
approach implemented in
APD
(e.g., tables and charts), therefore this general term a result, method nodes have several incoming edges
is used (Van Berkom et al. 2022). which indicate all the required parameters for this
Searching for a path in a network from one node method (Van Berkom et al. 2022). Methods and
to another is a well-known problem, which is widely parameters that share a relationship must be linked.
used in several applications (Shu-Xi 2012). Sev- For example, a method to compute the coefficient of
eral graph algorithms exist which allow for solving earth pressure according to Jaky (1944) considers
the shortest path problem (finding the shortest path the well-known relationship K0 = 1 − sin(𝜑� ), where
between two nodes in a graph). Nevertheless, graph K0 is the coefficient of earth pressure at rest and
algorithms do not apply to branching paths. Within 𝜑′ denotes the effective internal friction angle of
the parameter determination framework, a path to the the soil. The system must determine the input and
destination node can have more than one source node output for this method (𝜑′ and K0 respectively) and
as the parameters in the path can be obtained from generate links connecting these parameters.
multivariable formulas, which depend on multiple
input parameters (branching paths occur in the frame- 5.1.4 Generating the Graph
work). Consequently, the existing graph algorithms
cannot be applied to the parameter determination As discussed in Sect. 5.1.3, the relationships between
framework (Van Berkom et al. 2022). parameters and methods are defined based on the
To deal with branching paths, two types of nodes output and input(s) of different methods. Parameters
are implemented, namely parameters (green nodes and methods are considered as external inputs to the
in Fig. 14) and methods (blue nodes in Fig. 14). system. Two input files are required, namely, "meth-
Most often method nodes are empirical correla- ods" and "parameters". Due to the adaptability of the
tions that depend on more than one parameter. As framework, users may extend the standard database
of methods and parameters provided by the current
Vol.: (0123456789)
13
Geotech Geol Eng
version of the system. The system generates links recommended to define the units of all parameters.
between the methods and parameters and calculates The Constraints field is used to apply lower and upper
the intermediate and destination parameters. The two bounds to the parameters. If the calculated value is
different input files used to generate the graph are lower than the lower bound or higher than the upper
provided in comma-separate values (CSV) format bound, it would be discarded. This allows to neglect
(corresponding to parameters and methods). unrealistically high or low values for parameters.
Each CSV file is characterised by special prop- Similar to the reference field in the methods CSV file,
erties. With regard to the methods CSV file, it is the description field is an optional argument, where
required to define the several properties, namely the user could describe the parameter (e.g., OCR is
method_to, formula, parameters_in, parameters_out, the overconsolidation ratio).
validity and reference. The latter should be provided By defining these two CSV files (methods &
by the user in the methods CSV file. Following the parameters), the system imports the two files and cre-
above example concerning the earth pressure at ates links between related methods and parameters
rest coefficient, method_to presents the name of the (parameters_in and parameters_out). The output of
method; in this case it could be method_to_K0. In the module 4 (Graph-based approach) is a graph present-
formula’s field, the corresponding equation should be ing all the links between all the defined parameters
provided, i.e., 1 − sin(𝜙� ). Parameters_in implicitly and methods as well as the calculated values for dif-
defines the input(s) for this method, i.e., 𝜙′. Follow- ferent parameters (e.g., see Fig. 14).
ing the same definition, parameters_out states the
output of the method, i.e., K0. The applicability of
different methods is defined in the validity field. As 5.2 CPT Interpretation
some methods are suitable for all types of soils, other
methods are only valid for coarse-grained soils, while A standard validated database for methods and
others are only applicable for fine-grained soils. parameters has been compiled and is continuously
As presented in Table 2, the SBT is based on Rob- updated and improved. The current version of the
ertson’s modified non normalized SBT chart (Rob- database consists of more than 100 methods and
ertson 2010). Consequently, the validity is (mainly) parameters. Nevertheless, users are responsible for
defined in terms of SBT. If the method is only appli- validating the outcome of the system, even if they
cable to silt, the validity would be SBT(4). Concern- used the provided standard database. They still need
ing the method of coefficient of earth pressure at rest, to apply their experience and knowledge to the out-
the validity would be SBT(1234567). Reference is put of the system. However, with limited geotechnical
an optional field, where the user could mention the knowledge, the system should provide reasonable val-
author of the method (e.g., Jaky_1944). ues for different parameters. However, using all of the
Moving to the parameters CSV file, the following methods in the database could lead to a scatter in the
properties need to be defined: symbol, value, unit, obtained values, which will make the representation
constraints and description. Any parameter that has of the results challenging. To circumvent this problem
been defined in the methods CSV file (in the fields in the present study, graphs are only created based on
of formula, parameters_in and parameters_out) must a selected number of methods and parameters that are
be defined in the parameters CSV file. In the symbol presented as follows:
field, the notation of the parameter (that has been
used in the methods CSV file) is stated (e.g., u for 5.2.1 Initial Parameters
porewater pressure). In the value field, the user could
specify a value for a parameter (e.g., unit weight of An initial estimate of the total (𝜎v) and effective
water). It could also be used by the user to provide (𝜎 v ′) stress is required to compute some of the CPT
"known" values as direct "input" for some param- parameters (e.g., normalized cone resistance). Con-
eters. Unit is an optional field, where the user could sequently, the unit weight needs to be assessed at
provide the unit for all parameters. In order to avoid the beginning of the analysis. In the considered case
problems originating from unit conversion (e.g., using study the initial unit weight is computed according to
qc in MPa in a method that requires qc in kPa), it is Robertson and Cabal (2010):
Vol:. (1234567890)
13
Geotech Geol Eng
𝛾t = 𝛾w [0.27[log Rf ] + 0.36[log (qt ∕pa )] + 1.236] (4) In the following Sects. 5.2.1, 5.2.2, 5.2.3 and 5.2.4,
the selected methods and parameters are presented.
where 𝛾w is the unit weight of water and qt is These methods and parameters serve as basis for gen-
the corrected cone resistance, i.e., defined as erating the graphs in module 4, as will be shown in
qt = qc + (1 − a) × u2, where a is the cone tip net area Sect. 5.3. As mentioned above, in total more than 100
ratio. methods are implemented in the APD framework.
The porewater pressure (u0) is calculated based on
the ground water level (GWL). The GWL is provided
5.2.2 Stress History
in the CPT GEF file, otherwise it could be speci-
fied by the user. Based on the previous information,
Stress history is often represented by the overconsoli-
the total and effective stress are computed. As a key 𝜎p�
result, the following “CPT parameters” are computed: dation ratio defined as OCR = 𝜎v�
, where 𝜎p′ is the ver-
tical preconsolidation stress. Numerous methods are
• Normalized cone resistance available in the database to compute OCR. In this
q t − 𝜎v study, two methods were selected to determine OCR
Qt = (5) as follows:
𝜎v�
𝜎p′ 0.33(qt − 𝜎v )m
′
− 0.15 ≤ 1.0
( 𝜎� )
n =0.381(Ic ) + 0.05 v
(8) OCR = 0.33 × Qtn (12)
pa 2.
by Kulhawy and Mayne (1990) and Robertson
√ (2009), where Qtn is defined in Eq. 7. This cor-
Icn = (3.47 − log Qtn )2 + (log Fr + 1.22)2 (9) relation is only valid for fine-grained soils.
adequate method for assessing the unit weight as it by Mayne et al. (2009), where Qt and Bq are defined
influences the total and effective stress, which in turn in Eqs. 5 and 6, respectively. The valid range for this
influences the calculated “CPT parameters”. correlation is 0.1 ≤ Bq ≤ 1.0 and 20◦ ≤ 𝜙′ ≤ 45◦ and
it is only applicable for fine-grained soils.
Vol.: (0123456789)
13
Geotech Geol Eng
G0 = 2.78q1.335
c (19) Table 3 Machine learning based stratification of CPT 5/12
1.
by Mayne and Rix (1993); this correlation is Layer Start (m) End (m)
only valid for clays.
Layer 1—coarse-grained soil 0.00 8.50
G0 = 50pa (qt − 𝜎v )m∗
(20) Layer 2—transitional-soil 8.50 21.90
2. Layer 3—fine-grained-soil 21.90 40.00
Vol:. (1234567890)
13
Geotech Geol Eng
were generated by using a limited number of methods of these parameters were used as input parameters
and parameters. for the selected methods presented in Sects. 5.2.1,
5.2.2, 5.2.3 and 5.2.4. For each layer, the first and
5.3 Determination of Soil Parameters last measurements are filtered out. This procedure
reduces the effects of the neighbouring layers. For
This subsection demonstrates the output of the APD. layer 3 the averaged CPT data yield qc = 1392.8 kPa,
The two CSV files (imported by the system as inputs) fs = 29.4 kPa and u2 = 1008.7 kPa at 30.95 m depth.
include the parameters and methods presented in The groundwater level (GWL) is located at 3 m below
Sects. 5.2.1, 5.2.2, 5.2.3 and 5.2.4. the ground level. The cone tip net area ratio is pro-
The CPT data processed in this study is derived vided in the CPT GEF file as 0.85 (a = 0.85). The
from the test site described in Sects. 2, namely CPTu unit weight of water is defined as 10 kN∕m3. The
5/12 (Fig. 2). The CPTu soundings collected for this atmospheric pressure ( pa) corresponds to 100 kPa.
study revealed issues with the u2 measurements, par- Figure 14 shows the generated graph for layer 3.
ticularly in Layer 1 (coarse-grained material) and Two types of nodes are presented in the graph.
Layer 2 (sand-silt alteration). However, u2 measure- Parameters are represented by green nodes, while
ments were only used in Layer 3 (fine-grained mate- blue nodes correspond to methods. The links between
rial) to determine the friction angle (Eq. 13). Conse- parameters and methods within the system are illus-
quently, these issues did not affect the results of the trated by the arrows. These arrows have a defined
other sections. direction, either going from a parameter to a method
The ML-based stratification of this CPT is pre- or from a method to a parameter. The graph displays
sented in Sect. 3.5 and Fig. 5a. The first steps within the selected parameters and methods, while Table 4
APD were as follows: The system imported the CPT describes the parameters. Table 4 also illustrates the
GEF file, whereas the layers were provided manually values of the source parameters. These source param-
according to Table 3. For each layer, a SBT (accord- eters are either averaged CPT measurements or aver-
ing to Table 2) was assigned. Focusing on the 3rd aged initial parameters that were computed before
layer, SBT(3) (clay) was selected for this layer as generating the graph (e.g., total and effective stress).
estimated by the ML-based stratification. Know- In this case study, Eq. 4 was used to calculate the unit
ing the boundaries of the layer, the CPT raw data weight. As the system is created in an adaptable way,
(qc, fs and u2) as well as the parameters calculated the user could select other correlations for computing
from Eqs. (4)–(9) were averaged. The average values the initial unit weight. The output of each method is
Vol.: (0123456789)
13
Geotech Geol Eng
presented on the link connecting this method to the corner in Fig. 14, the selected method resulted in a
intermediate / destination parameter (e.g., output of friction angle of 29.68◦.
method_to_OCR_2 is 1.11). The value of intermedi- The constrained modulus was obtained from the
ate / destination parameter is shown next to the node selected method defined by Eq. 21, which resulted in a
(e.g., 2 values were computed for OCR, 1.09 and value of 3263.63 kPa. As a side note, the prediction of
1.11). the constrained modulus could be improved by using
It should be pointed out that it was not possible to other in-situ tests; to this end, the DMT (Marchetti
compare the obtained values (for different parameters) 1980) is currently incorporated in the APD frame-
to reference values, as no laboratory tests could be work. Generally, the incorporation of a higher number
adopted from the test site under study. Nonetheless, of in-situ tests to derive the parameters (and compare
the main aim of this case study is to present proof of the outcome of the different in-situ tests) leads to an
concept of the parameter determination framework. increase of confidence in the derived parameters.
Starting with the OCR located at the lower left corner The shear wave velocity (located at the right side
in Fig. 14, the selected two methods used to calculate in Fig. 14) is computed by four selected methods.
OCR resulted in values of 1.09 and 1.11, respectively. These methods resulted in values of 191.78, 187.26,
Moving to the friction angle located at the upper right 163.78 and 184.14 m/s, respectively. The shear
Vol:. (1234567890)
13
Geotech Geol Eng
modulus at small strains is located at the lower right to constitutive model parameters is considered as one
side in Fig. 14. Selected methods_to_G0_2 and 3 of the key aspects of the research project.
resulted in 48,555 and 45,184 kPa, respectively. As Currently, APD is able to interpret CPT and DMT
four different values of Vs were computed, method_ (Marzouk et al. 2023b) data; an extension of the
to_G0_1 results in four different values, 64962.54, framework (by including additional in-situ tests and
61932.96, 47377.17 and 59885.99 kPa. Ongoing measured in-situ shear wave velocity), validation
validation studies (not shown here) indicated that the of the output (by comparing the output of APD to
reliability of methods used to determine Vs is highly interpreted values from laboratory tests) and update
site dependent. Moreover, as could be expected, using and improvement of the database is part of ongoing
more accurate values of Vs will result in a more accu- research.
rate prediction of G0, which underlines the neces-
sity of creating a shear wave velocity module that is
able to import in-situ Vs measurements. This planned 6 Conclusions
module will be incorporated in module 1 (Fig. 12),
where in-situ Vs will be imported and used in further This work examines three different CPT data inter-
analyses. pretation strategies, with particular focus on the
identification of soil layer boundaries, spatial vari-
ability of CPT readings and soil parameter determi-
5.4 Final Comments Related to APD nation. Theoretical aspects and relevant use cases of
the underlying workflows, as well as relative merits
Figure 14 presents the generated graph for a clay and limitations compared to related approaches docu-
layer using a limited number of methods. In case mented in the literature are discussed throughout this
more methods are used (e.g., all of the methods in manuscript. The applicability and capabilities of the
the standard database of methods), this will lead to CPT data interpretation strategies are demonstrated
a considerable scatter in some of the derived param- considering in-situ CPT recordings recovered from
eters; cf. Fig. 15. Evaluating the scatter and selecting a test site in Salzburg, Austria. The primary objec-
a suitable approach for choosing a specific value from tive of this work is to advance towards an automated
the range of the calculated values is part of ongoing approach for layer detection, parameter determina-
research; the same applies to the process of validating tion, and evaluation of fluctuation degree, and to con-
and updating the correlations database. This is done nect it with numerical analysis.
by comparing the output of different correlations to In Sect. 3, it is demonstrated that the suggested
available laboratory tests; cf. Marzouk et al. (2023a). ML-based framework for the automatic identifica-
The uncertainties associated with using CPT data tion of soil layers predicts soil boundary distributions
for parameter determination are not currently taken that are well in agreement with the reference solution;
into account. In the current version of APD, the sys- hence, it offers a powerful alternative to classical filter
tem can only process one CPT at a time. The con- algorithms based on classification charts. Moreover,
sideration of several CPTs at the same time (e.g., the main motivation of automating the layer detec-
executed close to each other or from the same pro- tion process is to integrate it with other analyses, such
ject) would help to investigate whether the uncertain- as numerical analysis. Additionally, a ground model
ties due to the execution of the CPTs influence the approach adapted for "real-time" updates by modi-
obtained value(s) for the parameters or not. fying the soil stratification when new information
In this study, the transition from soil parameters to becomes available is being planned. Nevertheless, it
constitutive model parameters was not considered in is also highlighted that the accuracy of this approach
detail. Several correlations between soil parameters is considerably influenced by the quality and quantity
and some constitutive models, such as the Harden- of the database employed; in this respect, parameters
ing Soil model with small-strain stiffness (HSsmall) of the ML and KDE models should be updated and
(Benz 2007), are already available in the methods evaluated whenever the database is extended with
database. The transition from the CPT measurements additional samples. Furthermore, there is still a lack
of studies regarding interpolation strategies between
Vol.: (0123456789)
13
Geotech Geol Eng
Vol:. (1234567890)
13
Geotech Geol Eng
currently provided by the system. However, identify- Benz T (2007) Small-strain stiffness of soils and its numerical
ing the representative value for different parameters consequences. PhD thesis, University of Stuttgart
Brinkgreve RBJ (2019) Automated model and parameter selec-
is the biggest challenge due to the wide range of tion. Geostrata. https://doi.org/10.1061/geosek.0000115
values obtained from the various correlations (meth- Bustamante M, Gianesselli L (1982) Pile bearing capacity pre-
ods) used. Therefore, a statistical module is currently diction by means of static penetrometer CPT. In: Balkema
being developed to aid in selecting the representative AA (ed) Proceedings of the 2nd European symposium on
penetration testing, Amsterdam, pp 493–500
value. Moreover, APD is currently undergoing expan- Chakraborty A, Dutta T, Mondal S, Nath A (2018) Applica-
sion and validation. tion of graph theory in social media. Int J Comput Sci Eng
6:722–729. https://doi.org/10.26438/ijcse/v6i10.722729
Acknowledgements The authors would like to acknowl- Ebid AM (2021) 35 years of (AI) in geotechnical engineering:
edge the members of APD group, Arny Lengkeek and Ronald state of the art. Geotech Geol Eng 39(2):637–690. https://
Brinkgreve. doi.org/10.1007/s10706-020-01536-7
EN ISO 14688-1 (2019) Geotechnical investigation and test-
Funding Open access funding provided by Graz University ing—Identification and classification of soil: part 1: iden-
of Technology. No funding was received to assist with the tification and description
preparation of this manuscript. Fattahi H, Babanouri N (2017) Applying optimized support
vector regression models for prediction of tunnel boring
Availability of Data and Materials The database used in machine performance. Geotech Geol Eng 35(5):2205–
this study is available at https://www.tugraz.at/institute/ibg/ 2217. https://doi.org/10.1007/s10706-017-0238-4
forschung/numerische-geotechnik/datenbank/ Felic H, Schlicke D, Granitzer AN, Tschuchnigg F (2023)
Enhanced interoperability between geotechnical and
Declarations structural engineering for 3D building models. RILEM
Bookseries, vol 43. Springer, pp 217–228. https://doi.org/
Conflict of interest The authors declare no conflicting inter- 10.1007/978-3-031-33211-1_20
ests. Giangiulio M, Granitzer AN, Tschuchnigg F, Hoffmann J
(2022) BIM-to-FEM: development of a software tool to
increase the operational efficiency of dam construction
Open Access This article is licensed under a Creative Com-
projects. https://doi.org/10.1007/978-3-031-20241-4_13
mons Attribution 4.0 International License, which permits
Goldsworthy GS, Jaksa MB, Kaggwa WS, Fenton GA, Grif-
use, sharing, adaptation, distribution and reproduction in any
fiths DV, Poulos HG (2004) Cost of foundation failures
medium or format, as long as you give appropriate credit to the
due to limited site investigations. In: Wang CM, Murugap-
original author(s) and the source, provide a link to the Crea-
pan K (eds) Proceedings of the international conference
tive Commons licence, and indicate if changes were made. The
on structural and foundation failures. Singapore
images or other third party material in this article are included
Gomes Correia A, Cortez P, Tinoco J, Marques R (2013) Arti-
in the article’s Creative Commons licence, unless indicated
ficial intelligence applications in transportation geotech-
otherwise in a credit line to the material. If material is not
nics. Geotech Geol Eng 31(3):861–879. https://doi.org/10.
included in the article’s Creative Commons licence and your
1007/s10706-012-9585-3
intended use is not permitted by statutory regulation or exceeds
Graham J, Shields DH (1985) Influence of geology and geo-
the permitted use, you will need to obtain permission directly
logical processes on the geotechnical properties of a plas-
from the copyright holder. To view a copy of this licence, visit
tic clay. Eng Geol 22(2):109–126. https://doi.org/10.1016/
http://creativecommons.org/licenses/by/4.0/.
0013-7952(85)90042-0
Hamza O, Bellis A (2016) Investigation into the effect of
uncertainty of CPT-based soil type estimation on the
References accuracy of CPT-based pile bearing capacity analysis.
In: Icelandic geotechnical society, Proceeding of the
17th Nordic geotechnical meeting. https://doi.org/10.
Abu-Farsakh MY, Shoaib MM (2024) Machine learning mod-
13140/RG.2.2.35614.74565
els to evaluate the load-settlement behavior of piles from
Hegazy YA, Mayne P (1995) Statistical correlations between
cone penetration test data. Geotech Geol Eng. https://doi.
Vs and CPT data for different soil types. In: Proceedings
org/10.1007/s10706-023-02737-6
of symposium on cone penetration testing (CPT’95), vol
Al-Sammarraie D, Kreiter S, Kluger MO, Mörz T (2022) Reli-
2, pp 173–178
ability of CPT measurements in sand—influence of spac-
Henke S, Lerch H (2020) Building information modeling
ing. Géotechnique 72(1):48–60. https://doi.org/10.1680/
(BIM) in der Geotechnik—Aktuelle Entwicklungen und
jgeot.19.P.247
Erfahrungen. Bauingenieur 95(12):490–500
Anagnostopoulos A, Koukis G, Sabatakakis N, Tsiambaos G
Henke S, Ebers-Ernst J, Rust M, Schäferhoff G (2020) DAS
(2003) Empirical correlations of soil parameters based
BIM-Baugrundmodell im Kontext des BIM-Gesamt-
on cone penetration tests (CPT) for Greek soils. Geo-
modells—estmodellvorstellung unter Einbeziehung
tech Geol Eng 21(4):377–387. https://doi.org/10.1023/B:
zahlreicher Fachgewerke. In: Bundesanstalt für Was-
GEGE.0000006064.47819.1a
serbau (ed) Digitalisierung in der Geotechnik—Von
Vol.: (0123456789)
13
Geotech Geol Eng
der Entwicklung zur Anwendung eines digitalen Marzouk I, Tschuchnigg F, Brinkgreve RBJ (2023b) Expansion
Baugrundmodells of an automated system for determining soil parameters
Jaksa MB, Goldsworthy JS, Fenton GA, Kaggwa WS, Grif- using in-situ tests. In: Proceedings of the 10th European
fiths DV, Kuo YL, Poulos HG (2005) Discussion: towards conference on numerical methods in geotechnical engi-
reliable and effective site investigations. Géotechnique neering (NUMGE 2023), London. https://doi.org/10.
55(8):625–626. https://doi.org/10.1680/geot.2005.55.8. 53243/NUMGE2023-70
625 Mayne P (2007) In-situ test calibrations for evaluating soil
Jaksa MB, Goldsworthy JS, Fenton GA, Kaggwa WS, Griffiths parameters. In: Tan TS (ed) Characterisation and engi-
DV, Kuo YL, Poulos HG (2005) Towards reliable and neering properties of natural soils. Balkema, Lisse and
effective site investigations. Géotechnique 55(2):109–121. Exton. https://doi.org/10.1201/NOE0415426916.ch2
https://doi.org/10.1680/geot.55.2.109.59531 Mayne P (2014) Interpretation of geotechnical parameters from
Jaky J (1944) The coefficient of earth pressure at rest. In: Hun- seismic piezocone tests. In: Proceedings, 3rd international
garian (a nyugalmi nyomas tenyezoje). J Soc Hung Eng symposium on cone penetration testing (CPT14, Las
Arch (Magyar Mernok es Epitesz-Egylet Kozlonye) pp Vegas), vol 102, pp 47–73
355–358 Mayne P (2017) Stress history of soils from cone penetration
Kahlström M, Mortensen PA, Hauser C, Hansen Børner tests. Soils Rocks 40:203–216. https://doi.org/10.28927/
N (2021) Use of a 3D stratigraphic model as tool for SR.403203
improved communication and risk assessment in large Mayne P, Rix G (1993) Gmax-qc relationships for clays. Geo-
infrastructure projects. In: IOP conference series: earth tech Test J. https://doi.org/10.1520/GTJ10267J
and environmental science, vol 710, no 1, p 012038. Mayne P, Rix G (1995) Correlations between shear wave veloc-
https://doi.org/10.1088/1755-1315/710/1/012038 ity and cone tip resistance in natural clays. Soils Found
Krige DB (1951) A statistical approach to some basic mine 35(2):107–110
valuation problems on the Witwatersrand. J Chem Met- Mayne P, Coop M, Springman S., Huang AB, Zornberg J
all Min Soc South Afr. https://journals.co.za/doi/pdf/10. (2009) State-of-the-art paper (SOA-1): geomaterial behav-
10520/AJA0038223X_4792 ior and testing. In: Proceedings of the 17th international
Kulatilake PHSW, Um JG (2003) Spatial variation of cone tip conference on soil mechanics and geotechnical engineer-
resistance for the clay site at Texas A &M University. ing, Alexandria, vol 4
Geotech Geol Eng 21(2):149–165. https://doi.org/10. Mazzoccola DF, Millar DL, Hudson JA (1997) Information,
1023/A:1023526614301 uncertainty and decision making in site investigation for
Kulhawy FH, Mayne P (1990) Manual on estimating soil prop- rock engineering. Geotech Geol Eng 15(2):145–180.
erties for foundation design. Electric Power Research Inst https://doi.org/10.1023/A:1018499222495
Kumar D, Samui P, Kim D, Singh A (2021) A novel methodol- Miller HJ (2004) Tobler’s first law and spatial analysis. Ann
ogy to classify soil liquefaction using deep learning. Geo- Assoc Am Geogr 94(2):284–289
tech Geol Eng 39(2):1049–1058. https://doi.org/10.1007/ Niazi F (2022) CPT-based geotechnical design manual, Vol-
s10706-020-01544-7 ume 1: CPT interpretation-estimation of soil properties.
Likaj R, Shala A, Mehmetaj M, Hyseni P, Bajrami X (2013) Purdue University. https://doi.org/10.5703/1288284317
Application of graph theory to find optimal paths for the 346
transportation problem. IFAC Proc Vol 46(8):235–240. Oberhollenzer S, Marte R, Gasser D, Premstaller M, Leitich A
https:// d oi. o rg/ 1 0. 3 182/ 2 0130 6 06-3- X K- 4 037. 0 0031. (2019) Microstructure of the Salzburger Seeton—charac-
(15th IFAC Workshop on International Stability, Tech‑ terization based on cone penetration tests. Geomech Tunn
nology, and Culture) 12(4):340–351. https://doi.org/10.1002/geot.201900012
Liu J, Liu J, Li Z, Hou X, Dai G (2021) Estimating CPT Oberhollenzer S, Fankhauser A, Marte R, Tschuchnigg F,
parameters at unsampled locations based on kriging inter- Premstaller M (2021a) Characterization of microstructure
polation method. Appl Sci 11(23):11,264. https://doi.org/ in silt-dominated sediments. In: Proceedings of the 6th
10.3390/app112311264 international conference on geotechnical and geophysical
Lunne T, Robertson PK, Powell J (1997) Cone penetration site characterization
testing in geotechnical practice. Soil Mech Found Eng. Oberhollenzer S, Premstaller M, Marte R, Tschuchnigg F,
https://doi.org/10.1007/s11204-010-9072-x Erharter GH, Marcher T (2021) Cone penetration test
Marchetti S (1980) In situ tests by flat dilatometer. J Geo- dataset Premstaller Geotechnik. Data Brief 34(106):618.
tech Eng Div 106(3):299–321. https://doi.org/10.1061/ https://doi.org/10.1016/j.dib.2020.106618
AJGEB6.0000934 ÖNORM B 1997-2 (2017) Eurocode 7: Entwurf, Berechnung
Marzouk I, Tschuchnigg F, Paduli F, Lengkeek HJ, Brinkgreve und Bemessung in der Geotechnik—Teil 2: Erkundung
RBJ (2022) Determination of fine-grained soil param- und Untersuchung des Baugrunds
eters using an automated system. In: Proceedings of the Palla N, Gudavalli S, Subedi B, Jao M (2008) Comparison
5th international symposium on cone penetration testing, among SPT, CPT and Texas cone penetration test. In:
Bologna. https://doi.org/10.1201/9781003308829-77 ASCE (ed) Proceedings of ASCE Texas section spring
Marzouk I, Oberhollenzer S, Tschuchnigg F (2023a) An auto- meeting, pp 1–10
mated system for determining soil parameters: case study. Pedregosa F, Varoquaux G, Gramfort A, Michel V, Thirion B,
In: 8th international symposium on deformation charac- Grisel O, Blondel M, Prettenhofer P, Weiss R, Dubourg
teristics of geomaterials, Porto V et al (2011) Scikit-learn: machine learning in Python. J
Mach Learn Res 12:2825–2830
Vol:. (1234567890)
13
Geotech Geol Eng
Pedregosa F, Varoquaux G, Gramfort A, Michel V, Thirion B, Samui P (2012) Application of relevance vector machine for
Grisel O, Blondel M, Prettenhofer P, Weiss R, Dubourg prediction of ultimate capacity of driven piles in cohesion-
V et al (2021) Scikit-learn: user guide. https://scikit-learn. less soils. Geotech Geol Eng 30(5):1261–1270. https://
org/stable/user_guide.html doi.org/10.1007/s10706-012-9539-9
Phoon KK, Zhang W (2023) Future of machine learning in Schnaid F (2009) In situ testing in geomechanics: the main
geotechnics. Georisk Assess Manag Risk Eng Syst Geo- tests. Taylor & Francis, London
hazards 17(1):7–22. https://doi.org/10.1080/17499518. Shu-Xi W (2012) The improved Dijkstra’s shortest path algo-
2022.2087884 rithm and its application. Procedia Eng 29:1186–1190.
Rana H, Pandit B, Sivakumar Babu GL (2023) Estimation of https://doi.org/10.1016/j.proeng.2012.01.110
uncertainties in soil using MCMC simulation and effect of Umar SK, Samui P, Kumari S (2018) Deterministic and proba-
model uncertainty. Geotech Geol Eng 41(8):4415–4429. bilistic analysis of liquefaction for different regions in
https://doi.org/10.1007/s10706-023-02523-4 Bihar. Geotech Geol Eng 36(5):3311–3321. https://doi.
Rauter S, Tschuchnigg F (2021) CPT data interpretation org/10.1007/s10706-018-0498-7
employing different machine learning techniques. Geo- Van Berkom IE, Brinkgreve RBJ, Lengkeek HJ, De Jong AK
sciences 11(7):265. https://doi.org/10.3390/geoscience (2022) An automated system to determine constitutive
s11070265 model parameters from in situ tests. In: Proceedings of the
Rauter S, Tschuchnigg F (2022) Identification of soil strata 20th international conference on soil mechanics and geo-
from in-situ test data using machine learning. pp 37–44. technical engineering, Sydney 2022
https://doi.org/10.1007/978-3-031-12851-6_5 Vanmarcke EH (1977) Probabilistic modeling of soil profiles. J
Ricceri G, Simonini P, Cola S (2002) Applicability of piezo- Geotech Eng Div 103(11):1227–1246. https://doi.org/10.
cone and dilatometer to characterize the soils of the Ven- 1061/AJGEB6.0000517
ice Lagoon. Geotech Geol Eng 20(2):89–121. https://doi. Weil J (2020) Digitale Baugrundmodelle im Tunnelbau—Sta-
org/10.1023/A:1015043911091 tus, Chancen und Risiken. Geomech Tunn 13(2):221–236.
Robertson PK (2009) Interpretation of cone penetration tests— https://doi.org/10.1002/geot.201900078
a unified approach. Can Geotech J 46(11):1337–1355. Xie J, Huang J, Lu J, Burton G, Zeng C, Wang Y (2022) Devel-
https://doi.org/10.1139/T09-065 opment of two-dimensional ground models by combining
Robertson PK (2010) Soil behaviour type from the CPT: an geotechnical and geophysical data. Eng Geol. https://doi.
update. In: 2nd international symposium on cone penetra- org/10.1016/j.enggeo.2022.106579
tion testing, Huntington Beach
Robertson PK (2015) Guide to cone penetration testing for geo- Publisher’s Note Springer Nature remains neutral with regard
technical engineering. In: Proceedings 3rd international to jurisdictional claims in published maps and institutional
symposium on cone penetration testing (CPT14, Las affiliations.
Vegas)
Robertson PK (2016) Cone penetration test (CPT)-based soil
behaviour type (SBT) classification system—an update.
Can Geotech J 53(12):1910–1927. https://doi.org/10.
1139/cgj-2016-0044
Robertson PK, Cabal KL (2010) Estimating soil unit weight
from CPT. In: 2nd international symposium on cone pen-
etration testing, Huntington Beach
Vol.: (0123456789)
13