STM-based Symbolic Regression For Strength Predict
STM-based Symbolic Regression For Strength Predict
com/scientificreports
Keywords Machine learning, Symbolic regression, Strut-and-tie model, Deep beams, Corbels, CatBoost
model
Deep reinforced concrete (RC) elements, such as deep beams (RCDBs) and corbels (RCCs), characterized by a
small span-to-height ratio1,2, are frequently used as load-transferring components in structures, such as transfer
girders and pile caps, due to their superior shear strength compared to slender members. Despite their extensive
usage, designing these elements is challenging due to the nonlinear influence of various parameters on their
shear behavior. Multiple shear strength models have been studied, including those utilizing machine learning
methods3–8, the strut-and-tie model (STM)9–11, the compression field method12, and finite element analysis13.
Traditional design methods, such as the STM, often fail to adequately capture the complex relationships between
parameters impacting shear strength, resulting in inaccurate strength prediction. Additionally, existing code
provisions, such as ACI 31814, and EC215, along with previous mechanical models10,11, offer straightforward
procedures for calculating the shear capacity of RCDBs and RCCs. However, their conservative approach and
inconsistency with test results fail in developing a comprehensive model that can accurately estimate the shear
capacity of these elements.
Deep beams and corbels are frequently characterized by discontinuity regions, commonly called “D-region
members.” Within these D-regions, nonlinear strain distributions arise from sudden changes in geometry
or loading configurations16, making traditional design methods based on classical Bernoulli beam theory
inadequate. The strut-and-tie model (STM) has emerged as a robust and effective approach for designing deep
reinforced concrete elements. Numerous STMs have been proposed in the literature for estimating the shear
capacities of the D-regions11,17−19. However, despite the numerous STMs proposed, some models have been
criticized for their inconsistent and complex predictions of shear strength2,4,8,19.
Department of Structural Engineering, Mansoura University, PO BOX 35516, Mansoura, Egypt. email:
[email protected]
Machine learning (ML) has recently become increasingly prominent in various engineering applications,
offering an alternative approach to traditional mechanical theories. ML algorithms, including artificial neural
networks, genetic algorithms, and ensemble learning methods, have been widely applied to predict the shear
capacity of deep RC elements3–8. For instance, Ma et al4. applied six different ML models to forecast the
shear capacity of RCDBs and compared their performance with previous state-of-the-art models. Feng et al5.
investigated four standard ensemble learning models—random forests, gradient boosting regression trees,
adaptive boosting, and extreme gradient boosting (XGBoost)—to estimate the shear capacity of RCDBs using
a dataset comprising 271 samples. Ashour et al3. utilized genetic expression programming to formulate an
empirical expression for the shear capacity of RCDBs relying on 141 test data points. In addition, Shahnewaz et
al6. and Wakjira7employed genetic algorithm for shear strength prediction of RCDBs. However, these models
are challenging to apply in practical engineering design because the purely data-driven prediction procedure
cannot be translated into a feasible mathematical expression for structural application. Consequently, data-
driven algorithms are often viewed as black-box models. Furthermore, many expressions derived from ML
techniques, such as genetic expression programming (GEP) and genetic algorithms (GA), are often criticized for
their lack of physical significance and excessive complexity3,6,7, as outlined in Table 1. In addition, recent studies
have demonstrated the potential of ML to optimize and innovate within the field of civil engineering. Inqiad
et al. used GEP for predicting the compressive strength of self-compacting concrete. Moreover, Khaled et al20.
used different ML models to predict the axial capacity of rectangular concrete-filled concrete columns, including
the Gaussian process (GPR) and the extreme gradient boosting model (XGBoost). Additionally, Moradi et al.
[46] introduced a novel experimental approach to enhance the shear capacity of RC beams by utilizing fiber-
reinforced polymer wraps. These advancements highlight the growing role of machine learning in improving the
accuracy, efficiency, and innovation of civil engineering practices.
Due to the differing prediction mechanisms of explainable models (e.g., strut-and-tie models) and black-
box models (e.g., data-driven models), these approaches have traditionally been considered independent in
resistance prediction3–8. Previous studies have favored strut-and-tie models for their interpretable mechanisms,
while black-box models have been preferred for their superior performance. This study uses a machine-learning-
based symbolic regression (SR) technique to develop an ML-aided STM for predicting the shear strength of
RCDBs and RCCs. This technique serves as a hybrid model or an intermediate solution, effectively bridging
the gap between mechanical-based and black-box models, thereby gaining popularity in recent studies21,22.
Given the robust performance of STM in estimating the shear capacity of deep RC elements, integrating the SR
technique can calibrate the STM, resulting in a hybrid model with high prediction accuracy.
This study aims to develop a symbolic regression model based on the Strut-and-Tie Model (SR-STM) to
predict the shear strength of RCDBs and RCCs precisely. The novelty of this research lies in integrating the
symbolic regression (SR) technique with the robust performance of STM. An enforced structure tree for symbolic
regression is utilized to achieve three primary objectives: (1) reduce the search space, thereby enhancing the
efficiency of the genetic programming process; (2) provide explainable expressions consistent with the Strut-
and-Tie Model; and (3) diverge from previous research that generated uninterpretable functions using SR.
The term “interpretability” in this context refers to the model’s ability to produce equations that are not only
computationally feasible but also aligned with the physical behavior of the structures, making them meaningful
and actionable for engineers. This approach ensures that the resulting models are not only computationally
feasible but also maintain interpretability and consistency with established mechanical principles. Additionally,
the developed model was calibrated against a comprehensive database of test results, including 810 deep beams
and 371 corbels, which is more extensive than those considered by state-of-the-art models11,17–19. The full list
of input features, including geometric configurations, concrete properties, reinforcement characteristics, and
loading plate dimensions, is detailed in the next section (see the Experimental Database section). Various metrics
were applied to evaluate the model’s accuracy and generality. Finally, the predictions obtained from the proposed
STM-based model were compared with those from the CatBoost ensemble machine learning technique, known
for its superior performance8, and three existing closed-form models11,17–19. This comparison highlights the
effectiveness and reliability of the proposed model in predicting shear strength while ensuring interpretability
and alignment with mechanical principles.
Table 1. Summary of previous GP and GA models for predicting shear strength of deep RC elements.
Experimental database
The schematic diagrams of the shear mechanisms of RCDBs and RCCs are illustrated in Figs. 1 and 2, respectively.
A comprehensive dataset was collected to develop ML models, consisting of 810 RCDB and 371 RCC experiments
sourced from existing literature and a database gathered by Chetchotisak et al19. Detailed information about the
collected database is available in the supplementary data. Numerous experimental and theoretical studies9,11,17,18,
have identified that the shear capacity of deep RC elements is influenced by various factors, categorized into
six groups: (1) geometric configurations: beam height (h), effective height (d), width (bw), shear span (a) and
shear span-to-depth ratio (a/d); (2) concrete properties, i.e., concrete strength (fc’); (3) bottom longitudinal
reinforcement characteristics: reinforcement ratio (ρl), and strength (fyl); (4) web reinforcement characteristics:
vertical web reinforcement ratio (ρv) and strength (fyv), horizontal web reinforcement ratio (ρh) and strength
(fyh); (5) loading plate dimensions: top plate width (wtp) and bottom plate width (wbp); (6) horizontal force for
corbels (Nc). Table 2 summarizes statistical information for the established database.
Strut-and-tie model
Figures 1 and 2 illustrate two load-transferring mechanisms in a typical D-region for RCDB and RCC members:
the diagonal strut mechanisms (Figs. 1(a), 2(a)) and truss mechanisms (Figs. 1(b), 2(b)). In the diagonal strut
mechanisms, the primary force-transferring system consists of diagonal compression struts (depicted as dashed
lines) and tension ties (depicted as solid lines), which intersect at nodes called nodal zones. While the truss
mechanism describes how the shear force is resisted by the orthogonal shear reinforcement11,23. This mechanism
is composed of two subtrusses: the horizontal subtruss, where tension forces are transferred by the horizontal
web reinforcement (red lines), and the vertical subtruss, where tension forces are transferred by the vertical web
reinforcement (blue lines). The term θ in Figs. 1 and 2 represent the angle of the diagonal concrete strut relative
to the horizontal plane and can be calculated using Eq. (1).
jd
tan θ = (1)
a
where jd = d − kd/3is the height of the moment arm11,17, with d being the effective depth and kdthe depth of
the compression zone of the member. The term kdcan be derived from the elastic bending theory11,17 as:
kd = (nρ l )2 + 2 (nρ l ) − (nρ l ) d(2)
where n represents the ratio of Young’s modulus of steel to concrete. For the case of RCC under a horizontal force
Nc (Fig. 2), the term ρl in Eq. (2) is assumed by previous studies17,19,23 as:
Al − Nc/fyl
ρl = (3)
bw d
where ft is the concrete tensile strength and σt and σc indicate the principal tensile and compressive stresses inside
the diagonal strut, respectively, as shown in Fig. 1(a). For simplicity, the stress ratio σt/ft in Eq. (5) is assumed as
the ratio of the tie tension force Tto the tensile strength of longitudinal steel bars13,19 as follows:
Figure 4. (a) Flow charts of Symbolic regression. (b) The optimal tree-based individuals for RC deep beam
strength, where operator O, in the forced structure part, can be any arbitrary binary operator while, in the
optimized structure part, variable C is constant.
σt T
≈ (6)
ft (ρ l bw d) fyl
As depicted in Fig. 1(a), the tension force T at the CCT node (node surrounded by two struts and one tie) is
calculated through the balance of horizontal forces as follows:
T = Vc/tan θ (7)
While the diagonal compressive stress σ c acting on the concrete strut is given by:
Cc Vc
σc = = (8)
Astr Astr sin θ
where Astr is diagonal strut area as illustrated in Figs. 1(a) and 2(a) by the following expression:
Astr = bw (kd)2 + (wl )2(9)
where wlis the projected horizontal width of the CCC node (node surrounded by three struts)11,17,19,23,27. For
simplicity in this study, wl is assumed to be equal to the width of the top plate for deep beams and wc/2 for doubly
symmetric corbels, where wc is the column width (Fig. 2(a)).
By substituting Eqs. (6) and (8) into Eq. (5), the shear capacityof the diagonal strut mechanism (Vc) can be
expressed as:
1 1 1
= + (10)
Vc Vs Vt
where Vs and Vt define the strength of the diagonal strut and tension tie, respectively, given by:
Vs = fc′ Astr sin θ , Vt = ρ l bw dfyl tan θ (11)
Table 3. The parameters of the SR model used in generating expressions. (a)The ‘^’:(–1,10) constraint means
that the left argument of the power function can exhibit any level of complexity, whereas the right argument is
restricted to a maximum complexity of 10 nodes. (b)Nested constraints govern how operators can be combined
or nested. The constraint ‘^’:{‘^’:0,’/’:1} specifies that ‘^’ operator cannot be used inside another ‘^’ operator,
but ‘/’ operator can be nested once in ‘^’ operator.
Current study
For deep beams, vs = 0.8αd ks1.1, vt = 1.6(ρel)0.7, vh = 0.14ρeh, vv = 0.31ρev
For corbels, vs = 0.8α c ks0.78, vh = 0.17(ρ eh)0.83, vv = 0.27ρ ev
vt = 3.0ρ el,
with α d = fc′ /30 − 0.15, α c = fc′ /60 + 3.1 , ks = k 2 + (wl /d)2, ρ el = ρ l ff′l , ρ eh = ρ h ffh′ , ρ ev = ρ v ffv′
c c c
Table 4. Summary of previous mechanical models in predicting shear strength. where K is strut-and-tie index
accounting for the influence of the web reinforcement., wt is the widths the tie.
Equation (10) expresses the strength contributions of the concrete strut Vs and the tension tie Vt to the overall
shear capacity. Rewriting Eq. (10) in a dimensionless form results in:
1 1 1
= + e (12)
vc kssin θ ρl tan θ
where
Vc fyl
vc = , ks = (k)2 + (wl /d)2, ρel = ρl (13)
bw dfc′ fc′
where Fh and Fv refer to the average tensile forces carried by the horizontal and vertical web reinforcements,
respectively, in the D-regions. Accordingly, in a manner similar to Eq. (12), the shear strength of the truss
mechanism Vw can be rewritten in dimensionless form as follows:
Vw fyh fyv
vw = = ρehtan θ + ρev , with ρeh = ρh , ρev = ρv (15)
bw dfc′ fc′ fc′
From Eqs. (12) and (15), the overall contribution of the diagonal strut and truss mechanisms can be formulated
as:
1
Vn = 1 1 + ρhtan θ + ρv bw dfc′ (16)
e e
ks sin θ + ρe tan θ
l
Symbolic regression
Symbolic regression (SR)28,29is a genetic programming technique30designed to discover simple and interpretable
analytic equations that best fit a given problem through exploring a predefined space of mathematical expressions
and functions. SR is approached as a multi-objective optimization problem, balancing predictive accuracy and
model complexity. Genetic programming techniques, including the principles of natural selection and evolution,
are commonly used in SR to iteratively refine candidate mathematical expressions until satisfactory expressions
are achieved. This research employs a Python library called PySR31 to identify interpretable, simple expressions
for the shear strength of RCDBs and RCCs.
The SR algorithm starts with creating an initial population composed of a random combination of operational
symbols (e.g., +, -, *,/, ^, etc.) and terminals, such as input variables and constants. Each individual in the
population is represented by a tree-like expression. Selection is then performed probabilistically, favoring those
individuals that demonstrate superior performance. To prevent the generation of overly complex individuals by
SR, a complexity limit of 30 is set, meaning the total number of operators, constants, and variables in the equations
cannot exceed this value. In addition, overly complex expressions are excluded from the SR expressions such as
high exponential terms, e.g. (•)^(•^•). The selected individuals undergo mutation (Fig. 3(a, b)) or crossover
(Fig. 3(c)) to produce a new generation of populations. Figure 4(a) presents the core steps of the SR approach.
This evolutionary process employs a fitness function, defined in Eq. (17)31, to evaluate and identify the most
optimal individuals in each generation, ensuring the progressive refinement of solutions.
l (E) = lpred (E) .exp (frecency [C (E)])(17)
where lpred(E) represents the model prediction error, C(E) denotes the expression complexity E, quantified by
the total number of nodes in the expression. The term frecency [C(E)] accounts for the frequency and recency
of the expression E occurring at complexity C(E) within the population. This measure is crucial for avoiding
the overcomplication and redundancy of the generated expressions, ensuring a balance between minimizing
error and maintaining simplicity. Details of the SR parameters used for generating expressions in this study are
summarized in Table 3.
The process of finding the best expression requires many iterations and a detailed assessment of each one. Each
equation generated through these iterations is subjected to thorough evaluation and refinement, considering
factors such as the complexity of the equations, their accuracy, and their ease of interpretation.
The process of finding the best expression requires many iterations, where different parameters, such as the
number of generations, total population, and population size, are varied to generate distinct equations. Each
equation is then subjected to a detailed assessment and refinement, considering complexity, accuracy, and ease
of interpretation factors. This iterative approach ensures that the final model balances interpretability with
predictive accuracy.
1
Vn = 1 1 + vh (ρeh) tan θ + vv (ρev ) bw dfc′ (18)
vs (f ′ , ks )sin θ + v
c ( e
t ρl )tan θ
where the coefficient expressions for each component of the STM that need to be optimized by SR technique are:
vs = vs (fc′ , ks) , vt = vt (ρel) , vh = vh (ρeh) , vv = vh (ρev )(19)
f f f
with: ks = k 2 + (wl /d)2, ρel = ρl fyl′ , ρeh = ρh fyh′ , ρev = ρv fyv′
c c c
Symbolic regression is employed to optimize the four expressions in Eq. (19) using the PySR library. This
process involves developing a custom objective loss function designed explicitly for symbolic regression, which
predefines the form of the shear strength equation with the four expressions. The symbolic trees accepted in this
process must have of a structure of ((T1 ◦ T2) ◦ (T3 ◦ T4)), where subtrees T1, T2, T3, and T4 define the four
Figure 5. Distribution of the databases and the relationships between different parameters.
Figure 5. (continued)
functions vs, vt, vh, and vv, respectively, and operator ∘ could be any arbitrary binary operator. This structure is
referred to as the enforced structure of the tree generated using the developed objective function, as illustrated
in Fig. 4(b). The details of the objective function are illustrated in Algorithm 1. The objective function enforces
constraints on the structure of the symbolic expressions and penalizes undesirable characteristics.
As explained in Algorithm 1, the degree of the tree head is checked, and if it is not equal to two, meaning it
should combine two subtrees, TL = (T1 ◦ T2) and TR = (T3 ◦ T4), a significant penalty of 1000 is imposed. The
left subtree (TL) and right subtree (TR) are also checked for a degree of 2, with a smaller penalty of 100 imposed
if they do not meet this criterion. The left child of the left subtree (TLL or T1) represents vs (strut contribution)
and must be a function of fc’ (concrete strength) and the ratio ks. If vs contains invalid features or negative values,
penalties are added in proportion to the number of violations. Similarly, the right child of the left subtree (TLR
or T2) is processed as vt (tie contribution), which should only be a function of ρ el. The contributions from
the left subtree (TLL, TLR) are combined to derive the concrete contribution. Similarly, the right subtree (TR)
should be branched into two children (TRL, TRR) or (T3, T4), represent the contributions of horizontal and
vertical web reinforcement, respectively. It should be noted that the penalty term increases progressively by how
far it deviates from the constraints, effectively guiding the genetic algorithm towards the correct factorization.
Finally, the loss function combines the penalty terms and the Mean Absolute Percentage Error (MAPE) of the
predictions.
The optimal tree-based individuals (Fig. 4(b)) fitting the training experimental database for RC deep beams is
findings by Kani32, which show that beams exhibit higher shear resistance at higher angles θ (lower a/d ratios).
Design examples illustrating the above equations can be found in the supplementary data.
The developed expressions for the SR-STM model are not only simple and robust but also carry physical
significance, in contrast to the GEP and GA models from previous studies (see Table 1). Moreover, the enforced
structure tree for symbolic regression reduces the search space and ensures that the resulting expressions are
both explainable and consistent with the Strut-and-Tie Model (STM).
**: The expression NOV(TLL ≠ f (fc′ , ks) && TLL < 0) quantifies the number of violations where the TLL
subtree is not a function of the features fc′ and ks, or where it includes other features, or when its value is
negative.
n n n n
y i − y i )2
i=1 ( 1 yi 100% yi
, RM SE = 1
R2 = 1 − 2 , µ = , M AP E = − 1 yi − yi)2(22)
(
n − n y
i=1 i
n i=1 yi n i=1
i=1 y i − y
In
−
this context, yi refers to the predicted value for the i-th specimen, yi is the corresponding actual output value,
y is the average of the actual observations, and n indicates the total number of samples in the database. The a20-
index33 assesses the percentage of specimens where the ratio yi/yi falls within the range of 0.80 to 1.20.
As demonstrated in Tables 5 and 6, both SR-STM and CATB models exhibit high accuracy, with mean µ,
R2, and a20-index values approaching 1.0, along with low CoV, MAPE, and RMSE. Specifically, the CATBoost
model achieves MAPE values of approximately 3.3% and 10.7% for RC deep beams and 3.5% and 10% for RC
Figure 8. Summary plot for inputs influencing shear strength index Vn/bwd fc’ .
corbels in the training and testing sets, respectively, which are the lowest among the compared models. The
SR-STM model, while slightly less accurate with MAPE values of 10.84% and 11.76% for RC deep beams and
13.84% and 14.16% for RC corbels, still provides a strong performance. Furthermore, the introduced SR-STM
model yields µ values of 0.999 and 1.004, R2 values of 0.913 and 0.862, and CoV values of 14.55% and 14.95% for
RC deep beams and RC corbels, respectively. Although the SR-derived formulas show slightly lower accuracy
compared to the CATBoost (CATB) model, they are more accessible and easier to interpret, which significantly
enhances their practical utility in engineering applications. While the CATBoost model demonstrates superior
prediction accuracy, its black-box nature restricts its practical application in engineering design. In contrast,
the SR-STM model provides a more interpretable approach that bridges the gap between theoretical insight and
practical use.
prediction. On the other hand, the prediction results of closed-form models reveal deficiencies in their prediction
mechanisms. In particular, the Hwang expression18 for RCCs shows the most significant deviation between
experimental and predicted values, with a distribution skewed over the diagonal line, indicating a tendency
toward conservative predictions. The improvement of SR-STM reflects not only higher prediction accuracy
compared to closed-form models but also in providing more specific physical significance and mathematical
equations compared to purely data-driven models e.g., the CATB model.
Figure 7 displays the prediction errors of both existing closed-form expressions and the developed ML
models. In Fig. 7(a), the CATB and ST-STM models exhibit high precision, with over 83% of testing samples
within a 20% error range. In contrast, the Hwang and Russo formulas have almost 70% of samples within the
same range. The MIST model also performs well, capturing 80% of samples within the 20% error range. In
Fig. 7(b), the ST-STM and MIST formulas for RCCs show similar performance, with a slight advantage for
the ST-STM. While the results of the proposed equations and the MIST formula are comparable, the proposed
equations are more straightforward to implement. In addition, the ST-STM significantly outperforms the Hwang
model by having nearly three times the number of test samples within the same error ranges for RC corbels.
Moreover, all performance metrics for the introduced SR-STM, as detailed in Tables 5 and 6, exceed those of the
previously introduced mechanical models. These results highlight the superior performance of employing ML
techniques, such as the CATB and ST-STM models, in predicting the shear strength of RC deep beams and RC
corbels.
Figure 9. Features importance for inputs influencing shear strength index Vn/bwd fc’.
index. This positive effect is also reflected in the SHAP analysis, where these parameters increase the predicted
shear strength. Therefore, the proposed equation’s behavior is consistent with the SHAP analysis, highlighting
the model’s ability to accurately capture the relationships between the input features and the shear strength of
RCDBs and RCCs.
Conclusions
In conclusion, this study compiled a comprehensive database of 810 experimental tests for the shear strength of
RC deep beams (RCDBs) and 371 RC corbels (RCCs) tests from various research papers. It employed symbolic
regression (SR) techniques to refine and calibrate the Strut-and-Tie Model (SR-STM). From the evaluation
results, the following conclusions can be drawn:
• The integration of symbolic regression with the Strut-and-Tie Model successfully enhances prediction ac-
curacy while maintaining the interpretability and consistency of the models with established mechanical
principles.
• The SR-STM model yields µ values of 0.999 and 1.004, R² values of 0.913 and 0.862, and CoV values of 14.55%
and 14.95% for RC deep beams and RC corbels, respectively, indicating high predictive stability and robust-
ness.
• Compared to existing closed-form models by Hwang and Lee18, Russo et al11,17., and Chetchotisak et al.
(MIST)19, the SR-STM model shows better predictive performance, with improved CoV values and concen-
trated prediction-to-test ratios around unity.
• While the CATBoost model demonstrates superior performance with CoV values of 7.79% for RCDBs and
7.19% for RCCs, its black-box nature limits practical application in engineering design, highlighting the need
for more interpretable models like SR-STM.
• The SR-STM model significantly outperforms the Hwang model by having nearly three times the number of
test samples within the 20% error range for RC corbels, and it surpasses all performance metrics compared to
previously introduced mechanical models.
• The alignment between the SR-STM model’s equations and the SHAP analysis confirms the model’s effective-
ness in accurately capturing the key factors influencing the shear strength of RCDBs and RCCs.
The SR-STM not only achieves significant improvements in shear strength prediction but also effectively
combines the advantages of white-box and black-box models. Compared to purely data-driven approaches, the
SR-STM, with its explicit mathematical equations, is more accessible and reliable for engineers to utilize in
practical applications. While the SR-STM model demonstrates strong predictive capabilities, its effectiveness
is limited by the dataset’s scope, necessitating future research to broaden the range of geometries and material
properties to enhance model applicability and accuracy. In summary, integrating the ML-based approach
presents a promising method for accurately predicting the shear strength of RC deep elements, providing
valuable insights for engineering applications.
Data availability
All data generated or analyzed during this study are included in this published article and available in a public
repository: https://github.com/kmegahed/SR-STM.
References
1. MacGregor, J. G., Wight, J. K., Teng, S. & Irawan, P. Reinforced concrete: Mechanics and design, vol. 3. Prentice Hall Upper Saddle
River, NJ, (1997).
2. Kassem, W. Strength prediction of corbels using strut-and-tie Model Analysis. Int. J. Concr Struct. Mater. 9 (2), 255–266. https://
doi.org/10.1007/s40069-015-0102-y (2015).
3. Ashour, A. F., Alvarez, L. F. & Toropov, V. V. Empirical modelling of shear strength of RC deep beams by genetic programming.
Comput. Struct. 81 (5), 331–338. https://doi.org/10.1016/S0045-7949(02)00437-6 (2003).
4. Ma, C. et al. Prediction of shear strength of RC deep beams based on interpretable machine learning, Constr. Build. Mater., vol. 387,
no. July p. 131640, 2023, doi: (2022). https://doi.org/10.1016/j.conbuildmat.2023.131640
5. Feng, D. C., Wang, W. J., Mangalathu, S., Hu, G. & Wu, T. Implementing ensemble learning methods to predict the shear strength
of RC deep beams with/without web reinforcements. Eng. Struct. 235, 111979. https://doi.org/10.1016/j.engstruct.2021.111979
(2021).
6. Shahnewaz, M., Rteil, A. & Alam, M. S. Shear strength of reinforced concrete deep beams – A review with improved model by
genetic algorithm and reliability analysis. Structures. 23, 494–508. https://doi.org/10.1016/j.istruc.2019.09.006 (2020).
7. Wakjira, T., Ibrahim, M., Sajjad, B. & Ebead, U. Shear capacity of reinforced concrete deep beams using genetic algorithm. IOP
Conf. Ser. Mater. Sci. Eng. 910 (1). https://doi.org/10.1088/1757-899X/910/1/012002 (2020).
8. Megahed, K. Prediction and reliability analysis of shear strength of RC deep beams. Sci. Rep. 14 (1), 14590. https://doi.org/10.1038/
s41598-024-64386-w (2024).
9. Park, J. & Kuchma, D. Strut-and-tie model analysis for strength prediction of deep beams. ACI Struct. J. 104, 657–666 (2007).
10. Matamoros, A. B. & Wong, K. H. Design of Simply Supported Deep Beams Using Strut-and-Tie Models, ACI Struct. J., vol. 100, no.
6, pp. 704–712, [Online]. Available: (2003). https://www.scopus.com/inward/record.uri?eid=2-s2.0-0346243623&partnerID=40&
md5=3f8071840dea1f601ddc0f176bdb303b
11. Russo, G., Pauletta, M. & Venir, R. Reinforced concrete deep beams- Shear Strength Model and Design Formula. ACI Struct. J., 102,
3, https://doi.org/10.14359/14414
12. Vecchio, F. J. & Collins, M. P. The Modified Compression-Field Theory for Reinforced Concrete Elements Subjected to Shear,
318Reference, vol. 19, no. 16.
13. Tang, C. Y. & Tan, K. H. Interactive Mechanical Model for Shear Strength of Deep Beams, J. Struct. Eng. - J STRUCT ENG-ASCE,
vol. 130, doi: 10.1061/(ASCE)0733-9445(2004)130:10(1534). (2004).
14. A. C. I. C. 318, Building code requirements for structural concrete: (ACI 318 – 19) ; and commentary (ACI 318R-19). Farmington
Hills, MI: American Concrete Institute, [2019].
15. Hendy, C. R. & Smith, D. A. Designers’ Guide to EN 1992-2: Eurocode 2: Design of Concrete Structures: Part 2: Concrete Bridgesvol.
17 (Thomas Telford, 2007).
16. Schlaich, J., Schaefer, K. & Jennewein, M. Toward a consistent design of structural concrete. PCI J. 32 (3), 74–150. https://doi.
org/10.15554/pcij.05011987.74.150 (1987).
17. Russo, G., Venir, R., Pauletta, M. & Somma, G. Reinforced concrete corbels - Shear strength model and design formula, ACI Struct.
J., vol. 103, no. 1, pp. 3–10, [Online]. Available: (2006). https://www.scopus.com/inward/record.uri?eid=2-s2.0-32544444278&par
tnerID=40&md5=5ea8be15e672cd267a9207886dc8e81d
18. Shyh-Jiann, H. & Hung-Jen, L. Strength Prediction for Discontinuity Regions by Softened Strut-and-Tie Model, J. Struct. Eng., vol.
128, no. 12, pp. 1519–1526, Dec. doi: 10.1061/(ASCE)0733-9445(2002)128:12(1519). (2002).
19. Chetchotisak, P., Teerawong, J. & Yindeesuk, S. Modified interactive strut-and-tie modeling of reinforced concrete deep beams and
corbels, Structures, vol. 45, no. September, pp. 284–298, doi: (2022). https://doi.org/10.1016/j.istruc.2022.08.116
20. Megahed, K., Mahmoud, N. S. & Abd-Rabou, S. E. M. Application of machine learning models in the capacity prediction of RCFST
columns. Sci. Rep. 13 (1), 20878. https://doi.org/10.1038/s41598-023-48044-1 (2023).
21. Ben Chaabene, W. & Nehdi, M. L. Genetic programming based symbolic regression for shear capacity prediction of SFRC beams.
Constr. Build. Mater. 280, 122523. https://doi.org/10.1016/j.conbuildmat.2021.122523 (2021).
22. Megahed, K., Mahmoud, N. S. & Abd-Rabou, S. E. M. Prediction of the axial compression capacity of stub CFST columns using
machine learning techniques. Sci. Rep. 14 (1), 2885. https://doi.org/10.1038/s41598-024-53352-1 (2024).
23. Lu, W. Y., Lin, I. J. & Hwang, S. J. Shear strength of reinforced concrete corbels. Mag Concr Res. 61 (10), 807–813 (2009).
24. Zhang, N. & Tan, K. H. Direct strut-and-tie model for single span and continuous deep beams. Eng. Struct. 29 (11), 2987–3001.
https://doi.org/10.1016/j.engstruct.2007.02.004 (2007).
25. Hanoon, A. N., Jaafar, M. S., Al Zaidee, S. R., Hejazi, F. & Aziz, F. N. A. A. Effectiveness factor of the strut-and-tie model for
reinforced concrete deep beams strengthened with CFRP sheet, J. Build. Eng., vol. 12, no. June pp. 8–16, 2017, doi: (2016). https://
doi.org/10.1016/j.jobe.2017.05.001
26. Li, R., Deng, M., Zhang, Y. & Wei, D. Shear strengthening of reinforced concrete deep beams with highly ductile fiber-reinforced
concrete jacket, J. Build. Eng., vol. 48, no. December p. 103957, 2022, doi: (2021). https://doi.org/10.1016/j.jobe.2021.103957
27. Hwang, S. J., Lu, W. Y. & Lee, H. J. Shear strength prediction for deep beams, ACI Struct. J., vol. 97, no. 3, pp. 367–376, [Online].
Available: (2000). https://www.scopus.com/inward/record.uri?eid=2-s2.0-0034191709&partnerID=40&md5=740d3368c45f408c
b6472d616e219e3e
28. Koza, J. R. Genetic programming as a means for programming computers by natural selection. Stat. Comput. 4 (2), 87–112. https://
doi.org/10.1007/BF00175355 (1994).
29. Udrescu, S. M., Tegmark, M., Feynman, A. I. & A physics-inspired method for symbolic regression. Sci. Adv. 6 (16), eaay2631
(2020).
30. Goldberg, D. E. & Holland, J. H. Genetic algorithms and machine learning. Mach. Learn. 3 (2), 95–99. https://doi.
org/10.1023/A:1022602019183 (1988).
31. Cranmer, M. Interpretable Machine Learning for Science with PySR and SymbolicRegression.jl, 2023, [Online]. Available: http://
arxiv.org/abs/2305.01582
32. Kani, G. How safe are our large reinforced concrete beams? in Journal Proceedings, vol. 64, no. 3, pp. 128–141. (1967).
33. Asteris, P. G. & Mokos, V. G. Concrete compressive strength using artificial neural networks. Neural Comput. Appl. 32, 11807–
11826. https://doi.org/10.1007/s00521-019-04663-2 (2020).
34. Wang, J., Lu, R. & Cheng, M. Application of ensemble model in capacity prediction of the CCFST columns under axial and
eccentric loading. Sci. Rep. 13 (1), 9488. https://doi.org/10.1038/s41598-023-36576-5 (2023).
35. Chen, H., Yi, W. J. & Ma, Z. J. Shear size effect in simply supported RC deep beams. Eng. Struct. 182, 268–278. https://doi.
org/10.1016/j.engstruct.2018.12.062 (2019).
36. Sakalauskas, K. & Kaklauskas, G. Pure shear model for crack width analysis of reinforced concrete members. Sci. Rep. 13 (1),
13883. https://doi.org/10.1038/s41598-023-41080-x (2023).
37. Gerges, N. N. et al. Flexural capacity of eco-friendly reinforced concrete beams. Sci. Rep. 13 (1), 20142. https://doi.org/10.1038/
s41598-023-47283-6 (2023).
38. Ahmad, I. & Shokouhian, M. Promoting Sustainable Green Infrastructure: Experimental and Numerical Investigation of Concrete
Reinforced with Recycled Steel Fibers, Arch. Adv. Eng. Sci., no. SE-Research Articles, pp. 1–13, Jun. doi: (2024). https://doi.
org/10.47852/bonviewAAES42022837
39. Bin Inqiad, W., Ali Raza, M. & Asim, M. Predicting 28-Day Compressive Strength of Self-Compacting Concrete (SCC) Using
Gene Expression Programming (GEP), Arch. Adv. Eng. Sci., no. SE-Research Articles, pp. 1–13, Nov. doi: (2023). https://doi.
org/10.47852/bonviewAAES32021606
Acknowledgements
This research did not receive any specific grant from funding agencies in the public, commercial, or not-for-
profit sectors.
Author contributions
K.M is repsonsable for all manscript.
Funding
Open access funding provided by The Science, Technology & Innovation Funding Authority (STDF) in cooper-
ation with The Egyptian Knowledge Bank (EKB).
Competing interests
The author declares no competing interests.
Additional information
Supplementary Information The online version contains supplementary material available at https://doi.
org/10.1038/s41598-024-74803-9.
Correspondence and requests for materials should be addressed to K.M.
Reprints and permissions information is available at www.nature.com/reprints.
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and
institutional affiliations.
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which
permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give
appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and
indicate if changes were made. The images or other third party material in this article are included in the article’s
Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included
in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or
exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy
of this licence, visit http://creativecommons.org/licenses/by/4.0/.
1. use such content for the purpose of providing other users with access on a regular or large scale basis or as a means to circumvent access
control;
2. use such content where to do so would be considered a criminal or statutory offence in any jurisdiction, or gives rise to civil liability, or is
otherwise unlawful;
3. falsely or misleadingly imply or suggest endorsement, approval , sponsorship, or association unless explicitly agreed to by Springer Nature in
writing;
4. use bots or other automated methods to access the content or redirect messages
5. override any security feature or exclusionary protocol; or
6. share the content in order to create substitute for Springer Nature products or services or a systematic database of Springer Nature journal
content.
In line with the restriction against commercial use, Springer Nature does not permit the creation of a product or service that creates revenue,
royalties, rent or income from our content or its inclusion as part of a paid for service or for other commercial gain. Springer Nature journal
content cannot be used for inter-library loans and librarians may not upload Springer Nature journal content on a large scale into their, or any
other, institutional repository.
These terms of use are reviewed regularly and may be amended at any time. Springer Nature is not obligated to publish any information or
content on this website and may remove it or features or functionality at our sole discretion, at any time with or without notice. Springer Nature
may revoke this licence to you at any time and remove access to any copies of the Springer Nature journal content which have been saved.
To the fullest extent permitted by law, Springer Nature makes no warranties, representations or guarantees to Users, either express or implied
with respect to the Springer nature journal content and all parties disclaim and waive any implied warranties or warranties imposed by law,
including merchantability or fitness for any particular purpose.
Please note that these rights do not automatically extend to content, data or other material published by Springer Nature that may be licensed
from third parties.
If you would like to use or distribute our Springer Nature journal content to a wider audience or on a regular basis or in any other manner not
expressly permitted by these Terms, please contact Springer Nature at