Applied Artificial Intelligence For Predicting Construction Projects Delay
Applied Artificial Intelligence For Predicting Construction Projects Delay
net/publication/354871960
CITATIONS READS
20 1,445
5 authors, including:
Some of the authors of this publication are also working on these related projects:
Marginalization of end-user stakeholder’s in public private partnership road projects in Nigeria View project
All content following this page was uploaded by Christian Nnaemeka Egwim on 06 October 2021.
∗ Corresponding author.
E-mail addresses: [email protected] (C.N. Egwim), [email protected] (H. Alaka), [email protected] (L.O. Toriola-Coker),
[email protected] (H. Balogun), [email protected] (F. Sunmola).
https://doi.org/10.1016/j.mlwa.2021.100166
Received 19 February 2021; Received in revised form 28 May 2021; Accepted 13 September 2021
Available online 25 September 2021
2666-8270/© 2021 The Authors. Published by Elsevier Ltd. This is an open access article under the CC BY-NC-ND license
(http://creativecommons.org/licenses/by-nc-nd/4.0/).
C.N. Egwim, H. Alaka, L.O. Toriola-Coker et al. Machine Learning with Applications 6 (2021) 100166
delays impact profitability and productivity negatively; and at a project the Artificial Intelligence (AI)/Machine Learning (ML) which has been
level where delay foster industry client’s dissatisfaction on its overall widely adopted across other industries, but construction industry is
performance, cessation of contracts by the owner, and unprofitability slow to adopt (Blanco et al., 2018). The adoption of AI/ML algorithms
for contractor(s). Furthermore it has been argued (Kumar, 2016) that in construction is relatively evolving, especially when compared to
delay often lead to project cost overruns, insolvency of organization, other industries like healthcare: guiding in the choice of treatment;
loss of opportunity of future projects, dispute among project stakehold- education: virtual lectures; and transportation: autonomous vehicles,
ers. Major delay factors have been identified by several researchers as it currently uses lots of methods that were used in the centuries past
as evident in vast body of international literature (from amongst the (Marks, 2017).
oldest articles to the most recent), e.g. bad weather and jurisdic- As a commonplace the industry produces massive amount of data
tional/contractual disputes in both United States of America (USA) and daily on every project. For example data produced from images cap-
UK by Baldwin, Manthei, Rothbart, and Harris (1971) and Sullivan and tured from smart devices, IoT sensors, Building Information Modelling
Harris (1986) respectively; variation orders in both Nigeria and United (BIM is defined as structured model of data that represents building
Arab Emirates (UAE) by Motaleb and Kishk (2010) and Odeyinka and elements with its usage spanning beyond the pre-construction phase to
Adebayo (1997) respectively; planning and scheduling deficiencies in the post-construction phase (Ameziane, 2000)) etc, presents a window
Australia, delay in payment certificates in Ghana and poor site man- of opportunity for the industry and its customers to examine and gain
agement in Malaysia by Shah (2016); ground problems and inefficient profits from insights generated from past construction data through
structural connections for prefabricated components in both the UK and the aid of AI and ML. AI is defined as a collection of state-of-the-
India by (Agyekum-Mensah & Knight, 2017; Ji et al., 2018) respectively art technologies that permit machines or any computer programme to
and finally shortage of adequate equipment and poor communication sense, comprehend, act and learn (Goyal, 2019). ML on the other hand
among contracting parties in China by Chen et al. (2019). is a branch of AI that allows computers to learn by a direct route from
Several research approaches and guidelines for mitigating delay examples, data and experience replacing the traditional approaches to
of construction projects have been established over the decades. For programming that relied on hardcoded step by step rules (Royal Soci-
instance, Sullivan and Harris (1986) suggested more teamwork espe- ety, 2017). Several ML algorithms such as Genetic Algorithm, Neural
cially at the early stages of project planning. According to Alaghbari Networks, Linear Regression, Logistic Regression, Nearest-Neighbour
and Sultan (2018), Assaf, Al-Khalil, and Al-Hazmi (1995), Enshassi, Mapping, Decision Trees, K-Means Clustering, Random Forests, and
Al Najjar, and Kumaraswamy (2009) and Owalabi et al. (2014) clients Support Vector Machines exist for ML model implementation. Which
should adhere to timely payment of progress fee and consider funding ML algorithm to use depends on lot of factors, e.g., ease of use, accu-
levels at the planning stage of project. Furthermore, the survey by racy, training time, etc. Few researchers have attempted the use of AI
Gondia et al. (2020), Yaseen et al. (2020) recommended the use of and ML algorithms in some aspect of construction. Poh, Ubeynarayana,
predictive models to mitigate delay risks and time claim in construction and Goh (2018) used five popular ML algorithms to predict accident
projects. Despite all these delay factors and recommendations towards occurrence and severity of construction sites in Singapore; Zou and
mitigating delay in construction, delay still strives in the industry, Ergan (2019) Leveraged on three ML techniques to predict the influence
hence the first motivation of this study. Interestingly, only a few of construction projects on urban quality of life; Arditi and Pulket
studies have taken the advantage of the contemporary analysis method (2005) and Mahfouz and Kandil (2012) used only one and three ML
which best explains the factors that can be affecting a phenomenon models respectively to forecast end results of construction litigation all
like delay based on its predictive capabilities. This analysis method is in the USA.
2
C.N. Egwim, H. Alaka, L.O. Toriola-Coker et al. Machine Learning with Applications 6 (2021) 100166
Table 1
List of features and target.
3
C.N. Egwim, H. Alaka, L.O. Toriola-Coker et al. Machine Learning with Applications 6 (2021) 100166
three factors. All these made a total of twenty-four factors as features how they affect the phenomenon (cause and effect) and often uses large
(independent variables). Also, the responders were asked to rate how samples (Easterby-Smith et al., 2008; Morgan, 1980).
long the entire project delayed for in the final Section E; this represents Prior to distribution of the questionnaire, pilot testing was con-
the target (dependent variable) for the EMLA implementation. In the ducted by asking group of experts in construction to comment on the
end, a total of 302 questionnaire were distributed. A total of 120 representativeness and suitability of the questions. This was done to
responses were received and since sampling cannot be done in isolation ensure thorough understanding of the questions by the responders and
as there are no special right decision for determining sample size to avoid errors when recording data, to assess questions validity and the
for a research Flick (2014), this number of responses are considered likely reliability of data to be collected (Saunders et al., 2009, p.425).
satisfactory (Delice, 2001; Durbarry, 2019; James, Joe, & Chadwick, The responders of the questionnaire were experienced stakeholders
2001). from the construction industries in Nigeria. They were instructed to
The questions were designed on a Likert scale with a scale of one have in mind any project of choice they have worked on in the past
to five. Although questions in each section were analysed individually, while answering the questions. Since this study aims to develop a
they were also linked together in such a way that their respective multilayer high performant ensemble of ensembles predictive model
answers accumulatively helped to arrive at a finding (delay). The use using hyperparameter optimized EMLA to predict delay of construction
of questionnaire research signifies independent observation — implies projects makes it a deductive research which further reinforces its
the questionnaire will be completed in the absence of the researcher, positivism. Convenient sampling method was selected due to its ease of
and since one of the objectives of this study is set out to establish the accessibility, geographical proximity and affordability which satisfied
true (most) applicable factor to construction delay makes it a positivist this research. Convenient sampling (also called haphazard/accidental
research. A positivist researcher is usually independent (of the subject) sampling) is a typical nonprobability/non-random sampling where a
as an observer, reduces a phenomenon to simpler measurable factors researcher considers the most convenient object(s) and time, effort
(causes of delay in construction projects deduced from several literature and money for conducting data collection (Matthews, Ross, & Ellison,
was reduced using Likert scale.), explains the elements with regards to 2010).
4
C.N. Egwim, H. Alaka, L.O. Toriola-Coker et al. Machine Learning with Applications 6 (2021) 100166
The collected data received via Google forms were extracted and ROC curve modelling evaluation metrics were employed to measure the
converted into a comma-separated values file. To achieve the sec- new model and EMLA performance on the testing dataset as shown in
ond objectives, this raw data was pre-processed into a clean and an Fig. 2
analysable dataset by carrying out data imputation and outlier detec-
tion. Scaling and encoding feature engineering techniques were em- 3. Results and analysis
ployed to enable the selection of features or input variables to increase
the predictive power (hyperparameter optimization) of the EMLA. The 3.1. Reliability analysis of survey outcomes
resulting clean, pre-processed and feature engineered dataset was split
randomly into two in a ratio of 60% to 40% of training dataset A reliability analysis from the Alpha Test of Cronbach was carried
and testing dataset respectively. EMLA were imported into a running out to test the reliability of the respondents ’ answers for all 24 factors.
instance of Jupiter Notebook using Scikit-learn — an integral Python Alpha of Cronbach (𝛼) can be written as:
programming language module with a broad spectrum of state-of-the- 𝑁.𝑐
𝛼=
art algorithms for supervised and unsupervised medium-scale problems 𝑣 + (𝑁 − 1) .𝑐
(Pedregosa et al., 2011). Since EMLA fit input variables (delay factors) where, 𝑁 is the number of factors, 𝑐 the average covariance between
to a known output variable (delay) supervised modelling taxonomy factor-pairs and 𝑣 the average variance. The main purpose of 𝛼 was
was undoubtedly chosen. The training dataset (60% of total dataset) to assess how accurate the data obtained from the survey were, by
was used to fit different EMLA while their knobs were optimized evaluating the internal consistency coefficient of data. In addition, it
during successive runs to further improve the performance for making was important to decide whether the combined factors help to calculate
predictions on unseen test dataset (40% of total dataset). The resulting the same construct (delay).
best performing EMLA selected via the hard and soft voting rule were While there is no lower bound, the higher the alpha coefficient of
used as new input variables which produced a multilayer high perfor- Cronbach is to 1 , the greater the internal accuracy of the factors (Gliem
mant ensemble of ensembles algorithm (to achieve the third objective). & Gliem, 2003). An 𝛼 of 0.7 or higher is known to be symptomatic
Finally, Accuracy, Confusion Matrix, Precision, Recall, 𝐹1 -Score and of strong inner harmony of the factors in determining the reliability
5
C.N. Egwim, H. Alaka, L.O. Toriola-Coker et al. Machine Learning with Applications 6 (2021) 100166
Table 3
Algorithms and libraries.
Table 4
Confusion matrix.
Prediction
Negative Positive
(delay < threshold limit) = 0 (delay > threshold limit) = 1
Negative True negative False positive
Actual
(delay < threshold limit) = 0
Positive False negative True positive
(delay > threshold limit) = 1
of the construct (Bhatnagar, Kim, & E. Many, 2014). However, It is (F25) represent the target/dependent variable as shown in Table 1.
the viewpoint of Nunnally (1978) that the 𝛼 should surpass 0.8 for Descriptive statistics of these columns showed they contain discrete
fundamental science to consider accurate responses to a factor. The categorical data with ordinal values from 1–5. Furthermore, Fig. 3
findings of this study on the 24 factors in this analysis show strong displays a summary of the central tendency, dispersion and shape of
inner stability 𝛼 of 0.938 as shown in Table 2. the dataset’s distribution, as relates to its mean, median and standard
deviation (std). For instance, F3 has a mean of 2.48, median of 3, and
3.2. Data pre-processing std of 1.11 — implies that on the average, during the course of most
of the project on which each respondent answers were based, Inflation
An initial investigation on the data through Exploratory Data Anal- or sudden increase in good/commodities (F3) was medium.
ysis (EDA) showed that the data is a two-dimensional array with 120 After EDA, correlation analysis was done to identify multicollinear-
rows and 25 columns where the 1st to the 24th columns (F1–F24 factor ity among predictors (features vs target) using their respective correla-
IDs) represent the features/independent variables and the 25th column tion coefficient values (See Fig. 4). The correlation matrix plot in Fig. 4
6
C.N. Egwim, H. Alaka, L.O. Toriola-Coker et al. Machine Learning with Applications 6 (2021) 100166
Fig. 7a. DT ROC AUC Plot. Fig. 7d. Extra Tree ROC AUC Plot.
Fig. 7b. RF ROC AUC Plot. Fig. 7e. AdaBoost ROC AUC Plot.
shows the cross correlation between each feature (F1–F24) and the 3.3. Feature engineering
target (F25). For example, F13 has a positive correlation of 0.4 to F25,
F5 has a negative correlation of −0.07 to F25 and so on. In general, the
As a habitual requirement for most ML estimators owing to their
existence of multicollinearity implies an absolute correlation coefficient
underlying assumptions of any given dataset to be normally distributed,
>0.7 among two or more predictors (Dormann et al., 2013). Evidently, with zero mean and unit variance (Pedregosa et al., 2011), this study
there exist multicollinearity between F12 and 13, F13 and F14, F13 and utilized standardization feature scaling method to meet this require-
F16 etc as shown below. ment by subtracting the mean from each feature observation and
7
C.N. Egwim, H. Alaka, L.O. Toriola-Coker et al. Machine Learning with Applications 6 (2021) 100166
the target (F25). Consequently, data from column F25 were encoded
into 0 (no delay) for its ordinal values containing any number <= 3
and 1(delay) for its ordinal values containing 4 and 5 such that on
each occurrence, this value (0 or 1) can then help to show if a category
(delay) is present or not. Finally, the dataset was split using Scikit-
learn’s train_test_split function at a ratio of 60:40 for training (72 data
points) and testing (48 data points) respectively.
8
C.N. Egwim, H. Alaka, L.O. Toriola-Coker et al. Machine Learning with Applications 6 (2021) 100166
Table 5
Default parameter ensemble’s performance metrics report.
Naive Bayes ensemble methods we used follows the Bayes’ theorem 3.6. Algorithms performance metrics
which establishes the link between dependent variable y and related
independent variables vector 𝑥1 to 𝑥𝑛 as shown below: Typical performance metrics for evaluating classification-based prob-
( ) lems like the one for this study are, accuracy score, confusion matrix,
( ) 𝑃 (𝑦) 𝑃 𝑥1 , … , 𝑥𝑛 |𝑦 ∏𝑛
( ) precision, recall, 𝑓1 score and Compute Area Under the Receiver
𝑃 𝑦|𝑥1 , … , 𝑥𝑛 = ∝ 𝑃 (𝑦) 𝑃 𝑥𝑖 |𝑦
𝑃 (𝑥1 ..., 𝑥𝑛 ) Operating Characteristic Curve (ROC AUC).
𝑖=1
9
C.N. Egwim, H. Alaka, L.O. Toriola-Coker et al. Machine Learning with Applications 6 (2021) 100166
Table 6
Hyperparameter optimized ensemble’s performance metrics report.
Accuracy score is the sum of accurate estimates that have been made cases with delay > threshold accurately predicted to surpass the total
and separated into one per cent by the overall number of predictions number of cases with delay > threshold in the test data of this study.
that have been made. Accuracy score is generally not the only preferred It is expressed mathematically as:
metrics to use for classifiers especially with skewed datasets. The
True Positives
formula used to calculate accuracy score for this study is: Recall =
True Positives + False Negatives
True Negatives + True Positives
Accuracy score = 𝑥100 𝐹1 Score is the harmonic mean of precision and recall. As regular
Number of Predictions
Confusion matrix is a 2 x 2 matrix description of the number of mean gives equal weight to all values, harmonic mean gives more
accurate and inaccurate predictions made by a classifier. The confusion weight to low values. The 𝐹1 score favours classifiers that have similar
matrix result used for EMLA experimentations is shown in Table 4. precision and recall in this study. It is expressed mathematically as:
Precision (false positive rate) measures the accuracy of positive 2 Precision 𝑥 Recall
𝐹1 = = 2𝑥
predictions. Hence, in this study it is the accurately predicted ratio 1
+ 1 Precision + Recall
of cases with delay < threshold limit to be less than or equal to the Precision Recall
threshold limit to the total number of cases with delay < threshold limit The Receiver Operator Characteristic Curve (ROC) is a recall plot
in the test data. It is expressed mathematically as: of the 𝑦-axis against precision of the 𝑥-axis. In this study, the threshold
True Positives of the algorithm, which ranges from zero to one with a scale of 0.1, is
Precision =
True Positives + False Positives seen on the vertical axis to the right of the plot and on the curve as
Recall (sensitivity or true positive rate) is the ratio of positive well. Area under the curve (AUC) is the area under the ROC curve that
instances that are correctly detected by the classifier. It is the ratio of is generally recognized as the best indicator of the overall performance
10
C.N. Egwim, H. Alaka, L.O. Toriola-Coker et al. Machine Learning with Applications 6 (2021) 100166
of a classifier. Since the maximum value of the precision and recall of BNB: binarize = True, alpha = 0, fit_prior = True, class_prior = None;
the x and y axes of the ROC curve are 1, the maximum AUC value, (ix) MNB: alpha = 100, fit_prior = False, class_prior = None; (x) GNB:
indicating excellent accuracy, is 1. The minimum AUC value, however, priors = None, var_smoothing = 1e-01.
is 0.5.
Cross validation is a resampling technique for evaluating machine 3.7. Algorithms performance evaluation
learning models on a small dataset. Since the estimators (see Table 3)
for this study are classifiers and the target variable is binary, stratified At first the EMLA experimentations was performed on the first
k-fold a variant of k-fold that returns stratified folds containing about 7 algorithms (see Table 3) with their default parameters (without
the same proportion of target class as the initial dataset is used to hyperparameter optimization) on the unseen test dataset (40% of total
evaluate cross validation scores. As a result, the variation between dataset) after training the EMLA (model fitting on training dataset).
the estimates is minimized, and the average error estimation is more Table 5 shows their respective evaluation metrics report on test dataset.
accurate, hence mitigates potential overfitting. To obtain the cross A close attention to column 9 of Table 5 shows that the best of them
validation score in this study, we took the mean of stratifield10-fold only had a 15% increase in the minimum AUC value. Hyperparameter
(i.e., where k = 10) for each EMLA. optimization became more necessary, hence we proceeded with it and
The main parameters for each model are as follows. (i) DT: ran- decided not to continue the model training and testing experiment with
dom_state = 42, min_samples_leaf = 5, criterion = ’gini’, the default parameter for the last algorithm due to computational cost
min_samples_split = 4, n_jobs = -1; (ii) RF: n_estimators = 100, n_jobs and time.
= -1, random_state = 42, bootstrap = True, warm_start = False; (iii) Interestingly, we obtained almost double (27%) in value after hy-
Bagging: n_estimators = 100, bootstrap = True, n_jobs = -1, ran- perparameter optimization on the EMLA when compared to the initial
dom_state = 42, min_samples_leaf = 2, min_samples_split = 3, verbose EMLA experimentations with their default parameters on test dataset
= 1; (iv) Extra-Trees: min_samples_split = 4, random_state = 42, (40% of total dataset) as shown in Table 6.
criterion = ’entropy’, n_jobs = -1, min_samples_leaf = 2, n_estimators Table 6 presents the evaluation metrics report of the models de-
= 100; (v) AdaBoost: max_depth = 1, n_estimators = 100, learn- veloped with the 11 algorithms used for EMLA experimentations on
ing_rate = 3, min_samples_split = 3, n_jobs = -1, random_state = 42; the test data (40% of total dataset) for a weak learner DT as the base
(vi) GBM: loss: ‘deviance’, n_jobs = -1, random_state = 42, learn- estimator. Comparing the ensemble of ensembles (stacking) approach
ing_rate = 2, min_samples_split = 3, min_samples_leaf = 3; (vii) XG- proposed with existing naïve bayes method used to potentially predict
Boost: base_score = 1, booster = ’gbtree’, colsample_bylevel = 1.9, construction project delay risk by Gondia et al. (2020), this report
colsample_bynode = 1, colsample_bytree = 1, gamma = 0, gpu_id = clearly shows and confirms that although naïve bayes which is consid-
-1, importance_type = ’gain’, learning_rate = 1, max_delta_step = 2, ered as one of the most effective and efficient inductive EMLA due to
max_depth = 6, min_child_weight = 1, n_estimators = 100, n_jobs = -1, the conditional independence assumption on which it is theoretically
num_parallel_tree = 3, random_state = 42, tree_method = ’exact’; (viii) built, is however rarely valid in a real-world applications such as
11
C.N. Egwim, H. Alaka, L.O. Toriola-Coker et al. Machine Learning with Applications 6 (2021) 100166
the case in this study (Zhang, 2004). Furthermore, the better perfor- Gaussian, however although one of them (multinomial) did not perform
mance (75%, 76%, and 0.7448 of accuracy score, cross validation score as much as the base estimator, we proceeded with vote casting, which
and ROC AUC respectively) of Gaussian naïve bayes over Bernoulli ultimately yielded a third performant model. Figs. 8–10 shows how
naïve bayes (60.42%, 60% and 0.6171 of accuracy score, cross vali- these ensembles made their respective decisions interdependently.
dation score and ROC AUC respectively) and multinomial naïve bayes Where Ensemble 1, 2, and 3 are the resulting new performant
(60.42%, 75% and 0.6031 of accuracy score, cross validation score and models obtained during vote casting in ensemble bagging, boosting,
ROC AUC respectively) owes to the existence of dependences between and naïve bayes respectively. All in all, we conclusively proceeded to
its features since our dataset was optimally standardized to normally use these aggregated predictions from Ensemble 1, 2, and 3 to train a
distributed features (see Section 3.3). Fig. 11 (see Appendix A) shows new algorithm called Ensemble of Ensembles using Scikit-learn’s Mlens
the tree structure of this base estimator while Figs. 7a to 7j present the library. Consequently, resulting to a more performant model with a
multiple ROC AUC plot for the algorithms used. much more better accuracy score, confusion matrix, precision, recall, f1
For a start, we benched marked our EMLA on DT (typical unstable score, and Compute Area Under the Receiver Operating Characteristic
model) as their base estimator, then we proceeded the experimentation Curve (ROC AUC) as shown in Table 6.
by trying to gain stability for the base estimator using 3 bagging
ensemble algorithms namely optimized RF (a natural ensemble of DT), 4. Conclusion & recommendations
Bagging and Extremely Randomized Trees. As expected, they all per-
formed better than the base estimator based on all evaluation metrics The perpetual occurrence of a global phenomenon — delay in
(see Table 6), the challenge, however, was how to identify the best construction sector despite considerable mitigation efforts remains a
ensemble algorithm. To avoid bias and to enhance generalizability, we huge concern to its policy makers. Interestingly, this sector which
casted a vote with the bagging ensembles using the hard and soft voting produces massive amount of data from IoT sensors, building infor-
rule in Scikit-learn’s VotingClassifier. mation modelling, on most of its projects daily is slow in taking the
A resulting performant model (higher accuracy score and low vari- advantage of the contemporary analysis method — artificial intel-
ance) undoubtedly emerged. The experiment was repeated with the ligence/machine learning which best explains the factors that can
base estimator but this time using 3 boosting ensemble algorithms: affect a phenomenon like delay based on its predictive capabilities
Adaptive Boosting (CART), Gradient Boosting Machine, and Extreme haven been widely adopted across other sectors. In this study there-
Gradient Boosting and not surprisingly similar to the bagging ensem- fore, a premise to use ensemble machine learning algorithms (EMLA)
bles, they all performed better than the base estimator (see Table 6). for predicting delay of construction projects was architected, built
We again casted votes amongst them which yielded yet another perfor- and presented. First a review of existing body of knowledge on in-
mant model. Finally, we again repeated the experiment with the base fluencing factors of construction projects delay was used to conduct
estimator using 3 naïve bayes ensembles: Bernoulli, multinomial and survey of experts as an approach to its data collection and exploration.
12
C.N. Egwim, H. Alaka, L.O. Toriola-Coker et al. Machine Learning with Applications 6 (2021) 100166
The resulting dataset applied to EMLA was used to develop hyperpa- construction project within the sector, the unique data transformation
rameter optimized predictive models: Decision Tree, Random Forest, employed in this study may not, as typical of any data driven model,
Bagging, Extremely Randomized Trees, Adaptive Boosting (CART), Gra- be transferable to the data from other regions. Nevertheless, other
dient Boosting Machine, and Extreme Gradient Boosting. Finally, a region’s project datasets can be applied to the processes described in
multilayer high performant ensemble of ensembles predictive model this study. Also, the sample size of the respondents of this study may
was developed to maximize the overall performance of the EMLA not be representative of the total population size of the region. In order
combined. to produce improved classification outcomes, future studies should
Results from the algorithm evaluation metrics: accuracy score, con- be targeted at extending the algorithms either by further parameter
fusion matrix, precision, recall, F1, and ROC AUC indeed proved that optimization or feature engineering. Other methods used in the creation
EMLA are capable of improving the predictive force relative to the of ensemble models, apart from bagging, boosting, naïve bayes and
use of a single algorithm in predicting construction projects delay. stacking, should also be considered for predicting construction projects
By developing a multilayer high performant ensemble of ensembles delay.
predictive model, the current research contributes to the effort of
improving time efficiency of construction projects – a key performance CRediT authorship contribution statement
indicator for successful projects. Ultimately, this model can subse-
quently be integrated into construction information system to promote
Christian Nnaemeka Egwim: Conceptualization, Methodology,
evidence-based decision-making, thereby enabling constructive project
Software, Formal analysis, Writing – original draft. Hafiz Alaka: Su-
risk management initiatives. As compared to existing numerical or
pervision, Writing – review & editing, Investigation, Visualization.
statistical approaches, which used pure mathematical techniques such
Luqman Olalekan Toriola-Coker: Data curation. Habeeb Balogun:
as the arithmetic mean, standard deviation, hypothesis testing, etc.
Resources, Data curation. Funlade Sunmola: Supervision, Writing –
to draw inference from data, our predictive analytics approach used
review & editing.
known results (input variables), statical methods and advance ML
algorithms to develop a novel multilayer high performant ensemble
Appendix A
of ensembles predictive model to forecast futuristic delay values for
complex and new data of typical construction projects. Thus, will help
improve the quality of decisions and risks to be taken by several See Fig. 11.
construction sector stakeholders on their present or future construction
projects which as a result will foster trust, increase in productivity Appendix B. Supplementary data
and revenue and more importantly yield timely delivery of construc-
tion projects in the sector. While the proposed contemporary method Supplementary material related to this article can be found online
of analysis is assumed to be applicable in mitigating delay of any at https://doi.org/10.1016/j.mlwa.2021.100166.
13
C.N. Egwim, H. Alaka, L.O. Toriola-Coker et al. Machine Learning with Applications 6 (2021) 100166
References Ameziane, F. (2000). Information system for building production management. Interna-
tional Journal of Production Economics, 64(1), 345–358. http://dx.doi.org/10.1016/
S0925-5273(99)00071-7, Elsevier Science B.V..
Abdul-Rahman, H., et al. (2011). Project schedule influenced by financial issues: Arditi, D., & Pulket, T. (2005). Predicting the outcome of construction litigation using
Evidence in construction industry. Scientific Research and Essays, 6(1), 205–212. boosted decision trees. Journal of Computing in Civil Engineering, 19(4), 387–393.
http://dx.doi.org/10.5897/SRE10.989. http://dx.doi.org/10.1061/(ASCE)0887-3801(2005)19:4(387), American Society of
Civil Engineers.
Agyekum-Mensah, G., & Knight, A. D. (2017). The professionals’ perspective on the
Asadi, A., Alsubaey, M., & Makatsoris, C. (2015). A machine learning approach for
causes of project delay in the construction industry. Engineering, Construction and
predicting delays in construction logistics. International Journal of Advanced Logistics,
Architectural Management, 24(5), 828–841. http://dx.doi.org/10.1108/ECAM-03-
4(2), 115–130. http://dx.doi.org/10.1080/2287108x.2015.1059920, Informa UK
2016-0085.
Limited.
Alaghbari, W., & Sultan, B. (2018). Delay factors impacting construction projects Assaf, S. A., & Al-Hejji, S. (2006). Causes of delay in large construction projects.
in Sana ’ a -Yemen 1. PM World Journal, VII(December), 1–28, Available at: International Journal of Project Management. Pergamon, 24(4), 349–357. http://dx.
https://www.researchgate.net/publication/329656460%0ADelay. doi.org/10.1016/j.ijproman.2005.11.010.
14
C.N. Egwim, H. Alaka, L.O. Toriola-Coker et al. Machine Learning with Applications 6 (2021) 100166
Assaf, S. A., Al-Khalil, M., & Al-Hazmi, M. (1995). Causes of delay in large building Kumar, R. D. (2016). Causes and effects of delays in construction industry. International
construction projects. Journal of Management in Engineering. American Society of Research Journal of Engineering and Technology, 3(4), 1831–1837, Available at:
Civil Engineers, 11(2), 45–50. http://dx.doi.org/10.1061/(ASCE)0742-597X(1995) www.irjet.net.
11:2(45). Mahfouz, T., & Kandil, A. (2012). Litigation outcome prediction of differing site
Badawi, H., et al. (2019). Use of ensemble methods for indirect test of RF circuits: Can condition disputes through machine learning models. Journal of Computing in
it bring benefits? In LATS 2019-20th IEEE Latin American test symposium (no. 1). Civil Engineering, 26(3), 298–308. http://dx.doi.org/10.1061/(ASCE)CP.1943-5487.
http://dx.doi.org/10.1109/LATW.2019.8704641. 0000148.
Baldwin, J. R., Manthei, J. M., Rothbart, H., & Harris, R. B. (1971). Causes of delay in Marks, M. (2017). Construction: The next great tech transformation Voices Michael
the construction industry. Journal of the Construction Division, 177–187, Available at: Marks.
https://cedb.asce.org/CEDBsearch/record.jsp?dockey=0018302. (Accessed 23 April Matthews, B., Ross, L., & Ellison, N. (2010). RESEARCH METHODS : A great starting
2020). point for students and would-be social researchers. Available at: www.pearsoned.
Bartholomew, S. H. (2001). Construction contracting: business and legal principles. Prentice co.uk/matthews. (Accessed 24 September 2020).
Hall. Morgan, D. H. J. (1980). Sociological paradigms and organisational analysis. Sociology,
Bhatnagar, R., Kim, J., & E. Many, J. (2014). Candidate surveys on program evaluation: 14(2), 332–333. http://dx.doi.org/10.1177/003803858001400219.
Examining instrument reliability, validity and program effectiveness. American Motaleb, O., & Kishk, M. (2010). An investigation into causes and effects of construction
Journal of Educational Research. Science and Education Publishing Co. Ltd, 2(8), delays in UAE. In Association of Researchers in Construction Management, ARCOM
683–690. http://dx.doi.org/10.12691/education-2-8-18. 2010 - Proceedings of the 26th Annual Conference (pp. 1149–1157). Available at:
Blanco, J. L., et al. (2018). Artificial intelligence : Construction technology https://www.researchgate.net/publication/266174953. (Accessed 24 April 2020).
’ s next frontier. Mckinsey & Company, (April), 1–8. Available at: Nunnally, J. C. (1978). Phychometric theory (p. 640).
https://www.mckinsey.com/industries/capital-projects-and-infrastructure/our- Odeyinka, H., & Adebayo, Y. (1997). The causes and effects of construction delays on
insights/artificial-intelligence-construction-technologys-next-frontier. completion cost of housing projects in Nigeria. Journal of Financial Management of
Chen, G. X., et al. (2019). Investigating the causes of delay in grain bin construction Property and Construction, 2(3), 31–44, Available at: https://www.researchgate.net/
projects: the case of China. International Journal of Construction Management. Taylor publication/249643683. (Accessed 24 April 2020).
and Francis Ltd., 19(1), 1–14. http://dx.doi.org/10.1080/15623599.2017.1354514. Opitz, D., & Maclin, R. (1999). Popular ensemble methods: An empirical study. Journal
Delice, A. (2001). The sampling issues in quantitative research. Educational Sciences: of Artificial Intelligence Research. Morgan Kaufmann Publishers, 11, 169–198. http:
Theory & Practices, 10(4), 2001–2019. //dx.doi.org/10.1613/jair.614.
Dietterich, Thomas G. (2000). Ensemble methods in machine learning. In Lecture notes Owalabi, J. D., et al. (2014). Causes and effects of delay on project construction delivery
in computer science (including subseries lecture notes in artificial intelligence and lecture time. International Journal of Education and Research, 2(4), 197–208, Available at:
notes in bioinformatics), (pp. 1–15). http://dx.doi.org/10.1007/3-540-45014-9_1. www.ijern.com. (Accessed 24 April 2020).
Dormann, C. F., et al. (2013). Collinearity: A review of methods to deal with it and a Pedregosa, F., et al. (2011). Scikit-learn: Machine learning in Python. Journal of Machine
simulation study evaluating their performance. Ecography. Blackwell Publishing Ltd, Learning Research, 12(85), 2825–2830, Available at: http://scikit-learn.sourceforge.
36(1), 27–46. http://dx.doi.org/10.1111/j.1600-0587.2012.07348. net. (Accessed 7 January 2021).
Durbarry, R. (2019). Quantitative research. Research Methods for Tourism Students, Poh, C. Q. X., Ubeynarayana, C. U., & Goh, Y. M. (2018). Safety leading indicators for
(August), 98–113. http://dx.doi.org/10.4324/9780203703588-9. construction sites: A machine learning approach. Automation in Construction, 93,
Easterby-Smith, M., et al. (2008). Management research: theory and practice. 375–386. http://dx.doi.org/10.1016/j.autcon.2018.03.022, Elsevier B.V..
cloudgene.uibk.ac.at. Available at: https://cloudgene.uibk.ac.at/l2xq3pav9aoy/12- Rhodes, C. (2019). Construction industry: statistics and policy (01432), (pp. 1–13). House
alysha-lowe/read-9781847871770-management-research-theory-and-practice- of Commons Library.
paperbac.pdf. (Accessed 24 September 2020). Robinson, T. G. (2015). Global construction market to grow $ 8 trillion by 2030 :
Egan, S. J. (2018). Rethinking the Report of the Construction Task Force, Construction driven by China, US and India. Global Construction, 44.
Task Force, Available at: http://constructingexcellence.org.uk/wp-content/uploads/ Royal Society, T. (2017). Machine learning: The power and promise of computers that
2014/10/rethinking_construction_report.pdf. learn by example.
Egwim, C. N., et al. (2021). Extraction of underlying factors causing construction Saunders, M., et al. (2009). Research methods for business students (5th ed.). Pearson
projects delay in Nigeria. Journal of Engineering, Design and Technology, http://dx. Education Limited, Available at www.pearsoned.co.uk. (Accessed 24 September
doi.org/10.1108/jedt-04-2021-0211, ahead-of-p(ahead-of-print). 2020).
Enshassi, A., Al Najjar, J., & Kumaraswamy, M. (2009). Delays and cost overruns in the Seni, G., & Elder, J. F. (2010). Synthesis lectures on data mining and knowledge discovery:
construction projects in the Gaza Strip. Journal of Financial Management of Property vol. 2, Ensemble methods in data mining: improving accuracy through combining
and Construction, 14(2), 126–151. http://dx.doi.org/10.1108/13664380910977592. predictions (1), (pp. 1–126). Morgan & Claypool Publishers LLC, http://dx.doi.org/
European Comission (2017). European construction sector observatory. (p. 27). (June). 10.2200/s00240ed1v01y200912dmk002.
Flick, U. (2014). An introduction to qualitative research. Available at: https://us.sagepub. Shah, R. K. (2016). An exploration of causes for delay and cost overruns in construction
com/en-us/nam/an-introduction-to-qualitative-research/book240398. projects: Case study of Australia, Malaysia & Ghana. Journal of Advanced College
Flyvbjerg, B. (2014). What you should know about megaprojects and why: An overview. of Engineering and Management, 2, 41. http://dx.doi.org/10.3126/jacem.v2i0.16097,
Project Management Journal, 45(2), 6–19. http://dx.doi.org/10.1002/pmj.21409. Nepal Journals Online (JOL).
Gliem, J. a., & Gliem, R. R. (2003). Calculating, interpreting, and reporting cronbach’s Sullivan, A., & Harris, F. C. (1986). Delays on large construction projects. International
alpha reliability coefficient for likert-type scales. In 2003 Midwest research to Journal of Operations & Production Management, 6(1), 25–33. http://dx.doi.org/10.
practice conference in adult, continuing, and community education (1992), (pp. 82–88). 1108/eb054752.
http://dx.doi.org/10.1109/PROC.1975.9792. Wang, T. (2018). China: construction industry’s contribution share to GDP 2018–2021 |
Gondia, A., et al. (2020). Machine learning algorithms for construction projects delay statista. Statista. Available at: https://www.statista.com/statistics/1068213/china-
risk prediction. Journal of Construction Engineering and Management. American Society construction-industry-gdp-contribution-share/. (Accessed 18 April 2020).
of Civil Engineers (ASCE), 146(1), Article 04019085. http://dx.doi.org/10.1061/ Wang, T. (2019). Value added of U.S. construction as a percentage of GDP 2018.
(ASCE)CO.1943-7862.0001736. Statista. Available at: https://www.statista.com/statistics/192049/value-added-by-
Goyal, M. (2019). Artificial intelligence : A tool for hyper personalization. International us-construction-as-a-percentage-of-gdp-since-2007/. (Accessed 18 April 2020).
Journal of 360 Management Review, 07, 2320–7132. Wei, G., et al. (2020). A novel hybrid feature selection method based on dynamic
Haq, S., et al. (2017). Effects of delay in construction projects of punjab-Pakistan : An feature importance. Applied Soft Computing, 93, Article 106337. http://dx.doi.org/
empirical study effects of delay in construction projects of punjab-Pakistan. Journal 10.1016/j.asoc.2020.106337.
of Basic and Applied Scientific Research, 3(January 2014), 87–96. Woetzel, J., et al. (2017). Bridging infrastructure gaps has the world made progress?.
Hastie, T., Tibshirani, R., & Friedman, J. (2009). Ensemble learning. (pp. 1–20). McKinsey Global Institute, (October), p. 10. Available at: https://www.
http://dx.doi.org/10.1007/b94608_16. mckinsey.com/~/media/McKinsey/Industries/CapitalProjectsandInfrastructure/
James, E., Joe, W., & Chadwick, C. (2001). Organizational research : Determining OurInsights/Bridginginfrastructuregapshastheworldmadeprogress/
appropriate sample size in survey research. Information Technology, Learning, and BridginginfrastructuregapsHowhastheworldmadeprogressv2/MGI-Bridging-
Performance Journal, 19(1), 43–50. infrastructure-gaps.
Ji, Y., et al. (2018). Assessing and prioritising delay factors of prefabricated concrete Yaseen, Z. M., et al. (2020). Prediction of risk delay in construction projects using a
building projects in China. Applied Sciences, 8(11), 2324. http://dx.doi.org/10. hybrid artificial intelligence model. Sustainability, 12(4), 1514. http://dx.doi.org/
3390/app8112324, (Switzerland). MDPI AG. 10.3390/su12041514.
Kira, K., & Rendell, L. A. (1992). A practical approach to feature selection. In Machine Zhang, H. (2004). The optimality of naive Bayes naive Bayes and augmented naive
learning proceedings 1992 (pp. 249–256). Elsevier, http://dx.doi.org/10.1016/b978- Bayes. Aa, 1(2), 3, Available at: www.aaai.org. (Accessed 27 May 2021).
1-55860-247-2.50037-1. Zou, Z., & Ergan, S. (2019). Leveraging data driven approaches to quantify the impact of
construction projects on urban quality of life. (pp. 1–31). arXiv preprint arXiv:1901.
09084. Available at: https://arxiv.org/abs/1901.09084. (Accessed 8 May 2020).
15