Physical Energy and Data-Driven Models in Building Energy Prediction
Physical Energy and Data-Driven Models in Building Energy Prediction
Energy Reports
journal homepage: www.elsevier.com/locate/egyr
Review article
article info a b s t r a c t
Article history: The difficulty in balancing energy supply and demand is increasing due to the growth of diversified
Received 3 June 2021 and flexible building energy resources, particularly the rapid development of intermittent renewable
Received in revised form 15 January 2022 energy being added into the power grid. The accuracy of building energy consumption prediction is
Accepted 19 January 2022
of top priority for the electricity market management to ensure grid safety and reduce financial risks.
Available online 10 February 2022
The accuracy and speed of load prediction are fundamental prerequisites for different objectives such
Keywords: as long-term planning and short-term optimization of energy systems in buildings and the power grid.
Building energy modeling The past few decades have seen the impressive development of time series load forecasting models
Load prediction focusing on different domains and objectives. This paper presents an in-depth review and discussion of
Machine learning building energy prediction models. Three widely used prediction approaches, namely, building physical
Building energy simulation energy models (i.e., white box), data-driven models (i.e., black box), and hybrid models (i.e., grey box),
Time series forecasting were classified and introduced. The principles, advantages, limitations, and practical applications of
each model were investigated. Based on this review, the research priorities and future directions in the
domain of building energy prediction are highlighted. The conclusions drawn in this review could guide
the future development of building energy prediction, and therefore facilitate the energy management
and efficiency of buildings.
© 2022 The Authors. Published by Elsevier Ltd. This is an open access article under the CC BY license
(http://creativecommons.org/licenses/by/4.0/).
Contents
1. Introduction..................................................................................................................................................................................................................... 2657
1.1. Literature reviews .............................................................................................................................................................................................. 2657
1.2. Objectives and structure of the review........................................................................................................................................................... 2658
2. Building physical energy models – ‘‘white box’’ ........................................................................................................................................................ 2658
2.1. EnergyPlus .......................................................................................................................................................................................................... 2659
2.2. TRNSYS ................................................................................................................................................................................................................ 2659
2.3. Dymola ................................................................................................................................................................................................................ 2660
2.4. Other tools .......................................................................................................................................................................................................... 2660
2.5. Discussion on building physical energy models ............................................................................................................................................ 2660
3. Data-driven models using machine learning algorithms – ‘‘black box’’.................................................................................................................. 2660
3.1. Linear regression (LR)........................................................................................................................................................................................ 2661
3.2. Support vector machine (SVM) ........................................................................................................................................................................ 2661
3.3. Random forest (RF)............................................................................................................................................................................................ 2662
3.4. Extreme gradient boosting (XGBoost) and lightGBM .................................................................................................................................... 2662
3.5. Artificial neural network (ANN)....................................................................................................................................................................... 2663
3.6. Recurrent neural network (RNN) ..................................................................................................................................................................... 2663
3.7. Other models...................................................................................................................................................................................................... 2664
3.8. Discussion of data-driven models.................................................................................................................................................................... 2665
4. Hybrid models – ‘‘grey box’’ ......................................................................................................................................................................................... 2665
4.1. resistance–capacitance (RC) thermal network ............................................................................................................................................... 2665
∗ Corresponding authors.
E-mail addresses: [email protected] (Y. Chen), [email protected] (Z. Chen).
https://doi.org/10.1016/j.egyr.2022.01.162
2352-4847/© 2022 The Authors. Published by Elsevier Ltd. This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/).
Y. Chen, M. Guo, Z. Chen et al. Energy Reports 8 (2022) 2656–2671
recent years. Deep learning is an example of particularly suc- building owner to select a suitable model in practical engineering
cessful (Sun et al., 2020; Gassar et al., 2019). In the data-driven (4) summarizing the widely used models at present and pointing
model domain, two main factors, i.e., feature importance and out future direction of building energy prediction models. The
algorithm selection, were considered in the previous literature. paper is organized as follows. Section 2 elaborates on the physical
The input feature variables, including external and internal fac- building energy models by introducing and comparing different
tors, are the key elements for the prediction performance of the commonly used simulation tools. Section 3 studies the data-
algorithms (Zhang and Wen, 2019a; Luo et al., 2020). Although driven energy prediction models, and the hybrid methods are
the data-driven model has the merit of requiring less building introduced in Section 4. The advantages and disadvantages of
information to develop the model, the prediction performance is
each approach are presented in Section 5, and the conclusions
unstable, especially when the model is applied to other building
are drawn in Section 6.
cases. In addition, hybrid models have been developed simultane-
ously to improve the prediction performance by integrating the
advantages of physical and data-driven models. 2. Building physical energy models – ‘‘white box’’
There are several review papers about building energy pre-
diction (Foucquier et al., 2013; Wang and Srinivasan, 2017a;
Building physical energy models, also called physical models,
Amasyali and El-Gohary, 2018). However, there are still two
research gaps. First, most of the review papers focused on data- are based on heat and mass balance equations, which present the
driven models using machine learning algorithms (i.e., ‘‘black dynamic thermal behavior of buildings. Three heat transfer mod-
box’’ model). Despite the importance of these review efforts, els (i.e., conduction, convection and radiation) between building
physical and hybrid models are also important and well- envelop and its surroundings are considered in the heat balance
developed that they should be included and further discussed. analysis of physical energy models. Various commercial or open-
Second, review studies that cover overall building energy con- source software products such as EnergyPlus, Dymola, TRNSYS,
sumption prediction research in terms of different prediction DOE-2, and Matlab are available for building energy modeling to
spans (i.e., STLF and LTLF) are still insufficient. Such a review is construct and solve these equations conveniently (Harish and Ku-
essential for building owners to select an appropriate prediction mar, 2016), though the cooling and heating load can be calculated
model. Distinguishing from the published review papers, the manually. The description of heat and mass balance equations
novelty of this paper is to elaborate on building energy prediction and detailed steps to calculate the building heating and cooling
models based on prediction span. In the field of building energy loads was introduced in this paper (Hensen and Lamberts, 2012).
prediction, the prediction span is diverse according to practical Understanding the overall physical characteristics of buildings
engineering needs. Different models have different performances is important for using these building simulation tools. The heat
in manifold tasks of various prediction spans. In this review study, flow through the building envelope is determined not only by
three types of methods in different prediction spans (i.e., STLF and
the temperature difference, thermal resistance, and surface area,
LTLF) were investigated. The principle, advantages, limitations,
but also by the thermal inertia effect of the thermal mass, which
and practical applications of each method were investigated. In
results in heat lag. In general, detailed building information is
summary, this paper paves the way for a better understanding of
required to develop such models. Building envelope parameters,
the methodology for building energy prediction.
HVAC systems setting, internal heat gains, equipment and occu-
1.2. Objectives and structure of the review pancy schedules, thermal zones, location, and weather data are
essential to construct a physical building energy model (Crawley
The goal of this paper is to provide a comprehensive review et al., 2001). Zonal (Inard et al., 1996) and nodal (Zhai et al., 2011)
of building energy prediction approaches. The goals of this pa- approaches are two common methods for developing a physical
per are fourfold: (1) presenting a systematic review (including model. These approaches are a fast and simple way to estimate
physics-based, data-driven, and hybrid approaches) to facilitate the heat behavior of buildings (Foucquier et al., 2013). The rest
the development of energy prediction models; (2) describing the of this section describes the modeling process, advantages and
key processes and tactics of each approach; (3) paving a way for limitations, and applications of the commonly used software.
2658
Y. Chen, M. Guo, Z. Chen et al. Energy Reports 8 (2022) 2656–2671
Table 1
Brief description of different simulation tools.
Software tool Modeling approach Advantages Disadvantages Representative Reference
Require small computation time; Require a significant amount of time, U.S. Department of Energy (2021),
Applicable to large scale buildings; experience, and effort to enter the Trcka and Hensen (2010)
EnergyPlus Causal Good at envelope modeling; detailed parameters; and Fumo et al. (2010)
Friendly for beginners; Some required parameters are not
Free available;
Not good at HVAC systems
Require small computation time; Not good at building physical model; Anon (2021d), Quesada et al. (2011)
Flexibility and customization; Chargeable and Jani et al. (2020)
TRNSYS Causal Modular design;
Good at solar energy systems;
Friendly for beginners
Flexibility and customization; Not friendly for beginners; Anon (2021c), Hafner et al. (2014)
Modular design; Relatively long computation time; and Violidakis et al. (2020)
Dymola Acausal
Good at HVAC systems modeling; Chargeable
High reuse of components
DOE-2 Causal Good at building physics modeling; Not good at energy systems; Carriere et al. (1999) and
A traditional building simulation tool Unfriendly user interface Winkelmann and Selkowitz (1985)
31.9%; short-term: CV-RMSE 20.2%) are not good for this long- hidden layer ANN is shown in Fig. 8. A typical ANN has three
term task. XGBoost is good at long-term prediction, as other interconnected layers: input, middle (i.e., hidden), and output
studies have found. Lu et al. (2020) proposed a novel model that layers. Theoretically, the hidden layer consists of many sub-layers
combines XGBoost to predict the long-term energy consump- depending on the complexity and nature of the task (Mandal
tion of an intake tower. The MAPE of the prediction results of et al., 2006; Kiartzis et al., 1997).
the different methods are as follows: CEEMDAN-XGBoost: 4.85%, In addition to the factor of prediction accuracy, computing
XGBoost: 8.06%, CEEMDAN-RF: 6.26%, and PSO-SVM: 7.92%. time is another critical factor in evaluating the performance of a
In the building energy demand, the load demand of HVAC model. Generally, increasing the dimension of input features can
systems is the main difficulty to estimate because of its nonlinear improve prediction accuracy, but this strategy may also increase
character. Lu and Meng (2020) found that XGBoost is the best the computation cost, particularly for massive data processing.
model for forecasting AC energy use in residential buildings in Ahmad et al. (2017) compared an ANN and RF for HVAC electricity
Chongqing. For the cooling season of AC in buildings, they found consumption prediction of a hotel. Outdoor air-dry bulb temper-
that 11 input variables have a great influence on cooling energy ature, outdoor air relative humidity, day of week, hour of day,
use. These variables mainly included outdoor air temperature, occupancy schedule, and total rooms booked were considered as
running time of the AC, and temperature differences between input variables. They found that the ANN model’s prediction re-
indoor air and set-point, whereas no building physical and ther- sults were slightly better (MAE: 9.18% vs. 9.31%) when the model
mophysical variables such as window–wall ratio and total heat used all variables (ten features) instead of only using the impor-
transfer coefficient of the envelope were considered. The predic- tant variables (four features). In their study, the computation time
tion performance is not well compared with the results in Wang was not provided; nevertheless, using the main input features in
et al. (2020a) XGBoost (CV-RMSE: 62%), RF (CV-RMSE: 64%), SVR the model is a good way to optimize the prediction model in
(CV-RMSE: 64%), and ANN (CV-RMSE: 73%). Wang et al. (2019b)
practice. Mena et al. (2014) predicted the short-term electricity
tested several popular models (i.e., XGBoost, RF, ANN, and SVR) to
demand of a bioclimatic building in Spain by using an ANN-based
predict the heating energy consumption of a residential building
model. To avoid the use of unimportant variables in the model,
in Tianjin, China. Six input features (i.e., outdoor dry bulb temper-
input feature variable selection was implemented ahead of the
ature, dew point temperature, outdoor relative humidity, wind
data training and testing. The order of features’ importance of
speed, solar radiation, and hour of day) were used in their models.
different input variables is as follows: solar radiation, outdoor
The CV-RMSE of the prediction results of the different models is
temperature, wind speed, outdoor humidity, and wind direction,
as follows: average RF: 5.0%, XGBoost: 5.8%, SVR: 6.2%, and ANN:
and a mean error of 11.48% has been realized.
7.0%.
In addition, lightGBM is a tree-based gradient boosting frame-
3.6. Recurrent neural network (RNN)
work similar to XGBoost. It was first released on October 17,
2016 as a part of Microsoft’s Distributed Machine Learning Toolkit
project (Anon, 2021i). It was designed to be fast and distributed Elman RNN, LSTM, and gated recurrent unit (GRU) are three
with the advantages of faster training speed and higher efficiency, common RNN algorithms. LSTM was designed for handling se-
lower memory usage, supporting parallel and GPU learning, and quential data and was first introduced by Hochreiter and Schnid-
capable of handling large-scale data. LightGBM uses histogram- huber in 1997 (Hochreiter and Schmidhuber, 1997). Compared
based algorithms to bucket continuous features into discrete bins with the traditional neural network, LSTM can pass the informa-
so that it can reduce communication cost and memory usage (Jin tion from the last steps to the next time step (i.e., backpropa-
and Agrawal, 2003; Ke et al., 2017). Thus, lightGBM is a promising gation). Fig. 7 shows this memory passing process. LSTM can be
algorithm for energy prediction in massive data sources. considered as an integration of many traditional neural networks.
Based on this feature, LSTM is an inborn network that processes
3.5. Artificial neural network (ANN) sequential data such as building load. It can solve complex and
long-time-lag tasks that traditional RNN algorithms can barely
ANN is a nonlinear statistical algorithm inspired by biological solve. In the study (Wang et al., 2020a), LSTM performed better
neural networks. It can deduce the complicated hidden relation- for short-term load prediction compared with LR, SVM, RF, and
ship between inputs and outputs. The principle of a typical one XGBoost.
2663
Y. Chen, M. Guo, Z. Chen et al. Energy Reports 8 (2022) 2656–2671
Fig. 12. Frequency of use of different algorithms for STLF and LTLF in 25 core
references.
Fig. 14. Scheme of a hybrid model combining building physics with the data-driven algorithm.
Table 2
Advantages and disadvantages of different prediction approaches.
Prediction approaches Advantages Disadvantages
• The relationship between input and output is • It requires a significant amount of efforts to
explainable input building information and parameters
‘‘White box’’ models • The parameters are easily modified • Computation cost is huge
• No historical data is required • It requires previous knowledge of thermal
dynamics and software
• No specific expertise is required • It requires large amounts of historical data
‘‘Black box’’ models
• Model development period and computational • It is easy overfitting
time are short • It is usually no-explainable
• The developed model is easy to be generalized
• Less data is required • It couples two distinct scientific domains
‘‘Grey box’’ models • Only bounds on physical parameters are • It is not easy to develop
required
et al., 2015). Table 2 summarizes the advantages and disadvan- random forest (RF), extreme gradient boosting (XGBoost), recur-
tages of these three approaches, and it benefits engineers to select rent neural network (RNN), and artificial neural network (ANN).
an appropriate model for their own needs. The last category is the hybrid model (i.e. grey box), which
relies on both physical model and data-driven model. Through
6. Conclusion a well-rounded study of these three approaches, the following
conclusions were drawn:
In this paper, we presented an in-depth review of the main ap- (1) It is better to use physical models for their reliability, inter-
proaches applied to building energy prediction. These approaches pretability and accuracy although the modeling process is time-
were clustered into three groups including the most commonly consuming and repetitive. Data-driven models are extremely use-
used methods. First, the building physics-based simulation tools ful when owners have sufficient historic data from existing build-
(i.e., white box) have been introduced. These tools can be divided ings, whereas there is no or insufficient information of design
into zonal and nodal approaches. Second, data-driven models building. Hybrid models make a great trade-off between physical
(i.e., black box) have been reviewed. There are six main algo- and data-driven models, and they could be a better option when
rithms: linear regression (LR), support vector machine (SVM), the required information is insufficient for the other two models.
2667
Y. Chen, M. Guo, Z. Chen et al. Energy Reports 8 (2022) 2656–2671
Table A.1
(2) STLF and LTLF are two different types of needs in building such as the number of floors, wall area, and glazing area, can be
energy management projects. STLF is crucially important for con- used to improve the prediction accuracy.
trol goals such as energy system optimization and DR, whereas
LTLF is of great interest for long-term energy planning, such as Declaration of competing interest
system planning and energy policy formulation.
(3) Simulation tools such as TRNSYS and Dymola are good at
The authors declare that they have no known competing finan-
establishing energy systems, while EnergyPlus is good at building
cial interests or personal relationships that could have appeared
envelop simulation. In addition, some data-driven models such as
to influence the work reported in this paper.
XGBoost are preferred for LTLF, while RNN is good at STLF, and
ANN and RF can be used for both time spans.
(4) Feature selection is the most popular strategy for a data- Acknowledgments
driven model. Outdoor dry bulb temperature, outdoor relative
humidity, solar radiation, day of week, and hour of day are the This work was funded by the China Postdoctoral Science
five most important and frequently used input features in data- Foundation (No. 2020M681347) and the National Natural Science
driven models. In addition, the physical information of buildings, Foundation of China (No. 51908006).
2668
Y. Chen, M. Guo, Z. Chen et al. Energy Reports 8 (2022) 2656–2671
Appendix Chong, A., Augenbroe, G., Yan, D., 2021. Occupancy data at different spatial res-
olutions: Building energy performance and model calibration. Appl. Energy
286.
See Table A.1.
Chu, Y., Xu, P., Li, M., Chen, Z., Chen, Z., Chen, Y., et al., 2020. Short-term
metropolitan-scale electric load forecasting based on load decomposition and
ensemble algorithms. Energy Build. 225, 110343.
References Chuan, L., Shaohui, Z., Yi, Q., Edgar, E., 2020. A systematic review of deep transfer
learning for machinery fault diagnosis. Neurocomputing 407, 121–135.
Abu-Et-Magd, M.A., Findla, R.D., 2003. A new approach using artificial neural Crawley, D.B., Lawrie, L.K., Winkelmann, F.C., Buhl, W.F., Huang, Y.J., Peder-
network and time series models for short-term load forecasting. In: Canadian sen, C.O., et al., 2001. Energyplus: creating a new-generation building energy
Conference on Electrical and Computer Engineering. Quebec, Canada. simulation program. Energy Build. 33, 319–331.
Adrian, C., Weili, X., Song, C., Ngoc-Tri, N., 2019. Continuous-time Bayesian Dawn, A., Nam, H.K., Joo-Ho, C., 2015. Practical options for selecting data-driven
calibration of energy models using BIM and energy data. Energy Build. 194. or physics-based prognostics algorithms with reviews. Reliab. Eng. Syst. Saf.
Ahmad, T., Chen, H., 2019. Nonlinear autoregressive and random forest ap- 133.
proaches to forecasting electricity load for utility energy management Ding, Y., Zhang, Q., Yuan, T., Yang, K., 2018. Model input selection for building
systems. Sustain. Cities Soc. 45, 460–473. heating load prediction: A case study for an office building in tianjin. Energy
Ahmad, M.W., Mourshed, M., Rezgui, Y., 2017. Trees vs neurons: Comparison Build. 159, 254–270.
between random forest and ANN for high-resolution prediction of building Do, H., Cetin, K.S., 2018. Evaluation of the causes and impact of outliers on
energy consumption. Energy Build. 147, 77–89. residential building energy use prediction using inverse modeling. Build.
Ahmad, T., Zhang, H., Yan, B., 2020. A review on renewable energy and electricity Environ. 138, 194–206.
requirement forecasting models for smart grid and buildings. Sustain. Cities Dong, B., Li, Z., Rahman, S.M.M., Vega, R., 2016. A hybrid model approach
Soc. 55, 102052. for forecasting future residential electricity consumption. Energy Build. 117,
Alipour, M., Aghaei, J., Norouzi, M., Niknam, T., Hashemi, S., Lehtonen, M., 2020. 341–351.
A novel electrical net-load forecasting model based on deep neural networks Duc-Hoc, T., Duc-Long, L., Chou, J., 2020. Nature-inspired metaheuristic ensemble
and wavelet transform integration. Energy 205, 118106. model for forecasting energy consumption in residential buildings. Energy
Amasyali, K., El-Gohary, N.M., 2018. A review of data-driven building energy 191, 116552.
consumption prediction studies. Renew. Sustain. Energy Rev. 81, 1192–1205. Dudek, G., 2015. Short-term load forecasting using random forests. In: Advances
Anon, 2006. On-Site Generation Simulation with EnergyPlus for Commercial in Intelligent Systems and Computing.
Buildings. Microgrids at Berkeley Lab. Dynamic Modeling Laboratory User Manual Volume 1, Dassault Systèmes A.B.,
Anon, 2021a. Kaggle Competitions. https://www.kaggle.com/competitions. 2017. Dynamic Modeling Laboratory User Manual Volume 1. Dassault
Anon, 2021b. Open source library for building energy and control systems. Systèmes AB.
Lawrence Berkeley National Laboratory. http://simulationresearch.lbl.gov/ Elmqvist, H., 1978. A Structured Model Language for Large Continuous Systems.
modelica/index.html. Lund University.
Anon, 2021c. The introdction of Modelica language, Modelica association. https: Fahrmeir, L., Thomas, K., Stefan, L., Brian, M., 2013. Regression Models. Springer,
//www.modelica.org/modelicalanguage. Berlin, Heidelberg.
Anon, 2021d. A transien system simulation program. http://www.trnsys.com/. Fan, C., Sun, Y., Xiao, F., Ma, J., Lee, D., Wang, J., et al., 2020. Statistical investiga-
Anon, 2021e. Building Energy Software Tools Directory, The United State tions of transfer learning-based methodology for short-term building energy
Department of Energy https://www.buildingenergysoftwaretools.com/. predictions. Appl. Energy 262, 114499.
Anon, 2021f. EnergyPlus Documentation - Tips & Tricks for Using EnergyPlus Fan, C., Xiao, F., Wang, S., 2014. Development of prediction models for next-day
Insider secrets to Using EnergyPlus. https://energyplus.net/sites/default/files/ building energy consumption and peak power demand using data mining
docs/site_v8.3.0/Tips_and_Tricks_Using_EnergyPlus/Tips_and_Tricks_Using_ techniques. Appl. Energy 127, 1–10.
EnergyPlus/index.html#hvac-sizing-equipment-simulation-and-controls. Fan, C., Xiao, F., Zhao, Y., 2017. A short-term building cooling load prediction
Anon, 2021g. BIM/GIS and Modelica Framework for building and community method using deep learning algorithms. Appl. Energy 195, 222–233.
energy system design and operation. https://ibpsa.github.io/project1/index. Foucquier, A., Robert, S., Suard, F., Stephan, L., Jay, A., 2013. State of the art in
html. building modelling and energy performances prediction: A review. Renew.
Anon, 2021h. Get Started with XGBoost. https://xgboost.readthedocs.io/en/latest/ Sustain. Energy Rev. 23, 272–288.
get_started.html. Fu, Y., Li, Z., Zhang, H., Xu, P., 2015. Using support vector machine to predict next
Anon, 2021i. Distributed machine learning toolkit—big data, big model, day electricity load of public buildings with sub-metering devices. Procedia
flexibility, efficiency. http://www.dmtk.io/. Eng..
Butch, Q., 2020. Machine Learning with Spark: Covers XGBoost, LightGBM, Fumo, N., Mago, P., Luck, R., 2010. Methodology to estimate building en-
Spark NLP, Distributed Deep Learning with Keras, and more. Springer ergy consumption using EnergyPlus benchmark models. Energy Build. 42,
Science+Business Media New York, New York, NY 10004. 2331–2337.
Cai, M., Pipattanasomporn, M., Rahman, S., 2019. Day-ahead building-level load Gao, Y., Ruan, Y., Fang, C., Yin, S., 2020. Deep learning and transfer learning mod-
forecasts using deep learning vs. traditional time-series techniques. Appl. els of energy consumption forecasting for a building with poor information
Energy 236, 1078–1088. data. Energy Build. 223, 110156.
Carriere, M., Schoenau, G.J., Besant, R.W., 1999. Investigation of some large Gassar, A.A.A., Yun, G.Y., Kim, S., 2019. Data-driven approach to prediction
building energy conservation opportunities using the DOE-2 model. Energy of residential energy consumption at urban scales in London. Energy 187,
Convers. Manage. 40, 861–872. 115973.
Cetin, K.S., Fathollahzadeh, M.H., Kunwar, N., Huyen, D., Tabares-Velasco, P.C., Gaurav, C., Joshua, N., Jibonananda, S., Piljae, I., Zheng, O.N., Vishal, G., 2016.
2019. Development and validation of an HVAC on/off controller in EnergyPlus Evaluation of autotune calibration against manual calibration of building
for energy simulation of residential and small commercial buildings. Energy energy models. Appl. Energy 182.
Build. 183, 467–483. Guo, J., Liu, R., Xia, T., Pouramini, S., 2021. Energy model calibration in an office
Chammas, M., Makhoul, A., Demerjian, J., 2019. An efficient data model for building by an optimization-based method. Energy Rep. 7.
energy prediction using wireless sensors. Comput. Electr. Eng. 76, 249–257. Guo, Y., Wang, J., Chen, H., Li, G., Liu, J., Xu, C., et al., 2018. Machine learning-
Chargui, R., Sammouda, H., Farhat, A., 2012. Geothermal heat pump in heating based thermal response time ahead energy demand prediction for building
mode: Modeling and simulation on TRNSYS. Int. J. Refrig. 35, 1824–1832. heating systems. Appl. Energy 221, 16–27.
Chen, Z., Chen, Y., Xiao, T., Wang, H., Hou, P., 2021. A novel short-term load fore- Hafner, I., Roessler, M., Heinzl, B., Koerner, A., Landsiedl, M., Breitenecker, F.,
casting framework based on time-series clustering and early classification 2014. Investigating communication and step-size behaviour for co-simulation
algorithm. Energy Build. 251. of hybrid physical systems. J. Comput. Sci.-Neth. 5, 427–438.
Chen, Y., Chen, Z., Xu, P., Li, W., Sha, H., Yang, Z., et al., 2019. Quantification of Hagan, M.T., Behr, S.M., 1989. The time series approach to short-term load
electricity flexibility in demand response: Office building case study. Energy forecasting. Trans. Power Syst. PWRS-2.
188, 116054. Harish, V.S.K.V., Kumar, A., 2016. A review on modeling and simulation of
Chen, Y., Tan, H., 2017. Short-term prediction of electric demand in building building energy systems. Renew. Sustain. Energy Rev. 56, 1272–1292.
sector via hybrid support vector regression. Appl. Energy 204, 1363–1374. Hassid, S., 1985. A linear-model for passive solar calculations-evaluation of
Chen, Y., Xu, P., Chu, Y., Li, W., Wu, Y., Ni, L., et al., 2017. Short-term performance. Build. Environ. 20, 53–59.
electrical load forecasting using the support vector regression (SVR) model He, Y., Liu, R., Li, H., Wang, S., Lu, X., 2017. Short-term power load probability
to calculate the demand response baseline for office buildings. Appl. Energy density forecasting method using kernel-based support vector quantile
195, 659–670. regression and copula theory. Appl. Energy 185, 254–266.
Chen, Y., Xu, P., Gu, J., Schmidt, F., Li, W., 2018. Measures to improve energy He, F., Zhou, J., Mo, L., Feng, K., Liu, G., He, Z., 2020. Day-ahead short-term load
demand flexibility in buildings for demand response (DR): A review. Energy probability density forecasting method with a decomposition-based quantile
Build. 177, 125–139. regression forest. Appl. Energy 262, 114396.
2669
Y. Chen, M. Guo, Z. Chen et al. Energy Reports 8 (2022) 2656–2671
Hensen, J.L.M., Lamberts, R., 2012. Building Performance Simulation for Design Mazzeo, D., Matera, N., Cornaro, C., Oliveti, G., Romagnoni, P., De Santoli, L., 2020.
and Operation: Taylor and Francis. CRC Press. Energyplus, IDA ice and TRNSYS predictive simulation accuracy for building
Hochreiter, S., Schmidhuber, J., 1997. Long short-term memory. Neural Comput. thermal behaviour evaluation by using an experimental campaign in solar
9, 1735–1780. test boxes with and without a PCM module. Energy Build. 212, 109812.
Hong, T., Fan, S., 2016. Probabilistic electric load forecasting: A tutorial review. Mena, R., Rodriguez, F., Castilla, M., Arahal, M.R., 2014. A prediction model based
Int. J. Forecast. 32, 914–938. on neural networks for the energy consumption of a bioclimatic building.
Hong, T., Pinson, P., Fan, S., 2014. Global energy forecasting competition 2012. Energy Build. 82, 142–155.
Int. J. Forecast. 30, 357–363. Mohammad, H.S., Usman, A., Elen, I., James, O., 2020. A framework for
uncertainty quantification in building heat demand simulations using
Hou, D., Hassan, I.G., Wang, L., 2021. Review on building energy model
reduced-order grey-box energy models. Appl. Energy 275, 115141.
calibration by Bayesian inference. Renew. Sustain. Energy Rev. 143.
Moon, J., Kim, Y., Son, M., Hwang, E., 2018. Hybrid short-term load forecasting
Inard, C., Bouia, H., Dalicieux, P., 1996. Prediction of air temperature distribution
scheme using random forest and multilayer perceptron. Energies 11, 328312.
in buildings with a zonal model. Energy Build. 24, 125–132.
Mun, S., Kwak, Y., Huh, J., 2019. A case-centered behavior analysis and operation
Jani, D.B., Bhabhor, K., Dadi, M., Doshi, S., Jotaniya, P.V., Ravat, H., et al., 2020.
prediction of AC use in residential buildings. Energy Build. 188, 137–148.
A review on use of TRNSYS as simulation tool in performance prediction of Murphy, M.D., O’Sullivan, P.D., Da Graça, G.C., O’Donovan, A., 2021. Development,
desiccant cooling cycle. J. Therm. Anal. Calorim. 140, 2011–2031. calibration and validation of an internal air temperature model for a
Ji, Y., 2016. FDD Method of HVAC System in Large Public Buildings with naturally ventilated nearly zero energy building: Comparison of model types
Submetering Electricity Data (in Chinese). Tongji University, Shanghai. and calibration methods. Energies 14.
Ji, Y., Xu, P., Duan, P., Lu, X., 2016. Estimating hourly cooling load in commercial Nageler, P., Schweiger, G., Pichler, M., Brandl, D., Mach, T., Heimrath, R., et al.,
buildings using a thermal network model and electricity submetering data. 2018. Validation of dynamic building energy simulation tools based on a real
Appl. Energy 169, 309–323. test-box with thermally activated building systems (TABS). Energy Build. 168,
Jin, R.M., Agrawal, G., 2003. Communication and memory efficient parallel 42–55.
decision tree construction. pp. 119–129. Neto, A.H., Fiorelli, F.A.S., 2008. Comparison between detailed model simulation
Ke, G., Meng, Q., Finley, T., Wang, T., Chen, W., Ma, W., et al., 2017. LightGBM: and artificial neural network for forecasting building energy consumption.
A highly efficient gradient boosting decision tree. In: 31st Conference on Energy Build. 40, 2169–2176.
Neural Information Processing Systems. Long Beach, CA, USA. Nilsson, N.J., 1965. Learning Machines: Foundations of Trainable Pattern-
Ke, M., Yeh, C., Jian, J., 2013. Analysis of building energy consumption parameters Classifying Systems. McGraw-Hill, New York.
and energy savings measurement and verification by applying eQUEST Nouidui, T.S., Phalak, K., Wangda, Z., Wetter, M., 2012. Validation and application
software. Energy Build. 61, 100–107. of the room model of the modelica buildings library. In: Proceedings of the
Kiartzis, S.J., Zoumas, C.E., Theocharis, J.B., Bakirtzis, A.G., Petridis, V., 1997. Short- 9th International Modelica Conference. Munich, Germany.
term load forecasting in an autonomous power system using artificial neural Qian, F., Gao, W., Yang, Y., Yu, D., 2020. Potential analysis of the transfer
networks. Ieee T Power Syst. 12, 1591–1596. learning model in short and medium-term forecasting of building HVAC
Kim, E., He, X., Roux, J., Johannes, K., Kuznik, F., 2019. Fast and accurate energy consumption. Energy 193, 315–324.
district heating and cooling energy demand and load calculations using Quesada, B., Sanchez, C., Canada, J., Royo, R., Paya, J., 2011. Experimental results
reduced-order modelling. Appl. Energy 238, 963–971. and simulation with TRNSYS of a 7.2 kWp grid-connected photovoltaic
system. Appl. Energy 88, 1772–1783.
Kim, J.B., Jeong, W., Clayton, M.J., Haberl, J.S., Yan, W., 2015. Developing a
Ribeiro, M., Grolinger, K., ElYamany, H.F., Higashino, W.A., Capretz, M.A.M., 2018.
physical BIM library for building thermal energy simulation. Autom. Constr.
Transfer learning with seasonal and trend adjustment for cross-building
50, 16–28.
energy forecasting. Energy Build. 165, 352–363.
Kumar, S., Pal, S.K., Singh, R.P., 2018a. Intra ELM variants ensemble based model
Sala-Cardoso, E., Delgado-Prieto, M., Kampouropoulos, K., Romeral, L., 2018.
to predict energy performance in residential buildings. Sustain. Energy Grids
Activity-aware HVAC power demand forecasting. Energy Build. 170, 15–24.
Netw. 16, 177–187.
Salkuti, S.R., 2019. Day-ahead thermal and renewable power generation
Kumar, S., Pal, S.K., Singh, R.P., 2018b. A novel method based on extreme learning scheduling considering uncertainty. Renew. Energy 131, 956–965.
machine to predict heating and cooling load through design and structural Seyedzadeh, S., Rahimian, F.P., Rastogi, P., Glesk, I., 2019. Tuning machine
attributes. Energy Build. 176, 275–286. learning models for prediction of building energy loads. Sustain. Cities Soc.
Kusiak, A., Li, M., Zhang, Z., 2010. A data-driven approach for steam load 47, 101484.
prediction in buildings. Appl. Energy 87, 925–933. Siddharth, V., Ramakrishna, P.V., Geetha, T., Sivasubramaniam, A., 2011. Auto-
Lahouar, A., Slama, J.B.H., 2015. Day-ahead load forecast using random forest matic generation of energy conservation measures in buildings using genetic
and expert input selection. Energy Convers. Manage. 103, 1040–1051. algorithms. Energy Build. 43, 2718–2726.
Leung, M.C., Tse, N.C.F., Lai, L.L., Chow, T.T., 2012. The use of occupancy space Somu, N., Raman, G.M.R., Ramamritham, K., 2020. A hybrid model for building
electrical power demand in building cooling load prediction. Energy Build. energy consumption forecasting using long short term memory networks.
55, 151–163. Appl. Energy 261, 114131.
Li, Q., Meng, Q., Cai, J., Yoshino, H., Mochida, A., 2009. Applying support vector Son, H., Kim, C., 2015. Forecasting short-term electricity demand in residential
machine to predict hourly cooling load in the building. Appl. Energy 86, sector based on support vector regression and fuzzy-rough feature selection
2249–2256. with particle swarm optimization. 118, pp. 1162–1168.
Li, Y., O’Neill, Z., Zhang, L., Chen, J., Im, P., DeGraw, J., 2021. Grey-box modeling Sun, Y., Haghighat, F., Fung, B.C.M., 2020. A review of the -state-of-the-art in
and application for building energy simulations - A critical review. Renew. data -driven approaches for building energy prediction. Energy Build. 221,
Sustain. Energy Rev. 146. 110022.
Liu, C., Sun, B., Zhang, C., Li, F., 2020. A hybrid prediction model for residential Swati, Z.N.K., Zhao, Q., Kabir, M., Ali, F., Ali, Z., Ahmed, S., et al., 2019. Brain
electricity consumption using holt-winters and extreme learning machine. tumor classification for MR images using transfer learning and fine-tuning.
Appl. Energy 275. Comput. Med. Imaging Graph. 75, 34–46.
Trcka, M., Hensen, J.L.M., 2010. Overview of HVAC system simulation. Autom.
Lu, H., Cheng, F., Ma, X., Hu, G., 2020. Short-term prediction of building energy
Constr. 19, 93–99.
consumption employing an improved extreme gradient boosting model: A
Tsanas, A., Xifara, A., 2012. Accurate quantitative estimation of energy per-
case study of an intake tower. Energy 203, 117756.
formance of residential buildings using statistical machine learning tools.
Lu, Y., Meng, L., 2020. A simplified prediction model for energy use of air
Energy Build. 49, 560–567.
conditioner in residential buildings based on monitoring data from the cloud
Tuhus-Dubrow, D., Krarti, M., 2010. Genetic-algorithm based approach to opti-
platform. Sustain. Cities Soc. 60, 102194.
mize building envelope design for residential buildings. Build. Environ. 45,
Luo, X.J., Lukumon, O.O., Anuoluwapo, O.A., Olugbenga, O.A., Hakeem, A.O., 1574–1581.
Ashraf, A., 2020. Feature extraction and genetic algorithm enhanced adaptive U.S. Department of Energy, Energy Efficiency and Renewable Energy Office,
deep neural network for energy consumption prediction in buildings. Renew. Building Technology Program. https://energyplus.net/downloads.
Sustain. Energy Rev. 131, 109980. Vapnik, V., 2013. The Nature of Statistical Learning Theory. Springer, New York.
Luo, X.J., Oyedele, L.O., Ajayi, A.O., Monyei, C.G., Akinade, O.O., Akanbi, L.A., 2019. Violidakis, I., Zeneli, M., Atsonios, K., Strotos, G., Nikolopoulos, N., Karellas, S.,
Development of an IoT-based big data platform for day-ahead prediction of 2020. Dynamic modelling of an ultra high temperature PCM with combined
building heating and cooling demands. Adv. Eng. Inform. 41, 100926. heat and electricity production for application at residential buildings. Energy
Magnier, L., Haghighat, F., 2010. Multiobjective optimization of building design Build. 222, 110067.
using TRNSYS simulations, genetic algorithm, and artificial neural network. Vrablecova, P., Ezzeddine, A.B., Rozinajova, V., Sarik, S., Sangaiah, A.K., 2018.
Build. Environ. 45, 739–746. Smart grid load forecasting using online support vector regression. Comput.
Mandal, P., Senjyu, T., Funabashi, T., 2006. Neural networks approach to forecast Electr. Eng. 65, 102–117.
several hour ahead electricity prices and loads in deregulated market. Energy Wang, C., Grozev, G., Seo, S., 2012. Decomposition and statistical analysis for
Convers. Manage. 47, 2128–2142. regional electricity demand forecasting. Energy 41, 313–325.
2670
Y. Chen, M. Guo, Z. Chen et al. Energy Reports 8 (2022) 2656–2671
Wang, Z., Hong, T., Piette, M.A., 2019a. Predicting plug loads with occupant count Winkelmann, F.C., Selkowitz, S., 1985. Daylighting simulation in the DOE-2
data through a deep learning approach. Energy 181, 29–42. building energy analysis program. Energy Build. 8, 271–286.
Wang, Z., Hong, T., Piette, M.A., 2020a. Building thermal load prediction through Wu, J., Zhao, Z., Sun, C., Yan, R., Chen, X., 2020. Few-shot transfer learning for
shallow machine learning and deep learning. Appl. Energy 263, 114683. intelligent fault diagnosis of machine. Measurement.
Wang, S., Liu, X., Gates, S., 2015. An introduction of new features for conventional Xing, J., Ren, P., Ling, J., 2015. Analysis of energy efficiency retrofit scheme
and hybrid GSHP simulations in eQUEST 3.7. Energy Build. 105, 368–376. for hotel buildings using equest software: A case study from tianjin. China
Wang, R., Lu, S., Feng, W., 2020b. A novel improved model for building energy Energy Build. 87, 14–24.
consumption prediction based on model integration. Appl. Energy 262, Xu, X., Taylor, J.E., Pisello, A.L., Culligan, P.J., 2012. The impact of place-based af-
114561. filiation networks on energy conservation: An holistic model that integrates
Wang, R., Lu, S., Li, Q., 2019b. Multi-criteria comprehensive study on predictive the influence of buildings, residents and the neighborhood context. Energy
algorithm of hourly heating energy consumption for residential buildings. Build. 55, 637–646.
Sustain. Cities Soc. 49, 101623. Yang, Y., Che, J., Deng, C., Li, L., 2019. Sequential grid approach based support
Wang, Z., Srinivasan, R.S., 2017a. A review of artificial intelligence based building vector regression for short-term electric load forecasting. Appl. Energy 238,
energy use prediction: Contrasting the capabilities of single and ensemble 1010–1021.
prediction models. Renew. Sustain. Energy Rev. 75, 796–808. Zhai, Z.J., Johnson, M., Krarti, M., 2011. Assessment of natural and hybrid
Wang, Z., Srinivasan, R.S., 2017b. A review of artificial intelligence based building ventilation models in whole-building energy simulations. Energy Build. 43,
energy use prediction: Contrasting the capabilities of single and ensemble 2251–2261.
Zhang, G., Tian, C., Li, C., Zhang, J.J., Zuo, W., 2020. Accurate forecasting of
prediction models. Renew. Sustain. Energy Rev. 75, 796–808.
building energy consumption via a novel ensembled deep learning method
Wei, L., Tian, W., Zup, J., Yang, Z., Liu, Y., Yang, S., 2016. Effects of building form
considering the cyclic feature. Energy 201, 117531.
on energy use for buildings in Cold Climate Regions. 146, pp. 181–188.
Zhang, L., Wen, J., 2019a. A systematic feature selection procedure for short-term
Wei, Y., Xia, L., Pan, S., Wu, J., Zhang, X., Han, M., et al., 2019. Prediction
data-driven building energy forecasting model development. Energy Build.
of occupancy level and energy consumption in office building using blind
183, 428–442.
system identification and neural networks. Appl. Energy 240, 276–294.
Zhang, L., Wen, J., 2019b. A systematic feature selection procedure for short-term
Westphal, F.S., Lamberts, R., 2005. Building simulation calibration using sensi-
data-driven building energy forecasting model development. Energy Build.
tivity analysis. In: Proceedings of the 9th International IBPSA Conference.
183, 428–442.
Montreal, Canada.
Zhe, W., Tianzhen, H., Mary, A.P., 2020. Building thermal load prediction through
Wetter, M., Christoph, H., 2006. Modelica versus TRNSYS–a comparison between shallow machine learning and deep learning. Appl. Energy 263.
an equation-based and a procedural modeling language for building energy Zhou, Y., Zheng, S., 2020. Machine-learning based hybrid demand-side controller
simulation. In: Proceedings of the 2nd SimBuild Conference. Cambridge, MA, for high-rise office buildings with high energy flexibilities. Appl. Energy 262,
USA. 114416.
2671