Goyal 2020
Goyal 2020
Abstract—The utilization of energy in the most efficient 32% energy consumption. The third major area is Transport
manner is an urgent demand of the modern era, as energy is with 28% energy consumption. These studies motivated to
being used in each and every field. Globally buildings consume devise solutions for energy optimization in buildings.
the largest percentage of energy, and HVAC system consumes
most of the energy in a building. HVAC maintains desired Further studies show that within buildings, HVAC
temperature within a building by meeting Heating load and (Heating, Ventilation and Air conditioning) system is the
Cooling load requirements. These requirements should be less largest energy consumer accounting for 40-50%
to lessen energy consumption and achieve energy efficiency. consumption [6]-[7]. HVAC consumes energy to maintain
Some of the characteristics of buildings greatly affect the the desired temperature within a building and controlling
Heating load and Cooling load requirements. This research humidity. It is responsible for meeting the Heating load and
analyses eight important characteristics of a building- Relative Cooling load of a building. These two are related to the
Compactness, Surface Area, Wall Area, Roof Area, Overall thermal load of the building. When the building is cold, the
Height, Orientation, Glazing Area, Glazing Area Distribution thermal load is converted into Heating load and when the
and uses them for predicting Heating load and Cooling load. building is hot, thermal load is converted into Cooling load
Experiments have been performed using three Machine [8].
Learning algorithms- Multiple Linear Regression, K Nearest
Neighbours, Support Vector Regression and three Ensemble The Heating load and Cooling load of a building directly
algorithms- Random Forests, Gradient Boosting Machines, affect its energy performance. It requires analysis of factors
Extreme Gradient Boosting. Models have been evaluated using that affect the Heating and Cooling load. Studies reveal that
performance metrics: RMSE, MSE, MAE, R Squared and various characteristics of a building and its structure affect
Accuracy. Results show that Ensemble techniques outperform Heating and Cooling loads to a major extent [9]- [10].
Machine Learning techniques with an appreciable margin.
This paper focusses on several important features of
Keywords—Machine Learning; Energy Optimization; buildings, analyses them and uses them for predicting the
Random Forests; Multiple Linear Regression; Gradient Boosting Heating load and Cooling load using three well known
Machines; Extreme Gradient Boosting; K nearest neighbors; Machine Learning techniques and three Ensemble
Support Vector Regression techniques.
The paper is organized as follows: Section 1 introduces
I. INTRODUCTION the problem. Section 2 describes the Machine Learning
The use of energy is widespread and reached almost techniques and Ensemble techniques used in research. The
every known area, from Industries to means of transport, Literature is reviewed in Section 3. The Methodology of the
from offices to households. In the modern world, no aspect work done is described in Section 4. Section 5 explains the
of life can be thought of which doesn’t consume energy. experiments performed and results obtained. Section 6
Such massive usage of energy leads us to work towards its concludes the paper.
conservation and optimization. Optimal measures should be
taken so that energy is used optimally to safeguard the II. LITERATURE SURVEY
environment [1]. Studies done by experts all over the world This section reviews the analysis and findings of several
show that the highest percentage of energy is consumed by authors worldwide in the field of energy consumption and
buildings [2]-[5]. Buildings consume about 40% of the total optimization in buildings using various individual Machine
energy consumed in the world. After buildings, the second Learning and Ensemble techniques as shown in TABLE I.
major energy consumer is Industry which is reported for
Authorized licensed use limited to: UNIVERSITY OF WESTERN ONTARIO. Downloaded on October 11,2020 at 02:31:16 UTC from IEEE Xplore. Restrictions apply.
by MAPE
Jain R. K. et. Al., Support Vector Regression Developed a sensor-based model for Spatial granularity impact the prediction
2014 [2] forecasting energy consumption in multi- power of the model significantly
family residential buildings
Wei X. et. Al., 2015 MLP ensemble Model developed for HVAC energy Energy savings were more when Internal Air
[13] optimization Quality was taken into consideration
Park H. S. et. Al., Decision Tree Developed a new energy benchmark to Proposed benchmark better than
2016 [14] improve the operational rating system of conventional and baseline system
office buildings
Candanedo L.M. et. MLR, SVM, RF, GBM Model developed for predicting energy GBM outperformed other models;
Al., 2017 [15] usage by appliances in residential building atmospheric pressure is an important
predictor
Manjarres D. et. Al., Random Forest A framework developed to optimize HVAC Energy consumption reduced by 48% for
2017 [16] energy consumption heating and 39% for cooling
Peng Y. et. Al., 2018 K Nearest Neighbour A model developed to optimize energy Energy savings of 7-52% obtained
[17] consumption in building space according to
occupancy
Gallagher C. V. et. Al. LSR, DT, KNN, ANN, SVM ML algorithms used for measurement and Error reduced by 51.09%
, 2018 [18] verification of energy saved in Industrial
buildings
Deb C. et. Al., 2018 MLR, ANN Prediction models developed for energy ANN more accurate with MAPE of 14.8%
[19] savings in HVAC in office buildings
Nayak S. C., 2019 ARIMA, RBFNN, MLP, Developed a new model which linearly Model developed was better in terms of
[20] SVM, FLANN combined the five ML models for better feasibility and performance
accuracy
Sethi J.K & Mittal M., DT, Naïve Bayes, SVM, RF, ML techniques applied to predict accurate Ensemble techniques outperformed others
2019 [21] LR, Stacking ensemble, Air quality index
Voting ensemble
1034
Authorized licensed use limited to: UNIVERSITY OF WESTERN ONTARIO. Downloaded on October 11,2020 at 02:31:16 UTC from IEEE Xplore. Restrictions apply.
IV. METHODOLOGY B. Data Analysis and Pre-processing
This section explains the workflow approach followed in Data pre-processing is a process that consists of checking
this research. Fig. 1 shows the steps of the methodology the dataset for missing values and filling them with
followed. appropriate values, detecting and removing any outliers,
converting it into a particular form suitable for applying
algorithm, attribute selection etc. [12].
The dataset used in this research has been specifically
created by a Civil Engineer for performing energy analysis
in different kinds of buildings, so much pre-processing was
not required. Pearson Correlation Coefficient was calculated
to derive the strength of the relationship among several
variables of the dataset. This coefficient facilitates the
derivation of optimal filters for reducing noise. A zero value
for the Pearson Correlation Coefficient means the variables
are not correlated i.e. they are independent. A value closer
to 1 indicates a high correlation among variables [29]. The
obtained Correlation Coefficient values are represented in
matrix form in Fig. 2.
1035
Authorized licensed use limited to: UNIVERSITY OF WESTERN ONTARIO. Downloaded on October 11,2020 at 02:31:16 UTC from IEEE Xplore. Restrictions apply.
V. EXPERIMENTS AND RESULTS
All the experiments of the research were performed using
Python. Three Machine Learning algorithms namely MLR,
KNN and SVR and three Ensemble techniques namely, RF,
GBM, and XGBoost have been experimented on the
collected dataset. The results of the experiments are
described in Table III and Table IV.
1036
Authorized licensed use limited to: UNIVERSITY OF WESTERN ONTARIO. Downloaded on October 11,2020 at 02:31:16 UTC from IEEE Xplore. Restrictions apply.
TABLE III. RESULTS OF ENSEMBLE ALGORITHMS
1037
Authorized licensed use limited to: UNIVERSITY OF WESTERN ONTARIO. Downloaded on October 11,2020 at 02:31:16 UTC from IEEE Xplore. Restrictions apply.