0% found this document useful (0 votes)

50 views251 pages

Modeling and Analysis of Commercial Building Electrical Loads For Demand Side Management

This thesis examines modeling and analysis of commercial building electrical loads for demand side management. It presents four key contributions: 1) building-specific load forecasting studies of two Drexel University buildings using methods like linear regression, neural networks and SVR, finding SVR performed best; 2) estimation of prediction intervals for load forecasts to quantify uncertainty; 3) development of a controllable load model to represent loads that can be dispatched; and 4) formulation of optimization problems to schedule building demand resources across multiple scenarios with and without imperfect load forecasts. The thesis provides frameworks to evaluate the various methods and their results.

Uploaded by

yisakabera123

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

50 views251 pages

Modeling and Analysis of Commercial Building Electrical Loads For Demand Side Management

Uploaded by

yisakabera123

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 251

Modeling and Analysis of Commercial Building Electrical Loads for Demand Side

Management

A Thesis

Submitted to the Faculty

Drexel University

Jonathan Berardino

in partial fulfillment of the

requirements for the degree

Doctor of Philosophy

December 2016
© Copyright 2016
Jonathan Berardino. All Rights Reserved.
iii
ACKNOWLEDGMENTS

First, I would like to thank my advisor Dr. Chika Nwankpa for your time,

guidance, and support in performing the research that led to this thesis. I am forever

grateful to you for accepting me as a student and providing me with the opportunity to

work in this lab.

I would like to thank Dr. Miu, Dr. Niebur, Dr. Benson, and Dr. Kwatny for

serving on my thesis committee. Your comments have greatly improved the final version

of this thesis and I appreciate the advice and insights you have provided that will enhance

this work going forward.

Thank you to my fellow members of the Center for Electric Power Engineering. I

have been very fortunate to work with a group of people who have been valuable

colleagues and (more importantly) friends. I wish you all the best and look forward to

when our paths cross in the future.

Lastly, and only because the last paragraph is always the most important, I would

like to thank my wife Lindsay. Pursuing this degree has taken up enormous amounts of

my time and nobody has felt that impact more than you. Thank you for your patience and

selflessness over the years. This would not have happened without you.
iv
TABLE OF CONTENTS

LIST OF TABLES ........................................................................................................... viii

LIST OF FIGURES ............................................................................................................ x

ABSTRACT .................................................................................................................. xxxii

1. INTRODUCTION ....................................................................................................... 1

1.1. OVERVIEW ....................................................................................................... 1

1.2. BACKGROUND AND MOTIVATION............................................................ 1

1.3. OBJECTIVES .................................................................................................... 3

1.4. CONTRIBUTIONS ............................................................................................ 4

1.5. ORGANIZATION OF THESIS ......................................................................... 4

2. ANALYSIS OF COMMERCIAL BUILDING LOADS AS DEMAND SIDE

RESOURCES ..................................................................................................................... 6

2.1. OVERVIEW ....................................................................................................... 6

2.2. DEMAND SIDE MANAGEMENT PROGRAMS ............................................ 6

2.3. COMMERCIAL BUILDING ELECTRICITY USE AND BEHAVIOR .......... 9

2.4. REVIEW OF COMMERCIAL BUILDING DEMAND SIDE

MANAGEMENT RESEARCH................................................................................. 13

2.4.1. BUILDING-SPECIFIC LOAD FORECASTING................................ 13

2.4.2. COMMERCIAL BUILDING LOAD MODELING AND DISPATCH

3. BUILDING-SPECIFIC LOAD FORECASTING STUDIES ................................... 18

3.1. OVERVIEW ..................................................................................................... 18

3.2. LOAD FORECASTING METHOD REVIEW ................................................ 18

v
3.3. LOAD FORECASTING PROBLEM FORMULATION ................................ 20

3.4. DREXEL UNIVERSITY CASE STUDIES..................................................... 23

3.4.1. DATA SET DESCRIPTION ................................................................ 23

3.4.2. SELECTED FORECASTING METHODS ......................................... 25

3.4.3. MODEL BUILDING AND FORECASTING PROCEDURE ............ 33

3.4.4. FORECAST PERFORMANCE EVALUATION FRAMEWORK ..... 35

3.4.5. RESULTS ............................................................................................. 38

3.4.6. REMARKS ........................................................................................... 67

4. ESTIMATION OF BUILDING-SPECIFIC LOAD FORECASTING PREDICTION

INTERVALS .................................................................................................................... 69

4.1. OVERVIEW ..................................................................................................... 69

4.2. PROBABILISITIC FORECASTING REVIEW .............................................. 69

4.3. PREDICTION INTERVAL FORMULATION AND ESTIMATION

PROCEDURE............................................................................................................ 72

4.4. PREDICTION INTERVAL EVALUATION FRAMEWORK ....................... 79

4.5. RESULTS ......................................................................................................... 82

4.6. REMARKS ....................................................................................................... 90

5. CONTROLLABLE BUILDING ELECTRICAL LOAD MODELING ................... 92

5.1. OVERVIEW ..................................................................................................... 92

5.2. MODEL DEVELOPMENT ............................................................................. 92

5.2.1. TEST DESCRIPTION ......................................................................... 93

5.2.2. MODEL DERIVATION ...................................................................... 95

5.2.3. DREXEL-SPECIFIC EXAMPLE ........................................................ 99

vi
5.3. REMARKS ..................................................................................................... 102

6. OPTIMAL SCHEDULING OF BUILDING DEMAND RESOURCES ................ 104

6.1. OVERVIEW ................................................................................................... 104

6.2. BUILDING LOAD SCHEDULING PROBLEM DESCRIPTION ............... 104

6.2.1. GOALS ............................................................................................... 104

6.2.2. BUILDING DEMAND TERMINOLOGY ........................................ 105

6.2.3. SCENARIOS ...................................................................................... 107

6.2.4. ASSUMPTIONS ................................................................................ 109

6.3. OPTIMIZATION PROBLEM FORMULATIONS ....................................... 110

6.3.1. SCENARIO 1 & 2 .............................................................................. 110

6.3.2. SCENARIO 3 & 4 .............................................................................. 113

6.3.3. SCENARIO 1 & 2 WITH IMPERFECT LOAD FORECASTS ....... 115

6.3.4. SCENARIO 3 & 4 WITH IMPERFECT LOAD FORECASTS ........ 116

6.4. SIMULATIONS ............................................................................................. 117

6.4.1. TEST SYSTEM DESCRIPTION ................................................................... 117

6.4.2. SIMULATION RESULTS AND DISCUSSION........................................... 121

7. CONCLUSIONS AND FUTURE WORK .............................................................. 134

7.1. OVERVIEW ................................................................................................... 134

7.2. SUMMARY OF RESEARCH CONTRIBUTIONS ...................................... 134

7.3. FUTURE WORK ........................................................................................... 135

References ....................................................................................................................... 137

APPENDICES ................................................................................................................ 145

Appendix A. LIST OF NOMENCLATURE .............................................................. 146

vii
Appendix B. FORECASTING CASE STUDY RESULTS ....................................... 150

VITA ............................................................................................................................... 216

viii

LIST OF TABLES

Table 3.1. Indicator variables representing the day of the week ...................................... 26

Table 3.2. General and Building-specific predictor variables .......................................... 34

Table 3.3. Quantiles of day-ahead MAPE performance across all out-of-sample forecasts

........................................................................................................................................... 38

Table 3.4. Monthly MAPE (%) performance ................................................................... 39

Table 3.5. MAPE (%) performance broken up by weekdays and weekends/holidays ..... 39

Table 3.6. Hourly MAPE (%) across all out-of-sample forecasts .................................... 41

Table 3.7. Hourly bias (kW) across all out-of-sample forecasts....................................... 48

Table 3.8. Hourly SDE (kW) Across all out-of-sample forecasts .................................... 51

Table 3.9. Hourly skewness across all out-of-sample forecasts ....................................... 61

Table 3.10. Hourly kurtosis across all out-of-sample forecasts ........................................ 64

Table 4.1. % Deviation in empirical coverage compared to the nominal coverage rate . 85

Table 5.1 Parameter estimation results for the test described in Section 5.2.1............... 102

Table 6.1. Summary of DSM scenarios .......................................................................... 108

Table 6.2 Building load footprint definitions ................................................................ 118

Table 6.3 Building controllable load flexibility definitions............................................ 118

Table 6.4. Load footprint and flexibility for each building used in this case study........ 118

Table 6.5 Base building parameters values used in the simulations ............................... 120

Table 6.6 Optimal load schedule in kW for Test 1 (Scenario 1) .................................... 123

Table 6.7. Optimal load schedule for in kW Test 2 (Scenario 1 with reduced load

flexibility) ....................................................................................................................... 123

Table 6.8. Optimal load schedule in kW for Test 4 (Scenario 2) ................................... 127
ix
thresh
Table 6.9 Optimal load schedule in kW for Test 5 (Scenario 3 – P =150kW) ......... 127

Table 6.10 Optimal load schedule in kW for Test 6 (Scenario 3 – Pthresh=75kW) ......... 132

Table 6.11 Optimal load schedule in kW for Test 6 (Scenario 4) .................................. 132

Table 6.12 Optimal load schedule in kW for Test 8: +10kW uniform load forecast

uncertainty....................................................................................................................... 133

Table B.1 MAPE results at 5 minute resolution for all 116 day-ahead forecasts ........... 150
x

LIST OF FIGURES

Figure 2.1. Topology of existing DSM programs ............................................................... 7

Figure 2.2. Breakdown of commercial building energy use in the United States (2014)

[19] ...................................................................................................................................... 9

Figure 2.3. Linear correlation between demand and several variables: OAT (top), OAT-

Temp (middle), and OAT-Stpt (bottom) ........................................................................... 11

Figure 2.4. Temporal correlation for 24 hours of building demand. Sample size is 4

months of weekdays (88 days – 2011).............................................................................. 12

Figure 2.5. Example of a baseline demand profile during a DSM event .......................... 14

Figure 3.1. Sample load forecast showing time indices.................................................... 22

Figure 3.2. Breakup of data set and how each portion is used in the forecasting process 24

Figure 3.3. Daily building demand curves for the 116 days used for out of sample

testing. The thicker line shows the mean daily profile .................................................... 25

Figure 3.4. General artificial neuron model ...................................................................... 27

Figure 3.5. A general multi-layer, feed-forward artificial neural network with N hidden

layers ................................................................................................................................. 28

Figure 3.6. Diagram of the STLF process......................................................................... 33

Figure 3.7. Hourly MAPE for all forecasts generated by the MLR models ..................... 42

Figure 3.8. Hourly MAPE for all forecasts generated by the NN models ........................ 43

Figure 3.9. Hourly MAPE for all forecasts generated by the SVM models ..................... 43

Figure 3.10. Hourly MAPE for all forecasts generated by the SAM models ................... 44

Figure 3.11. Box plots of hourly MAPE results. (top) MLR without building

measurements (bottom) MLR with building measurements ............................................. 45

xi
Figure 3.12. Box plots of hourly MAPE results. (top) NN without building measurements

(bottom) NN with building measurements ....................................................................... 45

Figure 3.13. Box plots of hourly MAPE results. (top) SVM without building

measurements (bottom) SVM with building measurements ............................................. 46

Figure 3.14. Box plots of hourly MAPE results. (top) SAM without building

measurements (bottom) SAM with building measurements ............................................. 46

Figure 3.15. Hourly Bias for all forecasts generated by the MLR models ....................... 49

Figure 3.16. Hourly Bias for all forecasts generated by the NN models .......................... 49

Figure 3.17. Hourly Bias for all forecasts generated by the SVM models ....................... 50

Figure 3.18. Hourly Bias for all forecasts generated by the SAM models ....................... 50

Figure 3.19. Hourly standard deviation of the forecast error for all forecasts generated by

the MLR models ............................................................................................................... 52

Figure 3.20. Hourly standard deviation of the forecast error for all forecasts generated by

the NN models .................................................................................................................. 53

Figure 3.21. Hourly standard deviation of the forecast error for all forecasts generated by

the SVM models ............................................................................................................... 53

Figure 3.22. Hourly standard deviation of the forecast error for all forecasts generated by

the SAM models ............................................................................................................... 54

Figure 3.23. Empirical distribution of forecast errors for the NN model without building

measurements included (09:00-10:00).............................................................................. 55

Figure 3.24. Empirical distribution of forecast errors for the NN model with building

measurements included (09:00-10:00).............................................................................. 55

xii
Figure 3.25. Frequency of forecast errors within a ±10 kW error margin for the MLR

models ............................................................................................................................... 56

Figure 3.26. Frequency of forecast errors within a ±30 kW error margin for the MLR

models ............................................................................................................................... 57

Figure 3.27. Frequency of forecast errors within a ±10 kW error margin for the NN

models ............................................................................................................................... 57

Figure 3.28. Frequency of forecast errors within a ±30 kW error margin for the NN

models ............................................................................................................................... 58

Figure 3.29. Frequency of forecast errors within a ±10 kW error margin for the SVM

models ............................................................................................................................... 58

Figure 3.30. Frequency of forecast errors within a ±30 kW error margin for the SVM

models ............................................................................................................................... 59

Figure 3.31. Frequency of forecast errors within a ±10 kW error margin for the SAM

models ............................................................................................................................... 59

Figure 3.32. Frequency of forecast errors within a ±30 kW error margin for the SAM

models ............................................................................................................................... 60

Figure 3.33. Hourly skew of the forecast error for all forecasts generated by the MLR

models ............................................................................................................................... 62

Figure 3.34. Hourly skew of the forecast error for all forecasts generated by the NN

models ............................................................................................................................... 62

Figure 3.35. Hourly skew of the forecast error for all forecasts generated by the SVM

models ............................................................................................................................... 63
xiii
Figure 3.36. Hourly skew of the forecast error for all forecasts generated by the SAM

models ............................................................................................................................... 63

Figure 3.37. Hourly kurtosis of the forecast error for all forecasts generated by the MLR

models ............................................................................................................................... 65

Figure 3.38. Hourly kurtosis of the forecast error for all forecasts generated by the NN

models ............................................................................................................................... 65

Figure 3.39. Hourly kurtosis of the forecast error for all forecasts generated by the SVM

models ............................................................................................................................... 66

Figure 3.40. Hourly kurtosis of the forecast error for all forecasts generated by the SAM

models ............................................................................................................................... 66

Figure 4.1. Example of the difference between a 95% confidence interval (CI) and 95%

prediction interval (PI) ...................................................................................................... 70

Figure 4.2. Seasonal block segmentation (a) and double seasonal block segmentation (b)

........................................................................................................................................... 76

Figure 4.3. Block bootstrapped time series (a) and double seasonal block bootstrapped

time series (b) .................................................................................................................... 76

Figure 4.4. Residual bootstrapping procedure .................................................................. 78

Figure 4.5. Reliability diagram for the MLR models ....................................................... 83

Figure 4.6. Reliability diagram for the NN models .......................................................... 83

Figure 4.7. Reliability diagram for the SVM models ....................................................... 84

Figure 4.8. Reliability diagram for the SAM models ....................................................... 84

Figure 4.9. Average hourly skill score for the MLR models. Nominal coverage rate =

25% ................................................................................................................................... 86
xiv
Figure 4.10. Average hourly skill score for the MLR models. Nominal coverage rate =

75% ................................................................................................................................... 86

Figure 4.11. Average hourly skill score for the NN models. Nominal coverage rate = 25%

........................................................................................................................................... 87

Figure 4.12. Average hourly skill score for the NN models. Nominal coverage rate = 75%

........................................................................................................................................... 87

Figure 4.13. Average hourly skill score for the SVM models. Nominal coverage rate =

25% ................................................................................................................................... 88

Figure 4.14. Average hourly skill score for the SVM models. Nominal coverage rate =

75% ................................................................................................................................... 88

Figure 4.15. Average hourly skill score for the SAM models. Nominal coverage rate =

25% ................................................................................................................................... 89

Figure 4.16. Average hourly skill score for the SAM models. Nominal coverage rate =

75% ................................................................................................................................... 89

Figure 5.1. Electrical load data in %FLA and chilled water outlet temperature data in oF.

Data was collected over a 12 hour window ...................................................................... 94

Figure 5.2. Electric load (blue line) and temperature (red line) response to raising the chill

water temp. ........................................................................................................................ 95

Figure 5.3. Response to a step change in the chilled water temperature setpoint value

according to Equations (5.4)-(5.5) .................................................................................... 98

Figure 5.4. Application of the curve fit approach on the collected data for parameter

estimation ........................................................................................................................ 101

Figure 6.1. Sample building load forecast and dispatched load ..................................... 105
xv
Figure 6.2. Relationship between load margin and building load forecast uncertainty

(expected worst case) ...................................................................................................... 107

Figure 6.3. Forecasted building load profiles ................................................................. 119

Figure 6.4. LMP data for Monday July 21st, 2014 .......................................................... 120

Figure 6.5. Dispatch schedule for buildings 7 with decreasing load flexibility (No

dispatch during the intervals not shown) ........................................................................ 124

Figure 6.6 Dispatch schedule for buildings 7 with decreasing load footprint (No dispatch

during the intervals not shown ........................................................................................ 125

Figure 6.7 Cost savings as a function of load uncertainty. Break even line shown in blue

......................................................................................................................................... 130

Figure 6.8. Decreasing number of intervals where the DR revenue constraint can be met

......................................................................................................................................... 131

Figure B.1 MLR Empirical error distribution (L) No building measurements (R) With

building measurements (00:00-01:00) ............................................................................ 158

Figure B.2 MLR Empirical error distribution (L) No building measurements (R) With

building measurements (01:00-02:00) ............................................................................ 158

Figure B.3 MLR Empirical error distribution (L) No building measurements (R) With

building measurements (02:00-03:00) ............................................................................ 158

Figure B.4 MLR Empirical error distribution (L) No building measurements (R) With

building measurements (03:00-04:00) ............................................................................ 159

Figure B.5 MLR Empirical error distribution (L) No building measurements (R) With

building measurements (04:00-05:00) ............................................................................ 159

xvi
Figure B.6 Empirical error distribution (L) No building measurements (R) With building

measurements (05:00-06:00) .......................................................................................... 159

Figure B.7 MLR Empirical error distribution (L) No building measurements (R) With

building measurements (06:00-07:00) ............................................................................ 160

Figure B.8 MLR Empirical error distribution (L) No building measurements (R) With

building measurements (07:00-08:00) ............................................................................ 160

Figure B.9 MLR Empirical error distribution (L) No building measurements (R) With

building measurements (08:00-09:00) ............................................................................ 160

Figure B.10 MLR Empirical error distribution (L) No building measurements (R) With

building measurements (09:00-10:00) ............................................................................ 161

Figure B.11 MLR Empirical error distribution (L) No building measurements (R) With

building measurements (10:00-11:00) ............................................................................ 161

Figure B.12 MLR Empirical error distribution (L) No building measurements (R) With

building measurements (11:00-12:00) ............................................................................ 161

Figure B.13 MLR Empirical error distribution (L) No building measurements (R) With

building measurements (12:00-13:00) ............................................................................ 162

Figure B.14 MLR Empirical error distribution (L) No building measurements (R) With

building measurements (13:00-14:00) ............................................................................ 162

Figure B.15 MLR Empirical error distribution (L) No building measurements (R) With

building measurements (14:00-15:00) ............................................................................ 162

Figure B.16 MLR Empirical error distribution (L) No building measurements (R) With

building measurements (15:00-16:00) ............................................................................ 163

xvii
Figure B.17 MLR Empirical error distribution (L) No building measurements (R) With

building measurements (16:00-17:00) ............................................................................ 163

Figure B.18 MLR Empirical error distribution (L) No building measurements (R) With

building measurements (17:00-18:00) ............................................................................ 163

Figure B.19 MLR Empirical error distribution (L) No building measurements (R) With

building measurements (18:00-19:00) ............................................................................ 164

Figure B.20 MLR Empirical error distribution (L) No building measurements (R) With

building measurements (19:00-20:00) ............................................................................ 164

Figure B.21 MLR Empirical error distribution (L) No building measurements (R) With

building measurements (20:00-21:00) ............................................................................ 164

Figure B.22 MLR Empirical error distribution (L) No building measurements (R) With

building measurements (21:00-22:00) ............................................................................ 165

Figure B.23 MLR Empirical error distribution (L) No building measurements (R) With

building measurements (22:00-23:00) ............................................................................ 165

Figure B.24 MLR Empirical error distribution (L) No building measurements (R) With

building measurements (23:00-24:00) ............................................................................ 165

Figure B.25 NN Empirical error distribution (L) No building measurements (R) With

building measurements (00:00-01:00) ............................................................................ 166

Figure B.26 NN Empirical error distribution (L) No building measurements (R) With

building measurements (01:00-02:00) ............................................................................ 166

Figure B.27 NN Empirical error distribution (L) No building measurements (R) With

building measurements (02:00-03:00) ............................................................................ 166

xviii
Figure B.28 NN Empirical error distribution (L) No building measurements (R) With

building measurements (03:00-04:00) ............................................................................ 167

Figure B.29 NN Empirical error distribution (L) No building measurements (R) With

building measurements (04:00-05:00) ............................................................................ 167

Figure B.30 NN Empirical error distribution (L) No building measurements (R) With

building measurements (05:00-06:00) ............................................................................ 167

Figure B.31 NN Empirical error distribution (L) No building measurements (R) With

building measurements (06:00-07:00) ............................................................................ 168

Figure B.32 NN Empirical error distribution (L) No building measurements (R) With

building measurements (07:00-08:00) ............................................................................ 168

Figure B.33 NN Empirical error distribution (L) No building measurements (R) With

building measurements (08:00-09:00) ............................................................................ 168

Figure B.34 NN Empirical error distribution (L) No building measurements (R) With

building measurements (09:00-10:00) ............................................................................ 169

Figure B.35 NN Empirical error distribution (L) No building measurements (R) With

building measurements (10:00-11:00) ............................................................................ 169

Figure B.36 NN Empirical error distribution (L) No building measurements (R) With

building measurements (11:00-12:00) ............................................................................ 169

Figure B.37 NN Empirical error distribution (L) No building measurements (R) With

building measurements (12:00-13:00) ............................................................................ 170

Figure B.38 NN Empirical error distribution (L) No building measurements (R) With

building measurements (13:00-14:00) ............................................................................ 170

xix
Figure B.39 NN Empirical error distribution (L) No building measurements (R) With

building measurements (14:00-15:00) ............................................................................ 170

Figure B.40 NN Empirical error distribution (L) No building measurements (R) With

building measurements (15:00-16:00) ............................................................................ 171

Figure B.41 NN Empirical error distribution (L) No building measurements (R) With

building measurements (16:00-17:00) ............................................................................ 171

Figure B.42 NN Empirical error distribution (L) No building measurements (R) With

building measurements (17:00-18:00) ............................................................................ 171

Figure B.43 NN Empirical error distribution (L) No building measurements (R) With

building measurements (18:00-19:00) ............................................................................ 172

Figure B.44 NN Empirical error distribution (L) No building measurements (R) With

building measurements (19:00-20:00) ............................................................................ 172

Figure B.45 NN Empirical error distribution (L) No building measurements (R) With

building measurements (20:00-21:00) ............................................................................ 172

Figure B.46 NN Empirical error distribution (L) No building measurements (R) With

building measurements (21:00-22:00) ............................................................................ 173

Figure B.47 NN Empirical error distribution (L) No building measurements (R) With

building measurements (22:00-23:00) ............................................................................ 173

Figure B.48 NN Empirical error distribution (L) No building measurements (R) With

building measurements (23:00-24:00) ............................................................................ 173

Figure B.49 SVM Empirical error distribution (L) No building measurements (R) With

building measurements (00:00-01:00) ............................................................................ 174

xx
Figure B.50 SVM Empirical error distribution (L) No building measurements (R) With

building measurements (01:00-02:00) ............................................................................ 174

Figure B.51 SVM Empirical error distribution (L) No building measurements (R) With

building measurements (02:00-03:00) ............................................................................ 174

Figure B.52 SVM Empirical error distribution (L) No building measurements (R) With

building measurements (03:00-04:00) ............................................................................ 175

Figure B.53 SVM Empirical error distribution (L) No building measurements (R) With

building measurements (04:00-05:00) ............................................................................ 175

Figure B.54 SVM Empirical error distribution (L) No building measurements (R) With

building measurements (05:00-06:00) ............................................................................ 175

Figure B.55 SVM Empirical error distribution (L) No building measurements (R) With

building measurements (06:00-07:00) ............................................................................ 176

Figure B.56 SVM Empirical error distribution (L) No building measurements (R) With

building measurements (07:00-08:00) ............................................................................ 176

Figure B.57 SVM Empirical error distribution (L) No building measurements (R) With

building measurements (08:00-09:00) ............................................................................ 176

Figure B.58 SVM Empirical error distribution (L) No building measurements (R) With

building measurements (09:00-10:00) ............................................................................ 177

Figure B.59 SVM Empirical error distribution (L) No building measurements (R) With

building measurements (10:00-11:00) ............................................................................ 177

Figure B.60 SVM Empirical error distribution (L) No building measurements (R) With

building measurements (11:00-12:00) ............................................................................ 177

xxi
Figure B.61 SVM Empirical error distribution (L) No building measurements (R) With

building measurements (12:00-13:00) ............................................................................ 178

Figure B.62 SVM Empirical error distribution (L) No building measurements (R) With

building measurements (13:00-14:00) ............................................................................ 178

Figure B.63 SVM Empirical error distribution (L) No building measurements (R) With

building measurements (14:00-15:00) ............................................................................ 178

Figure B.64 SVM Empirical error distribution (L) No building measurements (R) With

building measurements (15:00-16:00) ............................................................................ 179

Figure B.65 SVM Empirical error distribution (L) No building measurements (R) With

building measurements (16:00-17:00) ............................................................................ 179

Figure B.66 SVM Empirical error distribution (L) No building measurements (R) With

building measurements (17:00-18:00) ............................................................................ 179

Figure B.67 SVM Empirical error distribution (L) No building measurements (R) With

building measurements (18:00-19:00) ............................................................................ 180

Figure B.68 SVM Empirical error distribution (L) No building measurements (R) With

building measurements (19:00-20:00) ............................................................................ 180

Figure B.69 SVM Empirical error distribution (L) No building measurements (R) With

building measurements (20:00-21:00) ............................................................................ 180

Figure B.70 SVM Empirical error distribution (L) No building measurements (R) With

building measurements (21:00-22:00) ............................................................................ 181

Figure B.71 SVM Empirical error distribution (L) No building measurements (R) With

building measurements (22:00-23:00) ............................................................................ 181

xxii
Figure B.72 SVM Empirical error distribution (L) No building measurements (R) With

building measurements (23:00-24:00) ............................................................................ 181

Figure B.73 SAM Empirical error distribution (L) No building measurements (R) With

building measurements (00:00-01:00) ............................................................................ 182

Figure B.74 SAM Empirical error distribution (L) No building measurements (R) With

building measurements (01:00-02:00) ............................................................................ 182

Figure B.75 SAM Empirical error distribution (L) No building measurements (R) With

building measurements (02:00-03:00) ............................................................................ 182

Figure B.76 SAM Empirical error distribution (L) No building measurements (R) With

building measurements (03:00-04:00) ............................................................................ 183

Figure B.77 SAM Empirical error distribution (L) No building measurements (R) With

building measurements (04:00-05:00) ............................................................................ 183

Figure B.78 SAM Empirical error distribution (L) No building measurements (R) With

building measurements (05:00-06:00) ............................................................................ 183

Figure B.79 SAM Empirical error distribution (L) No building measurements (R) With

building measurements (06:00-07:00) ............................................................................ 184

Figure B.80 SAM Empirical error distribution (L) No building measurements (R) With

building measurements (07:00-08:00) ............................................................................ 184

Figure B.81 SAM Empirical error distribution (L) No building measurements (R) With

building measurements (08:00-09:00) ............................................................................ 184

Figure B.82 SAM Empirical error distribution (L) No building measurements (R) With

building measurements (09:00-10:00) ............................................................................ 185

xxiii
Figure B.83 SAM Empirical error distribution (L) No building measurements (R) With

building measurements (10:00-11:00) ............................................................................ 185

Figure B.84 SAM Empirical error distribution (L) No building measurements (R) With

building measurements (11:00-12:00) ............................................................................ 185

Figure B.85 SAM Empirical error distribution (L) No building measurements (R) With

building measurements (12:00-13:00) ............................................................................ 186

Figure B.86 SAM Empirical error distribution (L) No building measurements (R) With

building measurements (13:00-14:00) ............................................................................ 186

Figure B.87 SAM Empirical error distribution (L) No building measurements (R) With

building measurements (14:00-15:00) ............................................................................ 186

Figure B.88 SAM Empirical error distribution (L) No building measurements (R) With

building measurements (15:00-16:00) ............................................................................ 187

Figure B.89 SAM Empirical error distribution (L) No building measurements (R) With

building measurements (16:00-17:00) ............................................................................ 187

Figure B.90 SAM Empirical error distribution (L) No building measurements (R) With

building measurements (17:00-18:00) ............................................................................ 187

Figure B.91 SAM Empirical error distribution (L) No building measurements (R) With

building measurements (18:00-19:00) ............................................................................ 188

Figure B.92 SAM Empirical error distribution (L) No building measurements (R) With

building measurements (19:00-20:00) ............................................................................ 188

Figure B.93 SAM Empirical error distribution (L) No building measurements (R) With

building measurements (20:00-21:00) ............................................................................ 188

xxiv
Figure B.94 SAM Empirical error distribution (L) No building measurements (R) With

building measurements (21:00-22:00) ............................................................................ 189

Figure B.95 SAM Empirical error distribution (L) No building measurements (R) With

building measurements (22:00-23:00) ............................................................................ 189

Figure B.96 SAM Empirical error distribution (L) No building measurements (R) With

building measurements (23:00-24:00) ............................................................................ 189

Figure B.97 Average hourly skill score for the MLR Models. Nominal coverage rate =

5% ................................................................................................................................... 190

Figure B.98 Average hourly skill score for the MLR Models. Nominal coverage rate =

10% ................................................................................................................................. 190

Figure B.99 Average hourly skill score for the MLR Models. Nominal coverage rate =

15% ................................................................................................................................. 190

Figure B.100 Average hourly skill score for the MLR Models. Nominal coverage rate =

20% ................................................................................................................................. 191

Figure B.101 Average hourly skill score for the MLR Models. Nominal coverage rate =

25% ................................................................................................................................. 191

Figure B.102 Average hourly skill score for the MLR Models. Nominal coverage rate =

30% ................................................................................................................................. 191

Figure B.103 Average hourly skill score for the MLR Models. Nominal coverage rate =

35% ................................................................................................................................. 192

Figure B.104 Average hourly skill score for the MLR Models. Nominal coverage rate =

40% ................................................................................................................................. 192

xxv
Figure B.105 Average hourly skill score for the MLR Models. Nominal coverage rate =

45% ................................................................................................................................. 192

Figure B.106 Average hourly skill score for the MLR Models. Nominal coverage rate =

50% ................................................................................................................................. 193

Figure B.107 Average hourly skill score for the MLR Models. Nominal coverage rate =

55% ................................................................................................................................. 193

Figure B.108 Average hourly skill score for the MLR Models. Nominal coverage rate =

60% ................................................................................................................................. 193

Figure B.109 Average hourly skill score for the MLR Models. Nominal coverage rate =

65% ................................................................................................................................. 194

Figure B.110 Average hourly skill score for the MLR Models. Nominal coverage rate =

70% ................................................................................................................................. 194

Figure B.111 Average hourly skill score for the MLR Models. Nominal coverage rate =

75% ................................................................................................................................. 194

Figure B.112 Average hourly skill score for the MLR Models. Nominal coverage rate =

80% ................................................................................................................................. 195

Figure B.113 Average hourly skill score for the MLR Models. Nominal coverage rate =

85% ................................................................................................................................. 195

Figure B.114 Average hourly skill score for the MLR Models. Nominal coverage rate =

90% ................................................................................................................................. 195

Figure B.115 Average hourly skill score for the MLR Models. Nominal coverage rate =

95% ................................................................................................................................. 196

xxvi
Figure B.116 Average hourly skill score for the NN Models. Nominal coverage rate =

5% ................................................................................................................................... 196

Figure B.117 Average hourly skill score for the NN Models. Nominal coverage rate =

10% ................................................................................................................................. 196

Figure B.118 Average hourly skill score for the NN Models. Nominal coverage rate =

15% ................................................................................................................................. 197

Figure B.119 Average hourly skill score for the NN Models. Nominal coverage rate =

20% ................................................................................................................................. 197

Figure B.120 Average hourly skill score for the NN Models. Nominal coverage rate =

25% ................................................................................................................................. 197

Figure B.121 Average hourly skill score for the NN Models. Nominal coverage rate =

30% ................................................................................................................................. 198

Figure B.122 Average hourly skill score for the NN Models. Nominal coverage rate =

35% ................................................................................................................................. 198

Figure B.123 Average hourly skill score for the NN Models. Nominal coverage rate =

40% ................................................................................................................................. 198

Figure B.124 Average hourly skill score for the NN Models. Nominal coverage rate =

45% ................................................................................................................................. 199

Figure B.125 Average hourly skill score for the NN Models. Nominal coverage rate =

50% ................................................................................................................................. 199

Figure B.126 Average hourly skill score for the NN Models. Nominal coverage rate =

55% ................................................................................................................................. 199

xxvii
Figure B.127 Average hourly skill score for the NN Models. Nominal coverage rate =

60% ................................................................................................................................. 200

Figure B.128 Average hourly skill score for the NN Models. Nominal coverage rate =

65% ................................................................................................................................. 200

Figure B.129 Average hourly skill score for the NN Models. Nominal coverage rate =

70% ................................................................................................................................. 200

Figure B.130 Average hourly skill score for the NN Models. Nominal coverage rate =

75% ................................................................................................................................. 201

Figure B.131 Average hourly skill score for the NN Models. Nominal coverage rate =

80% ................................................................................................................................. 201

Figure B.132 Average hourly skill score for the NN Models. Nominal coverage rate =

85% ................................................................................................................................. 201

Figure B.133 Average hourly skill score for the NN Models. Nominal coverage rate =

90% ................................................................................................................................. 202

Figure B.134 Average hourly skill score for the NN Models. Nominal coverage rate =

95% ................................................................................................................................. 202

Figure B.135 Average hourly skill score for the SVM Models. Nominal coverage rate =

5% ................................................................................................................................... 202

Figure B.136 Average hourly skill score for the SVM Models. Nominal coverage rate =

10% ................................................................................................................................. 203

Figure B.137 Average hourly skill score for the SVM Models. Nominal coverage rate =

15% ................................................................................................................................. 203

xxviii
Figure B.138 Average hourly skill score for the SVM Models. Nominal coverage rate =

20% ................................................................................................................................. 203

Figure B.139 Average hourly skill score for the SVM Models. Nominal coverage rate =

25% ................................................................................................................................. 204

Figure B.140 Average hourly skill score for the SVM Models. Nominal coverage rate =

30% ................................................................................................................................. 204

Figure B.141 Average hourly skill score for the SVM Models. Nominal coverage rate =

35% ................................................................................................................................. 204

Figure B.142 Average hourly skill score for the SVM Models. Nominal coverage rate =

40% ................................................................................................................................. 205

Figure B.143 Average hourly skill score for the SVM Models. Nominal coverage rate =

45% ................................................................................................................................. 205

Figure B.144 Average hourly skill score for the SVM Models. Nominal coverage rate =

50% ................................................................................................................................. 205

Figure B.145 Average hourly skill score for the SVM Models. Nominal coverage rate =

55% ................................................................................................................................. 206

Figure B.146 Average hourly skill score for the SVM Models. Nominal coverage rate =

60% ................................................................................................................................. 206

Figure B.147 Average hourly skill score for the SVM Models. Nominal coverage rate =

65% ................................................................................................................................. 206

Figure B.148 Average hourly skill score for the SVM Models. Nominal coverage rate =

70% ................................................................................................................................. 207

xxix
Figure B.149 Average hourly skill score for the SVM Models. Nominal coverage rate =

75% ................................................................................................................................. 207

Figure B.150 Average hourly skill score for the SVM Models. Nominal coverage rate =

80% ................................................................................................................................. 207

Figure B.151 Average hourly skill score for the SVM Models. Nominal coverage rate =

85% ................................................................................................................................. 208

Figure B.152 Average hourly skill score for the SVM Models. Nominal coverage rate =

90% ................................................................................................................................. 208

Figure B.153 Average hourly skill score for the SVM Models. Nominal coverage rate =

95% ................................................................................................................................. 208

Figure B.154 Average hourly skill score for the SAM Models. Nominal coverage rate =

5% ................................................................................................................................... 209

Figure B.155 Average hourly skill score for the SAM Models. Nominal coverage rate =

10% ................................................................................................................................. 209

Figure B.156 Average hourly skill score for the SAM Models. Nominal coverage rate =

15% ................................................................................................................................. 209

Figure B.157 Average hourly skill score for the SAM Models. Nominal coverage rate =

20% ................................................................................................................................. 210

Figure B.158 Average hourly skill score for the SAM Models. Nominal coverage rate =

25% ................................................................................................................................. 210

Figure B.159 Average hourly skill score for the SAM Models. Nominal coverage rate =

30% ................................................................................................................................. 210

xxx
Figure B.160 Average hourly skill score for the SAM Models. Nominal coverage rate =

35% ................................................................................................................................. 211

Figure B.161 Average hourly skill score for the SAM Models. Nominal coverage rate =

40% ................................................................................................................................. 211

Figure B.162 Average hourly skill score for the SAM Models. Nominal coverage rate =

45% ................................................................................................................................. 211

Figure B.163 Average hourly skill score for the SAM Models. Nominal coverage rate =

50% ................................................................................................................................. 212

Figure B.164 Average hourly skill score for the SAM Models. Nominal coverage rate =

55% ................................................................................................................................. 212

Figure B.165 Average hourly skill score for the SAM Models. Nominal coverage rate =

60% ................................................................................................................................. 212

Figure B.166 Average hourly skill score for the SAM Models. Nominal coverage rate =

65% ................................................................................................................................. 213

Figure B.167 Average hourly skill score for the SAM Models. Nominal coverage rate =

70% ................................................................................................................................. 213

Figure B.168 Average hourly skill score for the SAM Models. Nominal coverage rate =

75% ................................................................................................................................. 213

Figure B.169 Average hourly skill score for the SAM Models. Nominal coverage rate =

80% ................................................................................................................................. 214

Figure B.170 Average hourly skill score for the SAM Models. Nominal coverage rate =

85% ................................................................................................................................. 214

xxxi
Figure B.171 Average hourly skill score for the SAM Models. Nominal coverage rate =

90% ................................................................................................................................. 214

Figure B.172 Average hourly skill score for the SAM Models. Nominal coverage rate =

95% ................................................................................................................................. 215

xxxii

ABSTRACT
Modeling and Analysis of Commercial Building Electrical Loads for Demand Side
Management
Jonathan Berardino
Chikaodinaka Nwankpa, Ph.D.

In recent years there has been a push in the electric power industry for more

customer involvement in the electricity markets. Traditionally the end user has played a

passive role in the planning and operation of the power grid. However, many energy

markets have begun opening up opportunities to consumers who wish to commit a certain

amount of their electrical load under various demand side management programs. The

potential benefits of more demand participation include reduced operating costs and new

revenue opportunities for the consumer, as well as more reliable and secure operations for

the utilities. The management of these load resources creates challenges and

opportunities to the end user that were not present in previous market structures.

This work examines the behavior of commercial-type building electrical loads and

their capacity for supporting demand side management actions. This work is motivated

by the need for accurate and dynamic tools to aid in the advancement of demand side

operations. A dynamic load model is proposed for capturing the response of controllable

building loads. Building-specific load forecasting techniques are developed, with

particular focus paid to the integration of building management system (BMS)

information. These approaches are tested using Drexel University building data. The

application of building-specific load forecasts and dynamic load modeling to the optimal

scheduling of multi-building systems in the energy market is proposed. Sources of

xxxiii
potential load uncertainty are introduced in the proposed energy management problem

formulation in order to investigate the impact on the resulting load schedule.

1. INTRODUCTION

1.1. OVERVIEW

This thesis presents a study of commercial buildings as potential resources for demand

side management. Within this work, methods for modeling and forecasting building electric load

behavior are presented. These problems are examined through extensive case studies involving

Drexel University building data. The resulting modeling and forecasting approaches are applied

to the problem of optimally scheduling building loads as part of several demand side

management programs. Simulations demonstrate the potential value in optimally dispatching

building loads, subject to real-world building operational constraints and building load

variability.

In this chapter, the following topics are presented:

 The background and motivation for the work

 A summary of the objectives and main contributions of this research

 An overview of the thesis organization

1.2. BACKGROUND AND MOTIVATION

Power system operators are tasked with ensuring adequate generation is available to meet

projected demand. Traditionally this requirement is met through controlling the supply side (i.e.

- generators) of the power system; limited priority has been placed on involving the demand side.

However, increased investment in the development of a “Smart Grid” [1] is driving new

opportunities for the demand side to take a more active role in power system planning and

operations. Demand Side Management (DSM), also referred to often as Demand Response

(DR), is a class of programs that are designed to motivate end-use customers to modify their
2
electricity usage. This could be through the shifting of electricity use to another time (e.g. – off

peak hours) or shedding electric load temporarily.

The idea of managing demand side resources has been discussed since the 1890’s, as

described in detail in [2]. However, it was not until the restructuring and deregulation of electric

utilities in the 1990’s and subsequent issues that began to arise in the new wholesale markets that

a concerted effort was made to include DSM as an essential aspect of these new markets [3]. In

support of this, the United States government has issued a number of policies in an effort to

remove potential barriers to DSM participants [4] [5] [6] [7]. These programs have been

identified as having the potential to provide a wide range of benefits to both the power system

operator and the end-user [8] [9] [10]. Potential benefits include:

 Reduction in grid demand during peak hours and subsequent reduction in the

reliance on expensive peaking units [11]. In the United States, DSM potential is

estimated to be between 38GW and 188GW by 2019 [8]. Peak reduction also has

the added potential to defer transmission system infrastructure upgrades that may

otherwise be required to expand system capacity [12].

 Reduction in wholesale energy prices and decreased price volatility. Even a small

increase in demand elasticity can offset the extreme increase in generation cost at

high demand levels [3], [13].

 Increased system reliability. DSM resources can be scheduled in the ancillary

services market for regulation, spinning reserve, or to support the integration of

renewable resources [14] [15] [16] [17] [18].

 Reduced electricity costs and new revenue opportunities through the local

electricity market operator for the end-user.

3
Building electrical loads have been identified as important potential resources for

supporting power system operations in owing to a high load footprint [19] and controllability that

is at least on par with that of generators [20]. The future involvement of building loads in

various DSM programs is buoyed by technological advancements such as advanced metering

infrastructure (AMI). Such advancements bring a new level of two-way communication and

controllability between the end-user and the utility.

The motivation for this thesis is the need for improved methods to assess building-

specific DSM opportunities and support increased involvement of building loads in various DSM

programs. In order for building participation in DSM programs to continue to evolve, the

understanding of end-user capabilities and limitations must continue to progress. Specifically to

that end, this thesis will consider what tools are needed from the perspective of the end-user in

order to understand and plan their own building loads as part of a DSM program.

1.3. OBJECTIVES

The objectives of this work include:

 Describe the existing standard of DSM programs and the potential for using

commercial building electric loads as controllable resources

 Evaluate how building-specific information can improve predictions of building

electrical load and better characterize controllable building load behavior

 Study the problem of optimally scheduling building loads on a multi-building

campus. This study will include:

o Multiple DSM scenarios

o Realistic building operational constraints

o Building load variability

4
1.4. CONTRIBUTIONS

The main contributions of this research can be summarized as follows:

 An examination of conventional load forecasting methods enhanced with

building-specific information. This includes:

o Demonstration of improved accuracy and consistency of point forecasts

generated via enhanced conventional load forecasting methods

o Evaluation of demand side planning risk through both enhanced building-

specific point forecasts and probabilistic forecasts

 Development of a dynamic load model for a controllable HVAC chiller that

captures the thermal-electrical behavior of the load.

 Formulation of several optimization problems for finding the optimal building

load dispatch schedule of a multi-building campus. Conditions under which the

optimization problems are formulated include:

o Real-time pricing structures

o Opportunities for demand response revenue

o Building electrical and thermal operating constraints

o Controllable load dynamic behavior

o Varying levels of load forecast uncertainty

1.5. ORGANIZATION OF THESIS

This thesis is organized as follows

 Chapter 2 reviews the existing state of commercial buildings as resources for

demand side management programs. Also, the role of building-specific

information in describing commercial building electrical load behavior is covered.

5
 In Chapter 3, the problem of building-specific load forecasting is considered.

The main topics of interest include:

o Application of building-specific variables to the load forecasting problem

o Drexel specific load forecasting case study with results and observations

 Chapter 4 extends the results of Chapter 3 to the probabilistic load forecasting

problem. Methods of estimating prediction intervals of building demand are

presented, with an emphasis on developing and evaluating prediction intervals

conditioned on building-specific variables.

 In Chapter 5 a model of controllable building electrical loads is developed. In

this work the load is considered to be the building HVAC chiller.

 Chapter 6 presents a methodology of optimally scheduling building loads for a

multi-building campus. Simulation results for several problem formulations are

presented and discussed.

 Chapter 7 will summarize the contributions of this research and present a

discussion on related future work.

6
2. ANALYSIS OF COMMERCIAL BUILDING LOADS AS DEMAND SIDE

RESOURCES

2.1. OVERVIEW

This chapter examines the potential of using commercial building loads as controllable

load resources in DSM programs. Specifically this includes the following:

 An introduction to available DSM programs and considerations for integrating

building loads into the programs moving forward

 A review of commercial building energy use in the United States and a

breakdown of building energy consumption

 An exploration of the relationship between building-level information and

building demand

 A review of the existing research specific to employing commercial buildings as

demand side resources. Special attention is paid to methods for forecasting

building loads and evaluating building DSM capabilities. These methods will be

contrasted with the approach taken in this thesis.

2.2. DEMAND SIDE MANAGEMENT PROGRAMS

As mentioned previously, there are a wide variety of DSM programs that may be

available to the end-user. These programs are generally grouped into two basic types: price-

based and incentive based programs [10]. Figure 2.1 shows this classification of various DSM

programs. Price-based programs, as the name would imply, use variable electricity pricing

structures to encourage modification of electricity use. Time-of-use (TOU) pricing programs

provide variable rate plans to customers where the rates are defined well in advance of pre-
7
defined usage periods [21]. These usage periods can include, for example, daily peak or off

peak hours or seasonal time windows where each period is priced differently.

Figure 2.1. Topology of existing DSM programs

In contrast to TOU pricing, dynamic pricing sets electricity rates that are not known to

the end-user so far in advance. These rates may be set on a day-ahead or an hour-ahead time

scale. Dynamic pricing is a key factor in both Critical Peak Pricing (CPP) and Real Time Pricing

(RTP) programs. During CPP events which occur during high system demand days several

times a year, enrolled participants receive higher electricity rates to encourage reduced

electricity. In exchange, program participants receive a discount on normal operating electricity

rates [22]. In RTP programs, the cost of electricity is variable from day to day or hour to hour.

At present time, RTP programs are not very common. These types of programs rely on two-way

communication between the utility and end-user for pricing information. As more AMI projects

and other communication and control advancements are put into operation, RTP programs

should become more widespread. It is arguable that the evolution of DSM will rely heavily on

the universal availability of real-time price information.

8
Incentive-based DSM programs can be grouped into two categories: direct control and

market based programs. Direct control refers to programs where the end-user allows the utility

to control their load in exchange for an agreed upon payment or credit. Interruptible Load

Management (ILM), which targets mainly large industrial customers, and Direct Load Control

(DLC) programs, which focus mainly on residential customers, have been around since the

1970’s [2] to support peak load management. These programs continue to be employed by

utilities today.

Market based programs are a more recent expansion of DSM opportunities which allow

the end-user to participate in the wholesale electricity market. In Demand Bidding and Capacity

Market programs, the end-user bids demand reductions and, if the bids are accepted, must

provide demand reductions at the specified time [23]. In these programs the demand-side bids

are optimized with the supply-side bids, effectively treating loads and generators as equal players

in the market. Participation in the Ancillary services market includes, for example, registering

controllable demand as resources for regulation or spinning reserve applications [14] [15] [16]

[17] [18].

The DSM programs described above continue to grow in participation as time goes on. It

is expected these markets will continue to advance and that building loads will be increasingly

utilized as DSM sources in all programs [8]. While building loads make attractive DSM

participants there are still practical challenges that remain in order to evaluate participation in

DSM applications. Several qualities of the loads themselves need to be better understood: what

factors drive building demand, what temporal behavior do they present, can the behavior be

described accurately in a form conducive to automated decision making, and what uncertainties

arise when controlling building loads.

9
2.3. COMMERCIAL BUILDING ELECTRICITY USE AND BEHAVIOR

One of the most appealing aspects of using commercial building loads for DSM purposes

is the large electrical footprint and potential for controllability. In 2014, commercial building

energy use made up 34% of the total United States electricity demand [19]. Within these

buildings, over 50% of the electricity was consumed by the HVAC and lighting systems as

shown in Figure 2.2.

Figure 2.2. Breakdown of commercial building energy use in the United States (2014) [19]

It stands to reason given this breakdown that improved operation of HVAC systems has

been identified as a means of potential energy savings and increased energy efficiency in

commercial buildings [24]. It also indicates a strongly coupled relationship between the building

thermal mass and building demand. By exploring this relationship, important information can be

obtained in regards to factors driving building load and the dynamics of building load behavior.

The approach taken in this thesis is to focus on information available from building

management systems (BMS) typical of commercial buildings that influence building electrical

behavior. These measurements are directly linked to the thermal-electrical behavior that drives a

large portion of the building demand. Additionally, this approach can lead to the development of
10
data-driven building models that are well suited for integration in a DSM scheduling tool

deployed at the BMS level.

Many medium to large commercial and industrial buildings operate a BMS that is

responsible for the monitoring and control of various building mechanical and electrical system

performance, such as the heating and cooling systems. These BMS are typically configured as a

distributed control system, with a software layer managing the functions of hardware

components distributed throughout a building or a campus consisting of multiple buildings. In

order to study building load features, electrical and thermal data has been collected using the

Drexel University BMS over the last several years. Three main variables are considered from

this data:

1. Outside Air Temperature (OAT) as measured directly at the building

2. Temperature Gradient: Outside Air Temp. – Indoor Air Temp. (OAT-Temp)

3. Target Temperature Gradient: Outside Air Temp. – Indoor Air Temp. Setpoint (OAT-

Stpt)

For the studies presented in this chapter, the data used is from the summer months (May-

August) of 2011 and 2012. A large portion of the load for campus buildings is the HVAC

system, making them good candidates as controllable loads during the typically warm and humid

summers. Consequently, that makes these good months for studying building behavior to

support DSM planning activities. It should be noted that Drexel does not have electric heating in

the majority of buildings on campus (and none where data recording capabilities exist).

However the approach employed in this thesis could easily be employed in characterizing the

building load behavior if heating loads were also a potential controllable resource.

Figure 2.3 shows the linear correlation between demand and each BMS measured

variable from above for 2011 and 2012.

Figure 2.3. Linear correlation between demand and several variables: OAT (top), OAT-Temp (middle), and
OAT-Stpt (bottom)

Particularly strong correlation is noted between demand and the target temperature

gradient. This relationship makes sense intuitively, as the building HVAC system needs to draw

more energy to maintain a given setpoint inside the building when temperature outside increases.

These results indicate a reasonable correlation, particularly when considering the relationship

between demand and temperature is inherently nonlinear. Additional correlation studies have

been performed and the results are available in [25].

In addition to correlation with measured BMS variables, there are temporal dependencies

that arise due to the thermal-electrical coupling present in large HVAC electrical loads [26] [27],

which make up a considerable portion of the load base for commercial buildings [28] [19].

Figure 2.4 shows an example of the correlation between measured demand values with future

demand levels.
12
Linear Correlation Between Demand Readings (Summer 2011)
0 1

3 0.9

0.8
6

0.7
9

0.6
12

0.5

15
0.4

18
0.3

21
0.2

24
0 3 6 9 12 15 18 21 24

Figure 2.4. Temporal correlation for 24 hours of building demand. Sample size is 4 months of weekdays (88
days – 2011)

Figure 2.4 depicts how the demand at each point in time is correlated to the demand at a

future point in time over 24 hours, with samples taken every 5 minutes. For this example there

are large pockets of high correlation shown, particularly between 4am and 8am and 1pm and

8pm. This serial correlation between demand observations is representative of the temporal

dependencies found in building load behavior. The HVAC consumption, and indeed that of

other load components, is driven not only by environmental factors but also building operational

schedules and occupant behaviors. When compared to the residential sector, commercial

buildings experience less variance day in and day out due to more consistent schedules and

occupant behaviors. However, from the demand usage patterns, important intraday

characteristics related to said behavioral determinants can be extracted and characterized to

develop improved, data-informed building demand profiles [29]. These patterns are inherently

linked to the measurements captured via the BMS.

It has now been established that this thesis will focus on applying information available

from building management systems typical of commercial buildings to describe building

13
electrical behavior for DSM planning. In the next section, similar research efforts studying

building resources for DSM are reviewed and contrasted with the approach taken in this thesis.

2.4. REVIEW OF COMMERCIAL BUILDING DEMAND SIDE MANAGEMENT

RESEARCH

Before buildings can be deployed as part of a DSM program, two questions must be

addressed: how accurately can the load of the building be predicted and how can a building load

be integrated in a DSM application. This thesis argues that the answer to both questions is more

realistic, and consequently more accurate, if measurements from the BMS play a role in the

DSM planning process.

Given the potential of buildings as DSM resources it is not surprising that there is

considerable research that has been conducted in attempting to answer these questions. The next

two subsections look at the state-of-the-art in predicting building loads and modeling and

scheduling building loads for DSM.

2.4.1. BUILDING-SPECIFIC LOAD FORECASTING

The load shed during a DSM event must be measured against a forecast of what the

demand would be if no adjustment occurred. This forecast is often referred to as the baseline

demand. Figure 2.5 shows an example of the baseline demand and actual metered load during a

DSM event. The area between the magenta and blue vertical lines is the period of time in which

an adjustment to the building load is made.

14
240

Baseline

Metered Load
220

200

Demand (kW) 180

160

140

120
0 4 8 12 16 20 24
Time of Day (hours)

Figure 2.5. Example of a baseline demand profile during a DSM event

Utilities generally use simple baseline models that involve averaging the daily demand

over several days (excluding DSM event days) [30]. For example, CAISO introduced a “3-in-

10” method where the baseline is based on the hourly average of the three highest energy usages

in the past ten similar days [31]. PJM used the same approach except they look at four out of the

past 5 similar days [23]. Recent adjustments to both the CAISO and PJM models have been

made to improve the accuracy. Observations of metered demand are collected several hours

prior to the beginning of a DSM event and the original baseline calculations are adjusted up or

down based on these more recent demand readings. This adjustment has improved the accuracy

in quantifying demand behavior for economic settlement post event. However, this adjustment

provides no benefit for forecasting building loads except in near real-time applications since it

depends on very recent demand readings.

Several regression-based approaches have been proposed in the literature as alternatives

to the averaging methods discussed above [32] [33] [34] [35]. These methods include predictor

variables in the model such as weather and calendar information in an attempt to better predict

load. However, a comprehensive analysis of baseline calculation methods prepared for PJM
15
showed only 2 of 13 unadjusted methods applied a regression approach [36] and only one

method was actually employed by a utility (ERCOT). The results of this study also indicated the

regression models, as currently designed, did not necessarily outperform the other models in the

tests that were performed.

Beyond the averaging and regression-based baseline calculation approaches that have

been discussed, there is a surprising lack of more sophisticated approaches in the technical

literature [37]. This can be attributed more to terminology rather than research effort however.

The DSM community tends to use the term baseline as opposed to forecast. Baseline has the

very specific connotation as the load profile against which incentive-based DSM performance is

measured when the utility and end-user reach settlement following a DSM event. There are

other research efforts that focus on building-specific forecasts for studying building load

behavior. The models in these works tend to be more complex and less intuitive than the

baseline models but they more thoroughly capture building load behavior and produce more

consistently accurate load forecasts. These works are not solely concerned with quantifying

building behavior for DSM, although that is still often a motivation. Going forward, this thesis

will use the term forecast to refer to any approach concerned with predicting the future building

electricity demand.

Methods for forecasting end-user facility demand are presented in [30] [25] [38] [39] [40]

[41] [42] [43] [44] [45] [46] [47] [48] [49]. These works have applied a wide variety of methods

that are often used in short term load forecasting (STLF) studies but at a campus or building

level. In [30] [25] [38] [39] regression models are used. [40] [41] [42] [43] [44] employ several

different forms of neural network models. [45] proposes a new day-ahead probabilistic model

based on Gaussian processes for an industrial facility. Support vector machines (SVM) are used

in [46] to forecast monthly demand at four commercial sites in Singapore and in a short-term
16
application in [47]. In [48] and [49], several forecasting models are combined to create

ensemble models for forecasting building demand. All of these works recognize that forecasting

at the building level must deal with a higher degree of variability, a feature that becomes washed

out to a degree when forecasting at a transmission substation level with aggregated loads.

However, of these references, only [47] included internal building measurements in their forecast

models. The measurements were collected from a limited sensor network installed for testing

purposes. These measurements included occupancy information from sensors at the two

building entrances and four temperature sensors distributed through the building.

In contrast to the works above, the approach taken in this thesis is to focus on including

information available from building management systems (BMS) typical of commercial

buildings that influence building electrical behavior. As discussed earlier in this chapter, these

measurements are directly linked to the thermal-electrical behavior that drives a large portion of

the building demand. The goal is to enhance existing STLF techniques with these building-

specific measurements and compare the performance against similarly trained models that do not

include this information. This work is presented in detail in Chapters 3 and 4.

2.4.2. COMMERCIAL BUILDING LOAD MODELING AND DISPATCH

When surveying the literature on building load modeling, contributions predominantly

come from the HVAC engineer community. This research tends to provide highly detailed

models of the building thermal dynamic behavior [50]. Often these models require a thorough

knowledge of the building construction and equipment profiles. In general these models are far

too complex to be used for evaluating DSM applications for a given building, let alone a group

of buildings that might be under the end-user’s control.

Power systems researchers are increasingly interested in how building loads can be

integrated in the electric grid through DSM. However, the majority this research has centered
17
around the integration of bulk demand response by the independent system operator (ISO)

when scheduling system resources and performing system security analysis [51] [52] [53] [54]

[55]. It is assumed that building loads can play a role as DSM resources in these scenarios but

these methods include limited characteristics of actual building behavior. Further, no

information can be derived from these processes that would help the end-user achieve in practice

the demand modifications that would be required.

From the perspective of the end-user it is crucial to have a method of evaluating one’s

own potential for DSM activity; this includes models of building behavior and formal method of

determining when and how to dispatch loads. This is particularly true when demand

modifications are driven through control of the HVAC system given the potential for significant

impact to the comfort levels of the building. Modeling the behavior of thermostatically

controllable loads (TCL) for DSM has been studied in [14] [56] [57] [58]. While there is

potential applications for these models they are better suited for the aggregation of many

residential or small commercial loads [14] and overlook the fact that cycling HVAC systems

between on and off states is undesirable for reliability and efficiency reasons [59].

There remains a need for a model that captures the thermal-electrical dynamics of

commercial building loads with an eye towards controllability and simple implementation in a

DSM scheduling for the consumer. In this thesis such a model is developed in Chapter 5 using

methods applied previously for the development of power system dynamic load models and

again leveraging information collected from the BMS. Chapter 6 evaluates how this model,

combined with the building-specific forecasts from Chapters 3 and 4, can be used in a method of

planning commercial building load resources for DSM directed at the end-user.
18

3. BUILDING-SPECIFIC LOAD FORECASTING STUDIES

3.1. OVERVIEW

In this chapter, the problem of building-specific load forecasting is studied. The main

focus is incorporating building-specific information into conventional load forecasting

techniques and evaluating the performance. This chapter includes the following:

 A brief overview of state-of-the-art load forecasting methods and the definition of

the load forecasting problem

 A case study where conventional forecasting models are trained on data collected

from the Drexel University BMS and a series of day-ahead load forecasts are

performed.

 Results from the case study are used to evaluate building-specific forecasting

performance and characterize the uncertainty in building-specific forecasts

3.2. LOAD FORECASTING METHOD REVIEW

Load forecasting is an essential process in electrical utility planning and operation.

Historically, load forecasting (in particular short term load forecasting (STLF) ) has played a

critical role in ensuring power system dispatchers schedule adequate generating capacity in the

most economical way possible. The importance of having access to accurate and consistent

methods of load forecasting is obvious. Consequently, a considerable amount of work has been

done to develop and examine methods of forecasting power system loads over varying time

horizons [60] [61]. In this thesis the focus is on day-ahead forecasts, which fall into the STLF

window.
19
Load forecasting approaches generally fall into two categories: statistical approaches

such as classical time series analysis and regression–based models, and artificial intelligence

(AI) models such as artificial neural networks and fuzzy logic models [62]. Methods from both

of these categories have been applied to solving the STLF problem. A very thorough review and

critique of STLF techniques introduced prior to 2010 is presented in [63]. The popularity of AI

methods has led to a lot of research resources being devoted to applying newly developed AI

techniques to STLF. Far less attention is placed on applying modern statistical methods to this

problem. This is despite, as noted in [63], that statistical methods are much more widely used to

develop the candidate models employed by utilities.

An important contribution from [63] is the focus is not only on the variety of techniques

but also what predictor variables are employed in the forecasting methods and how said variables

can lead to conclusions regarding the causality of load consumption. Nearly all of the research

surveyed included some predictor variables and this thesis will consider their impact on

forecasting heavily. Predictor variables include environmental factors (“weather variables”),

lagged observations of demand or variables, as well as human or operational factors (“calendar

variables”). One of the conclusions is that successful inclusion of such variables depends on an

understanding of the geography of the power system under study and time frame of interest as

much as the method used to integrate these variables into a model. Without understanding this

behavior the chosen method will not successfully capture the link between demand and any

applied external variables.

In the years since the review in [63], the technical literature related to STLF has explored

additional methods for capturing the relationship between demand and additional predictor

variables. Artificial neural network models continue to be widely used owing to the inherent

capability of being able to learn and capture complex linear and nonlinear relationships from the
20
data to be modeled [64]. Such models have been recently applied in a variety of forms [65]

[66] [67] [68] [69] [70] to take advantage of this property. Additional AI techniques such as

Support Vector Machines (SVM) and other supervised learning algorithms [71] [72] [73] [74]

have also been used to achieve suitable results. On the statistical method front, semi-parametric

additive models have recently shown very good results at both the transmission and distribution

system level [75] [76] [77] [78]. These works employ a regression-based structure with smooth,

nonlinear functions used to capture the link between demand and a number of covariates.

3.3. LOAD FORECASTING PROBLEM FORMULATION

The goal of any forecasting problem is to predict future observations of a particular

variable given a set of information about said variable. This information may include only

historical observations of the variable of interest (univariate case) or observations and predictions

of related predictor variables as well (multivariate case). There is an implicit assumption that

identified historical behavior will continue in the future.

For this work, we are interested in predicting the demand level p for a given building.

Although electric power is consumed by a building continuously, the observations are discrete

based on the sampling of the electric meter. Therefore, demand is represented by a discrete time-

series  pt  t 1,..,  where pt is the metered demand at time t and T is the collection of time

indices where (in general)    0 .

The general load forecasting problem can be defined as a function f of the available

information as shown in (3.1) and (3.2) below:

pˆ t k|t  f  Φt  (3.1)

where
21
K: Forecast horizon
k 1,..., K : Time index within forecast horizon
pˆ t  k |t : Estimate of p given observations up to time t looking
k steps ahead
Φt : Collection of historical demand and predictor variables

with

Φt   pt , pt 1 ,..., pt  m , xt , xt 1 ,..., xt  m , xˆ t  k |t  (3.2)

where

m: Time in the past to which historical observations are included

pt  m : Historical observations of p from time m to time t
x: Set of all available predictor variables
xt  m : Historical observations of x from time m to time t
xˆ t  k |t : Estimate of x given observations up to time t looking
k steps ahead

Figure 3.1 shows a sample load forecast. In the plot, the division is shown between

historical observations pt-m (blue line) and estimated demand pˆ t k|t (grey line), also noting the

time indices as defined in above.

Figure 3.1. Sample load forecast showing time indices

The forecast horizon K, also referred to as the look-ahead time in the forecasting

literature, defines how far into the future one is looking to forecast. If K > 1, the forecast is

called a multiple step ahead forecast. The estimated model remains fixed for the duration of the

forecast horizon. The predictor variable estimates xˆ t  k |t are used as inputs to the model in order

to generate the final forecast pˆ t k|t . An iterative method can also be used which effectively turns

the problem into a single step ahead forecast. At each step k, the estimate pˆ t k|t is found. This

estimate is then treated as a previous observation, becoming an input to the same model in order

to forecast the subsequent point. The process continues in this manner until reaching the end of

the forecast horizon. This approach is more often used for univariate forecasts and has the

disadvantage that the errors in the predicted values are accumulated into the next predictions.
23
The predictor intervals contained in Φ will ideally include only variables that have a

causal relationship with the demand p. Identification of such variables can be done prior to

estimation of the function f by observing correlation plots such as the one shown in Figure 2.3.

Variable identification can also be done by training several forecast models with different

variables included in Φ, observing the forecast performance on a sample of actual demand

observations, and selecting the “best” model. In this thesis, two groups of predictor variables

will be used: one which includes building-specific measurements and one that does not. The

specifics of this are described in the model building procedure in subsection 3.4.3.

3.4. DREXEL UNIVERSITY CASE STUDIES

In order to test the idea of enhancing conventional load forecasting methods with

building-specific information, several STLF techniques will be used to forecast the day-ahead

demand of a building on Drexel University’s campus. The following subsections will describe

the data set used in this study, introduce the selected forecasting methods and describe the model

building procedure.

3.4.1. DATA SET DESCRIPTION

Data has been collected using the Drexel University building management system (BMS)

over the last several years. For the studies presented in this thesis, the data used is from the

summer months (May-August) of 2011, 2012, and 2014. As mentioned previously, the summer

weather conditions and large HVAC building load make these good months to study load

forecasting performance in support of demand side management activities. This case study will

focus on the Hagerty Library however the approach is generic to any similar commercial-type

building.
24
The raw data includes four variables, all recorded at 5 minute intervals:

1. Building Demand (kW)

2. Outside Air Temperature (oF)

3. Indoor Air Temperature (oF)

4. Indoor Air Temperature Setpoint (oF)

The summer months (May-August) of 2011 and 2012 are used as the training set and

2014 held out to be used for out-of-sample testing. This distinction is shown in Figure 3.2

below. By training and testing the models on separate data sets the problem of overfitting can be

avoided. The training set can be broken into a sample used for estimating model parameters and

a sample used to validate these parameters. This is not required for all methods in this study.

Instances where this is the case will be discussed in the methods section 3.4.2.

Figure 3.2. Breakup of data set and how each portion is used in the forecasting process

Unfortunately the 2013 data had to be excluded from this study due to a large number of

bad measurements in this data set. These issues are attributed to BMS data collection

functionality problems. However, the building involved in this study is known to have

undergone no equipment upgrades or operational changes (University library). It is therefore

acceptable to leave out this data set for the purposes of this research effort.

Figure 3.3 below shows the daily building demand measurements for the test set portion

of the data (2014). The superimposed dark line represents the mean load profile for the test set.

It is these measurements against which the forecasts will be compared in this case study.
25
Daily Building Demand (116 days) with Mean Profile
300

250

200
Demand (kW)

150

100

50
0 4 8 12 16 20 24
Hours

Figure 3.3. Daily building demand curves for the 116 days used for out of sample testing. The thicker line
shows the mean daily profile

3.4.2. SELECTED FORECASTING METHODS

This work examines the performance of four popular forecasting methods used in a

building-specific application. The selected methods were chosen to represent a broad cross

section of techniques. Detailed descriptions of each approach are well established in other works

but the general problem formulations are outlined below. Information specific to how these

problems are implemented in the chosen software packages is also discussed.

 MULTIPLE LINEAR REGRESSION (MLR)

A multiple linear regression model relates a dependent (or response) variable p to two or

more independent (or predictor) variables x. The general model is shown in (3.3) [79]:

pi  0  x1i 1  x2i 2  ...  xki k   i

(3.3)
i  1,..., N

where, for k predictor variables:

26
N: Total number of observations
pi : Observation of the dependent variable p at time i
xki : Observation of predictor variable k at time i
k : Model parameter corresponding to the k th explanatory variable
i : Model error at time i

In many cases, the predictor variables are quantitative such as temperature or wind speed.

However, the formulation in (3.3) does not limit the predictor variables to quantitative ones.

Qualitative predictor variables, often referred to as indicator or dummy variables, can also be

included in the model and are particularly important in forecasting applications. Indicator

variables with values 0 and 1 can be used to identify the category of a quantitative variable, for

example indicating a weekday versus a weekend. A qualitative variable with c categories must

be represented by at most c-1 indicator variables. For example, a qualitative variable

representing the day of the week has 7 categories (i.e. - Sunday, Monday, … , Saturday) and is

represented by 6 indicator variables as shown in Table 3.1. Choosing to use c variables will

result in too many parameters to estimate and subsequent regression failure.

Table 3.1. Indicator variables representing the day of the week

Day X1 X2 X3 X4 X5 X6
Mon 1 0 0 0 0 0
Tue 0 1 0 0 0 0
Wed 0 0 1 0 0 0
Thur 0 0 0 1 0 0
Fri 0 0 0 0 1 0
Sat 0 0 0 0 0 1
Sun 0 0 0 0 0 0

The coefficients β represent the partial effect of one predictor variable when all others are

held constant [79]. In other words they represent the marginal effect of each predictor variable.

Given a set of training data, the β values are estimated using the ordinary least squares estimation

method. The models used in this thesis were implemented in “R” using the ‘lm’ function. This

package handles MLR model fitting as described here.

27
 ARTIFICIAL NEURAL NETWORKS (NN)

Artificial neural networks (NN) are mathematical tools inspired by the way the human

brain processes information. The most basic computational unit of an NN is the neuron. The

neuron receives information, processes it internally, and provides a response. Figure 3.4 shows a

general schematic of an artificial neuron.

Figure 3.4. General artificial neuron model

In general, the information is processed in two stages. First, the input values are linearly

combined. Each value of the input array is associated with a weight value wi. An additional

input, a constant bias term θ, with a weight value equal to 1 is also applied. Second, this

combination becomes the argument of a non-linear activation function. There are a number of

possible functions but a very common choice (and the one implemented in this thesis) is the

logistic sigmoid function shown in (3.4).

1
f  x  (3.4)
1  e x

The organization of the neurons defines the architecture of the NN. A feed-forward

multilayer perceptron (MLP) neural network is a typical NN architecture employed for STLF. In

this architecture, the neurons are organized in layers where no neuron in a given layer is

connected to another neuron in the same layer, though they can share inputs. The term feed-
28
forward means that the outputs of one layer become the inputs to the following layer. An

example N-layer feed-forward MLP NN is shown in Figure 3.5.

Figure 3.5. A general multi-layer, feed-forward artificial neural network with N hidden layers

The parameters of this network are the matrices of weights between each neuron and its

associated input. Input in this case also means connections between neurons in different layers.

Estimation of these weight matrices is referred to as “training” the network. The most widely

applied approach to training in STLF is supervised learning. For this approach, sets of inputs

and matched outputs are used as teaching patterns and the network weights that provide the best

fit between the network output and teaching output are found. Best fit is determined through

minimization of a loss function. The available training algorithms and loss functions are varied,

but historically the back-propagation method and mean squared error criterion are common in

STLF and will be used in this thesis.

The neural network used in this thesis is a three-layer feed-forward MLP implemented

using the Matlab Neural Network Toolbox. The network architecture and training method used

are as described in the preceding paragraphs. The model is used to forecast multiple steps ahead;

that is to say 288 point forecasts, corresponding to a single day-ahead forecast at 5 minute
29
resolution. This results in a large neural network model that may be computationally

undesirable for implementation in a building level energy management system. However for the

purposes of this study it is sufficient and more computationally sensitive techniques for building

the models can be explored in future work.

One other item of note when constructing the NN model used in this work is that there

are no hard and fast rules guiding the selection of the number of hidden neurons [64]. The

hidden neurons are the neurons in the layer (or layers) between the input and output layers. The

model in this work uses 25 hidden neurons. This number was determined by estimating the

model on a portion of the training set as shown in Figure 3.2 with a varying the number of

neurons and observing the accumulated error on the validation set. Varying the number of

neurons did not appear to significantly affect the error results, which is not uncommon [64], and

25 was selected.

 SUPPORT VECTOR REGRESSION (SVR)

Support vectors and support vector machines (SVM) are a machine learning technique

used for data classification and regression [80]. Assume given training data (x1 , y1),…,(xn , yn),

where xi are the inputs and yi the corresponding outputs, the support vector regression solves the

following optimization problem in (3.5)-(3.8).

1 n

min * 2

wT w  C   i   i*  (3.5)
w , b , , i 1

subject to


yi  wT   xi   b     i*  (3.6)

w   x   b  y
T
i i
   i (3.7)
30
i ,   0, i  1,.., n
i
*
(3.8)

where xi is mapped to a higher dimensional space by the function ϕ, i* is the upper

training error and  i is the lower limit subject to the δ-insensitive tube y   wT   x   b    .

The parameters which control regression quality are the error cost C, the width of the tube δ, and

the mapping function ϕ.

As noted in [81], since the function ϕ can map xi to a high or even infinite dimensional

space the dual of (2) is often solved instead.

1 n n

   *  Q    *      i   i* + yi  i   i* 
T

min 2
(3.9)
 , * i 1 i 1

subject to

  i 
  i*  0 (3.10)
i 1

0   i ,  i*  C , i  1,.., n (3.11)

where Qij= ϕ(xi)Tϕ(xj). To solve this inner product, which may be computationally

difficult, a “kernel trick” is implemented to do the mapping implicitly. In other words, the

application of special forms which are inner products in a higher dimensional space yet can be

calculated in the original space. There are many options for kernel functions but for this work

the radial basis function (RBF) kernel is used. The expression for this kernel is shown in (3.13).

  xi    x j   e  
2
T  x  x i j
(3.12)
31
The RBF kernel can handle nonlinear relationships between xi and yi unlike the linear

kernel yet has fewer parameters than a polynomial kernel, reducing model complexity. This

kernel selection has been used to great effect in STLF applications [80] [73].

There are two key parameters that must be found when training the SVM models: the error

cost C and the RBF parameter γ. To decide the proper parameter values, the training set is

segmented such that the model performance can be evaluated. A portion of the training set is

used for updating the model parameters while the validation set is used to observe the

corresponding model performance. To determine suitable values for C and γ, v-fold cross

validation [82] is performed by dividing the training set into v equally sized subsets. With one

subset held out for validation, the model is trained on the other v-1 subsets. This process is

repeated using each subset as a validation set and the model performance aggregated. In this

thesis the models are implemented using the LIBSVM extension in R [81]. This package

automatically handles the parameter estimation procedure by efficiently solving (3.9) while

performing the CV procedure.

 GENERALIZED ADDITIVE MODELS (GAM)

In a manner similar to the regression structure established for the multiple linear

regression model, the semi-parametric additive models capture nonlinear relationships using the

framework established in [83].

The general statistical model is shown in (3.13):

pi  f1  x1i   f 2  x2 i   ...  f k  xki    i ,

(3.13)
i  1,..., N

where, for k predictor variables:

32
pi : Observation of the dependent variable p at time i
N: Total number of observations
xki : Observation of predictor variable k at time i
fk : Smooth, nonlinear function related to predictor variable k
i : Model error at time i

The functions fk are non-linear, smooth functions that can be well estimated from

observed data. These functions can be multivariate (e.g. – f(x1,x2) ) but for this thesis they will

only be a function of a single variable. It is common for these functions to be estimated via

penalized regression in a spline basis. If bi(x) is the ith basis function, then f can be represented

as in (3.14).

d
f k  xk    bi  xk   i (3.14)
i 1

where d is the dimension of the spline basis and bi(xk) are the corresponding spline

functions. There are many possible spline functions available. This work uses cubic regression

splines as was applied in previous STLF works [ [75] [76]]. In order to estimate f a penalized

regression problem is solved via (3.15):

kv
min p  Bβ   q βT S q β
2

β ,
(3.15)
q 1

where

kv : Total number of predictor variables

p: All collected observations of the dependent variable p
B: Matrix formed by concatenation of the spline functions
bi  x 
β: Vector of the the unknown regression parameters 
q : Smoothing paramter parameter
S: Smoothing matrix of known coefficients
33
The solution to this problem will attempt to balance the tradeoff between model fit and

model smoothness. The models used in this work are implemented in R using the mgcv package

[84]. The problem in (3.15) is solved using the methodology presented in [85] [86] which

involves minimization of the Generalize Cross Validation (GCV) criteria. This is done

automatically in mgcv with the appropriate spline function and training set specified.

3.4.3. MODEL BUILDING AND FORECASTING PROCEDURE

The overall STLF process is summarized in Figure 3.6. (Modified from [63]).

Figure 3.6. Diagram of the STLF process

The first step is to use historical information to estimate the model. The predictor

variables contained in Φ includes the data required to estimate the necessary model parameters.

When training each model, the considered predictor variables are shown in Table 3.2.
34
Table 3.2. General and Building-specific predictor variables

For each forecasting method tested in this thesis, two models are built: one which

includes the general variables and building-specific variables and one that only includes the

general variables. The 2011 and 2012 data is used for model estimation. For those forecasting

methods that use a validation set (refer to section 3.4.2 for details), the August 2012 data is set

aside for model validation.

Once the models are built the forecasts can be generated. Each model is used to forecast

the day-ahead demand for the 2014 test set. The forecasts are generated in 5 minute steps

matching the sample rate of the historical data. A total of 116 day-ahead forecasts were

performed (May 8th - Aug. 31st 2014). The first week of May was excluded as no historical

lagged measurement observations were available prior to May 1st to support forecast generation.

In real-world forecasting applications the forecasts are generated by feeding estimates of

the predictor variables xˆ t  k |t into the model. For example, a weather forecast for the next day

would be used as an input as opposed to actual temperature information. Using only information

known in advance, these forecasts are known as ex-ante forecasts. These are the only “true”

forecasts since all future information is unknown. In this thesis, because the goal is model
35
analysis, estimates of the predictor variables xˆ t  k |t are replaced with observed values xt  k |t , and

the generated forecast pˆ t  k |t is compared to the known value pt  k |t . These types of forecasts are

known as ex-post forecasts and are useful in demonstrating the potential accuracy of a model and

are the standard in the forecasting literature.

3.4.4. FORECAST PERFORMANCE EVALUATION FRAMEWORK

Performance evaluation focuses on three questions: did the building-specific

measurements improve the forecast accuracy, how consistent is this performance, and how does

the forecast uncertainty change when including building information. Most of the forecasting

literature is focused solely on the forecast accuracy. The majority of papers follow the same

general formula: develop a forecast model and then precede to use an accuracy metric that

demonstrates how great a particular model is for that given test. This type of analysis is fine but

a bit limited. It struggles to address consistency of the forecast performance and provides little

to no characterization of the forecast errors. The forecast performance evaluation framework in

this thesis will address not only the overall accuracy for the forecasts but focus particularly on

the performance consistency.

Forecast error is defined as the difference between the predicted power value and the

actual power, as shown in (3.16).

et k|t  pt k|t  pˆ t k|t (3.16)

The error is composed of a systematic part te k |t and a random part te k |t .

et  k |t  te k |t  te k |t (3.17)

Ideally, the systematic error is zero and the random part is white noise (zero mean,

Gaussian random variable). However, in practice these conditions are rarely the case and
36
examining each part of the error is necessary to understand the impact of building

measurements on the forecast performance.

The standard metric for defining the forecast accuracy in the load forecasting literature is

the Mean Absolute Percent Error (MAPE), shown in (3.18).

 1 N et k |t 
MAPE     100 (3.18)
 N k 1 pt  k |t 
 

where N is the number of forecasted points in the interval of interest. By varying the timeframe

over which the MAPE is calculated certain behaviors of the forecast method of interest can be

observed. This metric is an overall accuracy measure and both the systematic and random errors

contribute to the calculated value.

Another basic metric is the forecast method’s bias, given by the mean error over a

specified interval.

N
1
bias  ke 
N
e
k 1
t  k |t (3.19)

where N is the number of forecasted points in the interval of interest. The bias shows if the

method tends to under- or overestimate the forecast. It corresponds to the systematic part of the

forecast error et+k|t. Ideally the forecasts will be unbiased but in practice this is often unrealistic.

The variability of the forecast performance can observed by calculating the standard

deviation of the errors.

1 N
  et  k |t  ke 
2
 ke  (3.20)
N k 1

where N is the number of forecasted points in the interval of interest. The standard deviation

corresponds to the random part of the forecast error et+k|t.

37
In addition to the MAPE (3.18), bias (3.19), and standard deviation (3.20) metrics a

distribution-based approach to performance evaluation will be employed. Histograms of the

forecast errors represent the empirical distributions of these errors. This type of analysis is

important for characterizing the consistency of the forecast performance by answering the

question “How often does a given forecasting method result in a specific error level?”. For

example, two methods might result in nearly identical MAPE values over a given time window

but have radically different error distribution shapes, thus different frequencies of large errors.

Evaluating the moments of the forecast error distributions will shed light on several important

characteristics:

 The mean ke is a measure of the central tendency of the error distribution. As mentioned

above, this is equivalent to the bias metric.

 The standard deviation  ke reflects the dispersion of the distribution.

 The skewness ske describes the lack of symmetry of a distribution, indicating the most

likely direction of expected forecast errors. The skewness is calculated using (3.21) [87]:

N 3
1
N
 e k  t |t  e
k 
s 
e
k
k 1
(3.21)
N 3
1
N
 e
k 1
k  t |t  e
k 

 The kurtosis represents the “tailedness” of the distribution, indicating the propensity for

outliers in the forecast errors. The kurtosis is calculated using (3.21) [87]:
38
4
1 N
  ek t|t  ke 
N k 1
 ke  2 (3.22)
1 N 2

   k t|t   k 
e
e

 N k 1 

Error margin plots will be used to shed more light on the error histograms. These plots

show what proportion of forecast errors fall above a certain threshold for a given window of

time. This will help answer the question of how often the forecasts result in unacceptable errors

and if building measurements can improve this performance.

The following subsection will use the above metrics and distribution-based methods to

evaluate the 116 day-ahead building-specific load forecasts. The forecasts correspond to 33408

total point predictions. Comparisons are made between the models that include building

measurements and those that do not. The forecasts are observed over several different time

windows to try and better characterize the performance.

3.4.5. RESULTS

Before taking a more detailed look at the forecast performance it is typical of forecasting

studies to observe the overall accuracy results. First, the day-ahead MAPE for each of the 116

forecasts is calculated. These results are summarized in Table 3.3.

Table 3.3. Quantiles of day-ahead MAPE performance across all out-of-sample forecasts
MLR NN SVM SAM
quantile No Bldg Bldg No Bldg Bldg No Bldg Bldg No Bldg Bldg
10% 3.06 2.87 3.88 3.58 2.72 2.53 3.36 3.04
25% 3.82 3.64 4.74 4.25 3.15 2.97 4.01 3.94
50% 5.03 4.52 5.65 5.36 4.18 4.08 5.34 5.29
75% 6.74 5.93 8.55 6.68 6.44 6.39 7.72 7.62
90% 8.75 7.49 11.99 9.02 9.26 9.60 11.78 10.86
39
The universal improvement when building-specific measurements are included is

apparent for all 4 forecasting methods. The only deviation is for the SVM model in the 90th

percentile of forecasted days. It is important to note that the MAPE values of the models without

building measurements are already consistently good (and in some cases excellent) by today’s

forecasting standards. Improving upon a poorly performing model by feeding it new variables

would say little about the actual impact. That the models with building information consistently

outperformed the other set in this case study demonstrates the predictive capacity of these

variables.

Other breakdowns of forecast performance are shown in Table 3.4 and Table 3.5. Table

3.4 shows the MAPE calculated on a monthly basis. Table 3.5 shows the MAPE calculated for

weekdays and weekends/holidays.

Table 3.4. Monthly MAPE (%) performance

MLR NN SVM SAM
Month No Bldg Bldg No Bldg Bldg No Bldg Bldg No Bldg Bldg
May 9.04 8.78 10.91 8.60 8.87 8.86 11.40 9.55
June 5.42 5.40 6.40 5.29 4.67 4.80 5.96 5.90
July 4.89 4.93 5.91 5.03 4.19 3.99 5.93 5.49
August 3.92 4.00 5.48 5.00 3.81 3.68 4.76 4.45

Table 3.5. MAPE (%) performance broken up by weekdays and weekends/holidays

MLR NN SVM SAM
No Bldg Bldg No Bldg Bldg No Bldg Bldg No Bldg Bldg
Wkdy 5.75 5.68 6.98 5.58 5.29 5.15 6.69 6.06
Wknd/Hol. 5.34 5.42 6.89 6.44 4.91 5.05 6.91 6.40

The improvement when building-specific measurements are included is again

demonstrated for all methods. It is worth noting the improvement in July and August vs. May

and June. Although classes at Drexel University are in session year-round, the period between
40
July and September tends to have fewer people on campus. The forecast improvement during

these months could be attributed to diminished volatility of occupancy effects that are more

pronounced in May and June.

One of the objectives of this thesis is to study a tool for improving end-user energy

management decisions. While the results in Table 3.3 – Table 3.5 indicate solid performance

improvement, good daily or monthly performance is not sufficient to confidently inform DSM

operations and could in fact be misleading. The performance during hourly (or even shorter)

intervals must be explored to provide value for demand side management planning.

For this study the forecasts have been generated in 5 minute steps as stated previously.

The error statistics presented going forward in this chapter are aggregated as to be reported

hourly. The decision to present error statistics hourly is motivated by typical settlement period

lengths for day-ahead demand response programs [23]. This decision will also make

observations of the forecast performance a little more straightforward, but no less accurate, with

a condensed set of results. A full set of results at 5 minute resolution can be found in Appendix

A2.

Table 3.6 shows the hourly MAPE for all 4 methods across all 116 day-ahead forecasts.

When looking at the hourly performance we again see an overall measurable improvement when

building measurements are included in the models. There are also a number of additional

observations that can be made about the consistency of forecast performance from these results.
41
Table 3.6. Hourly MAPE (%) across all out-of-sample forecasts
MLR NN SVM SAM
Hour No Bldg Bldg No Bldg Bldg No Bldg Bldg No Bldg Bldg
1 5.16 5.12 8.06 6.54 5.49 5.55 7.03 6.31
2 5.37 5.33 7.85 5.62 6.86 6.81 7.51 6.62
3 8.53 8.61 8.61 7.95 6.47 6.24 10.80 9.39
4 8.37 8.56 12.50 8.08 6.72 7.02 11.52 10.16
5 8.18 8.26 10.23 6.85 6.11 6.35 10.56 9.38
6 10.01 9.61 10.38 7.73 8.66 9.18 10.01 10.31
7 6.55 6.61 6.77 6.14 5.95 5.93 7.26 6.66
8 5.27 5.31 6.43 5.23 5.09 5.17 6.05 5.76
9 5.47 5.46 6.94 5.52 6.30 6.15 6.19 6.17
10 5.25 5.19 5.96 5.40 5.65 5.55 5.94 5.86
11 5.00 4.89 5.99 5.25 5.49 5.23 5.42 5.43
12 4.93 4.84 6.37 4.89 4.87 4.84 5.62 5.36
13 4.41 4.33 5.64 4.81 4.43 4.60 5.06 4.86
14 4.43 4.42 6.24 5.12 4.08 4.01 5.05 4.70
15 4.89 4.71 5.59 4.70 3.99 3.83 5.30 5.03
16 5.06 4.86 5.77 5.06 4.20 3.93 5.41 5.13
17 4.89 4.78 5.68 5.64 4.23 3.88 5.58 4.99
18 4.56 4.53 5.06 5.11 4.00 3.35 5.24 4.61
19 4.41 4.22 4.83 6.98 4.15 3.72 5.32 4.64
20 4.68 4.48 4.83 6.63 4.13 3.87 5.75 5.03
21 4.57 4.62 5.05 4.77 3.96 3.85 5.80 4.85
22 4.98 5.30 7.62 5.18 4.41 4.48 6.51 5.27
23 5.09 5.36 8.25 5.20 4.38 4.49 6.62 5.46
24 4.98 5.10 6.27 5.52 4.70 4.94 6.58 5.78

Table 3.6 describes the day-ahead MAPE on an hourly interval. The hourly MAPE is

generally higher during the early morning hours. This is a time of lower building demand (see

Figure 3.3) so any error in the forecast will reflect more heavily on a percentage metric. During

a number of these early morning time intervals the MLR and SVM models actually perform

slightly better without the building measurements included. This is in contrast to the neural

network and additive models, and also to how all of the models generally behave over the rest of

the day.

The hourly MAPE values tend to drop as the day progresses and the building demand

increases considerably (Figure 3.3). More importantly, during this time window the models

which include building measurements improve the forecast in nearly all intervals, for all
42
methods, and by a greater amount. This is important for DSM planning purposes. This is the

time window when building demand resources will most often be dispatched and having a better

estimate of the building load is essential. Figure 3.7 - Figure 3.10 below show the hourly MAPE

for each forecasting method to visualize the results from Table 3.6.

13
W/O Bldg Meas.
12 WITH Bldg Meas

9
MAPE (%)

3
0 5 10 15 20 25
Hours

Figure 3.7. Hourly MAPE for all forecasts generated by the MLR models
43
13
W/O Bldg Meas
12 WITH Bldg Meas

9
MAPE (%)

3
0 5 10 15 20 25
Hours

Figure 3.8. Hourly MAPE for all forecasts generated by the NN models

13
W/O Bldg Meas
12 WITH Bldg Meas

9
MAPE (%)

3
0 5 10 15 20 25
Hours

Figure 3.9. Hourly MAPE for all forecasts generated by the SVM models
44
13
W/O Bldg Meas
12 WITH Bldg Meas

9
MAPE (%)

3
0 5 10 15 20 25
Hours

Figure 3.10. Hourly MAPE for all forecasts generated by the SAM models

To better understand the inter-hour and intra-hour forecast accuracy variability it is

helpful to visualize the range of MAPE results on an hourly basis. The box plots in Figure 3.11 -

Figure 3.14 look at the hourly MAPE across all out-of-sample forecasts. Each box plot displays

a notched mark at the median value with the box edges showing the 25th and 75th percentiles

and the whiskers capturing the most extreme results. These plots help identify during what

intervals of the day the models perform less consistently (in terms of MAPE values) and how this

consistency varies throughout the day.

45
Multiple Linear Regression Model without Building Measurements
35
30
25

MAPE (%)
20
15
10
5
0
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24
Hour