100% found this document useful (2 votes)

301 views435 pages

Modeling and Simulation of Complex Communication Networks

Uploaded by

someuser287

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

100% found this document useful (2 votes)

301 views435 pages

Modeling and Simulation of Complex Communication Networks

Uploaded by

someuser287

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 435

IET PROFESSIONAL APPLICATIONS OF COMPUTING SERIES 18

Modeling and Simulation of

Complex Communication
Networks
IET Book Series on Big Data—Call for Authors
Editor-in-Chief: Professor Albert Y. Zomaya, University of Sydney, Australia
The topic of Big Data has emerged as a revolutionary theme that cuts across many
technologies and application domains. This new book series brings together topics within the
myriad research activities in many areas that analyze, compute, store, manage, and transport
massive amounts of data, such as algorithm design, data mining and search, processor
architectures, databases, infrastructure development, service and data discovery, networking
and mobile computing, cloud computing, high-performance computing, privacy and security,
storage, and visualization.
Topics considered include (but not restricted to) Internet of Things and Internet computing;
cloud computing; peer-to-peer computing; autonomic computing; data centre computing;
multicore and many core computing; parallel, distributed, and high-performance computing;
scalable databases; mobile computing and sensor networking; green computing; service
computing; networking infrastructures; cyberinfrastructures; e-science; smart cities; analytics
and data mining; Big Data applications, and more.
Proposals for coherently integrated International coedited or coauthored handbooks and
research monographs will be considered for this book series. Each proposal will be reviewed
by the editor-in-chief and some board members, with additional external reviews from
independent reviewers. Please email your book proposal for the IET book series on Big Data to
Professor Albert Y. Zomaya at [email protected] or to the IET at
[email protected].
Modeling and Simulation of
Complex Communication
Networks
Edited by
Muaz A. Niazi

The Institution of Engineering and Technology

Published by The Institution of Engineering and Technology, London, United Kingdom

The Institution of Engineering and Technology is registered as a Charity in England &

Wales (no. 211014) and Scotland (no. SC038698).

© The Institution of Engineering and Technology 2019

First published 2019

This publication is copyright under the Berne Convention and the Universal Copyright
Convention. All rights reserved. Apart from any fair dealing for the purposes of research
or private study, or criticism or review, as permitted under the Copyright, Designs and
Patents Act 1988, this publication may be reproduced, stored or transmitted, in any
form or by any means, only with the prior permission in writing of the publishers, or in
the case of reprographic reproduction in accordance with the terms of licences issued
by the Copyright Licensing Agency. Enquiries concerning reproduction outside those
terms should be sent to the publisher at the undermentioned address:

The Institution of Engineering and Technology

Michael Faraday House
Six Hills Way, Stevenage
Herts, SG1 2AY, United Kingdom

www.theiet.org

While the authors and publisher believe that the information and guidance given in this
work are correct, all parties must rely upon their own skill and judgement when making
use of them. Neither the authors nor publisher assumes any liability to anyone for any
loss or damage caused by any error or omission in the work, whether such an error or
omission is the result of negligence or any other cause. Any and all such liability
is disclaimed.

The moral rights of the authors to be identiﬁed as authors of this work have been
asserted by them in accordance with the Copyright, Designs and Patents Act 1988.

British Library Cataloguing in Publication Data

A catalogue record for this product is available from the British Library

ISBN 978-1-78561-355-5 (hardback)

ISBN 978-1-78561-356-2 (PDF)

Typeset in India by MPS Limited

Printed in the UK by CPI Group (UK) Ltd, Croydon
Contents

Preface xiii

Part I Modeling and simulation

1 Modeling and simulation: the essence and increasing importance 3

Tuncer Ören, Saurabh Mittal, and Umut Durak
1.1 Introduction 3
1.2 Experimentation aspects of simulation 5
1.3 Experience aspects of simulation 6
1.3.1 Simulation for training 6
1.3.2 Simulation for entertainment 8
1.4 Taxonomies and ontologies of simulation 8
1.4.1 Background 8
1.4.2 Taxonomies of simulation 9
1.4.3 Ontologies of simulation 10
1.5 Evolution and increasing importance of simulation 10
1.6 Conclusion 11
Appendix A – A list of over 750 types of simulation 12
Appendix B – A list of 120 types of input 20
References 21

2 Flexible modeling with Simio 27

David T. Sturrock and C. Dennis Pegden
2.1 Overview 27
2.2 Simio object framework 27
2.3 Simio object classes 30
2.4 Modeling movements 31
2.5 Modeling physical components 33
2.6 Processes 38
2.7 Data tables 40
2.8 Experimentation with the model 42
2.9 Application programming interface 43
2.10 Applications in scheduling 44
2.11 Summary 50
Glossary 50
References 52
vi Modeling and simulation of complex communication networks

3 A simulation environment for cybersecurity attack analysis based

on network traffic logs 55
Salva Daneshgadeh, Mehmet Uğur Öney, Thomas Kemmerich,
and Nazife Baykal
3.1 Introduction 55
3.1.1 Network simulation 56
3.1.2 Network emulation 58
3.1.3 The application of network simulation and emulation
in network security 58
3.1.4 Virtualization 58
3.1.5 Virtualization using hypervisor 58
3.1.6 Virtualization using container 59
3.1.7 Virtual machines and simulation 59
3.2 Literature review 60
3.2.1 Network anomalies and detection methods 60
3.2.2 Network workload generators 61
3.2.3 Network simulation for security studies 61
3.3 Methodology 64
3.4 Defining a simulated and virtualized test bed for network anomaly
detection researches 64
3.4.1 GNS3 64
3.4.2 Ubuntu 66
3.4.3 Network interfaces 66
3.5 Simulated environment for network anomaly detection researches 68
3.5.1 Victim machine 68
3.5.2 Attacker machine 68
3.5.3 pfSense firewall 69
3.5.4 NAT and VMware host-only networks 70
3.5.5 Traffic generator machine 70
3.5.6 NTOPNG tool 71
3.5.7 Repository machine 74
3.6 Discussion and results 75
3.7 Summary 75
References 76

Part II Surveys and reviews

4 Demand–response management in smart grid: a survey and future

directions 83
Waseem Akram and Muaz A. Niazi
4.1 Overview 83
4.2 Introduction 83
4.3 Backgrounds 85
4.3.1 Smart grid 85
Contents vii

4.3.2 Demand–response management 86

4.3.3 Complex systems 87
4.3.4 Learning-based approaches 87
4.4 A review of demand–response management in SG 87
4.4.1 Learning-based approaches 88
4.4.2 Complex system 90
4.4.3 Other techniques 92
4.5 Open-research problems and discussion 102
4.5.1 Open-research problems in learning system 102
4.5.2 Open-research problems in complex system 102
4.5.3 Open-research problems in other techniques 103
4.6 Conclusions 105
References 105

5 Applications of multi-agent systems in smart grid: a survey

and taxonomy 111
Waseem Akram and Muaz A. Niazi
5.1 Overview 111
5.2 Introduction 111
5.3 A review of multi-agent system to smart-grid application 113
5.3.1 Communication management 113
5.3.2 Demand–response management 117
5.3.3 Fault monitoring 121
5.3.4 Power scheduling 125
5.3.5 Storage and voltage management 129
5.4 Open research problems and discussion 134
5.5 Conclusions 137
References 138

6 Shortest path models for scale-free network topologies: literature

review and cross comparisons 145
Agnese V. Ventrella, Giuseppe Piro, and Luigi Alfredo Grieco
6.1 Mapping the Internet topology 146
6.1.1 Interface level 147
6.1.2 Router level 151
6.1.3 AS level 152
6.1.4 Geographic network topologies 154
6.2 Internet models based on the graph theory 154
6.2.1 Fundamental notions from the graph theory 155
6.2.2 Topology models 156
6.2.3 Topology generator tools 158
6.3 Shortest path models 160
6.3.1 Parameters definition 160
viii Modeling and simulation of complex communication networks

6.3.2 Shortest path models 161

6.3.3 Cross-comparison among shortest path models 162
6.3.4 Shortest path models applications 163
6.4 Conclusion 167
Acknowledgment 167
References 168

Part III Case studies and more

7 Accurate modeling of VoIP traffic in modern communication 175

Homero Toral-Cruz, Al-Sakib Khan Pathan, and Julio C. Ramírez Pacheco
7.1 Introduction 175
7.2 Modern communication networks: from simple packet network to
multiservice network 177
7.3 Voice over IP (VoIP) and quality of service (QoS) 179
7.3.1 Basic structure of a VoIP system 179
7.3.2 VoIP frameworks: H.323 and SIP 181
7.3.3 Basic concepts of QoS 186
7.3.4 QoS assessment 186
7.3.5 Oneway delay 188
7.3.6 Jitter 188
7.3.7 Packetloss rate 189
7.4 Self-similarity processes in modern communication networks 192
7.4.1 Self-similar processes 192
7.4.2 Haar wavelet-based decomposition and Hurst index
estimation 194
7.5 QoS parameters modeling on VoIP traffic 195
7.5.1 Jitter modeling by self-similar and multifractal processes 195
7.5.2 Packet-loss modeling by Markov models 200
7.5.3 Packet-loss simulation and proposed model 202
7.6 Conclusions 204
References 205

8 Exploratory and validated agent-based modeling levels case study:

Internet of Things 209
Komal Batool and Muaz A. Niazi
8.1 Introduction 209
8.1.1 Agent-based modeling framework 210
8.1.2 Agent-based simulator 211
8.1.3 Case study: 5G networks and Internet of Things 213
8.1.4 Results and discussion 221
8.1.5 Conclusion 229
Contents ix

8.2 Validated agent-based modeling level case study: Internet of

Things 229
8.2.1 Introduction 229
8.2.2 Validated agent-based level 230
8.2.3 Case study: 5G networks and Internet of Things 233
8.2.4 Results and discussion 235
8.2.5 Validation discussion 236
8.2.6 Conclusion 236
References 237

9 Descriptive agent-based modeling of the “Chord” P2P protocol 239

Hasina Attaullah, Urva Latif, and Kashif Ali
9.1 Introduction 239
9.2 Background and literature review 240
9.2.1 CAS literature 240
9.2.2 Modeling and simulation of CACOONS 240
9.2.3 Chord P2P protocol 241
9.2.4 Hashing and key mapping 242
9.2.5 Node joining 242
9.2.6 Finger table 242
9.2.7 Stabilization 243
9.2.8 Performance of chord 245
9.2.9 PeerSim 245
9.2.10 Literature review 245
9.3 ODD model of a “Chord” 250
9.3.1 Purpose 251
9.3.2 Entities, state variables, and scales 251
9.3.3 Process overview and scheduling 252
9.3.4 Design concepts 252
9.3.5 Initialization 254
9.3.6 Input data 254
9.3.7 Sub-models 254
9.4 DREAM model of a “Chord” 254
9.4.1 Agent design 254
9.4.2 Activity diagrams 255
9.4.3 Flowchart 255
9.4.4 Pseudo-code based specification 256
9.5 Results and discussion 267
9.5.1 Metrics (table and description) 267
9.5.2 PeerSim results 269
9.5.3 ABM results 270
9.5.4 Comparison of PeerSim and ABM 271
9.5.5 DREAM network models 272
x Modeling and simulation of complex communication networks

9.5.6 Discussion (ODD vs. DREAM pros and cons of both)

and which is more useful for modeling the chosen P2P
protocol 278
9.5.7 Chord and theory of computation 280
9.6 Conclusions and future work 280
References 280

10 Descriptive agent-based modeling of Kademlia peer-to-peer

protocol 285
Hammad-Ur-Rehman and Muhammad Qasim Mehboob
10.1 Introduction 285
10.2 Background and literature review 286
10.2.1 Complex adaptive systems 287
10.2.2 Cognitive agent-based computing 287
10.2.3 Complex network modeling 287
10.2.4 Architecture of the “Kademlia” protocol 287
10.2.5 Literature review 292
10.3 Model design 298
10.3.1 ODD model of “Kademlia” 299
10.3.2 Overview 299
10.3.3 Design concept 299
10.3.4 Details 300
10.3.5 Activity diagrams of “Kademlia” 301
10.3.6 DREAM model of “Kademlia” 301
10.3.7 Network model 301
10.3.8 Pseudo-code description 301
10.4 Results and discussion 313
10.4.1 Evaluation metrics 313
10.4.2 Power law plots of centrality measures 313
10.4.3 PeerSim simulation using existing code in PeerSim 314
10.4.4 ABM simulation 323
10.4.5 Comparison of PeerSim and ABM results 325
10.4.6 Discussion 325
10.5 Conclusion and future work 328
References 328

11 Descriptive agent-based modeling of the “BitTorrent” P2P protocol 333

Abdul Saboor, Nasir Khan, and Mubariz Rehman
11.1 Introduction 333
11.1.1 Contributions 335
11.2 Background and literature review 336
11.2.1 Complex adaptive systems 336
11.2.2 Modeling and simulation of CACOONS 337
Contents xi

11.3 BitTorrent peer-to-peer protocol 338

11.3.1 BitTorrent history overview 339
11.3.2 Content publishing in BitTorrent 340
11.3.3 Joining swarm and peers discovery in BitTorrent 340
11.3.4 Delivery procedure BitTorrent 340
11.3.5 BitTorrent architecture and working 341
11.4 BitTorrent literature review 343
11.4.1 PeerSim 348
11.5 Model design 349
11.5.1 ODD approach 349
11.5.2 Overview of the proposed model 351
11.5.3 DREAM model 354
11.5.4 Pseudocode-based specification 356
11.5.5 Globals 357
11.5.6 Procedures 358
11.5.7 Experiments 362
11.5.8 Results and discussions 365
11.5.9 PeerSim results 366
11.5.10 ABM results 366
11.5.11 Comparison of both 367
11.5.12 DREAM network models 369
11.6 Discussion (ODD vs DREAM) 375
11.7 Conclusion 376
References 376

12 Social networks—a scientometric visual survey 381

Bisma S. Khan and Muaz A. Niazi
12.1 Introduction 381
12.2 Background 382
12.2.1 Social networks—an overview 382
12.2.2 Citation networks 383
12.2.3 Co-citation networks 383
12.2.4 Bibliographic coupling 383
12.2.5 Coauthorship networks 384
12.2.6 Co-occurrence networks 384
12.3 Materials and methods 384
12.3.1 Data collection 385
12.3.2 CiteSpace—a science mapping tool 385
12.4 Results and discussion 388
12.4.1 Cited-references co-citation network analysis 388
12.4.2 Authors collaboration network analysis 393
12.4.3 Institution collaboration network analysis 396
12.4.4 Country collaboration network analysis 400
12.4.5 Keywords co-occurrence network analysis 403
xii Modeling and simulation of complex communication networks

12.4.6 Category co-occurrence network analysis 405

12.4.7 Journal co-citation network analysis 409
12.5 Summary of results 409
12.6 Conclusions and future work 410
References 411

Index 413
Preface

Thank you for choosing “Modeling and Simulation for Complex Networks.” This book
offers a unique set of chapters and case studies employing the use of various disparate
techniques for the modeling and simulation of complex communication networks.
Rather than focus on simplistic models using simple numerical simulations, the book
focuses instead on tools and techniques which can be used for the realistic modeling of
large-scale and complex communication networks—termed collectively as Complex
Adaptive COmmunicatiOn Networks and environmentS (CACOONS).
The book has been logically sectioned in three parts. The first part focuses on
the importance of modeling and simulation and also gives two varied examples of
unconventional but powerful tools which can be useful for any type of modeling and
simulation, in general, and modeling and simulation of CACOONS, in particular.
This is followed by the second part which presents three critical reviews and
surveys in the domain of modeling and simulation. The third part of the book focuses
on practical case studies of modeling and simulation using different techniques. Of
interest here is a focus on the use of the Cognitive Agent-based Computing (CABC1 )
framework. CABC framework can be used to model any type of complex physical
system or complex adaptive system (CAS). As such, it can be a very useful approach to
model and simulate any type of CACOONS. The third part presents several practical
case studies employing the use of CABC framework in various areas of CACOONS.
Next, I give an overview of the various chapters in a bit more detail.
The first part starts with the essence and importance of “Modeling and Simula-
tion,” presented by Ören et al. The chapter not only gives an overview of modeling
and simulation but also presents taxonomies. The chapter first gives an overview of
why simulation could be needed instead of an actual system. Then it moves on to
taxonomies and ontologies for use in modeling and simulation.
The next chapter in part I presents a detailed overview of using Simio, a modern,
sophisticated object-oriented tool for developing simulations of complex real-world
systems. The chapter first starts out with an overview of the Simio object framework.
This is followed by a description of Simio object classes. Next, concepts related
to modeling movement are presented. This can be transformed to model not only
messages in the Internet of Things but also people, mobile devices, and more. A
description of modeling physical components is also presented. This is followed by
techniques for modeling processes. The chapter concludes by giving an overview of
process tables, API, experimentation, and useful applications in scheduling.

1
Pronounced as Ka-bek.
xiv Modeling and simulation of complex communication networks

Part I of the third chapter by Salva et al. presents a simulation environment for
cybersecurity attack analysis for network traffic. The chapter gives an overview of
simulation, emulation, and virtualization. After this, a network case study focusing
on network anomaly detection is presented.
In the second part, there are three critical reviews, each analyzing key literature
related to modeling and simulation in the domain of modeling CACOONS of various
kinds. The first survey by Akram and Niazi presents demand response management
in the domain of smart grid. The chapter first gives an overview of smart grid and how
this particular domain offers unique challenges to develop and understand by means
of modeling and simulation. It next presents an overview of the problem domain of
demand–response management in large-scaled CACOONS in the domain of smart
grid. In terms of approaches, the chapter first presents learning-based approaches.
Subsequently, it focuses on the complexity inherent to smart grid domain. Before
concluding, the chapter then moves on to open research problems and directions in
the domain.
The second chapter in the second part is by Akram and Niazi focusing on the
use of agent-based computing, multiagent systems, and agent-based modeling in
the domain of smart grid. It starts with an overview of the concepts and moves to
applications ranging from learning to more. It concludes after giving an overview of
key-open problems and issues in the domain.
The final chapter in second part is by Ventrella et al. This chapter focuses on the
scale-free network topologies giving a detailed overview as well as literature review
and more. The chapter starts with an overview of concepts pertaining to mapping
the Internet. It then gives concepts of using traceroute to mapping. This is followed
by IP options, subnet discovery, and router-level mapping. The chapter then presents
internet models with a focus on graph theoretic approaches. It gives an overview
of relevant concepts ranging from the basics to topological concepts such as scale-
free, power-law, among others. An overview of network topology generation is also
presented. Afterwards, the chapter moves on to the key topic of interest—namely
shortest path models.
In the final part of the book, six modeling and simulation case studies are pre-
sented. The studies have been selected based on the criteria that first, these will be
of interest not only to modelers and simulation experts but also to researchers and
practitioners in the domain of complex communication and social networks.
The first chapter in the part is focused on the important topic of accurate modeling
of VoIP traffic in modern communication networks by Toral-Cruz et al. The chapter
starts by giving an overview of the importance and complexity in VoIP traffic in
large-scale networks. It next presents the concepts of why modern networks have
evolved from simple packet networks to multiservice networks. Subsequently, the
chapter moves on to the importance of QoS in VoIP networks besides presenting VoIP
frameworks such as H.323 and SIP. It then formally describes and models concepts
related to QoS, one-way delay, jitter, self-similar processes, and more.
The second chapter in the last part presents implementation of two framework
levels from the CABC framework in the domain of Internet of Things. The chapter
starts with concepts related to the CABC framework and the simulator of choice.
Preface xv

It then presents research questions in the domain of 5G and the IoT. It then presents
detailed results and discussion in the domain.
The third chapter in the part is by Attaullah et al. and focuses on the use of the
DescRiptivE Agent-based Modeling (DREAM) from the CABC framework for the
modeling and simulation of the Chord peer-to-peer (P2P) protocol. The chapter first
introduces the Chord protocol describing its inherent complexities requiring the use
of more advanced modeling and simulation techniques. After the description of chord
protocols, the chapter presents the DREAM for the protocol allowing for a quantitative
description using complex network centralities. The chapter also presents detailed
results from both PeerSim as well as NetLogo-based simulations besides comparing
DREAM with the previous approach—the so-called ODD approach originating from
the domain of ecology and having been traditionally used in the past to model agent-
based and individual-based models.
This is followed by another chapter employing the use of DREAM and ODD to
model a P2P protocol commonly known as the Kademlia protocol. The chapter first
gives an overview of the protocol and the challenges associated with the complexity
of P2P protocols. This is followed by ODD and DREAM models, results, discussion,
and a detailed comparison.
BitTorrent is a very commonly used P2P protocol in the real world. The next
chapter in the part presents the use of the DREAM modeling level of the CABC
framework for the modeling and simulation of the BitTorrent protocol. After pre-
senting the background and overview of the torrent protocol, the chapter presents a
BitTorrent case study for use in the simulation model. Next, ODD and DREAM are
presented before a set of detailed discussion on the utility of the CABC framework
in the modeling and simulation of CACOONS.
The final chapter in the book is by Khan and Niazi and presents the application of
CABC level 1—complex network modeling level for the use of complex citation net-
works to analyze the domain of “Social networks.” The chapter starts by introducing
related concepts focused on measuring impact, citations, and scientometrics. It then
presents the dataset retrieved for developing the complex citation networks. This is
followed by a detailed network analysis demonstrating how this approach can be used
to model, simulate, transform, and analyze various types of complex networks data.
The book presents first steps in the domain of consolidating material specifically
focused on the modeling and simulation of complex communication networks—
CACOONS. It presents a selection of key case studies as well as concepts with a
primary focus on making the concepts accessible to a wide audience. However, like
any text in such a large and vibrant domain, it is understandable that we were only
able to present a sampling of key case studies and modeling paradigms in the domain.
Readers are further recommended to follow Springer-Nature CASs Modeling journal
for gaining access to more case studies and applications in the domain of modeling
and simulation of complex communication networks—CACOONS.
While we have tried our level best to minimize errors, it is impossible to minimize
all errors. If the book looks nice, it is all due to the efforts put in by the IET staff. And
if there are any mistakes, I humbly accept them to be mine. As such, it is requested to
kindly do keep sending your valuable and kind feedback and comments to the book
editor at [email protected].
Part I
Modeling and simulation
Chapter 1
Modeling and simulation: the essence and
increasing importance
Tuncer Ören1 , Saurabh Mittal2 , and Umut Durak3

The technical aspects of the essence of simulation are elaborated based on the
following definition: simulation is performing a goal-directed experimentation or
gaining experience under controlled conditions by using dynamic models either to
develop/enhance skills or for entertainment; where a dynamic model denotes a model
for which behaviour and/or structure is variable over time. Hence, experimentation
and experience aspects are explained. Several taxonomies, ontologies, and some
ontology-based dictionaries are cited for a comprehensive and integrative percep-
tion of simulation. Finally, the evolution and increasing importance of simulation is
explained.

1.1 Introduction
‘Simulation as a discipline is like mathematics and logic. It can be studied per se to
develop its own theories, methodologies, and tools, and it can be used in a multitude
of problem areas in many disciplines. The uses of simulation involve this second
aspect and make it a vital enabling technology for many disciplines’. The above is
from the conclusion section of another publication [1]. A recent publication ‘Guide
to Simulation-Based Disciplines: Advancing our Computational Future’ elaborates
on the universality of simulation [2], and another publication ‘The Profession of
Modeling and Simulation’ casts light on the professional aspect of simulation [3].
The clarifications given in this chapter on many aspects of simulation are relevant to
the universality of simulation.
The term simulation has been in existence in English since fourteenth century.
Its meaning is based on the concept of similarity. Depending on the goal of the
similarity, the original non-technical use of the term simulation has positive and
negative connotations. From a positive point of view, simulation implies imitation
such as simulated leather or simulated pearl. From a negative point of view, simulation

1
School of Electrical Engineering and Computer Science, University of Ottawa, Ontario, Canada
2
The MITRE Corporation, United States
3
German Aerospace Center (DLR), Institute of Flight Systems, Germany
4 Modeling and simulation of complex communication networks

implies disguised reality, e.g. counterfeit, feigning, false show, and hypocrisy. Later,
the term simulation acquired technical meanings. However, still the term is also used
with its original non-technical connotations. To denote its technical aspects, we use
the following concise and comprehensive definition:

Simulation is performing goal-directed experimentation, or gaining expe-

rience under controlled conditions by using dynamic models either to
develop/enhance skills, or for entertainment; where a dynamic model
denotes a model for which behaviour and/or structure is variable over time.

Due to its many aspects, there are many definitions of simulation. About 100 defini-
tions of simulation were compiled and presented in nine categories by Ören [4], and
a critical review of them was offered in a sequel publication [5]. As a testimony of the
variety of simulation, Appendix A lists over 750 types of simulation. Appendix B, a
list of 120 types of input variables, is yet another testimony of the richness of the field.
M&S is essentially composed of two separate activities: modelling and simula-
tion. While modelling necessitates abstraction, simulation is purely an engineering
activity that involves expertise from computer science and engineering discipline [2].
When we talk of simulation as a singular activity, it subsumes model building. From
historical evidence, model building has been attempted by various non-technical
means and a constant engagement with the problem-at-hand or the question under
exploration. Model development has been undertaken in different disciplines in
diverse manner. Some examples are as follows:
● Engineering: Model building is done for two purposes: design and control.
The design aspect involves creating model(s) of a ‘would be’ system. The con-
trol aspect necessitates building the model of an ‘existing system’ that needs
exploration of various control algorithms and mechanisms.
● Science: Model building is done to understand a natural phenomenon. New
nomenclature, taxonomy, vocabulary and abstractions are developed. The crit-
ical part is the specification of assumptions that limit the complexity of real
world in the model description.
● Education: Model building is done to explain, teach, understand, or learn a real-
world phenomenon. The abstraction level is dependent upon the audience that is
undergoing learning.
● Training: Model building is done to impart training or enhance skills (motor,
decision-making and communication, and operation) of the trainee in a specific
complex environment where it is cost prohibitive to involve real-world assets and
systems.
● Entertainment: Model building is done to provide a fictional reality in real or
staged environments for amusement purposes.
● Decision support: Model building is done to evaluate various courses of action
of a real-world state of a system on an existing model. In such cases, it is cost
prohibitive to perform real-world evaluation due to danger to life and property.
In all the disciplines mentioned above, model building incorporates the skill of devel-
oping abstractions. The determination of an abstraction level is contingent upon
Modeling and simulation: the essence and increasing importance 5

various factors such as the problem-at-hand, the desired goal, the available tools,
and the available knowledge. For example, in each economic era, from the age of
farming to Industrial Age and to the Information Era we currently are in, the prob-
lems, the desired goals, the tools, and the availability of knowledge have evolved,
leading to new representation of models. Some models that describe the natural laws
have withstood the test of time for example Newton Laws developed in eighteenth
century, and sometimes, a completely known theory developed in twentieth century
such as quantum mechanics fundamentally changes the perception of reality. Each
economic era has led to the evolution of these four aspects and, consequently, model
building has evolved accordingly. Model building takes its refuge in mathematics
at the core level and involves constant subject–environment engagement to keep the
developed abstractions attuned to the problem-at-hand. In times, today, much of the
model development has moved to computerized workbenches, often called integrated
development environments that bridge the gap between the model builder and the
model representation.
The simulation activity builds upon the model-building activity and presents
the challenge of running the model over time. In a computational environment, a
simulator (a software entity) is tasked with managing the advancement of time. In
a non-computational environment, the perception of movement of time becomes a
critical factor in determining how effective the simulation is. For example, in a stage or
theatre, if the modelled ‘act’is executed in slower time or faster than real time, it would
yield a completely different experience to the audience. Likewise, in a computational
environment, the advancement of time delivers results that may or may not address
the problem-at-hand. Handling time on an appropriate time base then becomes a
paramount activity in simulation.
In the following sections, the following is done. Experimentation aspects of
simulation are discussed in Section 1.2. In Section 1.3, experience aspects to
develop/enhance three types of skills or for entertainment are discussed. Taxonomies
and ontologies of simulation are mentioned in Section 1.4. Evolution and increasing
importance of simulation is discussed in Section 1.5, and last section is for conclusion.

1.2 Experimentation aspects of simulation

From experimentation aspect, a concise definition of simulation is as follows: simu-
lation is a goal-directed experimentation with dynamic models. Experimental condi-
tions can be formally specified by experimental frames [6] and by experimentation
scenarios.
Since Francis Bacon’s Novum Organum (The New Organon) published in 1620,
experimentation has been the essence of scientific approach. Simulation extends the
scope of experimentation to many cases:

1. Real system may not exist (as in engineering problems where new systems are
aimed to be built).
2. Real system may not be reachable for experimentation (e.g. testing lunar
vehicles).
6 Modeling and simulation of complex communication networks

3. Experimentation on real system may be dangerous (e.g. testing nuclear detonation

or simulation of forest fires).
4. Experimentation on real system may not be convenient (e.g. urban traffic
simulation, instead of experimenting on the real system).
5. Experimentation on real system may not be convenient time-wise (economic
systems would necessitate long time for experimentation, and some natural
phenomena would be too fast to observe).
6. Experimentation on real system may not be cost-effective. Furthermore, testing
a virtual prototype (via simulation) allows flexibility in finding optimal design.
Simulated experiments are used for description/explanation, decision support, explo-
ration, and teaching/learning. For decision support alone, simulation can be used
for the prediction of behaviour and/or performance, test of hypotheses about models
or experimental conditions, sensitivity analysis of behaviour or performances, vir-
tual prototyping, planning, acquisition, testing, and proof of concepts, as well as for
evaluation of alternative models and experimental conditions. Since experimentation
aspect of simulation is very widely used, several publications exist [7,8].

1.3 Experience aspects of simulation

Gaining experience under controlled conditions using dynamic models is one of the
major motivations of simulation [9]. Experimentation aspect includes two major use
cases: (a) training to develop or enhance skills and (b) entertainment. These use cases
will be further elaborated in the following sections.

1.3.1 Simulation for training

Simulation is used in training to enhance or develop three types of skills, namely motor
skills, decision-making skills, and operational skills. These application areas lead
the community to a taxonomy of simulations: live simulations for operational skills,
virtual simulations for motor skills, and constructive simulations for decision-making
skills.
In live simulations, real people use real (or imitation) equipment in the real world.
It puts the real equipment at work interconnected with computers where the assessment
of trainee actions is conducted by computer algorithms [10]. They are used to train
operational skills by using real life-like experience under controlled conditions. Live
simulation has been a major method in defence for training warfighting techniques and
tactics which inevitably require large number of assets and personal. With the advances
in information and communication technologies, the trainings started to make use of
virtual and constructive simulations to achieve a cost-effective mixture [11].
Following the definitions given by Ören in [7], in virtual simulations, real people
use virtual equipment in virtual environments in order to enhance motor skills to gain
proficiency in using the equipment. Typical examples of this type are flight simulators
which have long been establishing training aids in aviation. Link trainer (sometimes
called Blue Box) (Figure 1.1) is recognized as one of the first flight simulators. Ten
Modeling and simulation: the essence and increasing importance 7

Figure 1.1 Link trainer in a US Army Air Force field

thousand link trainers were manufactured from 1934 to 1950 [12]. They provided
means for training basic motor skill for pilots. The current flight simulator market is
about USD 6 Billion and 2021 forecast is about USD 7.5 Billon [13].
In constructive simulation, simulated people use simulated equipment in virtual
environment. The aim is to enhance decision-making and communication skills of
trainees through interactions with the simulation systems. Air traffic control simu-
lation systems are one of the typical examples [14]. Simulated pilots use simulated
aircrafts in air traffic control simulation systems where the simulation provides the
possibility to train controllers for decision-making and communication skills. One of
the commercial-off-the-shelf products is MaxSim – air traffic control simulators from
Adacel which can generate realistic air traffic based on defined scenarios and provides
direct voice communication possibilities with virtual pilots via speech recognition
features [15].
One of the key issues of simulation-based training is transfer of training which
is defined as the degree to which trainees effectively apply the trained skills in real
operation [16]. The research about transfer of training in flight simulators has quite
a long history. Valverde has published a paper in 1973 that provides a review of
flight simulator transfer of training studies since the 1950s [17]. In one of the recent
studies, Pool and Zaal present a cybernetic approach to assess the transfer of training
for manual control skills in flight simulators using multi-channel pilot models [18].
The fidelity, immersion, presence, and buy-in are defined as the four factors that drive
the transfer of training [19]. Fidelity is defined as the extent to which the simulation
matches the real world. While the immersion is the feeling of the individual to be
absorbed by the experience, in the situated immersion, the presence is defined as the
subjective experience of existence within the simulation [20]. Buy-in is eventually
the user’s acceptance of the experience as a useful training event.
8 Modeling and simulation of complex communication networks

1.3.2 Simulation for entertainment

Simulation is one of the key concepts of entertainment. It is not only used for interac-
tive entertainment purposes such as computer games but also for computer-generated
animation. With the advances in computer architectures, the interactive entertain-
ment industry started to work on high quality motion synthesis, and physics-based
simulation emerged as a field of study [21]. It was first referred as the study of the
motion of virtual objects using the laws of motion. It evolved in years and now cov-
ers rigid body dynamics to impact and collision, deformable bodies to soft bodies,
fluids and gasses to crowd simulation [22]. During the last decade, physics engines
which can be defined as reusable simulation libraries for interactive entertainment
became popular [23]. One of the well-employed open-source physics engines is Bullet
Physics Library [24]. Its features include rigid-body and soft-body simulation and
collision detection. Integrations plug-ins are available for bullet in order to use it
with Autodesk® Maya [25] or Blender [26] 3D computer graphics software. Well-
known computer games that utilize Bullet for physics simulations include Red Dead
Redemption [27] from Rockstar Games and Toy Story 3 [28] from Disney Interactive
Studios. PhysX [29] has been introduced as a proprietary real-time multi-platform
game physics solution by NVIDIA. It also provides simulation-driven effects such
as destruction or cloth simulation. Havok [30] is another major commercial multi-
platform multi-threaded physics simulation library for interactive entertainment. It
has been used in more than 400 games that have been published. In an ecosystem with
various physical simulation, application programming interfaces exist; there is also
an effort, namely Physics Abstraction Layer, which provide a high-level interface for
various physics simulation libraries [31].
Physics-based simulation methods that are employed in interactive entertain-
ment are also well-applicable in computer-generated animation to synthesize motion
of objects. Non-real-time characteristics of physics-based simulations in animation
applications allow further resolution in modelled behaviour. One of the recent exam-
ples is artistic simulation of curly hair by Disney/Pixar [32]. It has been used for
animating Merida [33] in a famous animation film Brave [34]. Open source 3D ani-
mation software Blender also provides many physics-based simulation features such
as fluids, smoke, cloth, or hair simulation. Reference [35] from Mullen which titled
as ‘Bounce, tumble, and splash!: simulating the physical world with Blender 3D’ can
be introduced as a reference book about physics-based simulation in Blender.

1.4 Taxonomies and ontologies of simulation

1.4.1 Background
Taxonomy, as the science of classification, is an indispensable aspect of scientific
studies and is concerned with finding, describing, classifying, and naming of things.
For example, taxonomies of plants and animals identify logical relationships of dif-
ferent species. In animal taxonomy, a living organism is assigned successively in a
kingdom, a phylum, class, order, family, genus, and species. Another example is tax-
onomy of learning and Bloom’s taxonomy of educational objectives [36]. Taxonomy of
learning and Bloom’s taxonomy of educational objectives are particularly important
Modeling and simulation: the essence and increasing importance 9

in simulation-based learning/teaching [37]. This importance is due to the fact that

simulation-based disciplines are proliferating and teaching/learning them may satisfy
different goals and hence should be targeted properly [2,9].
As a branch of metaphysics, ontology is concerned with the nature and relations
of beings and deals with abstract entities.
Ontology-based dictionaries combine classification of entities and their def-
initions [38]. They show logical relationships of several related terms as well as
their definitions; hence, they are helpful in teaching/understanding even the subtle
differences of related terms.
1.4.2 Taxonomies of simulation
Perception of the big picture and richness of simulation [7,39,40] as well as dis-
criminating the subtle differences in its many aspects would help appreciating many
possibilities it offers. Due to the increasing importance, versatility, and high vari-
ety of simulation, several classification studies already exist. These taxonomies can
be conceived under the following major groups: taxonomies of simulation, simula-
tion languages, simulation software, simulation models, behaviour and processing of
simulation models, simulation quality assurance, and elements of simulation models
and simulation-based experimentations as well as simulation-based experiences. The
taxonomies represent state of the art at the time of their preparations with rooms for
future developments.
Taxonomies of simulation were developed by Ören [41] and Sulistio et al. [42]. The last
one was reviewed by Ören [43]. Maier and Größler [44] developed a taxonomy of
computer simulations to support learning about socio-economic systems. Roeder’s
Ph.D. thesis is on a taxonomy for discrete event simulations [45].
Taxonomies of simulation languages: An early taxonomy of simulation languages
was developed by Ören [46]. Two updated versions of taxonomy of simulation
languages are also by Ören [47,48].
Taxonomies of simulation software: A survey on taxonomies of discrete simulation
software is prepared recently by Rachidi [49].
Taxonomies of simulation models: An early taxonomy of simulation models was pub-
lished by Ören [50]. A taxonomy of symbolic processing of simulation models was
published in 1987 [51]. Fishwick prepared a taxonomy of simulation modelling
[52]. Hare and Deadman published a taxonomy of agent-based simulation models
in environmental management [53]. A review of it was prepared by Ören [54].
Taxonomies of multimodels are prepared by Yilmaz and Ören [55] and by Ören
et al. [56]. Lynch and Diallo prepared a taxonomy for classifying terminologies
that describe simulations with multiple models [57]. Smith elaborates on the value
of taxonomy in modelling [58].
Taxonomies of behaviour of simulation models: A taxonomy of types, as well as gen-
eration and processing techniques of model behaviour was prepared by Ören [59].
Taxonomy of more specific topics: As an example, an early taxonomy of input vari-
ables can be cited [60]. Wenzel et al. published a taxonomy of visualization
techniques for simulation in production and logistics [61]. Le Digabel and Wild
have a taxonomy of constraints in simulation-based optimization [62]. Goldstein
and Khan prepared a taxonomy of event-time representations [63].
10 Modeling and simulation of complex communication networks

Due to the richness of modelling and simulation and its relationship with other
relevant disciplines, several other taxonomies of specific topics will be useful. Even
the most fundamental concepts have several terms to represent nuances. For example,
there are over 150 terms related with ‘variables’, over 90 terms related with ‘values’,
and over 1,000 terms related with or representing types of models (M&S Bok Index
studies). To attest the richness of the field two appendices are given. Appendix A is a
list of over 750 types of simulation and Appendix B is a list of 120 types of input.
1.4.3 Ontologies of simulation
Silver et al. prepared an ontology for discrete-event modelling and simulation [64].
The book edited by Tolk [65] is a very good source of information about simulation
ontologies. From Tolk’s book, the following are noteworthy contributions to simu-
lation ontology: Partridge et al. [66]; Hofmann [67]; Heath and Jackson [68]; and
Wang et al. [69]. An ontology for simulation systems engineering is developed by
Durak and Ören [70].
An ontology-based dictionary of multimodels was prepared by Ören, Mittal, and
Durak [9]. An ontology-based dictionary of machine understanding can be used for
simulating systems with understanding abilities including systems able to understand
emotions [38].
As a normative view, we think that development of new and updated as well
as more diversified taxonomies, ontologies, and ontology-based dictionaries may be
useful for learning several aspects of simulation, since it is progressing very rapidly
and becoming infrastructure for many disciplines.

1.5 Evolution and increasing importance of simulation

A recent publication [9] clarifies nine phases of the evolution of simulation and
documents the vital role of simulation for many disciplines:
1. In the pre-computer days, simulation started as non-computerized simulation
such as thought experiments, role-playing, and sand-box simulation.
2. Computerized simulation was an important step in the evolution of simulation.
At the beginning, the role of computers was limited to generation of model’s
behaviour (mostly trajectory and, sometimes, structural). With the advent of
more powerful computers and maturity of simulation, computers were also used
for the specification of simulation study as well as for other functions [71].
3. Contribution of system theories to simulation was the essence of formal
simulation [72] especially in DEVS (Discrete Event System Specification) [73].
4. Contribution of machine intelligence (or artificial intelligence – AI) opens a new
dimension for the full synergy of simulation and AI which consists of contribution
of simulation to AI and contribution of AI to simulation. At the beginning, AI
studies started as the simulation of natural intelligence [74,75]. Contribution of
AI to simulation provided advanced (intelligent) knowledge-processing abilities
to simulation systems [76].
5. Software agents are associated with the concept of autonomy (or quasi-
autonomy). Similar to the synergy of simulation andAI, full synergy of simulation
and software agents opened the possibility of agent-directed simulation which
Modeling and simulation: the essence and increasing importance 11

consists of contribution of simulation to agents or agent-based simulation

where intelligent and autonomous entities can be simulated and contributions
of agents to simulation. The second aspect consists of contribution of agents dur-
ing run-time, such as agent-monitored simulation, and contribution of agents
to the simulation systems such as simulation systems with several types of
understanding abilities [77].
6. When advances of computing reached the level of soft computing, fuzzy
computation, evolutionary computation including genetic algorithms, Bayesian
computation, and machine learning were part of the possibilities of computation.
The contribution of soft computing to simulation is soft simulation.
7. Two types of complexity, namely complexity of simulation systems and especially
complexity of simulands, laid the path to the synergy of simulation systems engi-
neering [78]. Indeed, simulation systems engineering provides a proper paradigm
for simulation to tackle complex problems.
8. Many advantages of experimentation and experience aspects of simulation are
the essence of simulation-based activities which provide an indispensable and
often a vital infrastructure for many disciplines. A recent volume, ‘Guide to
Simulation-Based Disciplines: Advancing our Computational Future’ by Mittal
et al. [2] covers in depth this phase of simulation.
9. Widespread availability and hence usage of simulation-as-a-service (SaaS) is a
desirable phase for the maturity of the simulation discipline. This would also
increase the usability and hence usefulness of simulation among many who are
expert in their field of specialization but not in simulation techniques [79]. SaaS
can be used by human users as well as by advanced agent-monitored systems.
The evolution of simulation at each phase of advancement brought to its users more
and more advanced and powerful possibilities in simulation-based problem solving.
Hence, the shift of paradigm from model-based approach [80] to simulation-based
approach [9] appears to be very beneficial for many disciplines.

1.6 Conclusion
Experimentation and experience aspects of simulation have already made it an
invaluable infrastructure for many disciplines and application areas. In this chapter,
evolution and increasing importance of simulation are elaborated after clarifications
of its experimentation and experience aspects. A comprehensive and integrative view
of simulation would be helpful to appreciate many advantages it offers. For this rea-
son, many taxonomies and ontologies and some ontology-based dictionaries are also
presented.

Disclaimer
The author’s affiliation with The MITRE Corporation is provided for identification
purposes only, and is not intended to convey or imply MITRE’s concurrence with,
or support for, the positions, opinions or viewpoints expressed by the author(s).
Approved for Public Release, Distribution Unlimited [Case Number: PR_17-3254-2].
12 Modeling and simulation of complex communication networks

Appendix A – A list of over 750 types of simulation

3-d simulation agent-monitored artistic simulation
simulation as-fast-as-possible
A– agent simulation simulation
ab initio simulation agent-supported asymmetric simulation
abstract simulation multisimulation asynchronous simulation
academic simulation agent-supported atomistic simulation
accelerated simulation simulation audio simulation
accurate simulation agent-triggered augmented live
acoustic simulation simulation simulation
activity-based simulation aggregate-level augmented physical
actor-based simulation simulation simulation
ad hoc distributed agile simulation augmented-reality
simulation AI-controlled simulation simulation
adaptive simulation AI-directed simulation augmented simulation
adaptive symbiotic all-digital analog autopoietic simulation
multisimulation simulation autosimulation
adaptive-system all-digital simulation autotelic system
simulation all-software simulation simulation
adiabatic-system allopoietic simulation
simulation allotelic system B–
advanced distributed simulation backward simulation
simulation ALSP-compliant base case simulation
advanced numerical simulation baseline scenario
simulation alternative simulation simulation
advanced simulation analog-computer baseline simulation
agent-based simulation batch simulation
multisimulation analog multilevel behaviourally adaptive
agent-based participatory simulation simulation
simulation analog simulation behaviourally
agent-based simulation analytic simulation anticipatory
agent-controlled analytical simulation multisimulation
simulation ancestor simulation behaviourally
agent-coordinated anticipatory anticipatory simulation
simulation multisimulation big simulation
agent-directed simulation anticipatory perceptual bio-inspired simulation
agent-initiated simulation simulation bio-nano simulation
agent-mediated anticipatory simulation biologically inspired
simulation appropriate simulation simulation
agent-monitored approximate simulation biomimetic simulation
anticipatory approximate biosimulation
multisimulation zero-variance simulation bisimulation
agent-monitored array simulation blended learning
multisimulation art-directed simulation simulation
Modeling and simulation: the essence and increasing importance 13

blended simulation combined conservative parallel

block-oriented simulation continuous/discrete simulation
bond graph simulation simulation conservative simulation
boundary value combined simulation constrained simulation
simulation combined-system constructive simulation
branched simulation simulation constructive training
built-in simulation common-use simulation simulation
communal simulation context-free simulation
competition simulation continuous-change
C–
competitive simulation simulation
case-based simulation
complete simulation continuous simulation
catastrophic simulation
component-based continuous-system
cellular automaton
collaborated simulation simulation
simulation
component-based continuous-time
centre-based simulation
distributed simulation simulation
chaotic simulation
component-based continuous-time
classical simulation
simulation continuous simulation
closed-form simulation component simulation
closed-loop real-time conventional simulation
composable simulation convergence simulation
simulation composite simulation
closed-loop simulation convergent boundary
compressed-time simulation
cloud-based simulation simulation
cloud-hosted simulation convergent simulation
computational simulation converging simulation
cloud simulation computer-aided
cluster simulation cooperation simulation
simulation cooperative simulation
coercible simulation computer-assisted
coercion simulation coopetition simulation
simulation coopetitive simulation
coercivity simulation computer-based
coersing simulation cosimulation
simulation
cognitive simulation coupled simulation
computer-mediated
cokriging simulation credible simulation
simulation
collaborative critical-event simulation
computer-network
component-based crowd simulation
simulation
simulation customizable simulation
computer simulation
collaborative DEVS customized simulation
computerized simulation
simulation cyber-physical system
conceptual simulation
collaborative simulation
concurrent simulation
distributed-simulation cycle-based simulation
condensed-time
collaborative simulation simulation
collaborative virtual conditional Monte Carlo D–
simulation simulation data-driven simulation
collocated cokriging conditional simulation data-intensive simulation
simulation conjoint simulation decision simulation
collocated cosimulation conservative event demon-controlled
collocated simulation simulation simulation
14 Modeling and simulation of complex communication networks

DES (Discrete Event distributed event-driven evaluative simulation

Simulation) simulation event-based agent
descriptive simulation distributed heterogeneous simulation
detached eddy simulation simulation event-based discrete
deterministic simulation distributed interactive simulation
DEVS simulation simulation event-based simulation
difference equation distributed lazy event-driven simulation
simulation simulation event-following
digital-analog simulation distributed-parameter simulation
digital-computer system simulation event-oriented simulation
simulation distributed real-time event-scheduling
digital quantum simulation simulation
simulation distributed simulation evolutionary simulation
digital simulation distributed web-based evolutionary-system
direct numerical simulation simulation
simulation DNA-based simulation ex ante simulation
direct simulation dynamic data-driven ex post simulation
DIS (Distributed simulation ex situ simulation
Interactive Simulation) dynamic simulation exascale simulation
disconnected simulation dynamic system expanded-time simulation
discontinuous simulation simulation explanatory simulation
discrete-arithmetic-based dynamically composable exploration simulation
simulation simulation exploratory
discrete-change multisimulation
simulation
E– exploratory-
discrete-event line
eddy simulation multisimulation
simulation
edge simulation methodology
discrete-event simulation
emergence simulation exploratory simulation
discrete simulation
emergency egress extensible simulation
discrete-system
simulation extreme scale simulation
simulation
emergent behaviour extrinsic simulation
discrete-time continuous
simulation simulation
discrete-time simulation emergent simulation F–
discrete-variable emulation fast simulation
simulation optimization endomorphic simulation fair simulation
display-based simulation engineering simulation fault simulation
dissimilar simulation entertainment simulation fault-tolerant simulation
dissimulation entity-level simulation faulty simulation
distributed agent equation-oriented federated simulation
simulation simulation field simulation
distributed asymmetric error-controlled finite-state machine
simulation simulation simulation
distributed DEVS escapist simulation first-degree simulation
simulation ethical simulation fixed-topology simulation
Modeling and simulation: the essence and increasing importance 15

flattened G-DEVS goal-processing system human-machine

simulation simulation simulation
flattened simulation goal-setting system hybrid computer
fluid simulation simulation simulation
forward multisimulation goal-determined hybrid simulation
full cosimulation simulation
full-immersive simulation goal-regression I–
full-system simulation simulation ideal-seeking simulation
fully coupled simulation goal-seeking simulation identity simulation
functional simulation graphical simulation immersive simulation
fuzzy simulation grid-based simulation imperative-driven
fuzzy [system] simulation grid simulation simulation
G– importance-sampling-
gate-level simulation H– based
Gaussian copula hand simulation simulation
simulation hands-on simulation in-basket simulation
Gaussian distribution hard simulation in context simulation
simulation hardware-in-the-loop in silico simulation
Gaussian random simulation in situ simulation
function simulation heterogeneous simulation in the large simulation
Gaussian sequential hierarchical simulation in the small simulation
simulation high-fidelity simulation in vitro simulation
Gaussian simulation high-level simulation in vivo simulation
gedanken simulation high-performance inappropriate simulation
general purpose simulation incremental simulation
distributed simulation high-resolution indirect simulation
generalized mixed-mode simulation individual-based
simulation high-speed simulation simulation
generalized simulation historical simulation individual simulation
generative HLA-based simulation inductive simulation
multisimulation HLA-compliant industrial scale
generative parallax simulation simulation
simulation holistic simulation instructional simulation
generative simulation holographic simulation integrated simulation
generic simulation holonic agent simulation intelligent simulation
genetic-algorithm holonic simulation intelligent-system
simulation holonic [system] simulation
goal-directed system simulation interaction-based
simulation human-centred simulation
goal-free simulation simulation interactive simulation
goal-generating system human-in-the-loop interactive graphical
simulation simulation simulation
goal-oriented system human-initiated intermittent simulation
simulation simulation interoperable simulation
16 Modeling and simulation of complex communication networks

interpretational loosely coupled federated mobile-device initiated

simulation simulation simulation
interpretive simulation low-fidelity simulation mobile-device-triggered
interval-oriented low-level simulation simulation
simulation ludic simulation mobile simulation
intractable simulation mock simulation
intrinsic simulation M– model-based simulation
introspective simulation machine-centred modular simulation
inverse ontomimetic simulation Monte Carlo simulation
simulation machine simulation moving-boundary
inverse simulation maintenance simulation simulation
man-centred simulation multiagent-based
J– man-in-the-loop simulation
joint simulation simulation multiagent
man-machine simulation participatory-simulation
man-machine [system] multiagent-based
K– simulation simulation
knowledge-based manual simulation multiagent simulation
simulation Markov chain simulation multiagent-supported
kriging simulation Markov simulation simulation
massive-scale simulation multiaspect
L– massively multiplayer multisimulation
L-system simulation simulation multiaspect simulation
laboratory simulation mathematical simulation multibody simulation
large eddy simulation mental simulation multilevel simulation
large-scale simulation mesh-based simulation multilingual simulation
large simulation meshfree simulation multimedia-enriched
lazy simulation mesoscale simulation simulation
lean simulation metamodel-based multimedia simulation
legacy simulation simulation multimethod simulation
library-driven simulation metamorphic simulation multiparadigm simulation
Lindenmayer [system] metasimulation multiperspective
simulation microanalytic simulation simulation
line-of-sight simulation microcomputer multiphysics simulation
linear [system] simulation simulation multiplayer simulation
live instrumented microgrid simulation multiple-fidelity
simulation microsimulation simulation
live simulation mission-level simulation multiple-run simulation
live system-enriching mission rehearsal multiprocessor simulation
simulation simulation multirate simulation
live system-supporting mixed-mode simulation multiresolution
simulation mixed-signal simulation multisimulation
live training simulation mixed simulation multiresolution
logic simulation mobile-device activated simulation
logical simulation simulation multiscale simulation
Modeling and simulation: the essence and increasing importance 17

multisimulation O– participatory agent

multistage object-oriented simulation
multisimulation simulation participatory simulation
multistage simulation offline simulation pathway simulation
multi-user simulation on-demand simulation peace simulation
mutual simulation online role-play pedagogical simulation
simulation peer-to-peer simulation
online simulation pen and paper simulation
N–
ontology-based agent perceptual simulation
N-body simulation
simulation pervasive simulation
nanoscale simulation
ontology-based petascale simulation
nano simulation
multiagent simulation Petri net simulation
narrative simulation
ontology-based physical simulation
nested multisimulation
simulation physical [system]
nested simulation
ontomimetic simulation simulation
netcentric simulation
open form simulation physics-based simulation
network-oriented open loop simulation
simulation portable simulation
open source simulation prediction simulation
networked simulation optimistic parallel
non-anticipatory predictive biosimulation
simulation predictive simulation
simulation optimistic simulation
non-convergent prescriptive simulation
optimizing simulation
simulation probabilistic
ordinary differential
non-equation-oriented bisimulation
equation simulation
simulation process-based
ordinary kriging
non-HLA-compliant discrete-event
simulation
simulation simulation
outcome-driven
non-line-of-sight process interaction
simulation
simulation simulation
outcome-oriented
non-stiff simulation process-oriented
simulation
non-terminating simulation
simulation P– process simulation
non-zero-sum simulation parallax simulation program-based
nonconvergent simulation parallel discrete-event simulation
nondeterministic simulation program-oriented
simulation parallel distributed simulation
nondeterministic Turing simulation proof-of-concept
machine simulation parallel simulation simulation
nonlinear [system] parallelized simulation proxy simulation
simulation partial differential pseudo-analytical
nonnumerical simulation equation simulation simulation
normative simulation partial equilibrium pseudosimulation
null Turing machine simulation public domain simulation
simulation partial simulation pure software simulation
numerical simulation participative simulation purposeful simulation
18 Modeling and simulation of complex communication networks

Q– replicative simulation sequential simulation

qualitative simulation reproducible simulation serial simulation
qualitative mixed resimulation serious simulation
simulation retrosimulation service-based simulation
quantitative mixed retrospective simulation shape simulation
simulation reverse simulation simulation
quantitative simulation rigid-body simulation simultaneous simulation
quantum simulation risk simulation single-aspect simulation
quasi-analytic simulation role-play simulation single-component
quasi-continuous role playing simulation simulation
simulation rollback-based discrete single-processor
quasi-identity simulation simulation simulation
quasi-Monte Carlo rollback-based parallel single-run simulation
simulation discrete-event single-user simulation
quasi-identity simulation simulation skeleton-driven
queue simulation rollback-based simulation simulation
rule-based simulation smart phone activated
R– rule-based system simulation
random simulation embedded simulation smoothness simulation
rare-event simulation
S– soft body simulation
rate-based simulation
ratio simulation sandbox-style simulation soft computing
real-life simulation scalable simulation simulation
real-system enriching scaled real-time soft simulation
simulation simulation software-based
real-system support scenario simulation continuous system
simulation scientific simulation simulation
real-time continuous second degree simulation software-based
simulation self-adaptive simulation discrete-event
real-time data-driven self-driven simulation simulation
simulation self-learning simulation software-based
real-time self-organizing simulation
decision-making simulation software-in-the-loop
simulation self-organizing system simulation
real-time simulation simulation spatial simulation
reasonable simulation self-regulating simulation spatiotemporal simulation
reasoning simulation self-replicating system spreadsheet simulation
reconfigurable simulation simulation stand-alone simulation
recursive simulation self-simulation state-maintaining
reflective simulation self-stabilizing system simulation
regenerative simulation simulation static simulation
regular simulation semiotic simulation statistical simulation
related simulation sensitivity simulation steady-state simulation
reliable simulation sequential Gaussian stiff simulation
remote simulation simulation stochastic simulation
Modeling and simulation: the essence and increasing importance 19

strategic-decision teleological system U–

simulation simulation ubiquitous computing
strategic simulation teleonomic [system] simulation
strategy simulation simulation ubiquitous simulation
strong bisimulation teleozetic simulation ultrascale simulation
strong simulation terascale simulation uncertainty simulation
strong-strong simulation terminating simulation unconditional simulation
strong two-way coupling test and evaluation unconstrained simulation
simulation simulation uncoupled simulation
structural simulation texture simulation unified discrete and
structure simulation third degree simulation continuous simulation
successor simulation thought controlled unified simulation
suitable simulation simulation uniformization
survivability simulation thought experiment simulation
sustainable simulation simulation universal Turing machine
thought simulation simulation
swarm simulation
throttled time-warp unsuitable simulation
symbiotic simulation
simulation utilitarian simulation
symbolic simulation
symmetric simulation time-domain simulation
V–
system dynamics time-driven simulation
value-free simulation
simulation time-interval simulation
variable fidelity
system-level simulation time-slice simulation
simulation
system of systems time-slicing simulation
variable resolution
simulation time-stepping simulation
simulation
system simulation time-varying system
variable-topology
system-theory-based simulation
simulation
continuous system time-warp simulation very large eddy
simulation timing simulation simulation
system-theory-based trace-driven simulation very large simulation
discrete-event tractable simulation virtual reality simulation
simulation training simulation virtual simulation
system-theory-based trajectory simulation virtual system simulation
simulation transfer function virtual time simulation
systematic simulation simulation virtual training
transparent reality simulation
T– simulation virtualization simulation
t-simulation trial simulation visual interactive
tactical decision trustworthy simulation simulation
simulation Turing machine visual simulation
tactical simulation simulation
tandem simulation turning bands conditional W–
technical simulation simulation war simulation
technology-enhanced turning bands simulation warfare simulation
simulation two-level simulation weak bisimulation
20 Modeling and simulation of complex communication networks

weak classical simulation web-based multi-user Y–

weak simulation simulation yoked simulation
weak-timed mutual web-based simulation
simulation web-centric simulation Z–
wearable computer-based web-enabled simulation zero-sum simulation
simulation web-service-based zero-variance simulation
wearable simulation simulation

Appendix B – A list of 120 types of input

3– common-mode input F–
3-D positional input composite video input fixed input
conditioned stimulus forced input
A– constant input
acoustic input context-aware input G–
actively perceived input context-sensitive input geopositional input
adequate stimulus context-unaware input gesture input
admissible input conventional input global position sensing
alphabetical input converted sensory data input
alphanumeric input counter input
alternative input credible input H–
ambiguous input hand-gesture input
analog input D– haptic input
AND input data input high-level stimulus
antenna input delimiterless input
detected event I–
anticipated fact
digital input imposed input
anticipated input
direct input impulse
assumed goal
discriminative stimulus inadequate stimulus
asynchronous input
distracting input indirect input
audio input
autostimulant inflow
E– input
aversive stimulus
emergent input intake
B– emotional input intermediate input
balanced input endogenous input internal excitation
batch input equivalent input internal input
bounded input evaluated input internally generated input
brain controlled input excitation inverted input
brain signal exogenous input inverting input
buffered input external event irrelevant input
external excitation irrelevant stimulus
C– external input
clock input externally generated K–
command-driven input input keyboard input
Modeling and simulation: the essence and increasing importance 21

L– perceived exogenous step input

limited input input stimulant
perceived external input stimulator
M– perceived input stimulus
manual input perceived internal input stylus input
marginal input perceptual input subliminal stimulus
microphone input positional input subthreshold stimulus
monotonous input primary input synchronized input
motive synchronous input
multimodal input Q– syntactic input
multiple input quantized input
multisensory input T–
R– tactile input
radar input trigger input
N–
real-time input two directional input
neutral input
reference input
nociceptive input
relevant input U–
nociceptive stimulus
relevant stimulus unambiguous input
noisy input
unconventional input
non-inverting input S– uniform input
non-speech audio input self-excitation user input
numerical input self-test input
semantic input V–
O– sensed input variable input
OR input sensor input video input
sensory input virtual sensor input
P– simulated input vision input
paltry input single input visual input
passively accepted soft sensor input voice input
input sonar input
perceived endogenous speech input W–
input standard input wireless input

References
[1] Ören T.I. ‘Uses of simulation’ in Sokolowski J.A., Banks C.M. (eds.). Princi-
ples of Modeling and Simulation: A Multidisciplinary Approach. New Jersey:
John Wiley; 2009. pp. 153–179.
[2] Mittal S., Durak U., Ören T. (eds.). Guide to Simulation-Based Disciplines:
Advancing our Computational Future. Cham: Springer; 2007.
[3] Tolk A., Ören T. (eds.). The Profession of Modeling and Simulation: Discipline,
Ethics, Education, Vocation, Societies, and Economics. Hoboken, NJ: John
Wiley & Sons; 2017.
22 Modeling and simulation of complex communication networks

[4] Ören T.I. ‘The many facets of simulation through a collection of about 100
definitions’. SCS M&S Magazine. 2011, vol. 2(2), pp. 82–92.
[5] Ören T.I. ‘A critical review of definitions and about 400 types of modeling and
simulation’. SCS M&S Magazine. 2011, vol. 2(3), pp. 142–151.
[6] Ören T.I., Zeigler B.P. ‘Concepts for advanced simulation methodologies’.
Simulation. 1979, vol. 32(3), pp. 69–82.
[7] Ören T.I. ‘Modeling and simulation: A comprehensive and integrative view’
in Yilmaz L., Ören T.I. (eds.). Agent-Directed Simulation and Systems
Engineering. Berlin: Wiley; 2009. pp. 3–36.
[8] Ören T.I., Yilmaz L. ‘Philosophical aspects of modeling and simulation’ in
Tolk A. (ed.). Ontology, Epistemology, and Teleology of M&S: Philosophical
Foundations for Intelligent M&S Applications. Berlin, Heidelberg (Germany):
Springer-Verlag; 2013. pp. 157–172.
[9] Ören T., Mittal S., Durak U. ‘The evolution of simulation and its contribu-
tions to many disciplines’ in Mittal S., Durak U., Ören T. (eds.). Guide to
Simulation-Based Disciplines: Advancing our Computational Future. Cham
(Switzerland): Springer; 2017. pp. 3–24.
[10] Bruzzone A.G., Massei M. ‘Simulation-based military training’ in Mittal S.,
Durak U., Ören T. (eds.). Guide to Simulation-Based Disciplines: Advancing
our Computational Future. Cham (Switzerland): Springer; 2017. pp. 315–362.
[11] Bezdek W.J., Maleport J., Olshon R. ‘Live, virtual & constructive simulation
for real time rapid prototyping, experimentation and testing using network
centric operations’. AIAA Modeling and Simulation Technologies Conference
and Exhibit, Honolulu, HI, 2008.
[12] De Angelo J., George L.S., Moody J. The Link Flight Trainer: An Historic
Mechanical Engineering Landmark. ASME International, History and Her-
itage Committee, & Roberson Museum & Science Center, Binghamton, New
York, 2000.
[13] MarketsandMarkets. Flight Simulator Market by Application (Military,
Commercial), by Type of Flight (Fixed Wing, Rotary Wing, Unmanned
Aircraft), Military Component (FFS, FMS, FTD), Commercial Compo-
nent (FFS, FBS, FTD), Geography – Global Forecast to 2021 [online].
Available from http://www.marketsandmarkets.com/Market-Reports/flight-
simulator-market-22246197.html [Accessed 09 Sep 2017].
[14] Hopkin V.D. Human Factors in Air Traffic Control. Bristol, PA: CRC Press;
1995.
[15] Adacel. ATC Simulation and Training [online]. Available from http://www.
adacel.com/solutions_services/downloads/brochures/2017_MaxSim_WEB.pdf
[Accessed 11 Sep 2017].
[16] Baldwin T.T., Ford J.K. ‘Transfer of training: A review and directions for future
research’. Personnel Psychology. 1988, vol. 41(1), pp. 63–105.
[17] Valverde H.H. ‘A review of flight simulator transfer of training studies’. Human
Factors. 1973 vol. 15(6), pp. 510–522.
[18] Pool D.M., Zaal P.M.T. ‘A cybernetic approach to assess the training of manual
control skills’. IFAC-PapersOnLine. 2016 vol. 49(19), pp. 343–348.
Modeling and simulation: the essence and increasing importance 23

[19] Alexander A.L., Brunyé T., Sidman J., Weil S.A. From Gaming to Training:
A Review of Studies on Fidelity, Immersion, Presence, and Buy-In and Their
Effects on Transfer in PC-Based Simulations and Games. DARWARS Training
Impact Group, Woburn, MA: 2005.
[20] Witmer B., Singer M. Measuring presence in virtual environments. U.S. Army
Research Institute for the Behavioral and Social Sciences Tech. Report No.
1014, 1994.
[21] Yeh T.Y., Faloutsos P., Reinman G. ‘Enabling real-time physics simulation in
future interactive entertainment’. Proceedings of the 2006 ACM SIGGRAPH
Symposium on Videogames; Boston, MI; 2006.
[22] Eberly D.H. Game Physics. 2nd edition, Boca Raton, FL: CRC Press; 2010.
[23] Millington I. Game Physics Engine Development. San Francisco, CA: Morgan
Kaufmann Publishers; 2007.
[24] Bullet Physics Library [online] Available from http://bulletphysics.org/
wordpress/ [Accessed 11 Sep 2017].
[25] Autodesk® Maya [online] Available from https://www.autodesk.de/products/
maya [Accessed 11 Sep 2017].
[26] Blender [online] Available from https://www.blender.org/ [Accessed 11 Sep
2017].
[27] Red Dead Redemption [online] Available from http://www.rockstargames.
com/games/info/reddeadredemption [Accessed 11 Sep 2017].
[28] Toy Story 3 [online] Available from http://games.disney.com.au/toy-story-3-
video-game [Accessed 11 Sep 2017].
[29] GameWorks PhysX Overview [online] Available from https://developer.nvidia.
com/gameworks-physx-overview [Accessed 11 Sep 2017].
[30] Havok [online] Available from https://www.havok.com/ [Accessed 11 Sep
2017].
[31] Boeing A., Bräunl T. ‘Evaluation of real-time physics simulation systems’.
Proceedings of the 5th International Conference on Computer Graphics and
Interactive Techniques in Australia and Southeast Asia; Perth, Australia; 2007.
[32] Iben H., Meyer M., Petrovic L., Soares O., Anderson J., Witkin A. Artistic
simulation of curly hair. Pixar Animation Studios Technical Memo 12-03a,
2012.
[33] Merida [online] Available from http://princess.disney.com/merida [Accessed
11 Sep 2017]
[34] Brave [online] Available from http://movies.disney.com/brave [Accessed 11
Sep 2017].
[35] Mullen T. Bounce, Tumble, and Splash!: Simulating the Physical World with
Blender 3D. Indianapolis, IN: John Wiley & Sons; 2008.
[36] Anderson L.W. A Taxonomy for Learning, Teaching, and Assessing: Pearson
New International Edition: A Revision of Bloom’s Taxonomy of Educational
Objectives, Abridged Edition. London, England: Pearson Education Limited;
2013.
[37] Ören T., Mittal S., Turnitsa C., Diallo S.Y. Simulation-based learning and
education disciplines’ in Mittal S., Durak U., Ören T. (eds.). Guide to
24 Modeling and simulation of complex communication networks

Simulation-Based Disciplines: Advancing Our Computational Future. Cham

(Switzerland) Springer; 2017. pp. 293–314.
[38] Ören T.I., Ghasem-Aghaee N., Yilmaz L. ‘An ontology-based dictionary of
understanding as a basis for software agents with understanding abilities’. Pro-
ceedings of the Spring Simulation Multiconference (SpringSim’07). Norfolk,
VA; 2007.
[39] Ören T.I. ‘Simulation and reality: The big picture’. International Journal of
Modeling, Simulation, and Scientific Computing. 2010, vol. 1(1). pp. 1–25.
[40] Ören T.I. ‘The richness of modeling and simulation and an index of its body of
knowledge’ in: Obaidat M.S., Filipe J., Kacprzyk J., Pina N. (eds.). Simulation
and Modeling Methodologies, Technologies and Applications, Advances in
Intelligent Systems and Computing. Springer; 2014. pp. 3–24.
[41] Ören T.I. ‘Simulation: Taxonomy’ in Singh M.G. (ed.). Systems and Control
Encyclopedia. Oxford, England: Pergamon Press; 1987. pp. 4411–4414.
[42] Sulistio A., Yeo C.S., Buyya R. ‘A taxonomy of computer-based simula-
tions and its mapping to parallel and distributed systems simulation tools’.
Software – Practice and Experience. 2004, vol. 34(7), pp. 653–673.
[43] Ören T.I. ‘Review of the article: A taxonomy of computer-based simulations
and its mapping to parallel and distributed systems simulation tools by Sulistio,
O., Yeo, C., Buyya, R.’. Software – Practice & Experience. ACM Computing
Reviews. 2005, vol. 34(7), pp. 653–673. February issue.
[44] Maier F.H., Größler A. ‘What are we talking about? A taxonomy of com-
puter simulations to support learning about socio-economic systems’. System
Dynamics Review. 2000, vol. 16(2), pp. 135–148.
[45] Roeder T.M.K. An information taxonomy for discrete event simulations. Ph.D.
Thesis, University of California, Berkeley, 2004.
[46] Ören T.I. ‘A basis for the taxonomy of simulation languages’. Proceedings of
the 1971 Summer Computer Simulation Conference; Boston, MA, 1971.
[47] Ören T.I. ‘Simulation and model-oriented languages: Taxonomy’in Singh M.G.
(ed.). Systems and Control Encyclopedia. Oxford, England: Pergamon Press;
1987. pp. 4303–4306.
[48] Ören T.I. ‘Simulation languages: Taxonomy’ in Morris D., Tamm B. (eds.).
Concise Encyclopedia of Software Engineering. Oxford, England: Pergamon
Press; 1993. pp. 306–312.
[49] Rachidi, H. ‘Discrete simulation software: A survey on taxonomies’. Journal
of Simulation. 2017, vol. 11(2), pp. 174–184.
[50] Ören T.I. ‘Simulation models: Taxonomy’ in Singh M.G. (ed.). Systems and
Control Encyclopedia. Oxford, England: Pergamon Press; 1987. pp. 4381–
4388.
[51] Ören T.I. ‘Simulation models symbolic processing: Taxonomy’ in Singh M.G.
(ed.). Systems and Control Encyclopedia. Oxford, England: Pergamon Press;
1987. pp. 4377–4381.
[52] Fishwick P.A. ‘A taxonomy for simulation modeling based on programming
language principles’. IIE Transactions. 1998, vol. 30(9), pp. 811–820.
Modeling and simulation: the essence and increasing importance 25

[53] Hare M., Deadman P. ‘Further towards of a taxonomy of agent-based simula-

tion models in environmental management’. Mathematics and Computers in
Simulation. 2004, vol. 64(1), pp. 25–40.
[54] Ören T.I. ‘Review of the article: Further towards a taxonomy of agent-based
simulation models in environmental management by Hare M. and Deadman P.’.
Mathematics and Computers in Simulation. ACM Computing Reviews. 2004,
vol. 64(1), pp. 25–40, May issue.
[55] Yilmaz L. Ören T.I. ‘Dynamic model updating in simulation with multimod-
els: A taxonomy and a generic agent-based architecture’. Proceedings of
SCSC 2004 – Summer Computer Simulation Conference; 2004, San Jose, CA,
pp. 3–8.
[56] Ören T., Mittal S., Durak U. ‘Induced emergence in social system engineering:
Multimodels and dynamic couplings as methodological bases’ Chapter 9 in:
Mittal S., Diallo S., Tolk A. (eds.) (2018). Emergent Behavior in Complex
Systems Engineering: A Modeling and Simulation Approach. Hoboken, NJ:
Wiley; 2018, April.
[57] Lynch, C.J., Diallo S.Y. ‘A Taxonomy for classifying terminologies that
described simulations with multiple models’. Proceedings of the 2015 Winter
Simulation Conference, Huntington Beach, CA, 2015, pp. 1621–1632.
[58] Smith R. ‘On the value of taxonomy in modeling’ in Tolk A. (ed.). Ontology,
Epistemology, and Teleology of M&S: Philosophical Foundations for Intel-
ligent M&S Applications. Berlin, Heidelberg (Germany): Springer-Verlag;
2013.
[59] Ören T.I. ‘Model behavior: Type, taxonomy, generation and processing tech-
niques’. in Singh M.G. (ed.). Systems and Control Encyclopedia. Oxford,
England: Pergamon Press; 1987. pp. 3030–3035.
[60] Ören T.I. ‘Software agents for experimental design in advanced simulation
environments’. Proc. of the 4th St. Petersburg Workshop on Simulation; St.
Petersburg, Russia, 2001. pp. 89–95.
[61] Wenzel S., Bernhard J., Jessen U. ‘A taxonomy of visualization techniques
for simulation in production and logistics’. Proceedings of the 2003 Winter
Simulation Conference; 2003, pp. 729–736.
[62] Le Digabel S., Wild S.M. A Taxonomy of Constraints in Simulation-based Opti-
mization. Argonne National Laboratory, Mathematics and Computer Science
Division, Preprint ANL/MCS-P5350-0515, 2015.
[63] Goldstein R., KhanA. ‘A taxonomy of event time representations’. Proceedings
of the Symposium on Theory of Modeling and Simulation (TMS/DEVS’17);
Virginia Beach, VA, 2017.
[64] Miller J.A., Baramidze G.T., Sheth A.P., Fishwick P.A. ‘DeMO: An ontology
for discrete-event modeling and simulation’. Simulation. 2011, vol. 87 (9), pp.
747–773.
[65] Tolk A. (ed.). Ontology, Epistemology, and Teleology for Modeling and Simu-
lation’ ? Philosophical Foundations for Intelligent M&S Applications. Berlin,
Heidelberg (Germany): Springer; 2013.
26 Modeling and simulation of complex communication networks

[66] Partridge C., Mitchell A., de Cesare S. ‘Guidelines for developing ontological
architectures in modelling and simulation’ in Tolk A. (ed.). Ontology, Epis-
temology, and Teleology of M&S: Philosophical Foundations for Intelligent
M&S Applications. Berlin, Heidelberg (Germany): Springer-Verlag; 2013, pp.
27–57.
[67] Hofmann, M. ‘Ontologies in modeling and simulation: An epistemologi-
cal perspective’ in Tolk A. (ed.). Ontology, Epistemology, and Teleology of
M&S: Philosophical Foundations for Intelligent M&S Applications. Berlin,
Heidelberg (Germany): Springer-Verlag; 2013. pp. 59–87.
[68] Heath, B.L., Jackson R.A. ‘Ontological implications of modeling and simula-
tion in postmodernity’ in Tolk A. (ed.). Ontology, Epistemology, and Teleology
of M&S: Philosophical Foundations for Intelligent M&S Applications. Berlin,
Heidelberg (Germany): Springer-Verlag; 2013. pp. 89–103.
[69] Wang W., Wang W., Li Q., Yang F. ‘Ontological, epistemological, and tele-
ological perspectives on service-oriented simulation frameworks’ in Tolk A.
(ed.). Ontology, Epistemology, and Teleology of M&S: Philosophical Foun-
dations for Intelligent M&S Applications. Berlin, Heidelberg (Germany):
Springer-Verlag; 2013, pp. 335–358.
[70] Durak U., Ören T. ‘Towards an ontology for simulation systems engineering’.
Proceedings of the SpringSim’16; Pasadena, CA, 2016.
[71] Ören T.I. ‘Computer-aided modelling systems’ in Cellier F.E. (ed.). Progress
in Modelling and Simulation. London: Academic Press; 1982. pp. 189–203.
[72] Ören T.I., Zeigler B.P. ‘System theoretic foundations of modeling and simula-
tion: A historic perspective and the legacy of A. Wayne Wymore’. Simulation.
2012, vol. 88(9), pp. 1033–1046.
[73] Zeigler B.P. Multifacetted Modeling and Discrete Event Simulation. London:
Academic Press; 1984.
[74] Simon, H.A., Newell A. ‘Simulation of human thinking’ in Greenberger M.
(ed.). Computers and the World of the Future. Cambridge, MA: The MIT Press;
1962. pp. 94–131.
[75] Feigenbaum E.A., Feldman J. (eds.). Computers and Thought. McGraw-Hill
Book Company; 1963
[76] Ören T.I. ‘Artificial intelligence and simulation: A typology’. Proceedings of
the 3rd Conference on Computer Simulation; Mexico City, 1995
[77] Yilmaz L., Ören T.I. (eds.). Agent-Directed Simulation and Systems Engineer-
ing. Berlin: Wiley-Berlin; 2009.
[78] Ören T.I., Yilmaz L. ‘Synergy of systems engineering and modeling and
simulation’. Proceedings of the 2006 International Conference on Modeling
and Simulation – Methodology, Tools, Software Applications (M&S MTSA);
Calgary, AL, Canada, 2006.
[79] NATO-SaaS. Modeling and Simulation as a service: New concepts and
service-oriented architectures. NATO STO Technical Report AC/323(MSG-
131)TP/608, 2015.
[80] Ören T.I., Zeigler B.P., Elzas M.S. (eds.). Simulation and model-based
methodologies: An integrative view. Berlin: Springer-Verlag; 1984.
Chapter 2
Flexible modeling with Simio
David T. Sturrock1 and C. Dennis Pegden1

2.1 Overview

Simio is an object-oriented (OO), general-purpose modeling tool that can be applied in

a broad set of applications including manufacturing, transportation, logistics, health-
care, and communication networks. Simio includes a comprehensive tool set for
building models, providing 3D animation, verifying and validating the model, exper-
imenting and optimizing results, and real-time planning and scheduling. Figure 2.1
illustrates a student model [1] of a rental car parking lot that has taken advantage of
3D animation to illustrate the solution. In this chapter, we will describe the general-
purpose modeling features and capabilities of Simio but, where appropriate, describe
these concepts as they relate to modeling of communication networks.

2.2 Simio object framework

Simio is a simulation modeling framework based on intelligent objects. The intelligent
objects are built by modelers and then may be reused in multiple modeling projects.

Figure 2.1 3D animation of rental car operation

1
Simio LLC, USA
28 Modeling and simulation of complex communication networks

Simio comes with pre-built libraries of objects. For example, the Standard Library
is set of general purpose objects (source, server, path, sink, etc.) that is commonly
used to model a wide range of discrete systems. Likewise, the Flow Library is a
set of general purpose objects (e.g., tank, pipe, filler) that is used to model systems
involving material flows such as liquids, sand, gravel, etc. Many other libraries are
also available such as the Extras library that represents cranes, elevators, robots,
and more.
In many cases, a modeling project is approached by first building a custom library
of special purpose objects, and then those objects are used as building blocks for creat-
ing a model. For example, a complex communication network involving ships, tanks,
airplanes, command centers, satellites, etc. can be modeled by first creating objects
representing each of the physical components and then placing multiple instances of
these objects into the final model. Objects can be stored in libraries and easily shared.
A beginning modeler may prefer to use pre-built objects from libraries; however, the
system is designed to make it easy for even beginning modelers to build their own
intelligent objects.
As noted above, a Simio model is built by combining objects that represent the
physical components of the system. A Simio model looks like the real system. The
model logic and animation is built as a single step. An object is animated in 3D to
reflect the physical object and its changing state. For example, a robot opens and
closes its gripper, and a battle tank turns its turret. The animated model provides a
moving picture of the system in operation. To simplify the effort of building animated
3D models, Simio can import 2D and 3D background objects as well as 2D and 3D
object representations from the target domain. Simio also provides a direct link to
Trimble 3D Warehouse, a free massive online library of 3D graphic symbols that
contains high-quality 3D symbols from virtually every domain.
Objects are built using graphical processes and the concepts of object-orientation.
There is no need to write programming code to create new custom objects. The activity
of building an object in Simio is identical to the activity of building a model—in fact,
there is no difference between an object and a model. This concept is referred to as
the equivalence principle and is central to the design of Simio. Whenever you build a
model, it is an object that can be instantiated into another model. For example, if you
combine two satellite dishes and six missile launchers into a missile defense battery,
the missile defense battery model is itself an object (see Figure 2.2) that can then
be instantiated any number of times into other models. The missile defense battery
model is an object just like the satellite dish and missile launchers are objects. In
Simio, there is no way to separate the idea of building a model from the concept
of building an object. Every model that is built in Simio is automatically a building
block that can be used in building higher level models.

Composite objects: The previous example in which we defined a new object def-
inition (missile defense battery) by combining other objects (satellite dish and
missile launcher) is one example of how we can create object definitions in Simio.
This type of object is called a composed object because we create this object by
combining two or more component objects. This object-building approach is fully
Flexible modeling with Simio 29

Figure 2.2 Placing a missile defense battery object in a model

hierarchical, i.e., a composed object can be used as a component object in building

higher level objects. This is only one way of building objects in Simio—there are
two other important methods.
Base objects: The most basic method for creating objects in Simio is by defining
the logical processes that alter their state in response to events. For example, a
router object might be built by defining the processes that alter the router state as
events occur such as packet arrivals, breakdowns, etc. This type of modeling is
like the process modeling done in traditional modeling systems in use today such
as Arena™or GPSS™. A base object can in turn be used as a component object
for building higher level objects.
Derived objects: The final method for building objects in Simio is based on the
concept of inheritance. In this case, we create an object from an existing object
by overriding (i.e., replacing) one or more processes within the object, or adding
additional processes to extend its behavior. In other words, we start with an object
that is almost what we want, and then we modify and extend it as necessary to
make it serve our own purpose. Whereas in a programming language, we extend
or override behavior by writing methods in a programming language; in Simio, we
extend or override behavior by adding and overriding graphically defined process
models. With Simio, the skills required to define and add new objects to the system
are modeling skills, not programming skills.
As an example of the idea of inheritance, we might build a specialized satellite
dish from a generalized satellite dish object by adding additional processes to handle
the failure and replacement of one of its critical components. An object that is built
in this way is referred to as a derived object because it is subclassed from an existing
object.
30 Modeling and simulation of complex communication networks

Regardless of which method is used to create an object, once created it is used

in the same way. An object can be instantiated any number of times into a model. You
simply select the object of interest and place it (instantiate it) into your model.
Entity is a class of objects that represents physical items in the system that can be
created and destroyed and moved through 3D space, such as ships, satellites, tanks,
etc., or packets of information that move between physical items. When modeling
systems comprise the IoT, the objects can represent the “things” or the “messages”
that are sent back and forth between the things.
Objects in Simio have static inputs referred to as properties, and dynamic vari-
ables referred to as states. For example, a missile might have a property that specifies
its maximum travel speed, and states that specify its current 3D position, direction,
and speed. The number and types of properties and states are defined by the creator
of the object. Both properties and states are strongly typed and can be defined as
numeric, Boolean, list, etc. There are two basic types of states: discrete and con-
tinuous. A discrete state is a value that only changes at event times (packet arrival,
machine breakdown, etc.). A continuous state (e.g., degrees of tank turret rotation,
position of a satellite, etc.) has a value that may change continuously over time by
specifying its rate of change.

2.3 Simio object classes

There are six basic classes of objects in Simio. These six classes of objects provide
a starting point for creating intelligent objects within Simio. By default, all six of
these classes of objects (Figure 2.3) have very little native intelligence, but all can
gain intelligence. You build intelligent versions of these objects by modeling their
behavior as a collection of event-driven processes. The Standard Library included
with Simio provides a rich and customizable set of objects that are derived from these
six basic classes of objects.
The first class is Fixed Objects. This object has a fixed location in the model
and is used to represent the things in your system that do not move from one location
to another. Fixed objects are used to represent stationary equipment such as routers,
servers, etc.
Links and Nodes are objects that are used to build networks over which entities
may flow. Note that links and nodes can model networks for both communication
movements as well as physical movements. A link defines a pathway for entity

Intelligent object

Fixed Link Node Agent

Entity

Transporter

Figure 2.3 Basic Simio object classes

Flexible modeling with Simio 31

movement between objects. A node defines a starting or ending point for a link.
Links and nodes can be combined into complex communication and physical net-
works. Although the base link has little intelligence, we can add behavior to allow
it to model unconstrained flow, congested traffic flow, or complex material handling
systems such as accumulating conveyors or power and free conveying systems.
Agents are objects that can freely move through three-dimensional space. Agents
are also typically used for developing agent-based models. This modeling view is
useful for studying systems that are composed of many independently acting intel-
ligent objects that interact with each other and in so doing create the overall system
behavior. Examples of applications include market acceptance of a new product or
service, or population growth of competing species within an environment. Note that
in Simio, all objects graphically defined processes provide intelligence to control
their behavior rather than requiring Java or other programming code as in most other
products.
Entities are objects that can freely move through three-dimensional space. Entities
can move through the system from object to object over a network of links and nodes
or move directly between objects through free space. Examples of entities include
communications such as information packets, or physical items such as tanks, satel-
lites, ships, etc. Note that in traditional modeling systems, the entities are typically
passive and are acted upon by the model processes. However, in Simio, the entities
can have intelligence and control their own behavior.
The final class of object is a Transporter and is subclassed from the entity class.
A transporter is an entity that has the added capability to pick up, carry, and drop-off
one or more other entities. By default, transporters have none of this behavior, but by
adding model logic to this class, we can create a wide range of transporter behaviors.
A transporter can model an airplane, ship, subway car, automated guided vehicle
(AGV), or any other object that can carry other entities from one location to another.
The Standard Library contains a vehicle object and a worker object, both of which
are derived from a transporter object.
A key feature of Simio is the ability to create a wide range of object behaviors
from these six basic classes. The Simio modeling framework is application domain
neutral—i.e., these basic classes are not specific to communications, manufacturing,
service systems, healthcare, military, etc. However, it is easy to build application-
focused libraries comprising intelligent objects from these classes designed for
specific application. For example, it is relatively simple to build an object (in this
case a link) that represents a complex accumulating conveyor for use in manufacturing
applications. The design philosophy of Simio directs that this type of domain-specific
logic belongs to the objects that are built by users, and not programmed into the core
system.

2.4 Modeling movements

A major focus in modeling typical communication networks is representing the move-
ment of both physical items and information packets through the system. In Simio,
32 Modeling and simulation of complex communication networks

B F

Figure 2.4 Example network

both items are modeled with entities, which move through the 3D model in one of two
ways. The first is to simply move in free space with no constraints in movement. In
this case, the entity can set its own direction, speed, and acceleration. In free space, the
entity is in complete control of its own movement. The second method is to move over
a network of nodes and links, where the network may control and limit the movements
of the entities. Networks are very useful for modeling complex movements.
Networks comprise one or more links, where each link starts and ends at a
node. A node can have any number of incoming and outgoing links. Links can be
unidirectional or bidirectional, have a capacity that limits traffic on the link, and can
have a maximum speed to limit traffic speed. Links also have a selection weight that
can be used in decision rules for routing entities through the network. The example
network in Figure 2.4 has six nodes (labeled A–F) and ten links connecting the nodes,
where the triangles are entities moving through the network.
The complete set of all links in a model is referred to as the global network.
However, links can also belong to one or more subnetworks. For example, the com-
munication links between a set of satellite dishes might be represented by a subnetwork
that is limited for use by signals traveling between satellite dishes, and pathways where
ships travel may be specified by a separate network.
The Standard Library contains four link objects and two node objects. The con-
nector, path, time path, and conveyor are derived from the link object and the basic
node and the transfer node are derived from the node object. The connector moves
entities across the link in zero-time. This type of link is used to model movements
such as signals that travel at the speed of light, for which the travel time is negligible
and can be ignored. The path is a type of link used to model entity movements where
Flexible modeling with Simio 33

each entity can travel at its own speed and either pass or not pass other entities based
on a property that is specified on the path. The time path is used to model situations
where the travel time on the link is specified by an expression (perhaps involving
random variables and other system status variables). The conveyor is a type of link
that is used to model both accumulating and non-accumulating conveyors that are
found in typical manufacturing and warehousing applications. Although the links and
nodes provided by the Standard Library work for many applications, users can also
create their own custom nodes and links.
Each entity in the network can have a single destination where it is headed, or
it can follow a specified travel sequence through the network. A travel sequence is
an ordered list of nodes (e.g., A, C, D, F in Figure 2.4) that must be visited in the
specified order on the way to its last node in the sequence. In either case, an entity
may have more than one possible routing to its next destination. For example, an
entity traveling from C to E could either take the direct path from C to E or travel
from C to D and then D to E. This might be advantageous, for example, if the link
to C to E was congested, and the travel speed on the alternate route through D was
faster and warranted the extra travel distance. The decision for which route to take
when moving to its next intermediate or final destination is based on properties that
are specified on the transfer node. The Outbound Link Rule property specifies that
the link should be selected based the shortest path or on decision weights that can
be assigned to each link. The Link Preference Property specifies if all links are to be
considered, only links that are currently available or a specific link is desired.

2.5 Modeling physical components

When modeling communication networks, it is also important to be able to model the
physical components of the system, such as vehicles, machines, workers, etc. This
can be done using objects in the Standard Library, Flow Library, or other custom
libraries. Although custom libraries are the most flexible, many applications can be
done using the Standard Library. The Standard Library consists of pre-built objects
to model a wide range of systems. We have already briefly discussed the four link
objects and two node objects that are included in this library. Table 2.1 summarizes
the 15 objects in the Standard Library.
The Flow Library is designed to represent situations where flow is continuous
(e.g., fluid in a pipe or ore on a conveyor) or so fast that it can best be modeled
as continuous (e.g., pills on a conveyor). The Flow Library also handles conversion
between discrete and flow (e.g., a filling machine that converts continuous fluid flow
into filled bottles). Table 2.2 summarizes the ten objects in the Flow Library.
Every object has properties that control its behavior. For example, the source
object properties view shown in Figure 2.5 has an arrival mode property that specifies
the mechanism for creating entities (either based on an interarrival time, data in a table,
an event, or based on a time-varying arrival rate). Each property value may in turn
switch on additional properties; e.g., if the arrival mode is specified as interarrival
time, then a property is switched on that specifies the expression for computing the
34 Modeling and simulation of complex communication networks

Table 2.1 Objects in Simio Standard Library

Name Class Description

Source Fixed Creates entities that arrive to the system

Sink Fixed Destroys entities and records statistics
Server Fixed Models a multichannel service process with input/output
queues
Resource Fixed Models a resource that can be used by other objects
Combiner Fixed Combines entities in batches
Separator Fixed Separates entities from batches
Workstation Fixed Models a three-phase workstation with setup, processing,
and teardown
Vehicle Transporter Carries entities between objects and serves entities at a fixed
location
Worker Transporter Carries entities between objects and serves entities at a fixed
location
Basic node Node A simple intersection of links
Transfer Node An intersection where entities set destination and wait on
node transporters
Connector Link A zero-time connection between two nodes
Path Link A pathway between two nodes where entities travel based
on speed
Time path Link A pathway with a specified travel time
Conveyor Link An accumulating/non-accumulating conveyor device

Table 2.2 Objects in Simio Flow Library

Name Class Description

Flow source Fixed Generates a flow of fluid or other mass of a specified entity type
Flow sink Fixed Destroys flow entities representing quantities of fluids or
other mass that have finished processing in the model
Tank Fixed Models a volume or weight capacity–constrained location for
holding entities representing quantities of fluids or other mass
Container Entity Models a type of simple moveable container (e.g., barrels or totes)
entity for carrying flow entities representing quantities of fluids or
other mass
Filler Fixed Fills containers with flow entities representing quantities of fluids
or other mass
Emptier Fixed Empties the flow contents of container entities
Item to flow Fixed Converts entities representing discrete items into flow entities
converter representing quantities of fluids or other mass
Flow to item Fixed Converts flow entities representing quantities of fluids or other
converter mass into entities representing discrete items
Flow node Node Regulates the flow of entities representing quantities of fluid or
other mass
Flow Link A zero-time connection between two flow nodes
connector
Flexible modeling with Simio 35

Figure 2.5 Selected properties of source object

interarrival time (typically a random variable). All properties can be used for either
deterministic or stochastic arrivals. Other properties provide flexibility in terminating
the arrival stream, for example, after a specified time or specified number of arrivals.
Many objects also use events to customize their behavior. Events let objects easily
communicate with other objects. For example, an event triggered elsewhere in the
model might cause a source object to create an arrival or entirely stop creating new
arrivals.
Figure 2.6 shows a simple model built using the source, server, and sink objects,
along with path links to define the movements between these objects. In this example,
entities are created at the source, travel to the server where they queue up and wait
for processing, and then travel to the sink where they depart the model.
The objects shown in Figure 2.6 all have their default generic graphics; in a
typical model, these would be replaced by more appropriate graphics. For example,
if the server represented an ATM machine at a bank, we would typically replace
the rectangular server symbol with a graphic symbol of an ATM machine from 3D
Warehouse. We could also replace the triangles representing entities with animated
36 Modeling and simulation of complex communication networks

Sink1

Server1

Source1

Figure 2.6 Example model using source, server, and sink objects

Sink1

Server1

Source1

Figure 2.7 Example model with domain-specific 3D animation

walking people. Figure 2.7 enhances that same model with the default graphics for
the server and entity replaced, which required only a few minutes to create.
The server object that is used in this simple model is one of the most powerful
and commonly used objects in the Standard Library. It can model a wide range of
Flexible modeling with Simio 37

Values
Starved
Processing
Blocked
Failed
53.3606%
Off shift
Failed processing
Off shift processing
Setup
Off shift setup

46.6394%

Server1

Figure 2.8 Selected server properties and optional graphics

physical elements of a system that constrain the movement of entities based on one
or more activities that must take place, secondary resources that may be required, and
material that may be consumed. Figure 2.8 illustrates many of the common properties
of the server object as well as an optional attached pie chart in the facility view that
indicates possible resource states.
The server can model multichannel processors, follow complex work schedules,
and incorporate failure/repair patterns. The server can also model complex operations
composed of a network of tasks that follow precedence relationships and operate
parallelly and/or sequentially. For example, Figure 2.9 illustrates a generic six-step
task sequence where each task has prerequisite tasks. Not only is the number and
relationship of tasks unlimited, but each individual task could require resources or
materials, or even be defined to execute one or more other objects, which themselves
might have networks of tasks.
The worker and vehicle are two other objects that are commonly used in Simio
models. The worker object is used to model operators or crew members that move
around the system and perform tasks. For example, a server may request that a worker
must come to the server to set it up before processing an entity. The vehicle object
is used to model ships, trucks, AGVs, etc., that travel through the model, picking up
and dropping off entities. Vehicles have flexible work selection and allocation logic,
38 Modeling and simulation of complex communication networks

Task2 Task4

Task6
Task1

Task3 Task5

Figure 2.9 Generic parallel/serial task sequence

reliability logic, and many options to control both behavior and animation such as
load and unload time, dwell logic, and automatic parking and homing options.
While each object has object-specific properties as mentioned above, each object
also has categories of properties that are found across many different objects. For
example, objects that incorporate buffers or queues typically have a Buffer Logic
category that contains properties to describe the capacity of those buffers, as well
as the logic that governs balking (bypass queue entry) and reneging (abort queue
waiting). Most objects have a Financials category that specifies the properties to
support comprehensive activity–based costing and supporting all world currencies.
Objects that typically represent some types of machine or equipment have a
Reliability category where failure-related properties such as downtime mode, period
between downtimes, and time to repair are specified. Downtime modes include cal-
endar or processing time between failures, processing count between failures, and
event-based failures.
Many objects also have categories to provide higher level interaction with other
objects such as state assignments, statistics, customized animation, and data log-
ging. Two broad interaction mechanisms—processes and data tables—are discussed
in Sections 2.6 and 2.7, respectively.

2.6 Processes

The use of library objects permits fast, highly productive modeling. But unless the
library is designed to closely match your application, you will often have to customize
objects in order to model accurately enough to meet project objectives. In most OO
simulation products, this customization can only be accomplished by modifying the
object definition using programming code like Java, C++, or a proprietary language.
Doing so takes a level of expertise often not readily available. Simio provides two
alternatives, both based on the patented concept of processes.
A process is a graphical way of defining the logic behind an object. Processes can
be used to make decisions, seize or release resources, search collections of objects or
data, wait for or trigger communications events, assign state variables, record custom
Flexible modeling with Simio 39

Process1
Decide1 Seize1 Delay1 Release1 Tally1
Begin End
Decide Seize Delay Release Tally
True
False
Wait1 Assign1
Wait Assign

Figure 2.10 Example process logic

Table 2.3 Commonly used process steps

Assign step is used to assign a new value to a state variable

Create step is used to generate objects into the system
Decide step may be used to determine the flow of a token through process logic
Delay step delays the arriving token in the step for the specified time duration
Destroy step destroys either the parent object or the executing token’s associated object
EndTransfer step may be used to indicate that the entity object associated with the executing
token has completed transfer into an object or station
Execute step may be used to execute a specified process
Find step may be used to search the value of an expression over a specified range of one
or more index variables. The expression will typically involve array variables (vectors or
multidimensional arrays) or indexing-related functions
Fire step may be used to fire an object event
Move step may be used to request a move from one or more moveable resources that have been
seized by either the parent object or object associated with the executing token. The executing
token will be held in the Move step until the resources have arrived to the requested locations
Release step releases capacity of one or more objects on behalf of the parent object or the object
associated with the executing token
Scan step may be used to hold a process token at the step until a specified condition is true
Search step may be used to search a collection of objects
Seize step may be used to seize capacity of one or more objects on behalf of the parent object
or the object associated with the executing token
SetNode step may be used to set the destination node of any entity object
Tally step tallies an observation for each token arriving to this step
Transfer step may be used to transfer the entity object associated with the executing token
between objects and between free space and objects
Wait step may be used to hold the arriving token in the step until a specified event occurs

statistics, and much more. Figure 2.10 illustrates a process that makes a decision, then
either seizes, delays, and releases a resource or waits for an event, assigns a state, and
then records an observational statistic. Although over 60 steps are available, the most
commonly used steps are described in Table 2.3.
A process can be used in an object definition to define the logic in a new object
or customize the logic in a subclassed object. But in many cases an even simpler alter-
native is available. Most library objects have “hooks” called add-on process triggers
that can be used to supplement the logic in a specific object instance. Figure 2.11
illustrates the add-on process triggers available in the server object. These support the
40 Modeling and simulation of complex communication networks

Figure 2.11 Server add-on process triggers

application of custom logic based on triggers related to starting/ending the process-

ing, starting/ending a failure, going on/off shift, and much more. This allows users to
graphically specify custom logic—for example, examining the current system state
before deciding what to do with any entities in process when a server is about to
transition off shift.
In addition to the flexibility in easily customizing logic, processes provide two
other important benefits: spanning time and allowing parallel processing. In most
programming solutions to customized object, the logic cannot span time, so the seize–
delay–release sequence and the alternate wait sequence illustrated in Figure 2.9 would
not be possible in a single program function. Another advantage is that while entities
or agents are the most primitive construct in most simulation products, Simio has the
concept of Tokens, which are delegates of an object—tokens execute processes. Since
an object can have many tokens, this allows for an entity to take multiple actions at
once by using multiple tokens. For example, a server object might execute a setup
process while at the same time summoning a mechanic to do a concurrent repair
operation.

2.7 Data tables

Simio contains a set of process steps that provide comprehensive input/output capabil-
ities including the ability to read/write CSV files, Excel™ files, and a wide variety of
Flexible modeling with Simio 41

Figure 2.12 Relational tables with partial master-detail view expansion

database files. But processing such files incrementally during a model run can often
be slow and inconvenient. So Simio extends this commonly available capability to
also create in-memory data repositories called data tables. In-memory tables execute
extremely fast.
The schema or design of data tables is under user control—you can have any
number of columns of different data types, in any order you want. A table can be
designed to be most convenient to the modeler or could be designed to perfectly
match an external data source to avoid transforming the data on each use. Data tables
can be simple tables, like a spreadsheet, or can be comprehensive sets of hierarchical
relational tables linked by keys and foreign keys. Figure 2.12 illustrates a set of
three tables, Job Table, Process Plan, and WIP, that are related by a key field. The
master-detail view is expanded on the first part type to show the relationships.
Tables can be built and used entirely within Simio, but it is more common to
import the table data from an external source. Simio incorporates sophisticated table-
input mechanisms. In addition to CSV, Excel, and databases, Simio directly supports
reading data tables in the Business to Manufacturing Markup Language (B2MML).
Since B2MML is used to integrate business systems such as ERP and software such
as SAP [2] with manufacturing systems such as manufacturing execution systems
(MESs), it is a rich source of predefined information for use in simulations. Simio
can also generate tables directly from Wonderware™, a leading MES software. Tables
can be configured to import on demand, or tables with frequently changing data can be
configured to automatically import with each run. Not only does Simio have extensive
built-in support of data import (see Figure 2.13), but also it provides the capability,
including sample source code, to customize data import with programming in any of
over 60 .NET languages.
42 Modeling and simulation of complex communication networks

Figure 2.13 Data table binding and import options

Simio tables are key to implementing two important modeling strategies which
are having major impact on the simulation industry.
Data-driven modeling is a way of structuring a model, so much of the model
data is in data tables rather than disseminated throughout the model and allowing
the configuration of the model to take place in data tables (or associated external
files). The combination of these features makes models easier for the modeler to
understand, maintain, and share with others, and makes it easier (e.g., “lowers the
bar”) for stakeholders to use and update the model without comprehensive knowledge
of simulation.
Data-generated modeling is a mechanism for building all or most of a model
entirely from external data. For example, a fairly complete model can be built directly
from data using the B2MML, ISA 95, or Wonderware™ import mechanisms men-
tioned above. Alternatively, the Simio application programming interface (API) can
be used to import model data from virtually any database, spreadsheet, or other data
source. Importing major parts of system configuration and descriptive information
can dramatically lower the time and expertise required to create a model-based solution
to a pressing problem.

2.8 Experimentation with the model

When you build a model in the facility view, you can run it, view real-time 3D
animation, interact with the model, use sophisticating debugging techniques, like step,
break, watch, and trace, and view sample output results. But in stochastic models,
it is vitally important to run multiple replications for statistical analysis and validity.
Simio has a built-in experiment window to very efficiently and quickly run multiple
replications, configure controls (what changes) and responses (KPI’s), configure and
compare multiple scenarios, and even run automatic optimization.
Flexible modeling with Simio 43

While the experiment window offers standard textual reports, most people prefer
the built-in Pivot Grid reports. Like the pivot tables featured in many top data analysis
packages, Simio allows you to filter, sort, and recategorize the data. This allows you
to generate concise, custom reports in literally just a few clicks, then you can save
those reports and reuse them anytime. The details of all scenarios are shown along
with statistical measures like mean, minimum, maximum, and half-width. Both the
summary and the detailed results can also be exported for additional analysis in
external programs.
Simio’s experiment window will automatically run any number of replications,
using all your available processors (defaults up to 16). If you have the common
configuration of a dual-threaded quad core processor, you can run eight replications
in about the same time it would take to run one. With higher versions of Simio, you
can also take full advantage of other computers in your workgroup, and you can even
extend the limit of 16 to take full use of a server farm or network of workgroup
computers. Another approach to running replications is to use the Simio Portal. This
Azure™-based software as a service offering allows you to bring the processing power
of the cloud and scale up to run massively parallel replications and instantly distribute
the results across the internet.
Running many replications quickly is most important when you want to compare
multiple scenarios. Simio allows you to define referenced properties in your model,
which are displayed as controls in your experiment. Controls describe how one sce-
nario differs from another, for example, number of workers or number of servers.
Simio also allows you to define Responses in the experiment. A response is like a
key performance indicator (KPI) that is a quick measure of the performance of each
scenario. Additional statistical information is also recorded on responses to support
the response results view. The response results view is an enhancement of the measure
of risk and error (MORE) analysis technique described by Nelson [3] that makes it
easier for people without a strong statistical background to gain important insight into
their data.
When you have many possible scenarios to evaluate, manually generating them
can be tedious. And it is important to minimize or completely avoid the execu-
tion of poor scenarios. Simio is tightly integrated with OptQuest ® , the leading
simulation-based optimization product. OptQuest uses metaheuristics to guide the
search algorithm to quickly find better solutions. OptQuest combines Tabu search,
scatter search, integer programming, and neural networks into a single composite
search algorithm that is orders of magnitude faster than other approaches [4].

2.9 Application programming interface

The Simio libraries are comprehensive, especially when supplemented with add-on
processes. Many users also choose to create custom objects, either by modifying
the open-source Simio libraries or using processes to graphically create entirely new
objects. But in rare instances, users desire even more customization.
44 Modeling and simulation of complex communication networks

Simio has an extensive API that allows customization of virtually all aspects of
Simio using the API and any of over 60 .NET languages. Users with programming
background can create new tools and customize the menus to display those tools
to your stakeholders. You can create new steps and elements—the fundamentals of
Simio processes. You can create design-time add-ins that support building models
from external data. You can build in new import/export capabilities to support a
unique or proprietary data source for importing individual items, entire data tables, or
even generating models directly from external data. With the API, you can even add
custom experimentation. For example, both OptQuest and the Select Best Scenario
tools were implemented as experiment add-ins.
As an example of highly customized menu items, a customer who was schedul-
ing weather-sensitive operations used the API to add a “Get Weather” menu item
which would log on to a weather subscription service and download regional weather
forecasts into a Simio data table that was directly accessed during planning. This
was combined with other application-specific items to create a custom ribbon using
customer terminology.
While Simio already includes a comprehensive set of scheduling rules (see Sec-
tion 2.10) you can customize these or create your own rules. On the analysis end,
although Simio already includes extensive support for experimentation and optimiza-
tion, the API supports creation of custom design of experiments and add-ins such as
a custom optimization algorithm.
In addition to providing comprehensive documentation of the API, Simio supplies
extensive sets of sample code for all the items mentioned above.

2.10 Applications in scheduling

Although simulation tools in the past have been primarily used in the design of complex
networks, Simio is specifically designed to also support applications for scheduling
these same systems. In scheduling applications, the focus is on simulating the actual
flow of entities through the network, given an initial starting state for the system. The
simulation of the entity flow then produces an operational schedule of the system, for
a given starting state and a given set of entities. The purpose of the simulation is to
forecast the operational performance of the network in an actual real-time setting.
An example of a complex network scheduling application is the scheduling of
computational tasks to be executed on a network of processors. This application
can be represented as a directed graph, where each node on the graph is processor
that can perform one or more independent computational tasks. However, because
of data interdependencies, the tasks have precedence relationships that define the
permissible sequences for execution for these tasks. This is a complex scheduling
application; however, it is relatively simple to model the basic network system using
Simio (using routing sequences or task sequences).
Traditional simulation tools lack the necessary features to use them effectively
in a real-time scheduling environment. However, Simio has been designed from the
Flexible modeling with Simio 45

ground up to support these applications by incorporating key features to support

critical scheduling functions. These features include the ability to
1. Drive the model from relational data tables.
2. Initialize the model state—particularly with entities in process.
3. Incorporate complex decision logic—particularly involving dynamic rules that
must be executed each time a decision must be made.
4. Log-detailed transactional data for creating reports, dashboards, and Gantt charts
that depict the planned schedule.
5. Evaluate schedule robustness and the associated risk.
In the following sections, we will describe each of these key features.
As discussed in Section 2.7, Simio has extensive capabilities in representing,
storing, and importing flat and relational data files. This is particularly useful in
scheduling applications. Resource data (e.g., machines, machine groupings, fix-
tures, manpower) must be mapped to model constructs. Dynamic data (e.g., order
information, material arrivals, man power schedules) must drive system arrivals.
Simio’s data-generated modeling features are designed to “reach into” existing data
repositories and configure or entirely build a simulation model from that data.
Up-to-date WIP information must be used to initialize the scheduling system
before each schedule is generated. This data is usually tracked in a MES or other
real-time data system. Simio data tables have user-configurable schemas and com-
prehensive importing capabilities that support flexible mapping to existing data. It
also features the capability to configure data import, so sensitive data is automatically
refreshed. Figure 2.14 illustrates a high-end deployment with ERP and MES integra-
tion. While typically such deployments include a human scheduler in the loop, some
deployments are done with the MES and Simio automatically detecting problems and
taking corrective actions.
Most planning and scheduling tools are highly limited in the complexity of the
system that can be represented [5]. As a general-purpose simulation language, Simio
provides the capability to model the devices like electronic switches, AGVs, cranes,
tanks, ovens, and conveyors that are often critical to facility operation. Most planning
and scheduling tools are also limited in the decision logic controlling device inter-
action. Figure 2.15 illustrates the dynamic ranking and selection rules that form a
part of Simio’s decision logic. Simio also includes a comprehensive set of standard
dispatching rules (Table 2.4) that can be used to provide both local and global system
optimization through routing and resource allocation. In addition to these built-in
rules, the Simio API allows creation of custom rules and optimization algorithms
unique to a facility.
While design-focused simulation is often concerned mainly with summary
statistics like cost and utilization, scheduling models need to record and display trans-
actional data. Logs of the start and end times of every significant transaction (like
a resource, entity, or material state change) must be maintained along with impor-
tant transactional details. These logs are then used to generate custom reports like
Resource Dispatch Reports and Workflow Constraints Analysis. Those same logs are
used to create interactive displays like the resource plan Gantt (Figure 2.16) and the
46 Modeling and simulation of complex communication networks

Actual production Order/Shop floor

data status Wonderware
operator
New order data Production
schedule screen
Master planning Operator view

Orders, WIP, etc.

Production data
Production
Schedule
Simio
Production
portal schedule
view
Scheduling Production floor

Figure 2.14 Example scheduling deployment integrated with ERP and MES

Figure 2.15 Simio ranking and selection

entity plan Gantt. These two forms of Gantt charts graphically display activity from
a resource or entity perspective. The Simio implementations provide extra tracking
options such as graphical material inventory, resource states, downtime, schedules,
constraints, and detailed background information on any Gantt item. Data logs are also
the basis for user-designed Dashboard Reports (Figure 2.17). In addition to extensive
tracking and diagnostics, using a drag and drop interface, the Gantt charts can be used
to interact with the plan by such actions as specifying overtime or downtime periods,
or selecting alternate process flows. While these Gantt and log-related features are
primarily designed for planning and scheduling applications, many users have found
them to also be extremely valuable in providing debugging and clear communication
for traditional design applications.
A common problem with most scheduling systems is that the schedule must be
created deterministically. There is no good way to generate a schedule that accu-
rately predicts system downtime, material delays, extended processing times, and
other commonly encountered variability. One approach is to ignore such variability
entirely, which results in an optimistic schedule that becomes infeasible the first time
something goes wrong. Another approach is to build-in extra processing time or idle
Flexible modeling with Simio 47

Table 2.4 Simio standard dispatching rules

Dispatching rule Selection criteria

FirstInQueue The entity ranked nearest the front of the queue

LargestPriorityValue The entity with the largest priority state value
SmallestPriorityValue The entity with the smallest priority state value
EarliestDueDate The entity with the earliest due date
CriticalRatio The entity with the smallest critical ratio. Critical ratio is
the time remaining until the entity’s due date divided by
the total operation time remaining
LeastSetupTime The entity with the least setup time
LongestProcessingTime The entity with the longest operation time
ShortestProcessingTime The entity with the shortest operation time
LeastSlackTime The entity with the least slack time. Slack time is the time
remaining until the entity’s due date minus the total
operation time remaining
LeastSlackTimePer The entity with the least average slack time per its
Operation remaining operations
LeastWorkRemaining The entity with the least total operation time remaining to
complete its assigned sequence
FewestOperations The entity with the fewest number of operations remaining
Remaining to complete its assigned sequence
LongestTimeWaiting The entity that has been waiting longest in the queue
ShortestTimeWaiting The entity that has been waiting the least time in the queue
LargestAttributeValue The entity with the largest value of the specified expression
SmallestAttributeValue The entity with the smallest value of the specified expression
CampaignSequenceUp The entity that has a campaign value equal to or next largest
compared to the value of the last processed entity
CampaignSequenceDown The entity that has a campaign value equal to or next
smallest compared to the value of the last processed entity
CampaignSequenceCycle Alternates back and forth between a campaign sequence up
and a campaign sequence down

time to allow for when things go wrong. But unfortunately, this is time that is wasted
when things go well.
Simio initially creates a deterministic plan based on no variability, then it makes
additional stochastic replications that consider all the potential problems and cal-
culates the risk of key milestones or targets being missed. The colored markers on
each order in Figure 2.18 indicate the risk associated with each order. For example,
even though Order-01 and Order-02 have similar slack time, the lower likelihood
of Order-02 achieving its release date target might be due to utilizing an unreliable
machine or consuming materials that are often late. With the knowledge of this risk
before deploying the schedule, the scheduler can use Simio to objectively evalu-
ate the most cost-effective ways to reduce the risk. This in-turn makes the schedule
more robust, e.g., it stays useful for a longer time. A related benefit is the ability to
quickly replan. When a major event (e.g., an equipment failure) invalidates the plan,
in Simio, the replan time is typically a few minutes versus the hours required in most
other approaches.
Figure 2.16 Resource plan Gantt
Figure 2.17 Interactive custom dashboard
50 Modeling and simulation of complex communication networks

Mon Tue Wed

Transaction

Order-01 Drill Weld Paint 72%

Order-02 Weld Cut Drill Shape 47%

Order-03 Paint 92%

Order-04 Cut Shape Drill 91%

Shape 12% Cut

Order-05

Figure 2.18 Entity Gantt with target risk analysis

2.11 Summary
Library-based modeling has long provided a faster way to build models, but unless the
library was closely matched to your application, it was often necessary to make model
approximations which made your solutions less accurate. The flexibility promised
through OO technology has the potential for dramatic improvements but often has
problems scaling to large models, and the customization of objects and libraries still
required high programming expertise.
Simio was invented with two primary goals in mind. The first was to bring new
technology to the OO simulation field to allow users to more effectively build objects,
libraries, and models without programming. The second goal was to extend the field
of discrete-event simulation beyond the traditional system design applications into
planning and scheduling. Rather than “bolting on” features as needed, Simio was
designed from the ground up to incorporate all the features needed to solve problems
in design, planning, and scheduling, in a single tool using a single model.
The creation of Simio with its data-driven and data-generated modeling features
was timely—just as the concepts of the smart factory promise a new way of operating
our production systems. The smart factory [5], also referred to as the fourth industrial
revolution or Industry 4.0 (Figure 2.19), represents the concept of physical systems
where the components are monitored and connected to a virtual system model to
predict and improve system performance. The virtual factory model provided by
Simio is a key component of the smart factory of the future.

Glossary
B2MML: Business to Manufacturing Markup Language as defined in [6] is a set of
XML schemas implementing the ISA-95, Enterprise-Control System Integration
family of standards, known internationally as IEC/ISO 62264.
Flexible modeling with Simio 51

ERP

Industry 4.0
Industry 3.0
Industry 2.0
Industry 1.0

Mechanization, steam Mass production, Automation, computers, Cyber physical systems,

power, weaving loom assembly line, and electronics internet of things, networks
electrical energy

Figure 2.19 Simio is a key technology to implement smart factory [5]

Dispatching rule: An algorithm for deciding which job to process next in a production
facility, such as which job has the earliest due date or which requires a minimum
changeover.
Enterprise resource planning (ERP): Enhancements of the original material require-
ments planning (MRP) functions to bring together accounting, human resources,
and other functions into a fully integrated IT system. ERP also incorporated sup-
ply chain management (SCM) to extend inventory control over a broader scope,
including distribution.
Entity: Part of an object model and can have its own intelligent behavior. They can
make decisions, reject requests, decide to take a rest, etc. Entities have object
definitions just like the other objects in the model. Entity objects can be dynam-
ically created and destroyed, moved across a network of links and nodes, move
through 3D space, and move into and out of fixed objects. Examples of entity
objects include customers, parts, or workpieces.
Event: A notification that can be given by one object and responded to by several. It
alerts other objects that an action has occurred.
Experiment: Part of the project that is used for output analysis. The user defines one
or more sets of inputs/outputs (scenarios) and runs multiple replications to get
statistically valid results from which to draw conclusions.
Finite capacity scheduling (FCS):A scheduling approach that accounts for the limited
production capacity of the system. This contrasts with the enterprise resource
planning system that typically assumes an infinite capacity.
Gantt chart: A chart used in scheduling applications for showing activities over a
timeline. A resource Gantt and an entity Gantt show the same information, but
from two different perspectives.
52 Modeling and simulation of complex communication networks

Manufacturing execution system (MES): A computerized system used to track and

document the transformation or raw materials into finished goods, including the
status of resources and the flow of work.
Model: A representation of real-world object or collection of objects that interact
with each other. Models are usually used to make decisions and are defined by
their properties, states, events, external view, and logic. A model is an object that
is executable.
Optimization: The process of defining and evaluating experiment scenarios to deter-
mine the overall “best” scenario. OptQuest is a tool that is highly integrated with
Simio that will automatically optimize against a single objective, multi-objective
weighted, or pattern frontier.
Planning: The process of creating a high-level production plan that identifies the that
work needs to be done, the materials that are required to perform that work, and
where the work will take place.
Risk-based planning and scheduling (RPS): The use of a custom-built stochastic
simulation model for both system design and scheduling. RPS incorporates risk
measures for assessing the robustness of a production schedule.
Scheduling: The process of turning a master production plan into a detailed, action-
able schedule that can be followed to produce required items while meeting key
objectives. Scheduling requires a detailed model of all the critical constraints in
a system, and a good schedule is dependent upon good planning.
SimBit: A brief, well documented, example of solving a specific modeling issue.
Basic and advanced SimBit search engines are provided via the Support Ribbon
or descriptive information is in Help.
Smart factory (Industry 4.0): A fully connected and automated production system,
based on digital part data, interconnected devices, and a virtual factory model to
plan and project the future of products and production facilities. The Industry 4.0
initiative is focused on creating the smart factory of the future.
Table: A table is a set of rows and columns to hold data during a run. It may be
relational and may contain special columns like times or destinations in addition
to general model data. Tables may be imported and exported. An object may be
associated with a specific table and row.
Transporter: A transporter object is a special type of entity that can pick up entity
objects at a location, carry those entities through a network of links or free space,
and then drop the entities off at a destination.

References
[1] Student Models, Student Simulation Competition [online]. Available from
https://www.simio.com/academics/student-competition.php. Accessed Nov
2018.
[2] Junot Systems, Inc. Advanced SAP MES Integration [online]. Available from
http://mes-to-sap.com/. Accessed Nov 2018.
Flexible modeling with Simio 53

[3] Nelson, B. L. 2008. “The MORE Plot: Displaying Measures of Risk & Error
From Simulation Output.” In Proceedings of the 2008 Winter Simulation Con-
ference, edited by S. J. Mason, R. R. Hill, L. Mönch, O. Rose, T. Jefferson,
J. W. Fowler, Piscataway, New Jersey: Institute of Electrical and Electronics
Engineers, Inc.
[4] OptTek Systems, OptQuest [online]. Available from http://www.opttek.
com/products/optquest/. Accessed Nov 2018.
[5] Pegden, C. D. Deliver onYour Promise, How Simulation-Based Scheduling Will
Change Your Business. Pittsburgh, Simio LLC, 2017.
[6] MESA International, Business to Manufacturing Markup Language (B2MML)
[online]. Available from http://www.mesa.org/en/B2MML.asp. Accessed
Nov 2018.
Chapter 3
A simulation environment for cybersecurity
attack analysis based on network traffic logs
Salva Daneshgadeh1 , Mehmet Uğur Öney2 ,
Thomas Kemmerich3 , and Nazife Baykal1

The continued and rapid progress of network technology has revolutionized all modern
critical infrastructures and business models. Technologies today are firmly relying on
network and communication facilities which in turn make them dependent on network
security. Network-security investments do not always guarantee the security of orga-
nizations. However, the evaluation of security solutions requires designing, testing
and developing sophisticated security tools which are often very expensive. Simu-
lation and virtualization techniques empower researchers to adapt all experimental
scenarios of network security in a more cost and time-effective manner before decid-
ing about the final security solution. This study presents a detailed guideline to model
and develop a simultaneous virtualized and simulated environment for computer net-
works to practice different network attack scenarios. The preliminary object of this
study is to create a test bed for network anomaly detection research. The required
dataset for anomaly or attack detection studies can be prepared based on the proposed
environment in this study. We used open source GNS3 emulation tool, Docker con-
tainers, pfSense firewall, NTOPNG network traffic–monitoring tool, BoNeSi DDoS
botnet simulator, Ostinato network workload generation tool and MYSQL database
to collect simulated network traffic data. This simulation environment can also be
utilized in a variety of cybersecurity studies such as vulnerability analysis, attack
detection, penetration testing and monitoring by minor changes.

3.1 Introduction

A computer network is a set of connected network devices at the edge of the net-
work which are used in personal and professional lives such as PCs, tablets, iPads and

1
Department of Information Systems, Informatics Institute, Middle East Technical University, Turkey
2
Department of Computer Engineering, Atılım University, Turkey
3
Department of Information Security and Communication Technology, Norwegian University of Science
and Technology, Norway
56 Modeling and simulation of complex communication networks

smartphones. Furthermore, it encompasses the network cores such as network switch-

ing and routing. Additionally, computer networks offer dedicated network services to
the connected devices as well as to applications. These network services can be DHCP,
NTP and security services like encryption or packet filtering [1]. Recently, network
researchers have also started to use simulation tools to test and evaluate communi-
cation protocols and network behavior. Every network simulation requires network
topologies, traffic models between senders and receivers, background noises intro-
duced by other devices and possible dynamic events. Moreover, researchers require
virtualization tools to understand, verify and analyze the complicated behavior in
network simulation [2].

3.1.1 Network simulation

Living in the age of information and telecommunication technologies, our personal
and professional lives are extremely dependent on Internet-based technologies. A
thriving, innovative and secure network topology and networking protocols can be
seen as a key for successfulness of the businesses operations. Therefore, an increasing
trend is observed for designing, developing and managing novel, high-performance
and security enabled–computer networks. However, designing and validating each
new network technologies requires huge investments. Subsequently, network design-
ers and researchers rely on network simulation. Generally, there are two types of
network simulation: analytical modeling and computer simulation. Analytical mod-
eling uses mathematical analyses to characterize a network, but it is mostly a too
simple model to emulate the dynamic features of the network. Computer simulation
models the behavior of real events in a real life scenario in association with time [3].
Simulator tools also are able to model interaction between different network entities
(e.g., routers, switches, links) and their related events such as link changes, route
changes, link failures and link overloading [4]. Network simulation empowers net-
work designers to investigate different design options before coming to an agreement
on a final network design.
Evaluation of network performance is one of the preliminary aims of network
designers to simulate the network topology before realizing it. Other motivating
factors of network simulation include failure analysis, network design and network
resource planning [5]. The network simulation not only saves a lot of time and money
but also increases efficiency by allowing testing of multiple behaviors of the network
in a controlled and reproducible manner.
Network simulations have been initialized in the early 1990s by the advent of the
Network Simulation Testbed (NEST) tool. NEST can be seen as a backbone of the
many modern simulation tools [6]. It had a graphical environment for simulation and
rapid prototyping of distributed networked systems and protocols. In the 1990s, it was
used by designers of the distributed networked systems to measure the performance of
the systems under a different situation such as failure of the links or switches. NEST
simulation tool was composited of a network server and monitors as clients. The
client/server architecture of it allowed multiple remote accesses to a shared test bed.
NEST was based on the standard Unix and its server and clients were UNIX libraries
A simulation environment for cybersecurity attack analysis 57

and functions. Therefore, users could easily modify its functions based on their own
needs [7].
REAL (realistic and large) network simulation tool was based on a modified
version of the NEST 2.5 simulation test bed. Its initial developing motivation was to
compare the “fair queuing” gateway algorithm with first-come-first-served schedul-
ing and with competing proposals from Digital Equipment Corporation. REAL was
composited of two parts: a simulated server and a display client. The Berkeley UNIX
socket was used to connect the server to the client. It supported packet switched, store
and forward networks similar to the existing Xerox corporate net and the DARPA
Internet. REAL was able to model many details of the flow in the network and
transport layers [8].
In general, each network simulation or emulation study requires a simulation
scenario which defines the input configuration. According to Bajaj et al. [9], each
simulation scenario is usually made up of four components:
1. Network topology: which defines the physical interconnects between nodes and
the static characteristics of links and nodes.
2. Traffic model: which defines the network usage patterns and locations of unicast
and multicast senders.
3. Test generation: which creates events such as flooding traffic toward specific
node.
4. Network dynamics: such as node and link failures.
Additionally, NS2, NS3, OMNeT++, SSFNet, J-Sim, OPNET and QualNet are
some other examples of the well-known network simulation tools [10]. According
to Wehrle et al. [11], simulation tools have to model different network elements as
following:
● Network nodes: which illustrate end nodes such as PCs, laptops, servers, tablets
and network devices such as routers, hubs and switches.
● Network devices: which illustrate the physical devices that connect nodes to
Ethernet network interface card, a wireless IEEE 802.11 device, etc.
● Communication channels: which illustrate the medium for sharing information
among network devices such as fiber-optic-point-to-point links, shared broadcast
media, wireless spectrum, etc.
● Communication protocols: which model the implementation of standardized and
experimental network protocols such as User Datagram Protocol (UDP), Domain
Name System (DNS), etc.
● Protocol headers: which illustrate the special data related to the specific protocol
in the network packets.
● Network packets: which are the main parts of the information exchange in
computer networks. Network packets consist of protocol header and payload
data.
Conjointly, Wehrle et al. [11] emphasize on the importance of the realism rather
than abstraction in network simulation, as the high level of the abstraction might result
in abundant divergence from the experimental results.
58 Modeling and simulation of complex communication networks

3.1.2 Network emulation

Network emulation is an integration of simulated networks with real end-systems
such as computers, routers, switches, etc. The connections between real world and
simulated environments are done in a seamless manner as the connection among real
network objects.

3.1.3 The application of network simulation and emulation

in network security
Conducting real experiments such as attack scenarios in an operational network envi-
ronment causes high risks. Therefore, simulation environments become more popular
in network security research. Network anomaly and intrusion detection is one of the
interesting research subjects in the area of network security. Most of the existing stud-
ies in the field of network anomaly detection validate their methods using simulated
datasets, because practicing attack scenarios on real and live networks may causes a
network crash [12].

3.1.4 Virtualization
Computer virtualization techniques were first developed in 1960s by IBM [13]. Virtu-
alization techniques enable users to divide the physical computer to multiple isolated
environments called virtual machines or guest machines. Virtual machines also can
be seen as an emulation of physical machines. Virtual machines are another solu-
tion which is used to model networks. There are two types of virtualization: virtual
machines which are powered by hypervisors and container-based virtual machines
(Docker).

3.1.5 Virtualization using hypervisor

All of virtual machines working on a physical computer (host) share their physical
resources such as the memory, disk and network devices by means of a software which
is called hypervisor or a control program. VMware, KVM, Xen, Xbox and Hyper-V
are the most well-known hypervisors. The hardware resources are allocated by the
virtual machines on request. Resource sharing and isolation are two prominent advan-
tages of the virtual machines. Virtualization empowers system administrators to create
a simplified abstract view of the system as a working space of the software application.
Additionally, most of the cybersecurity phenomena cannot be studied experimentally
due to the potential risk of collapsing or infecting the experimental environment, high
cost and legal issues. Therefore, virtual machines are absolutely essential for cyberse-
curity studies. Virtual machines empower researchers to develop intrusion prevention
and intrusion detection systems (IDSs) by safely testing suspicious activities in a
virtual environment.
A simulation environment for cybersecurity attack analysis 59

App1 App2 App3 App1 App2 App3

Bins/Libs Bins/Libs Bins/Libs Bins/Libs Bins/Libs Bins/Libs
App1 App2 App3
Guest OS Guest OS Guest OS Guest Linux
Bins/Libs Bins/Libs Bins/Libs

Hypervisor Docker engine Hypervisor

Host OS Host OS Host OS

Figure 3.1 Architecture of Docker and hypervisor-based virtual machines

3.1.6 Virtualization using container

Linux containerization is a virtualization method which enables running of multiple
processes on multiple isolated environments or operating systems only by means of a
single kernel. It is an operating system level virtualization environment which takes
advantages of Linux Cgroups1 and namespace2 to allow different containers to run on
a single host [14]. Docker provides very light vitalization, because it does not require
hardware level virtualization. It wraps software and its dependencies such as shared
applications and services into a standardized container [15]. In other words, a Docker
enables a transparent and an independent workspace for each application running on
it by dividing operating system resources [16]. Figure 3.1, displays the difference of
traditional virtual machines and Docker.
The advantages of using a Docker image rather than a traditional virtual machine
image are agility, portability and controllability of the application environments. It
enables users easy to maintain customized execution environments, in the shape of
lightweight Docker images instead of bulky virtual machine images. This paves the
way for the micro-services architectural pattern to rise.

3.1.7 Virtual machines and simulation

Virtual machines are suitable for modeling mid-scale networks, but the representation
of huge networks with thousands of network objects are not practical using virtual
machines [17]. In this chapter, we took advantage of both simulation and virtualization
techniques to create a safe virtual lab for imitating different attack scenarios and collect
traffic logs. These network traffic logs can be used to develop novel detection and
defense methods in the field of network security.
The remainder of the chapter is organized as follows: Section 3.2 describes the
background of network simulation and virtualization in the field of anomaly and
attack detection; Section 3.3 provides the methodology used; Section 3.4 describes
the network topology of our test bed; Section 3.5 provides detailed information about
the way that different objects of the network topology were configured; Section 3.6

1
Control group is a Linux kernel feature which provides isolated workstation with limited resources called
container.
2
Namespace is a Linux feature that prevents observation of resources used by different groups.
60 Modeling and simulation of complex communication networks

presents discussion and results; and finally, Section 3.7 summarizes the study and
presents a road map for the future work.

3.2 Literature review

This section presents a short overview of network anomalies and their detection tech-
niques, network workload generators and some simulation practices in the field of
network security.

3.2.1 Network anomalies and detection methods

In a large dataset, nonconforming patterns are often called anomalies, outliers,
exceptions, aberrations, surprises, peculiarities or discordant observations in various
domains. However, outliers and anomalies are the most frequently used terms in the
context of network intrusion detection [18]. Network events which are far from nor-
mal or expected normal behavior are suspicious from the perspective of the security.
Not certainly all anomalies reflect a malicious activity in the network. Anomalies also
can be observed when the meaning or scope of the normality changes. Therefore, the
number of anomalies is always equal or higher than the number of malicious points in
a given dataset. On the other hand, anomalies which are the result of the unauthorized
attempt to access information, unauthorized information manipulation and attempt to
make a system unreliable or unusable are categorized as malicious activities. There are
bunch of studies in the literature which investigate network anomalies using different
methods including statistical methods, classification, clustering, information theory.
References [19–22] are examples of many survey studies which present the state of
the art in the field. All network anomaly detection researches require dataset to vali-
date their detection methods. Nevertheless, there are only few real datasets which are
publicly available such as FIFA World Cup Dataset 1998, DARPA Intrusion Detec-
tion Data Sets 1998, KDD cup Dataset 1999, UCLA Dataset 2001, CAIDA DDoS
Attack Dataset 2007 and TUIDS DDoS Dataset 2012. The KDDCUP’99 dataset is
the most widely used benchmark dataset in network anomaly and attack detection
studies. On the other hand, some researchers have blamed it for its inherited problems
as following [23]:

● Both the background and the attack data were synthesized for the privacy issues.
● Data’s false alarm characteristics were neglected; therefore, it is difficult to claim
that the available dataset is similar to the observed data.
● The workload of the synthesized data does not seem to be similar to the traffic in
real networks.
● More probably, the TCPdump data collector tool was overwhelmed during the
heavy traffic load and drop packets.
● There is no exact definition of the attacks for some cases such as probing or buffer
overflow.
A simulation environment for cybersecurity attack analysis 61

Gogoi et al. [24] emphasize on the nature of the input data as the key aspect of
any anomaly detection system. Input data is defined as a collection of data with some
attributes of same or different types such as binary, categorical or continuous. As the
nature of attributes determines the applicability of an anomaly detection technique,
it is so prominent to employ the dataset with desired attributes. It is not likely to find
any publicly available real dataset which perfectly matches attribute requirements of
all anomaly detection studies. In a nutshell, the combination of real data and realistic
synthetic dataset which represents the real environment could be seen as a coherent
choice to validate novel anomaly detection engines in the rapidly growing computer
and information technology area.

3.2.2 Network workload generators

There are only a few studies in the literature regarding the realistic network workload
generation in contrast to the huge amount of studies on the characterization, model-
ing and simulation of computer networks. In general, a synthetic network workload
generator should be able to appropriately capture the complexity of real workload in
different scenarios, modify the properties of workload based on specific demands of
the scenario and finally measure indicators of the performance for the workload at net-
work level [25]. OSTINATO [26], SEAGULL [27], Tmix [28], RUDE/CRUDE [29],
MGEN [30], KUTE [31] and BRUTE [32] are some examples of the network work-
load generation tools. Network workload generation approaches can be classified into
two groups such as following:
● Trace-based generation: In this approach, the content and the timings of traffic
traces are mimicked based on previously collected data in the real scenarios.
● Analytical model-based generation: In this approach, flows and packets are
generated based on statistical models.
A comprehensive network workload generation tool should employ both
approaches depending on the characteristics of the various scenarios[25].

3.2.3 Network simulation for security studies

In recent years, simulation and virtualization have gained popularity in network secu-
rity research as well. They empower researchers to run vulnerability-related programs
against IT-systems and IT-applications and then develop solutions to detect and mit-
igate vulnerabilities. Network anomaly detection is one of the well-known research
focus in the area of network security. Enormous research efforts have been spent on
anomaly-based network intrusion detection using mathematical, machine learning,
artificial neural network, fuzzy set, knowledge-based and combination learning tech-
niques [33]. In general, all these techniques have a common point of intersection:
they need network traffic datasets to validate their anomaly detection approaches.
Real and simulated datasets are two major types of datasets which are used in
network anomaly detection studies. However, each of them has some advantages and
disadvantages. Most of the real datasets in the field are outdated and anonymized
for privacy concerns. Furthermore, there is no existing dataset that can meet all
62 Modeling and simulation of complex communication networks

requirements of researchers for various attack detection methods [34]. Therefore,

researchers struggle to validate their approaches using real datasets in experiments or
simulations. On the other hand, real system–based experiments are very complicated,
time consuming and expensive. As a result, researchers have launched to prepare their
own datasets in simulation environment due to the controllability, reproducibility and
scalability of simulated data [35]. In this section, we presented some recent exam-
ples of the studies which have benefited the simulation tools to investigate network
anomalies.
Kuhl et al. [36] used ARENA simulation software for modeling a network setup,
for modeling cyberattacks, for simulating cyberattacks and generating IDS data. Their
simulation environment composited of three main components: machines, connec-
tors and subnets. Machines were individual client computers or servers which were
employed as attacker or target machines. Target machines were equipped with IDS
sensors to detect cyberattacks and create alarms. In this study, attack scenarios were
defined in 5 major groups and 23 subgroups. The simulated environment enabled
users to choose different types of attacks to be occurred over a period of time along
with a specified quantity of network noise. The main goal of this study was to generate
automated attack and produce IDS alert files which included alerts from both attack
actions and noise. Correspondingly, these alert files were used to test and evaluate
cybersecurity systems.
Elejla et al. [37] proposed a flow-based IDS for detecting ICMPv6-based DDoS
attacks. Since there was no existing dataset that could meet their criteria for detecting
ICMPv6-based DDoS attacks, researchers created their own dataset in virtual envi-
ronment using GNS3 emulation tool. The GNS3 allowed them to collect a normal
traffic from the real-life network of a university and generate attack data in simulated
environment. Their simulated environment consisted of one victim and two attacker
machines: Kali OS with THCtoolkit and an Ubuntu OS machine with SI6 attacking
tool. In this study, different ICMPv6 DDoS attack scenarios were performed, and the
related network traffic was captured using Wireshark3 tool in PCAP file format.
Balyk et al. [38] used the GNS3 tool to simulate DDoS attacks. The experimental
network topology consisted of three virtual PCs of regular users, one Fedora core 22
64 bit Linux system running apache 2.4.12 web server and one attacker host. The
network was realized by Cisco Ethernet switches and Cisco routers. For DDoS attack
simulation, a simple Perl script was used to create multiple parallel connections to
destination port 80 of the web server. Wireshark was used to capture the traffic flows
on the closest switch to the web server. This study only simulated a single DDoS
attack scenario, while the concentration of the study was on web server parameters
settings and defense modules settings in GNS3 simulations.
Al Kaabi et al. [39] developed a virtual lab called DoS_VLab to allow students
to practice five types of DDoS attacks including ARP cache poisoning, Switch CAM
table corruption, TCP SYN flood, Land and ARP storm attacks in secure academic
environment. The lab was based on virtualization and GNS3 network simulation for

3
https://www.wireshark.org/.
A simulation environment for cybersecurity attack analysis 63

building virtual networks. It consisted of two Windows XP Virtual Machines (VMs):

one of them acts as the attacker host and the second as the victim host. The study only
mentioned that appropriate tools were installed on the attacker machine and different
types of DDoS attacks were performed, but the names of the tools were not implicitly
indicated.
For developing a traffic analyzer to detect DDoS attacks, Ojeniyi et al. [40]
proposed a simulated environment using GNS3 tool. The network topology consisted
of an attacker and a victim machine both running Kali Linux. In order to simulate
the DDoS attack, the built-in Hping3 tool of the Kali Linux on the attacker’s machine
was used to perform DDoS attack toward the victim.
For proposing a method to discriminate DDoS attacks from the flash events (FE),
Behal and Kumar [41] developed a test bed to generate dataset of low-rate and high-
rate DDoS attacks and of FE. Their test bed was a combination of real and emulated
systems. Their test bed consisted of 75 physical nodes that run Ubuntu and Windows
OS, Ethernet switches, routers and Linux server. The Core emulator tool was used
to increase the number of the identical virtual nodes. The httperf and D-ITG traffic
generator tools were used to generate legitimate HTTP traffic and BoNeSi botnet
simulator tool was used to generate DDoS attack traffic.
Zhao et al. [42] evaluated different methodologies to detect network anomalies.
In order to prepare datasets for their experiments, they defined the topology of the
simulation network in NS2 format, then they used Malicious traffic Composition
Environment (MACE) tool4 as the malicious traffic generator and the LTProf tool5
as the legitimate traffic generator. Moreover, Scalable URL Reference Generator
(SURGE) tool6 was used to produce extra workload on the web servers to test stress
tolerance of the network. NetFlow was generated for all passing network traffics in
both directions of the link (in the form of compressed nfdump files) and passed to
anomaly detection engine for further analyses.
Sieklik et al. [34] developed a simulated environment to investigate the amplifi-
cation DoS attack based on the Trivial File Transfer Protocol (TFTP). Their network
topology was consisted of three routers, a computer and two servers. GNS3 simulation
software was used to simulate the network topology in this study. VMware virtualiza-
tion software was utilized to implement more flexible simulation systems. The three
virtual machines (attacker, target and amplifier) were connected to the router run-
ning in GNS3. The attacker server ran Kali Linux 5 R3 OS with various penetration
testing tools. Both amplifier and target were running Windows XP SP3. The attacker
computer ran several TFTP service, and the target machine ran default Windows XP
UDP services whereas any other UDP service could be employed. They investigated
a realistic amplification scenario by creating spoofed packets using Scapy,7 then the
attacking machine using a loop command to send these multiple crafted packets to
the amplifying server.

4
MACE is a toolkit to generate divers set of attacks [43].
5
TheLTProf collects legitimate traffic samples from public traces [44].
6
SURGE is a web workload generation tool which mimics a set of real users accessing a server [45].
7
It is a special network analysis tool written in Python to create network packets [46].
64 Modeling and simulation of complex communication networks

3.3 Methodology
The most challenging aspect of simulation based anomaly detection research is
proving the reliability and dependability of simulated datasets in comparison to
real-life datasets. On the other hand, well-designed simulation environment offers
repeatability, programmability and extensibility of the validation instrument [12].
The main purpose of this chapter is to introduce a simulation environment
using VMware virtualization software to design a flexible and reliable simulation
environment. In order to realize the simulated environment, some software and
hardware were required such as VMware workstation, GNS3 software and Ubuntu
Docker image. We also used open-source pfSense firewall, NTOPNG and MYSQL
to apply network rules, collect network flow data and store network flow data, respec-
tively. Moreover, we utilized botnet simulator and network traffic–generator tools
for creating DDoS attack and normal traffic data samples. We used GNS3 sim-
ulator to develop our experimental environment; as mention in [38], the results
of the GNS3 simulation tool matched the results obtained from the Cisco net-
work. Additionally, GNS3 is a well-tested and established network-simulation tool,
which is also used by many other companies like Exxon, Walmart, AT&T, NASA,
etc. [38].

3.4 Defining a simulated and virtualized test bed for network

anomaly detection researches

We have implemented the virtual lab named Cyber Security Simulated Lab (CSSL)
in order to create an isolated platform to simulate, test and analyze different types
of security threats. Our infrastructure was built by means of a VMware virtualiza-
tion software on one physical machine. In order to connect the virtual machine to the
network, we mapped the external Internet connection of our host machine to the inter-
nal VM network. The CSSL allows us to configure different network topologies for
simulating different attack scenarios. Our virtual test bed is an isolated environment
to mainly fabricate and collect simulated DDoS attack data. As network technolo-
gies are growing rapidly, we primarily employed open platforms to include different
efforts and different packages whenever there is a need [10]. Moreover, using open
source tools and applications facilitates the repeatability of the study. We initially
defined the network topology as shown in Figure 3.2 using GNS3. Attacker and tar-
get machines are Ubuntu Docker appliance for GNS3. pfSense is an appliance of the
GNS3. VMnet8 is our exit point to the Internet. We disabled all incoming and out-
going traffic to/from the VMnet8 using firewall rules during the experimental phase
for security concerns. (For more information refer to Section 3.5.3.)

3.4.1 GNS3
GNS3 is a graphical network emulation tool which can provide simulation/emulation
of entire networks and many network devices such as links, switches, routers
A simulation environment for cybersecurity attack analysis 65

Simulated outside VMnet8

VMnet1 VMnet2 Real outside

Attacker
Attacker zone
Simulated inside
victim zone

Service_Machine
Switch(MirroringFunction) PfSense_Firewall Switch

Victim NTOPNG Data_Repository

Figure 3.2 Network topology of the test bed

Computer QEMU*
GNS3 GUI
server 1

Controller
IOU**
Computer
GNS3 WEB
server 2

* Open source platform for hardware virtualization DAYNAMIPS**

**IOL or IOS On Linux
***An emulator computer program for emulating Cisco router

Figure 3.3 GNS3 architecture

firewalls, etc. As it can be seen in Figure 3.3, GNS3 has a similar architecture to
Linux computers based on internal interfaces (network to device driver) and appli-
cation interfaces (sockets) [11]. All the communications in GNS3 tool are done over
HTTP using JSON; therefore, HTTP basic authentication can be used to securely
access to the application programming interfaces [47]. Additionally, GNS3 enables
packet filtering and raw-packet capturing in the network using its direct interface to
Wireshark application [48]. We installed both GNS3 windows application and GNS3
virtual machine image [49]. As it can be seen in Figure 3.4, we also connected them to
each other by setting the remote main server address of the GNS3 windows application
to the IP address of the GNS3 virtual machine.
66 Modeling and simulation of complex communication networks

Figure 3.4 Connecting GNS3 application to the remote server

3.4.2 Ubuntu
We used the following command to pull GNS3 Ubuntu Docker container on GNS3
VM from Docker registry.

docker pull gns3/Ubuntu: xenial

The Ubuntu Docker container encompassed networking tools such as net-tools,

iproute2, ping and traceroute, curl (data transfer utility), host (DNS lookup utility),
iperf3, mtr (full screen traceroute), socat (utility for reading/writing from/to network
connections), ssh client, tcpdump and telnet [50].

3.4.3 Network interfaces

Virtual machine allows defining and setting of different network adapters. As it can
be seen in Figure 3.5, we created three network interfaces as following [51]:

● NAT (network address translation): The virtual machine does not have an IP
address on the external network. Therefore, it translates the addresses of virtual
machines in a private VMnet network to that of the host machine. Subsequently,
A simulation environment for cybersecurity attack analysis 67

Figure 3.5 Host-only, NAT and custom virtual network interfaces

VMnet1
10.5.6.x

VMnet8 Internet
10.5.5.x

VMnet2
10.5.7.x
VMware

Figure 3.6 Virtual machine–adapter setting

it uses the host computer network connection in order to connect to the Inter-
net. VMware virtual DHCP server assigns an address to the virtual machine.
It provides a transparent and easy-to-configure method to access to network
resources.
● Host-only: It provides a network connection between the virtual machine and the
host computer. The virtual machine is connected to the host-operating system
using a virtual Ethernet adapter that is visible to the host-operating system on a
virtual private network. It is not visible to the outside host.
● Custom: It is a more complicated networking configuration option which provides
customized setup for virtual network adapters. After selecting “Custom” option,
the user should choose a virtual switch to connect the virtual machine’s adapter
to that switch.
Accordingly, we created three corresponding network adapters for the GNS3
virtual machine like Figure 3.6. We also assigned IP addresses for each of the interfaces
of GNS3 virtual machine. Figure 3.7 demonstrates the assignment of IP addresses to
interfaces of VMnet8.
68 Modeling and simulation of complex communication networks

VMnet8

10.5.5.2

PfSense_Firewall
Switch
10.5.5.3

Figure 3.7 IP address assignment for VMnet8

3.5 Simulated environment for network anomaly detection

researches

As a simplest topology, we required an attacker, a target, a firewall, normal and attack

traffic generators, a log collector and a data repository.

3.5.1 Victim machine

As a victim machine, we installed the Apache2 server on an Ubuntu Docker container
and made a new container called Victim_Machine_Template.

3.5.2 Attacker machine

As the attacker machine, we installed the attack tool BoNeSi [52] on an Ubuntu
Docker container and made a new container called Attacker_Machine_Template.
1. We were able to add any penetration testing or attack simulation tool on the
attacker machine based on our needs. We needed to simulate a DDoS attack;
therefore, we used BoNeSi botnet simulator tool with 50k bots to simulate DDoS
attack.
2. BoNeSi is able to generate different types of flooding attacks using ICMP,
UDP and Transmission Control Protocol (TCP) protocols using different 50k
IP addresses. BoNeSi empowers users to configure rates, data volume, source
IP addresses, URLs, target port, time to live, etc. It also supports the simulation
of the HTTP-GET floods attacks. As cited in [52], BoNeSi can generate up to
150,000 packets per second on an AMD Opteron with 2 GHz. Its rate can be
duplicated using recent AMD Phenom II X6 1100T with 3.3 GHz. We simulated
the following DDoS attacks:
Bonesi -r 3000 -p tcp -d eth0 10.5.5.50:80 (Sends 3,000
packets per second to port 80 of victim machines)
A simulation environment for cybersecurity attack analysis 69

VMnet2

Attacker

10.5.7.70

10.5.7.3
PfSense_Firewall
10.5.6.3 10.5.5.3

Figure 3.8 IP address assignment for WAN and LAN interfaces of pfSense

Bonesi -r 3000 -s 320 -p tcp -d eth0 10.5.5.50:80

(Sends 3,000 packets with size of 320 byes per second to port 80 of victim
machines)
Bonesi -r 3000 -i ip_list -p tcp -d eth0 10.5.5.50:80
(Sends 3,000 packets per second to port 80 of victim machine using source IPs
in the ip_list text8 file)
Bonesi -r 3000 -i ip_list -p tcp d eth0 10.5.5.80.80
(Sends 3,000 packets per second to port 80 of the victim machines using source
IPs in the ip_list text file)
Bonesi -i ip_list -p tcp -d eth0 10.5.5.80.80 (Floods as
much as packets it can to port 80 of the victim machines using source IPs in
the ip_list text file)

3.5.3 pfSense firewall

We added the pfSense appliance of GNS3 to this project. Consequently, we assigned
the WAN and LAN interfaces of pfSense and their correspondent IP addresses such
as shown in Figure 3.8.
3.5.3.1 Firewall configuration
We accessed the web configurator of the firewall using the http://10.5.6.3
URL as it has been applied in the LAN interface setting of the firewall. Sub-
sequently, we defined rules for floating, WLAN, LAN and attacker zone. For
instance, we allowed and logged all of traffic between victim zone and attacker zone.
We also blocked all traffic from/to real-outside, because the virtualized environment

8
The list of source IP addresses to participate in the DDoS attack can be provided by a text file and then
pass to BoNeSi using ‘-i’ parameter.
70 Modeling and simulation of complex communication networks

VMnet1

Service_Machine
Switch(MirroringFunction)
10.5.6.60

10.5.6.70
Ostinato

10.5.6.50 10.5.6.90 10.5.6.40

Victim NTOPNG Data_Repository

Figure 3.9 IP address assignment for victim zone

is vulnerable to all the traditional attacks and exploits worse than normal environ-
ments [52]. Moreover, we wanted an isolated environment to create and test attacks
without affecting the real systems.

3.5.4 NAT and VMware host-only networks

The network interfaces of the real outside, attacker zone and victim zone in our project
were mapped to corresponding network interfaces adapters. Figure 3.9 demonstrates
the IP address assignment of interfaces in the victim zone.

3.5.5 Traffic generator machine

Ostinato is a packet crafter, network traffic generator and analyzer which is supported
with a user friendly graphical user interface (GUI) [26]. Ostinato allows users to create
desired data streams by manually configuring each packet at different layers of the
Open Systems Interconnection (OSI) model. Ostinato uses a client/server architecture.
The GUI is used to create desired packets and then send them out of the traffic
interfaces using Ostinato’s server (drone) [26]. The Ostinato GUI can be used to
configure IP and MAC addresses of the interfaces (eth1, eth2, eth3, etc.). By default,
the interface eth0 is used to connect to the local machine. Figure 3.10 demonstrates
the main user interface of Ostinato tool.
We added the Ostinato appliance of GNS3 to this project in order to fabricate
and send packets of several streams. Ostinato generates stateless streams which are
A simulation environment for cybersecurity attack analysis 71

Figure 3.10 GUI of Ostinato

dropped by pfSense firewall. Therefore, we placed it behind the firewall. In order to

use the GUI of the Ostinato, we set the console type to virtual network computing
(VNC) in the node configuration window of GNS3. GNS3 supports ThightVNC [53]
viewer, so we installed ThightVNC9 viewer on our local machine. The IP address of
the remote host in ThightVNC was set to the IP address of GNS3 virtual machine
(10.5.5.5). We set the several parameters at different OSI layers when a new stream
was created as following:
1. Configuring protocols in different layers (MAC, Ethernet, IPv4, TCP and Text
for physical, data link, network, transport and application layer correspondingly).
2. Configuring MAC address of the sender and receiver to MAC address of attacker
and victim machines, respectively.
3. Configuring source and destination IP addresses to 10.5.7.70 (attacker machine)
and 10.5.6.50 (victim machine).
4. Configuring destination port number to 80.
5. Writing “Network Traffic Generation” in payload of the data.
6. Configuring the number of packets to 10,000 and the transmission rate to 200 pps.
Ostinato also allows to set checksum and different packet flags including URG,
ACK, PSH, RST, SYN and FIN. Figure 3.11 demonstrates the properties of the crafted
stream in detail.

3.5.6 NTOPNG tool

We added the Docker image of the NTOPNG as an appliance to our GNS3 project.
We utilized NTOPNG tool to collect the processed network traffic data in our GNS3

9
VNC is a graphical desktop sharing system based on the Remote Frame Buffer protocol to remotely access
and control another computer [54].
72 Modeling and simulation of complex communication networks

Figure 3.11 Detailed packet view in Ostinato

environment. NTOPNG is a network traffic probe that monitors network usage.

NTOPNG is based on libpcap and has a capability to virtually run on every Unix
platform, MacOSX and on Windows as well.
Tools like TCPdump and Wireshark collect raw data. It means that when a packet
is received and sent, data is captured and logged. On the other hand, NTOPNG is able
to collect data in a connection-based format. It enabled us to capture main features
of network traffic including source IP address, destination IP address, source port,
destination port, duration, number of bytes and packets which are sent and received.
It also provides an extra feature to geographical information of the IP addresses [55].
Figure 3.12 show the summary of active hosts in non-attack period, and Figure
3.13 shows the time-based graph of traffic follows in the NTOPNG interface (eth0)
which is connected to the victim host during the simulated DDoS attack period.
A simulation environment for cybersecurity attack analysis 73

Figure 3.12 Sent/Received traffics in interface of NTOPNG (eth0 ) during attack

Timeseries Timeframe: 1m 5m 10m 1h 3h 6h 12h 1d 1w 2w 1M 6M 1Y

Traffic (eth0)
4.76 Mbit/s
4.50 Mbit/s Network link saturated
4 Mbit/s

3.50 Mbit/s

3 Mbit/s

2.50 Mbit/s

2 Mbit/s

1.50 Mbit/s

1 Mbit/s

500 Kbit/s

0
14:47:57 14:50:00 14:51:40 14:53:20 14:55:00 14:56:40 14:57:56

910.82 Kbps Uptime: 8 min, 55 sec

927.74 Kbps 1 Alert 407 Hosts 129,716 Flows

Figure 3.13 Active network hosts during non-attack time period

Figure 3.14 shows the summary of a time-based graph of the packets’ arriving
rate during simulated DDoS attack in NTOPNG.

3.5.6.1 NTOPNG configuration

As Figure 3.12 shows, we attached NTOPNG to the same switch which the victim
machine was also connected. In addition, we configured port mirroring and copied
all traffic on the switch port which was connected to the victim to the mirror port.
We used this technique in order to send all network traffic to the victim machine to
NTOPNG [56].

3.5.6.2 NTOPNG configuration to dump logs to Mysql machine

We set a start command of the NTOPNG machine using its configuration interface
by typing the following connection string.

(mysql;<host>;<dbname>;<table name>; <user>; <pw>)

74 Modeling and simulation of complex communication networks

TCP (rcvd)
TCP (sent)

2 Mbits

1 Mbits

500 Kbit/s

Figure 3.14 Sent/Received traffics in victim host during attack

Table 3.1 Fields of NTOPNG logs and their corresponding data types

Field Type Key

idx Int Yesa

VLAN_ID Small Int No
L7_Protocol Small Int No
Source IP address Int No
Source port number Small Int No
Destination IP address Int No
Destination port number Small Int No
Protocol Tiny Int No
Bytes Int No
Packets Int No
First_Switched Int No
Last_Switched Int No
Info Int No
Json Blob No
Profile Varchar No
NTOPNG_Instance_Name Varchar No
Interface_ID Small Int No

a
Auto incremental.

Consequently, NTOPNG created a database and table on the Data_Repository

machine as Table 3.1 displays.

3.5.7 Repository machine

As NTOPNG supports the exportation of the monitor data to MySQL, Elasticsearch
and Logstash, we installed MYSQL on an Ubuntu Docker container and made a
A simulation environment for cybersecurity attack analysis 75

new container called Data_Repository_Template. We connected it to the NTOPNG

machine to collect network traffic from NTOPNG. As Figure 3.9 demonstrates, the
Data_Repository machine has been placed in the same network with NTOPNG and
the victim machine (10.5.6.0/24).

3.5.7.1 Repository machine configuration

We altered the configuration of Data_Repository machine for the listening port. By
default, MYSQL listening port is port 3306 from the local host. If the “dump traf-
fic data” option of the NTOPNG is set, it automatically will send the log data to
MYSQL. In this situation, MYSQL should receive data (listen) out of local host. We
set bind_address variable to the IP address of the NTOPNG machine, then stop and
restart the Data_Repository machine.

3.5.7.2 Give a remote root access to Data_Repository machine

By default, the remote root access to MYSQL is disabled for security reasons. We
locally enabled it using the following SQL query in order to authorize NTOPNG for
writing network follow logs in data repository machine.

GRANT ALL ON . TO root@’10.5.6.90’ IDENTIFIED BY

’test’; FLUSH PRIVILEGES;

3.6 Discussion and results

To best of our knowledge, there is no comprehensive study in the literature which
merely concentrates on step-by-step implementation of the simulated lab, based on
virtual machines and Docker containers to produce directional (sent/received) DDoS
attack data and collect these data according to many criteria including IP address,
port, L7 protocol, throughput, RTT, TCP statistics (retransmissions, out of order
packets, packet lost), bytes/packets transmitted, etc. We think this chapter fills in
the gap by providing guidelines for cybersecurity researchers and practitioners to
develop an isolated simulation environment. Usage of open-source tools ensure the
repeatability and comparability of results among studies which will apply the proposed
simulated environment in their research. The simulate environment is suitable to create
normal and malicious network traffics and collect flow-based traffic logs for network
anomaly detection analyses. Minor changes might be required based on different
attack scenarios. For example, different attack tools can be installed on attacker
machine to launch variety of exploits toward victim machine.

3.7 Summary

In this chapter, we provided information on some key concepts of network sim-

ulation and virtualization and their differences. Subsequently, we have presented
a network simulation environment with a focus on network attack and anomaly
76 Modeling and simulation of complex communication networks

detection scenarios. We took advantages of both hypervisor and Docker-based vir-

tual environments. We utilized preconfigured GNS3, NTOPNG, Docker containers,
Ostinato and pfSense virtual image to realize our network topology. On the other
hand, we made our own containers including Data_Repository, Victim_template and
Attacker_template which can be used in future industry and academic studies. Virtu-
alization and simulation play a significant role not only in reducing the operational
costs but also in keeping operational environments safe from threats. Therefore, they
provide an intrinsic potential to be used in the field of network security, attack and
anomaly detection. In this study, we almost opted open source tools to encourage
adoption, modification and improvement of this simulated environment by future
researchers. This is an example of using GNS3 and computer virtualization for net-
work simulation for the investigation of network security and attack scenarios. Based
on this example, several different simulation scenarios can be realized.
The simulation results depend strongly on the performance of the used hard-
ware.10,11 The same simulation with slower/faster hardware could deliver divergent
results. This is also depending on bus speed, NIC speed, etc. This has to be taken into
account for performance testing purposes. But the fundamental results concerning
security investigation using this simulation method are not infected.

References
[1] Kurose JF, Ross KW. Computer networking: A top-down approach. Addison-
Wesley, Reading; 2010.
[2] Breslau L, Estrin D, Fall K, et al. Advances in network simulation. Computer.
2000;33(5):59–67.
[3] Sarkar NI, Halim SA. A review of simulation of telecommunication networks:
Simulators, classification, comparison, methodologies, and recommendations.
Cyber Journals: Multidisciplinary Journals in Science and Technology, Journal
of Selected Areas in Telecommunications (JSAT). 2011;2(3):10–17.
[4] Chang X. Network simulations with OPNET. In: Proceedings of the 31st Con-
ference on Winter Simulation: Simulation—A Bridge to the Future-Volume 1.
ACM; 1999. p. 307–314.
[5] Gyires T. Network simulation. In: Iványi A, editor. Algorithms of informatics.
vol. 2. Budapest: MondAt Kiadó. 2007.
[6] Chan KFP, De Souza P. Transforming network simulation data to semantic
data for network attack planning. In: ICMLG 2017 5th International Confer-
ence on Management Leadership and Governance. Academic Conferences and
Publishing Limited; 2017. p. 74.
[7] Dupuy A, Schwartz J, Yemini Y, Bacon D. NEST: A network simulation and
prototyping testbed. Communications of the ACM. 1990;33(10):63–74.
[8] Keshav S. REAL: A network simulator. University of California Berkeley,
Berkeley, CA, USA; 1988.

10
Host: Intel(R) Core(TM) i7-7700HQ CPU @ 2.80GHz, 16.0GB RAM, 64-bit OS.
11
Virtual Appliance: 1×4 Core processor, 6.1GB RAM.
A simulation environment for cybersecurity attack analysis 77

[9] Bajaj S, Breslau L, Estrin D, et al. Improving simulation for network research.
University of Southern California, Tech. Rep; 1999.
[10] Pan J, Jain R. A survey of network simulation tools: Current status and future
developments. Washington University in St. Louis, Tech. Rep; 2008.
[11] Wehrle K, Günes M, Gross J. Modeling and tools for network simulation.
Aachen: Springer Science & Business Media; 2010.
[12] Behal S, Kumar K. Trends in validation of DDoS research. Procedia Computer
Science. 2016;85:7–15.
[13] Bitner B, Greenlee S. z/VM a brief review of its 40 year history. Dosegljivo:
http://www vm ibm com/vm40hist pdf (pridobljeno: 26 4 2016). 2012.
[14] Preeth E, Mulerickal FJP, Paul B, Sastri Y. Evaluation of Docker containers
based on hardware utilization. In: Control Communication & Computing India
(ICCC), 2015 International Conference on. IEEE; 2015. p. 697–700.
[15] Morris D, Voutsinas S, Hambly N, Mann R. Use of Docker for deployment
and testing of astronomy software. Astronomy and Computing. 2017;20:
105–119.
[16] Geng X, Zeng X, Hu L, Guo Z. An novel architecture and inter-process com-
munication scheme to adapt chromium based on Docker container. Procedia
Computer Science. 2017;107:691–696.
[17] Grunewald D, Lützenberger M, Chinnow J, Bye R, Bsufka K, Albayrak
S. Agent-based network security simulation. In: The 10th International
Conference on Autonomous Agents and Multiagent Systems-Volume 3. Inter-
national Foundation for Autonomous Agents and Multiagent Systems; 2011.
p. 1325–1326.
[18] Bhattacharyya DK, Kalita JK. Network anomaly detection: A machine learning
perspective. New York: Chapman and Hall/CRC; 2013.
[19] Gogoi P, Bhattacharyya D, Borah B, Kalita JK. A survey of outlier detec-
tion methods in network anomaly identification. The Computer Journal.
2011;54(4):570–588.
[20] Jyothsna V, Prasad VR, Prasad KM. A review of anomaly based intru-
sion detection systems. International Journal of Computer Applications.
2011;28(7):26–35.
[21] Chandola V, Banerjee A, Kumar V. Anomaly detection: A survey. ACM
Computing Surveys. 2009;41(3), Article 15:1–58.
[22] Ahmed M, Mahmood AN, Hu J. A survey of network anomaly detection
techniques. Journal of Network and Computer Applications. 2016;60: 19–31.
[23] Tavallaee M, Bagheri E, Lu W, Ghorbani AA. A detailed analysis of the KDD
CUP 99 data set. In: Computational Intelligence for Security and Defense
Applications, 2009. CISDA 2009. IEEE Symposium on. IEEE; 2009. p. 1–6.
[24] Gogoi P, Bhuyan MH, Bhattacharyya D, Kalita JK. Packet and flow based
network intrusion dataset. In: International Conference on Contemporary
Computing. Springer; 2012. p. 322–334.
[25] Botta A, Dainotti A, Pescapé A. A tool for the generation of realistic net-
work workload for emerging networking scenarios. Computer Networks.
2012;56(15):3531–3547.
78 Modeling and simulation of complex communication networks

[26] Ostinato. Github; 2018. Available from: https://ostinato.org/.

[27] Seagull: an Open Source Multi-protocol traffic generator. sourceforge; 2018.
Available from: http://gull.sourceforge.net/.
[28] Weigle MC, Adurthi P, Hernández-Campos F, Jeffay K, Smith FD. Tmix: A tool
for generating realistic TCP application workloads in ns-2. ACM SIGCOMM
Computer Communication Review. 2006;36(3):65–76.
[29] RUDE & CRUDE. SourceForge; 2018. Available from: http://rude.source
forge.net/.
[30] Multi-Generator (MGEN). U.S. Naval Research Laboratory; 2018. Available
from: https://www.nrl.navy.mil/itd/ncs/products/mgen.
[31] Kernel-based Traffic Engine (KUTE). Swinburne University of Technology;
2018. Available from: http://caia.swin.edu.au/genius/tools/kute/.
[32] Brawny and RobUst Traffic Engine (BRUTE). MIUR project EURO
“University Experiment of an Open Router”; 2018. Available from:
https://code.google.com/archive/p/brute/.
[33] Bhuyan MH, Bhattacharyya DK, Kalita JK. Network anomaly detection:
Methods, systems and tools. IEEE Communications Surveys & Tutorials.
2014;16(1):303–336.
[34] Sieklik B, Macfarlane R, Buchanan WJ. Evaluation of TFTP DDoS amplifi-
cation attack. Computers & Security. 2016;57:67–92.
[35] Ringberg H, Roughan M, Rexford J. The need for simulation in evaluat-
ing anomaly detectors. ACM SIGCOMM Computer Communication Review.
2008;38(1):55–59.
[36] Kuhl ME, Kistner J, Costantini K, Sudit M. Cyber attack modeling and sim-
ulation for network security analysis. In: Proceedings of the 39th Conference
on Winter Simulation: 40 years! The Best is Yet to Come. IEEE Press; 2007.
p. 1180–1188.
[37] Elejla OE, Anbar M, Belaton B, Alijla BO. Flow-based IDS for ICMPv6-based
DDoS attacks detection. Arabian Journal for Science and Engineering. 2018;
43(12):1–19.
[38] Balyk A, Karpinski M, Naglik A, Shangytbayeva G, Romanets I. Using graphic
network simulator 3 for DDoS attacks simulation. International Journal of
Computing. 2017;16(4):219–225.
[39] Al Kaabi S, Al Kindi N, Al Fazari S, Trabelsi Z. Virtualization based ethical
educational platform for hands-on lab activities on DoS attacks. In: Global
Engineering Education Conference (EDUCON), 2016 IEEE. IEEE; 2016.
p. 273–280.
[40] Ojeniyi JA, Balogun MO, Sanjo F, Ugochukwu O. Development of a traffic
analyzer for the detection of DDoS attack source. 2016. In: International Con-
ference on Information and Communication Technology and Its Applications
(ICTA 2016). 2016. p.111–117.
[41] Behal S, Kumar K. Detection of DDoS attacks and flash events using informa-
tion theory metrics—An empirical investigation. Computer Communications.
2017;103:18–28.
A simulation environment for cybersecurity attack analysis 79

[42] Zhao X, Qian YK, Wang CS. A framework of evaluation methodologies for
network anomaly detectors. In: Advanced Materials Research. vol. 756. Trans
Tech Publ; 2013. p. 3005–3010.
[43] Sommers J, Yegneswaran V, Barford P. A framework for malicious work-
load generation. In: Proceedings of the 4th ACM SIGCOMM Conference
on Internet Measurement. ACM; 2004. p. 82–87.
[44] Mirkovic J. D-WARD: Source-end defense against distributed denial-of-
service attacks. University of California, Los Angeles, CA; 2003.
[45] Barford P, Crovella M. Generating representative web workloads for network
and server performance evaluation. In: ACM SIGMETRICS Performance
Evaluation Review. vol. 26. ACM; 1998. p. 151–160.
[46] Scapy’s documentation; 2018. Available from: https://scapy.readthedocs.io.
[47] GNS3 Architecture. GNS3 Academy.; 2018. Available from: http://api.gns3.
net/en/latest/general.html#architecture.
[48] Neumann JC. The book of GNS3: Build virtual network labs using Cisco,
Juniper, and More. San Francisco: No Starch Press; 2015.
[49] GNS3 Software. GNS3 Inc.; 2018. Available from: https://www.gns3.
com/software.
[50] Goldstein M. BoNeSi – The DDoS Botnet Simulator; 2016. Available from:
https://github.com/Markus-Go/bonesi.
[51] Ali I, Meghanathan N. Virtual machines and networks-installation, perfor-
mance study, advantages and virtualization options. arXiv preprint arXiv:
11050061. 2011.
[52] Reuben JS. A survey on virtual machine security. Helsinki University of
Technology. Tech. Rep; 2007.
[53] TightVNC Software. TightVNC Group; 2018. Available from: https://www.
tightvnc.com.
[54] Virtual Network Computing. AT&T Laboratories Cambridge; 2018. Available
from: http://www.hep.phy.cam.ac.uk/vnc_docs/protocol.html.
[55] NTOP Software. NTOP Inc.; 2018. Available from: https://www.ntop.org/
products/traffic-analysis/ntop/.
[56] DrayTek. Vigor3300V user guide V3.0; 2009. https://www.draytek.com/en/
products/products-a-z/router.all/2016/03/30/vigor3300v/.
Part II
Surveys and reviews
Chapter 4
Demand–response management in smart grid:
a survey and future directions
Waseem Akram1 and Muaz A. Niazi1

Nowadays, one of the key areas of research in smart grid (SG) is demand–response
management (DRM). DRM assists in simplifying interactions between the customers
and the utility-service providers. It also helps in the improvement of energy efficiency
as well as effects on load balancing. Studies on DRM have brought a number of
interesting, technical discussions and research contributions. Many of these studies
work toward making energy-efficient systems. However, there is a need to work
in the domain of customer satisfaction; this area needs considerable new advances.
From past few decades, a number of studies have been carried out in SG regarding
DRM. However, there is no such work that presents a comprehensive analysis of these
works. There is a need to investigate different techniques, their advantages, as well
as limitations. By focusing on DRM from a customer satisfaction perspective, in this
chapter, we present a detailed overview of different solutions for developing DRM.
We also group existing solutions and identify trends and challenges in an SG domain
from DRM perspective.

4.1 Overview

We first start by giving an introduction of SG. Then background and basic concepts are
given. Next, we present a detailed review of different literature from DRM perspective
in SG. Then open-research problems are given. Finally, we present conclusion at the
end of this chapter.

4.2 Introduction
The traditional power system provides one-way power flow to the consumers. On
the other side, the energy demands are continuously growing from consumer sides.
This makes the traditional power system difficult to respond to the ever-changing and

1
Computer Science Department, COMSATS Institute of Information Technology, Pakistan
84 Modeling and simulation of complex communication networks

rising energy demand of consumers. Due to this issue, the energy sector has started
working for efficient and sustainable energy system. This effort introduced the SG
concept in the energy domain.
The SG introduced a two-way dialog where electricity and information can be
exchanged between utility and consumers. It integrates advanced information and
communication technology (ICT), smart meters, smart appliances, and other sensing
mechanisms [1]. It is a developing network of distributed nodes, where all operations
of the system are controlled by an intelligent and autonomous system [2]. The SG
involves the transmission of energy to the consumers in a controlled and smart way,
which benefices both utility and end users [3].
DRM plays an important role in SG environment. It enables the dynamic adjust-
ment of energy demand from consumers in response to the price signals and incentives.
This process shifts higher demand to lower demand, thus reducing energy cost [4,5].
It assists in the interaction between end users, appliances, and utility service provider
which minimizes end-user effort in controlling power usage devices [6,7]. It also
helps in fault detection and prevention in the system, thus improves system reliability
and sustainability [2].
There are several research challenges related to the DRM. The deployment of ICT,
smart meters, and renewable energy resources is a challenging task [1]. Renewable
energy resources have unpredictable fluctuation in power generation. It is difficult to
predict energy for the day ahead [8]. Another big challenge is decision-making for
demand and consumption at consumer side. Consumers are making a decision about
how much energy is required for a certain type of appliance in a particular time period.
This makes the consumer decision more complex. The users’ demand for energy
changes with time (variable demand), this needs an adaptive strategy of grid unit
that can modify their capacity according to the user demand [9]. Reliability is another
issue in an SG environment [10]. Some naturally accruing events lead to the cascading
failure of SG [6,11–14], where supervisory control and data-acquisition system is
used to detect and prevent a fault in the system [3]. The SG presents heterogeneous
structures composed of distributed nodes. All operations are controlled through a
communication network. The current communication techniques are inefficient due
to the large and complex systems.
The deployment of renewable energy resources needs more coordination and
controlling techniques to achieve reliable and efficient system. A multi-agent system
(MAS) is a useful tool for coordination and controlling all operations within the SG,
due to its distributed and autonomous property. MAS is widely used in SG appli-
cations. In articles [13,15–17], MAS is adopted for DRM [18], fault handling [14],
and voltage and storage control [17], [19]. In the last couple of decades, researchers
have made a number of contributions to the DRM and have made the efficient system
in the SG environment. However, there is still a need for improvement in consumer
satisfaction domain.
From past few decades, a number of studies have been carried out in SG regarding
DRM. However, there is no such work that presents a comprehensive analysis of
these works. There is a need to investigate different techniques, their advantages, and
limitations. So here in this part, we present a comprehensive and detailed review of
Demand–response management in smart grid 85

DRM techniques. We review different scientific publications and investigate their

features as well as open-research problems.
Our expected contributions are listed as follows:
1. To review large number of literature in the domain of SG from DRM perspective.
2. To propose a classification of DRM techniques used in previous literature.
3. To highlight their key features as well as open-research problems.

4.3 Backgrounds

In this section, we are going to present basic background and concepts for
understanding DRM in the SG.
4.3.1 Smart grid
The traditional power system is responsible for generation and transmission of energy
to end users. However, the user demand changes with time (variable demand), so the
static approach cannot deal with variable demand. This problem gained the attention
of researchers and introduced SG technology. SG is a complex system that is being
formed from the traditional power system [20]. This integrates advanced communi-
cation and control technology that enables the system to perform the automated oper-
ation. It also consists of other various technologies like smart meters, smart homes,
generators, storage devices, appliances, load, etc. This presents a network composed
of distributed nodes; all operations of the system are controlled intelligently and
autonomously. The key benefit of an SG is to achieve an efficient energy system [2].
NIST [21] presented a conceptual model for SG domain called NIST SG frame-
work 1.0 in the National Institute of Standards and Technology, US Department of
Commerce. This model represents seven different actors/applications that are inter-
acting with each other. The conceptual model for SG has been shown in Figure 4.1.
Each actor in this model is described below:
1. Customer: Represent end users that consume and store energy. They may be
residential, industrial or commercial.
2. Market: Operators in the electricity market.
3. Operation: They manage all energy transmissions.
4. Service provider: They provide services and facility to the utility and customer.
5. Generation: They generate and store energy.
6. Transmission: They carry energy over large distances.
7. Distribution: They carry energy to and from customers.
The model components:
1. Social components: Electricity consumers, producers, grid operators.
2. Technical components: Loads (consuming devices), generators, power lines,
buses.
The interaction and behavior of these actors will influence and be influenced by
the technical system. The changes in the configuration of the technical system will
86 Modeling and simulation of complex communication networks

Service
Market Operation
provider

Generation Transmission Distribution Customer

Figure 4.1 SG conceptual model adapted from [21]

affect the actors’ behavior, and changes in actors’ behaviors will affect the technical
system configuration. Therefore, there is a need to consider coupled social-technical
system in order to achieve reliable, sustainable, and resilient power system.

4.3.2 Demand–response management

DRM plays an important role in the SG environment. This refers to the process
in which end users reduce energy consumption in response to the incentives [22].
This results in scalability and efficiency of the system. It also provides different
strategies such as time-of-use (TOU) price, real-time pricing (RTP), critical peak
pricing, incentives based, pricing and capacity market [23]. The authors of articles
[24,25] defined these strategies as follows:

1. TOU: It uses different unit prices for different time periods.

2. RTP: It involves changing the energy cost on hourly bases.
3. Critical peak price: Combines both TOU and RTP strategy.
4. Incentive-based: This strategy involves three different pricing schemes. Direct
load control, in which different power usage devices are directly controlled
(on/off). Interruptible service, in which service provider offers a discount rate to
the end users. Biding strategy, in which end users bid for consumption pattern.
5. Capacity market: In which end users have to send back energy to the grid unit.

DRM is an important resource for the SG system. It integrates modern technology

like sensor, ICT, smart meter, appliances, etc. These technologies enable end users to
interact with the power system. They also allow users to reduce energy consumption,
which has a considerable impact on energy-cost reduction [26].
The integration of modern technology and renewable energy resources faces great
challenges in SG system. Recently, DRM has gained a great deal of attention from the
research community and brought a number of research contributions and technical
Demand–response management in smart grid 87

discussions on DRM. During our literature study, we found two types of literature;
learning-based techniques and complex system to address DRM in the SG.

4.3.3 Complex systems

The complex system represents an object with many interconnected elements or
agents. There exist relatively many relations among each element or agent. The behav-
ior of each element depends on the behavior of others. Another term, emergence, is
also used to describe complex system [27]. The complex adaptive system is a type
of complex system that concerns with agent’s behaviors in complex systems. These
agents are capable of learning and adapting in response to interaction with other
agents. They have nonlinear and hierarchy properties and behaviors. The complex
adaptive system represents a set of natural and artificial complex systems like market
environment [28], cell organism, the internet with user and servers, a power system
with consumers and grid unit, etc.
Recently, a number of studies have been carried out on the topic “DRM in power
system environment from a complex system perspective.” In articles [4,16,29,30],
complex system approach was adopted and developed complex models by integrating
consumers feedback, agents behavior, generation unit, a distribution unit, storage, and
consumption devices.

4.3.4 Learning-based approaches

Learning-based approach is one of the major domain of artificial intelligence (AI)
in which agents learn from past experience [31]. Agents can act according to the
situation, hence they are able to have dynamic behavior against the dynamic situation.
Experience gained through previous action state pair for existing event and training
data. Learning-based approach is used in agent-based modeling for optimization and
predictive analysis [32].
Reinforcement learning (RL) is a type of machine-learning approach and also a
branch of AI. It concerns with agents’ behaviors and actions in the environment. These
agents learn the states and available actions as well as other agent’s action and state.
It perceives the input from the environment and performs a specific action and gets
a reward corresponding to that action for any state. In articles [32–34], RL approach
is adopted to train agents to get an optimal policy to make demand–supply balance
in the power-system environment.

4.4 A review of demand–response management in SG

In this section, we are going to present a detailed overview of surveyed work about
DRM in the SG. We have grouped DRM into three main categories which are learning-
based approaches, complex system, and other techniques. Figure 4.2 shows the
classification of the reviewed work on the SG from DRM perspective. In learning-
based approach, different learning techniques are used like artificial neural network
(ANN) and RL. While complex system consists of collaborative, adaptive, particle
88 Modeling and simulation of complex communication networks

ANN
Learningbased
RL

Collaboration

CAS

DRM Complex system Demand integration

PSO

Game theory

Security management

HEM

Other techniques EVs

Renewable energy
resources

Energy market

Microgrid

Figure 4.2 Classification of literature for DRM in smart grid

swarm optimization (PSO) and game-theory approaches, other techniques comprise

communication management, home-energy management (HEM), renewable energy
sources, electric vehicles (EVs), energy market, and microgrid.

4.4.1 Learning-based approaches

In learning-based approach, agents learn and adapt their behavior against the
dynamic environment. The learning-based approach is categorized into ANN and
RL techniques.

4.4.1.1 Artificial neural network

Hernandez et al. [35] proposed a MAS for the virtual power plant. The virtual power
plant consists of small elements or a single unit. ANN is applied for efficient control
and management of operations in a virtual power plant. The experimental results
showed 1.5% error rate. This model works on a small level, and it needs enough
information for predicting the future state of the system.
Demand–response management in smart grid 89

4.4.1.2 Reinforcement learning approach

There needs an intelligent and accurate model for prediction of energy consumption
in an SG environment. In [36], Mocanu et al. presented a model for energy prediction
based on RL without using any historical data. This model integrates the RL with
the deep belief network. It estimates the state space and then finds optimal policy by
using the RL technique. Experimental results showed 91.42% accuracy of the system.
However, this model is not implemented for the different level in the SG.
In [10], Lakić et al. presented an agent-based model using SA-Q learning tech-
nique. It learns how much the system reserves to offer power at the different time.
It also increases the ratio between economic cost and its benefits. The results showed
an improvement in performance and economic outcome to users. However, this
method is not applied to the multi-agent framework.
In [37], Dusparic et al. proposed a multi-agent scheme based on RL for demand–
response problem in the SG. This method uses current load information and predicts
load for the next one day by using load forecasting approach. The agents learn how to
fulfill user demands from available energy. The results showed peak usage reduction is
33% and off-peak increased by 50%. However, in this method, there is no collaboration
and communication among agents.
Wen et al. [38] addressed demand–response problem in SG and proposed frame-
work base on RL technique. In this method, demand–response problem is decomposed
over each device. This technique performs the self-initiation job and handles many
flexible requests. The complexity of the proposed algorithm is linear. The results
showed that for a broad range of trade-off parameters, it outperforms. However, this
technique focuses on the demand–response problem of a single unit or building.
In [33] by Ruelens et al., research work is carried out on demand–response
problem using a batch RL technique. The batch RL technique covers the inefficient
information problem of RL method. This method uses a batch of experiences to find
out optimal policy. In this work, two agents, water and residential building agent, are
used. For dynamic pricing, to minimize cost, a closed loop policy and for the day
ahead scheduling, open-loop policy is followed. The results showed that energy cost
is reduced by 19%, and consumption rate is increased by 4%.
When economic dispatch and demand response are treated as separate and
sequential operation, energy efficiency decreases. Zhang et al. [39] presented opti-
mal energy-management strategy in order to maximize social welfare. This method
operates through coordination of demand response and economic dispatch. Economic
dispatch is provided by generator and demand response by the customers. This method
is also used for the discovery of power demand–supply mismatch. The simulation
results showed convergence rate of 40 iterations.
Another approach for demand response is studied by O’Neill et al. [40] and pro-
posed consumer-automated energy system. This technique reduces residential energy
cost and usage. This method uses online energy-cost estimation and user decision
policy. This is the independent approach to energy price and system behavior. In this
method, users decide which device will use energy and how much. The results showed
40% cost reduction by using price unaware of energy scheduling.
90 Modeling and simulation of complex communication networks

4.4.2 Complex system

In this section, we present related work carried out in SG domain from a complex
system perspective. We grouped the existing techniques into collaborative, complex
adaptive system, demand-side integration, PSO, and game-theory approach.

4.4.2.1 Collaborative approach

MASs are widely used for controlling and managing in an SG. In [41], Manick-
avasagam proposed and developed intelligent energy control center (ECC) mechanism
for the SG. This technique consists of two layers, the one is DER which serves as a
client and the other is ECC as a server. ECC is controlled and monitored by a fuzzy
logic controller (FLC). Communication and negotiation between client servers take
place through internet protocol. The simulation results are stored in an excel database
acting as a monitoring agent. ECC used these results for decision-making in DERs.
However, communication between results and FLC is not taken into account.
The mismatch between supply and demand reduces system performance. Paral-
lel Monte Carlo tree search (P-MCTS) can produce an optimal solution for power
balancing, but it has no coordination support. In [18], Golpayegani et al. extended
the P-MCTS work by introducing collaborative and coordination concept. Agents
negotiate with each other and present their proposal. This method resolves prob-
lems of agent’s conflict, load-shifting, and charging capacity. The results showed that
charge capacity increased from 33% to 50%. However, this model does not deal with
prediction of data.
In [42], Le Cadre and Bedo worked on uncertainty in an SG environment and
present decentralized hierarchy based on the learning-game approach. It is composed
of supplier, generator, and consumer agents. Agents forecast demand and production
of the grid in a collaborative manner. It determines the price that balance power and
demand. The results showed that in a shared information network, faster convergence
rate is achieved using cooperative learning as compared with an individual learning.
In [43], Huang et al. presented a novel model for the demand–response problem
with the conjunction of the elastic dispatch process. In the study system, the elastic
economic dispatch process is used as a feedback controller and flexible load cost as
a control signal. The control signal balances demand and response. In this method,
max-min and interval mathematic technique is used for boundary calculation. This
estimates the uncertainty which is the difference between the present and the target
value. Simulation results showed that the interval mathematic technique is efficient as
compared to Monte Carlo approach. Its convergence time is also 1% less than Monte
Carlo approach. However, this technique does not handle probability distribution.

4.4.2.2 Complex adaptive system

In [44], Kremers et al. presented a bottom-up approach for the SG. It consists of two
layers: physical layer for electrical power transmission and logical layer for commu-
nication. This model has the ability to integrate new devices in an SG environment.
It provides dynamic load management, power, and communication controlling and
Demand–response management in smart grid 91

monitoring. Experiment results showed 40% reduction factor in energy consumption.

However, this model is not capable of handling high-load management.
In [4], Thimmapuram and Kim proposed an agent-based model using elasticity
market complex adoption system to an SG domain. This technique handles user elastic
demand and lower cost. This method reduced peak load in the range of 8%–5%.
However, the cost of energy for some user is increased.

4.4.2.3 Demand-side integration

Demand-side integration in SG results in security, quality, efficiency, and reduction
in cost. In [45], Mocci et al. proposed a MAS for integration of demand and EVs. The
load agents calculate power demand and act as master agents. The master agents with
cooperative agents send power load and global data to the demand side. Its demand–
response rate is 85%, and it also reduced the flow of data. However, this technique is
not able to calculate the state of batteries of different storage at the different time.
In [22], Nunna et al. proposed a priority banking scheme, its concerns with
user’s demands. This method gives some share to the users from available resources;
it monitors user’s demand and updates their priority. This method reduced network
loss by 50% and also reduced dependency on overall grid. However, this technique
provides fewer shares to users.

4.4.2.4 Particle swarm optimization

Advance power-grid shifting from vertical to a horizontal structure which requires
efficient management system. In [46], Hurtado et al. proposed biding energy man-
agement system. This technique provides interaction between different environments.
It used PSO approach for maximizing comfort level and energy efficiency. In this hier-
archical infrastructure, lower level agents abstract information and provide to higher
level agents. Performance is described by weight factor; fair scenario showed 0.5
weight while bias scenario results in 0.3 weight. However, this technique generates
some unbalance situation.

4.4.2.5 Game-theory approach

The retailer and market-price management in SG gained great attention in research
work. In [47], Wei et al. focused on energy price and dispatch problem in an SG
environment. This study proposed a two-stage two-level model. Customer demand and
price is considered as the first stage using the Stackelberg game approach, while the
operation of storage devices is considered as the second stage using linear max–min
problem. Then the model is translated into mixed integer linear problem (MILP). The
results showed 5% improvement in system performance, and it also increased retailer
profit. However, this method is very sensitive and knowledge gathering process is
very difficult.
In [48], Chai et al. addressed demand–response issue and present two-level game
approach. This technique handles multiple utility company and users. The utility com-
pany is modeled as noncooperative and communication between users as evolutionary.
The results showed that the proposed technique reduced cost payment from 3,197.7
92 Modeling and simulation of complex communication networks

to 2,425.6 and the energy demand is increased from 1,224.9 to 1,478.4. However, this
work does not handle constraint on power consumption.
In [49] by Song et al., another framework for optimal nonstationary demand-side
management in an SG environment is proposed. In this method, the user selects their
energy usage pattern according to their priority and needs. They used a repeated game
approach which provides interaction among foresighted price anticipating users. This
method showed 50% reduction in energy cost and robustness in error. However, higher
threshold value results in a trade-off between cost and peak average ratio.
In [50], Nunna and Doolla carried out research work on management of demand
response in multiple microgrid networks. In this work, customers participate in
demand–response strategy. This study proposed a priority index approach through
which customers participate in the market. This method reduced peak demand. It is
found that customers with high priority index get power at low cost.
In [51], O’Brien et al. focused on DRM in SG application. In this work, demand
response is modeled as the game-theoretic environment, and Shapley-value (SV) is
used for payment distribution process. RL technique is used to estimate SV. Simulation
results showed that for random sampling, 1,000,000 samples take 58.2 s execution
time, while for sigmoid sample, 51,129 samples take 6.5 s. The results also showed
that uniform sample balances demand and response. However, this method is not
suitable for distribution scheme and its direct estimation is difficult. The literature
summary of DRM has been shown in Table 4.1.

4.4.3 Other techniques

Apart from learning-based and complex-system approaches, there are also several
other studies that have carried out in the SG domain. We grouped these studies into
security management, HEM, EVs charging, energy market, and microgrid.
4.4.3.1 Security management
In demand–response (DR) process, communication occurs between consumers and
suppliers in order to balance demand and supply. During communication process,
sensitive information is exchanged between consumers and suppliers. This commu-
nication process needs to be secure from unauthorized users. To address this issue,
Rahman et al. [52] have proposed a private secure bidding protocol for DR using
incentive-based strategy. This proposed technique uses cryptographic primitives. It
comprises three entities that are a supplier as a registration manager, DR automa-
tion server as bidding manager and bidder. The proposed protocol is implemented in
Java by using primitive operations. The empirical results showed the feasibility of the
protocol in terms of computation cost and a number of primitives. However, in the
proposed scheme, RM and BM are assumed as secure.
There are some issues with existing communication framework in SG, i.e., there
is no currently available ideal communication architecture and no such work which
analyze and investigate bit error rate during the communication process. In this regard,
in [53], Moghaddam et al. proposed a cloud-based DR in an SG environment. In this
work, a communication model is developed for the smart distributed system. The eval-
uation criteria are considered as cloud DR and distributed DR. The simulation results
Demand–response management in smart grid 93

Table 4.1 Literature summary of demand–response management

Ref Technique Strength Limitations

Hernandez Energy Forecasting in 1.5% Error rate Works on single unit

et al. [35] VPP, ANN
Mocanu [36] Energy prediction for 91.42% accuracy Not implemented on
building, RL + belief different level
neural network
Lakić [10] Learning how much Ratio between cost Not implemented on
system reserve to offer and benefit increased MAS framework
at different time, SA-Q
learning
Dusparic Energy prediction of next 33% usage reduced No communication
[37] day-ahead for EVs,
RL+ load forecasting
Wen [38] Segmentation of each Linear complexity Single unit
device, RL
Ruelens [33] Thermostatic load 19% cost reduction Single unit
controlling, batch RL
Zhang [39] Finding mismatch Convergence rate Only local
between demand and 40 iteration communication
supply, economic
dispatch
Golpayegani Conflict management Battery capacity No prediction
[18] between Evs, increased 17%
CP-MCTS
Kremers [44] Agent-based model of 40% consumption Not handle high load
simple smart grid, CAS reduced, and peak
load lies in the range
5%–8%
Mocci [45] Controlling of integrated Response rate achieved No calculation of
demand and EVs, DSI 85%, network loss rate battery state
reduced 50%
Hurtado [46] Controlling of Performance in the Unbalance situation
interoperation of smart form of weight factor,
building, PSO which is achieved 0.5
Wei [47] Management of energy 5% performance Difficult to gather
price and dispatch improved information
problem, MILP
Chai[48] Controlling multiple Cost reduced from No constraints on
utility centers and end 3,197.0 to 2,425.6 RS power consumption
users, two-level game
O’Brien [51] Payment distribution Converge in 58.2 s Not suitable for energy
process, Shapley value distribution
distribution

showed that on clusters creation, the bit error is very large. It has been shown that by
using UDP protocol for communication, the broadcast showed no optimal solution,
while the TCP protocol showed a high bandwidth capacity. However, the convergence
rate is increased. With high DR usage, effective communication is achieved at a high
94 Modeling and simulation of complex communication networks

Table 4.2 Literature summary of DR security management

Ref Technique Strength Limitations

Rahman [52] Secure communication Computation cost is Some entities are

process, incentive-based DR reduced assumed pre-secure
Moghaddam Investigation of bit error TCP protocol showed No local
[53] rate during communication, high bandwidth communication
cloud-based DR capacity
Tsai [54] Large-scale communication 50% balance state is Not handle
network, randomized achieved corrupt data
alternative technique
Wada [55] Privacy management in Balance state of the Not tested on other
distributed system, RTP system is achieved DR scheme
scheme

cost. Another disadvantage of this study is that the distributed DR is applied on huge
distance; there is no local communication among neighbor’s channels.
In [54], Tsai et al. have worked on distributed DR for large-scale consumers
load with the conjunction of renewable energy resources. In this work, a neighbor-
communication strategy is applied. This results in low communication cost. They
used a randomized alternative direct technique of multipliers for distributed DR. In
this method, there is no need for communication synchronization. With few mes-
sages, the balance state can be achieved by the system. The results showed 50%
balance state is achieved. It showed outperformance by using a RTP scheme over the
existing distributed DR. However, they assumed that all consumers involved in com-
munication process are trustable. The proposed scheme cannot handle wrong data
transmission.
Although DR creates an energy-efficient system by reducing energy demand
from peak-hour to off-peak. In this process, consumers and utility service providers
always communicate with each other. Consumers transmit their energy demand profile
to the grid unit, while from grid side, the energy-cost information is routed to the
consumers. In this communication process, this information can be accessible to
unauthorized users. So there is a need to make the communication system secure. In
this regard, Wada et al. [55] worked on privacy management and proposed masking
method to secure the privacy of each individual in a smart distributed energy system.
In this scheme, every agent uses a mask signal along with their states. Then, during
the communication process, each agent exchanges their mask with other agents. To
obtain the correct signal, agents subtract the obtained signal from their own state.
The RTP scheme of DR is applied. The results showed that this method can protect
information of each agent along with a balanced state of the system. The literature
summary of security management in SG has been shown in Table 4.2.

4.4.3.2 Home-energy management system

Nowadays, DR is getting more importance in-HEM systems (HEMSs). However,
dynamic pricing scheme is not useful without a combination of DR and HEM. In [56],
Demand–response management in smart grid 95

Ghazvini et al. have proposed a new HEM algorithm. The proposed scheme schedule
appliances, EVs, and electric water heater (EWH) with a combination of energy
storage. EV is considered as a dispatchable energy source. In this work, renewable
energy resource such as photovoltaic voltage (PV) is also used. They used simple
rule-based algorithm under different pricing scheme which schedules EV charging
and EWH heating process. The simulation results showed 29.5%–31.5% energy-cost
reduction.
In [57], Luo et al. have worked on large scale ice-thermal storage system with
the investigation to find out how to use it for fast voltage-control strategy with the
conjunction of renewable energy resources. The work presents a modified version
of the conventional system for thermal load management. In this work, a refrigera-
tor is used for the ice-thermal load. This work showed that the proposed technique
can effectively reduce the ratio of power imbalance in smart homes. The proposed
technique is implemented on computer-simulation tool. The results showed the total
fluctuation in voltage frequency reduced. The possible extension of this work can be
the use of the proposed scheme on the large-scale distributed power system.
In smart homes, a smart meter is used that monitors the user load and demand
profiles. However, the load forecasting of individuals at large scale is a challenging
task due to stochastic nature of the individual demand. In this regard, in [58], Yu et al.
have worked on this issue and proposed the use of the sparse coding technique to
model individual loads at a large scale of the distributed power system. In this work,
data of 5,000 homes based on a project with the collaboration of electrical power
board in Chattanooga (2011–13) was used. The objective function was to forecast
and predict next-day and next-week total load. The results showed that 10% accuracy
of the system improved. However, the proposed scheme needs to be tested on others
sparse methods like change point detection in a distributed system for getting a more
accurate system.
In most previous studies, the game theoretic approach has been used for DRM.
However, their computation cost is very large for finding Nash equilibrium. In [59],
Li et al. have proposed a sparse load-shifting based DRM that schedule different smart
home appliances. In this work, bidirectional communication is used that improve the
searching process for Nash equilibrium. The objective function to minimize peak to
average ratio (PAR) was used. The proposed algorithm showed the linear cost for
finding Nash equilibrium. The results showed convergence rate of 500 iterations.
The deployment of DR needs appropriate policy design and new technology. In this
regard [60], a MAS is developed for residential DR in a distributed energy network.
In this work, two agents, i.e., home agent and retailer agent, are used. The home agent
predicts the load profile of consumers. The RTP scheme of DR is used in this work.
The convex programming is used to model the consumption pattern of consumers.
They used two objective functions, i.e., energy-cost minimization and users’ waiting
time. In this work, two case studies were considered. The simulation results showed
that in Case 1, PAR and cost are reduced by 2.32$ and 62.7$, respectively. For case
2, PAR and cost are reduced by 1.54$ and 51.82$, respectively.
In [61], Huang et al. have introduced the use of the smart-gateway network in
the SG. In this work, a single home with multiple rooms is considered along with a
96 Modeling and simulation of complex communication networks

Table 4.3 Literature summary of HEM

Ref Technique Strength Limitations

Ghazvini HEM with EWH scheduling, Cost reduced Single home

[56] rule-based algorithm 29.5%
Luo [57] Voltage management of Reduced voltage Not implemented for
large-scale, ice-thermal fluctuation distributed system
energy scheduling
Yu [58] Load forecasting of Improved 10% Not used other sparse
appliances on large scale, accuracy methods like change point
sparse coding technique detection and clustering
making for distributed system
Li [59] Scheduling load of Linear computation Energy cost was ignored
different smart homes, cost, convergence
sparse load shifting rate 500 iterations
technique
Wang Designing appropriate PAR reduced 2.32, Single home is
[60] policy for DR, RTP cost reduced 62.7$ considered
scheme
Huang Appliances scheduling of Peak load reduced Single home is
[61] multiple rooms, 38.5% considered
minority-game based DR

single power grid and one PV. They used the multi-agent framework. First, energy-
demand pattern of each room is extracted with some uncertainties assumptions. Each
room is considered an agent. In this work, a dataset of a single building is used. A
minority-game-based DR is used for peak demand reduction. The simulation results
showed that peak load is reduced 38.5% in summer and 5.8% in winter. The literature
summary of HEM in SG has been shown in Table 4.3.

4.4.3.3 Electric vehicles charging

In SG, EVs are used to store energy and then to use this energy later. In this regard,
in [62], Yao et al. have proposed a real-time charging scheme for EVs with the
conjunction of DR. The scenario of parking station is considered. The charging and
discharging of EVs is modeled as a binary optimization problem. The binary (on, off)
optimization fasts the charging and discharging process, while the exhaustive search
is expensive in terms of computational cost. Therefore, a convex relaxation technique
is applied that searches and schedules the charging and discharging periods. The
simulation results demonstrate satisfactory results for charging EVs. With maximum
numbers of EV, minimization of total energy cost is achieved. The computation time
is noted as 0.19 s.
Regarding EVs charging, in [63], Le Floch et al. worked on two types of EV load
management. The one is fixed power load that can be changed, while the second is
flexible load but always remain fixed. They used a hierarchal control scheme with the
objective function of minimizing cost. This technique comprises two steps. First of
Demand–response management in smart grid 97

Table 4.4 Literature summary of EVs

Ref Technique Strength Limitations

Yao [62] EVs charging, binary Computation time Energy cost is ignored
optimization 0.19 s is achieved
Le Floch Two type of EV load PAR reduced 40% Only feasible under
[63] management, price-based DR limited threshold voltage
Jannati Optimal management of EV Operational cost Not tested on other DR
[64] with parking plots, time-of- reduced 4.30% strategies
use DR

all, it computes voltage capacity of the system. Then customizes load profile of the
consumers by using a price-based DR. The proposed scheme showed feasibility for
some specific cases like if the voltage remains within fixed limits, the flexible load is
achieved. The work is implemented on IEEE 55-bus radial distribution network. The
results showed 40% reduction in PAR.
The main benefit of renewable energy resources is to reduce air pollution pro-
duced by fuel consumption in power grid. In this context, a number of EVs, as well
as their parking plots, also increase to reduce the burden on the power grid. However,
there needs an optimal operation of these EVs with the conjunction of parking plots.
In this regard, in [64], Jannati and Nazarpour proposed an optimal management sys-
tem for EVs and their parking plots. They integrated the model with the wind, PV,
and local generators. They also used hydrogen and fuel cell storage system. The TOU
DR strategy is applied to schedule charging and discharging process of EVs. The
objective function to minimize operational cost along with charging and discharging
of EVs cost is considered. Then mixed integer linear programming is used in four case
studies. In Case 1, hydrogen storage and DR was not applied. In Case 2, hydrogen
storage was integrated with the model. In Case 3, DR strategy was integrated, and in
Case 4, both hydrogen storage and DR strategy were used. The simulation results of
Case 2, Case 3, and Case 4 were compared with Case 1. The obtained results showed
that Case 2, Case 3, and Case 4 reduced operation cost by 1.79%, 4.07%, and 4.30%,
respectively. The literature summary of electric vehicles has been shown in Table 4.4.

4.4.3.4 Renewable energy sources

Nowadays, DERs such as PV play an important role in the SG. Therefore, there is a
need to promote the use of PV in urban as well as ruler areas. In [65], Wang et al.
have worked on PV promotion by using a game theoretic approach. The proposed
scheme was analyzed from a different level of DR with RTP along with a different
number of PV and batteries. In this work, 5 levels of consumers and 32 PV were
considered. The optimization problem was modeled as a mixed compliment problem.
The simulation results showed that consumers using high response get larger PV; they
need less battery, meet energy demand at the real time, and also with less cost.
98 Modeling and simulation of complex communication networks

With the advent of smart homes, electrical sector encourages users to use renew-
able energy which benefits both users and grid as the total energy cost can be reduced.
A buy-back strategy encourages users to generate more power from renewable energy
resources that reduce the load on the main power grid. In [66], Chiu et al. worked
on buy-back scheme with dynamic pricing technique. Dynamic pricing is modeled
as a convex optimization dual problem. In this work, a day-ahead time-dependent
pricing scheme is used. It also integrates wind, PV, and battery storage in the system.
The objective function to achieve maximum user and company benefit was used. The
simulation results showed that 1.28 PAR was achieved and peak load was reduced
from 881.11 to 754.18/kW h.
Nowadays, the uses of wind energy resource are increasing. However, due to
the stochastic nature of it, the mismatch of energy demand and power generation is
also increasing. This introduced micro-combined heat power (CHP)—a hybrid energy
system. However, there is a need to analyze the impact of DR with the conjunction
of CHP at large scale with wind-energy resources. In [67], Jiang et al. addressed
this issue and proposed an operation model representing the residential hybrid energy
system. The proposed scheme uses price response, micro-CHP, smart appliances, and
also load aggregator. The load aggregator is used to centralize different consumers
load. The scheme is implemented on IEEE 118-bus. The simulation results showed that
wind power curtailment is reduced 78% in 6-buses. It also reduced energy cost 10.7%
and operation cost 11.7% on 118 buses. HEMS use DR to schedule home appliances.
However, currently, there is no accurate method that predicts load consumption of
appliances within a residential building. In [68], Hu and Xiao have worked on load
prediction within the residential sector. In this work, the air conditioner appliance is
used to train the thermal model. The historical data of indoor and outdoor temperature
was used. The optimization algorithms like trust region algorithm, genetic algorithm
(GA), and PSO were used to schedule the load of the air conditioner. They also used
two strategies for the temperature which are set point and precooling. The simulation
results showed power reduction 26%. However, they used single speed compressor.
The proposed methodology should be tested on the inverter-driven air conditioner.
In [69], Amrollahi and Bathaee have worked on modeling a stand-alone microgrid
that is far from the main power grid. This work investigates DR in the component size
of optimization of the microgrid. They considered only wind and solar energy system.
In this work, component size optimization and cost reduction are done by time-
shift and load scheduling. The simulation results showed that a number of batteries,
inverters, PV capacity, and energy cost reduced to 35.6%, 35%, 1.8%, and 17.1%,
respectively.
In SG, a thermostatic load such as heat and air conditioner also help in reducing
energy cost. In [82], Behboodi et al. have worked on thermostatic load control with
controlling the real-time energy market by using transactive control paradigm. In this
work, an ABM is developed that models DR for thermostatic loads. The proposed
scheme can control thermostatic load under heating and cooling condition. The sim-
ulation results showed 10% energy cost reduction. However, this work ignored other
appliances and just focused on the heater and air conditioner. The proposed work can
be extended to integrate others appliances as well as renewable energy resources.
Demand–response management in smart grid 99

Table 4.5 Literature summary of EVs

Ref Technique Strength Limitations

Wang Promotion the use of PV, Energy cost is reduced, Not implemented on
[65] game-theoretic approach, fewer batteries, other DR strategies
RTP scheme of DR consumers with high
response get larger PV
Chiu [66] Modeling renewable PAR achieved 1.28 Maintain centralize
energy as buy-back communication
scheme, dynamic pricing infrastructure, not
technique of DR useful in the case
of blackout
Jiang [67] Modeling large-scale CHP Reduced energy PAR is ignored
with DR. Price response DR cost 10.7%
Hu [68] Load profile prediction trust Power reduction 26% Only modeled single
region algorithm, GA, PSO speed compressor
Amrollahi Modeled stand-alone Energy cost Only wind and solar
[69] microgrid time-shift, reduced 35.6% energy is considered
DR scheme
Behboodi Optimization of thermostatic Energy cost Only focused on heater
[82] load, transactive control reduced 10% and air conditioner
scheme, agent-based model
Shakeri Control scheme for thermal Energy cost Demand and PV
[70] energy storage novel reduced 20% capacity prediction
optimization algorithm was ignored

Regarding HEMS, in [70], Shakeri et al. have proposed a new control strategy for
thermal and storage-management system. The working of the proposed algorithm is
that it receives price information in advance and purchases energy at an off-peak hour.
This work also integrated batteries and PV in a residential home. Total 26 appliances
are used. Results showed 20% energy cost reduction. However, the proposed scheme
was not tested for demand forecasting and prediction of PV capacity. The literature
summary of renewable energy sources has been shown in Table 4.5.

4.4.3.5 Energy market

Samimi et al. [71] have proposed a stochastic framework for coupling active and
reactive market in SG application. Active and reactive power is provided by distributed
energy resources (DERs). A distributor company buys active and reactive power. The
whole seller sells this power via the market environment. Demand buyback program
(DBP) is used in which aggregators participate in the market. The scheduling process
is modeled by optimization problem using mixed integer linear programming. In this
work, evaluation criteria are set as the cost of energy, reactive power from DER, CO2
emission, cost of DBP, and minimization of total cost. The simulation results showed
the effectiveness of the DBP scheme, and also it reduced energy cost. However,
aggregators only offer load reduction at a price they want to be reduced.
100 Modeling and simulation of complex communication networks

Dynamic pricing strategy is used to implement DRM for optimization of energy

consumption pattern. It also helps in reduction of peak load. In [72], Srinivasan et al.
proposed a game theoretic approach for dynamic pricing scheme using a special case
of the Singapore energy market. They focused on the residential and industrial sector.
In this work, five different loads along with price dataset were used. The pricing
strategies half-hourly, RTP, TOU, and day–night pricing were used. The simulation
results showed the RTP scheme that demonstrated high reduction of peak load. The
peak load of residential and commercial was noted as 10% and 5%, respectively.
However, in this work, the dataset used was not sufficient.
In some cases, there may be power transmission from generation unit to con-
sumers at large scale. For this type of scenarios, there needs a dynamic behavior of
transmission system. So whenever, a new consumer enters in the system, the system
must be compatible with this new addition. However, in this area, the research focus
is very less. DR strategy can be applied to control the whole system from short to
the large-scale power system. Regarding this issue, in [73] have proposed a nonlinear
economic model of consumers load. In this method, the price elasticity of demand and
customer benefit is used. A multi-objective function consists of transmission plan-
ning, DR program is used. The power model is integrated with wind energy resource.
The proposed scheme is implemented on IEEE reliability test system and Iran 400-
kV. The simulation results showed that the total energy cost has successfully reduced
with the proposed scheme. The future direction of this work can be the testing of other
DR programs.
Although in previous literature a number of studies have been carried out in SG
regarding DR, there still exist some challenges like the selection of optimal buses
in power system using DR strategy. Regarding this issue, in [74], Dehnavi and Abdi
have worked on selection and searching of optimal locations for the distrusted power
system. In this work, IEEE-39 bus dataset is used. The optimization technique power
transfer distribution factor (PTDF) is applied. This technique also searches for avail-
able power transmission capacity and optimal flow on the network. The simulation
results showed congestion and the number of clusters is reduced. It also prevents
black-out events. The computation time of the proposed technique is noted as 6.7 s.
Previous studies showed how to effectively model the stochastic process of appli-
ances load. However, there is no such work to schedule online consumer demand as
they may not be aware of energy cost before time. In this regard, in [75], Bahrami
et al. have worked on long-term scheduling problem of appliances along with the
varying behavior of energy. The energy price is modeled as Markov decision prob-
lem. This enables the model to observe the behavior of each interactive entity. The
Markov perfect equilibrium technique is applied for optimal load scheduling. They
also developed an online load scheduling learning (LSL) technique to find out user’s
equilibrium policy. The LSL showed cost and PAR reduction 28% and 13%, respec-
tively. However, this work is only applied to a single home. This work can be extended
to multiple homes along with multiple electricity markets. The literature summary of
energy market has been shown in Table 4.6.

4.4.3.6 Mircorgrid
From past few decades, it confirms that network microgrid plays an important role
in making an energy-efficient and reliable system. However, due to the unpredictable
Demand–response management in smart grid 101

Table 4.6 Literature summary of energy

Ref Technique Strength Limitations

Samimi Stochastic model for active and Reduced Load reduction at

[71] reactive energy market MILP, energy cost specific time
demand buy-back DR
Srinivasan Optimization for Singapore energy Peak load Small dataset
[72] market game-theoretic approach, reduced 10%
dynamic pricing scheme
Hajebrahimi Energy transmission at large scale Reduced Not implemented
et al. nonlinear economic model, price energy cost on other DR
2017 [73] elasticity DR schemes
Dehnavi Selection of optimal buses, PTDF Computation time
[74] is 6.7 s
Bahrami Online long-term appliances PAR reduced 28%, Single home was
[75] scheduling, online LSL cost reduced 13% considered

nature of renewable energy resources, they impose new challenges on the smart
distributed energy system. To address this issue, in some papers, stochastic technique
is used. In [76], Nikmehr et al. have proposed another scheme for network microgrid to
schedule consumers load. In this work, intermittent nature of load and generation unit
is considered. They used time-of-use and real-time pricing of DRM. The optimization
technique PSO is used for scheduling consumer load under uncertainty scenario. The
simulation results showed the execution time of PSO is 241 s, while other stochastic
technique showed 2,763 s. The operational cost is reduced to 17.3%, 30.6% with TOU
and RTP, respectively.
In [77], a peer-to-peer network consists of consumers generation unit, i.e., PV
is considered. They used priced-based DR strategy. The energy-sharing problem is
modeled as a dynamic internal pricing scheme which provides supply and demand
ratio. In this work, the flexibility of consumer’s consumption is considered. The
objective function economic cost and user’s willingness is used. The performance
of the system is evaluated in terms of prosumers cost and sharing of energy. The
simulation results showed that total power loss is reduced from 3,321 to 3,187/kW h.
The convergence rate is noted as 60 iterations. However, this work was not tested on
a distributed network.
In [78], work is done on the smart microgrid and proposed stochastic optimization
problem model with an objective function to minimize operational cost and CO2
emission along with renewable energy resources. In this work, probability density
function is used to predict wind speed and solar irradiance. Three types of consumers
were considered, i.e., residential, industrial, and commercial. The incentive-based
DR strategy is applied with three different case studies, i.e., (1) operational cost
and emission; (2) operational cost, emission, and DR; (3) multi-objective function,
operational cost, and emission. The simulation results showed that by using DR, the
operational cost is reduced by 21% and emission by 14%. The literature summary of
microgrid has been shown in Table 4.7.
102 Modeling and simulation of complex communication networks

Table 4.7 Literature summary of microgrid

Ref Technique Strength Limitations

Nikmehr Load scheduling of network-based Cost reduced PAR was ignored

[76] microgrid, PSO, real-time, and 17.3%
TOU DR
Liu [77] Peer-to-peer network based Power loss Not tested on
microgrid, dynamic pricing scheme reduced 4% distributed network
Aghajani Stochastic optimization problem for Operational cost PAR was ignored
[78] smart microgrid, probability density reduced 21%
function, incentive-based DR

4.5 Open-research problems and discussion

In this section, we are going to discuss different methodologies and techniques on
DRM to SG application and their open-research problems. The SG brings many
facilities to users, e.g., energy efficiency, customer satisfaction, reduction in energy
cost, and load balancing as a few to mention. However, there exist a number of
challenges to be researched.
How to handle demand response in SG environment? To address this question,
different approaches have been applied to SG applications. We surveyed three types
of literature (learning-based approach, complex system, and other techniques) that
are applied to handle demand response in SG applications.

4.5.1 Open-research problems in learning system

Learning-based approach involves ANN and RL. ANN is applied to the virtual power
plant by Lloret and Valencia in article [35]. This showed 1.5% error rate. However, this
needs enough information for prediction of future demand. RL with the combination
of a deep belief network is applied by Mocanu et al. in article [36] for energy prediction
and showed 91.42% accuracy rate. However, this model showed poor performance
in different scenarios. Lakić et al. [10] applied SA-Q to learn system reservation to
offer power at different times. This has increased cost-benefit rate. In article [37],
demand response was addressed by Dusparic et al. to predict load for next day. This
reduced 33% peak usage; however, there is no collaboration among agents. Wen
et al. [38] worked on DRM for single unit or building. The proposed technique
by Zhang et al. [39] maximizes social welfare and shows that it converged in 40
iterations.

4.5.2 Open-research problems in complex system

Complex system literature is involved in collaborative CAS, demand-side integration,
PSO, and game-theory approach. A collaborative approach was adopted in different
Demand–response management in smart grid 103

literature such as Manickavasagam in article [41], proposed ECC mechanism that

consists of the layer-based model. The FLC is used for ECC controlling. However,
there is no defined link or relation between monitoring and fuzzy logic. Golpayegani
et al. in article [18] extends the previous work done on P-MCTS by integrating
negotiation and collaboration among agents in P-MCTS. This study resolved the
agent conflict issue. However, this model is not able to predict available energy. In
an article [42], the authors addressed the prediction of energy issue discussed in the
previous literature.
CAS approach is used in articles [4,44]. Kremers et al. in article [44] presented
a layer-based model for energy transmission and communication. This model offers
dynamic load management. The simulation results showed 40% energy consumption.
Another approach from the CAS perspective was studied by Thimmapuram et al. in
article [4] that deals with the elastic demand of the consumers. This work reduced peak
load. However, energy cost increased for some end users. Mocci et al. in article [45]
worked on integrating EVs into the SG. This technique achieved 85% response rate.
However, this model was unable to predict the state of the battery. In article [79],
authors presented priority banking scheme and offers energy shares to the users.
In this work, network loss is reduced by 50%. However, this offers small share to
the users. Hurtado et al. in article [46] proposed bidding energy management system
technique that maximizes the user comfort level. However, this creates an unbalanced
situation between energy cost and consumption. Wei et al. in article [47] proposed the
two-stage two-level model. The simulation results showed that performance has been
raised up to 5%. However, this model is not robust to error. Song et al. in article [49]
proposed non-stationary demand scheme and achieved 50% energy cost reduction.
However, there exists a trade-off between cost and PAR. Nunna and Doolla [50] deal
with multiple microgrids which allow the user to participate in the market through a
priority index number. Simulation results showed that with higher priority, users get
energy with low cost.

4.5.3 Open-research problems in other techniques

The other technique category comprises security management, HEM, renewable
energy resources, EVs, energy market, and microgrid. We have seen that a num-
ber of studies have been carried out to address different problems like minimization
of energy cost, user discomfort, peak load, etc. Next, we discuss each work lies under
the other technique category.
Regarding security management, different approaches have been presented in the
domain of SG to protect user’s confidential data from unauthorized users. However,
there are still some issues that exist in the current security management approaches
like some part of the system is ignored; sometimes corrupt data is not addressed, etc.
The security in SG is a critical issue. This needs to be addressed to obtain sustainable
and secure smart power system. The data and information transmission within the SG
is very large, there is a need for advanced and new enhancement in communication
infrastructure of the SG.
104 Modeling and simulation of complex communication networks

In the context of HEM, different smart appliances consume energy. However, the
demand patterns and available power are not remaining same at all time. Their value
changes with time. This fluctuation always tends to create an unbalance situation
between energy demand and available energy. To some extent, this issue is addressed
by a number of research works. A study such as presented in [57] worked on the ice-
thermal storage system. The main purpose of the work was to control voltage with
the conjunction of renewable energy resource. The possible extension of this work
can be to use the proposed scheme on the large-scale distributed power system. As in
real world, power system presents a complex system. So it needs to work on a large
scale to observe the behavior of each component in the system.
In [80][81], the authors have presented work on appliances scheduling. These
studies present how scheduling the power demand of different appliances by using
different heuristic approaches can be effective. However, these current studies only
focused on a single home. The possible extension of the current work may be to
test the proposed technique on multiple homes along with different DR strategies to
investigate the load as well as energy cost pattern.
Nowadays, the concept of EVs has been introduced in SG domain. The EVs
have the capability to store and transmit energy. These EVs are used to store energy
whenever energy cost is low from grid unit. Then they sell energy at low cost when
the load on the main grid is high. So this reduces load burden on the grid as well as
high energy cost. In this context, a number of studies such as in papers [62–64] have
proposed different models that show how to effectively use EVs in SG scenarios for
load and energy-cost reduction. However, there are still open-researches issues like
sometime they ignore energy cost, only feasible under limited voltage, some schemes
are not tested on different DR.
Renewable energy resources offer alternate energy resource in the form of wind
and PV energy. Users can fulfill their energy demands from these resources. How-
ever, their energy production is unpredictable. They only depend on weather condition.
From past few decades, different heuristic optimization techniques are used for han-
dling the unpredictable nature of these renewable energy resources. A study presented
in [65] worked on PV promotion. However, they just focused on RTP scheme and
ignored other DR strategies. They need to study the effectiveness of the current work
on other DR strategies. A study in the paper [66] proposed a buy-back scheme for
renewable energy resource. In this work, they used a centralized communication
infrastructure which is not useful in the case of a blackout. Fault in one part can
tend to create disturbance in the whole system. Other studies such as presented in
papers [67–69], [82] also worked on renewable energy resources and demonstrated the
energy cost reduction. However, they just focused on cost reduction; other parameters
like user discomfort and PAR reduction is ignored.
The energy market is responsible for buying energy from power sources and then
selling to the consumers. This area of research also studied different literature. They
demonstrated the peak load and energy-cost reduction. However, the current work
was presented on a small level. Studies such as in papers [76–78] worked on the
microgrid. By using optimization techniques, energy cost is successfully reduced.
However, they ignored the PAR parameters as well as users comfort level.
Demand–response management in smart grid 105

4.6 Conclusions
The DRM plays an important role in the SG environment. It offers a broad range of
advantages on system operation by reducing energy cost as well as effects on load
balancing. In this part, we covered the different approaches applied for DRM in SG and
proposed a classification of DRM models according to the techniques used for their
implementation. The current literature in SG from DRM aspect is categorized into
three main research directions. These research directions are learning-based approach,
complex system, and some other different techniques. We finally described each
technique and its model in detail. We also highlighted open-research problems exist
in each solution.

References
[1] Gungor VC, Sahin D, Kocak T, et al. Smart grid technologies: Communica-
tion technologies and standards. IEEE Transactions on Industrial Informatics.
2011;7(4):529–539.
[2] Bollinger LA, van Blijswijk MJ, Dijkema GP, et al. An energy systems mod-
elling tool for the social simulation community. Journal of Artificial Societies
and Social Simulation. 2016;19(1):1.
[3] Siano P. Demand response and smart grids: A survey. Renewable and
Sustainable Energy Reviews. 2014;30:461–478.
[4] Thimmapuram PR, Kim J. Consumers’ price elasticity of demand modeling
with economic effects on electricity markets using an agent-based model. IEEE
Transactions on Smart Grid. 2013;4(1):390–397.
[5] Kamyab F, Amini M, Sheykhha S, et al. Demand response program in smart
grid using supply function bidding mechanism. IEEE Transactions on Smart
Grid. 2016;7(3):1277–1284.
[6] Rahman M, Mahmud M, Pota H, et al. A multi-agent approach for enhancing
transient stability of smart grids. International Journal of Electrical Power &
Energy Systems. 2015;67:488–500.
[7] Giraldo J, Mojica-Nava E, Quijano N. Synchronization of isolated micro-
grids with a communication infrastructure using energy storage systems.
International Journal of Electrical Power & Energy Systems. 2014;63:71–82.
[8] Lawrence TM, Boudreau MC, Helsen L, et al. Ten questions concerning
integrating smart buildings into the smart grid. Building and Environment.
2016;108:273–283.
[9] Haider HT, See OH, Elmenreich W. Residential demand response scheme
based on adaptive consumption level pricing. Energy. 2016;113:301–308.
[10] Lakić E, Artač G, Gubina AF. Agent-based modeling of the demand-side
system reserve provision. Electric Power Systems Research. 2015;124:85–91.
[11] BabalolaA, Belkacemi R, Zarrabian S. Real-time cascading failures prevention
for multiple contingencies in smart grids through a multi-agent system. IEEE
Transactions on Smart Grid. 2016;9(1):373–385.
106 Modeling and simulation of complex communication networks

[12] Chen C, Wang J, Qiu F, et al. Resilient distribution system by micro-

grids formation after natural disasters. IEEE Transactions on Smart Grid.
2016;7(2):958–966.
[13] Eriksson M,Armendariz M, Vasilenko OO, et al. Multiagent-based distribution
automation solution for self-healing grids. IEEE Transactions on Industrial
Electronics. 2015;62(4):2620–2628.
[14] Ghorbani MJ, Choudhry MA, Feliachi A. A multiagent design for power
distribution systems automation. IEEE Transactions on Smart Grid. 2016;7(1):
329–339.
[15] Kahrobaee S, Rajabzadeh RA, Soh LK, et al. A multiagent modeling and inves-
tigation of smart homes with power generation, storage, and trading features.
IEEE Transactions on Smart Grid. 2013;4(2):659–668.
[16] de Durana JMG, Barambones O, Kremers E, et al. Agent based modeling of
energy networks. Energy Conversion and Management. 2014;82:308–319.
[17] Li Q, Chen F, Chen M, et al. Agent-based decentralized control method
for islanded microgrids. IEEE Transactions on Smart Grid. 2016;7(2):
637–649.
[18] Golpayegani F, Dusparic I, Taylor A, et al. Multi-agent collaboration for con-
flict management in residential demand response. Computer Communications.
2016;96:63–72.
[19] Teleke S, Baran ME, Bhattacharya S, et al. Rule-based control of battery energy
storage for dispatching intermittent renewable sources. IEEE Transactions on
Sustainable Energy. 2010;1(3):117–124.
[20] Nardelli PHJ, Rubido N, Wang C, et al. Models for the modern power grid.
The European Physical Journal Special Topics. 2014;223(12): 2423–2437.
[21] Greer C, Wollman DA, Prochaska DE, et al. NIST framework and roadmap for
smart grid interoperability standards, release 3.0. Special Publication (NIST
SP)-1108r3. 2014.
[22] Nunna HK, Saklani AM, Sesetti A, et al. Multi-agent based demand response
management system for combined operation of smart microgrids. Sustainable
Energy, Grids and Networks. 2016;6:25–34.
[23] Yu M, Hong SH. Supply–demand balancing for power management in smart
grid: A Stackelberg game approach. Applied Energy. 2016;164:702–710.
[24] Valogianni K, Ketter W. Effective demand response for smart grids: Evidence
from a real-world pilot. Decision Support Systems. 2016;91:48–66.
[25] Fera M, Macchiaroli R, Iannone R, et al. Economic evaluation model for the
energy demand response. Energy. 2016;112:457–468.
[26] Labeodan T, Aduda K, Boxem G, et al. On the application of multi-agent sys-
tems in buildings for improved building operations, performance and smart
grid interaction—A survey. Renewable and Sustainable Energy Reviews.
2015;50:1405–1414.
[27] Niazi MA. Towards a novel unified framework for developing formal, network
and validated agent-based simulation models of complex adaptive systems.
PhD Dissertation, University of Stirling, Scotland, UK; 2011.
[28] Niazi MA. Complex adaptive systems modeling: A multidisciplinary roadmap.
Complex Adaptive Systems Modeling. 2013;1(1):1.
Demand–response management in smart grid 107

[29] Santos G, Pinto T, Praça I, et al. MASCEM: Optimizing the performance of a

multi-agent system. Energy. 2016;111:513–524.
[30] Haghnevis M, Askin RG, Armbruster D. An agent-based modeling optimiza-
tion approach for understanding behavior of engineered complex adaptive
systems. Socio-Economic Planning Sciences. 2016;56:67–87.
[31] Weiss G. Multiagent systems: a modern approach to distributed artificial
intelligence. Cambridge, MA: MIT Press; 1999.
[32] Rayati M, Sheikhi A, Ranjbar AM. Applying reinforcement learning method
to optimize an energy hub operation in the smart grid. In: Innovative Smart
Grid Technologies Conference (ISGT), 2015 IEEE Power & Energy Society.
IEEE; 2015. p. 1–5.
[33] Ruelens F, Claessens BJ, Vandael S, et al. Residential demand response of
thermostatically controlled loads using batch reinforcement learning. IEEE
Transactions on Smart Grid. 2017;8(5):2149–2159.
[34] Li D, Jayaweera SK. Reinforcement learning aided smart-home decision-
making in an interactive smart grid. In: Green Energy and Systems Conference
(IGESC), 2014 IEEE. IEEE; 2014. p. 1–6.
[35] Hernández L, Baladron C, Aguiar JM, et al. A multi-agent system architecture
for smart grid management and forecasting of energy demand in virtual power
plants. IEEE Communications Magazine. 2013;51(1):106–113.
[36] Mocanu E, Nguyen PH, Kling WL, et al. Unsupervised energy prediction
in a smart grid context using reinforcement cross-building transfer learning.
Energy and Buildings. 2016;116:646–655.
[37] Dusparic I, Harris C, Marinescu A, et al. Multi-agent residential demand
response based on load forecasting. In: Technologies for Sustainability
(SusTech), 2013 1st IEEE Conference on. IEEE; 2013. p. 90–96.
[38] Wen Z, O’Neill D, Maei H. Optimal demand response using device-based
reinforcement learning. IEEE Transactions on Smart Grid. 2015;6(5):
2312–2324.
[39] Zhang W, Xu Y, Liu W, et al. Distributed online optimal energy management
for smart grids. IEEE Transactions on Industrial Informatics. 2015;11(3):
717–727.
[40] O’Neill D, Levorato M, Goldsmith A, et al. Residential demand response using
reinforcement learning. In: Smart Grid Communications (SmartGridComm),
2010 First IEEE International Conference on. IEEE; 2010. p. 409–414.
[41] Manickavasagam K. Intelligent energy control center for distributed gen-
erators using multi-agent system. IEEE Transactions on Power Systems.
2015;30(5):2442–2449.
[42] Le Cadre H, Bedo JS. Dealing with uncertainty in the smart grid: A learning
game approach. Computer Networks. 2016;103:15–32.
[43] Huang H, Li F, Mishra Y. Modeling dynamic demand response using Monte
Carlo simulation and interval mathematics for boundary estimation. IEEE
Transactions on Smart Grid. 2015;6(6):2704–2713.
[44] Kremers E, de Durana JG, Barambones O. Multi-agent modeling for the
simulation of a simple smart microgrid. Energy Conversion and Management.
2013;75:643–650.
108 Modeling and simulation of complex communication networks

[45] Mocci S, Natale N, Pilo F, et al. Demand side integration in LV smart grids
with multi-agent control system. Electric Power Systems Research. 2015;125:
23–33.
[46] Hurtado L, Nguyen P, Kling W. Smart grid and smart building inter-operation
using agent-based particle swarm optimization. Sustainable Energy, Grids and
Networks. 2015;2:32–40.
[47] Wei W, Liu F, Mei S. Energy pricing and dispatch for smart grid retailers under
demand response and market price uncertainty. IEEE Transactions on Smart
Grid. 2015;6(3):1364–1374.
[48] Chai B, Chen J, Yang Z, et al. Demand response management with multiple
utility companies: A two-level game approach. IEEE Transactions on Smart
Grid. 2014;5(2):722–731.
[49] Song L, Xiao Y, Van Der Schaar M. Demand side management in smart
grids using a repeated game framework. IEEE Journal on Selected Areas in
Communications. 2014;32(7):1412–1424.
[50] Nunna HK, Doolla S. Demand response in smart distribution system with
multiple microgrids. IEEE Transactions on Smart Grid. 2012;3(4):1641–1649.
[51] O’Brien G, El Gamal A, Rajagopal R. Shapley value estimation for compen-
sation of participants in demand response programs. IEEE Transactions on
Smart Grid. 2015;6(6):2837–2844.
[52] Rahman MS, Basu A, Kiyomoto S, et al. Privacy-friendly secure bidding for
smart grid demand-response. Information Sciences. 2017;379:229–240.
[53] Moghaddam MHY, Leon-Garcia A, Moghaddassian M. On the performance of
distributed and cloud-based demand response in smart grid. IEEE Transactions
on Smart Grid. 2017;9:5403–5417.
[54] Tsai SC, Tseng YH, Chang TH. Communication-efficient distributed demand
response: A randomized ADMM approach. IEEE Transactions on Smart Grid.
2017;8(3):1085–1095.
[55] Wada K, Sakurama K. Privacy masking for distributed optimization and its
application to demand response in power grids. IEEETransactions on Industrial
Electronics. 2017;64(6):5118–5128.
[56] Ghazvini MAF, Soares J, Abrishambaf O, et al. Demand response implemen-
tation in smart households. Energy and Buildings. 2017;143:129–148.
[57] Luo X, Lee CK, Ng WM, et al. Use of adaptive thermal storage system as
smart load for voltage control and demand response. IEEE Transactions on
Smart Grid. 2017;8(3):1231–1241.
[58] Yu CN, Mirowski P, Ho TK. A sparse coding approach to household elec-
tricity demand forecasting in smart grids. IEEE Transactions on Smart Grid.
2017;8(2):738–748.
[59] Li C,Yu X,Yu W, et al. Efficient computation for sparse load shifting in demand
side management. IEEE Transactions on Smart Grid. 2017;8(1):250–261.
[60] Wang Z, Paranjape R. Optimal residential demand response for multiple het-
erogeneous homes with real-time price prediction in a multiagent framework.
IEEE Transactions on Smart Grid. 2017;8(3):1173–1184.
Demand–response management in smart grid 109

[61] Huang H, Cai Y, Xu H, et al. A multiagent minority-game-based demand-

response management of smart buildings toward peak load reduction. IEEE
Transactions on Computer-Aided Design of Integrated Circuits and Systems.
2017;36(4):573–585.
[62] Yao L, Lim WH, Tsai TS. A real-time charging scheme for demand response
in electric vehicle parking station. IEEE Transactions on Smart Grid.
2017;8(1):52–62.
[63] Le Floch C, Bansal S, Tomlin CJ, et al. Plug-and-play model predictive control
for load shaping and voltage control in smart grids. IEEETransactions on Smart
Grid. 2017;1(1):1–10.
[64] Jannati J, Nazarpour D. Optimal energy management of the smart parking
lot under demand response program in the presence of the electrolyser and
fuel cell as hydrogen storage system. Energy Conversion and Management.
2017;138:659–669.
[65] Wang G, Zhang Q, Li H, et al. Study on the promotion impact of
demand response on distributed PV penetration by using non-cooperative game
theoretical analysis. Applied Energy. 2017;185:1869–1878.
[66] Chiu TC, ShihYY, PangAC, et al. Optimized day-ahead pricing with renewable
energy demand-side management for smart grids. IEEE Internet of Things
Journal. 2017;4(2):374–383.
[67] Jiang Y, Xu J, Sun Y, et al. Day-ahead stochastic economic dispatch of wind
integrated power system considering demand response of residential hybrid
energy system. Applied Energy. 2017;190:1126–1137.
[68] Hu M, Xiao F. Investigation of the demand response potentials of residen-
tial air conditioners using grey-box room thermal model. Energy Procedia.
2017;105:2759–2765.
[69] Amrollahi MH, Bathaee SMT. Techno-economic optimization of hybrid pho-
tovoltaic/wind generation together with energy storage system in a stand-alone
micro-grid subjected to demand response. Applied Energy. 2017;202:66–77.
[70] Shakeri M, Shayestegan M, Abunima H, et al. An intelligent system archi-
tecture in home energy management systems (HEMS) for efficient demand
response in smart grid. Energy and Buildings. 2017;138:154–164.
[71] Samimi A, Nikzad M, Siano P. Scenario-based stochastic framework for
coupled active and reactive power market in smart distribution systems with
demand response programs. Renewable Energy. 2017;109:22–40.
[72] Srinivasan D, Rajgarhia S, Radhakrishnan BM, et al. Game-theory based
dynamic pricing strategies for demand side management in smart grids.
Energy. 2017;126:132–143.
[73] Hajebrahimi A, Abdollahi A, Rashidinejad M. Probabilistic multiobjective
transmission expansion planning incorporating demand response resources and
large-scale distant wind farms. IEEE Systems Journal. 2017;11(2):1170–1181.
[74] Dehnavi E, Abdi H. Determining optimal buses for implementing demand
response as an effective congestion management method. IEEE Transactions
on Power Systems. 2017;32(2):1537–1544.
110 Modeling and simulation of complex communication networks

[75] Bahrami S, Wong VW, Huang J. An online learning algorithm for demand
response in smart grid. IEEE Transactions on Smart Grid. 2017;9(5): 4712–
4725.
[76] Nikmehr N, Najafi-Ravadanegh S, Khodaei A. Probabilistic optimal schedul-
ing of networked microgrids considering time-based demand response pro-
grams under uncertainty. Applied Energy. 2017;198:267–279.
[77] Liu N,Yu X, Wang C, et al. An energy sharing model with price-based demand
response for microgrids of peer-to-peer prosumers. IEEE Transactions on
Power Systems. 2017;32(5)3569–3583.
[78] Aghajani G, Shayanfar H, Shayeghi H. Demand side management in a smart
micro-grid in the presence of renewable generation and demand response.
Energy. 2017;126:622–637.
[79] Ellabban O, Abu-Rub H. Smart grid customers’ acceptance and engagement:
An overview. Renewable and Sustainable Energy Reviews. 2016;65:1285–
1298.
[80] Manzoor A, Javaid N, Ullah I, et al. An intelligent hybrid heuristic scheme
for smart metering based demand side management in smart homes. Energies.
2017;10(9):1258.
[81] Ahmad A, Khan A, Javaid N, et al. An optimized home energy management
system with integrated renewable energy and storage resources. Energies.
2017;10(4):549.
[82] Behboodi S, Chassin DP, Djilali N, et al. Transactive control of fast-acting
demand response based on thermostatic loads in real-time retail electricity
markets. Applied Energy. 2018;210:1310–1320.
Chapter 5
Applications of multi-agent systems in smart
grid: a survey and taxonomy
Waseem Akram1 and Muaz A. Niazi1

Multi-agent systems (MASs) in the smart-grid area have received a great deal of
attention from the research community in recent years. Studies on MAS to the smart
grid have brought a number of interesting technical discussions on simulation and
modeling of the smart grid and research contributions. Researchers are trying to
bring energy efficiency and load balancing in the smart grid. Many of these research
works have achieved efficiency in power-system domain, while the social system
and consumer satisfaction still need improvement. By focusing on the MAS in smart
grid, in this part, we survey the body of knowledge and discuss the challenges of
simulation and modeling of MAS in the smart grid. We investigate and group the
existing solutions and highlight open-research problems.

5.1 Overview

We first start by giving an overview of the smart-grid concept. Next, we present a

detailed review of different literature in the smart-grid domain. This is followed by
open research problems and discussions. The chapter ends with a conclusion.

5.2 Introduction

The traditional power system provides one-way power flow, which is responsible for
generation and transmission of energy to end users. However, the user demand changes
with time (variable demand). The one-way power flow could not deal with variable
demand. This problem gained the attention of researchers and introduced smart-
grid technology by integrating information and communication technology with the
traditional system. The smart grid is a power system consisting of various technologies
like a smart meter, ICT, smart homes, generators, storage devices appliances, load, etc.

1
Computer Science Department, COMSATS Institute of Information Technology, Pakistan
112 Modeling and simulation of complex communication networks

The smart grid is a network composed of distributed nodes, all operations of the
system are controlled intelligently and autonomously, in order to achieve efficient
energy system [1].
Fuel consumption changes the climate. This change attracted the researchers
to introduce renewable energy resources like solar and wind. However, the out-
come of these resources is unpredictable due to its fluctuation behavior. To achieve
future sustainability, reliability, and resilience features of the smart grid, the research
community is attracted to deploy renewable energy resources in power system. The
structure of the power system is now shifted to more bottom-up approach. This
means that all decisions related to power generation and transmission are taken by
various actors (agents) in generation unit in a distributed manner. Various actors
interact with the technical system (power system) and they are dependent on each
other.
The smart-grid system is made up of two main components, e.g., technical sys-
tem and social system. The technical system consists of power plants, power lines,
load, transformers, and busses. The social system consists of consumers, operators,
and electricity retailers. Each component of the social system interacts with each
component of the technical system.
The deployment of renewable energy resources needs more coordination, man-
agement, and controlling techniques to achieve reliable and efficient system.
A MAS is a useful tool for coordination and management of all operations within
the smart grid, due to its distributed and autonomous property. MAS is widely used
for smart-grid application. They are responsible for the management and control of all
smart-grid activities. They can perform various tasks like communication among dif-
ferent agents, fault detection and prevention, power scheduling, voltage controlling,
and storing energy.
In previous literature, a number research works have been carried out in the smart-
grid domain. However, currently, there is no such work that investigates and analyzes
these works. There is a need to find out which technique is feasible in what scenario.
In this chapter, we provide a detailed survey and comparison of different techniques
available for smart-grid system over the period 2010–16. The aim of this study is
to present a comprehensive understanding of the smart-grid domain, its application,
as well as the open-research problems that need to be addressed to gain sustainable
and reliable system. We have cited a large number of scientific publications round
about 100 papers. To the best of our knowledge, this is the first comprehensive
survey on MAS in the smart grid. While during our literature review, we found one
paper [2] that presents a survey on a specific aspect of MAS in the smart grid.
In [2], the author focuses on demand-side management, generation and transmission
management. Although the author discussed important issues in the domain. However,
there is no discussion about other relevant aspects such as communication, self-
healing, power scheduling, and storage management.
In this part, we aim to present more comprehensive and concise overview up
to date by targeting five aspects such as communication, demand-side manage-
ment, fault detection and prevention, power scheduling, and storage and voltage
management.
Applications of multi-agent systems in smart grid 113

5.3 A review of multi-agent system to smart-grid application

In this section, we are going to discuss briefly different approaches and solutions in
the smart grid from MAS perspective. We grouped existing literature into five cate-
gories that are communication management, demand–response management, faults
controlling, power scheduling, and storage management. The proposed taxonomy is
shown in Figure 5.1.

5.3.1 Communication management

In a smart-grid environment, different agents communicate with each other to share
information about power demand and capacity. This process is categorized into group
communication and learning-based communication.

5.3.1.1 Group communication

In group communication, different entities are connected through a common purpose
and interact with each other. In this section, we present some of the previous studies
carried out in the smart grid that make use of group communication paradigm. This
paradigm is categorized into subcategories according to the techniques used in the
study. Next, we discuss each of these techniques.

Hierarchal framework
Li et al. [3] presented an agent-based decentralized control scheme for distributed
smart-grid network. It consists of two layers. One is the bottom layer that represents
a communication network composed of agents that act as controllers and collect
information about grid status. Second is the top layer representing a distribution
process of the power grid network. The agents at bottom layer control the power
produced by distributor grids. This study achieved balance state between power and
demand. It also reduced communication complexity and voltage variation.
One-way power communication in smart grid is considered to be slow in response.
In [4], Al-Agtash presented a novel agent-based model for two-way power commu-
nication in the smart grid. This model provides two-way power flow between user’s
demand and power generators. This architecture consists of three layers: power gen-
erators, middle-ware, and electricity agents. Agents operate in an integrated manner
within smart grid. They control and monitor demand variations and selling of power
at customer side. These agents provide reliability, security, and stability of the system.
Simulation results showed that market price decreased from 80 to 50/mW h. How-
ever, there are still some design issues, i.e., API, integrity, and consistency of agents
operation.
The decentralized management system in a smart grid makes each part of the
system intelligent and autonomous. Palicot et al. [5] have presented hierarchal cog-
nitive radio network architecture for the smart grid. The framework focuses on the
hierarchal position of each element of the system. The results showed that peak power
55,000 W reduced to 900 W. This method reduced pressure on the system and also
reduced the risk of failure.
114 Modeling and simulation of complex communication networks

Hierarchal
Coalition formation
Group communication
Census

PSO
Communication
RL

Learning ANN

Bayesian

Collaborative
CAS
Demand integration
Complex model
PSO

Demand response Game theory

ANN
Learning
RL

Adaptive program

Self-organizing MAF
WPH
MAS-SG Fault control
Fuzzy-rule
Algorithm Census

Sweep technique
Spanning tree

Self-organizing

Hierarchal
Complex model
Census

Power scheduling Cognitive

RL
Learning
ANN

Volt/Var
Storage/Voltage Monitoring Census

State monitoring

Self-organizing

Normality analysis
Search
Hill-climbing

Swarm-intelligence

Figure 5.1 Proposed taxonomy of literature review in smart grid

Applications of multi-agent systems in smart grid 115

In a smart-grid environment, power demand and supply balance can be achieved

by enabling agents to share information. In [6], Larsen et al. proposed an information
sharing model of imbalance power in the smart grid. In this model, agents commu-
nicate with their neighbors and exchange information about the imbalance of power.
By comparing this model with the centralized algorithm, it showed that centralized
algorithm requires 29 times more time for 150 households than the proposed method.
However, in this method, agents keep information about their own imbalance power.
They also only communicate directly with their neighbors.
For communication management, in [7], another framework is proposed by Yan
et al. based on zero correlation zones. In this technique, communication between
machine-to-machine was initiated through mutual authentication. This technique
maintains physical layer security and reduces traffic overhead. The performance of
the system is described in terms of time–efficiency ratio. The results showed that
the proposed technique is better in larger nodes and low load impedance scenario.
However, with small nodes scenario, it gives worse results.
Coalition formation
Power scheduling and transmission are challenging tasks as the structure of smart
grid become complex. In [8], Ye et al. presented a multi-agent coalition formation–
based dispatch technique. This technique has decentralized behavior, and it does not
require any global information. Each node acts as an autonomous agent, negotiates
with each other (agents), and takes part in the decision-making process. Agents work
in the group, generating outstanding output. This model requires a large time for
negotiation between agents. Results demonstrate good average utility.
In [9], Dagdougui and Sacile addressed the issue regarding the optimization
of cost and power exchange among the smart microgrid network. They proposed a
decentralized control and monitoring strategy of the smart microgrid network. In the
study system, each microgrid is able to generate, store, and transmit energy to other
microgrids and main grid. This method allows storage devices to operate around a
reference value through cooperatively sharing power information among microgrids.
For a fixed number of network, this method makes low iteration, but the iteration
no. is influenced by the size of the network. It also does not handle the stochastic
scenario.
In [10], Nguyen and Flueck addressed the communication latency problem in
smart-grid application. Communication latency has a great impact on system per-
formance. In this technique, the system is modeled as a random parameter with
probability density function, and sending/receiving messages are generated randomly.
The results showed restoration time is 3.983 s. However, this method does not focus
on individual communication and communication bandwidth.
Census
As the environment of smart grid becomes complex, it is difficult to manage it using
census-based approach due to its lack of information. In [11], Zhang presented a
robust incremental cost estimation scheme for power dispatch in the smart grid. It
consists of two layers that are executed in parallel. One layer is a gossip update rule
that calculates average power mismatch. The second layer is consensus incremental
116 Modeling and simulation of complex communication networks

cost estimation that calculates the system incremental cost. This method enhanced
the system vulnerability and it requires less information. The results showed that
information loss and iteration have a direct relationship. This method gives better
results when information loss is 5%.

Particle swarm optimization

Integrating smart grid and smart green have attained great research interest recently.
In such area, communication is bidirectional. In [12] by Wang, work is carried out
on the communication protocol and proposed a negotiation-based agent adaptive
attitude bidding strategy. The agents adjust their behavior in response to the variation
in the communication process. This work presents particle swarm optimization agent
adaptive bidding strategy (PSO-AABS) used by trader agents for decision in response
to opponent behavior. It handles bidirectional communication between smart grid and
green building. This method maximizes trader’s payoff and negation time. The results
showed that the cost of buying reduced 7% and their negation time 27%, while the
cost of selling reduced 17% and their negation time 9%. However, customer comfort
level is also reduced.

5.3.1.2 Learning-based approach

In this section, we discuss some of the learning techniques that were presented in smart
grid in the past for addressing communication aspects of the smart-grid system. These
techniques are reinforcement, neural, and Bayesian-learning approach.

Reinforcement learning approach

In [13], Yu et al. studied the smart generation control of multi-agent multi-area dis-
tributed smart-grid system. This paper presents a novel scheme named correlated
equilibrium Q(λ) to produce an optimal equilibrium solution for the load control in a
distributed network of the power system. In this work, a formulated equilibrium selec-
tion function is presented. This technique enhances the overall long-run performance
of the system. The results demonstrate fast convergence rate.
In smart-grid infrastructure, each node needs to maintain same frequency and
voltage to prevent failures. In [14], Giraldo et al. focused on frequency synchroniza-
tion. It enables the microgrid to remain synchronized even in unknown changes. It
showed that the stability of the system is improved with an energy storage system
(ESS). This model works with linear time invariant; it does not address nonlinear
time invariant.

Artificial neural network

In [15], Saraiva et al. focused on the classification of load power in the smart grid. In
this work, a classification of the nonlinear load in the smart grid is presented using
the artificial neural network (ANN) and MAS. They used smart meter agents in the
classification process. The smart meter measures voltage and sends the result to other
agents at the substation. Experimental results showed that system accuracy achieved
98.7% and also the overall cost is less. However, this approach is not robust to error.
Applications of multi-agent systems in smart grid 117

Table 5.1 Literature summary of communication

Ref Technique Strength Limitations

Li [3] Decentralized controlled Reduced Agents only

scheme, system dynamic communication communicate with
modeling complexity its neighbors
Palicot [5] Cognitive radio network Peak power The retailers are
architecture, hierarchal reduced ignored
framework
Larsen [6] Sharing information about Balance of power Only communication
imbalance power, network and demand between neighbors
topology achieved
Yan [7] Machine-to-machine Maintain security Worse results in case
communication, zero of physical of small system
correlation zone system
communication protocol
Dagdougui Optimization of cost and Low convergence Not handle stochastic
and power in microgrid, rate scenario
Sacile [9] network theory
Nguyen Communication latency in Restoration time Not handle individual
and smart grid, coalition is 3.9 s communication and
Flueck [10] formation communication bandwidth
Zhang [11] Cost estimation scheme, Worse results in the case
census of large information loss
Wang [12] Adaptive biding Reduced power User comfort also reduced
communication strategy, PSO cost 7%
Giraldo [14] Frequency synchronization Stability of the Not handle nonlinear
in smart grid, census system increased time invariant
Saraiva [15] Classification of 98.7% accuracy Not robust to error
load, ANN
Misra [16] Energy trading with Utility Not control packet loss
incomplete information, increased 40%
Bayesian learning

Bayesian learning
Information about prices and demands may be lost during the communication process.
The incomplete information affects the performance of smart grid. In [16], Misra et
al. addressed this issue regarding smart grid. In this work, the agent-based model is
proposed using Bayesian learning approach. It consists two types of agents: customer
and grid agents. Customer’s agents calculate the price given by the grid. Grid’s agents
calculate demand given by customers based on the probability of their belief. Simu-
lation results showed that utility is increased by 40%. However, this method ignored
control packet loss rate. The literature summary of communication management has
been shown in Table 5.1.

5.3.2 Demand–response management

Demand–response management is the process of shifting consumer demand from high
demand to low demand by giving incentives to the users. This approach is categorized
into two approaches which are learning based and complex system approach.
118 Modeling and simulation of complex communication networks

5.3.2.1 Learning-based approach

In smart grid, we found two types of learning-based approaches: neural network
and reinforcement learning (RL). These techniques have been applied to address
demand–response problem. Next, we discuss these learning-based studies.

Artificial neural network

Hernández et al. in [17] proposed a MAS for the virtual power plant. The virtual
power plant consists of small elements or a single unit. ANN is applied for efficient
control and management of operations in a virtual power plant. The experimental
results showed 1.5% error rate. This model works on a small level. It also needs more
information for predicting the future state of the system.

Reinforcement learning approach

There needs an intelligent and accurate model for prediction of energy consumption
in a smart-grid environment. In [18], Mocanu et al. presented a model for energy
prediction based on RL. The RL technique works without using any historical data.
This model integrates the RL with the deep belief network. It estimates the state space
and then finds optimal policy by using the RL. Experimental results showed 91.42%
accuracy of the system. However, this model is not implemented for the different
level in the smart grid.
In [19], Lakić et al. presented an agent-based model using SA-Q learning tech-
nique. It learns how much system reserve to offer power at the different time. It also
increases the ratio between economic cost and its benefits. The results showed an
improvement in performance and economic outcome to users. However, this method
is not applied to the multi-agent framework.
In [20], Dusparic et al. proposed a multi-agent scheme based on RL for demand–
response problem in the smart grid. This method uses current load information and
predicts load for the next one day by using load forecasting approach. The agents learn
how to fulfill user demands from available energy. The results showed peak usage
reduction is 33% and off-peak increased by 50%. However, in this method, there is
no collaboration and communication among agents.
Wen et al. [21] addressed demand–response problem in smart grid and proposed
framework based on RL technique. In this method, demand–response problem is
decomposed over each device. This technique performs the self-initiation job and
handles many flexible requests. The complexity of the proposed algorithm is linear.
The results showed that for a broad range of trade-off parameters, it outperforms.
However, this technique focuses on the demand–response problem of a single unit or
building.
In [22] by Ruelens et al., research work is carried out on the demand–response
problem using a batch RL technique. The batch RL technique covers the inefficient
information problem of RL method. This method uses a batch of experiences to find
out optimal policy. In this work, two agents, water and residential building agents,
are used. For dynamic pricing, to minimize cost a closed loop policy and for the
day ahead scheduling open-loop policy is followed. The result showed a reduction in
energy cost by 19%, and consumption rate is increased by 4%.
Applications of multi-agent systems in smart grid 119

When an economic dispatch and demand response are treated as separate and
sequential operation, energy efficiency decreases. Zhang et al. [23] presented opti-
mal energy-management strategy in order to maximize social welfare. This method
operates through coordination of demand response and economic dispatch. Economic
dispatch is provided by generator and demand response by the customers. This method
is also used for discovery of the power demand–supply mismatch. The simulation
results showed convergence rate of 40 iterations.
Another approach for demand response is studied by O’Neill et al. [24] and
proposed consumer automated energy system. This technique reduces residential
energy cost and usage. This method uses online energy cost estimation and user
decision policy. This is the independent approach to energy price and system behavior.
In this method, users decide which device will use energy and how much. The results
showed 40% cost reduction by using price unaware energy scheduling.

5.3.2.2 Complex system

In this section, we present related work carried out for demand–response manage-
ment from a complex system perspective. We grouped the existing techniques into
collaborative, complex adaptive system, demand-side integration, particle swarm
optimization, and game theory approach.

Collaborative approach
MASs are widely used for controlling and managing a smart grid. In [1], Manick-
avasagam proposed and developed intelligent energy control center (ECC) mechanism
for the smart grid. This technique consists of two layers. The one is DER serve as a
client and the other is ECC as a server. ECC is controlled and monitored by a fuzzy
logic controller (FLC). Communication and negotiation between client servers take
place through internet protocol. The simulation results are stored in an excel database
acting as a monitoring agent. ECC uses these results for decision-making in DERs.
However, communication between results and FLC is not taken into account.
The mismatch between supply and demand reduces system performance. Paral-
lel Monte Carlo tree search (P-MCTS) can produce an optimal solution for power
balancing, but it has no coordination support. In [25], Golpayegani et al. extended
the P-MCTS work by introducing collaborative and coordination concept. Agents
negotiate with each other and present their proposal. This method resolves prob-
lems of agent’s conflict, load-shifting, and charging capacity. The results showed that
charge capacity increased from 33% to 50%. However, this model does not deal with
prediction of data.
In [26], Le Cadre and Bedo worked on uncertainty in a smart-grid environ-
ment and present decentralized hierarchal based on the learning game approach. It
is composed of supplier, generator, and consumer agents. Agents forecast demand
and production of the grid in a collaborative manner. It determines the price that
balance power and demand. The results showed that in a shared information network,
faster convergence rate is achieved using cooperative learning as compared with an
individual learning.
120 Modeling and simulation of complex communication networks

In [27], Huang et al. addressed demand–response issue in a smart-grid envi-

ronment. This study proposed another approach in which elastic economic dispatch
process is modeled. The flexible load cost is used as a control signal. The control signal
balances demand and response. In this method, Monte Carlo and interval mathematic
technique is used for boundary calculation. This estimates the uncertainty which is
the difference between the present and the target value. Simulation results showed
that the interval mathematic technique is efficient as compared to Monte Carlo. It also
showed less convergence time, e.g., 1% as compared with Monte Carlo technique.
However, this technique does not handle probability distribution.

Complex adaptive system

In [28], Kremers et al. presented a bottom-up approach for the smart grid. It consists
of two layers: physical layer for electrical power transmission and logical layer for
communication. This model has the ability to integrate new devices in a smart-grid
environment. It provides dynamic load management, power, and communication con-
trolling and monitoring. Experiment results showed 40% reduction factor in energy
consumption. However, this model is not capable of handling high-load management.
In [29], Thimmapuram and Kim proposed an agent-based model using elasticity
market complex adoption system to a smart-grid domain. This technique handles
user elastic demand and lower cost. This method reduced peak load in the range of
8%–5%. However, the cost of energy for some users has increased.

Demand-side integration
Demand-side integration in smart grid results in security, quality, efficiency, and
reduction in cost. In [30], Mocci et al. proposed a MAS for integration of demand
and electric vehicles (EVs). The load agents calculate power demand and act as master
agents. The master agents with cooperative agents send power load and global data
to the demand side. It achieved demand–response rate of 85%. It also reduced the
flow of data. However, this technique is not able to calculate the state of batteries of
different storage at the different time.
In [31], Nunna et al. proposed a priority banking scheme. It concerns with
users’ demands. This method gives some share to the users from available resources.
It monitors user demand and updates their priority. This method reduced network
loss by 50% and also reduced dependency on overall grid. However, this technique
provides fewer shares to users.

Particle swarm optimization

Advance power grid shifting from vertical to a horizontal structure which requires
efficient management system. In [32], Hurtado et al. proposed building energy
management system (BEMS). This technique provides interaction between different
environments. It uses PSO approach to maximize comfort level and energy efficiency.
In this hierarchical infrastructure, lower level agents abstract information and pro-
vide to higher level agents. Performance is described by weight factor; fair scenario
showed 0.5 weight while bias scenario results in 0.3 weight. However, this technique
generates some unbalanced situation.
Applications of multi-agent systems in smart grid 121

Game theory approach

The retailer and market price management in smart grid gained great attention in
research work. In [33], Wei et al. focused on energy price and dispatch problem in
a smart-grid environment. This study proposed a two-stage two-level model. Cus-
tomer demand and price is considered as the first stage using the Stackelberg game
approach, while the operation of storage devices is considered as the second stage
using linear max–min problem. Then the model is translated into mixed integer linear
problem (MILP). The results showed 5% improvement in system performance, and it
also increased retailer profit. However, this method is very sensitive and knowledge
gathering process is very difficult.
In [34], Chai et al. addressed demand–response issue and presented a two-level
game approach. This technique handles multiple utility company and users. The
utility company is modeled as noncooperative and communication between users as
evolutionary. The results showed that proposed technique reduced cost payment from
3,197.7 to 2,425.6 and energy demand is increased from 1,224.9 to 1,478.4. However,
this work does not employ constraint on power consumption.
In [35] by Song et al., another framework for optimal nonstationary demand-side
management in a smart-grid environment is proposed. In this method, the user selects
their energy usage pattern according to their priority and needs. They used a repeated
game approach which provides interaction among foresighted price anticipating users.
This method showed 50% reduction in energy cost and robustness in error. However,
higher threshold value results in a trade-off between cost and peak average ratio.
In [36], Nunna and Doolla carried out research work on management of demand
response in multiple microgrid networks. In this work, customers participate in
demand–response strategy. This study proposed a priority index approach through
which customers participate in the market. This method reduced peak demand. It is
found that customers with high priority index get power at low cost.
In [37], O’Brien et al. focused on demand–response management in smart-grid
application. In this work, demand response is modeled as the game-theoretic environ-
ment, and Shapley-value (SV) is used for payment distribution process. RL technique
is used to estimate SV. Simulation results showed that for random sampling, 1,000,000
samples take 58.2 s execution time, while for sigmoid sample, 51,129 samples take
6.5 s, this showed that uniform sample balances demand and response. However, this
method is not suitable for distribution scheme and its direct estimation is difficult. The
literature summary of demand-response management has been shown in Table 5.2.

5.3.3 Fault monitoring

Fault-monitoring process involves the detection and prevention of any fault that occurs
in a smart-grid environment. This approach is categorized into self-organizing and
algorithm approach.

5.3.3.1 Self-organizing
Self-organization is an activity of the system in which each or some parts of the
system arrange themselves based on the local interaction among each component of
122 Modeling and simulation of complex communication networks

Table 5.2 Literature summary of demand–response management

Ref Technique Strength Limitations

Hernández Energy forecasting in 1.5% error rate Works on single unit

et al. [17] VPP, ANN
Mocanu Energy prediction for 91.42% accuracy Not implemented on
[18] building, RL + belief different level
neural network
E. Lakić Learning how much system Ration between cost Not implemented on
[19] reserve to offer at different and benefit increased MAS framework
time, SA-Q learning
I. Dusparic Energy prediction of next 33% usage reduced No communication
[20] day-ahead for EVs, RL+
load forecasting
Wen [21] Segmentation of each Linear complexity Single unit
device, RL
Ruelens Thermostatic load 19% cost reduction Single unit
[22] controlling, batch RL
Zhang [23] Finding mismatch between Convergence rate 40 Only local
demand and supply, iterations communication
economic dispatch
Golpayegani Conflict management Battery capacity No prediction
[25] between Evs, CP-MCTS increased 17%
Kremers Agent-based model of 40% consumption Not handling
[28] simple smart grid, CAS reduced, and peak high load
load lies in the
range of 5%–8%
Mocci [30] Controlling of integrated Response rate achieved No calculation of
demand and EVs, DSI 85%, network loss rate battery state
reduced 50%
Hurtado Controlling of Performance in the Unbalance situation
[32] interoperation of smart form of weight factor,
building, PSO which is achieved 0.5
Wei [33] Management of energy price 5% performance Difficult to gather
and dispatch problem, MILP improved information
Chai [34] Controlling multiple utility Cost reduced from No constraints on
centers and end users, 3,197.0 to 2,425.6 RS power consumption
two-level game
O’Brien Payment distribution Converge in 58.2 s Not suitable for
[37] process, Shapley value energy distribution
distribution

the system. In this section, we discuss some of the self-organization approaches that
have been carried out for addressing fault-monitoring problem.

Adaptive programming scheme

Some naturally accruing events lead to the cascading failure and loss in a smart-grid
system. In [38], Babalola et al. proposed an adaptive MAS for prevention of cascading
Applications of multi-agent systems in smart grid 123

failure and loss in the smart grid. The proposed model searches for overloaded trans-
mission lines and then redistribute power to that line. The system decreases the
transmitted power in the overloaded lines and brings the lines to in working state.
This process successfully halts the cascading failure without load shedding. How-
ever, this approach needs major hardware requirement and efficient dispatch power
history. Additionally, the algorithm also consists of a large number of constraints.
In [39], Nassar and Salama introduced the dynamic microgrid concept having
flexible boundaries. With this feature, the size of the grid can be reduced or extended
according to the need. It uses forward–backward sweep technique for power flow.
In an emergency situation, self-healing feature is achieved. The result showed good
performance when compared with fixed boundary system. However, the computation
time of this technique is very large which is 15.106/h.
In [40] by Chen et al., work is done on restoration of the power flow after a
natural disaster. In this work, multi-agent coordination control scheme based on a
mixed-integer linear program is presented. The proposed system controls on and off
status of switched devices. A local communication technique is used for discovery
of global information. The global information is used for the optimal decision. The
results showed the computation time of this technique 0.265 s. However, this work
does not focus on communication range, battery capacity, and the requirement for
global information discovery.
Multi-agent framework
Fault detection and its diagnosis avoid loss of synchronous operation in power sys-
tem. In [41], Rahman et al. presented an intelligent agent-based model for system
protection in critical time. This model has the ability of autonomous decision-making
for circuit breakers and detects a fault in critical time. Simulation results showed the
flexibility and stability of the system. However, this model cannot be implemented in
the large and complex power system.
Wolf-pack hunting
In [42], Xi et al. presented multi-agent wolf-pack hunting approach for the smart-
grid system. The wolf-pack idea is derived from a hunting group of a wild wolf
pack. The basic idea is to ensure survival in the harsh environment. This model can
handle optimal management of power distribution and can operate in load disturbance
condition. Experimental results showed that the convergence rate is 51.37%–57.4%
and the error rate is 0.5%. The agents exchange information so rapidly and calculate
the optimal policy. It increased utilization cost with reduction of generation cost.
5.3.3.2 Algorithmic approach
In the past, studies based on algorithmic approaches such as a fuzzy-rule, census,
sweep technique, and spanning tree approach are also presented in the smart-grid
domain. Next, we discuss these studies.

Fuzzy-rule
In [43], Elmitwally et al. proposed distributed system based on fuzzy rule-based multi-
agent approach. Its work mainly focuses on eliminating congestion of smart-grid
124 Modeling and simulation of complex communication networks

components, voltage violation, and cooperative operations. During the experiment,

congestion is eliminated in 2.17 s. Voltage is controlled by keeping the operation
in the limit. This approach performs voltage adjustment task in 28 s. However, its
performance decreased in the case of communication failure.

Census scheme
In [44], Teng et al. proposed a restoration framework for an emergency situation in a
smart-grid environment. In this method, a dynamic leader agent is used for operation
in emergency and disaster situation, and bus agents operate in a normal situation. This
method reduced communication time, and communication bandwidth is kept saved
during a disaster.

Sweep technique
In [45], Nguyen and Flueck proposed another decentralized distributed agent-based
model for power flow problem. It consists of multi-agents having autonomous, local
view, and decentralized behavior. Agents use back and forward sweep iteration
technique for power flow solving. The results showed computation time 81.96 s.

Spanning tree approach

In [46], Eriksson et al. presented a multi-agent distributed algorithm for integrated
volt/var control in the smart grid. Agents are collaboratively controlling and managing
voltage and capacitor. This method deals with the optimization of voltage profile,
reducing system loss, and switching of the capacitor. Two types of agents are used: a
switching agent who detects and solves system fault and volt/var control agent who
controls power flow. This technique controls voltage above the lower limit but does not
handle the voltage below the high limit. The results showed that the average time for
solving power flow is 9.4405 s, which demonstrates an efficient technique. However,
the solution does not lead to optimum. The literature summary of fault control has
been shown in Table 5.3.

Table 5.3 Literature summary of fault control

Ref Technique Strength Limitations

Babalola Prevention of cascading Needs large information

[38] failure, adaptive programming about system states
Nassar Dynamic boundaries, forward Self-healing Computation time
[39] and backward sweep techniques is achieved increased
Chen [40] Coordination control Convergence Not focused on
scheme, MILP rate 0.265 s communication range
Rahman Fault location in System stability Complex system not
[41] critical time, MAF is achieved handled
Elmitwally Elimination of Restoration Performance decreased
[43] congestion, fuzzy-rule time 2.17 s in the case of
communication failure
Applications of multi-agent systems in smart grid 125

5.3.4 Power scheduling

Power-scheduling process involves in setting power consumption and production at
the specific time period. This process is categorized into complex and learning-based
approach.

5.3.4.1 Complex system

As we know that the complex system comprises many interconnected objects in the
system, these objects interact with each other in a nonlinear manner. Next, we discuss
different techniques that have been carried from a complex system perspective to
address power-scheduling problem.

Self-organizing
Smart grid requires real-time monitoring to provide reliable services for end users.
In [47], Colson and Nehrir proposed a decentralized MAS for real-time power man-
agement in smart grid. MAS controls the grid assists based on price, resources, and
users’ demand. The experimental results show that decentralized MAS are reliable
for real-time monitoring in the smart grid. It is also shown that as time continues, the
performance of storage degrades due to discharging.

Hierarchal approach
In [48], Hu et al. proposed a hierarchal approach based on a MAS for smart-grid
operation. This approach integrates the EVs and addresses grid congestion and voltage
violation problems. The results showed good performance for power scheduling and
control. However, the communication between agents is too complex.
There have been several designs proposed for smart-grid architecture but still
facing feasibility and economy problems. In [49], Chao and Hsiung proposed fair
energy resource allocation algorithm for electricity trading among smart grid. This
technique prevents starvation situation and fatal problem. It also reduces power cost. It
achieved 96.25% fairness index even in the high worst case. However, this technique
does not take power transmission into account.
Rahman et al. [50] have proposed an agent-based model to address voltage stabil-
ity problem. In this model, agents manage their activities through online information
and power flow. They estimate voltage variation by using distributed synchronous
compensator. Simulation experiments showed robustness performance of the system.
However, communication time delay is observed 15 ms, while voltage stability has
improved.
In a smart-grid environment, there need to achieve stability and reduction in
operation cost. In [51], Radhakrishnan proposed smart-grid framework based on
the multi-agent distributed energy management system. It performs optimal energy
allocation and management in smart grid. This model consists of renewable energy
sources, storage devices, and generators. It controls power balance by the state of
charge of the batteries. Simulation results showed a reduction of total cost from 662.2
to 658.4. However, the performance of the proposed algorithm degrades under some
uncertain condition.
126 Modeling and simulation of complex communication networks

Census-based approach
A centralized system is not able to handle flexible power loads to maintain the power
balance in a smart-grid environment. In [52], Li et al. proposed a look-ahead schedul-
ing model for flexible loads in a smart-grid environment. This model consists of three
layers: centralized, distributed, and cooperative control. Load agents perform coor-
dination among agents, and cooperative control strategy is used for communication
protocol. This model provides flexible strategies to handle the large flexible load.
However, this model is not able to handle uncertainty.
In [53], Guo et al. proposed an economic dispatch scheme based on projected
gradient concerns with economic dispatch problem. It decomposes centralize opti-
mization into local optimal agents. It deals with the stochastic environment. This
scheme presents a finite time average census algorithm. In this method, agents itera-
tively calculate the solution of the optimal problem. Its communication with agents is
limited. This method achieved plug-in-play, and it does not require any private infor-
mation. It can handle quadratic and non-quadratic cost function. The results showed
that overall cost of the system reduced.
Kahrobaee et al. in [54] presented the concept of smart home within a smart-
grid environment. In this work, home is considered an agent who can buy, sell, and
store energy and interact with the grid. This framework consists of home agent based
on distributed multi-agent network. The home agent makes autonomous decisions to
buy, sell and store energy, it takes a decision based on maximum utility. The home
agent decision affects the market price. The results showed home agent decision
reduced their energy cost as it buys, sells and generate energy at the same time.
However, this method is simple and does not address all issue related to demand and
supply.
In [55], Samadi et al. addressed uncertainty issues in smart grid and present
an optimized algorithm based on the central unit. This technique only needs future
demand estimation and minimizing energy cost for each user. The results showed
that the peak to average load is 25.5% achieved. It also reduced energy expenses.
However, the complexity of the system is increased.
In [56], Gregoratti and Matamoros presented another approach for power flow
in a smart-grid environment. The proposed technique controls and manages power
flow among multiple microgrids. This technique focuses on protecting private local
information, and it is based on sub-gradient cost minimization approach. The results
showed limited iteration and faster convergence rate. However, in this work, the
communication with the main grid was not considered.

Cognitive-based approach
In [57], Bu and Yu studied green cognitive network in smart-grid application. Cogni-
tive network monitors smart-grid operation and provides information to the control
unit. The power allocation is performed based on collected information. Power allo-
cation, price, and efficiency are modeled as three-stage Stackelberg game. Results
demonstrated 31.09% cost reduction. However, this technique does not handle the
incomplete scenario.
Applications of multi-agent systems in smart grid 127

5.3.4.2 Learning-based approach

In the past, power scheduling problem has been addressed by using two types of
learning techniques that are reinforcement and neural network.

Reinforcement learning
RL technique is an essential tool for computation and estimation of payoff to achieve
game equilibrium in a smart-grid environment. Wang et al. [58] presented a scheme
based on RL technique for energy trading in the smart grid. This method chooses a
random strategy and maximizes the average utility and revenue. The proposed scheme
is able to achieve Nash equilibrium. This technique handles incomplete information
available and stochastic environment. Information is exchanged through the central
unit and protects private information. However, implementation of the finite action
learning algorithm is a challenging task in real value action environment.
In [59], Samadi et al. worked on load scheduling and power trading in a smart-
grid environment. The study considered high penetration renewable resources. They
adopt the game theory approach. In this method, users can sell their extra power to
their neighbors locally. This method handles the reverse power flow problem. This
increases the revenue and decreases energy expenses of the users. The results showed
that average energy imported is reduced to 820.2 kW from 1,360.9 kW, and energy
cost is reduced to 40.37$ from 60.91$.
Energy hub provides interaction between energy carriers in supply requiring
loads. In [60], Sheikhi et al. extended the energy hub system. This study proposed
cloud-computing concept which consists of a utility provider and customer interaction
through the cloud. The cloud takes the input of utility power and produces output to
the users. This model provides two-way communications between utility companies
and energy hub. The results showed that energy cost is reduced to 33%. However, the
proposed system is unable to predict consumer’s future demands.
In [61], by Ghorbani et al., fault-detection technique based on the MAS is
presented in a smart-grid environment. This technique combines centralize and decen-
tralize features that demonstrate the hierarchal coordination scheme. It consists of
zone agents, feeder agents, and substation agents. Zone agents provide services
to detect and locate the fault and help feeder agents to restore services using the
q-learning technique. This method needs fewer messages for communication and
reduced computation time. The results showed that 16 messages are required for
communication for 21 agents, while centralized and decentralized scheme required 20
and 38 messages, respectively. However, the number of zone agents and feeder agents
remain fixed with the system size which results in more burden and computational
time in the complex system.
Venayagamoorthy et al. [62] proposed intelligent dynamic energy management
system (I-DEMS) based on neural network and RL. They used Bellman equation
for the optimal control signal and calculate min and max cost-to-go function. They
compared this technique with DEMS based on Decision Tree method, DT is inefficient
because it supplies energy based on available power. The result shows that I-DEMS
is reliable and it extends battery life, but this technique does not predict battery sate.
128 Modeling and simulation of complex communication networks

In [63], Li and Jayaweera presented a hierarchical architecture for communication

between utility company and customers. The proposed technique consists of two
stages: initial and real-time interaction. At initial interaction, demand response is
controlled which proved load remains flat. However, it also showed that by increasing
training period, performance of the studied system decreased.
In [64], Rayati et al. proposed smart energy hub concept in smart-grid applica-
tion. Smart energy hub is used for multipurpose transmission of generator energy,
information, and user-demand scheduling. In this method, RL is used for optimal
solution and the result demonstrates 26% cost reduction.
There are some implementation challenges in the smart grid due to uncertainty
and dynamic price. Kim et al. in [65] focused on dynamic price management and
proposed agent-based RL technique. In this framework, each customer is considered
as an agent and learns policy without any advance knowledge. Utility company moni-
tors customer behavior and schedule power on demand. This method increased system
and customer cost. However, error in estimation of cost effects system performance.
It also does not handle multiple energy resources and bidirectional communication.
In [66], Wang et al. presented broker concept in a smart-grid environment. In
this framework, the broker is responsible for predicting user demand and then buys
energy from utility company using auction strategy. Each customer is distributed as
cluster network. The broker uses MDP and RL technique to predict customer demand.
This technique balances energy supply–demand and achieved 24.6% imbalance rate.
However, due to broker’s involvement, computation time is increased.
Load shedding is used to balance power supply and demand. Central controller
allocates power to users using bidding process. Lim and Kim [67] presented a bidding
scheme based on q-learning technique. It makes policy when power is less than
demand. It starts bidding process and identifies those who want to buy power and
submit their bid price and quantity. The results showed that power balance is achieved
by the repetitive interaction between agents. It also showed that period of exploration
increased as trivial interaction decreased.
It is impractical to have complete information in advance about cost and demand
for energy in a smart-grid environment. Zhang et al. [68] focused on this issue
and proposed price-dependent load-scheduling technique. This technique uses post-
decision state (PDS) and Markov decision process (MDP). This method can provide an
optimal solution in an unknown environment. In this method, consumers can buy and
store energy during peak hours. Energy cost and demand are taken as variable entities.
Load-scheduling process is considered as MDP using RL PDS technique. This method
needs less information to converge into an optimal solution. The results showed that
algorithm converges into optimal solution in 1,122 time slot giving 90% average
utility. However, this method does not handle load scheduling in a collaborative
environment.
Wu and Liao [69] focused on power-dispatch problem and presented function
optimal RL scheme for power-dispatch problem in complex and multidimensional
space of smart-grid application. This technique searches in sequence result showed
that 32.31% reduction of computation time and voltage stability also increased.
However, this technique showed the conflict between energy cost and voltage stability.
Applications of multi-agent systems in smart grid 129

Table 5.4 Literature summary of power scheduling

Ref Technique Strength Limitations

Colson and Real-time power Reliable system Performance of storage

Nehrir [47] management, devices decreased
self-organizing
Chao and Electricity trading, fair Reduced Ignored power
Hsiung [49] energy resource allocation power cost transmission
Rahman Voltage stability in SG, Robust Communication time
[50] hierarchal approach performance increased
Li [52] Look ahead scheduling, Handle large Not handle uncertainty
census flexible load
Kahrobaee Smart home concept, Cost reduced Not address all issues
[54] census related to demand
and supply
Gregoratti Power flow in multiple Fast No communication
[56] microgrid, census convergence rate with main grid
Sheikhi Cloud computing Energy cost No prediction of
[60] concept in E-hub, RL reduced by 33% energy demands
Kim [65] Dynamic price Monitor customer Not handle multiple
management, RL behaviors energy resources
Wu and Power dispatch in Computation time Conflict between energy
Liao [69] complex scenario, RL reduced by 32.31% cost and voltage stability

In [70], Shirzeh et al. worked on management of renewable energy resources

and storage devices in a smart-grid environment. They proposed a MAS based on a
plug-and-play technique for managing and controlling resources in the smart grid.
Plug-in-play technique used RL method based on distributed value function to adjust
power balance of demand and supply. Results showed 81% reduction in fluctuation by
using plug-and-play. The number of iterations is also reduced. However, this method
does not deal with the stochastic environment.

Artificial neural network

Integrating wind energy resources with other distributed energy resources is a chal-
lenging task. In [71], Motevasel and Seifi addressed this issue and proposed an
expert energy-management system. This technique finds optimal set points of energy
resources and storage devices. This technique controls forecasting, optimizing, and
storage module. ANN is used for forecasting process. Results showed that conver-
gence is achieved in 445 iterations. The literature summary of power scheduling has
been shown in Table 5.4.

5.3.5 Storage and voltage management

Storage and voltage management scheme handles storage devices and voltage vari-
ation. This scheme is categorized into learning, monitoring, and search-based
approach.
130 Modeling and simulation of complex communication networks

5.3.5.1 Learning
Storage and voltage-management problem are addressed by using RL and neural
network approach. Next, we discuss these learning techniques and try to explain how
different studies addressed the storage and voltage problem in smart-grid domain.

Reinforcement learning
In [72] by Li et al., research work is concerned with the implementation of RL
technique for load-balancing problem in the smart grid. The proposed scheme is
based on dynamic hierarchal approach. It finds an optimal policy to balance power
demand and supply. It handles curse dimensionality problem. It is a fast-learning
technique in an unknown environment.
In [73], Salehizadeh and Soltaniyan proposed a fuzzy q-learning technique. It
handles multidimensional renewable power in less iteration. With this method, 40%
iterations decreased as compared to other techniques. It models electricity in continues
range.
Wind energy is uncertain and is a variable energy resource; this effects smart-
grid performance. In [74], de Montigny et al. addressed this issue and proposed
multi-agent architecture. This method calculates import and export losses. It also
calculates global-demand forecasting using minute-to-minute strategy. Additionally,
it also estimates system performance from historical data. Results obtained through
minute-to-minute strategy and showed that number of generating unit start and stop
increased by 5%. However, computational time of this method is very large.
Load frequency managing and controlling is a hot topic for research in a smart-
grid environment. The linear model is not capable of handling dynamic behavior of
the system. In [75], Daneshfar et al. addressed this issue and proposed multi-agent
RL technique which consists of two agents: estimator and controller. Estimator agent
finds frequency error, and controller agent uses genetic optimization for frequency
control. This technique showed frequency variation fall to zero through the optimal
solution. However, load disturbance is generated by reaching to maximum frequency.
In [76], Wei et al. addressed battery-management issues in a smart-grid envi-
ronment. This study proposed a dual iterative q-learning technique based on adaptive
dynamic programming for managing and controlling storage devices. In this method,
dual iteration, internal iteration for minimizing power cost, and external iteration for
finding Q function to converge into optimum is used. This algorithm converged into
optimal solution in 20 iterations. However, the proposed algorithm finds optimal solu-
tion indirectly. Initial interaction handles demand response at customer side, and the
load is considered as a flat point. Real-time interaction is used for decision-making.
This technique used hidden mode MDP. This technique outperforms as training period
is increasing. However, in the studied system, a smart home was not considered.
Integrating different types of energy storage devices in smart grid produces
implementation challenges. In [77], Qiu et al. focused on controlling and manag-
ing different types of energy-storage devices. This study proposed RL-based scheme
to optimize coordination of energy-storage devices. The results showed that system
gradually learns with time and results in an optimal solution. This study also showed
Applications of multi-agent systems in smart grid 131

that system losses decreased. However, it required large computational time, and it
does not support power-sharing feature.
Integrating photoelectric energy with smart grid decreases fossil fuel consump-
tion as well as electricity bill. In [78], Wang et al. proposed near-optimal control
algorithm for the residential storage system which controls power generation, predicts
power consumption, and accounts for various loss components during operation. They
applied RL technique for prediction amount of energy in ESS. This technique per-
forms optimization on energy price and energy demand price. Experimental results
show that the proposed algorithm outperforms and achieves up to 72% enhance-
ment in electricity-cost reduction compared with baseline storage control algorithm.
Limitation of this system is that PV generation system only works in sunlight.
Battery management plays a key role in a smart-grid environment. In [79],
Kuznetsova et al. presented a two step-ahead RL algorithm for battery scheduling
within microgrid architecture. It is composed of local consumers, generator, and stor-
age devices connected to the external grid. This technique predicts and forecasts power
demand. It finds optimal actions for battery scheduling. Simulation results showed
3.94% improvement in battery. However, the simulation running is very large.
In [80], Vandael et al. addressed day-ahead power scheduling problem for EVs
in a smart-grid environment. In this method, charging process is performed by the
heuristic scheme. The heuristic scheme is controlling and managing each EV. The
system collectively learns cost-effective scheduling strategy for EV charging through
RL technique. The results showed that average cost increased by 10%. However, this
method has some overloading and over constraint issues.
In [81], Guan et al. focused on minimizing energy cost in a smart-grid envi-
ronment. In this work, RL technique is applied to find an optimal policy to storage
devices. This method does not require any future prediction about energy generation
and consumption but the partial observable environment. The TD-lambda algorithm
is used for convergence to the optimal solution in the non-Markovian environment.
Simulation results showed 59.8% reduction in energy cost.
Artificial neural network
Battery management plays a key role in smart grid; it is important to measure the
health of batteries during operation. In [82], Landi and Gross proposed two different
techniques for estimating battery health in smart-grid application. First one is based on
fuzzy logic and the second one is a neural network. These techniques use temperature,
charging/discharging, and a number of the cycle as parameters. Results showed 5%
error rate.

5.3.5.2 Monitoring
In this section, we discuss different approaches presented for storage and voltage
monitoring.
Volt/Var control
In [83], Zhang et al. presented a multi-agent distributed algorithm for integrated
volt/var control in the smart grid. Agents are collaboratively controlling voltage and
capacitor. This method deals with the optimization of voltage profile, reducing system
132 Modeling and simulation of complex communication networks

loss, and switching of the capacitor. Two types of agents are used: switching agents
who detect and solve system fault and volt/var control agents who control power flow.
This technique controls voltage above the lower limit but does not handle voltage
below high limit. The results showed that the average time for solving power flow is
9.4405 s, which demonstrates an efficient technique. However, the solution does not
lead to optimum.
Census approach
Researchers are also interested in reducing high-power consumption and demand to
reduce cost. In [84], Sharma et al. proposed agent-based distributed control model
to address this issues. In this model, power-storage devices are used as agents. It
achieves convergence in agreement of power consumption. It prevents overcharging
and discharging of batteries. Results showed 95% and 85% charging and discharging
efficiency, respectively. However, the communication between agents is limited, and
it does not predict the state of batteries only its maximum/minimum state.
State monitoring
For dynamic state estimator, in [85], Srivastava et al. proposed a MAS for the
multi-area power system. This method divides the whole network into subsystem
and algorithm executes in parallel. This use two unit’s: field and phasor unit run
separately. At last, center controller integrates their results. The algorithm follows
cubature Kalman filter. Results showed 2.4(10−2 ) voltage error. It has been showed
that extended Kalman filter is not feasible.
In [86], Teleke et al. focused on battery management and proposed rule-based
control strategy. This technique monitors and controls charge/discharge limit and
battery lifetime. It also utilizes 70% battery capacity. The results showed voltage
deviation reduction from 24% to 4%. However, this required high-capacity batteries.
Integrating solar energy in a smart grid make it an active system which required
cyber-physical management system. In [87], the author presents a goal-based Holonic
MAS. This technique uses nested agent concept and controls power strategy and
state estimation. The results showed execution time 93 s and absolute error 0.038%.
However, the complexity of the system increased by nested agents.
In [88], Klaimi and Merghem-Boulahia focused on energy-management system
and proposed a multi-agent intelligent model for smart-grid application. In this tech-
nique, intelligent storage devices are used for storing surplus power. This technique
reduced energy cost and access to the grid. Results showed 60% cost reduction.

5.3.5.3 Searching
The searching techniques used for addressing storage and voltage problems include
self-organizing, normality analysis, hill-climbing, and swarm intelligence. Next,
these search-based techniques are discussed.
Self-organizing
The integrating and monitoring of smart microgrid is at the initial stage, and it needs
more research studies. In [89], Vaccaro et al. proposed and developed a self-organized
standalone smart microgrid framework for solving and controlling smart microgrid
Applications of multi-agent systems in smart grid 133

operations. They focused on synchronizing and controlling of smart applications

involved in the system. Results and experiments show that dynamic agents are useful
in power flow problems. It also helps estimation of state in the smart microgrid.
However, this approach does not address the computation and estimation of semantic
representation of data.
Integrating EVs with the smart grid is a challenging task for the researcher.
In [90], Hu et al. presented hierarchal control method that coordinates self-
interconnected nodes. In operation constraint, marked-based control is used. In this
framework, two level are used: upper bound and lower bound. Upper bound controls
power scheduling, and lower bound provides power to EVs. Implementation showed
that this method is feasible and there is no power loss.
Normality analysis
In [91], Vallejo et al. extended the previous work of intelligent monitoring of substa-
tions in the smart grid. They used knowledge-based software agents which are used
for data collection and decision-making. This model integrates new agents that can
be used in different environment condition. They used web services for results mon-
itoring. The experimental results showed 75% absolute normal voltage. This model
improves robustness and provides reconfiguration as well as replication of services.
However, this model requires more data and information for intensity control.
Hill climbing
In [92], Xi et al. presented win/loss fast policy hill-climbing approach for optimal
averaging policy learning for the MAS in the smart grid. This model is applicable
in stochastic non-Markov environments. This technique is an independent self-play
game and can achieve fast convergence learning rate. Simulation results showed 68%
faster learning rate than previous techniques used in literature that are q-learning,
q-lambda, etc. However, this model has some challenges to real implementation; it
faces security and stability problems.
Modeling of the smart grid is a complex task due to its complex nature, multi-
agent behavior, and distributed resources. In [93], de Durana et al. presented a model
for local multi-carrier energy network in the smart grid. This model allows trans-
mitting of different type energy in the smart-grid network. The simulation results
showed rebalancing between energy networks. This model only focuses on smart-grid
operation, not on energy generation and load management.
Swarm intelligence
In smart grid, it needs to manage and control frequency around reference value in
order to get secure and quality power flow. Unbalance frequency produces unbal-
ance power distribution to the consumers. In [94], Evora et al. presented a swarm
intelligence agent-based system for frequency management in the smart grid. It used
decentralize independent agents and can exchange information in the shared environ-
ment. It evaluates three policies: detection, communication, and stability. However,
increasing microlevel results non-desire effect.
Direct load control (DLC) is used for control and management of demand at con-
sumer side. In [95], Hernandez et al. presented DLC method based on multi-objective
134 Modeling and simulation of complex communication networks

Table 5.5 Literature summary of storage and voltage control

Ref Technique Strength Limitations

de Montigny Minute-to-minute System efficiency Complexity increased

[74] forecasting, RL improved by 5%
Bevrani [75] Dynamic behavior of Zero frequency at Load disturbance at
the system, RL optimal solution high frequency
Qiu [77] Management of different Energy loss Complexity increased
type storage, RL decreased
Landi and Battery health Error rate 5% High-error rate
Gross [82] estimation, ANN
Sharma [84] Disturbance control Charging Only estimates
model, census efficiency 95% maximum and minimum
state of the storage
Pahwa [87] Nested agent concept for 93 s execution time Complexity increased
the state estimation,
goal-based holonic MAS
Vaccaro [89] Controlling and Estimates state of Not handles semantic
monitoring microgrid, microgrid representation of data
self-organize
Vallejo [91] Substation monitoring, 75% absolute Required large data
normality-analysis normal voltage
Evora [94] Frequency management, System stability Increasing microlevel
swarm intelligence is achieved result non-desire effects
on the system

particle swarm optimization for smart-grid environment. Appliances operate when

constraints are satisfying at system side. The operation of appliances was obtained
by distributing constraints among neighbor nodes. This method reduced user energy
demand by 20%. However, this method was applied only to three appliances: refrig-
erator, light, and freezer. Another drawback of this method is that it must generate
a result in a fraction of second, otherwise system stability will effect. The literature
summary of storage control has been shown in Table 5.5.

5.4 Open research problems and discussion

In this section, we are going to discuss different methodologies and techniques
on MAS in smart grid and their open-research problems. The smart grid brings
many facilities to users and energy efficiency, customer satisfaction, reduction in
energy cost, and load balancing. However, there exists a number of challenges to be
researched and resolved. These challenges involve communication protocols, fault
detection, prevention, power scheduling, load balancing, and storage/voltage control.
How to handle communication among multi-agent in smart grid? To address
this question, a number of research efforts have been carried in the domain of
smart-grid domain. We reviewed two approaches that are group communication and
Applications of multi-agent systems in smart grid 135

learning-based approach used to address communication challenges in the smart grid.

Group communication approach involves hierarchal, coalition formation, census
and PSO methods. In [3–5], the hierarchal scheme is used to handle communica-
tion among multi-agent. In [3], the layer-based framework was proposed, and this
framework reduced communication complexity and voltage variation. In [4], two-
way communication scheme was proposed and energy cost decreased from 80 to 50.
However, this scheme faces some design issues. In [5], the cognitive-based scheme
is used which reduced peak power from 55,000 to 900 W. In [6], another scheme
based on information sharing about imbalance power is proposed. However, in this
scheme, agents only keep information about their own status. In [7], zero correlation
schemes were proposed for the machine-to-machine communication. This scheme
maintains security and reduces traffic overhead. However, this approach gives worst
result in the case of small scenario. In [8,9], coalition formation approach was used.
In [8], decentralize dispatch scheme was proposed. In this scheme, agents perform
the task in a group or teamwork. However, this scheme showed large communication
time. In [9], the proposed technique does not handle stochastic scenarios. In [10],
the author addressed communication latency problem, and the result showed 3.983 s
restoration time. However, this scheme does not handle individual communication.
The census-based scheme was proposed for cost estimation in [11], this scheme
increased system vulnerability. PSO scheme based on adaptive biding technique was
proposed in [12,96], this reduced buying cost. However, this also reduced customer
comfort level. Learning-based approaches involve RL, ANN, and Bayesian learning
approaches in [13–16]. RL technique showed fast convergence rate. However, this
only handles linear time-invariant. In [15], ANN learning is used for power classifica-
tion, and this showed 98.7% accuracy in system performance. However, this technique
is not robust to error. Bayesian learning is used in [16] for price and demand calcula-
tion in the case of incomplete information. This showed 40% increase in total utility.
However, this technique does not control packet loss rate.
How to handle demand response in smart-grid environment? To address this
question, different approaches also have been applied in the domain of smart grid. We
reviewed two types of approaches that are learning-based and complex system-based
approach adapted to handle demand response in smart-grid application. Learning-
based approach involves ANN and RL. ANN is applied to a virtual power plant in [17],
this showed 1.5% error rate. However, this needs enough information for prediction
of future demand. RL is applied in [18] for energy prediction and showed 91.42%
accuracy rate. However, this method was not applied on a different level. In [19],
SA-Q learning was applied to learn system reservation to offer power at the different
time. This increased cost-benefit rate. In [20], demand response was addressed to
predict load for the next day. This reduced 33% peak usage; however, there is no
collaboration among agents. In [21], demand–response management for a single unit
or building was proposed. The proposed technique in [22] maximize social welfare
and converged in 40 iterations. In [24], online energy cost estimation is proposed and
the result showed 40% energy cost reduction.
The complex system comprises collaborative, complex adaptive system, demand-
side integration, PSO, and game-theory approaches. The collaborative-based scheme
136 Modeling and simulation of complex communication networks

was discussed in [25–27]. This scheme has open issues regarding communication, pre-
diction, and probability distribution. Complex adaptive system approach was applied
in [28,29]. This reduced 40% energy cost and also peak load to 8%–5% range. How-
ever, this approach does not handle high load and also the cost of energy increased
for some users. Demand-side integration was discussed in [30,31]. The open research
issues existing in this scheme are as follows: it does not estimate the state of bat-
teries and offers fewer energy shares to the users. In [32], PSO technique based on
BEMS framework has unbalanced situation issue. The game theory approach was
also applied to address demand–response problem. This approach has open issues
related to sensitivity, information gathering, and the trade-off between cost and PAR.
It also not suitable for distribution scheme.
How to detect and prevent a fault in the system? To address this challenge, a
number of research efforts have been done and cited in our review work. We grouped
these studies into two categories, i.e., self-organizing and algorithmic approach. The
self-organizing approach consists of adaptive programming, MAF, and WPH. These
approaches can perform self-healing task in an efficient manner. However, there exist
some open research problems that are as follows: these required major hardware for
implementation, unable to address complex model, cannot address battery capacity,
there is no global information discovery.
Algorithmic approaches consist of fuzzy-rule, census, sweep and spanning tree
techniques. These studies successfully reduced congestion and communication time.
However, there still exist some open research problems that to be addressed. In this
scheme, the system performance degrades in the case of failure and no guarantee of
an optimal solution.
How to perform power scheduling? We surveyed research work and grouped
these work into two categories, i.e., complex system and learning-based model.
The complex system consists of self-organizing, hierarchal, census, and cognitive-
based approaches. The self-organizing approach is discussed in [47]. This technique
showed good performance in term of monitoring; however, performance degrades in
discharging periods. The hierarchal scheme is discussed in [48–50], this approach
has the ability to handle the starvation problem and achieved 96.5% fairness index.
However, this scheme increased complexity and computational time. Census-based
approaches are also reviewed in this part for power-scheduling task. In [52], flex-
ibility concept is introduced and provides flexible strategy to perform flexible
power transmission. In [53], the central unit is introduced and achieved 25.5%
peak to the average rate. In [54], the subgradient concept was used for cost
minimization. This showed fast convergence rate; however, there is no commu-
nication with the main grid. Pruning strategy was discussed in [56] that prune
those agents which are not participating in the communication. This reduced search
space size; however, this method is unable to prune those agents which are close to
each other.
Learning-based approaches (RL and ANN) adopted to address power scheduling
problem in the smart grid. With the adaptation of RL-based approaches, private
information was protected from external users. It provides reverse power flow facility,
where the user can send extra power back to the main grid. Cloud interaction concept
was introduced in [60], where user and utility can interact with each other through the
Applications of multi-agent systems in smart grid 137

cloud. This reduced energy cost to 33%. ANN-based approach is presented in [71],
which integrates wind energy resource with other resources. However, learning-based
approaches are still facing open-research problems that are as follows: there is no
collaborative learning, the conflict between cost and voltage, and there is no procedure
to predict system state.
How to manage and handle storage devices and voltage? To address this prob-
lem, a number of research works are discussed and reviewed in this part. We grouped
these work into three categories, i.e., learning-based, monitoring, and search-based
approaches. Regarding learning-based approach, in [74], minute-to-minute forecast-
ing strategy was applied. This increased the number of generating units. However,
the computational time is also increased. Different types of energy storage devices
was integrated with the system in [77], this decreased energy loss. In [81], two-step
ahead forecasting strategy was applied which showed 3.94% improvement in battery
life. ANN-based learning scheme was used in [83] for state estimation and showed
5% error rate.
Monitoring-based techniques consist of volt/var, census, and state monitoring.
These techniques control voltage and monitor system state. Agent-based distributed
control (ABDC) based on monitoring approach prevents overcharging and discharg-
ing of the battery. This method achieved 95% and 85% efficiency in charging and
discharging, respectively.
Search-based techniques consist of self-organizing, normality analysis, hill
climbing, and swarm intelligence. Self-organizing technique addressed application
synchronization problem in [91]. However, this technique is unable to handle semantic
data. Normality analysis is used in [92] which integrate EVs. In [93], the knowledge-
based scheme was used which provide integration of new agents, reconfiguration, and
replication services. However, this scheme requires large data for intensity control.

5.5 Conclusions
As a simulation and modeling perspective, the MAS in smart grid has recently been
attracting an increasing attention from the research community. The growing domains
of interest in MAS in the domain of smart grid are communication protocols, demand
response, self-healing, power scheduling, load balancing and storage-device manage-
ment. A number of research works have been carried out and developed multi-agent
based models for smart grid in abovementioned domains.
In this part, we covered the different approaches adopted in MAS for smart-grid
modeling and proposed a classification of MAS models according to the techniques
used for their implementation. We finally described each technique and its model.
We also highlighted open research problems exist in each solution.
The basic objective of MAS in smart-grid modeling is load balancing, to bring
balance or equilibrium between users demand and generation capacity. In another
word, MAS in smart-grid modeling deals with energy-optimization process. As for
the authors are concerned, this is the first article which clearly highlights open research
problem in MAS in the smart grid that covers a large number of different research
studies.
138 Modeling and simulation of complex communication networks

The aim of this survey was to allow a comprehensive understanding of the various
emerging development in the field of the smart grid, the different approaches, their
advantages, and limitations. We hope it will be a good guideline and a starting point
to those researchers coming to this field and desiring to increase their knowledge in
smart-grid domain from MAS perspective.

References
[1] Manickavasagam K. Intelligent energy control center for distributed gen-
erators using multi-agent system. IEEE Transactions on Power Systems.
2015;30(5):2442–2449.
[2] Siano P. Demand response and smart grids: A survey. Renewable and
Sustainable Energy Reviews. 2014;30:461–478.
[3] Li Q, Chen F, Chen M, et al. Agent-based decentralized control method for
islanded microgrids. IEEE Transactions on Smart Grid. 2016;7(2):637–649.
[4] Al-Agtash S. Electricity agents in smart grid markets. Computers in Industry.
2013;64(3):235–241.
[5] Palicot J, Moy C, Résimont B, et al. Application of hierarchical and distributed
cognitive architecture management for the smart grid. Ad Hoc Networks.
2016;41:86–98.
[6] Larsen GK, van Foreest ND, Scherpen JM. Power supply–demand balance in
a smart grid: An information sharing model for a market mechanism. Applied
Mathematical Modelling. 2014;38(13):3350–3360.
[7] Yan Y, Qian Y, Hu RQ. A secure and efficient scheme for machine-to-
machine communications in smart grid. In: Communications (ICC), 2012
IEEE International Conference on. IEEE; 2012. p. 167–172.
[8] Ye D, Zhang M, Sutanto D. Decentralised dispatch of distributed energy
resources in smart grids via multi-agent coalition formation. Journal of Parallel
and Distributed Computing. 2015;83:30–43.
[9] Dagdougui H, Sacile R. Decentralized control of the power flows in a net-
work of smart microgrids modeled as a team of cooperative agents. IEEE
Transactions on Control Systems Technology. 2014;22(2):510–519.
[10] Nguyen CP, Flueck AJ. Modeling of communication latency in smart grid.
In: Power and Energy Society General Meeting, 2011 IEEE. IEEE; 2011.
p. 1–7.
[11] Zhang Y, Rahbari-Asr N, Chow MY. A robust distributed system incremental
cost estimation algorithm for smart grid economic dispatch with communi-
cations information losses. Journal of Network and Computer Applications.
2016;59:315–324.
[12] Wang Z, Wang L. Adaptive negotiation agent for facilitating bi-directional
energy trading between smart building and utility grid. IEEE Transactions on
Smart Grid. 2013;4(2):702–710.
[13] Yu T, Wang H, Zhou B, et al. Multi-agent correlated equilibrium Q (λ) learning
for coordinated smart generation control of interconnected power grids. IEEE
Transactions on Power Systems. 2015;30(4):1669–1679.
Applications of multi-agent systems in smart grid 139

[14] Giraldo J, Mojica-Nava E, Quijano N. Synchronization of isolated micro-

grids with a communication infrastructure using energy storage systems.
International Journal of Electrical Power & Energy Systems. 2014;63:71–82.
[15] Saraiva FdO, Bernardes WM, Asada EN. A framework for classification of
non-linear loads in smart grids using artificial neural networks and multi-agent
systems. Neurocomputing. 2015;170:328–338.
[16] Misra S, Bera S, Ojha T, et al. ENTICE: Agent-based energy trading with
incomplete information in the smart grid. Journal of Network and Computer
Applications. 2015;55:202–212.
[17] Hernández L, Baladron C, Aguiar JM, et al. A multi-agent system architecture
for smart grid management and forecasting of energy demand in virtual power
plants. IEEE Communications Magazine. 2013;51(1):106–113.
[18] Mocanu E, Nguyen PH, Kling WL, et al. Unsupervised energy prediction
in a smart grid context using reinforcement cross-building transfer learning.
Energy and Buildings. 2016;116:646–655.
[19] Lakić E, Artač G, Gubina AF. Agent-based modeling of the demand-side
system reserve provision. Electric Power Systems Research. 2015;124:85–91.
[20] Dusparic I, Harris C, Marinescu A, et al. Multi-agent residential demand
response based on load forecasting. In: Technologies for Sustainability
(SusTech), 2013 1st IEEE Conference on. IEEE; 2013. p. 90–96.
[21] Wen Z, O’Neill D, Maei H. Optimal demand response using device-based
reinforcement learning. IEEE Transactions on Smart Grid. 2015;6(5):2312–
2324.
[22] Ruelens F, Claessens BJ, Vandael S, et al. Residential demand response of
thermostatically controlled loads using batch reinforcement learning. IEEE
Transactions on Smart Grid. 2017;8(5):2149–2159.
[23] Zhang W, Xu Y, Liu W, et al. Distributed online optimal energy management
for smart grids. IEEE Transactions on Industrial Informatics. 2015;11(3):
717–727.
[24] O’Neill D, Levorato M, Goldsmith A, et al. Residential demand response using
reinforcement learning. In: Smart Grid Communications (SmartGridComm),
2010 First IEEE International Conference on. IEEE; 2010. p. 409–414.
[25] Golpayegani F, Dusparic I, Taylor A, et al. Multi-agent collaboration for con-
flict management in residential demand response. Computer Communications.
2016;96:63–72.
[26] Le Cadre H, Bedo JS. Dealing with uncertainty in the smart grid: A learning
game approach. Computer Networks. 2016;103:15–32.
[27] Huang H, Li F, Mishra Y. Modeling dynamic demand response using Monte
Carlo simulation and interval mathematics for boundary estimation. IEEE
Transactions on Smart Grid. 2015;6(6):2704–2713.
[28] Kremers E, de Durana JG, Barambones O. Multi-agent modeling for the
simulation of a simple smart microgrid. Energy Conversion and Management.
2013;75:643–650.
[29] Thimmapuram PR, Kim J. Consumers’ price elasticity of demand modeling
with economic effects on electricity markets using an agent-based model. IEEE
Transactions on Smart Grid. 2013;4(1):390–397.
140 Modeling and simulation of complex communication networks

[30] Mocci S, Natale N, Pilo F, et al. Demand side integration in LV smart grids
with multi-agent control system. Electric Power Systems Research. 2015;125:
23–33.
[31] Nunna HK, Saklani AM, Sesetti A, et al. Multi-agent based demand response
management system for combined operation of smart microgrids. Sustainable
Energy, Grids and Networks. 2016;6:25–34.
[32] Hurtado L, Nguyen P, Kling W. Smart grid and smart building inter-operation
using agent-based particle swarm optimization. Sustainable Energy, Grids and
Networks. 2015;2:32–40.
[33] Wei W, Liu F, Mei S. Energy pricing and dispatch for smart grid retailers under
demand response and market price uncertainty. IEEE Transactions on Smart
Grid. 2015;6(3):1364–1374.
[34] Chai B, Chen J, Yang Z, et al. Demand response management with multiple
utility companies: A two-level game approach. IEEE Transactions on Smart
Grid. 2014;5(2):722–731.
[35] Song L, Xiao Y, Van Der Schaar M. Demand side management in smart
grids using a repeated game framework. IEEE Journal on Selected Areas in
Communications. 2014;32(7):1412–1424.
[36] Nunna HK, Doolla S. Demand response in smart distribution system with
multiple microgrids. IEEE Transactions on Smart Grid. 2012;3(4):1641–1649.
[37] O’Brien G, El Gamal A, Rajagopal R. Shapley value estimation for compen-
sation of participants in demand response programs. IEEE Transactions on
Smart Grid. 2015;6(6):2837–2844.
[38] BabalolaA, Belkacemi R, Zarrabian S. Real-time cascading failures prevention
for multiple contingencies in smart grids through a multi-agent system. IEEE
Transactions on Smart Grid. 2016;9(1):373–385.
[39] Nassar ME, Salama MM. Adaptive self-adequate microgrids using dynamic
boundaries. IEEE Transactions on Smart Grid. 2016;7(1):105–113.
[40] Chen C, Wang J, Qiu F, et al. Resilient distribution system by micro-
grids formation after natural disasters. IEEE Transactions on Smart Grid.
2016;7(2):958–966.
[41] Rahman M, Mahmud M, Pota H, et al. A multi-agent approach for enhancing
transient stability of smart grids. International Journal of Electrical Power &
Energy Systems. 2015;67:488–500.
[42] Xi L, Zhang Z, Yang B, et al. Wolf pack hunting strategy for automatic gener-
ation control of an islanding smart distribution network. Energy Conversion
and Management. 2016;122:10–24.
[43] Elmitwally A, Elsaid M, Elgamal M, et al. A fuzzy-multiagent self-
healing scheme for a distribution system with distributed generations. IEEE
Transactions on Power Systems. 2015;30(5):2612–2622.
[44] Teng F, Sun Q, Xie X, et al. A disaster-triggered life-support load restora-
tion framework based on multi-agent consensus system. Neurocomputing.
2015;170:339–352.
[45] Nguyen CP, Flueck AJ. A novel agent-based distributed power flow solver for
smart grids. IEEE transactions on Smart Grid. 2015;6(3):1261–1270.
Applications of multi-agent systems in smart grid 141

[46] Eriksson M,Armendariz M, Vasilenko OO, et al. Multiagent-based distribution

automation solution for self-healing grids. IEEE Transactions on Industrial
Electronics. 2015;62(4):2620–2628.
[47] Colson CM, Nehrir MH. Comprehensive real-time microgrid power manage-
ment and control with distributed agents. IEEE Transactions on Smart Grid.
2013;4(1):617–627.
[48] Hu J, Morais H, Lind M, et al. Multi-agent based modeling for electric vehi-
cle integration in a distribution network operation. Electric Power Systems
Research. 2016;136:341–351.
[49] Chao HL, Hsiung PA. A fair energy resource allocation strategy for micro grid.
Microprocessors and Microsystems. 2016;42:235–244.
[50] Rahman M, Mahmud M, Oo A, et al. Agent-based reactive power management
of power distribution networks with distributed energy generation. Energy
Conversion and Management. 2016;120:120–134.
[51] Radhakrishnan BM, Srinivasan D. A multi-agent based distributed energy
management scheme for smart grid applications. Energy. 2016;103:
192–204.
[52] Li Y, Yong T, Cao J, et al. A consensus control strategy for dynamic power
system look-ahead scheduling. Neurocomputing. 2015;168:1085–1093.
[53] Guo F, Wen C, Mao J, et al. Distributed economic dispatch for smart grids
with random wind power. IEEE Transactions on Smart Grid. 2016;7(3):
1572–1583.
[54] Kahrobaee S, Rajabzadeh RA, Soh LK, et al. A multiagent modeling and inves-
tigation of smart homes with power generation, storage, and trading features.
IEEE Transactions on Smart Grid. 2013;4(2):659–668.
[55] Samadi P, Mohsenian-Rad H, Wong VW, et al. Tackling the load uncer-
tainty challenges for energy consumption scheduling in smart grid. IEEE
Transactions on Smart Grid. 2013;4(2):1007–1016.
[56] Gregoratti D, Matamoros J. Distributed energy trading: The multiple-microgrid
case. IEEE Transactions on Industrial Electronics. 2015;62(4):2551–2559.
[57] Bu S, Yu FR. Green cognitive mobile networks with small cells for multi-
media communications in the smart grid environment. IEEE Transactions on
Vehicular Technology. 2014;63(5):2115–2126.
[58] Wang H, Huang T, Liao X, et al. Reinforcement learning in energy trading
game among smart microgrids. IEEE Transactions on Industrial Electronics.
2016;63(8):5109–5119.
[59] Samadi P, Wong VW, Schober R. Load scheduling and power trading in systems
with high penetration of renewable energy resources. IEEE Transactions on
Smart Grid. 2016;7(4):1802–1812.
[60] Sheikhi A, Rayati M, Ranjbar A. Dynamic load management for a residential
customer: Reinforcement learning approach. Sustainable Cities and Society.
2016;24:42–51.
[61] Ghorbani MJ, Choudhry MA, Feliachi A. A multiagent design for power
distribution systems automation. IEEE Transactions on Smart Grid. 2016;7(1):
329–339.
142 Modeling and simulation of complex communication networks

[62] Venayagamoorthy GK, Sharma RK, Gautam PK, et al. Dynamic energy man-
agement system for a smart microgrid. IEEE Transactions on Neural Networks
and Learning Systems. 2016;27(8):1643–1656.
[63] Li D, Jayaweera SK. Reinforcement learning aided smart-home decision-
making in an interactive smart grid. In: Green Energy and Systems Conference
(IGESC), 2014 IEEE. IEEE; 2014. p. 1–6.
[64] Rayati M, Sheikhi A, Ranjbar AM. Applying reinforcement learning method
to optimize an Energy Hub operation in the smart grid. In: Innovative Smart
Grid Technologies Conference (ISGT), 2015 IEEE Power & Energy Society.
IEEE; 2015. p. 1–5.
[65] Kim BG, Zhang Y, van der Schaar M, et al. Dynamic pricing and energy
consumption scheduling with reinforcement learning. IEEE Transactions on
Smart Grid. 2016;7(5):2187–2198.
[66] Wang X, Zhang M, Ren F, et al. GongBroker: A broker model for power trading
in smart grid markets. In: Web Intelligence and Intelligent Agent Technology
(WI-IAT), 2015 IEEE/WIC/ACM International Conference on. vol. 2. IEEE;
2015. p. 21–24.
[67] Lim Y, Kim HM. Strategic bidding using reinforcement learning for load
shedding in microgrids. Computers & Electrical Engineering. 2014;40(5):
1439–1446.
[68] Zhang Y, van der Schaar M. Structure-aware stochastic load management in
smart grids. In: INFOCOM, 2014 Proceedings IEEE. IEEE; 2014. p. 2643–
2651.
[69] Liao H, Wu Q, Jiang L. Multi-objective optimization by reinforcement learning
for power system dispatch and voltage stability. In: Innovative Smart Grid
Technologies Conference Europe (ISGT Europe), 2010 IEEE PES. IEEE; 2010.
p. 1–8.
[70] Shirzeh H, Naghdy F, Ciufo P, et al. Balancing energy in the smart grid
using distributed value function (DVF). IEEE Transactions on Smart Grid.
2015;6(2):808–818.
[71] Motevasel M, Seifi AR. Expert energy management of a micro-grid con-
sidering wind energy uncertainty. Energy Conversion and Management.
2014;83:58–72.
[72] Li FD, Wu M, He Y, et al. Optimal control in microgrid using multi-agent
reinforcement learning. ISA Transactions. 2012;51(6):743–751.
[73] Salehizadeh MR, Soltaniyan S. Application of fuzzy Q-learning for electricity
market modeling by considering renewable power penetration. Renewable and
Sustainable Energy Reviews. 2016;56:1172–1181.
[74] de Montigny M, Heniche A, Kamwa I, et al. Multiagent stochastic simulation
of minute-to-minute grid operations and control to integrate wind generation
under AC power flow constraints. IEEE Transactions on Sustainable Energy.
2013;4(3):619–629.
[75] Daneshfar F, Bevrani H. Load-frequency control: A GA-based multi-agent
reinforcement learning. IET Generation, Transmission & Distribution.
2010;4(1):13–26.
Applications of multi-agent systems in smart grid 143

[76] Wei Q, Liu D, Shi G. A novel dual iterative Q-learning method for optimal
battery management in smart residential environments. IEEE Transactions on
Industrial Electronics. 2015;62(4):2509–2518.
[77] Qiu X, Nguyen TA, Crow ML. Heterogeneous energy storage optimization for
microgrids. IEEE Transactions on Smart Grid. 2016;7(3):1453–1461.
[78] Wang Y, Lin X, Pedram M. A near-optimal model-based control algo-
rithm for households equipped with residential photovoltaic power generation
and energy storage systems. IEEE Transactions on Sustainable Energy.
2016;7(1):77–86.
[79] Kuznetsova E, Li YF, Ruiz C, et al. Reinforcement learning for microgrid
energy management. Energy. 2013;59:133–146.
[80] Vandael S, Claessens B, Ernst D, et al. Reinforcement learning of heuristic EV
fleet charging in a day-ahead electricity market. IEEE Transactions on Smart
Grid. 2015;6(4):1795–1805.
[81] Guan C, Wang Y, Lin X, et al. Reinforcement learning-based control of res-
idential energy storage systems for electric bill minimization. In: Consumer
Communications and Networking Conference (CCNC), 2015 12th Annual
IEEE. IEEE; 2015. p. 637–642.
[82] Landi M, Gross G. Measurement techniques for online battery state of
health estimation in vehicle-to-grid applications. IEEE Transactions on
Instrumentation and Measurement. 2014;63(5):1224–1234.
[83] Zhang X, Flueck AJ, Nguyen CP. Agent-based distributed volt/var control with
distributed power flow solver in smart grid. IEEE Transactions on Smart Grid.
2016;7(2):600–607.
[84] Sharma DD, Singh S, Lin J. Multi-agent based distributed control of distributed
energy storages using load data. Journal of Energy Storage. 2016;5:134–145.
[85] Sharma A, Srivastava SC, Chakrabarti S. Multi-agent-based dynamic state
estimator for multi-area power system. IET Generation, Transmission &
Distribution. 2016;10(1):131–141.
[86] Teleke S, Baran ME, Bhattacharya S, et al. Rule-based control of battery energy
storage for dispatching intermittent renewable sources. IEEE Transactions on
Sustainable Energy. 2010;1(3):117–124.
[87] Pahwa A, DeLoach SA, Natarajan B, et al. Goal-based holonic multiagent
system for operation of power distribution systems. IEEE Transactions on
Smart Grid. 2015;6(5):2510–2518.
[88] Klaimi J, Merghem-Boulahia L, Rahim-Amoud R, et al. An energy manage-
ment approach for smart-grids using intelligent storage systems. In: Digital
Information and Communication Technology and its Applications (DICTAP),
2015 Fifth International Conference on. IEEE; 2015. p. 26–31.
[89] Vaccaro A, Loia V, Formato G, et al. A self-organizing architecture for decen-
tralized smart microgrids synchronization, control, and monitoring. IEEE
Transactions on Industrial Informatics. 2015;11(1):289–298.
[90] Hu J, Saleem A, You S, et al. A multi-agent system for distribution grid
congestion management with electric vehicles. Engineering Applications of
Artificial Intelligence. 2015;38:45–58.
144 Modeling and simulation of complex communication networks

[91] Vallejo D, Albusac J, Glez-Morcillo C, et al. A multi-agent approach to intel-

ligent monitoring in smart grids. International Journal of Systems Science.
2014;45(4):756–777.
[92] Xi L,Yu T,Yang B, et al. A novel multi-agent decentralized win or learn fast pol-
icy hill-climbing with eligibility trace algorithm for smart generation control
of interconnected complex power grids. Energy Conversion and Management.
2015;103:82–93.
[93] de Durana JMG, Barambones O, Kremers E, et al. Agent based modeling of
energy networks. Energy Conversion and Management. 2014;82:308–319.
[94] Evora J, Hernandez JJ, Hernandez M, et al. Swarm intelligence for frequency
management in smart grids. Informatica. 2015;26(3):419–434.
[95] Evora J, Hernandez JJ, Hernandez M. A MOPSO method for direct load control
in smart grid. Expert Systems with Applications. 2015;42(21):7456–7465.
[96] Cheng X, Cao R, Yang L. Relay-aided amplify-and-forward powerline
communications. IEEE Transactions on Smart Grid. 2013;4(1):265–272.
Chapter 6
Shortest path models for scale-free network
topologies: literature review and cross
comparisons
Agnese V. Ventrella1,2 , Giuseppe Piro1,2 ,
and Luigi Alfredo Grieco1,2

The term Internet refers to the global network infrastructure, connecting more than
15 billions of devices around the world. At the time of this writing, it supports a
massive distribution of information, which reaches around 1.5 ZB per year. These
estimates, however, are continuously growing: by 2021, the annual global traffic will
grow to 3.3 ZB per year [1,2]. Therefore, the Internet appears as a complex system
that is continuously evolving during the time.
The knowledge about the Internet topology has always been considered an impor-
tant aspect for researchers, industries, and service providers. It, in fact, is extremely
useful to evaluate network resilience [3], analyze topological properties and their
evolution [4], predict and improve the performance of communication protocols and
the effectiveness of routing algorithms [5], solve specific problems involving a par-
ticular topological structure (i.e., how to distribute storage across routers in order to
obtain an optimal caching allocation) [6], and so on. Thus, analytical models showing
Internet characteristics (like average shortest path and shortest path distribution) and
simulation tools able to reproduce Internet-like topologies are key instruments for
most of the research activities in this context.
Nevertheless, the complexity and the dynamism of the overall Internet architec-
ture make the study of the Internet topology as one of the hottest and hardest research
topic to solve [7]. First of all, it is important to have a clear definition of topology.
According to the Open System Interconnection (OSI) model, a topology represents a
simplified way to depict the interconnections among communication entities [8]. But,
what communication entities refers to, is not completely clear a priori. The scientific
literature, for instance, considers three main levels of granularity, namely interface
level, router level, and Autonomous System (AS)-level [9–11]. Thus, a network topol-
ogy may expose different information according to the level of granularity taken into
account. Second, differently from other large networks, like public switched telephone

1
Department of Electrical and Information Engineering (DEI), Politecnico di Bari, Bari, Italy
2
CNIT, Consorzio Nazionale Interuniversitario per le Telecomunicazioni, Italy
146 Modeling and simulation of complex communication networks

network, the Internet did not grow according to a topological design developed by
some central authority or administration [12]. Hence, huge dimension, rapid change,
and lack of publicly available information inevitably make hard to capture a complete
snapshot of the overall network infrastructure [13].
To solve this issue, several methodologies were introduced to infer topology
information, based on both active and passive approaches. These mechanisms must
be properly configured and adapted when applied to interface router and AS levels
of granularity. At the same time, however, it is also important to consider the set of
limitations they introduce, thus being able to better estimate the level of accuracy of
retrieved data [10,14–17].
Starting from inferred data, it is possible to formulate mathematical models able
to capture statistical characteristics of the Internet. Graph theory is widely used to
reach this goal [18]. In fact, many models were already developed, which refer to
regular, random, small world, and the most recent power-law and scale-free graphs
[11,19–22]. Among them, however, the scale-free graph is widely accepted as the
best model able to represent Internet-like topologies. A number of network simulators
already implement these models and are able to reproduce Internet-like topologies
that can be used in a variety of research activities.
Another important step forward in the study of the Internet topology is the model-
ing of the shortest path connecting any peers attached to the communication systems.
The scientific literature already provides models for both average shortest path and
distribution of the shortest path length [23–26].
Based on these premises, the present book chapter aims at providing an overview
of Internet-like topologies, by covering a broad set of aspects, including the level of
granularity, methodologies useful to retrieve topology information, simulation tools,
and analytical models. Then, the accuracy of reference models for the distribution
of the shortest path length (i.e., Gamma, Lognormal, and Weibull distributions) is
evaluated through a massive simulation campaign, carried out by using the Boston
university Representative Internet Topology gEnerator (BRITE) tool [27]. From one
side, obtained results demonstrate that the available models are able to catch the
average value and the distribution of the shortest path over a very broad set of condi-
tions. But, from another side, they also highlight an unresolved issue: they require a
case-by-case tuning of model parameters.
The rest of this chapter is organized as in the following. Section 6.1 presents the
main levels of granularity of the Internet topology and reviews active and passive
methodologies useful to collect data. Section 6.2 discusses Internet topology models
based on the graph theory and provides an overview of topology generator tools.
Section 6.3 investigates, through computer simulations, the accuracy of analytical
models developed for scale-free networks and identifies useful applications of the
shortest path distribution. Finally, Section 6.4 draws the conclusions.

6.1 Mapping the Internet topology

The scientific literature generally describes the Internet topology through different
levels of granularity. In all the cases, however, the graph theory is deeply adopted
Shortest path models for scale-free network topologies 147

as a key instrument that captures well the required details of the overall network
architecture [10,28]. Such a consideration is also valid for Internet-like topologies,
like restricted portion of the Internet handled by a single Internet Service Provider
(ISP). In fact, at the time of this writing, it is common to represent the Internet topology
as an undirected graph, G. More specifically, this graph is further characterized by
the ordered pair G = (N , E), where N refers to a set of vertices (also called nodes or
points), connected by a set of E edges (also called arcs or lines) [29–31]. Without loss
of generality, it is possible to assume that devices belonging to the global network
infrastructure establish a bidirectional relationship. Therefore, the graph is considered
undirected because edges do not have any orientation.
The roles covered by both nodes and edges belonging to an Internet-like topol-
ogy strictly depends on the level of granularity selected to model the network itself.
Conventional approaches include interface level, router level, and Autonomous Sys-
tem (AS) level [9–11] (see the preliminary overview depicted in Figure 6.1). It
is important to note that details about network topology, routing policies, peer-
ing relationships, and resilience are commercially sensitive, could expose potential
vulnerability to attackers, and reveal resilience planning. Accordingly, they are not
publicly available. At the same time, the network is dynamic and constantly evolv-
ing because of failures, maintenance, and upgrades. For these reasons, information
regarding both global structure and local properties of the Internet cannot be retrieved
in an easy way. Nevertheless, dedicated approaches can be used to partially solve this
problem. They can be divided into two kinds of methodologies, namely, passive and
active [32]. The passive method learns the presence of nodes and their interactions by
simply collecting the information flowing over a wire and generated by other commu-
nication protocols (which work for different purposes). The active method, instead,
supposes to send dedicated packets (i.e., probe messages) to target devices into the
network and to collect the related responses.
The following paragraphs describe the three levels of granularity introduced
above. At the same time, they also present the most important passive and active
methodologies used to retrieve and study Internet or Internet-like network topologies.
For each single strategy, pros and cons are evaluated too (see the summary reported
in Table 6.1). Finally, they also provide an overview regarding geographic network
topologies.

6.1.1 Interface level

The current Internet is based on the host-centric communication paradigm, and the
data exchange is handled through the well-known Transmission Control Protocol
(TCP)/Internet Protocol (IP) stack. In particular, the IP protocol implements net-
working functionalities. Among its other specifications, it identifies any network
interface of hosts, servers, and routers, through an IP address. Today, for instance,
two versions of the IP protocol can be used: IPv4 and IPv6. The former uses a 32-bit
address scheme. The latter adopts a 128-bit address scheme [33].
The interface level of granularity depicts the Internet topology by paying attention
to network interfaces having an IP address, as well as to their peer-to-peer connec-
tions (see Figure 6.1(a)). In this way, a node of the graph describing the considered
148 Modeling and simulation of complex communication networks

Interface level
Nodes
edges

(a)

Router level
Nodes
edges

(b)

AS level
Nodes
edges

(c)

Figure 6.1 Internet topology at three main levels of granularity: (a) Interface level,
(b) Router level, and (c) AS level
topology maps a given network interface and edges refer to direct connections between
nodes [10]. Routers with multiple configured network interfaces are mapped to
multiple logical nodes. Thus, the resulting interface-level topology embraces a num-
ber of nodes equal to the number of active network interfaces with an IP address
and a number of edges equal to the amount of direct connections established at
the network layer.
Shortest path models for scale-free network topologies 149

Table 6.1 Methodologies used to retrieve and study Internet-like topologies and
their related issues

Level of granularity Learning technique Related issues

Interface level Active: Traceroute Absence of ICMP enabled routers;

presence of load-balancing strategies
Active: IP options Absence of routers supporting
IP options
Active: Subnet Possibility to have incomplete data
discovery
Router level Active: Alias resolution Possibility to have incomplete data
Passive: Internet Limited access to the database; absence
Routing Registry of routers supporting IPv4 multicast
AS level Passive: BGP Limited capabilities of monitors
Passive: Internet Stale or incomplete data
Routing Registry
Active: Traceroute Absence of ICMP enabled routers;
presence of load balancing strategies

Interface-level topologies are generally learned through active methodologies,

based on the traceroute tool, the usage of IP options, and subnet discovery.

6.1.1.1 Active methodology based on traceroute

At the time of this writing, traceroute is one of the most popular tool adopted to acquire
topology details [34]. It was originally written to detect communication problems
present within a network, such as routing loops and black holes, as well as to locate
where those failures occur. Subsequently, it has been used to pursue other purposes,
including the active discovery of Internet-like topologies. This tool is available for
most of the operating systems, including Apple macOS, Unix systems, and Microsoft
Windows. In the latter case, however, it is generally known by a different name, that
is, tracert. As default, traceroute works with IPv4. But an updated version for IPv6 is
also available: traceroute6 for Apple macOS and Unix systems; tracert6 for Microsoft
Windows [33].
From a technical point of view, traceroute relies on the Internet Control Message
Protocol (ICMP) [35,36], which represents a messaging protocol working alongside
the IP protocol and offering the support for routing operations, network diagnostic,
and error notification. With traceroute, ICMP is used to calculate the forwarding
path and the communication delay between a source node (i.e., who runs the tool)
and a target network interface (i.e., an interface belonging to the studied network).
To this end, a train of messages is delivered through the User Datagram Protocol
(UDP) with a variable value of the time-to-live (TTL) field of the IP header. To ease
the comprehension to a broader set of readers, it is important to point out that the
TTL field of the IP header is used to limit the lifetime of an IP datagram within the
network. For example, if the TTL value is equal to x, it means that the corresponding
150 Modeling and simulation of complex communication networks

IP datagram can pass, at most, through x consecutive routers before being discarded.
This is because, every intermediate router decrements the TTL value by 1 unit before
triggering the forwarding process. Therefore, as soon as the TTL value reaches the
value 0, the corresponding IP datagram is no more forwarded toward the destination
interface, but an ICMP Time Exceeded message is sent back to the source node for
notification purposes.
Starting from these premises, traceroute works as follows. At the beginning, the
device that runs the tool issues a group of ICMP messages, whose TTL value is set
to 1. Note that more than one message is sent at each step because the procedure
intends to collect statistical information related to communication delays (such as
minimum, maximum, and average value of the round trip time, generally expressed
in milliseconds). These initial packets reach only the node directly connected to the
sender, before being discarded. The ICMP Time Exceeded messages generated by
this node are used by the sender to infer details about the first network interface of
the forwarding path. Then, a new set of ICMP messages is sent with a TTL value set
to 2. In line with the process described above, the sender can now learn information
about the second hop of the forwarding path toward the destination. This process is
repeated until the destination node is reached. At the end, the sender collects some
details of the network topology, on a hop-by-hop basis [11].
It is important to remark that two main limitations affect traceroute [14]. First, if
some routers do not implement ICMP, the acquired forwarding path will not consider
some of the intermediate network interfaces. Second, in the event that a intermediate
router implements a load-balancing strategy, traceroute will generate results referring
to multiple paths through which packets are sent. Thus, the acquired forwarding path
will include additional network interfaces and the learned network topology could
not exactly capture the reality.

6.1.1.2 IP options and subnet discovery

Two further measurement techniques exploit IP packet options field and subnet
discovery [10].
The options available within the IP header could be useful to support additional
functionalities, such as the packet routing toward a path that is different from the
usual one or the registration of specific information related to the network topology.
For instance, the source routing option allows the discovery of new paths. When
it is enabled, in fact, the sender can choose at most nine routers that the packet
is supposed to go through before reaching the destination. Additionally, the record
route option can be used to allow routers involved in the forwarding process to store
their IP addresses within a dedicated list available in another option field of the IP
header. These information are used by the destination node to learn a multi-hop path
connecting it to the sender. A drawback is that these options are not supported by all
routers.
The subnet discovery technique is based on the subnetting concept. A subnet is
a layer 2 subdivision of an IP network, where all the devices are addressed with a
common most significant bit-group (e.g., IP prefix). This technique exploits the IP
prefix to detect the subnet boundaries and reveals the pingable IP addresses avail-
able in the subnet. Achieved results are then used to build the network topology.
Shortest path models for scale-free network topologies 151

This technique can suffer from incomplete data because of relationship policies and
routing preferences that make the packet observe only some paths and missing other
ones.

6.1.2 Router level

Differently from the interface level of granularity, when the router-level approach is
chosen, each router is mapped to only one node of the graph, without considering the
possibility that more network interfaces with different IP address can coexist in the
same device (see Figure 6.1(b)). In other words, a node is viewed as an aggregation
of network interfaces that belong to a single device. Therefore, the router level of
granularity describes how routers are connected to each other within an Internet-like
topology. In the resulting graph, nodes represent routers and edges indicate networking
connectivity among them [10,11].
The details about router-level topologies could be achieved by means of alias
resolution and recursive router discovery techniques.
6.1.2.1 Alias resolution techniques
Alias resolution is an active method, still based on the traceroute tool. While traceroute
is used to infer the forwarding path on a hop-by-hop basis (as previously discussed),
additional methods are implemented for properly mapping network interfaces to the
right nodes of the topology.
One possibility is based on the fingerprint technique [17]. Here, a device inter-
ested to build a router-level representation of a network focuses the attention on a
remote network interface, having a known IP address. Such a device issues fake UDP
or TCP packets to that IP address, by setting the destination port address to unused
values. As expected, the remote network interface replies with an ICMP Port Unreach-
able error message. In the case that the received error message contains an IP source
address that is different from the contacted one, the device performing the alias
resolution technique recognizes that these IP addresses refer to network interfaces
configured in the same router. Therefore, according to the router level of granularity,
these interfaces will be aggregated into the resulting topology representation.
Another approach is referred to as IP-identification fingerprint method. During
this learning procedure, a device identifies two potential aliases of IP addresses.
Then, it sends to both interfaces a UDP probe packet, by setting the destination port
number to a high (and unused) value. In both cases, the two destination interfaces
reply with an ICMP Port Unreachable error message. The device collects the IDs
of the received messages, that are, for instance, id1 and id2 . A third fake packet is
sent to the destination IP address from which it has received the first ICMP Port
Unreachable error message. Again, the remote destination interface answers with
a new ICMP Port Unreachable error message with ID equal to id3 . Now, in the
case that id 1 < id 2 < id 3 and the difference id 3 − id 1 is below a given threshold,
the IP-identification fingerprint method assumes that the contacted IP addresses are
aliased. Thus, their related network interfaces are aggregated in the resulting topology
representation [37].
Finally, analytical techniques could also be introduced for further solving alias
resolution. Here, the common IP address assignment scheme to infer IP aliases, that
152 Modeling and simulation of complex communication networks

belong to two opposite paths, is used. After having identified the subnets, aliases are
inferred by analyzing path segments [11].
It is possible to conclude that alias resolution techniques are generally considered
accurate. But, sometimes retrieved data could be incomplete. The reason is that
traceroute can fail when nodes are disconnected, turned off, or configured to not
respond to probe packets [17].

6.1.2.2 Recursive router discovery

Recursive router discovery is a passive method that exploits the capability of routers
to be queried in order to retrieve information about their neighbors.
In Local Area Networks (LANs), the Simple Network Management Protocol
(SNMP) is frequently used for handling network monitoring and for storing collected
data in a dedicated Management Information Base (MIB). In particular, SNMP-
enabled routers store the list of neighbor interfaces within the ipRoute table of the
MIB. Information collected through SNMP could be useful to build a router-level
description of an Internet-like topology. From one side, this approach may provide
accurate data. But, from another side, its usage is highly restricted because the MIB
is accessible only by network administrators [10].
Another solution implementing the recursive router discovery approach is
MRINFO, which is based on the Internet Group Management Protocol (IGMP). IGMP
was initially standardized to allow hosts and adjacent routers to establish multicast
group memberships. With MRINFO, an IGMP Ask Neighbors message is issued in
order to receive the list of all the router’s interfaces and their related neighbors. The
answer is reported in the IGMP Neighbors Reply message [16]. Unfortunately, this
technique can only be used with IPv4 multicast enabled routers.

6.1.3 AS level
Before introducing the latest level of granularity useful to describe Internet-like
topologies, it is important to remark that the global network infrastructure appears as
a connection of several autonomous systems (ASs). Each AS is made up by a group
of routers deployed by one or more network operators, on behalf of a single admin-
istrative entity [38]. For instance, an AS can refer to the network of a large company,
a university, a network service provider, and so on. Typically, individual users, small
enterprise networks, and ASs located at the edge of the Internet can join the global
network through other ASs, namely, ISP. In turn, ISPs may obtain the same service
from one or more upstream ISPs. Each AS is uniquely identified by an AS number
(ASN). Originally, it was defined as a 16-bit integer (by admitting a maximum of
65.536 assignments). Then a 32-bit ASN has been introduced in order to uniquely
identify a higher number of ASs [39]. In addition, ASs are divided into two categories:
transit and stub. A transit AS is part of the core network and usually carries traffic
between isolated domains, managed by different administrative entities. A stub AS,
instead, provides Internet connectivity to end users. Thus, from one side, it is con-
nected to end users. From another side, it is connected to the rest of the Internet
through one or more transit ASs. Sometimes, the administrator of a given AS can
Shortest path models for scale-free network topologies 153

change its own traffic relationship with other providers, thus modifying the overall
network architecture and making the resulting topology constantly evolving.
The AS level of granularity, also known as inter-domain description, depicts the
Internet architecture as a group of interconnected ASs. Accordingly, it brings to an
undirected graph where each node identifies one AS and edges represent the logi-
cal peering relationship between two adjacent ASs (see Figure 6.1(c)). Despite its
coarse level of details, the AS level of granularity is frequently leveraged to study,
control, optimize, and implement inter-domain routing, mechanisms for the provi-
sioning of the quality of service, and customer-provider and peering relationships
between ISPs.
Also in this case, both passive and active mechanisms can be used to infer infor-
mation related to the AS level topology. The first mechanism basically collects data
generated by the Border Gateway Protocol (BGP) [40] or provided by the Inter-
net Routing Registry [41]. The second one investigates forwarding paths through
traceroute.

6.1.3.1 Passive methodology based on BGP and Internet

Routing Registry
BGP is the current de-facto standard for inter-domain routing. It allows the exchange
of routing information between ASs without revealing detailed and internal infor-
mation about their own networks. In particular, a BGP-compliant router obtains
information about existing routes from its BGP neighbors. The obtained routes are
processed and shared with to other BGP-compliant router according to specific rout-
ing policies. Routes selection generally preserves system scalability. At the end of
the process, if the protocol converges, a stable routing solution is found. But, identi-
fied routes are generally far from the shortest path. BGP data are gathered by route
monitors or collectors, that are specific devices deployed around the globe by some
international projects such as University of Oregon’s Route-Views [42] or RIPE Rout-
ing Information Service [43]. These projects were originally used by ISPs to debug
and optimize their networks [44].
Anyway, it is important to note that BGP cannot provide a complete view of ASs
because significant information could be missed [8]. This is due to the following
motivations: (1) monitors can only see what the connected routers choose to send;
(2) monitors are not present in each location; (3) the location of these monitors is not
randomly distributed across the Internet; (4) the connections between BGP monitors
and routers are not completely reliable because of session resets, collector down time,
and missing updates [15].
Another passive method to infer an AS level topology refers to the look up of the
Routing Assets Database provided by the Internet Routing Registry, stored in dedi-
cated File Transfer Protocol servers. This database includes information about routing
policies, regulation, and peering provided by the ASs themselves. Specifically, the
whois command can be used to retrieve these information [11,15]. Some limitations
that characterize this approach are due to the fact that the stored information can be
stale or incomplete.
154 Modeling and simulation of complex communication networks

6.1.3.2 Active methodology based on traceroute

Also for the AS level of granularity, traceroute can be used to retrieve details about for-
warding paths. Data retrieved through traceroute, however, must be further processed
in order to map network interfaces to the corresponding ASs. In fact, consecutive IP
addresses that belong to two different adjacent ASs reveal the connectivity between
ASs. The issues associated to traceroute and discussed in the previous paragraphs are
valid also in this context.

6.1.4 Geographic network topologies

All the methodologies discussed above do not provide any reference to the physical
location of nodes on the map. Without any doubt, such information can be added
to the network topologies, obtained through any level of granularity, in order to
increase their usability. For instance, geographical information allows one to (1)
simplify the network troubleshooting and the detection of attacks and congestions,
(2) guarantee resilience of interconnections in case of disaster scenario, (3) provide
location information to Internet services that require them, and (4) provide a visual
representation of the Internet [11,45].
For sure, the definition of a geographic network topology depends on the selected
level of granularity. Network interfaces and routers can be immediately mapped to a
precise location on a map, i.e., to a pair of coordinates. Instead, nodes belonging to
the AS level topology do not refer to a single entity and to a unique location, because
an AS gathers routers under a common administrator. Therefore, when a geographical
information is assigned to an AS, it is just used to coarsely identify the geographic
region covered by the AS.
To achieve this further level of detail, active or passive measurement methods
can be exploited. Active IP geolocation techniques are typically based on delay mea-
surements that offer good levels of accuracy. But their drawbacks include scalability,
high measurement overhead, and very high response time. Passive approaches, such
as database-driven geolocation, are faster. They usually consist of a database-engine,
e.g., Structured Query Language (SQL)/MySQL, containing records for a range of
IP addresses. Nevertheless, this database is difficult to manage and update, and its
accuracy is not so high because of the lack of information about the operations used
to build it [46].

6.2 Internet models based on the graph theory

Nowadays all the techniques described in the previous section are continuously used
to experimentally collect useful data related to Internet or Internet-like networks,
as well as to study their characteristics. Some results are publicly available. For
instance, reference datasets are provided by Centre for Applied Internet Data Analysis
(CAIDA) [47] and Internet Topology Zoo [48]. Starting from these data, it is possible
to formulate mathematical models based on graph theory that capture the main facets
of Internet-like topologies (from one side) and allow users to reproduce them through
computer simulations (from another side).
Shortest path models for scale-free network topologies 155

The following paragraphs introduce fundamental notions inherited from graph

theory that are at the basis of the aforementioned models. Then, they present the
most important analytical models describing the Internet or restricted portion of it.
Finally, they provide an overview of topology generators that are able to reproduce
Internet-like topologies based on the aforementioned models.
6.2.1 Fundamental notions from the graph theory
As already anticipated in the Introduction, the Internet topology can be described
through an undirected graph, G = (N , E), where N refers to a set of vertices connected
by a set of E edges. Nevertheless, such a graph can be further characterized by
additional parameters that include [29,49] the following:
● Node degree, k: It represents the number of edges incident to a vertex. This
parameter allows one to capture the connectivity characteristics of the topology.
In particular, networks with higher k register an average better connection that
results in a higher robustness to failures.
● Shortest path, d(u, v): It identifies the shortest distance between two vertices u
and v. More details about shortest path characteristics will be provided in the
following sections.
● Diameter, δ: It defines the longest shortest path between any node pair u and v,
and it is expressed as
δ = sup d(u, v) (6.1)
(u,v)

● Clustering coefficient: It is a measure of the degree to which nodes in a graph tend

to cluster together. Two measurements of clustering coefficient can be considered:
global and local. Moreover, two definitions of global clustering coefficient are
possible. The first one is based on triplets of nodes: three connected nodes form
a triplet and three triplets form a triangle. Therefore, the global clustering coeffi-
cient is the number of closed triplets over the total number of triplets (both open
and closed). The alternative definition evaluates the global clustering coefficient
as the mean of the local clustering coefficients related to all the vertices. The local
clustering coefficient is defined as the ratio between the number of existing edges
within the neighborhood of a vertex and the number of possible edges within the
neighborhood of the same vertex. The higher the local clustering of a node, the
more interconnected are its neighbors.
● Betweenness: It is a measure of the node centrality, calculated as the fraction
of shortest paths between node pairs that pass through the node of interest. It is
inversely related to the robustness of the graph when a node is removed. In fact,
the higher the number of paths that pass through a node, the higher is the damage
that will be done when that node is removed. Moreover, it can provide a measure
of the traffic load that a node must handle or the influence that an individual node
has in the spread of information within the network.
● Spectrum: It represents the set of eigenvalues of the adjacency matrix of the
graph. It allows a user to measure the overall characteristics of the network and
its robustness.
156 Modeling and simulation of complex communication networks

6.2.2 Topology models

The most important Internet topology models proposed in the literature include reg-
ular and well-known topologies, random and small-world topologies, power-law and
scale-free topologies, and hierarchical topologies.

6.2.2.1 Regular and well-known topology models

According to [11], regular and well-known topologies represent the simplest model
used to describe a network. It cannot be applied to the overall Internet architecture.
But it only serves to investigate a restricted part of a network (as described below).
The term regular topology refers to elementary network architectures, including
mesh, rings, trees, stars, and lattice. Therefore, resulting models only support the
simulation of a very limited portion of the Internet network (like a LAN) or other
basic infrastructures. Thanks to the simplicity of these topologies, they do not require
complex generator tools.
The term well-known topology refers to specific real networks, such as GÉANT
or National Science Foundation Network (NSFnet) backbone. In particular, GÉANT
interconnects the European National Research and Educational Network and provides
research data communication across the continent [50]. NSFnet backbone intercon-
nected six supercomputer sites, several regional networks, and ARPANET [51].

6.2.2.2 Random and small-world topology model

Looking at the overall Internet, it was originally described through the random graph
theory developed by Erdős and Rènyi [19]. Let N , E, and k be the number of nodes,
the number of edges, and the average degree of a given network. The random topology
model assumes that an average degree k, is equal to:
2E
k= , (6.2)
N
It is kept constant and it is assumed that every pair of nodes is connected with a
probability p equal to
k
p= . (6.3)
N
The resulting model generates network topologies having a small average shortest
path length and a small clustering coefficient. Moreover, it does not capture all the
characteristics of a real Internet-like topology, such as the presence of hubs.
The limitations of Erdős and Rènyi model were overcome by Watts and Strogatz
model [20]. It still allows the generation of network models with a small average
shortest path length. But, differently from the previous one, it registers a large clus-
tering coefficient. Among its limitations, the Watts and Strogatz model generates
network topologies with an unrealistic degree distribution. It is possible to generate
a Watts–Strogatz network by starting from a regular ring lattice with N nodes. Each
node is connected to the same number of 2m nearest neighbors. Then, each edge
has to be removed according to a uniform and independent probability p. This edge
has to be rewired in order to connect a pair of nodes uniformly and randomly chosen.
Shortest path models for scale-free network topologies 157

There is also a the Newman–Watts variant of the Watts–Strogatz network that does not
include the removal of the edges from the underlying lattice in the building process.
In this model, edges are only added between pairs of nodes in the same way as in a
Watts–Strogatz network [52].

6.2.2.3 Power-law topology models

The work presented in [21] demonstrated for the first time a new set of properties of
the Internet. Specifically, the work considered three different snapshots of the Internet
referring to an AS level representation of 1997, an AS level representation of 1998,
and a router level representation of 1995. All the topology details were inferred from
RouteViews BGP tables. The conducted study identified three specific power laws:
● Rank exponent: the out-degree of a node is proportional to its rank to the power
of a constant. Let kv and rv be the out-degree and the rank of a node v, then the
following relation exists: kv ∝ rvR . The exponent R is obtained by performing a
linear regression on kv , by plotting a log–log graph.
● Out-degree exponent: the frequency of an out-degree is proportional to the out-
degree to the power of a constant. Let f (k) be the fraction of nodes with degree
k, then f (k) ∝ k O . The exponent O is obtained by performing a linear regression
on f (k) when plotted on a log–log graph.
● Eigen-exponent: the eigenvalues λi of the adjacency graph, sorted in a decreasing
order, are proportional to the order i to the power of a constant, according to the
relation λi ∝ iE . The exponent E is obtained by performing a linear regression on
λi when plotted on a log–log graph.

Moreover, [21] studied the neighborhood size within some distances. Also in
this case, the relation follows a power law, but it was considered an approximation
because of the small number of samples. In particular, let P(h) be the total number
of pairs of nodes within h hops. P(h) is proportional to the number of hops to the
power of a constant H , according to the relation P(h) ∝ cH when h δ, where δ is
the diameter of the network and c = N + 2E.
After [21], several researchers supported these findings and tried to further under-
stand the origin of the power law [53,54]. A very important contribution was provided
by the Barabasi–Albert model [22]. At the same time, the literature also proposes
opposing theories. For instance, Chen et al. [55] argued that an AS level topology
does not include all the Internet connectivity. In fact, at least 20%–50% of the physi-
cal links are missing. Therefore, the node degree distribution does not follow a strict
power-law relationship.

6.2.2.4 Scale-free topology model

Barabasi–Albert formulated the scale-free model, which follows three main proper-
ties [22]. First, the network is not static, but it evolves over the time. Second, any
new added node will be connected to an existing vertex with a probability depend-
ing on the connectivity of the vertex (specifically, the higher the connectivity of the
vertex, the higher the chances that the node will be chosen as attachment point by
the joining node). This mechanism is known as preferential attachment or rich get
158 Modeling and simulation of complex communication networks

richer phenomenon [56,57]. Finally, node degree distribution asymptotically settles

to a power law.
This means that the node degree has a heavy-tailed distribution. The coexistence
in the same network of nodes with widely different degrees is expressed by the term
scale-free that suggests the lack of an internal scale. This feature distinguishes scale-
free networks from lattices, in which all nodes have the same degree, or from random
networks, whose degrees vary in a narrow range.
Moreover, another important characteristic of scale-free networks concerns the
average shortest path between two vertices of the topology. Its value is small, such as
in the small-word models, and it will be discussed in the following section.
The process to generate a scale-free topology entails an evolving network over a
discrete time domain: at every timestep, a new vertex is added with m ≤ m0 edges,
where m0 is the initial small number of vertices deployed in the system.
Note that power-law random graphs (PLRGs) and scale-free networks are not
synonym: while the former is static, the latter evolves during time. Moreover,
a PLRG has a pre-given number of nodes and edges which follow a power-law
degree sequence. Instead, in the Albert–Barabasi scale-free network, nodes and
edges are self-organized in order to asymptotically reach the power-law degree
distribution [58].

6.2.2.5 Hierarchical methods

The N -level hierarchical method envisages the generation of the Internet topology by
iteratively expanding individual nodes into other graphs [11]. First of all, a connected
graph is generated, then each node is substituted by a connected graph. The edges
belonging to the original graph are connected to the nodes of the new graphs. This
process continues N times. The scale of the final graph is the product of the scales of
the individual levels.

6.2.3 Topology generator tools

Many tools implement the aforementioned models. They can be used to reproduce
Internet-like topologies for computer simulations. In fact, they are extremely impor-
tant because several times research activities cannot be carried out on real networks
because of dimension, control, and permissions issues. Topology generators should
fulfill the following characteristics [11]:

● Representativeness: the input arguments should produce accurate topologies.

● Inclusiveness: the generator should include different methods and models because
of the lack of a universally accepted model.
● Flexibility: topologies should not have a limitation on the size (i.e., maximum
number of nodes).
● Extensibility: users should be able to extend the tool with additional features.
● Interoperability: generated topologies should be in a format that is able to be
processed by other simulator tools.
Shortest path models for scale-free network topologies 159

● Efficiency: the tool should be able to generate large topologies by preserving the
required statistical characteristics and by using a reasonable CPU and memory
consumption.
● User friendliness: the usage of the tool should be easy to learn.

6.2.3.1 Random topology generator tools

Waxman developed one of the first topology generators [59]. It implements an
extended version of the Erdős and Rènyi random model, where nodes are randomly
located on the Cartesian plane and connectivity among a node pair is generated accord-
ing to a probability that is a function of the Euclidean distance that separates them in
the plane.

6.2.3.2 Power-law topology generator tools

Inet [60] and PLRG [61] are two generators that produce power-law topologies. Both
of them start the generation process assigning a degree from a power-law distribution
to nodes. Then, they interconnect nodes by using different approaches [54]. Inet, first,
creates a spanning tree with the nodes that have a degree greater than one. Then, it
connects the remaining nodes with degree one to the spanning tree according to a linear
preference, i.e., preferentially to nodes that have a higher degree. Instead, PLRG, first,
increases the number of nodes by duplicating each one for a value equal to the degree
assigned to it. Then, it interconnects all the clones in a uniform and random way. Note
that graphs generated by PLRG can be disconnected and can contain self-loops and
duplicate links. Therefore, the actual graph used for research purposes is obtained by
extracting the giant connected component (that is always present according to [61]),
and eliminating self-loops and merging duplicate links.

6.2.3.3 Scale-free topology generator tools

BRITE is one of the most widespread Internet topology generator [27,62]. It jointly
supports Barabasi–Albert, Waxman, and hierarchical topology models. With ref-
erence to the Barabasi–Albert topology model, BRITE reproduces the incremental
growth and preferential connectivity that characterize the scale-free approach. In par-
ticular, it allows the user to choose the parameter m, i.e., the number of neighbors
of each new node that is added to the graph during the topology generation pro-
cess. Higher values of m produce denser topology. The new node v will connect to a
potential neighbor node i with a probability ki kj , where ki is the current out-degree
j∈C
of node i and C is the set of candidate neighbor nodes. This means that a new node
added to the network will select with higher probability those nodes that have a higher
number of connections.
Moreover, BRITE places nodes on the plane in a random or heavy-tailed way. In
the first case, nodes are simply randomly distributed on the plane. When the heavy-
tailed distribution is used, the plane is divided into HS×HS high-level squares. Then,
each square is further subdivided into smaller LS×LS low-level squares. A number
of nodes, drawn from a heavy-tailed distribution (bounded Pareto distribution), is
attributed to each high-level square; then, these nodes are randomly located in each
160 Modeling and simulation of complex communication networks

low-level square. BRITE also provides a bandwidth value to each link according to
four distributions:
● Constant: All links have the same value.
● Uniform: Bandwidth values are assigned according to a uniform distribution
between two input values.
● Exponential: Bandwidth values are assigned according to an exponential distri-
bution with mean equal to an input value.
● Heavy-tailed: Bandwidth values are assigned according to a heavy-tailed distri-
bution (Pareto with shape 1.2) with minimum and maximum values equal to two
input values.

Finally, BRITE assigns geographical coordinates to each node.

6.2.3.4 Hierarchical topology generator tools

The Georgia Tech Internetwork Topology Model (GT-ITM) topology generator can
be used to produce a two-level method, also known as transit-stub model [63,64].
GT-ITM first creates a connected random graph by using the Waxman method or
a variant of it. Each generated node represents a transit domain. Then, these nodes
are expanded in order to form another connected random graph that represents the
backbone topology of the transit domain. For each node of the transit domain, stub
domains are attached by generating a certain number of random graphs.

6.3 Shortest path models

Topology models are very relevant to the Internet performance evaluation. In this
context, it is essential to characterize the distribution of shortest paths in order to gain
precious insights on the network behavior. In reality, the communication between
two peers is not the shortest path offered by the network topology. Actual multi-
hop paths are generally longer than the shortest. This phenomenon is known as path
inflation. It can be due to routing policies and to traffic engineering techniques that
spread the load among more links in the topology [65,66]. But, because of the lack
of more accurate models that are able to provide multi-hop communication paths, the
shortest path model is still considered as a reference approach for estimating network
performances [67].
This section proposes a cross-comparison of shortest path models currently avail-
able in the literature, while focusing the attention to scale-free networks. The study
highlights pros and cons of these models, which emerge when they are applied to
different networks.

6.3.1 Parameters definition

The distribution of shortest path lengths indicates the number of shortest paths among
all node pairs. It allows a statistical characterization of the network by using average
Shortest path models for scale-free network topologies 161

path length and graph diameter. For scale-free networks, the average shortest path
length, d̄, is approximately equal to

d̄ ≈ logN , (6.4)

where N represents the number of nodes in the topology. In particular, this formula
refers to the scale-free network that are built by adding each new vertex to m other
nodes with m = 1. Otherwise, if m > 1, the average shortest path, d̄, is asymptotically
equal to

log N
d̄ ∼ (6.5)
log log N

6.3.2 Shortest path models

The definition of analytical models, that are able to describe the shortest path lengths,
is still an open issue. Models proposed in the literature include Gamma distribution,
Weibull distribution, and Lognormal distribution.

6.3.2.1 Gamma distribution

According to [24–26], the shortest path distribution can be modeled through a
Gamma distribution. The probability density function, that describes the shortest
path distribution according to the Gamma distribution model, is

1 1 η−1 x/θ
f (x; θ, η) = x e (6.6)
θ η (η)

where (·) is the gamma function and θ > 0 and η > 0 are scale and shape parameters,
respectively. This model indicates that the distance distribution of all nodes consists
of two regimes. The former is characterized by a rapid growth. The latter refers to an
exponential decay.

6.3.2.2 Weibull distribution

The work [23] applies the extreme value theory [68] to find the most appropriate model
to describe the shortest path distribution. Starting from the Fisher–Tippett–Gnedenko
theorem, three distributions were found: Gumbel, Frechet, and Weibull [69]. Among
them, the Weibull distribution emerged as the most suitable one because the sampled
distribution has to have a finite lower limit. This is the case of path lengths with a
lower bound equal to zero. The probability density function describing the shortest
path distribution according to the Weibull model is
κ κ−1 −(x/λ)κ
f (x; λ, κ) = x e (6.7)
λκ
where λ > 0 and κ > 0 are scale and shape parameters, respectively.
162 Modeling and simulation of complex communication networks

6.3.2.3 Lognormal distribution

The Lognormal distribution model was presented in [23]. It uses a probability density
function defined as in the following:

1 2 2
f (x; μ, σ ) = √ e(logx−μ) /2σ , (6.8)
xσ 2π

where − inf < μ < + inf is the logarithm of the mean and σ > 0 is the logarithm of
the standard deviation.

6.3.3 Cross-comparison among shortest path models

The accuracy of the reference models for the distribution of the shortest path length
(i.e., Gamma, Lognormal, and Weibull distributions) is evaluated through a massive
simulation campaign, carried out by using the BRITE tool [27]. The study focuses
on scale-free network topologies with different number of nodes and different values
of the node degree. Specifically, the number of nodes N is set to 5,000, 10,000, and
20,000. The node degree m is set to 1, 2, and 3. For each set of parameters, 30 different
topology realizations are generated and evaluated to produce final results.
For each distribution, related parameters are estimated through curve fitting. The
resulting probability density function (pdf) and cumulative distribution function (cdf)
are shown in Figures 6.2–6.4. Obtained results demonstrate that the average shortest
path and the network diameter increase with the number of nodes of the topology.
This behavior reflects the theoretical formulation reported in both Eqs. (6.4) and (6.5).
When m increases, the average shortest path and the diameter decreases. In this case, in
fact, each node added to the topology will be connected to a higher number of existing
nodes. Therefore, the overall path lengths will be reduced. Moreover, the probability
of the average shortest path becomes higher. These behaviors are further confirmed by
results reported in Table 6.2, showing the theoretical and simulated average shortest
path and the average diameter obtained through computer simulations.
More in general, however, all the curves seem to well describe what is provided
by computer simulations. But, to better study and compare the accuracy of the con-
sidered models, the Kolmogorov–Smirnov test is used [70]. It evaluates the maximum
absolute difference between the cumulative distribution functions generated through
simulations and the cumulative distribution functions of the theoretical models. The
results reported in Table 6.3 clearly demonstrate that all the models provide a good
fitting. In particular, the Gamma distribution shows the lowest error when m = 1, and
the Weibull distribution reaches the best results when m = 2 and m = 3. In all cases,
the lognormal distribution registers the worst behavior.
It is possible to conclude that the conducted study clearly demonstrate that the
available models are able to catch the average value and the distribution of the shortest
path distribution over a very broad set of conditions. Unfortunately, the parameters of
each distribution must be properly set through curve fitting. Accordingly, the models
require a case-by-case tuning of parameters.
Shortest path models for scale-free network topologies 163