Paper 1
Paper 1
Abstract—The increasing demand for electricity, coupled with [10], [11]. Furthermore, a comprehensive scheduling program
the rise in greenhouse gas emissions, necessitates the integration is required to increase customers’ monetary profit, making
of Renewable Energy Sources (RESs) into power grids. However, RESs more affordable. Finally, security is also a concern that
the fluctuating nature of RESs introduces new challenges in
energy management. The Internet of Energy (IoE) framework needs to be handled since the growth of advanced information
provides a solution by enabling real-time monitoring, dynamic and communication technologies makes the network more
scheduling, and enhanced energy routing. This paper proposes vulnerable to malicious physical and cyber-attacks [12], [13].
a comprehensive approach to optimizing energy management In a nutshell, the newly emerged challenges need to be dealt
in smart grids using Deep Reinforcement Learning (DRL) and with through novel techniques that are precise and high-speed
Convolutional Neural Networks (CNN). The research focuses on
three main objectives: optimizing operation scheduling, improv- besides self-improvement capability for the upcoming modi-
ing energy routing, and enhancing cyber-physical security. A fications. Conventional Energy Management Systems (EMSs)
DRL-based scheduling algorithm is developed to manage energy are not practical henceforward due to transformation in the
components effectively, while an optimized energy routing algo- network’s topology and load curve, caused by penetration of
rithm ensures efficient electricity flow. Additionally, a security new resources, besides existing massive data generated by
framework utilizing Long Short-Term Memory (LSTM) and
CNN is proposed to detect False Data Injection (FDI) attacks and smart tools. As a complex cyber-physical system, the Internet
electricity theft. The proposed methods aim to improve energy of Energy (IoE) can provide an energy management framework
efficiency, reduce costs, and ensure the security of IoE-enabled that facilitates the accommodation and coordination of RESs,
power systems. This research bridges existing gaps by addressing providing real-time monitoring via embedded bidirectional
the dynamic and complex nature of modern energy networks. The communication systems. IoE is a cloud-based technology that
integration of these advanced technologies promises significant
advancements in the reliability and efficiency of smart grids. integrates the power system with embedded metering, commu-
Ultimately, this work contributes to the development of a sus- nication, sensing, and networking capabilities [14], delivering
tainable and secure energy future. a bidirectional information structure. This is vital in an Energy
Index Terms—Smart Grids, Attack Surface, Physical Interface, Management System (EMS) to distribute electricity safely and
AI, ML, RL. efficiently from RESs to the system or end-users [11], [15].
Operational Scheduling (OS) is one of the main objectives
of the EMSs, which can be fulfilled through IoE. Scheduling
I. I NTRODUCTION the operation, maintenance, and transmission planning plays
Electricity demand is steadily rising since industrial and a leading role in efficient EMS. Optimized OS transforms
economic development heavily relies on electrical energy [1], consumers into prosumers who can participate in the energy
[2]. During the past ten years, electricity demand is increased market by selling their surplus electricity while maximizing
uninterruptedly by an average of 3.1% annually, which resulted the users’ economic profit. It also accelerates the utilization
in growing greenhouse gas (GHG) emissions and increases of RESs, which leads to numerous technical and economic
the need for new energy resources [3], [4]. According to advantages [16], [17]. In addition to facilitating OS, IoE
the Energy Information Administration (EIA), 62% of elec- also offers the opportunity for energy routing optimization
tricity production in the United States supplied by fossil to improve stability and reliability and to increase profit in
energies, and 20% average global growth has happened for the power network. Due to the RESs, multiple energy hubs
these resources. Since fossil fuels are expensive and pollute, are connected via energy routers to build an energy network,
Renewable Energy Sources (RESs) are utilized dramatically which results in the genesis of Virtual Power Plants (VPPs),
[5]–[7]. High penetration of RESs brings up new challenges one of the central concepts for improving energy efficiency
due to supply fluctuation, uncertainties imposed by the nature [8]. Energy router is a fundamental component in EMS that
of RESs and decentralized topology that originates from the regulates the direction and amount of electricity and optimizes
wide geographical distribution of the energy resources. [8], [9]. the energy flow among all devices [9]. Therefore, optimizing
Also, real-time monitoring of energy flow and dealing with big energy routing in IoE needs to be considered as a crucial
data generated by information infrastructures are vital to en- challenge in energy management. Aside from numerous advan-
hance the network’s energy efficiency, reliability, and stability tages of employing IoE in EMS, this technology is vulnerable
2
to malicious attacks [18]–[20]. The distributed pattern of IoE, hybrid electric vehicles (PHEVs) is introduced. Unfortunately,
which allows users to interact and exchange information and the RESs are ignored in this study, and the utilized Support
energy without central control, leads to many security and Vector Machine (SVM) algorithm can assign a binary function
privacy challenges [21], [22]. Security concerns are not limited to the devices. Furthermore, the centralized decision-making
to cyber layers since the system may be physically manipu- structure that is employed may result in decreasing efficiency
lated/damaged for electricity theft or sabotages. In summary, and arduous recovery after disasters or attacks. A two-stage
improving energy efficiency in the power network via the device scheduling with dynamic programming is proposed
IoE concept is facing three main challenges that should be by [32]. In this study, only major renewable energy stations
addressed: optimal OS, optimal energy routing in the different are considered, with no scheduling at the home level. None
layers, and cyber/physical security concerns. Consequently, of the previous works included comprehensive scheduling
three main goals are defined for this research proposal. First, considering both energy units and home appliances to find
an operation scheduling algorithm is optimized using Deep surplus energy. Almost all of the previous works consider both
Reinforcement Learning (DRL). Next, the same method is consumers and prosumers with the same role in the market as
applied for routing energy at three different layers, including price takers, resulting in a non-transparent market since a large
smart home, Home Area Network (HAN), and grid layers. share of the network belongs to the household users. An intel-
Finally, an attack detection framework is proposed to detect ligent scheduling program would make users prone to act as a
False Data Injection (FDI) attacks and electricity theft using price maker in the electricity market, leading to a competitive
Long Short-Term Memory (LSTM) and Convolutional Neural marketplace. Dealing with the uncertainties of RESs, handling
Network (CNN) [23], [24]. sudden changes in the consuming or generating patterns, and
computation complexity also need to be addressed. After
developing a scheduling algorithm that satisfies all technical
II. P ROBLEM S TATEMENT AND M OTIVATION
and economic goals at the residential level, the next step is
The origination of IoE emanates from the high penetration optimizing the energy route among all devices in a smart home
of RESs across the power grids and can be used for energy and the HANs. In [33] a hierarchical optimization method
management enhancement. IoE provides a decentralized struc- for ERs is proposed to reduce the complexity of centralized
ture that facilitates accommodation and integration of RESs optimal dispatch on large-scale systems. The consideration
besides increasing demand-supply reliability. Real-time mon- level is higher than the sub-distribution network, where there is
itoring that is enabled through bidirectional communication no routing algorithm for the residential section. Also, storage
is another significant advantage of this cloud-based network. and Electric Vehicles (EVs) effects have been ignored. An
Finally, automation controls implemented in an IoE-based energy router design with an optimized algorithm that covers
power network improve energy efficiency, leading to profit smart homes, HANs, and grid levels at the same time is
maximization [19], [25]. The residential sector, including both still absent. Moreover, the desired algorithm that can perform
consumers and prosumers, is known as the most important in online and offline mode has not been studied yet, and
sector in the IoE networks and economic goals are one of islanding mode identification should also be investigated. IoE
the main motivations for this sector [26]. IoE facilitates han- relies on two-way communication with a fully interconnected
dling challenges associated with OS optimization to improve Advanced Metering Infrastructures (AMIs) network, which
energy efficiency, profit maximization, and decreasing GHGs makes the system vulnerable to cyber and physical malicious
emissions. Besides that, IoE can be used to enhance energy activities. For instance, a cyber-attack on the Supervisory
routing optimization. As the core of IoE, an Energy Router Control and Data Acquisition (SCADA) system of a regional
(ER) adjusts the energy route dynamically among the devices electricity distribution company in Ukraine led to a power
considering technical and economic constraints to minimize outage for 225,000 customers, which caused massive technical
energy loss and maximize profit [27], [28]. Consequently, opti- and economic damages [34], [35]. Also, physical tampering
mizing electricity routing is vital, either made for a single user and bypassing are responsible for 20% of the electricity lost
or at the Home Area Network (HAN) level. Despite the above- in India [36], [37]. Therefore, guaranteeing security comes
mentioned advantages, IoEs are vulnerable to cyber-attacks with high priority in IoE-enabled power systems. In [38], a
due to the broad range of bidirectional interconnection among defence mechanism based on an interval state predictor is
installed smart devices. Therefore, to take advantage of this proposed, which can mitigate the effect of malicious attacks.
technology, assuring security against malicious cyber/physical However, the proposed method does not consider the storage,
attacks is required [29]. Several investigations have been EVs and DC busses in the ERs. The security challenges of
conducted on the application of IoE during the last decade, integrating fog computing into the IoE are studied in [22],
but the challenges mentioned above have not been addressed focusing on collision attacks. Still, FDI or physical attacks
well. [30], [31] proposed a real-time power scheduling method like AMIs manipulation and bypassing are not considered.
based on the structural design of the IoE. The suggested Therefore, developing a comprehensive algorithm for cyber
approach has a high computational burden and highly relies and physical attack detection in different layers of IoE is
on regional zones. Also, there is no specific planning for the necessary.
home appliances operation, which is necessary to discover
surplus electricity for market trading and profit maximization.
In [16], a scheduling method for smart homes and plug-in
3
contributions of this research are multifaceted and significant. framework represents a significant advancement in energy
By developing optimized scheduling and routing algorithms, management. It not only enhances the efficiency of energy
the research aims to enhance the efficiency and reliability use but also empowers consumers to become active partic-
of energy systems. The introduction of sophisticated attack ipants in the energy market. By optimizing the scheduling
detection mechanisms, using cutting-edge machine learning of energy resources and shifting loads to off-peak hours, the
techniques, will ensure the security and integrity of the data, framework reduces costs, increases the utilization of renewable
safeguarding IoE networks from both cyber and physical energy, and supports the development of a more resilient
threats. This comprehensive approach will not only improve and sustainable energy system. This research aims to provide
the performance of energy systems but also contribute to the a robust solution that addresses the current challenges in
development of more resilient and secure IoE infrastructures, energy management and contributes to the evolution of smart,
capable of meeting the evolving challenges of the modern en- integrated energy networks.
ergy landscape. The research contribution can be summarized
as follows: A. Developing an energy routing algorithm
via DRL considering technical and economic constraints
IV. D EVELOPING A DRL- BASED SCHEDULING Routing optimization extremely relies on OS since the amount
FRAMEWORK FOR TECHNO - ECONOMIC IMPROVEMENT IN and direction of energy that needs to be sent or received
THE POWER NETWORK among devices is planned based on the scheduling algorithm’s
As the first major contribution, this research proposal in- results. Also, the transmission constraints that are notified
troduces a Q-Learning based framework, an off-policy Re- by utilities must be satisfied. As the second contribution, a
inforcement Learning (RL) algorithm, to optimize operation DRL based algorithm is proposed to deal with the complexity
schedules within diverse environments and under various con- of the problem and minimize the negative impact of power
straints. These environments include home appliances, energy generation and consumption uncertainty at both user-level
resources (such as generation and storage units), and the tech- and grid levels. Furthermore, the proposed DRL-based OS
nical, economic, and communication layers that interconnect algorithm is capable of handling islanding mode challenges
them. The proposed framework aims to create an optimal besides offline working ability.
scheduling program that transforms traditional consumers into
prosumers by shifting the operation of flexible loads to off- B. Cyber-physical attack detection guarantying information
peak hours. This shift not only reduces energy costs for correctness
households but also fosters a competitive market where even To ensure the security of the IoE, an LSTM algorithm is
those without their own energy generation units can participate proposed to detect FDI attacks in time series data, combined
effectively. The scheduling of Renewable Energy Sources with a CNN for feature extraction and correlation identification
(RESs), fixed storage systems, and Electric Vehicles (EVs) among different types of data to deal with uncertainties.
is contingent on their specific limitations and the overall con- Additionally, an Ensemble Deep Convolutional Neural Net-
ditions of the interconnected environments. These conditions work (EDCNN) algorithm is proposed for Electricity Theft
include the dynamic variations in energy supply and demand, Detection (ETD) caused by AMIs’ physical manipulation. As
as well as the technical and economic constraints that can fluc- the first layer of the model, a random under bagging technique
tuate over time. By leveraging Deep Reinforcement Learning is applied to deal with the imbalance data, and then deep
(DRL), the proposed framework enables consumers to act as CNNs are utilized on each subset. Finally, a voting system
prosumers. This is achieved by strategically shifting flexible is embedded in the last part.
loads to off-peak periods, thereby lowering energy bills and
allowing for the trading of surplus energy. In practical terms,
the Q-Learning based framework optimizes the scheduling C. Significance of the Study
process by continuously learning and adapting to changes Electrical energy plays a crucial role in this era and as a
in the environment. It considers the real-time status and high demand type of energy. Utilizing RESs is vital due to
capabilities of home appliances, energy generation and storage global warming caused by GHGs emissions originated from
units, and the overall energy market conditions. By doing so, it using fossil fuels to generate electricity. High penetration of
identifies the most cost-effective and efficient times to operate RESs requires modern energy management systems to deal
various devices and systems, ensuring that energy consumption with technical and economic constraints. IoE is the enabler
is aligned with periods of lower demand and higher availability for modern energy management, which provides novel energy
of renewable energy. Moreover, the framework’s adaptability trading features besides real-time monitoring and dynamic
to varying technical and economic constraints ensures that scheduling. After reviewing all the related studies, this pro-
it remains effective even as external conditions change. This posal determined the research gaps of application of IoE for
adaptability is crucial in a dynamic energy landscape where energy management, including optimal OS, optimal energy
supply and demand can be highly variable. The use of DRL routing, security, and energy trading. Consequently, a com-
techniques allows the system to handle these complexities prehensive scheduling program is proposed to optimize the
and make informed decisions that maximize both individual energy components’ operation scheduling considering techno-
and collective benefits. The Q-Learning based optimization economic constraints besides achieving the best energy routing
5
plan. Also, a reliable platform is required to ensure the off-peak hours, households can reduce energy costs and par-
correctness of the information used in the scheduling and ticipate in energy trading markets, even without their own
routing algorithms. Therefore, an attack detection algorithm energy generation units. This shift not only benefits individual
is developed detecting FDI attacks and electricity theft [19], households by lowering bills but also contributes to a more
[39]. balanced and efficient energy grid. The IoE concept, supported
by RL approaches, offers a powerful framework for improving
energy efficiency through optimized scheduling and real-time
V. P ROPSOED M ETHODOLOGY
management. By leveraging the strengths of RL algorithms,
After reviewing the various studies and reports, along this research aims to enhance the operational efficiency of
with industrial and market demand, the problem statements smart homes and integrate them more effectively into the
have been recognized. A comprehensive literature review then broader energy system. This approach promises to deliver
demonstrated the research gaps and challenges that have led significant economic and environmental benefits, paving the
to defining research objectives and contributions. Next, the way for a more sustainable and resilient energy future.
best solution has been suggested for every goal where several
datasets have been collected to train, evaluate, and test the
proposed algorithms. The proposed solutions are as follows:
the interconnected environments. These conditions include of the problem and minimize the negative impact of power
dynamic variations in energy supply and demand, as well as generation and consumption uncertainty at both user-level
the technical and economic constraints that can fluctuate over and grid levels. Furthermore, the proposed DRL-based OS
time. By leveraging Deep Reinforcement Learning (DRL), the algorithm is capable of handling islanding mode challenges
proposed framework enables consumers to act as prosumers. besides offline working ability.
This is achieved by strategically shifting flexible loads to
off-peak periods, thereby lowering energy bills and allowing
for the trading of surplus energy. In practical terms, the Q- C. Cyber-physical attack detection guarantying information
Learning based framework optimizes the scheduling process correctness
by continuously learning and adapting to changes in the
environment. It considers the real-time status and capabilities To ensure the security of the IoE, an LSTM algorithm is
of home appliances, energy generation and storage units, and proposed to detect FDI attacks in time series data, combined
the overall energy market conditions. By doing so, it identifies with a CNN for feature extraction and correlation identification
the most cost-effective and efficient times to operate various among different types of data to deal with uncertainties.
devices and systems, ensuring that energy consumption is Additionally, an Ensemble Deep Convolutional Neural Net-
aligned with periods of lower demand and higher availability work (EDCNN) algorithm is proposed for Electricity Theft
of renewable energy. Moreover, the framework’s adaptability Detection (ETD) caused by AMIs’ physical manipulation. As
to varying technical and economic constraints ensures that the first layer of the model, a random under bagging technique
it remains effective even as external conditions change. This is applied to deal with the imbalance data, and then deep
adaptability is crucial in a dynamic energy landscape where CNNs are utilized on each subset. Finally, a voting system
supply and demand can be highly variable. The use of DRL is embedded in the last part.
techniques allows the system to handle these complexities
and make informed decisions that maximize both individual
and collective benefits. The Q-Learning based optimization VII. C ONCLUSION
framework represents a significant advancement in energy
management. It not only enhances the efficiency of energy In conclusion, this research addresses the critical challenges
use but also empowers consumers to become active partic- of energy management in IoE-enabled smart grids by devel-
ipants in the energy market. By optimizing the scheduling oping advanced algorithms for operation scheduling, energy
of energy resources and shifting loads to off-peak hours, the routing, and cyber-physical security. The DRL-based schedul-
framework reduces costs, increases the utilization of renewable ing algorithm optimizes the use of RESs, reducing costs and
energy, and supports the development of a more resilient emissions while maximizing economic benefits for prosumers.
and sustainable energy system. This research aims to provide The energy routing algorithm ensures efficient electricity flow,
a robust solution that addresses the current challenges in adapting to technical and economic constraints. Furthermore,
energy management and contributes to the evolution of smart, the proposed security framework enhances the resilience of
integrated energy networks. IoE networks against cyber and physical attacks, ensuring
[27]. reliable operation. The contributions of this research provide
a robust foundation for future developments in smart grid
technology, promoting the integration of renewable energy
B. Developing an energy routing algorithm and enhancing overall energy management efficiency. These
via DRL considering technical and economic constraints advancements are essential for meeting the growing energy
Routing optimization extremely relies on OS since the amount demands in an environmentally sustainable manner. By lever-
and direction of energy that needs to be sent or received aging cutting-edge AI technologies, this research paves the
among devices is planned based on the scheduling algorithm’s way for smarter, more resilient power systems. The findings
results. Also, the transmission constraints that are notified highlight the potential for IoE to revolutionize energy man-
by utilities must be satisfied. As the second contribution, a agement, making it a cornerstone for future innovations in the
DRL based algorithm is proposed to deal with the complexity field.
8