0% found this document useful (0 votes)

16 views5 pages

The Fusion of Deep Reinforcement Learning and Edge Computing For Real-Time Monitoring and Control Optimization in Iot Environments

This paper presents an optimization control system that integrates deep reinforcement learning with edge computing to enhance real-time monitoring and control in industrial IoT environments. The proposed system reduces communication latency, improves responsiveness, and optimizes resource allocation, achieving better control quality and cost-effectiveness compared to traditional cloud-based systems. Experimental results demonstrate significant improvements in control stability and operational efficiency, particularly in applications such as industrial boiler monitoring.

Uploaded by

reyadarefin888

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

16 views5 pages

The Fusion of Deep Reinforcement Learning and Edge Computing For Real-Time Monitoring and Control Optimization in Iot Environments

Uploaded by

reyadarefin888

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 5

The Fusion of Deep Reinforcement

Learning and Edge Computing for

Real-time Monitoring and Control
Optimization in IoT Environments
Jingyu Xu1,*,Weixiang Wan 2,a,Linying Pan3,b,Wenjian Sun4,c,Yuxiang Liu5,d

1
Northern Arizona University,1900 S Knoles Dr, Flagstaff, Arizona,USA

2
University of Electronic Science and Technology of China,ChengDu,China

3
Trine university,Phoenix, Arizona, USA

4
Yantai University,Tokyo,Japan

5
Northwestern University,Atlanta, Georgia,USA

Abstract: In response to the demand for real-time Keywords: Deep reinforcement learning; Edge computing;
performance and control quality in industrial Internet of Industrial Internet of Things; Lightweight policy network;
Things (IoT) environments, this paper proposes an Dynamic resource allocation
optimization control system based on deep reinforcement
I. INTRODUCTION
learning and edge computing. The system leverages cloud-edge
collaboration, deploys lightweight policy networks at the edge, With the rapid development of the industrial Internet of
predicts system states, and outputs controls at a high Things, there is a growing demand for real-time monitoring
frequency, enabling monitoring and optimization of industrial and control of systems. However, relying on cloud
objectives. Additionally, a dynamic resource allocation computing centers for computation and decision-making
mechanism is designed to ensure rational scheduling of edge often fails to meet the constraints of real-time
computing resources, achieving global optimization. Results responsiveness [1]. In this regard, this study proposes a
demonstrate that this approach reduces cloud-edge novel industrial system control architecture that actively
communication latency, accelerates response to abnormal senses the environment and makes rapid decisions through
situations, reduces system failure rates, extends average the organic combination of deep reinforcement learning and
equipment operating time, and saves costs for manual edge computing [2]. This approach deploys lightweight
maintenance and replacement. This ensures real-time and policy networks at the network edge to predict and control
stable control. local states at a high frequency. Simultaneously, multiple
edge nodes collaborate with the cloud center, enhancing
control real-time performance at the edge while the cloud Reinforcement Learning module is responsible for learning
center tracks strategies and performs global optimization. and optimizing system control strategies, primarily divided
This paper provides a detailed construction of the system's into modeling unit, policy network, and decision unit [9].
overall architecture, functional modules, and designs a Among them, the modeling unit constructs an environment
lightweight policy network structure and dynamic resource model to predict system states; the policy network represents
allocation mechanism for edge computing. Experimental and approximates control strategies using neural networks,
validation demonstrates the effectiveness of this approach, and the decision unit provides control decisions based on the
significantly reducing control latency and improving control network outputs. The Edge Computing module primarily
quality and cost-effectiveness compared to architectures offers local data storage, preprocessing, caching, and other
relying solely on the cloud center. functions to assist in the training of the Deep Reinforcement
Learning module. Additionally, it includes a task distributor
II. OPTIMIZATION CONTROL SYSTEM BASED ON DEEP
that dynamically allocates edge computing tasks and a data
REINFORCEMENT LEARNING AND EDGE COMPUTING
collector that aggregates data from edge nodes.
A. Overall System Architecture
This system employs a layered architecture comprising a
sensor acquisition layer, edge computing layer, and cloud
computing layer[7]. The sensor layer collects environmental
and system data like temperature and machine status. This
data is sent to the edge computing layer, where edge servers
perform real-time analysis and local decision-making. Deep
reinforcement learning models in this layer predict and
control system behavior, creating a digital twin to forecast
and optimize operations. The cloud computing layer
Fig. 2. System Functional Modules
oversees the entire system, providing powerful computing
and storage to refine control strategies and system logic. The III. KEY TECHNOLOGIES AND ALGORITHMS
architecture is service-oriented and modular, with
A. Lightweight Deep Reinforcement Learning for Edge
components like data acquisition, storage, deep learning,
control, and scheduling modules connected via a message To accommodate the limited computational and storage
bus. This design enhances flexibility and scalability, resources of edge computing nodes, the system employs a
allowing the system to automate and intelligently control customized, lightweight deep reinforcement learning
operations, adapt to various scenarios, and efficiently algorithm. This algorithm uses simpler network structures,
manage complex tasks[8]. such as a three-layer perceptron, instead of complex deep
convolutional neural networks, reducing the number of
parameters and the model's space footprint. The experience
replay buffer size is also limited to around 5000 transition
samples to manage capacity. During training, the batch size
is set to 16, matching the parallel computing capabilities of
edge servers.The lightweight model contains about 1 million
parameters and occupies less than 400MB, making it
suitable for deployment on less powerful edge computing
Fig. 1. Overall System Architecture nodes. It can deliver near real-time control strategy outputs,
even on inexpensive hardware. In practical applications like
B. System Functional Modules
monitoring machine tools in smart manufacturing, the model
The functional modules of this system mainly consist of can re-plan and issue control strategy instructions every 5
the Deep Reinforcement Learning module and the Edge seconds. This optimizes the machine's dynamic performance
Computing module, as shown in Figure 2. The Deep and extends its lifespan.Compared to traditional cloud-based
models, which may have an average delay of over 1 minute IV. EXPERIMENT AND RESULTS ANALYSIS
in computing and issuing control commands, this system
A. Experimental Environment and Dataset
significantly reduces control loop latency, demonstrating
The experimental environment for this research is based
more efficient and responsive control in real-time scenarios.
on the TensorFlow framework and utilizes an NVIDIA Tesla
B. Dynamic Collaborative Distributed Optimization V100 GPU for constructing deep neural networks, training,
Algorithm and testing. To thoroughly validate the effectiveness of the
To achieve global optimal control through cloud-edge proposed method, the experiments use an open-source IoT
collaboration, this system designs a dynamic optimization simulation environment, IoTSim, as the dataset. IoTSim
allocation distributed algorithm. This algorithm is includes data from sensors, edge layer resource
coordinated by the cloud-side master node, which can configurations, and network parameters. The dataset
request or cancel edge server's computational resources encompasses readings from various heterogeneous sensors,
on-demand and maintain a list of available resources while such as temperature, humidity, voltage, etc., spanning a
monitoring the load on each edge server. Based on the month with a sampling frequency of one sample per minute.
current system state and control requirements, it runs an Considering the real-time control requirements of the system,
environmental monitoring program that dynamically selects the research sets up an environment where the system
a group of edge servers with the most optimal combined reports its state and outputs control commands every 5
indicators, such as bandwidth and computing capacity. It seconds, and the dataset is resampled accordingly.
also allocates critical control modules with higher
B. Model Hyperparameter Settings
computational intensity to servers with stronger computing
In deep reinforcement learning, the choice of
power. By aggregating intermediate states and control
hyperparameters plays a crucial role in the model's
results from various edge nodes, it collaboratively optimizes
performance and convergence speed. Hyperparameters are
and obtains a global control strategy. The optimization
those parameters that need to be manually set when training
algorithm can be expressed using the following formula:
deep reinforcement learning models, and they can influence

Maximize ∑&"$% ∑# the training process and the model's final performance.
!$% A(c! , r" ) （1）
These hyperparameters are listed in Table 1.
Where ri represents the resources of the ith server, and cj
represents the jth control module. TABLE I. MODEL HYPERPARAMETERS

It must satisfy the following conditions:

Hyperparameter Initial Setting Impact

A larger experience
∑)'$% 𝑥'( ≤ 1 ∀𝑗 （2）
pool provides more
Experience Pool Size 5000
Each control module can be assigned to at most one training data, aiding in
resource. model convergence.

Adjustment of the
Server resource and load constraints:
learning rate can

𝑙' + ∑*
'$% 𝑥'( ∙ 𝐿𝑜𝑎𝑑(𝑐( ) ≤ 𝐶𝑎𝑝𝑎𝑐𝑖𝑡𝑦(𝑟' ) ∀𝑖 （ 3 ） Learning Rate 0.01 affect the model's
convergence speed
Where xij is a binary decision variable, indicating
and stability.
whether control module cj is assigned to resource ri (1 if
A smaller γ value
assigned, 0 otherwise). Load(cj) represents the
focuses more on
computational density of control module cj, and Capacity(ri)
short-term rewards,
is the maximum capacity of resource ri. Discount Factor (γ) 0.95
while a larger γ value
emphasizes long-term
rewards.
The choice of update cloud-edge collaborative deep reinforcement learning
frequency (τ) can framework greatly improved real-time control effectiveness
Target Network Update
100 impact the model's and quality. Future enhancements aim to increase the
Frequency (τ)
stability and learning adaptability of edge agents.
speed.

C. Evaluation Metric Setup

This system consists of three layers: the sensor
acquisition layer, the edge computing layer, and the cloud
computing layer. At the base, the sensor acquisition layer
gathers environmental data like temperature, humidity, and
machine status. This data is sent to the edge computing layer
for real-time analysis and local decision-making. Here, deep Fig. 3. shows the changing trends of experimental results over time or

reinforcement learning models predict system behavior and iterations.

create control strategies for closed-loop control. These

V. CASE STUDY
models learn operational patterns, establish digital twins, and
predict future states to devise optimal strategies, enabling A. Scenario Modeling
quick response and self-adjustment for automation and To assess the practical effectiveness of the method, an
intelligence. The top layer, cloud computing, oversees the application scenario for industrial state monitoring and fault
system, fine-tuning strategies and optimizing control logic prediction was constructed. The scenario involves
with its superior computing and storage capabilities. The monitoring the operation of an industrial boiler. Data sources
architecture is service-oriented and modular, with units like include boiler inlet and outlet temperatures, water level, and
data acquisition, storage, learning models, control operating pressure signals. For the actuators such as water
optimization, and task scheduling. These components are pumps and valves, corresponding states were also set, with
decoupled and communicate via a message bus, enhancing state transitions based on control instructions from the deep
flexibility and scalability. The system's deep learning reinforcement learning model. The entire boiler operation
capabilities, combined with its hierarchical, service-oriented process constitutes a complex state mechanism, requiring
design, make it efficient and adaptable to various complex precise control of parameters like water quantity and
scenarios. temperature to maximize efficiency while avoiding accidents.
D. Experimental Results and Analysis Using the dataset from this application scenario, an
environment dynamics model was built, and a deep
Experimental results show that using distributed deep
reinforcement learning controller was trained. To conserve
reinforcement learning with edge computing significantly
edge computing resources, this research employed a
reduces communication time between cloud and edge,
two-layer fully connected neural network as the policy
lowering control latency. Traditional cloud-centered control
function approximator. The state space includes current
had a delay of up to 1.5 seconds, slowing response to sudden
process parameters, information from the last 10 observed
changes. By deploying deep reinforcement learning agents at
states, rewards, and more. The deep reinforcement learning
the edge, this delay dropped to about 0.3 seconds, meeting
model outputs deterministic control actions that can be
real-time control needs. This approach also improved
directly applied to actuators, enabling monitoring and
resource utilization, with CPU usage at the edge layer
optimization of the boiler's operational status, as shown in
increasing from 53% to 67%. The system achieved a higher
Figure 4.
cumulative reward (890 points) compared to cloud-only
systems (750 points), indicating better performance. Control
stability improved with a 22% reduction in control loss, and
action accuracy reached 88%, showing enhanced
responsiveness to environmental changes. Overall, the
timely control policy updates and flexible resource
allocation. Experimental results have shown that this system
can reduce control loop latency and enhance responsiveness
to sudden state changes. When applied to an industrial boiler
control scenario, the method outperforms rule-based control
by increasing operational rewards, reducing failure
Fig. 4. Scenario Modeling probabilities, extending the fault-free running time, and
lowering manual intervention and maintenance costs. The
B. Performance Evaluation approach designed in this research ensures control quality
In a month-long experiment comparing traditional PID while improving the real-time nature of control and
control to a deep reinforcement learning (DRL) approach for decision-making. Future work will involve validating the
boiler operation, the DRL method showed significant method's effectiveness in more complex industrial
improvements. It scored an average reward of 3820 points environments.
over the month, 36% higher than the 2810 points achieved
REFERENCES
by the PID method. The DRL algorithm's reward curve
[1]Zhou P , Chen X , Liu Z ,et al.DRLE: Decentralized
stabilized over time, unlike the PID's fluctuating curve. Reinforcement Learning at the Edge for Traffic Light
Control in the IoV[J].IEEE, 2021(4).
Notably, the DRL controller, using predictive models, [2]Celtek S A , Durdu A .A Novel Adaptive Traffic Signal
greatly reduced water and temperature anomalies, leading to Control Based on Cloud/Fog/Edge
Computing[J].International journal of intelligent
a 29% decrease in boiler system failures and extending transportation systems research, 2022.
uninterrupted operation by 15 days. This also resulted in [3]Elgendy I A , Muthanna A , Hammoudeh M ,et
al.Advanced Deep Learning for Resource Allocation
lower costs for manual maintenance and parts replacement. and Security Aware Data Offloading in Industrial
Mobile Edge Computing[J].Big Data, 2021.
Overall, from reward, stability, and economic perspectives, [4]Mlika Z , Cherkaoui S .Network Slicing with MEC and
the DRL method excelled in boiler state control and Deep Reinforcement Learning for the Internet of
Vehicles[J]. 2022.
optimization. [5]Laroui M , Khedher H , Moussa A C ,et al.SO￢MEC:
Service Offloading in Virtual Mobile Edge Computing
Using Deep Reinforcement Learning[J].Transactions
on Emerging Telecommunications Technologies, 2021.
[6]F. An, B. Zhao, B. Cui and R. Bai, "Multi-Functional DC
Collector for Future All-DC Offshore Wind Power
System: Concept, Scheme, and Implement," in IEEE
Transactions on Industrial Electronics, 2022.
[7]F. An, B. Zhao, B. Cui and Y. Chen, "Selective Virtual
Synthetic Vector Embedding for Full-Range Current
Harmonic Suppression of the DC Collector," in IEEE
Transactions on Power Electronics.
Fig. 5. Average Rewards Over 20 Days Under the PID Method [8]Chang Che, Bo Liu, Shulin Li, Jiaxin Huang, and Hao
Hu. Deep learning for precise robot position prediction
in logistics. Journal of Theory and Practice of
VI. CONCLUSION Engineering Science, 3(10):36–41, 2023.
[9]Hao Hu, Shulin Li, Jiaxin Huang, Bo Liu, and Change
This research has introduced an intelligent monitoring Che. Casting product image data for quality inspection
with xception and data augmentation. Journal of
and optimization method for industrial systems based on Theory and Practice of Engineering Science, 3(10):42–
deep reinforcement learning and edge computing. The 46, 2023.
[10]Tianbo, Song, Hu Weijun, Cai Jiangfeng, Liu Weijia,
method leverages edge computing resources to the fullest Yuan Quan, and He Kun. "Bio-inspired Swarm
Intelligence: a Flocking Project With Group Object
extent by deploying lightweight deep reinforcement learning Recognition." In 2023 3rd International Conference on
models at the network's edge, enabling real-time prediction Consumer Electronics and Computer Engineering
(ICCECE), pp. 834-837. IEEE, 2023.
and control of system states. Additionally, it facilitates
collaboration between the edge and the cloud, ensuring more

View publication stats

Sylenth1 Product Keys For Activation
100% (1)
Sylenth1 Product Keys For Activation
2 pages
EI-0105I - Rev.01 - 02.11.2011 - Calibration and Tuning Sigas 2-3
No ratings yet
EI-0105I - Rev.01 - 02.11.2011 - Calibration and Tuning Sigas 2-3
36 pages
Getting Started Guide IRRIGATION
No ratings yet
Getting Started Guide IRRIGATION
41 pages
Report For Face Mask Detection Using Python and Deep Learning
100% (2)
Report For Face Mask Detection Using Python and Deep Learning
30 pages
Deep Reinforcement Learning MultiAgent System For Resource Allocation in Industrial Internet of ThingsSensors
No ratings yet
Deep Reinforcement Learning MultiAgent System For Resource Allocation in Industrial Internet of ThingsSensors
23 pages
Enhancing User Code Efficiency in Edge Computing Applications Through Machine Learning
No ratings yet
Enhancing User Code Efficiency in Edge Computing Applications Through Machine Learning
31 pages
1 s2.0 S0167739X23003862 Main
No ratings yet
1 s2.0 S0167739X23003862 Main
15 pages
Resource Allocation For Edge Computing in IoT Networks Via Reinforcement Learning
No ratings yet
Resource Allocation For Edge Computing in IoT Networks Via Reinforcement Learning
6 pages
Netdata in Practice: Definitive Reference for Developers and Engineers
From Everand
Netdata in Practice: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Embedded Systems Programming with C: Writing Code for Microcontrollers
From Everand
Embedded Systems Programming with C: Writing Code for Microcontrollers
Larry Jones
No ratings yet
Mastering Embedded C: The Ultimate Guide to Building Efficient Systems
From Everand
Mastering Embedded C: The Ultimate Guide to Building Efficient Systems
Robert Johnson
No ratings yet
Dynamic Scheduling For Stochastic Edge-Cloud Computing Environments Using A3C Learning and Residual Recurrent Neural Networks
No ratings yet
Dynamic Scheduling For Stochastic Edge-Cloud Computing Environments Using A3C Learning and Residual Recurrent Neural Networks
15 pages
Embedded Systems Programming with C++: Real-World Techniques
From Everand
Embedded Systems Programming with C++: Real-World Techniques
Robert Johnson
No ratings yet
Study Guide 300-435 ENAUTO: Automating and Programming Cisco Enterprise Solutions Certification Exam
From Everand
Study Guide 300-435 ENAUTO: Automating and Programming Cisco Enterprise Solutions Certification Exam
Anand Vemula
No ratings yet
PIC Microcontroller Development Essentials: Definitive Reference for Developers and Engineers
From Everand
PIC Microcontroller Development Essentials: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Knowledge Transfer For On-Device Deep Reinforcement Learning in Resource Constrained Edge Computing Systems PDF
No ratings yet
Knowledge Transfer For On-Device Deep Reinforcement Learning in Resource Constrained Edge Computing Systems PDF
10 pages
Cloud-Edge Hybrid Deep Learning Framework For Scalable Iot Resource Optimization
No ratings yet
Cloud-Edge Hybrid Deep Learning Framework For Scalable Iot Resource Optimization
27 pages
DeepEdge A New QoE-Based Resource Allocation Framework Using Deep Reinforcement Learning For Future Heterogeneous Edge-IoT Applications
No ratings yet
DeepEdge A New QoE-Based Resource Allocation Framework Using Deep Reinforcement Learning For Future Heterogeneous Edge-IoT Applications
13 pages
SystemTap Essentials: Definitive Reference for Developers and Engineers
From Everand
SystemTap Essentials: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Comprehensive Guide to Zipkin: Definitive Reference for Developers and Engineers
From Everand
Comprehensive Guide to Zipkin: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Designing Scalable IoT Solutions with ThingsBoard: Definitive Reference for Developers and Engineers
From Everand
Designing Scalable IoT Solutions with ThingsBoard: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
BeagleBone Systems and Applications: Definitive Reference for Developers and Engineers
From Everand
BeagleBone Systems and Applications: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Contiki Operating System for Embedded IoT: Definitive Reference for Developers and Engineers
From Everand
Contiki Operating System for Embedded IoT: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
CircuitPython in Practice: Definitive Reference for Developers and Engineers
From Everand
CircuitPython in Practice: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
LSTM Network-Based Adaptation Approach For Dynamic Integration in Intelligent End-Edge-Cloud Systems
No ratings yet
LSTM Network-Based Adaptation Approach For Dynamic Integration in Intelligent End-Edge-Cloud Systems
13 pages
Energy Management Systems: Design and Implementation: Definitive Reference for Developers and Engineers
From Everand
Energy Management Systems: Design and Implementation: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Comprehensive Guide to Micro:bit Technology: Definitive Reference for Developers and Engineers
From Everand
Comprehensive Guide to Micro:bit Technology: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Efficient Workflow Orchestration with Astronomer: The Complete Guide for Developers and Engineers
From Everand
Efficient Workflow Orchestration with Astronomer: The Complete Guide for Developers and Engineers
William Smith
No ratings yet
DENT Network Operating System in Practice: The Complete Guide for Developers and Engineers
From Everand
DENT Network Operating System in Practice: The Complete Guide for Developers and Engineers
William Smith
No ratings yet
Observium Network Monitoring Solutions: Definitive Reference for Developers and Engineers
From Everand
Observium Network Monitoring Solutions: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
NetWorker Configuration and Administration Reference: Definitive Reference for Developers and Engineers
From Everand
NetWorker Configuration and Administration Reference: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
When Edge Computing Meets Microgrid A Deep Reinforcement Learning Approach
No ratings yet
When Edge Computing Meets Microgrid A Deep Reinforcement Learning Approach
15 pages
Lexicon of Computer Science Terminology: Lexicon of Tech and Business, #16
From Everand
Lexicon of Computer Science Terminology: Lexicon of Tech and Business, #16
Mustafa Al-Dori
4/5 (1)
Deep Reinforcement Learning Based Computation Offloading and Resource Allocation For MEC
No ratings yet
Deep Reinforcement Learning Based Computation Offloading and Resource Allocation For MEC
6 pages
Munin for Systems Monitoring: Definitive Reference for Developers and Engineers
From Everand
Munin for Systems Monitoring: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
DNP3 Protocol Engineering: Definitive Reference for Developers and Engineers
From Everand
DNP3 Protocol Engineering: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Seldon Core Triton Integration for Scalable Model Serving: The Complete Guide for Developers and Engineers
From Everand
Seldon Core Triton Integration for Scalable Model Serving: The Complete Guide for Developers and Engineers
William Smith
No ratings yet
Practical Observability Engineering with Relic: Definitive Reference for Developers and Engineers
From Everand
Practical Observability Engineering with Relic: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Collaborative Edge Computing and Caching With Deep Reinforcement Learning Decision Agents
No ratings yet
Collaborative Edge Computing and Caching With Deep Reinforcement Learning Decision Agents
9 pages
Review of Deep Reinforcement Learning Based Scheduling For Optimizing System Load and Response Time in Edge and Fog Computing Environments
No ratings yet
Review of Deep Reinforcement Learning Based Scheduling For Optimizing System Load and Response Time in Edge and Fog Computing Environments
2 pages
Cortex for Scalable Multi-Tenant Metrics: The Complete Guide for Developers and Engineers
From Everand
Cortex for Scalable Multi-Tenant Metrics: The Complete Guide for Developers and Engineers
William Smith
No ratings yet
Embedded Systems Design Essentials: Definitive Reference for Developers and Engineers
From Everand
Embedded Systems Design Essentials: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Mobile Neural Network Framework in Practice: The Complete Guide for Developers and Engineers
From Everand
Mobile Neural Network Framework in Practice: The Complete Guide for Developers and Engineers
William Smith
No ratings yet
Fundamentals of Microcontroller Architecture and Applications: Definitive Reference for Developers and Engineers
From Everand
Fundamentals of Microcontroller Architecture and Applications: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Micropython Essentials: Definitive Reference for Developers and Engineers
From Everand
Micropython Essentials: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Zhao 等 - 2022 - Multi-Agent Deep Reinforcement Learning for Task Offloading in UAV-Assisted Mobile Edge Computing
No ratings yet
Zhao 等 - 2022 - Multi-Agent Deep Reinforcement Learning for Task Offloading in UAV-Assisted Mobile Edge Computing
12 pages
Efficient Deployment Automation with Fabric: Definitive Reference for Developers and Engineers
From Everand
Efficient Deployment Automation with Fabric: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
DataDog Operations and Monitoring Guide: Definitive Reference for Developers and Engineers
From Everand
DataDog Operations and Monitoring Guide: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
OneAgent Deployment and Optimization: Definitive Reference for Developers and Engineers
From Everand
OneAgent Deployment and Optimization: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Mastering C: Advanced Techniques and Tricks
From Everand
Mastering C: Advanced Techniques and Tricks
Ted Norice
No ratings yet
Thundra Observability and Monitoring Solutions: Definitive Reference for Developers and Engineers
From Everand
Thundra Observability and Monitoring Solutions: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Comprehensive Guide to Mbed Development: Definitive Reference for Developers and Engineers
From Everand
Comprehensive Guide to Mbed Development: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Deep QNetwork Based Rotary Inverted Pendulum System and Its Monitoring On The EdgeX Platform
No ratings yet
Deep QNetwork Based Rotary Inverted Pendulum System and Its Monitoring On The EdgeX Platform
6 pages
Blockchain Based Distributed Control System For Edge Computing
No ratings yet
Blockchain Based Distributed Control System For Edge Computing
6 pages
Literature Survey Draft
No ratings yet
Literature Survey Draft
3 pages
Mastering OpenTelemetry: Building Scalable Observability Systems for Cloud-Native Applications
From Everand
Mastering OpenTelemetry: Building Scalable Observability Systems for Cloud-Native Applications
Robert Johnson
No ratings yet
Deep-Deterministic Policy Gradient Based Multi-Resource Allocation in Edge-Cloud System A Distrib
No ratings yet
Deep-Deterministic Policy Gradient Based Multi-Resource Allocation in Edge-Cloud System A Distrib
18 pages
2020-cs-433 (Paper Summary)
No ratings yet
2020-cs-433 (Paper Summary)
3 pages
Icinga System Monitoring Essentials: Definitive Reference for Developers and Engineers
From Everand
Icinga System Monitoring Essentials: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
NetBackup Administration and Automation: Definitive Reference for Developers and Engineers
From Everand
NetBackup Administration and Automation: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Principles of Observability for Modern Systems: Definitive Reference for Developers and Engineers
From Everand
Principles of Observability for Modern Systems: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Mastering C: Advanced Techniques and Best Practices
From Everand
Mastering C: Advanced Techniques and Best Practices
Adam Jones
No ratings yet
Cerebras GPT: Wafer-Scale Architectures for Large Language Models
From Everand
Cerebras GPT: Wafer-Scale Architectures for Large Language Models
William Smith
No ratings yet
Study Guide Cisco Certified Design Expert (CCDE 400-007) Exam
From Everand
Study Guide Cisco Certified Design Expert (CCDE 400-007) Exam
Anand Vemula
No ratings yet
Techshopbd - Order Voucher
No ratings yet
Techshopbd - Order Voucher
1 page
Ssme Main Combustion Chamber (MCC) Oil" Dewaxing: NASA-CR-202079
No ratings yet
Ssme Main Combustion Chamber (MCC) Oil" Dewaxing: NASA-CR-202079
27 pages
Conventionally Cast and Forged Copper Alloy For High-Heat:-Flux Thrust Chambers
No ratings yet
Conventionally Cast and Forged Copper Alloy For High-Heat:-Flux Thrust Chambers
14 pages
RAIN WATER HERVESTING-Srijan Thesis-Updated
No ratings yet
RAIN WATER HERVESTING-Srijan Thesis-Updated
46 pages
Getting Started With Easychair
No ratings yet
Getting Started With Easychair
7 pages
ASEJAR2023020102
No ratings yet
ASEJAR2023020102
4 pages
Bangladesh Air Pollution Studies (BAPS) : Task 5 (Industrial Emission Estimate)
No ratings yet
Bangladesh Air Pollution Studies (BAPS) : Task 5 (Industrial Emission Estimate)
90 pages
CH 8TH
No ratings yet
CH 8TH
8 pages
E I Dci ZV Ci Cöfve: Lô Aa VQ
No ratings yet
E I Dci ZV Ci Cöfve: Lô Aa VQ
22 pages
Av Jvi Cöwzmiy: Beg Aa VQ
No ratings yet
Av Jvi Cöwzmiy: Beg Aa VQ
12 pages
IJSMS V6i1p105
No ratings yet
IJSMS V6i1p105
10 pages
Mohammad Reyad Arefin Resume
No ratings yet
Mohammad Reyad Arefin Resume
1 page
Product Catalogue
No ratings yet
Product Catalogue
20 pages
TSP Csse 38464
No ratings yet
TSP Csse 38464
18 pages
Iot Based Electrical Motor Control and M
No ratings yet
Iot Based Electrical Motor Control and M
6 pages
Three Phase Motor Controlling Using Concept of IoT Ijariie9736 PDF
No ratings yet
Three Phase Motor Controlling Using Concept of IoT Ijariie9736 PDF
4 pages
Ijeee V3i4p2
No ratings yet
Ijeee V3i4p2
7 pages
IoT Based Monitoring and Speed Control o
No ratings yet
IoT Based Monitoring and Speed Control o
10 pages
Wa0014
No ratings yet
Wa0014
6 pages
JETIR2206485
No ratings yet
JETIR2206485
5 pages
Firmansah 2019 IOP Conf. Ser. - Mater. Sci. Eng. 588 012016
No ratings yet
Firmansah 2019 IOP Conf. Ser. - Mater. Sci. Eng. 588 012016
8 pages
Wa0002
No ratings yet
Wa0002
6 pages
Wa0016
No ratings yet
Wa0016
13 pages
Wa0007
No ratings yet
Wa0007
5 pages
Wa0005
No ratings yet
Wa0005
6 pages
Wa0004
No ratings yet
Wa0004
5 pages
Wa0015
No ratings yet
Wa0015
12 pages
Rigol Software Tools
No ratings yet
Rigol Software Tools
6 pages
ITE 291-Human Computer Interaction SAS#3
No ratings yet
ITE 291-Human Computer Interaction SAS#3
10 pages
02 Task Performance 1
No ratings yet
02 Task Performance 1
4 pages
CadnaA Reference 2024MR1
No ratings yet
CadnaA Reference 2024MR1
1,410 pages
Installing Ansible - Ansible Documentation
No ratings yet
Installing Ansible - Ansible Documentation
14 pages
FINAL PROJECT REPORT of ITR (Devendra)
No ratings yet
FINAL PROJECT REPORT of ITR (Devendra)
62 pages
SWAT4 Editor v1-0
No ratings yet
SWAT4 Editor v1-0
209 pages
GU School of Design
No ratings yet
GU School of Design
22 pages
Ain Shams University
No ratings yet
Ain Shams University
15 pages
Tim Ve Coi Nguon Chu Han
No ratings yet
Tim Ve Coi Nguon Chu Han
1,281 pages
1 - Version 2 - 020 - Internal Presentation For System Board Configuration Tool
No ratings yet
1 - Version 2 - 020 - Internal Presentation For System Board Configuration Tool
57 pages
Adobe InDesign 2.0
No ratings yet
Adobe InDesign 2.0
444 pages
White Space: Normal Space Nonbreaking Space
No ratings yet
White Space: Normal Space Nonbreaking Space
7 pages
Sel 3350 Rtac
No ratings yet
Sel 3350 Rtac
16 pages
Cds Isis
No ratings yet
Cds Isis
13 pages
Cyclone V SoC Linux Interrupt-2
No ratings yet
Cyclone V SoC Linux Interrupt-2
9 pages
Python RTL
No ratings yet
Python RTL
92 pages
Boxblot in Fods
No ratings yet
Boxblot in Fods
5 pages
Structure Manager
100% (2)
Structure Manager
646 pages
Using Backup4all - FAQ 8 - Backup4all
No ratings yet
Using Backup4all - FAQ 8 - Backup4all
11 pages
Q2 L3 - Designing The User Interface of A Game App
No ratings yet
Q2 L3 - Designing The User Interface of A Game App
8 pages
E Vaccination System Using Ai
No ratings yet
E Vaccination System Using Ai
17 pages
Blue Futuristic Illustrative Artificial Intelligence Project Presentation
No ratings yet
Blue Futuristic Illustrative Artificial Intelligence Project Presentation
8 pages
CS0051 - M1-Parallel Computing Hardware
No ratings yet
CS0051 - M1-Parallel Computing Hardware
36 pages
Function Block Introduction Guide R133-E1-01
No ratings yet
Function Block Introduction Guide R133-E1-01
73 pages
Unit 1 MCQ
No ratings yet
Unit 1 MCQ
8 pages

The Fusion of Deep Reinforcement Learning and Edge Computing For Real-Time Monitoring and Control Optimization in Iot Environments

Uploaded by

The Fusion of Deep Reinforcement Learning and Edge Computing For Real-Time Monitoring and Control Optimization in Iot Environments

Uploaded by

The Fusion of Deep Reinforcement

Learning and Edge Computing for

It must satisfy the following conditions:

C. Evaluation Metric Setup

reinforcement learning models predict system behavior and iterations.

create control strategies for closed-loop control. These

View publication stats

You might also like