Experience-Driven Computational Resource Allocation of Federated Learning by Deep Reinforcement Learning

The document proposes an experience-driven computational resource allocation algorithm for federated learning using deep reinforcement learning. It aims to improve the energy efficiency of federated learning on mobile devices by optimizing CPU cycle frequency allocation. The algorithm uses a deep reinforcement learning agent that learns the best strategies for resource allocation based on previous experiences. Simulation results show the proposed approach achieves better performance in terms of average system cost and energy consumption compared to heuristic and static allocation methods. However, the algorithm has only been tested in simulations and not real-world experiments, and uses an older network dataset that may not reflect current network conditions.

Uploaded by

Harshit Shukla

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

47 views2 pages

Experience-Driven Computational Resource Allocation of Federated Learning by Deep Reinforcement Learning

Uploaded by

Harshit Shukla

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 2

Question 1

“Experience-Driven Computational Resource Allocation of Federated Learning by Deep

Reinforcement Learning”

Motivation
Deep Learning Techniques have emerged as the most promising in training models for various tasks like object detection,
classification, anomaly detection, etc. However, a large chunk of big data is generated through less resourceful user-equipments
(UE’s) like smartphones, IoT devices, where it is impractical to upload all this data to one centralized node server for training.
Centralized model training is a cumbersome process proposing many hindrances like network quality restrictions, privacy and
ownership concerns, lack of collaboration, etc. Hence Federated learning was introduced as a novelty where distributed learning
was brought under one umbrella, enhancing collaboration among devices without private data exposure. Federated Learning
works in iterations, where UE’s train their local models, upload the model parameters instead of the data to the centralized server,
where a global model is synthesized and then distributed back to the UE’s. UE’s however, are heterogeneous in computation and
communication capabilities due to varying underlying hardware; hence there is a tradeoff between the two costs during Federated
Learning, and there has been active research conducted in this field for its optimization. There has also been a push for designing
quick learning algorithms with better converging speeds, but the main issue we face in our research is "energy efficiency". This is
the crucial motivation for the work in the paper. A tradeoff between energy efficiency and learning time arises due to UE's
heterogeneous nature combined with the synchronization among training nodes after each iteration. All of this must take into
account the unpredictable network quality due to mobility or environmental factors. Instead of combining network quality
prediction and optimization algorithms, we turn to machine learning for solving federated learning problems.

Problem Statement
Federated Learning over wireless networks proposes the optimization problem of computational resource allocation on mobile
devices, that captures the tradeoff between connectivity and computational costs and improves the energy efficiency(by trading
idle time for power saving) without slowing the training process, a significant issue with mobile devices with heterogeneous
environments of computation & communication capabilities combined with varying physical specifications and battery exhaustion
limitations. Another factor that is unrealistically assumed in previous papers is the assumption of stable network connectivity
among connected devices.

Contributions
The paper contributes to providing a new computational resource allocation algorithm for federated learning that considers both
the converging time and mobile devices' energy consumption. The algorithm proposed is experience-driven i.e., it can learn the
best strategies for resource allocation based on the previous action (using an action-critic model) and is tested and proved both on
small-scale testbed as well as large scale simulations where it outperforms the traditional state-of-the-art solutions by a superior
margin. The DRL agent is based on an action-critic network forming the federated learning’s core and predicts the best suitable
CPU-cycle frequency for each mobile device at the beginning of every iteration. DRL interacts with the Federated Learning
system, which defines the rules, restrictions, and reward mechanism, observes its state, and determines the action based on
previous experiences, bringing the “experience-driven” part of the algorithm into actuality. It learns through a state, action, and
reward system to find the best policy mapping a state to an action that maximizes the discounted accumulated reward. Improving
the energy-efficiency of federated learning by carefully controlling the CPU-cycle frequency is the key contribution of the paper.
Due to the hardness of the control problem and the unawareness of the network quality, machine learning methods were applied
and an experience-driven method was devised to solve the control problem based on DRL. We train the DRL agent based on the
real-world network datasets. The final trace-driven experiments further demonstrate the superiority of the DRL-based approach
compared to the state-of-the-art solutions.
Proposed Approach
The proposed approach is broadly classified into two parts- Federated Learning system and the DRL agent. Considering a
practical scenario with dynamic network bandwidth, the author sets the state space in DRL formulation since future network
bandwidth is related to historical bandwidth information. Action in the mth round is defined as the set of CPU cycle frequencies of
all connected mobile devices in the mth iteration. Hence in one iteration, the mobile devices complete the federated learning
updates, upload the new parameters to the node server and after each mobile device has completed the upload, the DRL agent
obtains the system cost in the current iteration. The PPO algorithm is chosen for our policy optimization approach due to its ease
of implementation, sample complexity, ease of tuning, and assurance of low deviation from previous policy. The DRL agent
maintains an experience replay buffer, a policy(action), and an estimate of the value function(critic). Since the parameter server
can access mobile devices’ information, hence we can train the DRL agent in an offline manner.
The training procedure begins with random initialization of action and critic network parameters. The real-world network dataset
and mobile devices’ information are pre-loaded, hence constructing a simulated training environment of federated learning
systems. In order to train the DRL agent efficiently, another policy is used to sample the federated learning environment. In this
way, the DRL agent can repeatedly use the experience sampled by the old policy multiple times. The federated learning system
randomly selects a start time and the DRL agent constructs the initial state by each mobile device’s past bandwidth historical
information. Then the DRL agent starts to execute CPU-cycle frequency control and at the beginning of the kth iteration in
federated learning, the DRL agent feeds the current state into the policy network and derives the corresponding action. After the
mobile devices receive the action from the DRL agent, they train the deep learning model with the specified CPU-cycle frequency
determined by the current action. The kth iteration ends with the parameter server receiving all the updates from the mobile
devices. Then the DRL agent can calculate the reward obtained in the kth iteration, and the federated learning system moves to the
next state. At the same time, the experience in the kth iteration is stored in the experience replay buffer. When the experience
replay buffer is full, we can update the DRL agent with the experience in the replay buffer, where the actor-network is updated by
the PPO approach. After the DRL agent learns the information from the experience in the buffer, the new parameters of
actor-network are assigned to the old policy to do the next sampling.

Outcomes and Conclusion

The trace dataset contains bandwidth measurements of 4G networks along several routes, collected by Huawei P8 Lite
smartphones, as a uniform distribution is adjusted within 50-100 MB, running in 6 scenarios in and around the city of Ghent,
Belgium, with a number of CPU cycles distributed within 10-30 cycles/bit and maximum CPU-cycle frequency within 1.0-2.0
GHz, during the period of 2015-12-16 to 2016-02-04. The proposed algorithm is compared with two state-of-the-art approaches
heuristic and static. In Offline DRL Training the training loss becomes stabilized after less than 200 episodes, which means that
the DRL agent learns to adapt the federated learning environment. In the case of Online DRL Reasoning where after adequate
offline training, the DRL model is saved for reasoning, it generated an average system cost of 7.25,compared to 9.74 and 10.5 for
heuristic and static, respectively. The average system cost of the traditional approaches is 35% higher than DRL-based approach.
The DRL-based approach also consumes the minimum computational energy as compared to the other two approaches ranging
from 1.5 to 1.6 for DRL in each iteration. However, in the heuristic approach, over 80% of the energy consumption is more than
1.7 whereas in static, the mobile device implements the computation with the same CPU-cycle frequency. This verifies our
suspicion that even though mobile devices invest more computing power, it can not necessarily accelerate the convergence rate of
federated learning. In order to evaluate the scalability of the proposed DRLbased approach, simulations were conducted with 50
mobile devices. Where the DRL approach obtained the best performance as compared with the state-of-the-art with the system
cost of each iteration being almost less than 12 in contrast to 14 and 16 with traditional approaches. The final trace-driven
experiments further demonstrate the superiority of the DRL-based approach compared to the state-of-the-art solutions.

Limitations
The question for Federated Learning implementations has always been around its efficiency.Federated Learning requires small
delays and higher reliability of mobile devices.Although the paper tries to address this issue, however the algorithm is tested in a
simulation and not a real-time experiment. The dataset itself is old and won’t cater to the modern interconnection bandwidth,
traffic and other complications. The unpredictability of network bandwidth and QoS always remains a limitation in these areas and
are still open issues.

Bonawitz, Eichner Et Al 2019 - Towards Federated Learning at Scale
No ratings yet
Bonawitz, Eichner Et Al 2019 - Towards Federated Learning at Scale
15 pages
Extreme Privacy - Mobile Devices
100% (6)
Extreme Privacy - Mobile Devices
135 pages
11 em Acc Public MLM
No ratings yet
11 em Acc Public MLM
11 pages
Topic 6 BPR Methodology
0% (1)
Topic 6 BPR Methodology
23 pages
PSM Report Content FSKTM
100% (1)
PSM Report Content FSKTM
3 pages
Aop Iccps 2024
No ratings yet
Aop Iccps 2024
10 pages
Auction Based Clustered Federated Learning in Mobile Edge Computing System
No ratings yet
Auction Based Clustered Federated Learning in Mobile Edge Computing System
13 pages
Federated Learning Resource
No ratings yet
Federated Learning Resource
8 pages
Accelerating DNN Training in Wireless Federated Edge Learning
No ratings yet
Accelerating DNN Training in Wireless Federated Edge Learning
30 pages
Meta Federated Reinforcement Learning For Distributed Resource Allocation
No ratings yet
Meta Federated Reinforcement Learning For Distributed Resource Allocation
11 pages
Clustering Enhanced Reinforcement Learning For Adaptive Offloading in Resource Constrained Devices
No ratings yet
Clustering Enhanced Reinforcement Learning For Adaptive Offloading in Resource Constrained Devices
8 pages
Applied Federated Learning: Architectural Design For Robust and Efficient Learning in Privacy Aware Settings
No ratings yet
Applied Federated Learning: Architectural Design For Robust and Efficient Learning in Privacy Aware Settings
13 pages
Federated Learning Challanges
No ratings yet
Federated Learning Challanges
21 pages
Federated Learning For Edge Networks: Resource Optimization and Incentive Mechanism
No ratings yet
Federated Learning For Edge Networks: Resource Optimization and Incentive Mechanism
7 pages
Towards Energy-Aware Federated Learning Via MARL: A Dual-Selection Approach For Model and Client
No ratings yet
Towards Energy-Aware Federated Learning Via MARL: A Dual-Selection Approach For Model and Client
9 pages
Mobile Device Training Strategies in Federated Learning: An Evolutionary Game Approach
No ratings yet
Mobile Device Training Strategies in Federated Learning: An Evolutionary Game Approach
6 pages
1.-Federated Learning Over Wireless Networks Optimization Model Design and Analysis-1
No ratings yet
1.-Federated Learning Over Wireless Networks Optimization Model Design and Analysis-1
9 pages
COMPDLA08
No ratings yet
COMPDLA08
3 pages
Towards Federated Learning at Scale System Design
No ratings yet
Towards Federated Learning at Scale System Design
15 pages
FederatedLearning Bahaa Nashwa 2311.16021v1
No ratings yet
FederatedLearning Bahaa Nashwa 2311.16021v1
7 pages
Federated Learning
No ratings yet
Federated Learning
15 pages
Federated Meta-Learning With Fast Convergence and
No ratings yet
Federated Meta-Learning With Fast Convergence and
14 pages
Federated Meta-Learning For Traffic Steering in O-Ran
No ratings yet
Federated Meta-Learning For Traffic Steering in O-Ran
7 pages
Digital Twin-Assisted Federated Learning Service Provisioning Over Mobile Edge Networks
No ratings yet
Digital Twin-Assisted Federated Learning Service Provisioning Over Mobile Edge Networks
13 pages
Applied Federated Learning
No ratings yet
Applied Federated Learning
18 pages
Communication-Efficient Learning of Deep Networks From Decentralized Data
No ratings yet
Communication-Efficient Learning of Deep Networks From Decentralized Data
11 pages
OpenTelemetry in Practice: Definitive Reference for Developers and Engineers
From Everand
OpenTelemetry in Practice: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Apprentissage Fédéré Pour La 6G
No ratings yet
Apprentissage Fédéré Pour La 6G
7 pages
Report
No ratings yet
Report
59 pages
Machine Learning - Advanced Concepts
From Everand
Machine Learning - Advanced Concepts
Derrick Mwiti
No ratings yet
A Decentralized AI Data Management System in Federated Learning
No ratings yet
A Decentralized AI Data Management System in Federated Learning
4 pages
Fedlesscan: Mitigating Stragglers in Serverless Federated Learning
No ratings yet
Fedlesscan: Mitigating Stragglers in Serverless Federated Learning
13 pages
Python Machine Learning: Machine Learning Algorithms for Beginners - Data Management and Analytics for Approaching Deep Learning and Neural Networks from Scratch
From Everand
Python Machine Learning: Machine Learning Algorithms for Beginners - Data Management and Analytics for Approaching Deep Learning and Neural Networks from Scratch
Ahmed Ph. Abbasi
No ratings yet
Incentive Mechanism For Reliable Federated Learning - A Joint Optimization Approach To Combining Reputation and Contract Theory
No ratings yet
Incentive Mechanism For Reliable Federated Learning - A Joint Optimization Approach To Combining Reputation and Contract Theory
15 pages
Relay-Assisted Federated Edge Learning
No ratings yet
Relay-Assisted Federated Edge Learning
17 pages
FL Profiling Paper
No ratings yet
FL Profiling Paper
10 pages
Vehcom D 24 00093
No ratings yet
Vehcom D 24 00093
15 pages
Time-Sensitive Federated Learning With Heterogeneous Training Intensity A Deep Reinforcement Learning Approach
No ratings yet
Time-Sensitive Federated Learning With Heterogeneous Training Intensity A Deep Reinforcement Learning Approach
14 pages
B12. Fedeteral Learning in Edge
No ratings yet
B12. Fedeteral Learning in Edge
64 pages
Federated Deep Reinforcement Learning For Task Offloading in Digital Twin Edge Networks
No ratings yet
Federated Deep Reinforcement Learning For Task Offloading in Digital Twin Edge Networks
12 pages
IGNOU MCS 231 Mobile Computing Previous Year Solved Papers
From Everand
IGNOU MCS 231 Mobile Computing Previous Year Solved Papers
Manish Soni
No ratings yet
Deep Reinforcement Learning For Mobile 5G and Beyond Fundamentals Applications and Challenges
No ratings yet
Deep Reinforcement Learning For Mobile 5G and Beyond Fundamentals Applications and Challenges
9 pages
Federated Learning A Survery
No ratings yet
Federated Learning A Survery
31 pages
BT 3
No ratings yet
BT 3
28 pages
IGNOU MCS 227 Cloud Computing and IoT Previous Years Solved Papers
From Everand
IGNOU MCS 227 Cloud Computing and IoT Previous Years Solved Papers
Manish Soni
No ratings yet
TFL-DT A Trust Evaluation Scheme For Federated Learning in Digital Twin For Mobile Networks
No ratings yet
TFL-DT A Trust Evaluation Scheme For Federated Learning in Digital Twin For Mobile Networks
13 pages
Leveraging Federated Learning and Edge Computing For Recommendation Systems Within Cloud Computing Networks
No ratings yet
Leveraging Federated Learning and Edge Computing For Recommendation Systems Within Cloud Computing Networks
6 pages
CSCE689 DRL Project Report
No ratings yet
CSCE689 DRL Project Report
7 pages
OneFlow for Parallel and Distributed Deep Learning Systems: The Complete Guide for Developers and Engineers
From Everand
OneFlow for Parallel and Distributed Deep Learning Systems: The Complete Guide for Developers and Engineers
William Smith
No ratings yet
Study Guide 300-435 ENAUTO: Automating and Programming Cisco Enterprise Solutions Certification Exam
From Everand
Study Guide 300-435 ENAUTO: Automating and Programming Cisco Enterprise Solutions Certification Exam
Anand Vemula
No ratings yet
Jsan 14 00037 With Cover
No ratings yet
Jsan 14 00037 With Cover
43 pages
Semi-Synchronous Personalized Federated Learning
No ratings yet
Semi-Synchronous Personalized Federated Learning
17 pages
Optimizing Federated Learning On Non-IID Data With Reinforcement Learning
No ratings yet
Optimizing Federated Learning On Non-IID Data With Reinforcement Learning
10 pages
2017 Konecny Et Al Federated Learning Google Paper
No ratings yet
2017 Konecny Et Al Federated Learning Google Paper
10 pages
Speed Up Federated Learning in Heterogeneous Environments A Dynamic Tiering Approach
No ratings yet
Speed Up Federated Learning in Heterogeneous Environments A Dynamic Tiering Approach
9 pages
Application Performance Management in Modern Systems: Definitive Reference for Developers and Engineers
From Everand
Application Performance Management in Modern Systems: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
CAIE - A Review of Applications in Federated Learning - Deposit
No ratings yet
CAIE - A Review of Applications in Federated Learning - Deposit
58 pages
Federated Learning For Computationally Constrained Heterogeneous Devices A Survey
No ratings yet
Federated Learning For Computationally Constrained Heterogeneous Devices A Survey
27 pages
FL Tut Ans
No ratings yet
FL Tut Ans
19 pages
Decentralized Federated Learning Fundamentals State of The Art Frameworks Trends and Challenges
No ratings yet
Decentralized Federated Learning Fundamentals State of The Art Frameworks Trends and Challenges
31 pages
A Novel Edge-Based Multi-Layer Hierarchical Architecture For Federated Learning
No ratings yet
A Novel Edge-Based Multi-Layer Hierarchical Architecture For Federated Learning
5 pages
Implementation of Federated Learning On Resource-Constrained Devices Lessons Learned
No ratings yet
Implementation of Federated Learning On Resource-Constrained Devices Lessons Learned
6 pages
FedFMSL - Federated Learning of Foundation Models With Sparsely Activated LoRA
No ratings yet
FedFMSL - Federated Learning of Foundation Models With Sparsely Activated LoRA
15 pages
Efficient and Intelligent Multi-Job Federated Learning in Wireless Networks
No ratings yet
Efficient and Intelligent Multi-Job Federated Learning in Wireless Networks
13 pages
Minutes of Meeting Held Between M/S Ultra Tech Sewagram Cements LTD and M/S S.N Enviro Solutions PVT LTD
No ratings yet
Minutes of Meeting Held Between M/S Ultra Tech Sewagram Cements LTD and M/S S.N Enviro Solutions PVT LTD
1 page
Isc 3DD209L: Silicon NPN Power Transistor
No ratings yet
Isc 3DD209L: Silicon NPN Power Transistor
2 pages
Manitowoc Cranes: 7974100011 (Sheet 1 of 6)
No ratings yet
Manitowoc Cranes: 7974100011 (Sheet 1 of 6)
6 pages
HW2 111306048
No ratings yet
HW2 111306048
4 pages
MO1 Connect Internal Hardware Componenet
100% (1)
MO1 Connect Internal Hardware Componenet
26 pages
Bess White-Paper Explosion-Protection Final
No ratings yet
Bess White-Paper Explosion-Protection Final
2 pages
MySky Update Fail
No ratings yet
MySky Update Fail
10 pages
35 Swap Space Management 08-11-2024
No ratings yet
35 Swap Space Management 08-11-2024
6 pages
11.chapter 2 1
No ratings yet
11.chapter 2 1
6 pages
CCNA-1 Answer
No ratings yet
CCNA-1 Answer
14 pages
Q-Ans All Competitive Exam Guide Ebook by Education For Assam (BIJAY KOCH)
No ratings yet
Q-Ans All Competitive Exam Guide Ebook by Education For Assam (BIJAY KOCH)
49 pages
CSC2102 Data Structures and Algorithm Program BSSE-3 Sec. A Week 1
No ratings yet
CSC2102 Data Structures and Algorithm Program BSSE-3 Sec. A Week 1
30 pages
Comparison of Different DEM Generation Methods Based On Open Source Datasets
No ratings yet
Comparison of Different DEM Generation Methods Based On Open Source Datasets
23 pages
Notes 8467 Explanatory
No ratings yet
Notes 8467 Explanatory
7 pages
Brackets Lesson For Coding and Programming by Slidesgo
No ratings yet
Brackets Lesson For Coding and Programming by Slidesgo
57 pages
Post-Implementation Steps For SAP Note 3295909
No ratings yet
Post-Implementation Steps For SAP Note 3295909
5 pages
Cosworth Performance Parts 2011
No ratings yet
Cosworth Performance Parts 2011
48 pages
Social Entrepreneurship: Assignment 1: Social Enterprise and Entrepreneur Desicrew Solutions and Saloni Malhotra
No ratings yet
Social Entrepreneurship: Assignment 1: Social Enterprise and Entrepreneur Desicrew Solutions and Saloni Malhotra
3 pages
UPDATED - HGDML - ALL QUIZ QUESTIONS and ANSWERS v2.3.1
100% (1)
UPDATED - HGDML - ALL QUIZ QUESTIONS and ANSWERS v2.3.1
15 pages
Email Invoicing (E-Invoicing) : A Tool For Customer Satisfaction and Logistics Optimization
No ratings yet
Email Invoicing (E-Invoicing) : A Tool For Customer Satisfaction and Logistics Optimization
3 pages
Statistics: Measures of Central Tendency
No ratings yet
Statistics: Measures of Central Tendency
13 pages
Color Video Doorphone Kit: 1byone Products Inc
No ratings yet
Color Video Doorphone Kit: 1byone Products Inc
19 pages
Pipe2024 Help Manual
No ratings yet
Pipe2024 Help Manual
1,861 pages
SLG Math 10 Quarter 2 Week 2
No ratings yet
SLG Math 10 Quarter 2 Week 2
5 pages
June 2019 Pure Shadow Paper 2
No ratings yet
June 2019 Pure Shadow Paper 2
13 pages
PA-FD-ID Fans
100% (3)
PA-FD-ID Fans
53 pages

Experience-Driven Computational Resource Allocation of Federated Learning by Deep Reinforcement Learning

Uploaded by

Experience-Driven Computational Resource Allocation of Federated Learning by Deep Reinforcement Learning

Uploaded by

Question 1

“Experience-Driven Computational Resource Allocation of Federated Learning by Deep

Outcomes and Conclusion

You might also like