Auction Based Clustered Federated Learning in Mobile Edge Computing System
Auction Based Clustered Federated Learning in Mobile Edge Computing System
Abstract—In recent years, mobile clients’ computing ability has become one of the main bottlenecks hindering artificial
and storage capacity have greatly improved, efficiently dealing intelligence development. Federated learning [1] is proposed
arXiv:2103.07150v1 [cs.LG] 12 Mar 2021
with some applications locally. Federated learning is a promising as a promising distributed learning to alleviate the privacy
distributed machine learning solution that uses local computing
and local data to train the Artificial Intelligence (AI) model. leakage problem of machine learning. Unlike traditional ma-
Combining local computing and federated learning can train a chine learning and distributed machine learning, there is no
powerful AI model under the premise of ensuring local data pri- need to centralize user data for AI model training. In the
vacy while making full use of mobile clients’ resources. However, federated learning system, clients only need to transmit the
the heterogeneity of local data, that is, Non-independent and model parameters or gradients trained on their local data to
identical distribution (Non-IID) and imbalance of local data size,
may bring a bottleneck hindering the application of federated the aggregation server, thereby protecting data privacy.
learning in mobile edge computing (MEC) system. Inspired by Moreover, data heterogeneity brings new challenges to
this, we propose a cluster-based clients selection method that federated learning development. Wang et al. [2] proposed a
can generate a federated virtual dataset that satisfies the global reinforcement learning solution to solve federated learning
distribution to offset the impact of data heterogeneity and proved in Non-IID scenarios. Since this scheme requires multiple
that the proposed scheme could converge to an approximate
optimal solution. Based on the clustering method, we propose an rounds of reinforcement learning model training in advance
auction-based clients selection scheme within each cluster that for different scenarios, its generalization ability is weak. There
fully considers the system’s energy heterogeneity and gives the are also some clustering solutions to solve this challenge [3],
Nash equilibrium solution of the proposed scheme for balance [4], [5], [6], [7]. For example, Sattler et al. [3] divided the
the energy consumption and improving the convergence rate. The clients into several groups based on the similarity of the local
simulation results show that our proposed selection methods and
auction-based federated learning can achieve better performance model and then train in each group to improve the average
with the Convolutional Neural Network model (CNN) under accuracy. The above solutions have shown specific effects in
different data distributions. dealing with Non-IID scenarios, but they ignore the imbalance
Index Terms—Federated learning, Auction mechanism, Clus- of local data. That is, the size of data generated by different
ter. clients is inconsistent. Cai et al.[8] proposes a scheme that
using dynamic samples to solve the problem of data imbalance
I. I NTRODUCTION without considering the impact of Non-IID.
Also, especially in wireless edge networks, the energy
Local data
Local data Local data Local data
Auction Test data Local update 1
...
Global
Local data
model
Group 2
Train
Cluster Random data
Local data Local data
Local data Auction Test data Local update 2
Local data
...
. .
.
. .
.
. .
.
Local data Group J
Train
Random data
Server
Local data Local data Auction Test data Local update J
Local data
Local data
...
Iterate
al.[2] observed that the clustering scheme could speed up we give the optimal solution for clients’ bidding, which
the convergence speed of the global model compared with satisfies the Nash equilibrium.
randomly selecting user local models on Non-IID. They also • We evaluate the performance of our scheme through
verified the effectiveness of the clustering algorithm through simulation in a variety of different Non-IID scenarios.
experiments. However, in their experimental settings, each user Furthermore, we introduce the metric of energy consump-
has the same number of data samples, which is unrealistic in tion balance in the federated learning scenario for the
real system scenarios. The real scene is different users have first time. The simulation results show that our scheme
different data sizes, which means the scale of data owned by shows good performance in convergence rate and energy
edge clients is imbalanced. Therefore, in our research, we will consumption balance.
fully consider the two aspects of data heterogeneity: Non-IID
and imbalances of local data. Besides, they did not give a II. RELATED WORKS
theoretical analysis. On the other hand, partial edge clients are
selected for training in each iteration. Appropriate edge client In recent years, federated learning [1] as a special dis-
selection can effectively improve the convergence rate of the tributed machine learning approach has been widely studied
global model. However, the energy of mobile edge clients is by researchers. On the one hand, the original intention of
limited. Therefore, we proposed the energy balanced selection federated learning is to train the AI model to ensure data
mechanism in this paper. privacy. [11], [12], [13], [14], [15] study federated learning
from the perspective of protecting client’s data privacy and
AI model. On the other hand, different from the traditional
B. Contribution distributed machine learning system, the federated learning
Fig. 1 give an overview of auction based clustered federated system’s communication environment is more complex and
learning, and the main contributions of our research are as uncertain. Therefore, reducing communication overhead and
follows: improving communication efficiency is another core challenge
• We propose a client selection scheme based on initial gra- of federated learning. [16], [17], [18], [19] is devoted to
dient clustering, which mainly includes the following im- reducing the communication cost of federated learning or
provements: 1) We introduce the concept of federated vir- the communication rounds required for training. Recently,
tual datasets, and its goal is to transform the heterogeneity the heterogeneity of federated learning systems has become
of distributed local data into solving the heterogeneity the main bottleneck of its development. FL heterogeneity is
of virtual datasets. 2) To alleviate the impact of local divided into data heterogeneity and structural heterogeneity
data imbalance and ensure client clustering accuracy, we [20].
propose a sample window mechanism before clustering. Our research is mainly to solve the challenge of data
3) We give a theoretical analysis of the proposed scheme heterogeneity in the federated learning model’s training pro-
and prove that it can converge to an approximate optimal cess. In terms of training data samples, unlike conventional
solution under the stochastic gradient descent algorithm. distributed machine learning, the training data samples of
• Given the uneven resource consumption caused by ran- federated learning are generally Non-IID. McMahan et al.[1]
domly selecting clients in the cluster, we propose a cluster proposed the Federated Averaging (FedAvg) algorithm, which
internal client selection scheme based on the auction is a deep network federated learning method based on iterative
mechanism, which fully considers the data heterogeneity model averaging. They also pointed out that the FedAvg
and each client’s remaining energy. At the same time, algorithm is still applicable when the data of clients is Non-
3
samples to each client, and each client can have at least 100 and computing services from the bidders. After bidding, there
data samples and a maximum of 1200 data samples. will be Kj (Kj ≥ 1) winner in cluster j, whose bids are the
lowest within the cluster.
) 0 1 , 6 7 ) 0 1 , 6 7 Our auction-based federated learning system mainly con-
0.35 * U D G L H Q W V B F O X V W H U B U D Q G R P
0.8 0.30 : H L J K W V B F O X V W H U B U D Q G R P sists of four parts, energy consumption, cost function and
) H G $ Y J B U D Q G R P reward model, and auction-based edge clients selection algo-
0.25