0% found this document useful (0 votes)

33 views15 pages

An Efficient Privacy-Enhancing Cross-Silo Federated Learning and Applications For False Data Injection Attack Detection in Smart Grids

Uploaded by

Hareesh Talari

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

33 views15 pages

An Efficient Privacy-Enhancing Cross-Silo Federated Learning and Applications For False Data Injection Attack Detection in Smart Grids

Uploaded by

Hareesh Talari

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 15

2538 IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, VOL.

18, 2023

An Efficient Privacy-Enhancing Cross-Silo

Federated Learning and Applications
for False Data Injection Attack
Detection in Smart Grids
Hong-Yen Tran , Jiankun Hu , Senior Member, IEEE, Xuefei Yin , and Hemanshu R. Pota

Abstract— Federated Learning is a prominent machine learn- for jointly training a global model. Due to this characteristic,
ing paradigm which helps tackle data privacy issues by allowing federated learning offers significant privacy improvements
clients to store their raw data locally and transfer only their over centralizing all the training data. However, federated
local model parameters to an aggregator server to collaboratively
train a shared global model. However, federated learning is learning is vulnerable to inference attacks from dishonest
vulnerable to inference attacks from dishonest aggregators who aggregators who can infer information about clients’ training
can infer information about clients’ training data from their data from their model parameters (weights, gradients) [2],
model parameters. To deal with this issue, most of the proposed [3], [4], [5], [6], [7]. For example, [4] employed generative
schemes in literature either require a non-colluded server setting, adversarial networks to infer the private data of a target client
a trusted third-party to compute master secret keys or a secure
multiparty computation protocol which is still inefficient over from its shared model parameters. This means that even if
multiple iterations of computing an aggregation model. In this the model is trained in federated learning, data privacy still
work, we propose an efficient cross-silo federated learning scheme cannot be rigorously guaranteed. Information can be extracted
with strong privacy preservation. By designing a double-layer from global model parameters, but this information cannot be
encryption scheme which has no requirement to compute discrete linked to a specific single client because the data samples are
logarithm, utilizing secret sharing only at the establishment phase
and in the iterations when parties rejoin, and accelerating the anonymized among multiple clients. However, this is not the
computation performance via parallel computing, we achieve an case if the information is inferred from local model parameters
efficient privacy-preserving federated learning protocol, which by a corrupted aggregator. Thus, clients’ model parameters
also allows clients to dropout and rejoin during the training should be protected from the access of a corrupted aggregator
process. The proposed scheme is demonstrated theoretically and to prohibit these potential inference attacks.
empirically to provide provable privacy against an honest-but-
curious aggregator server and simultaneously achieve desirable To address this problem, existing approaches focus on two
model utilities. The scheme is applied to false data injection attack main techniques, which are differential privacy-based and
detection (FDIA) in smart grids. This is a more secure cross- secure aggregation-based. The former adds noise directly to
silo FDIA federated learning resilient to the local private data the client’s models over a numerous number of iterations;
inference attacks than the existing works. thus, it has the drawbacks of sacrificing the global model
Index Terms— Privacy-preserving, federated learning, encryp- accuracy to make a trade-off of privacy-utility. The latter
tion, secret sharing, false data injection attack detection. utilizes techniques in cryptography such as secure multiparty
computation and homomorphic encryption to securely aggre-
I. I NTRODUCTION gate the clients’ models without knowing their specific values.

F EDERATED learning [1] is an emerging machine learning

paradigm which addresses critical data privacy issues by
enabling clients to store their raw data locally and transfer only
However, most of these existing approaches rely on a trusted
third party to generate the master key for aggregation or a
setting with multiple non-colluding servers. Besides, many
their updated local model parameters to an aggregator server proposed schemes are still inefficient and impractical due to
the expensive overhead of computation and communication
Manuscript received 15 April 2022; revised 22 February 2023; accepted
11 April 2023. Date of publication 17 April 2023; date of current version among multiple clients over multiple rounds of training.
26 April 2023. This work was supported in part by ARC Discovery Grant False data injection attack (FDIA) detection [8], [9] is a
DP190103660 and Grant DP200103207, and in part by ARC Linkage Grant critical security operation in a smart grid control system. and
LP180100663. The associate editor coordinating the review of this manuscript
and approving it for publication was Prof. Chia-Mu Yu. (Corresponding has been solved by data-driven machine learning methods.
author: Jiankun Hu.) The data-driven machine learning methods require a huge
Hong-Yen Tran, Jiankun Hu, and Hemanshu R. Pota are with the amount of measurement data which are distributed over an
School of Engineering and Information Technology, The University of
New South Wales Canberra at ADFA, Canberra, ACT 2602, Australia (e-mail: interconnected grid. In such an interconnected grid, each sub-
[email protected]; [email protected]; [email protected]). grid is possessed and managed by an independent transmission
Xuefei Yin is with the School of Information and Communication grid company (TGC) regarding power industry deregulation
Technology, Griffith University, Gold Coast, QLD 4222, Australia (e-mail:
[email protected]). [10], [11]. To build a high-accuracy model for false data injec-
Digital Object Identifier 10.1109/TIFS.2023.3267892 tion detection, measurement data from all involved sub-grids
This work is licensed under a Creative Commons Attribution 4.0 License. For more information, see https://creativecommons.org/licenses/by/4.0/
TRAN et al.: EFFICIENT PRIVACY-ENHANCING CROSS-SILO FEDERATED LEARNING AND APPLICATIONS 2539

should be shared. However, transmitting such huge measure- differential privacy [14], which adds appropriate noise to
ment data over the network for a centralized detection machine shared parameters according to the desired privacy level.
learning algorithm is expensive and also leads to security For example, [15] added Laplace noise to the gradients and
and privacy issues including competitive privacy [12]. The selectively shared the perturbed gradients, [16], [17] presented
question is how to coordinate these TGCs to detect FDI attacks a client-sided differential privacy federated learning scheme to
while preserving their competitive privacy. This remains a hide clients’ model contributions during training. To protect
challenging problem which has been attracting recent studies local models, the added noise to each local model must be
with federated learning-based solutions. In federated learning, big enough, resulting in the aggregate noise corresponding to
a cross-silo setting is often established where a number of the aggregate model being too large, which would completely
companies or organizations have a common incentive to train destroy the utility of this model.
a model based on all of their data, but do not share their data The other technique is secure multiparty computation and
directly due to confidentiality/privacy or legal constraints [13]. homomorphic encryption for secure aggregation. The scheme
To enhance the privacy of power companies when they in [18] was based on Elgamal homomorphic encryption. This
contribute their local training models, an efficient privacy- scheme requires a trusted dealer to provide each participant
preserving cross-silo federated learning for FDIA detection with
Pk a secret key ski and the aggregator sk0 such that
over multi-area transmission grids should be designed. i=0 ski = 0. Their private secure aggregation is aggregator
In view of the above issues, we propose an efficient cross- oblivious in the encrypt-once random oracle model where each
silo federated learning with strong privacy preservation which participant only encrypts once in each time period. To decrypt
can be applicable to the smart grid domain. By designing a the sum, it ends up computing the discrete logarithm which
double-layer encryption scheme over multiple federated learn- can be implemented through a brute-force
√ search or Pollard’s
ing rounds and utilizing Shamir secret sharing, we achieve an lambda method which requires O( k1), where k is the num-
efficient privacy-preserving federated learning protocol, which ber of parties and 1 is the maximum value of any party’s input.
also allows some clients to drop out and rejoin dynamically To overcome the limitations of solving discrete logarithm
during the training process. Specifically, we summarize the problems, [19] presented a scheme in the encrypt-once random
main contributions as follows: oracle model with fast encryption and decryption based on
• A general privacy-enhancing cross-silo federated learning Decisional Composite Residuosity Assumption which removes
with a secure weighted aggregation scheme is designed the discrete logarithm computation. However, this scheme also
based on lightweight double-layer encryption and Shamir requires a trusted dealer to generate and distribute the secret
secret sharing. The scheme removes the requirement of keys to participants and an aggregator. Besides, both of the
computing discrete logarithms which is the limitation of approaches in [18] and [19] only deal with secure aggregation
some related works. No multiple non-colluding server of scalars over periods of time (not the secure weighted aggre-
settings are required. Besides, clients’ secret keys of two gation of model vectors over multiple iterations of federated
encryption layers are generated in a decentralized manner learning) and does not deal with dropouts/rejoining problems.
which helps increase privacy. Addressing the drawbacks of [18] and [19], the work in [20]
• The proposed scheme is demonstrated theoretically and proposed a secure aggregation scheme where the input is a
empirically to provide provable privacy against an honest- vector and can deal with dropouts. The scheme is based on
but-curious aggregator server and simultaneously achieve pairwise additive stream ciphers and Shamir secret sharing to
desirable model utility. tackle client failures. Diffie-Hellman key exchange is adopted
• The proposed scheme is efficient in communica- to share common pair-wise seeds of a pseudorandom gen-
tion/computation and robust against dropouts/rejoining erator. Double-masking is introduced to prevent leakage if
during training iterations. there is any delay in transmission. Nevertheless, this approach
• An efficient privacy-enhancing cross-silo federated learn- requires at least four communication rounds between each
ing resilient to the local training data inference attacks client and the aggregator in each iteration and a repetition
for FDIA detection in the smart grid domain is proposed of Shamir secret sharing for each iteration. Thus, it suffers
and empirically evaluated. from communication and computation inefficiency considering
This paper consists of eight sections. Following this Intro- the huge number of iterations of federated learning. Utilizing
duction section are the Related Works and Preliminaries the technique of secure data aggregation in [20], the work
sections. The proposed privacy-enhancing cross-silo feder- in [21] proposed a general privacy-enhanced federated learning
ated learning without any trusted third parties is given in scheme with secure weighted aggregation, which can deal
Section IV, followed by the analysis of the scheme in with both the data significance evaluation and secure data
Section V. A concrete scenario of enhancing privacy in cross- aggregation. This scheme still inherits the same drawbacks
silo federated learning for FDIA detection in smart grids with as [20]. Besides, this scheme only resolved a weak security
empirical evaluation is given in Section VI and Section VII. model where no collusion between the server and the clients
Finally, Section VIII is for the discussion and conclusions. participating in the federated learning. The paper [22] pre-
sented Prio, a privacy-preserving system for the collection of
II. R ELATED W ORKS aggregate statistics. With a similar approach, [23] introduced
Existing works on enhancing privacy for federated learning SAFELearn, a generic design for efficient private federated
mainly employ two types of techniques. One technique is learning systems that protect against inference attacks using
2540 IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, VOL. 18, 2023

secure aggregation. However, these designs rely on multiple do not consider the security problem of local data privacy
non-colluded server settings. Dong et. al. in [24] designed leakage from local models as in [32] and our proposed scheme.
two secure ternary federated learning protocols against semi- The scheme in [32] enhanced privacy by utilising Pallier-
honest adversaries based on threshold secret sharing and based homomorphic encryption for secure model aggregation,
homomorphic encryption respectively. In the first protocol, but only resolved a weak security model where no collusion
threshold secret sharing is used to share all local gradient among the server and the clients participating in the federated
vectors in all iterations, which causes expensive computation learning. All clients have to share a common pair of public key
and communication overhead. Besides, the limitation of their and secret key for encryption/decryption and a trusted party is
second protocol is that all clients use the same secret key and required to generate this key pair.
if the server colludes with a client then it can obtain all client’s A privacy-preserving federated learning approach needs to
models. In [25], Fang et. al. modified the traditional ElGamal be efficient in computation and communication while provid-
protocol into a double-key encryption version to design a ing strong privacy preservation and desirable model utility.
new scheme for federated learning with privacy preservation Most of the related works focus on the basic problem of secure
in cloud computing. Nevertheless, the scheme has to solve aggregation with the main approaches based on secure multi-
the discrete logarithm problem as [18]. The study in [26] party computation, homomorphic encryption, and differential
combined additively homomorphic encryption with differential privacy. In spite of some achievements in secure aggregation
privacy but cannot tolerate client dropouts. Their system and privacy-preserving federated learning, there are still draw-
creates significant run-time overheads which makes it imprac- backs. The majority of proposed schemes in literature either
tical for real-world federated learning applications. Functional require a trusted third party to compute master secret keys or
encryption and differential privacy is utilized in [27] to design all local parties share a common secret key or non-colluded
the HybridAlpha scheme. However, HybridAlpha relies on a server settings. This means these works guarantee privacy in
trusted party that holds the master keys. The proposed scheme weaker security models (e.g., no collusion).
in [28] replaced the complete communication graph in [20] The proposed scheme does not require a trusted dealer to
with a k-regular graph of the logarithmic degree to reduce the provide each participant with a secret key as the scheme
communication cost while maintaining the security guarantees; in [18], [19], [27], and [32]. While the schemes in [18]
however, each client shares its secret across only a subset of and [25] require computing the discrete logarithm, our scheme
parties, and thus the dropout-resilience is downgraded. removes that complexity by utilizing the encryption-decryption
Considering the integrity of the global model besides the based on the Decisional Composite Residuosity assumption.
privacy preservation of the local data and models, the proposed Moreover, both of the approaches in [18] and [19] only deal
approach in [29] combined the Paillier additive homomorphic with secure aggregation of scalars over periods of time, not the
and verifiable computation primitives. The scheme in [29] can secure weighted aggregation of model vectors over multiple
verify the correctness of the aggregated model given the fact iterations of federated learning. The dropout and rejoining
that every client provides their genuine local models. From problems were not investigated in these works too. Although
the perspective of privacy preservation, the scheme can only eliminating the drawbacks in [18] and [19], the schemes in
tolerate a weaker threat model. No collusion among the server [20] and [28] suffer higher computation overhead than the
and clients participating in the federated learning protocol was proposed approach and do not address federated learning with
assumed as the keys (sk, pk) necessary for the homomorphic secure weighted aggregation. Other systems in [22] and [23]
encryption and the signatures are generated by one of the depend on multiple non-colluded server settings, which is not
clients and shared among all clients. In the work [17], to deal required with our scheme. The systems in [21], [24], [29],
with the problem of collusion in [29], adding Gaussian noise and [32] cannot tolerate the risk of revealing all clients’ models
to the local models before homomorphically encryption was when there is a collusion between the server and a client as
proposed. However, the standard variation of the additive our protocol. The study in [26] cannot resolve client dropouts.
Gaussian noise must be small to not destroy the genuine local Their system creates significant run-time overheads, making
models, resulting in the fact that the adding noise protection it impractical for real-world federated learning applications.
is not able to provide a high level of differential privacy (ε is Our scheme is resilient to dropouts and provides efficient
not small, i.e., less than 1). performance for real applications, such as privacy-preserving
The power grid scenario of false data injection attack federated learning false data injection detection.
detection based on federated learning in smart grids has been To summarize, Table I gives a comparison of our scheme
studied in [30], [31], and [32]. The investigated power grid with related works regarding the application scenario of FDIA
scenario is similar in these papers and in the proposed scheme. federated learning with secure weighted aggregation (A1, A2)
For example, in [30] an independent power system state owner and different security/privacy properties (A3-A8). Only three
(PSSO) and a detection service provider (DSP) correspond recent works [30], [31], [32] studied the FDIA federated
to an independent transmission grid company (TGC) and a learning. Most of the related works do not provide all security
system operator (SO) in the proposed scheme. The power grid properties A3-A8. Only the studies in [20] and [28] filtered
scenario fits with the investigated cross-silo federated learning from Table I satisfy all security/properties as the proposed
setting (e.g., the number of parties (PSSOs/TGCs) is small and approach. Table II compares the computation and communi-
each party is facilitated with high-performance computing). cation complexity between these two studies [20], [28] and the
However, [30] and [31] only apply federated learning and proposed scheme. From Table I and Table II, it can be seen that
TRAN et al.: EFFICIENT PRIVACY-ENHANCING CROSS-SILO FEDERATED LEARNING AND APPLICATIONS 2541

TABLE I federated learning is the federated learning setting when clients

C OMPARISON OF O UR S CHEME W ITH R ELATED W ORKS . A1. FDIA are different organizations or geo-distributed data centres that
F EDERATED L EARNING , A2. S ECURE W EIGHTED M ODEL
AGGREGATION , A3. N O T RUSTED D EALER TO G ENERATE
have the incentive to train a shared model on the union of
AND D ISTRIBUTE S ECRET K EYS , A4. N O N ON -C OLLUDED their siloed data. [13]
S ERVER S ETTING , A5. C OLLUSION R ESISTANCE , Several algorithms have been proposed for federated learn-
A6. D ROPOUTS /R EJOINS H ANDLING , A7. N O
D ISCRETE L OGARITHM P ROBLEM S OLVING ,
ing. In this work, we utilize FedAvg [1], which is the original
A8. P RIVACY-S ECURITY T RADE -O FF federated learning aggregation mechanism and is commonly
applied in related works. In FedAvg, the global model param-
eters are updated Pkby nsumming the weighted local model
parameters w = i=1 n
i
· w i .

C. Shamir Secret Sharing

(t, n) Shamir secret sharing scheme [33] creates k shares
{s (1) , . . . , s (n) } of a secret s such that s can be efficiently
reconstructed by any combination of t data pieces but cannot
by any set of less than t data pieces.
s, s (1) , . . . , s (n) are the elements in a finite field Z p for some
large prime p where 0 < t ≤ n < p. The scheme works as
follows:
• Setup: The secret holder randomly chooses a1 , . . . , at−1
from Z p and a0 = f (0) = s to define a polynomial of
degree t ≤ 1:

f (x) = a0 + a1 x + a2 x 2 + · · · + at≤1 x t≤1 mod p

• Sharing: The secret holder computes s (i) = f (i) for

i ∈ {1, 2, . . . , n}, and sends (i, s (i) ) to the corresponding
participants i.
• Reconstructing: Given any t of (i, s (i) ) pairs, an user is
able to reconstruct the secret
t t
X
( j)
Y xm
s = a0 = s · mod p
xm − x j
j=1 m=0,m̸ = j
the proposed scheme guarantees privacy in a stronger security
model and at a lower computational overhead than the related
works. D. Decisional Composite Residuosity Assumption
Let N = p · q for two large primes p and q. The Decisional
III. P RELIMINARIES Composite Residuosity (DCR) assumption [34] states that the
A. Notations and Definitions advantage of a distinguisher D, defined as the distance:
Column vectors are denoted by lower-case bold letters, AdvDCR := |Pr[D(y, N ) = 1| y = x N mod N 2 , x ←$ Z∗N ]
D
like v. The i-th entry of the vector v is vi . v T is the transpose
− Pr[D(y, N ) = 1| y ←$ Z∗N 2 ]|
of the column vector v. The zero-vector is represented by
0. Given a set S, x ←$ S indicates that x is sampled where probabilities are taken over all coin tosses, is a negli-
uniformly at random from S. The notion [k] represents the gible function
set {0, 2, . . . , k − 1}. The computational indistinguishability
of two distributions H0 and H1 , is denoted by H0 ∼ = H1 .
Table III lists the notions used in this paper. E. False Data Injection Attacks
False data injection attacks (FDIAs) are designed by manip-
B. Federated Learning ulating some measurements to circumvent the residual-based
Federated learning is a machine learning scheme where bad data detection in a power management system [8],
multiple clients collaborate in generating a shared machine [9], [35]. Various algorithms have been designed to detect
learning model, under the coordination of a central server. these attacks using new techniques instead of the residual-
Each client’s raw data is stored locally and not transmitted; based bad data detection mechanism. One example is the deep
instead, their local model parameters are sent to the server learning network to model the spatial-temporal relationship
for aggregation to achieve the learning objective. Cross-silo between bus and line measurements in [36].
2542 IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, VOL. 18, 2023

TABLE II
C OMPARISON OF S ECURE AGGREGATION A MONG [20], [28] AND O URS . k I S THE N UMBER OF L OCAL PARTIES /C LIENTS . L I S THE
L ENGTH OF THE C LIENTS ’ M ODEL V ECTOR . τ I S THE S HAMIR S ECRET S HARING T HRESHOLD

TABLE III risk of inference attacks performed by an honest-but-curious

L IST OF N OTATIONS aggregator on each local model to extract information about
the corresponding party’s local data used for training. Hence,
ordinary federated learning needs to be integrated with privacy
protection techniques to prohibit access to individual model
updates. The system should be designed in a way to hide
local models from the aggregator to counter the inference
attacks while still enabling efficient and accurate federated
learning.
The following section introduces and explains the main
techniques in the proposed privacy-enhancing cross-silo fed-
erated learning scheme.

B. High-Level Technical Overview

1) Protecting Local Models: The encryption scheme in [19]
based on the DCR assumption in the random oracle model is
utilized to obtain the global model vector as the weighted
average function of a set of local model vectors given their
(t) (t)
(t)
encryptions ci, j = (1 + N1 )xi, j · H1 ( j)ski mod N12 . Here,
(t)
xi, j is the j-th element of the i-th party’s model vector
encoded in a non-negative integer form at the t-th iteration,
(t)
ski is the secret encryption key of i-th party at the t-th
IV. P ROPOSED P RIVACY-E NHANCING C ROSS -S ILO iteration. The main benefit of this construction is that the
F EDERATED L EARNING weighted average global model vector can be retrieved without
computing the discrete logarithm as the other approaches in
A. System Model and Overview of the Proposed literature [18], [25]. In [19], only the secure aggregation
Privacy-Enhancing Cross-Silo Federated Learning is considered and it is assumed that there exists a trusted
dealer generatesPencryption key ski , i = 1 · · · k and the master
Consider a system with k local parties and an aggregator k
key sk0 = − i=1 ski . In our proposed scheme, the secure
server. Each local party owns its private dataset Di , i ∈
weighted aggregation is investigated, each party creates its
{1, . . . , k} with n i = |Di | samples. All local participants agree (t)
own secret key ski and the master key is computed from
on the same learning network structure N . The global learning
clients’ secret keys in a secure computation manner. To enable
network model at t-th iteration consists of L weight parame-
(t) (t) (t) (t) the secure weighted aggregation of local models which was
ters, denoted as w G = {w0 , w1 , . . . , w L−1 }. The aim is to
not considered in [19], the number of each party’s training
learn a global network model from all local datasets without (t)
samples is also encrypted by the corresponding ski at each
exposing participants’ data privacy under the coordination of
iteration. The master key P to decrypt the global model vector
the aggregator. (t)
is calculated as msk (t) = i∈Uat ski , where Uat is the set of
The adversary is the honest-but-curious aggregator server
alive parties who contribute their encrypted local models for
which is assumed to follow the protocol honestly, but attempts
aggregation. This master key should be computed in a secure
to infer sensitive information about participants’ training data
(t) way to increase privacy level. This is achieved by designing
from their model updates wi . It is also assumed that there
the second layer of the basic encryption scheme to encrypt
are private and authenticated peer-to-peer channels between (t)
the secret encryption keys ski of the first layer, which is
parties so that the data transferred cannot be eavesdropped (t) (t) (t)
on or modified. This can be enforced in practice with the βi = (1 + N2 )ski · H2 (t)vi mod N22 . The secret encryption
(t) (t)
appropriate use of Digital Signatures and Certificate Authori- key of this second layer is vi . The requirement for vi is that
(t)
it is privately generated by each party such that i∈U vi = 0,
P
ties. To implement federated learning which utilizes the union
of local datasets, for each iteration, each party contributes where U is the set of all parties. Different from the secret keys
(t)
its local model vector wi . Unfortunately, this raises the sk of the first layer which are generated at each iteration, the
TRAN et al.: EFFICIENT PRIVACY-ENHANCING CROSS-SILO FEDERATED LEARNING AND APPLICATIONS 2543

secret encryption keys v (0) which are created at the initial keys v of the second layer can be used for multiple iterations.
sub-protocol π0 of the establishment phase basically can be Shamir’s secret sharing for the secrets s is only implemented
used for multiple iterations (v (t) = v (t−1) = · · · = v (0) ). The at the establishment phase and in the iterations when parties
generation of v (t) is based on the correlated antiparticles
P using rejoin. Besides, only rejoining parties Pr generate new key
(t) (t) (t) (t) (t)
common pair-wise secrets, vi = γ i> j γ j,i ,
P
i< j i, j − pairs and transmit their new public keys pkr = g sr .
(t)
where γi, j is the common initial pair-wise secrets between
party i and party j created by adopting the Diffie-Hellman
Key exchange protocol. C. Description of the Proposed Protocol
2) Handling Dropouts: Shamir’s τ -out-of-k secret sharing Algorithm 1 describes the overall steps of the proposed
is utilized to allow a user to split a secret into k shares, such privacy-enhancing cross-silo federated learning from the client
that any τ shares can be used to reconstruct the secret, but side and the server side.
any set of at most τ ≤ 1 shares gives no information. Each
(t)
party creates k shares of its secret si , keeps one share and Algorithm 1 Proposed Privacy-Enhancing Cross-Silo
sends each share to each different party from k − 1 remaining Federated Learning Algorithm
shares. At each iteration t, after receiving the ciphertexts, Input:
the aggregator broadcasts the set of alive parties Uat , the T : Maximum number of rounds, k: the number of
set of the dropped parties Udt = U \ Uat . If Uat = U then clients selected in each round, Nepoch : the number of
(t)
i∈Uat vi = 0; but, if Uat ⊂ U then the sum local epochs, and η: the local learning rate, pp: public
P
we have
(t) parameters
i∈U t vi
P
needs to be recovered. Alive parties send their
a
(ti)
Output:
shares sd of a dropped party Pd to the aggregator. Thanks Global model w G
to the τ -out-of-k Shamir threshold secret sharing scheme, the Processing:
(t)
share sd can be recovered by the aggregator as long as the [Server-side]
(ti) (t)
aggregator receives at least τ secret shares sd . Having sd , 1: Initialize w 0G
(t) (t)
the aggregator can compute vd and obtain i∈U t vi
2: for each round t from 1 to T do
P
as
d
the master key of the second encryption layer to obtain the 3: Ut contains k clients
sum
P (t)
. Because the sum
P (t) 4: for each client i ∈ Ut in parallel do
j∈Uat sk j∈U sk j is the master
t (t) (t)
j a 5: Ci ← LocalTraining(i, w G , t)
key of
P the first encryption layer; thus, it helps to get the 6: end for
(t) (t)
sum t x
j∈Ua j . 7: y(t+1) , n (t) ← Dec( pp, {Ci }i∈Uat )
(t−1) (t+1) 1
3) Handling Rejoining: Assume that the secret sd of a 8: wG = (t) · Decode(y(t+1) )
dropped party Pd was revealed to the aggregator. If Pd rejoins n
9: end for
the current iteration, which is the t-th iteration, Pd has to
(t) [Client-side: Party Pi ]
create a new secret sd . For this case, the party Pd needs
(t) LocalTraining(i, w, t):
to send its updated public key pkd = g sd to the aggregator, (t)
10: Divide local dataset Di for round t into batches;
then creates and shares Shamir’s shares of its updated secret (t)
(t) Bi denotes the set of the batches.
sd . The aggregator broadcasts the updated set of public keys 11: for each epoch j from 1 to Nepoch do
and the set of rejoining parties. Rejoining parties Pr update (t)
(t)
12: for each batch b ∈ Bi do
(t) (t)
the seeds sr,i = ( pki )sr shared with all other parties and 13:
(t) (t)
wi ← wi − η∇ L(wi ; b)
(t)
(t)
compute their updated secret vr . Other parties Pi update the 14: end for
(t) (t) si(t)
seeds si,r = ( pkr ) shared with the rejoining parties and 15: end for
(t) (t) (t)
(t) 16: z i ← n i · w i
also calculate their updated secret vi . (t) (t)
4) Reducing Communication and Computation Overhead: 17: x i ← Encode(z i )
(t) (t) (t)
To overcome the problem of communication and computation 18: Ci ← Enc( pp, xi , n i , t)
(t)
overhead in federated learning with multiple iterations, the 19: return Ci
proposed solution is threefold. The first one is to utilize
a lightweight encryption/decryption scheme which has no
requirement to compute discrete logarithms. The second one 1) Establishment: All the parties agree on the public
is to accelerate the computation performance via parallel com- parameters pp = (N1 , N2 , H1 , H2 , G, T ) where: N1 is
puting of Single Instruction Multiple Data (SIMD) of crypto- the modulus of encryption layer√ 1, N2 is the modulus of
graphic operations over model vectors and pre-computed hash encryption layer 2 and N2 > k · 2l1 where l1 is the bit-
functions. The third one is to limit the number of times of length of N1 and k is the number of local parties; H1 :
(t) Z → Z∗ 2 , H2 : Z → Z∗ 2 are two hash functions, G is the
creating and transmitting the secrets si in the Shamir secret N1 N2
sharing scheme. This is effectively performed by designing a learning network and T is the number of federated learning
double-layer encryption scheme where the secret keys sk of iterations. The sub-protocol π0 generates the secrets v(0) as
the first layer are used for only one iteration and the secret follows:
2544 IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, VOL. 18, 2023

Fig. 1. Secure weighted model aggregation procedure for one epoch.

a) Sub-protocol π0 : each training epoch, where a step in square brackets (e.g. [2])
1. The aggregator chooses and publishes a λ-bit prime, p, indicates that this step is included if dropout/rejoining happens.
where λ is the security parameter, and g is the generator At each iteration t, each Pi owns a L-length local vector
(t)
of Z∗p model wi . The following describes in detail the steps of
(0) secure weighted aggregation at the iteration t ∈ [T ]
2. Each Pi uniformly samples si ←$ Z∗p and sends
(t) (t) (t)
(0) (0) 1. Pi .Encode(wi , n i ) → x i : Pi encodes the weighted
pki = g si to the aggregator who then broadcasts the (t)
set of all public keys to all parties model to get the non-negative integer vector x i accord-
3. Each pair of clients (Pi , P j ) computes a common pairwise ing to the method in [37]:
(0) (t) (t) (t) (t) (t)
seed γi, j : z i = {n i · wi, j | j ∈ [L]} ; n i = |Di | (4)
(0) (0) (t) (t)
(0) (0)
γ j,i = ( pki )s j = ( pk j )si
(0) (0)
= γi, j (1) x i = Encode(z i ) (5)
(t) (t)
zi = Decode(x i ) (6)
4. Each Pi computes:
(0)
X (0)
X (0) [2]. If Pi rejoins this iteration, this party runs Pi .GenKey()
vi = γi, j − γi, j (2)
to generate a new pair of its secret and public key, and
i< j i> j
Pi .CreateShares() to create k shares of the updated
5. Each Pi runs the Shamir-secret sharing algorithm (t)
(0) (0)
secret si :
SS(si , τ, k) to create k shares of its secret key si and (t)
(0 j)
sends each triple (i, j, si ) to each other party P j , where si ←$ Z∗p (7)
(0 j) (0) (t) (t)
si
si is the share of si corresponding to the party P j : pki =g (8)
(0, j) (0) (t, j) (t)
{(i, j, si )} j∈[k] ← SS(si , τ, k) (3) {(i, j, si )} j∈[k] ← SS(si , τ, k) (9)
(t)
2) Secure Weighted Aggregation: This section describes Then Pi sends the updated public key to the pki
the proposed secure weighted aggregation happening at each aggregator.
federated learning iteration to evaluate the global model as the 3. Based on the receiving updated public keys, the aggrega-
weighted aggregation of the encrypted local models. Fig. 1 tor creates the set of rejoining parties of this iteration,
illustrates the main steps and computations carried out during which is Urt . If Urt = ∅ then v (t) = v (t−1) , else the
TRAN et al.: EFFICIENT PRIVACY-ENHANCING CROSS-SILO FEDERATED LEARNING AND APPLICATIONS 2545

aggregator broadcasts the updated set of the public keys – If Uat ⊂ U:

{ pk} and Urt . (t)
vi
P
(t) i∈Udt
[4]. Upon receiving Urt and { pk}, a rejoining party checks if ( i∈Uat βi ) · H2 (t) − 1 mod N22
Q
its updated public key is in the set and then continues the msk (t) =
N2
protocol; if not then leaves the protocol (early dropout).
(18)
If Urt ̸ = ∅ then Pi .UpdateSeedsSecret().
(t) (t)
If Pi rejoins, then Pi updates the seeds γi, j shared 11. A.Eval({αi }i∈Uat , msk) → n (t) ,
(t) (t)
with all other parties A.Eval({ci, j }i∈Uat , msk) → y j
(t) (t) (t) Having msk (t) , the aggregator can compute the global
γi, j = ( pk j )si ; j ∈ U \ {i} (10)
model:
(t) (t) (t)
else Pi updates the seeds γi,r shared with rejoining ( i∈Uat αi ) · H1 (L)−msk − 1 mod N12
Q
(t)
parties n =
N 1
(t) (t) (19)
γi,r = ( pkr(t) )si ; j∈ Urt \ {i} (11)
(t) (t)
( i∈Uat ci, j ) · H1 ( j)−msk − 1 mod N12
Q
(t) (t+1)
Then Pi updates their secret vi yj = (20)
N1
(t)
X (t) X (t) 1
vi = γi, j − γi, j (t+1) (t)
(12) wj = · Decode(y j ), ( j ∈ [L]) (21)
i< j i> j n (t)
Then, the aggregator sends the global model
(t) (t) (t) (t) (t) (t) (t+1)
5. Pi .Encrypt(x i , n i ) → Ci = {αi , βi , {ci, j } j∈[L] }: w (t+1) = {w j } j∈[L] to all local parties for the
(t)
Pi encrypts x i , which includes the following main steps: next epoch t + 1.
(t)
– Sample ski ←$ ±{0, 1}2l1
– Compute V. A NALYSIS OF THE P ROPOSED S CHEME
(t) (t) A. Correctness
(t)
ci, j = (1 + N1 )xi, j · H1 ( j)ski mod N12 • If Uat = U:
where j = 0 · · · L − 1 (13) From (25):
(t) (t) (t) Y (t) (t) (t)
αi = (1 + N1 )n i · H1 (L)ski mod N12 (1 + N2 )ski · H2 (t)vi mod N22
Y
(14) βi =
(t) (t) (t)
vi
βi = (1 + N2 ) ski
· H2 (t) mod N22 (15) i∈Uat i∈Uat
(t) (t)
vi
P P
ski
(t) (t) (t) (t) (t) = (1 + N2 ) i∈Uat · H2 (t) i∈Uat mod N22
– Return Ci = {αi , βi , ci = {ci, j } j∈[L] }
(t) (22)
Then Pi sends Ci to the aggregator
(t) Besides, from (2, 12), we have:
6. Receiving Ci from the alive parties, the aggregator
creates the set Uat of the alive parties and Udt = U \ Uat X (t)
vi = 0 (23)
of the dropped parties
i∈U
[7]. If Uat ⊂ U then the aggregator broadcasts Udt
(t,i) Thus,
[8]. Pi sends to the aggregator the value sd which is the
(t) (t)
share of the secret sd of a dropped party Pd in the set Udt . (t)
P
Y ski
(t) (t)
βi = (1 + N2 ) i∈Uat mod N22 (24)
[9]. A.ReconstructSecrets(Udt ) → {sd , vd }: Having the i∈Uat
Shamir’s secret shares from the alive parties, the aggre-
(t) From (17):
gator reconstructs the secret keys sd of dropped parties
(t) (t)
and then computes the secret vd of every dropped party βi − 1 mod N22
Q
(t) i∈Uat
in the set Udt from the recovered secrets. msk =
N2
(t)
X (t) X (t) P (t)
vd = γd,i − γd,i , d ∈ Udt (1 + N2 ) i∈Uat ski
mod N22 − 1 mod N22
d<i d>i =
(t)
N2
(t) s (t)
where γd,i = pki d (1 + i∈Uat ski · N2 ) mod N22 − 1 mod N22
P
(16)
=
(t) (t) N2
10. A. ComputeMSK({βi }i∈Uat , {v j } j∈U t ) → msk: The X (t)
d
= ski mod N22 (25)
aggregator computes the master key msk (t) :
i∈Uat
– If Uat = U:
(t)
• If Uat ⊂ U
β − 1 mod N22
Q
(t) Based on the Shamir threshold secret sharing scheme, the
msk = i∈U i (17) (t)
N2 aggregator can reconstruct all the secrets sd of dropped
2546 IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, VOL. 18, 2023

parties as long as the aggregator receives at least τ shares Similarly, substitute (28, 30) into (20), we have:
(t) (t)
of each sd . From that, the aggregator can recover vi of (t) (t)
( i∈Uat ci, j ) · H1 ( j)−msk − 1 mod N12
Q
(t)
dropped parties and obtains i∈U t vi .
P
(t+1)
d yj =
N1
(t) (t)
(1 + i∈Uat xi, j · N1 ) mod N12 − 1 mod N12
Y
βi
P

i∈Uat
=
N1
(t) (t) X (t)
vi
P P
ski 2
= (1 + N2 ) i∈Uat · H2 (t) i∈Uat mod N22 = xi, j mod N1 (32)
i∈Uat
(26)
(t+1) 1 (t+1)
(t)
wj = · Decode(y j )
i∈Uat vi
n (t)
P
Substitute (26) into (18), and note that +
(t) (t) 1
xi, j (t) )
X
i∈U t vi = i∈U vi = 0, we have:
P P
= (t) · Decode(
d n t
i∈Ua
(t)
vi
P
(t) 1 X
z i, j (t) (from (6))
i∈Udt
( i∈Uat βi ) · H2 (t) − 1 mod N22
Q
= (t) ·
msk (t) = n t i∈Ua
N2
(t) 1 X (t) (t)
P
(1 + N2 ) i∈Uat ski
· H2 (t)0 − 1 mod N22 = · n i · wi, j (from (4)) (33)
= n (t) t
N2 i∈Ua
(t)
(1 + i∈Uat ski · N2 ) mod N22 − 1 mod N22
P
This proves that the aggregator can compute the global model
= as the weighted average of all local models even if the
N2
X (t) aggregator does not know the true value of each local model.
2
= ski mod N2 (27)
i∈Uat
B. Security Analysis
Hence, in P
both cases, we successfully compute the master key
(t) In this section, we prove that the proposed protocol is
msk (t) = i∈Uat ski mod N22
√ (t) secure multiparty computation against an honest-but-curious
From N2 > k · 2l1 and ski < 22l1 , we have: adversary who controls the aggregator server and a set C of
X (t) colluded parties where |C| < τ . The aggregator is always
N22 > k · 22l1 > ski online while participants Pi may drop out and rejoin at any
i∈Uat iteration.
The security guarantee of the proposed scheme is based on
Then
Shamir’s secret sharing scheme, and the aggregator oblivious-
(t) ness security provided by the encryption construction in [19]
msk (t) =
X
ski (28)
under DCR assumption in the random oracle model. Secu-
i∈Uat
rity is against a computationally-bounded honest-but-curious
Next, we prove that with this master key, the global model aggregator server.
can be correctly computed. In fact, from (13, 14) we have: We will consider the executions of the proposed protocol
where an honest-but-curious aggregator server interacts with a
(t) (t)
(t) set of parties, the underlying encryption construction is based
P P
Y ni ski
αi = (1 + N1 ) i∈Uat · H1 (L) i∈Uat mod N12
on DCR assumption, and the Shamir secret sharing’s threshold
i∈Uat
is set to τ . In such executions, users might drop and rejoin
(29) at any iteration. The following proves the indistinguishability
(t) (t)
(t) of the distribution of the random variable representing the
P P
i∈Uat xi, j i∈Uat ski
Y
ci, j = (1 + N1 ) · H1 ( j) mod N12
adversary view in a real execution of the proposed protocol
i∈Uat
and the distribution of the random variable representing the
(30) adversary view in a secure-by-definition “ideal world” using
a simulation-based proof, which is a standard for security
Substitute (28, 29) into (19), we have: analysis of multiparty computation protocol [38]. The security
(t) (t) analysis of the protocol indicates that what the adversary
( i∈Uat αi ) · H1 (L)−msk − 1 mod N12
Q
n (t)
= learns from the real protocol execution is no more than
N1 what she can learn from the ideal protocol execution which
(t)
(1 + · N1 ) mod N12 − 1 mod N12 provides security/privacy. This also means the protocol in real
P
i∈Uat n i
= execution is secure against an honest-but-curious adversarial
N1
X (t)
model. To be more specific, the joint view of the server and
= n i mod N12 (31) any set of less than τ clients does not leak any information
i∈Uat about the other clients’ inputs (i.e., locally trained models/local
TRAN et al.: EFFICIENT PRIVACY-ENHANCING CROSS-SILO FEDERATED LEARNING AND APPLICATIONS 2547

training data) besides what can be inferred from the output of TABLE IV
the protocol computation (i.e., the aggregate model). C OMPUTATION OVERHEAD OF E ACH L OCAL PARTY AND THE
,τ,λ AGGREGATOR AT THE E STABLISHMENT P HASE AND E ACH
Let REALU C ∩A be a random variable representing the view I TERATION . T HE E XPRESSIONS IN [] A RE I NCLUDED
of the adversary in a real execution of the proposed protocol. IN THE C ASE OF D ROPOUT /R EJOINING H APPENS

Let SCU∩,τ,λ
A be the view of the adversary generated by a
simulator in a secure-by-definition “ideal world”. It is going
,τ,λ U ,τ,λ
to be proved that the distributions of REALU C ∩A and SC ∩A
are indistinguishable.
{REALU ,τ,λ } ∼
= {S
C ∩A
U ,τ,λ
}
C ∩A
We use the hybrid argument technique to prove this. First, federated learning where there is kr (0 ≤ kr < k) rejoined
we define a series of hybrid random variables H0 , H1 , · · · to parties (with or without Pi ) and kd (0 ≤ kd < k) dropped
construct the simulator S in an “ideal world” by the subsequent parties. The computation and communication overheads are
modifications such that any two subsequent random variables summarized in Table IV and Table V, respectively. Denote
Hi and Hi+1 are computationally indistinguishable, starting l pk , lss , li , le1 , le2 , l p are the sizes in bits of a public key,
,τ,λ
from H0 which is the same as REALU C ∩A . The final result of a secret share, an integer, a first-layered ciphertext, a second-
U ,τ,λ
subsequent modification is SC ∩A . layered ciphertext, and a plaintext, respectively. The cost in the
• H0 : This random variable is distributed exactly as square brackets ([]) is included in the case of dropouts/rejoins
,τ,λ ∼
{REALU C ∩A } = {H0 }
happens.
• H1 : This hybrid is distributed exactly as H0 , but shares 1) Computation Cost:
of 0 (using a different sharing of 0 for every honest party) a) Computation cost of a local party: The computation
(t)
substitute for all shares of si generated by honest parties cost of each party Pi at the establishment phase includes the
and given to the corrupted parties. Since the adversaries in main parts: 1- generating its public key, 2- performing each
(t)
C ∩ A do not receive any additional shares of si from an pair-wise secret agreement with each of other k − 1 parties,
honest party, the combined view of adversaries has only which takes O(k −1), and 3- creating τ -out-of-k Shamir secret
(t) (t)
|C| < τ shares of each secret si . The security properties shares of si which is O(τ · k). Thus, the computation cost
of Shamir’s secret sharing guarantee that the distribution of each party Pi at the establishment phase is O(τ · k).
of any shares of 0 is identical to the distribution of Pi ’s computation cost at each iteration is the cost of
(t) (t) (t)
an equivalent number of shares of any given secret creating the ciphertexts ci, j , αi , βi which takes O(L).
(t)
si , making this hybrid identically distributed to H0 , If Pi rejoins, then there is extra computation cost as the
{H0 } ∼= {H1 } cost of Pi in the establishment phase, which is O(τ · k).
• H2 : In this hybrid, compared to H1 , for each honest party Thus the total computation of each party in an iteration is
(t) (t)
Pi , the ciphertexts ci, j , t ∈ [T ] of xi, j is replaced by O(L + [τ · k]).
the cipher text of a dummy vector 0, the ciphertexts b) Computation cost of the aggregator: The aggregator’s
(t) (t)
αi , t ∈ [T ] of n i is replaced by the ciphertext of a computation cost can be divided into the main operations:
dummy value 0; hash function H1 is substituted with a 1- reconstructing Shamir secrets (one for each dropped party)
truly random function O1 . The aggregator obliviousness whenever dropouts happen, which takes the total time O(k 2 ),
security in the random-oracle model under the DCR and 2- obtaining wt by carrying decryption O(L) times. Thus
assumption of the construction in [19] guarantees that the total computation cost of the aggregator at an iteration is
this hybrid is indistinguishable from the previous one, O(L + [k 2 ]).
{H1 } ∼= {H2 } 2) Communication Cost:
• H3 : In this hybrid, compared to H2 , for each honest party, a) Communication cost of a local party: The communi-
(t) (t) (t)
vi is replaced by random yi subject to i∈U \C yi =
P
cation cost of each party Pi at the establishment phase includes
(t) the main parts: sending its public key to the aggregator,
− j∈C v j ; and hash function H2 is substituted with a
P
truly random function O2 . The aggregator obliviousness sending k − 1 secret shares to other k − 1 parties (each secret
security in the random-oracle model under the DCR share to each party), resulting l pk +(k −1)·lss , which is O(k).
assumption of the construction in [19] guarantees that The communication cost of each party Pi at an iteration
this hybrid is indistinguishable from the previous one, can be partitioned into the main parts: 1- receiving k updated
{H2 } ∼= {H3 } public keys from the aggregator, which takes k ·l pk , 2- sending
(t)
Defining such a simulator S as described in the last hybrid, the k − 1 secret shares of its updated secret si when it rejoins
view generated by S is computationally indistinguishable from which takes (k − 1) · lss , 3- sending its secret shares of
,τ,λ ∼
that of the real execution: {REALU ∼
C ∩A } = {H0 } = {H1 } =
∼ kd dropped parties’ secrets which is kd · lss , 4- sending an
(t) (t) (t) (t) (t)
{H2 } ∼
= {H3 } ∼
U ,τ,λ
= {SC ∩A }. encryption message Ci = {αi , βi , ci = {ci, j } j∈[L] } to
the aggregator at every iteration t, which accounts for (le1 +
C. Communication and Computation Analysis le2 + L · le1 ), and 5- receiving the aggregate model, which is
Communication and computation overheads are analyzed L ·l p . Thus, communication cost of Pi at an iteration includes:
according to the establishment phase and each iteration of download cost (i.e., receiving messages) is [k · l pk ] + L · l p or
2548 IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, VOL. 18, 2023

TABLE V
C OMMUNICATION OVERHEAD OF E ACH L OCAL PARTY AND THE AGGREGATOR AT THE E STABLISHMENT P HASE AND E ACH
I TERATION . T HE E XPRESSIONS IN [] I S I NCLUDED IN THE C ASE OF D ROPOUTS /R EJOINS H APPENS

O(L+[k]), upload cost (i.e, sending messages) is [(k−1)·lss ]+ TABLE VI

[kd ·lss ]+(le1 +le2 + L ·le1 ) < [(2k −1)·λ]+(le1 +le2 + L ·le1 ) PARAMETERS FOR L AYERS OF THE N ETWORK IN F IG . 2
(l pk = lss = λ), or O(L + [k]).
b) Communication cost of the aggregator: The commu-
nication cost of the aggregator at establishment phase includes
receiving k public keys of k parties, resulting k · l pk , which
is O(k).
The communication cost of the aggregator at an iteration
can be broken into the main parts: 1- receiving kr updated
public keys which is kr · l pk , 2- sending the updated set of
public keys to k parties which is k · k · l pk , 3- receiving secret
shares of the dropped parties from the alive parties, which
causes maximum (k−kd )·kd ·lss , 4- receiving k−kd encryption
message which is (k −kd )·(le1 +le2 +L ·le1 ), and 5- sending the
TABLE VII
aggregate model to each local party, which is k·L ·l p . Thus, the
PARAMETERS FOR LSTM L AYERS OF THE N ETWORK IN F IG . 2
communication cost of the aggregator at an iteration includes:
upload cost is [k 2 ] + k · L ·l p or O(L · k + [k 2 ]), download cost
is [kr ·l pk ]+[(k −kd )·kd ·lss ]+k ·(le1 +le2 + L ·le1 )−[kd ·(le1 +
le2 + L · le1 )] < [k · l pk ] + [k 2 /4 · lss ] + k · (le1 + le2 + L · le1 ) =
2
[( k4 +k)·λ]+k ·(le1 +le2 + L ·le1 ), resulting in O(L ·k +[k 2 ]).

VI. P RIVACY-E NHANCING C ROSS -S ILO F EDERATED

supporting security operations and power management is a
L EARNING FDIA D ETECTION IN S MART G RIDS
common interest of all parties, thus it is reasonable that they
Consider a multi-area grid of k non-overlapping areas man- are incentivised to follow the protocol to achieve the best
aged by k independent transmission grid companies (TGCs). output. However, some parties might be motivated to conspire
There is a system operator (SO) who takes care of the inter- with each other to infer private training data samples of a
connection areas and coordinates operations. Each Pi TGC target party for some business benefits. In the context of the
owns a private local dataset Di , i ∈ {1, . . . , k} with n i = |Di | above-proposed system model, a semi-honest adversary is an
samples and has communication lines with the SO and other adversary that controls SO and a set of colluded TGCs.
TGCs. To model the spatial-temporal relationship between bus and
For FDIA detection in smart grids, the federated learning line measurements, a network architecture modified from the
approach is superior to the centralised in terms of data privacy method [36] is trained for the FDIA detection, as shown
protection and communication overhead. From a data privacy in Fig. 2. The model in Fig. 2 is utilized to detect false
protection viewpoint, the private data of each local party data injection attacks in transmission power grids. In the
are not transmitted outside of federated learning, while for training stage, the model is securely trained by the pro-
a centralized approach, all these data have to be uploaded posed privacy-enhancing cross-silo federated learning frame-
to a central server, which is a risk to more security threats. work. The trained global model is then distributed to each
From a communication overhead perspective, in federated participant/sub-grid. In the test stage, each sub-grid utilizes the
learning, local models are transmitted to the centre instead of trained global model to detect FDIAs individually. Time-series
raw measurement data. This also helps reduce communication bus measurements Zbti and transmission line measurements Zlti
overhead due to the fact that the size of models is often much are fed into the model, which is utilized to model the spatial-
smaller than raw measurement data. temporal relationship between bus and line measurements. The
An honest-but-curious adversarial model is considered. model will output the likelihood of FDIAs in the current sub-
Adversaries are assumed to be honest but curious in the grid. The details of network parameters are summarised in
sense that they follow the protocol but can obtain available Table VI and Table VII.
transcripts to learn extra information that should remain pri- With the above training network architecture, the training
vate. A good result of detecting false data injection attacks network model for FDIA detection has 132743 parameters.
TRAN et al.: EFFICIENT PRIVACY-ENHANCING CROSS-SILO FEDERATED LEARNING AND APPLICATIONS 2549

Fig. 2. The network architecture for FDIA detection in AC-model transmission power grids.

TABLE VIII • sub-grid S3 contains bus 92, 22, 84, 14, 20, 76, 132, 18,
D ETAILS A BOUT THE T RANSMISSION P OWER 114, 4, 80, 90, 42, 40, 82, 28, and
G RID ‘1-HV- MIXED -0-N O S W ’
• sub-grid S4 contains bus 32, 2, 36, 70, 72, 108, 46, 78,
16, 86, 118, 24, 58, 88, 60, 30.
For each sub-grid, the bus measurements consist of
active/reactive power injections and bus voltage magnitude,
denoted by z btk ∈ Rn b ×3 at time tk ; and the line measurements
consist of active/reactive power flows and line electrical cur-
rent, denoted by zltk ∈ Rnl ×3 at time tk .
2) Normal and FDIA Measurement Data: The power grid
‘1-HV-mixed-0-no sw’ contains 35136 demand profiles, with
one profile per 15 minutes for one year. To generate the
datasets which include the normal measurement and the FDIA
measurement, the commercial software PowerFactory 2017
SP4,1 the open source software Pandapower,2 and the bench-
mark SimBench 3 were utilised. The normal measurements
were obtained by calculating the power flow using the com-
mercial software PowerFactory 2017 SP4. The attacks were
launched on a target bus by modifying either its voltage angle
The proposed privacy-enhancing cross-silo FDIA detection or voltage magnitude. All of these FDIA measurement samples
is based on the classical federated learning framework have bypassed the residual-based data detection function of
FedAvg [1] with the privacy protection part on top. PowerFactory 2017 SP4.

VII. E MPIRICAL E VALUATION B. Training and Testing Setting

This section demonstrates the desirable utility and effi- There are 35136 normal measurement samples and 35136
ciency of the proposed cross-silo privacy-enhancing federated FDIA measurement samples, with normal measurement sam-
learning. In the following, we provide the description of the ples labelled 0 and FDIA samples labelled 1. In the training
measurement dataset and the transmission power grid system stage, 29952 normal samples and 29952 FDIA samples for the
which includes several subgrids controlled by local TGCs first 312 days are grouped as the training dataset; the other
and a SO who coordinates the federated learning process. 5184 normal and FDIA samples for the remaining 54 days
Following that is the training/testing setting and the discussion are used as the test dataset. In the federated learning training,
of the performance in terms of accuracy, training time and the number of global epochs was set to 200, the number
inference time. of local epochs was set to 5, the number of local batches
was set to 48, and the sequence number for LSTM layers is
set to 96. In each federated learning training round, 3 local
A. Description of Datasets sub-grids were randomly selected to collaboratively train the
1) Transmission Power Grid Test Set: A transmission power global model. The federated learning source code 4 and the
grid, ‘1-HV-mixed-0-no sw’, from the benchmark dataset popular deep learning framework Pytorch-1.9.05 were used
SimBench [39] was used to evaluate the FDIA detection. This to implement the proposed FDIA federated learning detection
power grid contains 64 buses, 58 loads, and 355 measure- framework for the model training and testing.
ments, with more details shown in Table VIII. This power grid Three commonly used metrics were applied to evaluate the
is divided into four sub-grids, with each sub-grid containing accuracy of the FDIA detection, namely precision, recall, and
16 buses, summarised as follows:
1 https://www.digsilent.de/en/powerfactory.html
• sub-grid S1 contains bus 62, 26, 130, 44, 94, 104, 74, 48, 2 https://www.pandapower.org/
106, 50, 12, 54, 52, 0, 56, 34, 3 https://simbench.readthedocs.io/en/stable/about/installation.html
• sub-grid S2 contains bus 64, 100, 128, 68, 110, 112, 126, 4 https://github.com/AshwinRJ/Federated-Learning-PyTorch
66, 122, 98, 124, 102, 38, 96, 116, 120, 5 https://pytorch.org/docs/1.9.0/
2550 IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, VOL. 18, 2023

TABLE IX TABLE XI
C ENTRALIZED T RAINED FDIA D ETECTION ACCURACY AVERAGE C OMPUTATIONAL T IME IN S ECONDS PER O NE G LOBAL
E POCH IN A S INGLE -P ROCESSING M ANNER

TABLE X
F EDAVG FDIA D ETECTION ACCURACY
TABLE XII
AVERAGE C OMPUTATIONAL T IME IN S ECONDS PER O NE G LOBAL
E POCH IN A M ULTI -P ROCESSING M ANNER W ITH 4 CPU S

F1 score, expressed by
encrypted aggregation model and decrypting it. In Table XI
 Nt p we provide the average computational time in seconds per

 Pr ecision = ,


 N tp + N f p one global epoch (one federated learning round) of our pro-
 Nt p posed privacy-enhancing FDIA detection federated learning
Recall = ,

 N tp + N f n in a single-processing manner. The local model training part
 F1 = 2 × Pr ecision × Recall ,


 without privacy protection consumes around 233 seconds. The
Pr ecision + Recall average extra time for the privacy protection part comprises
where N f p indicates the number of false positive, Nt p indi- 1- the time for the initial setting of the protection scheme
cates the number of true positive, N f n indicates the number of which is 16.41 seconds on average, 2- the computation time of
false negative, and Ntn indicates the number of true negative. local model protection which happens at the client side at every
federated learning round which is 12.35 seconds in average
per client per round, 3- the computation time of obtaining the
C. FDIA Detection Accuracy and Time Overhead
encrypted aggregation model and decrypting it which happens
We have compared the performance of the proposed solution at the server side at every federated learning round which is
(i.e. the federated learning trained model on encrypted local 12.14 seconds in average per round.
models from each local dataset) with the centralized trained To test the ability to accelerate the computation time, the
model on the whole plain dataset. The same model was trained, multiprocessing technique is implemented to partition the
without the proposed encryption scheme, in the centralized Singular Instruction Multiple Data (SIMD) computations of
way using the same hyperparameters in Section VII-B. The cryptography operations over model vectors onto 4 CPUs.
results of the centralized trained model on the whole plain Table XII illustrates the possibility of accelerating the speed by
dataset are summarized in Table IX. Table X is for the FDIA multiprocessing utilizing 4 CPUs. The computation overhead
detection accuracy of FedAvg FDIA detection algorithm on of local model protection in each federated learning round with
the test dataset. As can be seen from Table IX and Table X, security on top only incurs 5.56 seconds, i.e., 2.38% compared
there is no big difference in the accuracy. to 233 seconds of the underlying model without security.
The privacy-enhancing FedAvg FDIA detection version has The total extra time of the privacy protection component
the same accuracy as the original FedAvg FDIA detection running over 200 epochs of federated learning training in a
version. However, the average training time for each sub-grid single-processing manner is around 83 minutes, while in a
as well as for the whole system to get the weighted global multi-processing manner with 4 CPUs is around 36 minutes.
model is longer due to the complexity of privacy protection The implementation of our proposed scheme is well-suited
added for secure weighted aggregation. The average training for parallel computation. Thus, the extra computational time
time is collected by evaluating the framework in a Linux overhead that occurred from our privacy-protection component
system with each sub-grid using one Nvidia Tesla Volta V100- could be significantly reduced by using more CPUs that local
SXM2-32GB GPU. transmission grid operators are facilitated or from the cloud at
Encryption parameters are set as: λ = 2048 (modulus p in the very low price.6
the sub-protocol π0 is a 2048-bit prime), l1 = 256 (modulus From the communication analysis in Section V-C.2, with
N1 of the first encryption layer is 256-bit length integer), the above encryption parameter setting for the experiment and
l2 = 512 (modulus N2 of the second encryption layer is a the size of model vector is L = 132743, the download cost of
512-bit length), l p = 64. a client is less than k · λ + L · l p = 4 · 2048 + 132743 · 64 =
For each federated learning round, each TGC timed its 8503744 bits ≈ 8.5 Mbits = 1 Mbyte, the upload cost of
own part including the local model training part and the
privacy protection part; SO timed the section of obtaining the 6 https://aws.amazon.com/ec2/pricing/on-demand/
TRAN et al.: EFFICIENT PRIVACY-ENHANCING CROSS-SILO FEDERATED LEARNING AND APPLICATIONS 2551

a client is less than [(2k − 1) · λ] + (le1 + le2 + L · le1 ) = [3] F. Tramèr, F. Zhang, A. Juels, M. K. Reiter, and T. Ristenpart, “Stealing
(2 · 4 − 1) · 2048 + (512 + 1024 + 132743 · 512) ≈ 68 Mbits = machine learning models via prediction APIs,” in Proc. 25th USENIX
Secur. Symp., 2016, pp. 601–618.
8.5 Mbytes; [4] B. Hitaj, G. Ateniese, and F. Perez-Cruz, “Deep models under the GAN:
The model training is not a real-time process, thus we Information leakage from collaborative deep learning,” in Proc. ACM
can afford more time for transmission leading to a lower SIGSAC Conf. Comput. Commun. Secur., Oct. 2017, pp. 603–618.
bandwidth. If 1 second per iteration is used for uploading data [5] Z. He, T. Zhang, and R. B. Lee, “Model inversion attacks against
collaborative inference,” in Proc. 35th Annu. Comput. Secur. Appl. Conf.,
from a local party to the aggregator (resulting in 0.05 hours Dec. 2019, pp. 148–162.
of uploading data from a local party to the aggregator in [6] L. Melis, C. Song, E. De Cristofaro, and V. Shmatikov, “Exploiting
the whole training process with 200 epochs used in the unintended feature leakage in collaborative learning,” in Proc. IEEE
Symp. Secur. Privacy (SP), May 2019, pp. 691–706.
experiment), then the upload bandwidth requirement would
[7] N. Carlini, C. Liu, U. Erlingsson, J. Kos, and D. Song, “The secret
be 68Mbps. The network bandwidth for our campus office is sharer: Evaluating and testing unintended memorization in neural net-
900Mbps. works,” in Proc. 28th USENIX Secur. Symp., 2019, pp. 267–284.
In the inference stage, each sub-grid utilizes the trained [8] G. Hug and J. A. Giampapa, “Vulnerability assessment of AC state
global model to detect FDIAs individually. Time-series bus estimation with respect to false data injection cyber-attacks,” IEEE
Trans. Smart Grid, vol. 3, no. 3, pp. 1362–1370, Sep. 2012.
measurements Zbti and transmission line measurements Zlti are [9] G. Liang, J. Zhao, F. Luo, S. R. Weller, and Z. Y. Dong, “A review of
fed into the model, which is utilized to model the spatial- false data injection attacks against modern power systems,” IEEE Trans.
temporal relationship between bus and line measurements. The Smart Grid, vol. 8, no. 4, pp. 1630–1638, Jul. 2017.
model will output the likelihood of FDIAs in the current sub- [10] R. D. Christie, B. F. Wollenberg, and I. Wangensteen, “Transmission
management in the deregulated environment,” Proc. IEEE, vol. 88, no. 2,
grid. Detecting FDIA given a trained model (i.e., inference) in pp. 170–195, Feb. 2000.
the proposed scheme is 6.7 milliseconds on average, which is [11] F. Karmel, “Deregulation and reform of the electricity industry in
fast for relevant smart grid operations, e.g., state estimation. Australia,” Aust. Government-Dept. Foreign Affairs Trade, Barton,
ACT, Australia, Aust.-Jpn. Found. Grant 2017-18, 2018. [Online].
Available: https://www.dfat.gov.au/sites/default/files/deregulation-of-the-
VIII. C ONCLUSION energy-industry-australian-experience.pdf
In this paper, we propose a cross-silo privacy-enhancing [12] L. Sankar, “Competitive privacy: Distributed computation with privacy
guarantees,” in Proc. IEEE Global Conf. Signal Inf. Process., Dec. 2013,
federated learning which is secure in the honest-but-curious pp. 325–328.
adversarial model. With the main techniques of secure multi- [13] P. Kairouz et al., “Advances and open problems in federated learning,”
party computation based on double-layer encryption and secret Found. Trends Mach. Learn., vol. 14, nos. 1–2, pp. 1–210, Jun. 2021.
sharing, the scheme is efficient in communication and com- [14] C. Dwork and A. Roth, “The algorithmic foundations of differential pri-
vacy,” Found. Trends Theor. Comput. Sci., vol. 9, nos. 3–4, pp. 211–487,
putation overhead and robust against dropouts and rejoining. 2013.
The scheme removes the requirement of computing discrete [15] R. Shokri and V. Shmatikov, “Privacy-preserving deep learning,” in
logarithms or multiple non-colluding server settings which are Proc. 22nd ACM SIGSAC Conf. Comput. Commun. Secur., 2015,
pp. 1310–1321.
the limitations of some related works. In addition, the client’s
[16] R. C. Geyer, T. Klein, and M. Nabi, “Differentially private federated
secret keys of two encryption layers are generated by each learning: A client level perspective,” 2017, arXiv:1712.07557.
party in a decentralized manner which helps increase the level [17] A. G. Sébert, R. Sirdey, O. Stan, and C. Gouy-Pailler, “Protecting data
of privacy guarantee. We also firstly design and empirically from all parties: Combining FHE and DP in federated learning,” 2022,
arXiv:2205.04330.
evaluate a practical and efficient privacy-enhancing cross-silo
[18] E. Shi, T. H. Chan, E. Rieffel, R. Chow, and D. Song, “Privacy-
federated learning resilient to the local private data inference preserving aggregation of time-series data,” in Proc. NDSS, vol. 2, 2011,
attacks for FDIA detection in the smart grid domain. The pp. 1–17.
proposed scheme provides a framework which can be adapted [19] M. Joye and B. Libert, “A scalable scheme for privacy-preserving
aggregation of time-series data,” in Proc. Int. Conf. Financial Cryptogr.
to other domains. The analysis of security and the empirical Data Secur. Cham, Switzerland: Springer, 2013, pp. 111–125.
evaluation proves that the proposed scheme achieves prov- [20] K. Bonawitz et al., “Practical secure aggregation for privacy-preserving
able privacy against an honest-but-curious aggregator server machine learning,” in Proc. ACM SIGSAC Conf. Comput. Commun.
colluding with some clients while providing desirable model Secur., Oct. 2017, pp. 1175–1191.
[21] J. Guo, Z. Liu, K.-Y. Lam, J. Zhao, and Y. Chen, “Privacy-enhanced
utility in an efficient manner. In future works, we are going federated learning with weighted aggregation,” in Proc. Int. Symp. Secur.
to investigate more different adversarial models in various Privacy Social Netw. Big Data. Cham, Switzerland: Springer, 2021,
federated learning settings which is applicable for security in pp. 93–109.
cyber-physical systems. [22] H. Corrigan-Gibbs and D. Boneh, “Prio: Private, robust, and scalable
computation of aggregate statistics,” in Proc. 14th USENIX Symp.
Networked Syst. Design Implement., 2017, pp. 259–282.
R EFERENCES [23] H. Fereidooni et al., “SAFELearn: Secure aggregation for private
[1] B. McMahan, E. Moore, D. Ramage, S. Hampson, and B. A. y. Arcas, federated learning,” in Proc. IEEE Secur. Privacy Workshops (SPW),
“Communication-efficient learning of deep networks from decentral- May 2021, pp. 56–62.
ized data,” in Proc. 20th Int. Conf. Artif. Intell. Statistics. (AIS- [24] Y. Dong, X. Chen, L. Shen, and D. Wang, “EaSTFLy: Efficient and
TATS), Fort Lauderdale, FL, USA, in Proceedings of Machine secure ternary federated learning,” Comput. Secur., vol. 94, Jul. 2020,
Learning Research, vol. 54, A. Singh and J. Zhu, Eds. PMLR, Art. no. 101824.
Apr. 2017, pp. 1273–1282. [Online]. Available: http://proceedings.mlr. [25] C. Fang, Y. Guo, N. Wang, and A. Ju, “Highly efficient federated
press/v54/mcmahan17a?ref=https://githubhelp.com. learning with strong privacy preservation in cloud computing,” Comput.
[2] M. Fredrikson, S. Jha, and T. Ristenpart, “Model inversion attacks Secur., vol. 96, Sep. 2020, Art. no. 101889.
that exploit confidence information and basic countermeasures,” in [26] S. Truex et al., “A hybrid approach to privacy-preserving federated
Proc. 22nd ACM SIGSAC Conf. Comput. Commun. Secur., Oct. 2015, learning,” in Proc. 12th ACM Workshop Artif. Intell. Secur., Nov. 2019,
pp. 1322–1333. pp. 1–11.
2552 IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, VOL. 18, 2023

[27] R. Xu, N. Baracaldo, Y. Zhou, A. Anwar, and H. Ludwig, “Hybridalpha: Jiankun Hu (Senior Member, IEEE) is currently
An efficient approach for privacy-preserving federated learning,” in Proc. a Professor with the School of Engineering and
12th ACM Workshop Artif. Intell. Secur., Nov. 2019, pp. 13–23. IT, The University of New South Wales Can-
[28] J. H. Bell, K. A. Bonawitz, A. Gascón, T. Lepoint, and M. Raykova, berra at ADFA, Canberra, Australia. He is also
“Secure single-server aggregation with (poly)logarithmic overhead,” an invited Expert of Australia Attorney-General’s
in Proc. ACM SIGSAC Conf. Comput. Commun. Secur., Oct. 2020, Office, assisting the draft of Australia National
pp. 1253–1269. Identity Management Policy. He has received nine
[29] A. Madi, O. Stan, A. Mayoue, A. Grivet-Sebert, C. Gouy-Pailler, and Australian Research Council (ARC) Grants and has
R. Sirdey, “A secure federated learning framework using homomorphic served at the Panel on Mathematics, Information,
encryption and verifiable computing,” in Proc. Reconciling Data Anal., and Computing Sciences, Australian Research Coun-
Automat., Privacy, Secur., Big Data Challenge (RDAAPS), May 2021, cil ERA—The Excellence in Research for Australia
pp. 1–8. Evaluation Committee in 2012. His research interests are in the field of cyber
[30] W.-T. Lin, G. Chen, and Y. Huang, “Incentive edge-based federated security covering intrusion detection, sensor key management, and biometrics
learning for false data injection attack detection on power grid state esti- authentication. He has many publications in top venues, including the IEEE
mation: A novel mechanism design approach,” Appl. Energy, vol. 314, T RANSACTIONS ON PATTERN A NALYSIS AND M ACHINE I NTELLIGENCE,
May 2022, Art. no. 118828. the IEEE T RANSACTIONS ON C OMPUTERS, the IEEE T RANSACTIONS ON
[31] L. Zhao, J. Li, Q. Li, and F. Li, “A federated learning framework for PARALLEL AND D ISTRIBUTED S YSTEMS, the IEEE T RANSACTIONS ON
detecting false data injection attacks in solar farms,” IEEE Trans. Power I NFORMATION F ORENSICS AND S ECURITY, Pattern Recognition, and the
Electron., vol. 37, no. 3, pp. 2496–2501, Mar. 2022. IEEE T RANSACTIONS ON I NDUSTRIAL I NFORMATICS. He is a Senior Area
[32] Y. Li, X. Wei, Y. Li, Z. Dong, and M. Shahidehpour, “Detection of false Editor of the IEEE T RANSACTIONS ON I NFORMATION F ORENSICS AND
data injection attacks in smart grid: A secure federated deep learning S ECURITY.
approach,” IEEE Trans. Smart Grid, vol. 13, no. 6, pp. 4862–4872,
Nov. 2022.
[33] A. Shamir, “How to share a secret,” Commun. ACM, vol. 22, no. 11,
pp. 612–613, Nov. 1979.
[34] P. Paillier, “Public-key cryptosystems based on composite degree resid-
uosity classes,” in Proc. Int. Conf. Theory Appl. Cryptograph. Techn. Xuefei Yin received the B.S. degree from Liaoning
Cham, Switzerland: Springer, 1999, pp. 223–238. University, Liaoning, China, the M.E. degree from
[35] R. Deng, G. Xiao, R. Lu, H. Liang, and A. V. Vasilakos, “False Tianjin University, Tianjin, China, and the Ph.D.
data injection on state estimation in power systems—Attacks, impacts, degree from The University of New South Wales
and defense: A survey,” IEEE Trans. Ind. Informat., vol. 13, no. 2, Canberra at ADFA, Canberra, Australia. He is cur-
pp. 411–423, Apr. 2017. rently with the School of Information and Com-
[36] X. Yin, Y. Zhu, and J. Hu, “A subgrid-oriented privacy-preserving munication Technology, Griffith University, Gold
microservice framework based on deep neural network for false data Coast, QLD, Australia. He has published articles
injection attack detection in smart grids,” IEEE Trans. Ind. Informat., in top journals, including IEEE T RANSACTIONS
vol. 18, no. 3, pp. 1957–1967, Mar. 2022. ON PATTERN A NALYSIS AND M ACHINE I NTELLI -
[37] M. De Cock, R. Dowsley, A. C. A. Nascimento, D. Railsback, J. Shen, GENCE , IEEE T RANSACTIONS ON I NFORMATION
and A. Todoki, “High performance logistic regression for privacy- F ORENSICS AND S ECURITY, ACM Computing Surveys, IEEE T RANSAC -
preserving genome analysis,” BMC Med. Genomics, vol. 14, no. 1, TIONS ON I NDUSTRIAL I NFORMATICS , and IEEE I NTERNET OF T HINGS
pp. 1–18, Dec. 2021. J OURNAL. His research interests include biometrics, pattern recognition,
[38] Y. Lindell, “How to simulate it—A tutorial on the simulation privacy-preserving, and intrusion detection.
proof technique,” in Tutorials on the Foundations of Cryptog-
raphy (Information Security and Cryptography), Y. Lindell, Ed.
Cham, Switzerland: Springer, 2017. [Online]. Available: https://link.
springer.com/chapter/10.1007/978-3-319-57048-8_6.
[39] S. Meinecke et al., “SimBench—A benchmark dataset of electric power
systems to compare innovative solutions based on power flow analysis,” Hemanshu R. Pota received the B.E. degree from
Energies, vol. 13, no. 12, p. 3290, Jun. 2020. the Sardar Vallabhbhai Regional College of Engi-
neering and Technology, Surat, India, in 1979, the
M.E. degree from the Indian Institute of Science,
Bengaluru, India, in 1981, and the Ph.D. degree
Hong-Yen Tran is currently pursuing the Ph.D. from The University of Newcastle, NSW, Australia,
degree with the School of Engineering and IT, in 1985, all in electrical engineering. He is cur-
The University of New South Wales Canberra at rently an Associate Professor with The University
ADFA, Canberra, Australia. Her research interests of New South Wales Canberra at ADFA, Canberra,
are in the field of secure and verifiable computation, Australia. He has held visiting appointments with
applied cryptography in cyber-physical systems, and Columbia University, New York City, NY, USA; the
bio-cryptography. University of California at Los Angeles, Los Angeles; the University of
Delaware; Iowa State University; Kansas State University; Old Dominion
University; the University of California at San Diego, San Diego; and the
Centre for AI and Robotics, Bengaluru.

SAP BDEx Config Guide
100% (2)
SAP BDEx Config Guide
101 pages
AZURE AZ 500 STUDY GUIDE-1: Microsoft Certified Associate Azure Security Engineer: Exam-AZ 500
From Everand
AZURE AZ 500 STUDY GUIDE-1: Microsoft Certified Associate Azure Security Engineer: Exam-AZ 500
Mamta Devi
No ratings yet
An Efficient Privacy-Enhancing Cross-Silo Federated Learning and Applications For False Data Injection Attack Detection in Smart Grids
No ratings yet
An Efficient Privacy-Enhancing Cross-Silo Federated Learning and Applications For False Data Injection Attack Detection in Smart Grids
15 pages
Network Coding and Signcryption for Cloud Data Integrity
From Everand
Network Coding and Signcryption for Cloud Data Integrity
Noah Joan
No ratings yet
Intrusion Detection Based On Privacy-Preserving Federated Learning For The Industrial IoT
No ratings yet
Intrusion Detection Based On Privacy-Preserving Federated Learning For The Industrial IoT
10 pages
100percent Updated 1
No ratings yet
100percent Updated 1
13 pages
Cybersecurity in Cloud Computing
From Everand
Cybersecurity in Cloud Computing
Akula Achari
No ratings yet
A Detailed Survey On Federated Learning Attacks and Defenses
No ratings yet
A Detailed Survey On Federated Learning Attacks and Defenses
18 pages
IGNOU MCS 227 Cloud Computing and IoT Previous Years Solved Papers
From Everand
IGNOU MCS 227 Cloud Computing and IoT Previous Years Solved Papers
Manish Soni
No ratings yet
Federated Learning With Differential Privacy Algorithms and Performance Analysis
No ratings yet
Federated Learning With Differential Privacy Algorithms and Performance Analysis
16 pages
MDPI Article Template
No ratings yet
MDPI Article Template
19 pages
Sparse Malicious False Data Injection Attacks and Defense Mechanisms in Smart Grids
No ratings yet
Sparse Malicious False Data Injection Attacks and Defense Mechanisms in Smart Grids
12 pages
NIT Research Proposal1
No ratings yet
NIT Research Proposal1
2 pages
Khaliq Ur Rehman - 21109103 PSSG
No ratings yet
Khaliq Ur Rehman - 21109103 PSSG
1 page
Cloud Computing: The Untold Origins of Cloud Computing (Manipulation, Configuring and Accessing the Applications Online)
From Everand
Cloud Computing: The Untold Origins of Cloud Computing (Manipulation, Configuring and Accessing the Applications Online)
William Cormier
No ratings yet
Enterprise Data Protection with Rubrik: Definitive Reference for Developers and Engineers
From Everand
Enterprise Data Protection with Rubrik: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Privacy
No ratings yet
Privacy
14 pages
Quality Inference in Federated Learning With Secure Aggregation
No ratings yet
Quality Inference in Federated Learning With Secure Aggregation
8 pages
1differentially Private Federated Learning With An Adaptive Noise Mechanism
No ratings yet
1differentially Private Federated Learning With An Adaptive Noise Mechanism
14 pages
Sensors 23 01683
No ratings yet
Sensors 23 01683
21 pages
File - BB 6 15
No ratings yet
File - BB 6 15
10 pages
Scope On Detecting Constrained Backdoor Attacks in Federated Learning
No ratings yet
Scope On Detecting Constrained Backdoor Attacks in Federated Learning
14 pages
Comptia Cloud+ CV0 - 004: 715 Questions and Explanation
From Everand
Comptia Cloud+ CV0 - 004: 715 Questions and Explanation
Arabella Kushner
No ratings yet
Machine Learning Methods For Attack Detection in The Smart Grid Final
No ratings yet
Machine Learning Methods For Attack Detection in The Smart Grid Final
66 pages
SAFELearn Secure Aggregation For Private FEderated Learning
No ratings yet
SAFELearn Secure Aggregation For Private FEderated Learning
7 pages
COM3030 Week 10 Slides
No ratings yet
COM3030 Week 10 Slides
63 pages
Electronics 3103323 Peer Review v1
No ratings yet
Electronics 3103323 Peer Review v1
18 pages
Intrusion Detection Method Based On CNN-GRU-FL in A Smart Grid Environment
No ratings yet
Intrusion Detection Method Based On CNN-GRU-FL in A Smart Grid Environment
18 pages
A Novel Data Poisoning Attack in Federated Learning Based On
No ratings yet
A Novel Data Poisoning Attack in Federated Learning Based On
8 pages
8 A Privacy Preserving Federated Learning Scheme Using Homomorphic
No ratings yet
8 A Privacy Preserving Federated Learning Scheme Using Homomorphic
15 pages
Jsan 12 00013
No ratings yet
Jsan 12 00013
18 pages
Cloud-Based Multi-Modal Information Analytics
From Everand
Cloud-Based Multi-Modal Information Analytics
Tanushri Kaniyar
No ratings yet
Enhancing IoT Security: A Novel Approach With Federated Learning and Differential Privacy Integration
No ratings yet
Enhancing IoT Security: A Novel Approach With Federated Learning and Differential Privacy Integration
17 pages
Cloud: Get All The Support And Guidance You Need To Be A Success At Using The CLOUD
From Everand
Cloud: Get All The Support And Guidance You Need To Be A Success At Using The CLOUD
John Hawkins
No ratings yet
Privacy-Preserving Federated Learning Based On Differential Privacy and Momentum
No ratings yet
Privacy-Preserving Federated Learning Based On Differential Privacy and Momentum
6 pages
Journal Pone 0316536
No ratings yet
Journal Pone 0316536
39 pages
KFRNN An Effective False Data Injection Attack Detection in Smart Grid Based On Kalman Filter and Recurrent Neural Network
No ratings yet
KFRNN An Effective False Data Injection Attack Detection in Smart Grid Based On Kalman Filter and Recurrent Neural Network
12 pages
Split-Fed Learning A Deep Dive Into Methods Innova
No ratings yet
Split-Fed Learning A Deep Dive Into Methods Innova
24 pages
Bharati Et Al 2022 Federated Learning Applications Challenges and Future Directions
No ratings yet
Bharati Et Al 2022 Federated Learning Applications Challenges and Future Directions
17 pages
Cloud computing: Moving IT out of the office
From Everand
Cloud computing: Moving IT out of the office
BCS, The Chartered Institute for IT
No ratings yet
A Hybrid Approach To Privacy-Preserving Federated Learning
No ratings yet
A Hybrid Approach To Privacy-Preserving Federated Learning
11 pages
Federated Learning Attacks and Defenses: A Survey
No ratings yet
Federated Learning Attacks and Defenses: A Survey
10 pages
MS-FL A Federated Learning Framework Based On Mult
No ratings yet
MS-FL A Federated Learning Framework Based On Mult
12 pages
Robust Aggregation For Federated Learning
No ratings yet
Robust Aggregation For Federated Learning
13 pages
Architecting Distributed Applications with Macrometa: The Complete Guide for Developers and Engineers
From Everand
Architecting Distributed Applications with Macrometa: The Complete Guide for Developers and Engineers
William Smith
No ratings yet
(24.07) Combining Federated Learning and Control A Survey
No ratings yet
(24.07) Combining Federated Learning and Control A Survey
19 pages
A Survey of Security Threats in Federated Learning
100% (1)
A Survey of Security Threats in Federated Learning
26 pages
Ieee Melecon 2024
No ratings yet
Ieee Melecon 2024
6 pages
A Survey On Security and Privacy of Federated Learning
No ratings yet
A Survey On Security and Privacy of Federated Learning
61 pages
Enterprise Data Protection with Veritas Technologies: Definitive Reference for Developers and Engineers
From Everand
Enterprise Data Protection with Veritas Technologies: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Hybridalpha: An Efficient Approach For Privacy-Preserving Federated Learning
No ratings yet
Hybridalpha: An Efficient Approach For Privacy-Preserving Federated Learning
11 pages
Denver
No ratings yet
Denver
83 pages
A Survey On Privacy-Preserving Federated Learning Against Poisoning Attacks
No ratings yet
A Survey On Privacy-Preserving Federated Learning Against Poisoning Attacks
18 pages
A Robust Privacy-Preserving Federated Learning Model Against Model Poisoning Attacks
No ratings yet
A Robust Privacy-Preserving Federated Learning Model Against Model Poisoning Attacks
16 pages
1 s2.0 S0950705121000381 Main
No ratings yet
1 s2.0 S0950705121000381 Main
11 pages
InfoSec Project Report
No ratings yet
InfoSec Project Report
7 pages
The Ultimate Guide to Unlocking the Full Potential of Cloud Services: Tips, Recommendations, and Strategies for Success
From Everand
The Ultimate Guide to Unlocking the Full Potential of Cloud Services: Tips, Recommendations, and Strategies for Success
Rick Spair
No ratings yet
Bus 14
No ratings yet
Bus 14
185 pages
Futureinternet 13 00073 v2
No ratings yet
Futureinternet 13 00073 v2
14 pages
Detecting False Data Injection Attacks in Smart Grids A Semi-Supervised Deep Learning Approach
No ratings yet
Detecting False Data Injection Attacks in Smart Grids A Semi-Supervised Deep Learning Approach
12 pages
Curiefense for Cloud-Native Application Security: Definitive Reference for Developers and Engineers
From Everand
Curiefense for Cloud-Native Application Security: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Iot Unit - 4 - PM
No ratings yet
Iot Unit - 4 - PM
6 pages
Iot Unit - 3 - PM
No ratings yet
Iot Unit - 3 - PM
6 pages
Campus Abnormal Behavior Recognition With Temporal Segment Transformers
No ratings yet
Campus Abnormal Behavior Recognition With Temporal Segment Transformers
14 pages
A Multi Perspective Fraud Detection Method For Multi Participant E Commerce Transactions
No ratings yet
A Multi Perspective Fraud Detection Method For Multi Participant E Commerce Transactions
6 pages
IJNRD2404218
No ratings yet
IJNRD2404218
5 pages
File Converter - Requirements
No ratings yet
File Converter - Requirements
6 pages
HikCentral Access Control Brochure
No ratings yet
HikCentral Access Control Brochure
2 pages
COMP-533 4 Use Cases Exercises Solution
No ratings yet
COMP-533 4 Use Cases Exercises Solution
18 pages
Message
No ratings yet
Message
2 pages
Id-11659 Scrapping Web
No ratings yet
Id-11659 Scrapping Web
295 pages
Stock Analysis Spreadsheet (10YR, 2024) (Vers 4.2) PUBLIC
No ratings yet
Stock Analysis Spreadsheet (10YR, 2024) (Vers 4.2) PUBLIC
17 pages
Unit 5 Exception Handling
No ratings yet
Unit 5 Exception Handling
36 pages
Ijmet 08 10 013
No ratings yet
Ijmet 08 10 013
7 pages
SQL Class-12
No ratings yet
SQL Class-12
12 pages
CCNA 200-301 Chapter 27 Analyzing Cisco Wireless Architectures
No ratings yet
CCNA 200-301 Chapter 27 Analyzing Cisco Wireless Architectures
17 pages
Accomplishment Report Ict
100% (2)
Accomplishment Report Ict
2 pages
RAJU
No ratings yet
RAJU
25 pages
DSE 3 Unit 3
No ratings yet
DSE 3 Unit 3
4 pages
22A Oracle Responsys Express - Release Overview - External
No ratings yet
22A Oracle Responsys Express - Release Overview - External
22 pages
BBP Template
No ratings yet
BBP Template
8 pages
A Study On Windows Mobile 6.5 Operation System
No ratings yet
A Study On Windows Mobile 6.5 Operation System
13 pages
Executor
No ratings yet
Executor
2 pages
Practical Unit Testing With JUnit and Mockito (Kaczanowski, Tomek) (Z-Library)
No ratings yet
Practical Unit Testing With JUnit and Mockito (Kaczanowski, Tomek) (Z-Library)
299 pages
Getting Started
No ratings yet
Getting Started
14 pages
IRS Imp
No ratings yet
IRS Imp
76 pages
Operating Manual Programming and Diagnostic Tool
No ratings yet
Operating Manual Programming and Diagnostic Tool
40 pages
Parrot Linux Command
100% (2)
Parrot Linux Command
3 pages
Omori Headspace Vinyl Figure Collection - Omocat
No ratings yet
Omori Headspace Vinyl Figure Collection - Omocat
1 page
AFF A400 Systems: Installation and Setup Instructions
No ratings yet
AFF A400 Systems: Installation and Setup Instructions
5 pages
Executive Assistant Syllabus Recruitment of Officers and Junior Executive Officers 2025 26
No ratings yet
Executive Assistant Syllabus Recruitment of Officers and Junior Executive Officers 2025 26
1 page
I Want A Research Paper On Gasless Transactions Using The Gas Station Network and Multi Chain Access
No ratings yet
I Want A Research Paper On Gasless Transactions Using The Gas Station Network and Multi Chain Access
9 pages
DM Trends
No ratings yet
DM Trends
49 pages
SAP Module 2
No ratings yet
SAP Module 2
27 pages
HollySys - Introduction V1.2 - 2021
No ratings yet
HollySys - Introduction V1.2 - 2021
42 pages

An Efficient Privacy-Enhancing Cross-Silo Federated Learning and Applications For False Data Injection Attack Detection in Smart Grids

Uploaded by

An Efficient Privacy-Enhancing Cross-Silo Federated Learning and Applications For False Data Injection Attack Detection in Smart Grids

Uploaded by

2538 IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, VOL.

An Efficient Privacy-Enhancing Cross-Silo

F EDERATED learning [1] is an emerging machine learning

TABLE I federated learning is the federated learning setting when clients

C. Shamir Secret Sharing

f (x) = a0 + a1 x + a2 x 2 + · · · + at≤1 x t≤1 mod p

• Sharing: The secret holder computes s (i) = f (i) for

TABLE III risk of inference attacks performed by an honest-but-curious

B. High-Level Technical Overview

Fig. 1. Secure weighted model aggregation procedure for one epoch.

aggregator broadcasts the updated set of the public keys – If Uat ⊂ U:

O(L+[k]), upload cost (i.e, sending messages) is [(k−1)·lss ]+ TABLE VI

VI. P RIVACY-E NHANCING C ROSS -S ILO F EDERATED

VII. E MPIRICAL E VALUATION B. Training and Testing Setting

You might also like