Uncertainty Propagation in Complex Networks (Chaos, 2020)
Uncertainty Propagation in Complex Networks (Chaos, 2020)
COLLECTIONS
Paper published as part of the special topic on Symmetry and Optimization in the Synchronization and Collective
Behavior of Complex Systems
Note: This paper is part of the Focus Issue on Symmetry and Optimization in the Synchronization and Collective
Behavior of Complex Systems.
© 2020 Author(s).
Chaos ARTICLE scitation.org/journal/cha
AFFILIATIONS
1
Departament d’Enginyeria Informàtica i Matemàtiques, Universitat Rovira i Virgili, 43007 Tarragona, Catalonia, Spain
2
Center for Complexity Science, University of Warwick, Coventry CV4 7AL, United Kingdom
3
The Alan Turing Institute, London NW1 2DB, United Kingdom
Note: This paper is part of the Focus Issue on Symmetry and Optimization in the Synchronization and Collective Behavior of
Complex Systems.
a)
Author to whom correspondence should be addressed: [email protected]
ABSTRACT
Many complex networks are built up from empirical data prone to experimental error. Thus, the determination of the specific weights of
the links is a noisy measure. Noise propagates to those macroscopic variables researchers are interested in, such as the critical threshold
for synchronization of coupled oscillators or for the spreading of a disease. Here, we apply error propagation to estimate the macroscopic
uncertainty in the critical threshold for some dynamical processes in networks with noisy links. We obtain closed form expressions for the
mean and standard deviation of the critical threshold depending on the properties of the noise and the moments of the degree distribution
of the network. The analysis provides confidence intervals for critical predictions when dealing with uncertain measurements or intrinsic
fluctuations in empirical networked systems. Furthermore, our results unveil a nonmonotonous behavior of the uncertainty of the critical
threshold that depends on the specific network structure.
Published under license by AIP Publishing. https://doi.org/10.1063/1.5129630
The critical point (or threshold) refers to the minimum value of a I. INTRODUCTION
tuning parameter that triggers a phase transition in a dynamical The study of critical phenomena has been, and still is, a
process. This value becomes an uncertain quantity when noise is fruitful area of research in network science.1 Critical phenom-
present in the underlying network of interactions. For the class of ena in networks include a wide set of aspects, from structural
processes where the threshold is determined by the inverse of the changes in networks, or percolation phenomena,2 to epidemic3 or
largest eigenvalue of the adjacency matrix, we apply error prop- synchronization4 thresholds and many other phase transitions in
agation to estimate the critical uncertainty induced by quenched dynamical processes defined on networks.1,5 The estimation of the
noise on the network weights. We obtain closed form expressions critical threshold is of upmost importance to predict the onset of the
for the mean and standard deviation of the threshold depending phase transition, and hence a major concern in several applications,
on the properties of the noise and the moments of the network such as the containment of an infectious disease6 or the control of
degree distribution. Our results can be used in practical situa- synchronization in the power grid.7,8 However, an accurate estima-
tions, to provide confidence intervals for the predictions of the tion of the threshold is often elusive and costly because it depends on
critical threshold when dealing with uncertain measurements or the particular details of the whole network structure, usually through
intrinsic fluctuations in a networked system. Furthermore, the its eigenvalues.
results unveil several noise-amplifying properties of the networks, As network science becomes more and more extended, its
including a nonmonotonous behavior of the uncertainty of the potential applications grow fueled by the necessity of analyzing data
threshold depending on the heterogeneity and density of the net- produced in diverse fields of research, such as sociology, biology,
work. Accordingly, our analysis predicts the existence of particu- experimental physics, etc. However, the data collected in any of
lar structures that maximize the uncertainty of the threshold only the former fields are not free from experimental error, induced,
due to small noise in the weights, without altering the underlying for example, by sampling biases, device accuracy, or mistakes in
structure of the links. data entry. Nevertheless, the literature on network science usually
dismisses these error sources and produces results that are only valid from random matrix theory15,16 and spectral graph theory.17,18 How-
if data is error free. Some authors have concentrated their attention ever, it becomes very challenging to study noisy sparse networks
on the inference of missing data in networks;9–12 however, to the best with arbitrary degree distributions in these frameworks. Here, we
of our knowledge, no similar attention has been paid to the propaga- use an alternative approach, based on applying error propagation to
tion of uncertainty from the structure to the properties of dynamical the mean-field approximation (MFA) of Eq. (1). This approxima-
processes running on it. tion obviously restricts the validity range of the analysis; however,
The lack of works devoted to the analysis of error propagation the results are found to be very accurate in some scenarios, and,
in networks is probably due to the fact that many studies consider more importantly, they provide clear analytical insight on how the
networks unweighted, where a link is a binary variable denoting uncertainty in the structure affects the determination of the critical
its existence or not. However, the vast majority of networks are threshold.
weighted, i.e., the existence or not is valued by its intensity. The Our derivation starts assuming a mean-field approach. For
accurate determination of the weight is unlikely, and, therefore, simplicity, we restrict to the case of undirected (symmetric) net-
the error in their numerical values will influence any particular works. Under the aforementioned conditions, the critical threshold
measurement of the network properties. in Eq. (1) can be approximated19–22 by
Here, we present a study of error propagation in networks
hsi
where links are subject to uncertainty in their weights and wonder Kc = , (2)
about the effect that this uncertainty will have in the determination hs2 i
of the critical threshold. In particular, we focus on those dynamical where hsn i is the n-moment of the strength distribution (the strength
processes in which the critical point is known to be inversely propor- of a node is the sum of in-coming/out-going weights). Equation (2)
tional to the largest eigenvalue of the connectivity matrix. In Sec. II, can also be obtained directly from the equations of motion of
we present the particularities of our analysis and derive our main the dynamical process (for instance, in the Kuramoto model20) by
results; in Sec. III, we study the range of uncertainty in the critical assuming that the local field in a node is proportional to the global
point for different network structures; and, finally, in Sec. IV, we field weighted by the in-strength of the node.4 For the rest of the
discuss the implications and limitations of the current study, paving article, we will refer to Eq. (2) as the mean-field approximation
the way for new analysis to come. (MFA).
First, we test the accuracy of the critical threshold in the MFA,
II. PROPAGATION OF UNCERTAINTY IN THE CRITICAL
Eq. (2), by comparing it to Eq. (1), in Erdös–Rényi networks with
THRESHOLD uncertainty in the weights. In Fig. 1, we plot the threshold distribu-
We consider a dynamical process running on top of a complex tion for two different values of the intensity of the uncertainty σ . We
network with N units. We restrict the study to the class of dynam- observe that the MFA accurately determines the distribution, and
ical models in which a phase transition occurs at a critical value of that the values of the expected critical threshold Kc and its variance
the coupling intensity (the threshold), and where this value is given are clearly dependent on σ . In general, we expect our results to be
in terms of the largest eigenvalue λmax of the network connectiv- accurate in the cases in which the approximation of Eq. (2) remains
ity matrix C whose values represent the weighted structure of the valid.
network.13 Explicitly, we assume that Using Eq. (2), we can express Kc in terms of the moments of
K0 the degree distribution and noise parameters. The detailed calcula-
Kc = , (1) tions are shown in Appendix A. We obtain
λmax (C)
PN
where K0 is a constant that depends on the specific details of µi ki
Kc = PN 2 2 i=1 PN , (3)
the particular process. Without loss of generality, we fix K0 = 1. i=1 µi (ki − ki ) + i=1 hw ii ki
2
Equation (1) estimates the threshold for a wide variety of dynam-
ical processes, including the synchronization of heterogeneous where µi is the average weight of node i, and hw2 ii the average
phase-oscillators,4 the onset of endemicity of a disease in epidemic second moment of the weight distribution for node i. In random
models,3,14 and the phase transition in the Ising model in networks, homogeneous networks, for sufficiently large degree (ki 1), we
to name a few.1,2,5 The aim of this work is to understand how small can approximate µi = µ, and hw2 ii = σ 2 + µ2 in Eq. (3). This
noise in the entries of C affects the statistical properties of the macro- approximation allows to write down a simple relation between the
scopic threshold given by Eq. (1), without looking into the details of mean of the critical threshold and the uncertainty of the network as
a specific dynamical model. For the sake of simplicity, we assume µhki
that the quenched noise in the entries is gaussian and uncorrelated, hKc i ≈ . (4)
µ2 hk2 i + σ 2 hki
where each weight is independently drawn from a normal distribu-
tion N (µ, σ 2 ), being µ > 0 the average weight and σ 2 its variance. Interestingly, the naïve approximation in Eq. (4) already informs
Nevertheless, the proposed analysis can be extended to other dis- that the critical threshold decreases as the noise intensity σ
tributions of noise, either theoretical or obtained through empirical increases. This can be understood because the noise increases the
measurements, if the covariance matrix of the noise is known (see structural heterogeneity of the network, and heterogeneity tends to
Appendix B for details). make the critical threshold to vanish.3,4,22 Note that for µ = 1 and
To study the exact statistics of Kc in Eq. (1) induced by the σ = 0, we recover the usual threshold for unweighted, undirected
presence of noise, one could use in principle the available tools networks21 and for σ 1, hKc i ≈ hki/µhk2 i.
FIG. 2. Numerics [Eq. (1)] vs theory [Eqs. (4) and (6)]: mean and standard devi-
ation of the threshold Kc depending on the noise intensity σ for an Erdös–Rényi
network with N = 200, p = 0.3, µ = 1, and 5000 independent realizations for
each value of the noise intensity σ .
and for the same noise parameters µ and σ . After some algebra, the
condition for a given network to display higher uncertainty in Kc
than a random regular network reads
hk2 i2 hk2 i2
hk3 i > 1+ . (11)
2hki hki4
Now, we can use Eq. (11) to evaluate the role of heterogene-
ity. Let us consider a power-law distribution p(k) ≈ k−γ , where
the exponent γ controls the tail of the distribution. For the
FIG. 3. Numerics vs theory: standard deviation of the critical threshold δKc value γ = 3, one recovers the well-known scale-free network that
depending on the noise intensity σ with µ = 1 for a (left) fixed Erdös–Rényi net- emerges from preferential attachment.26 For lower (higher) values
work (N = 200, hki = 60, p = 0.3) and (right) the empirical network of airports of γ , the network becomes more (less) heterogeneous. For a finite
(N = 3154, hki ≈ 12) for 2000 independent realizations for each value of the power-law network, the moments of the degree distribution are
noise. Results have been rescaled by N.
given by
n−γ +1 n−γ +1
(−γ + 1)(kmax − kmin )
hkn i = γ +1 γ +1
. (12)
the theory for particular networks, and we refer the reader to the (n − γ + 1)(kmax − kmin )
literature19,21,22,25 for details on the validity of the MFA. Moreover, it
is important to remark that even if the MFA holds, the method of By fixing the value of kmin , we can explore the space of networks with
error propagation (at any order) can only be applied in our problem a given (γ , kmax ), thus revealing the effect of heterogeneity and size.
when the mean of the signal µ is sufficiently large compared to the To simplify the visualization, we define
noise.
3
2hkihk i
q = log 2 2
. (13)
III. THE ROLE OF THE STRUCTURE IN THE ERROR hk2 i2 1 + hkhkii4
PROPAGATION
This way, when q = 0, the uncertainty of the critical threshold of a
The network structure plays an important role in the uncer- network is the same than that of the regular one, and for positive
tainty range of Kc . After the finding of Eq. (6), some interesting (negative) values of q, we are measuring an increase (decrease) of
questions arise: does the heterogeneity induce an increase of the δK with respect to the homogeneous network. In Fig. 4, we show
critical fluctuations with respect to a homogeneous network? Is the the theoretical results obtained for the q value of networks in the
behavior of (δKc ) monotonous with the moments of the degree dis- space (γ , kmax ). We note that the three horizontal lines correspond
tribution of the network? If not, is there any particular structure that to the cases where the network has an integer exponent of 2, 3,
maximizes the uncertainty of the critical point induced by noise in or 4. In these cases, the first, second, or third moments diverge.
the weights?
To answer these questions, we consider the regime where net-
works are sufficiently large and σ µ. Then, we can approximate
Eq. (6) by its leading term, neglecting terms in σ larger than O(σ 2 )
2hkihk3 i − hk2 i2
(δKc )2 ≈ 2σ 2 hKc i4 . (9)
Nhki3
Note that δKc increases linearly with the noise intensity and scales
with hKc i2 . We know that hKc i2 is reduced by the heterogeneity
of the degree distribution,3,4,22 and, therefore, one would expect
δKc to follow the same trend. However, the nonlinear depen-
dence on the moments of the degree distribution could change this
intuition.
To understand this effect, we choose first as a reference the
most homogeneous network we can consider, a regular network, i.e.,
ki = k, ∀i. We compute Kc and δKc for a regular network, obtaining
1
hKc ireg ≈ ,
µk
(10)
2σ 2
(δKc )2reg ≈ .
Nµ4 k3 FIG. 4. Colormap showing the theoretical dependence of q on the exponent γ
2 and the maximum degree of the network kmax . The value of kmin is fixed to kmin = 5
The role of the heterogeneity will be detected by comparing (δKc ) and the resolution of the map is 100 × 100.
with (δKc )2reg for networks with the same size and average degree,
γ ≈ 3 (γ ≈ 2) emerge as the structures that maximize (minimize) APPENDIX B: CALCULATION OF THE VARIANCE
the uncertainty of the threshold, due to an interplay between the The propagation of uncertainty of a nonlinear function of
second and third moment of the degree distribution. the random variables as Eq. (3) requires to use a truncated Tay-
lor expansion.24 Up to second-order, and in the notation used
ACKNOWLEDGMENTS in the main text, the approximate variance of the function is
L.A.-F. and A.A. acknowledge the Spanish MINECO (Grant given by
No. PGC2018-094754-B-C2). A.A. also acknowledges the Generali- 1
tat de Catalunya (Grant No. 2017SGR-896), the Universitat Rovira i (δKc )2 ≈ JT0 VJ0 + Tr[(H0 V)2 ], (B1)
2
Virgili (Grant No. 2017PFR-URV-B2-41), and the ICREA Academia
and the James S. McDonnell Foundation (Grant No. 220020325). where the Jacobian vector and the Hessian matrix are evaluated at
G.M.-D. acknowledges funding from the Engineering and Phys- the mean values of the random variables µ E and hwE2 i. The Jacobian
ical Sciences Research Council (EPSRC) and MRC Centre for of the system in Eq. (3) is
Doctoral Training in Mathematics for Real World Systems (No.
EP/L015374/1) and The Alan Turing Institute under the EPSRC ∂Kc ∂Kc ∂Kc ∂Kc
J= ,..., , ,..., . (B2)
Grant (No. EP/N510129/1). B.S. acknowledges financial support ∂µ1 ∂µN ∂hw2 i1 ∂hw2 iN
from the European Union’s Horizon 2020 research and innovation
programme under the Marie Skodowska-Curie Grant Agreement First, we compute the partial derivatives in Eq. (B2) explicitly from
No. 713679 and from the Universitat Rovira i Virgili (URV). Eq. (3), obtaining
∂ 2 Kc 1
N N N
1 X X X ≈ 2 3 [−Qki kj + 4µ2 hkikj (k2i − ki )], (B5)
= aij w2ij + aij aik wij wik . (A2) ∂µi ∂hw ij
2 NQ
N i=1 j=1
j6=k
and for the last term, we have
aij aik = k2i − ki , we obtain
P
Noticing that j6=k
∂Kc 2µki kj hki
" N N
# ≈ . (B6)
2 1 X 2 2 X
2
∂hw2 ii ∂hw2 ij N2 Q3
hs i = µ (k − ki ) + hw ii ki , (A3)
N i=1 i i i=1 For the covariance matrix, we can obtain explicit expression for
where hw2 ii is the average second moment of the i-node. Plugging the entries (V)ij when the noise in the weights is assumed gaus-
Eqs. (A1) and (A3) into Eq. (2) in the main text, we obtain sian and uncorrelated. By assumption, the network is symmetric and
PN so it will be the covariance matrix, which can be written in block
µ i ki form as
Kc = PN 2 2 i=1 PN , (A4)
i=1 µi (ki − ki ) + i=1 hw ii ki
2
vµ 2
vµ,hw2 i
V= 2 ,
which correspond to Eq. (3) in the main text. vµ,hw2 i vhw2 i
32 34
D. R. Chialvo, “Emergent complex neural dynamics,” Nat. Phys. 6, 744–750 S. Jalan and J. Bandyopdhyay, “Random matrix analysis of complex networks,”
(2010). Phys. Rev. E 76, 046107 (2007).
33 35
L. Arola-Fernández, A. Díaz-Guilera, and A. Arenas, “Synchronization invari- C. Sarkar and S. Jalan, “Spectral properties of complex networks,” Chaos
ance under network structural transformations,” Phys. Rev. E 97, 060301 (2018). 28, 102101 (2018).