Abstract
We consider an extension of a binary decision model in which nodes make decisions based on influence-biased averages of their neighbors’ states, similar to Ising spin glasses with on-site random fields. In the limit where these influences become very heavy-tailed, the behavior of the model dramatically changes. On complete graphs, or graphs where nodes with large influence have large degree, this model is characterized by a new ‘phase’ with an unpredictable number of macroscopic shocks, with no associated critical phenomena. On random graphs where the degree of the most influential nodes is small compared to population size, a predictable ‘glassy’ phase without phase transitions emerges. Analytic results about both of these new phases are obtainable in limiting cases. We use numerical simulations to explore the model for more general scenarios. The phases associated with very influential decision makers are easily distinguishable experimentally from a homogeneous influence phase in many circumstances, in the context of our simple model.
1. Introduction
Qualitative understanding of the behavior of many social systems has been afforded by statistical physics and the study of systems on complex networks [4, 9, 29]. In particular, one of the major hopes of sociophysics and econophysics is to observe a ‘thermodynamic’ limit in markets and social dynamics as the population size gets large [7], similarly to how a thermodynamic limit arises in a typical disordered system such as a spin glass [14], which may allow for quantitative predictions about the behavior of social systems. One of the most interesting applications of statistical physics in this field is to the study of market crashes and other phase transitions in decision making processes. These are phenomena which are challenging to describe using classical economics, yet easily understood—at least conceptually—from the perspective of disordered systems [7].
Recently, [22] introduced a binary decision model which admitted a very simple solution on typical random graphs. The decision model proposed was a variant of the random field Ising model which has been used frequently to model decision making processes [8, 18, 17] due to disorder induced hysteresis and avalanche dynamics [30]. Other models of cascades and global avalanches [13, 31] such as the fiber bundle model [27] have also been employed, and share some qualitative features as well. Although many of these models have focused on the avalanche dynamics near or at a phase transition, [22] emphasized in addition a seemingly universal ‘glassy’ phase to these models, which was remarkably simple to understand on locally tree-like graphs.
The aim of this paper is to describe a modification of the model in [22] to allow for influential decision making, where a given node does not weight its neighbors equally when determining its own state. Similar models using the random field Ising model were studied in the past [19], although not beyond the level of mean field theory. Influential decision making has long been assumed in consumer research—for example, most prevalent is the so-called ‘two-step flow’ model, which assumes that information first passes through influential decision makers, who then filter it to a cluster of followers [12, 20, 28]. However, the presence of influential decision makers is not required for shocks and large cascades, as pointed out by the physics community [32]. Since the question of just how important influential individuals are in decision making processes is still open (for some recent experimental studies, see [2, 10, 33]), it is worth exploring more carefully the role that influence could play in decision making processes.
In this paper, we provide a conceptual framework for understanding the effects of influence in decision making. We show that interplay between network structure and the influence of given nodes allows for a variety of possible outcomes—three of which exist as solvable limits, and which we detail in this paper. The first possible outcome is that influence is a negligible effect, which gives us the phases of [22]. The second possible outcome is unpredictable decision making, in which shocks occur without critical phenomena, and the decision making process is entirely controlled by a few nodes. This limit occurs when there are nodes which are both heavily connected and have a large fraction of the global influence. The third limit corresponds to the existence of a single, very ‘glassy’ phase, independent of the network structure, which corresponds to non-interacting clusters following single influential nodes. This limit occurs when there are nodes that carry heavy influence but do not have large degree.
The outline of this paper is as follows. In section 2 we describe our model, and in section 3 we review basic information about heavy-tailed distributions of random variables. Section 4 describes the unpredictable decision making phase in the simple case where the network is a complete graph, which dramatically simplifies the analysis. Section 5 describes the rich variety of scenarios which are possible on random graphs, including the limit when influence becomes weak. We conclude by discussing the relevance of our results to possible experiments.
2. Binary decision making with influential nodes
Let us recall the binary decision model presented in [22], which is similar to the random field Ising model. For more details and figures one can refer to this reference; here we will discuss the highlights and not include derivations of any results.
We consider an undirected graph with nodes labeled by v. We will denote the degree of v, which is the number of edges entering v, with kv. On each vertex, we place a bit xv∈{0,1} for each vertex v. The phrase binary decision making comes from the fact that there only are two possible states, and no intermediate ‘wavering’ state. The rules are that

where Θ(x) is the step function which is 1 if x ≥ 0 and 0 if x < 0, p is a global ‘magnetic field’ and Pv correspond to quenched random variables (the ‘internal magnetic fields’). h is a monotonically increasing function which we take to be identical for all nodes. In this paper, we always take

for A > 0, for simplicity. Finally, Qv corresponds to the fraction of the neighbors u of the node v which are in state 1:

The intuition for equation (1) is simple: each node compares the global field p to some internal field Pvh(Qv): social interactions imply that the internal field it sees is dependent on the actions of its neighbors. If the global field is smaller than this internal field, it will adopt state 1; otherwise it will adopt state 0. As a simple example, we can imagine modeling someone choosing to buy a good (xv = 1) if the price p is below the perceived value of that good Pvh(Qv), and not buying (xv = 0) otherwise. This model has non-trivial behavior because not only is Pv different for each node, but the perceived values are dependent on the states of other nodes.
On a complete graph, or in mean field approximation, the solution to this model is quite simple: it is straightforward to see that if

then the mean field states, characterized by an overall fraction q of states in state 1, are determined by the solutions to the equation

Numerically, equation (1) can be solved from an arbitrary initial condition by simply iterating through every single node, and flipping its state whenever equation (1) is violated. This will always converge to an equilibrium state; though as we will discuss shortly, there are generically many equilibria that can be reached. A drastic simplification occurs whenever we can start from an equilibrium state: in this case, if we increase p, we know that the only nodes which will flip are nodes that will flip from 1 to 0, and oppositely if we decrease p. In this paper our simulations will exploit this simplification.
It is often the case that equation (5) admits multiple solutions. A typical scenario (referred to henceforth as bistable) is that there are three solutions: q1(p) < q2(p) < q3(p), with q1 and q3 stable equilibria, and q2 an unstable equilibrium. These solutions certainly depend on the value p, as the notation suggests. This bistability is analogous to the way that the simple Ising model in an external magnetic field can admit multiple stable equilibria. Furthermore, as we increase the value of p, we usually find that eventually, the solutions q2 and q3 approach each other, and simultaneously disappear. If one was sitting at the stable equilibrium q3 as p increased, past a critical point one would suddenly find that q1 < q3 is the only stable equilibrium—this corresponds to a discontinuous phase transition, or in economic language, a market crash. This is analogous to the behavior of the standard Ising model, and is not surprising. One can study the stability of solutions to equation (5) by studying the parameter

If α < 1, the equilibrium is stable, and if α > 1 it is unstable. In fact, there is a microscopic understanding of this result: one can show that the probability that flipping node u’s state will cause one of its neighbors to flip to the state is equal to α/〈k〉, where 〈k〉 is the expected degree (average number of edges) of a node in the graph. From here, one finds under typical circumstances that the expected size of an avalanche—the cumulative number of nodes who have flipped their state, due to the flip of a single node—is 1/(1 − α), so long as α < 1. Because this is a finite effect, the equilibrium is thus stable when α < 1. In such a regime, the effect of such an avalanche can be modeled under linear response, which is simply a way of saying that all macroscopic perturbations to q need only be kept track of to lowest order, so long as the number of nodes in the graph, which we call N, is large.
More exciting within the economic context is the emergence of complexity: one finds that on any graph with finite 〈k〉, so long as α > 0, the number of stable equilibria is exponential in the size of the graph. Due to the ‘glassy’ nature of the random field Ising model, this is not particularly strange, although we find especially elegant formulas describing this complexity in the context of this simple model. The way to understand the emergence of complexity in this model is as follows: consider a pair of neighboring states which are both very close to the point where Pvh(Qv) = p, and which can flip if any of their neighbors flips: this happens with probability α/〈k〉. Since there are 〈k〉 edges per node, the probability that any one node can simultaneously flip with one of its neighbors is then roughly 〈k〉(α/〈k〉)2, and there is a stable equilibrium where the pair of nodes is 0, and an equilibrium where the pair is 1. In particular, one can show that there is a macroscopic spectrum of allowed values of q associated with any single stable phase, at mean field level, whose width is

so long as 〈k〉 is not too small, by accounting for the possibility of avalanches due to node flips. This relation follows more rigorously from Thouless–Anderson–Palmer (TAP) equations, and requires that 〈k〉 be large enough so that the linear response regime is valid. A formal proof of whether or not this phase is truly a ‘glass’ is subtle, and we will comment more on this later.
In this paper, we will provide a new twist on this model: we will alter equation (3) to be a weighted average over the neighbors of v, in the following sense. To each node, we assign an influence parameter Jv, and we let

The sums in equation (8) are over each node that is connected to node v. It is straightforward to see that within mean field theory, where we replace all xv in the equations above by 〈xv〉 ≡ q, that we find that Q = q (using equation (8)), which implies that the mean field equations are exactly the same as they are in the homogeneous influence case. However, it is possible to find distributions on Jv which make mean field theory a bad approximation, even on complete graphs. We will show below that if the probability distribution on Jv is scale free with a small enough exponent, then in fact a few nodes dominate Q, and the resulting model behaves dramatically different from mean field theory. This will be the subject of much of this paper.
Before describing the results for the binary decision model under very heavy influence distributions, let us return to the question of whether or not this model is a true glass. A mathematical approach to this problem is to look for divergent spin glass susceptibility. Here we simply quote a theorem: for any model which admits an energy functional of the form

one can rigorously show that so long as each Juv ≥ 0, in a thermal ensemble where the probability of drawing a given state in equilibrium is proportional to e−H/T, for some temperature T, that no spin glass phase exists [21]. In the special case of h(Q) = 1 + AQ we consider in the numerical simulations of this paper, there does exist an energy functional with the property that its minimization corresponds to the solutions of equation (1). To see this, divide through equation (1) by Pv/JneighborsJv, where Jneighbors corresponds to the sum of the influence of each of the neighbors of v. One finds that equation (1) becomes

which can be seen easily to correspond exactly to minimizing the energy functional

So by one definition, this model for h(Q) = 1 + AQ does not describe a glass. Note that for other choices of h(Q), such an energy functional may not exist.
However, the definition of glass above is too restrictive in our context. To see this, note that the previous paragraph referred to the Gibbs distribution at finite temperature T > 0, implying that the noisy ‘thermal’ dynamics of the system admit a stationary Gibbs distribution. This is a non-trivial and overly restrictive statement in the context of sociophysics. In particular, at T = 0 (the non-dynamical limit in which we are interested), we generically expect most of the stable equilibria described above to have vanishing weight in the Gibbs distribution since they are not necessarily states of minimal H. However, from the equations of motion, which is how we originally defined the model, there is no intuitive reason why these equilibria must be suppressed; one can construct a Markov chain whose ‘zero temperature’ limit does not suppress so many stable equilibria. One would therefore like a definition of glasses, appropriate for sociophysics, which is more general: a heuristic but physically motivated definition would be a system with a complexity paradigm that admits dynamics on arbitrarily large time scales (perhaps breaking ergodicity). Our model, in many phases, has a complex landscape of exponentially many solutions in many situations. It is reasonable to suspect that large distributions in the sizes of ‘glassy clusters’ whose flips provide the exponential numbers of equilibria lead to mixing dynamics over many time scales, when one studies the dynamics of this model. For this reason, we will drop the quotes around the word ‘glass’ and refer to phases of this model with exponential numbers of equilibria as glassy phases. One should note however that these states do not necessarily obey one conservative definition of a (spin) glass.
As a final note, for the remainder of this paper, we will make the approximation that the sums in equation (8) are over the neighbors u of the node v, as well as the node v itself. The addition of v to this sum makes some of the analytic calculations of this paper simpler, but does not lead to any qualitatively new physics, as we have checked numerically.
3. Lévy flights
In this section, we will focus on influence statistics when the influence is distributed according to a power law distribution with a very heavy tail—in this paper, the word ‘very’ will quantify distributions with infinite mean. Since there are many experimental studies suggesting that reasonable measures of social influence have power law tails [1, 11, 15, 16, 25, 33], this is not an unreasonable toy model. In particular, in some circumstances [33] found very heavy-tailed ‘influence’ distributions (out-degrees of nodes in a network) in social recommender systems in social media.
The extreme value statistics of random variables with infinite mean, described in terms of the well-known ‘Lévy flights’, have found a variety of interesting applications in physics [5], primarily as descriptions of statistical systems which cannot reach an equilibrium distribution [3, 6]. We will review some of the basic calculations as we wish to derive some less commonly known results as well. Then we will describe how these Lévy flights are manifested in our binary decision model as two alternative solvable phases of our binary decision model. Some of the calculations below are described in an alternate fashion in [3]—we repeat them here for simplicity.
3.1. Influence statistics
Let us consider the following simple problem. Suppose we have N independently and identically distributed (iid) random variables Ji (i = 1,…,N) distributed according to the distribution

for 0 < ν < 1. For the remainder of this section, we will assume that x ≥ 1, to simplify notation. Without loss of generality, we can re-label these variables as Ji, with the re-labeling chosen so that J1 > J2 > ⋯ > JN.1 The cumulative distribution function for Jk is easy to find:


The interpretation is simple: the first term corresponds to the probability that Jk−1 are all bigger than x (the multiplicative factor corresponds to the number of possible permutations of Jk−1 from the original Jα). However, there is also the probability that Jk−1 is also smaller than x, etc, and accounting for all of these possibilities we obtain the total sum. When N is large and x ≫ 1, we can make the approximation

From equation (15) we see that the scale for J to be large is N1/ν: in particular, the probability distribution on J1 is rapidly suppressed (as an exponential in N) for x ≪ N1/ν. The probability that J1 < N1/ν is 1 − 1/e. The probability that there are exactly k Jis which are larger than N1/ν is given by

It is straightforward to argue that the scale of the typical Ji < N1/ν is

with standard deviation

These manipulations suggest that only a few influence variables dominate the sum J1 + ⋯ + JN. It therefore seems reasonable to focus only on the largest few random variables in this sum. In particular, we will focus on the random variables

in this section, as these are the weights that enter into equation (8) for Q in the binary decision model on complete graphs. Of course, it really is no different on any other graph either, as although the denominator only contains kv + 1 terms for node v, the sum will still be dominated by only a few terms. More importantly, while Ji have divergent and badly behaved distributions, the distributions on Ki are well-behaved. In general, it is very difficult to compute the distributions of Ki. However, we notice that if ν is close to 0, equations (17) and (18) suggest that all of the small J added together are still significantly smaller than the largest few J.
Let us begin by looking at the distribution on the ratios Jk/J1. The cumulative joint distribution on Jk and J1 is given by

To very good approximation when N is large, we have




From here we can compute

This asymptotes to

3.2. The thermodynamic limit
Now, let us study what happens in the large N limit. In particular, we want to understand the distribution on K1. As we will confirm in the next sections, whenever the probability that K1 > 0 is finite, even as N → ∞, the mean field description of the binary decision model breaks down. This is because in this case, the macroscopic dynamics of the model will depend on the microscopic details: in particular, the realization of disorder P1 for the most influential node. Therefore, in this section, we will determine P(K1 > 0) (after the N → ∞ limit).
Let us begin with the case 0 < ν < 1. In this case, we can prove that P(K1 > 0) = 1 by showing that 〈1/K1〉 is finite in the limit N → ∞:

We thus see that almost surely there is no mean field limit in this case.
Next, let us look at the case ν = 1. In this case, we see that J1 is almost surely of order N.2 The sum of all of the smaller random variables is of the order

with standard deviation

Thus in the strict limit N → ∞ we see that there is a mean field limit almost surely. However, this is a very slow limiting process, and we will return to this point shortly.
The case ν > 1 also has a mean field limit, as it should. To explicitly check this, we see that N1/ν ∼ J1 ≪ Jtot ∼ N, and

When ν > 2, of course, the standard central limit theorem holds and Jtot is essentially a Gaussian distribution. Regardless, we see that for ν > 1, fluctuations are subleading just as we expect, so the mean field limit exists.
We should also note that even if there is formally a mean field limit, this mean field limit may require N to be so large that it is impractical for ‘realistic’ situations. To that end, let us alternatively say that the mean field limit occurs when there are almost surely (away from any possible phase transitions) no avalanches of size larger than λ. In any linear response regime, we have shocks no larger than Δq ∼ K1/(1 − α), which simply follows from the fact that having many influential nodes all having very similar P values is very unlikely. Note that in the case where there is no influence, K1 = 1/N, and the total number of nodes that flips on average reduces to 1/(1 − α), as was discussed in section 2. We conclude that when K1 ≪ λ, we have a mean field limit, and when K1 ≫ λ, we do not3.
Now, we have already seen that K1 is almost surely positive when ν < 1, so there can be no mean field limit here, so long as λ is chosen to be small enough. In the case ν = 1, we have K1 ∼ 1/logN, which means that if we are looking at scale λ, the size of N required to have a mean field limit is N ∼ exp[1/λ], which is likely much larger than any practical graph for any reasonable resolution λ. When ν > 1, we require N1/ν−1 ≪ λ, or N ≫ (1/λ)ν/(ν−1), which may be ‘surprisingly’ large. For example, in the case ν = 2 which is believed to be experimentally relevant in many systems, we actually need to have N ∼ 1/λ2 to have a mean field limit, and if our resolution λ is small, this limit may be beyond our reach with any practical social system. We should finally note that the limit ν → ∞ results in N ∼ 1/λ, which is trivial since the number of nodes which flip in any avalanche must be an integer multiple of 1/N. This final case where N ∼ 1/λ will hold, up to minor corrections, in most systems where J1 does not have a power law tail distribution: for example, in the case of a Gaussian distribution on the J, we have , which is not much different from N−1.
4. The complete graph
In this section, we begin by studying the binary decision model with very heterogeneous influence (ν < 1) on the complete graph. The result will be an unpredictable phase characterized by abrupt shocks without critical phenomena, and with no thermodynamic limit with large N. This is a ‘solvable’ model in the sense that the dynamical behavior may be completely explained, and certain quantities can, in certain limits, be computed exactly, as we describe in this section.
4.1. A picture gallery
As we demonstrated in section 4, there can be no mean field limit in the case where ν < 1. The consequences of the mean field limit being non-existent are dramatic: the macroscopic dynamics of the system are sensitive to the realization of Pv, which is microscopic data. This should make sense, since when ν < 1, a finite number of nodes can carry a finite fraction of the total influence in the graph in the limit N → ∞. So we begin by showing in figure 1 some typical trajectories of q(p), found by increasing p starting from q(0) = 1. Since we are working on a complete graph, we can write very simple code by using the fact that we can order the nodes in order of their values of Pv, and sequentially look through them to determine whether or not they will be in state 0 or 1 at a given value of p. Exploiting the ‘graph structure’ here allows us to work with orders of magnitude more nodes with fast run times. Note that despite N ranging over four orders of magnitude, there is no noticeable mean field limit, just as we predicted earlier. Also note, particularly on the uniform distribution, the possibility of multiple large shocks.
Figure 1. We show typical q(p) for unpredictable decision making on complete graphs. Plots (a) and (c) have ν = 0.9, and (b) and (d) have ν = 0.5. Plots (a) and (b) have Pv uniformly distributed on [0,1], and (c) and (d) have Pv normally distributed with mean 1 and standard deviation 0.25. Dotted lines correspond to graphs with N = 103; dashed lines to N = 105; solid lines to N = 107. To avoid clutter, we do not display the lower branch of the bistable decision making.
Download figure:
Standard image High-resolution imageThere have been previous models which admit shocks without critical phenomena—admitting lower cutoffs in the distribution of Pv can result in sudden shocks, as was demonstrated in [22, 26]. However, the key difference with these models is that in the former models, the location of this shock is predictable given the microscopic dynamics. Furthermore, the shock in these models is typically of the form q(p) = Θ(pc − p), which means that the entire market changes its state in one global avalanche. Although this possibility should not be ruled out, since some economic markets (e.g., housing) will never undergo such radical binary shifts, alternative mechanisms for unpredictability should be explored. One mechanism which is slightly simpler mathematically than heavy-tailed influence distributions is to allow the function F(P) distribution to have discontinuous first derivative (or even be discontinuous) at points where 0 < F(P) < 1. However, this mechanism seems rather non-robust and would require the model to be fine-tuned.
4.2. Analytic results
Now, let us return to unpredictable decision making, and focus on some analytic computations which help illuminate the underlying behavior of these systems. The picture gallery makes clear that the various phases of this model, encapsulated in the double-valued q(p), is itself a random variable in unpredictable decision making. However, this does not imply that the model cannot be solved in any limit; instead, what we mean by ‘solvability’ must be slightly re-defined. This is precisely analogous to the way in which the fair coin flip problem is exactly solved—there is an equal probability for the coin to give heads or tails, although on a given realization the result will be unknown. Similarly, for this problem, the easily solvable limit corresponds to ν → 0, in which only one node should dominate the overall influence, and therefore the macroscopic behavior. Higher order corrections will become more and more relevant for larger ν, in which the behavior of two, three, etc nodes must be accounted for.
In this limit, it is very easy to understand the model, to first order. The first order approximation for this computation is that exactly one node contributes to Q: i.e.

Here x1 refers to the state of the most influential node. The decision making will be bistable and will correspond to solutions of the pair of equations


From these equations it is straightforward to derive that q(p) is a multivalued function with upper branch q+(p) and lower branch q−(p):


insofar as the first order approximation that only one node has influence is relevant. The locations of the discontinuity on q+ and on q−, as well as the size of the discontinuity, is determined by the random variable P1, the intrinsic value (internal magnetic field) as determined by the most influential node in the graph. Nonetheless, P1 is microscopic data about the state of the system, so to an outside observer who cannot determine this, the behavior will appear unpredictable.
In this section, our main numerical result will be to compute the probability density function of s1, which is the size of the shock during which the most influential node flips from 1 to 0. The size of the shock is easily shown to be

and therefore the distribution on s1 can be found straightforwardly by studying the preimage of the function of the right-hand side.
In general it is quite ugly to invert equation (36). In the case where P1 is distributed uniformly on [0,1] and h = 1 + AQ, however, it is straightforward to compute

This implies that s1 is uniformly distributed on [0,A/(A + 1)].
From the picture gallery, it is clear that in general a first order approximation is not enough. This is because there are multiple jumps (shocks) in many of the curves (note that the number of large jumps is random as well), and whenever there are two or more shocks, that implies that at least two influential nodes were relevant.4 We will see that numerical data demonstrates this is true as well for the distribution of s1. Let us therefore also describe the second order approximation, which already becomes quite cumbersome. We can tackle this most straightforwardly by first breaking the problem into two problems, each of which happens with half probability: P1 < P2 and P2 < P1. Since K1 = 1 − K2, P1 and P2 are each independent, we will assume that K1 is fixed and average over its distribution at the very end. To begin, let us denote P− = min(P1,P2) and P+ = max(P1,P2). We can write the probability density function on the shock size s1 by computing the contributions from the probability that P1 < P2 and both nodes are in different shocks, then the same shock; and repeating for P2 < P1:

In the case of P1 and P2 uniformly distributed on [0,1] with h(Q) = 1 + AQ, this integral can be explicitly evaluated:

At this point, we simply perform the average over K1 numerically, by using equation (24). Figure 2 shows that for a small value of ν, there is excellent agreement between the theoretical prediction and numerical data, suggesting that in this limit, only the first two nodes are relevant. Note that the first order approximation would not have sufficed.
Figure 2. We show the theoretical estimate for the PDF p(s1) (solid line) given by equation (39) and the numerical estimate (circles) for the specific case ν = 0.3. The numerical estimate was obtained from 104 trials, binning data points into 50 bins, on graphs with N = 50 000 nodes: (a) shows the case A = 2; (b) shows the case A = 1.
Download figure:
Standard image High-resolution image5. Random graphs
Now let us turn to the behavior of this model on random graphs. In [22] it was shown that in the case of no influence, the presence of a network turned the mean field model into a glassy model with exponential numbers of equilibria. In this section, we will see that models which are unpredictable on complete graphs become predictable again on random graphs with finite degree, but enter a new glassy phase distinct from the mean field glassy phases.
5.1. Glassy behavior
Glassy behavior comes about from the possibility that a cluster of nodes could simultaneously be all in state 1 or all in state 0, as discussed in section 2. However, this picture breaks down in the very heterogeneous influence model, because most nodes are only following a single node. This must remain true on random graphs, because the Lévy flights associated with the total influence that each node sees are dominated by a few terms regardless of the number of terms in the sum.
Let us assume that the random graph is sparse in that the degree of any node is much smaller than the total number of nodes. Let us also assume that ν is close to 0—we will return to what happens at larger values of ν later. Although the most influential node cannot influence the entire graph as it can on the complete graph, it does still entirely determine the state of each of its neighbors. Now, let us imagine removing the influential node and all of its neighbors from the graph. We can perform the same procedure again, and remove more nodes which are mostly entirely following a different node. Continuing this process until the graph has no remaining nodes, we see that the graph decomposes into local clusters which are effectively described by the unpredictable model on a complete graph. Since we have many of these clusters, each of which have independent thresholds, we expect a well-behaved and predictable thermodynamic limit to emerge.
Let us be more quantitative about this resulting limit. To do this, let us first focus on a single cluster. Within a single cluster, the state is entirely determined by P1, the internal field of the most influential node. Each cluster is characterized by a configuration with a regime of bistability at intermediate p, which can be described by specifying the upper and lower branches of the multivalued function q(p), which is precisely the result we found earlier in equation (35). Now, let us average over the value of P1. This average is actually quite simple to take:


So we conclude that if F(p) is a continuous function, this model is in fact characterized by a single, glassy phase with equilibrium spectrum width

We emphasize that this glassy phase not only is independent of N, as it must be to be a well-behaved thermodynamic limit, but it is entirely independent of the graph, so long as the degree of each node is much smaller than N.
Figure 3 demonstrates that qualitatively, the theory is very accurate at small values of ν, and becomes less so at larger values of ν. We will return to this point in a later section. Note that the glassy phase described above is remarkably independent of the underlying network structure, just as we predicted. Figure 4 shows typical samples of the total glassy phase at finite values of 〈k〉, when compared to the approximately 〈k〉-independent result. One can see that the larger the value of 〈k〉, the larger the typical shocks, since the clusters which are flipping contain more nodes.
Figure 3. We compare the theoretical glassy phase to the numerical one on various graphs and values of ν. In the legend, ER stands for Erdös–Rényi graphs, and SF stands for scale-free graphs with degree distribution ρk ∼ k−3. Plot (a) has a uniform distribution for Pv and A = 2, and plot (b) has a Gaussian distribution with μ = 1, σ = 0.25, and A = 1. All graphs have N = 2000 nodes. We averaged over 200 trials for each data point.
Download figure:
Standard image High-resolution imageFigure 4. We compare samples of influential decision making on graphs to the average behavior. We use Erdös–Rényi graphs with N = 2000 nodes, and influence distributions with ν = 0.15. Plots (a) has a uniform distribution for Pv and A = 2, and plot (b) has a Gaussian distribution with μ = 1, σ = 0.25, and A = 1. The thick solid line corresponds to the average case; thin solid lines refer to graphs with 〈k〉 = 200, and dashed lines to graphs with 〈k〉 = 20.
Download figure:
Standard image High-resolution image5.2. Finite N fluctuations
At a finite value of N, there will of course be fluctuations in mean field theory. For each cluster, the variance is given by




For the total graph, we just need to divide by the number of clusters, which is of order N/〈k〉, as each cluster behaves independently, so we expect

Surprisingly, for many regions of parameter space this relation is obeyed fairly quantitatively (to within a factor of 2), despite our crude estimate of the number of clusters, as shown in figure 5. The scaling in 〈k〉 is always obeyed although sometimes the effective prefactor appears to differ—depending on what the internal distribution F(P) is.
Figure 5. We compare the prediction of equation (47) to numerical results on Erdös–Rényi graphs with N = 2000 nodes, and influence distributions with ν = 0.15. Plots (a) and (b) have uniform distributions for Pv, and plot (c) has a Gaussian distribution with μ = 1, σ = 0.25. Plot (a) has A = 2, (b) has A = 4, (c) has A = 1. In each case, we have multiplied the numerical variance by N/〈k〉 to stress that after correcting for this graph dependent prefactor, the qualitative results are independent of graph. Variances were obtained by averaging over 200 trajectories q+(p) for each instance.
Download figure:
Standard image High-resolution image5.3. Phase transitions suppressed by influence
Let us now compare the glassy phase of the very heterogeneous influence model with the other case—where influence is not ‘heavy-tailed’. Making precise what we mean by ‘heavy-tailed’ is likely quite cumbersome, and for this paper we will simply continue to refer to J distributions with ν < 1 as heavy-tailed.
To understand what happens when the influence distribution is not heavy-tailed, consider the opposite limiting case where J ≈ 1, with small amounts of disorder in influence. Influence in this limit has almost no effect on the equilibria. We have seen this already at mean field level, but it also holds to higher orders in the cavity expansion. We say ‘almost’ here because there is a very small effect, which is analogous to the effect of small degrees in the homogeneous influence model. To write the cavity equation, let us denote the state of the cavity node with x0, and one of its neighbors with x1. Let K10 denote the relative influence of the cavity on its neighbor. Averaging over the neighbors of the cavity, we find analogously to the computation in [22] that the TAP equations become

The first term in this equation is the mean field result, and the second term is the cavity’s effect on its neighbors—for a more careful derivation, one should consult the reference above. From the x0-dependent piece of this equation, we can extract that the probability that both a 0 and 1 are allowed solutions:

where we have averaged over P0 and J0, about the mean field solution. But 〈K10〉 = 1/〈k〉, a fact which is easy to see by the following logic:

We have used the tower property of conditional expectation values here, and used that averaging over all influences, the influence of any one neighbor is equally likely to that of any other, and therefore the influence must be the inverse of the degree. We conclude, using this logic, that the first order cavity correction to the mean field equations is therefore independent of influence. Of course, we showed with a heuristic argument earlier that very heterogeneous influence models do alter the glassy physics, and confirmed this numerically. The flaw in this argument is simply that we cannot linearize around a solution in the very heterogeneous influence model, since the behavior of x0 is entirely dominated by the most influential node.
So we conclude that for ν ≪ 1, we should have the highly glassy phase described in earlier sections, and for ν ≫ 1 we should return to the glassy phase of [22], given by equation (7), which has small 1/〈k〉 deviations from mean field theory (with the possible suppression of discontinuities in phase transitions). Figure 6 shows the transition between the two glassy phases, as ν is varied, for two values of 〈k〉.5 It is likely that for ν ∼ 1 there is a complicated interplay between finite degree effects in the network and multiple influential nodes being relevant, and so we do not attempt a further theoretical analysis of this case. We do make a final note, however: we see numerical evidence that influence may suppress a phase transition even when ν > 1. In the simulations with 〈k〉 = 40, there is a genuine phase transition at ν = 5; however, at ν = 1.5, we see a ‘tail’ in q(p). It is reasonable to suspect that a wide enough distribution in influence may thus have a benign effect on the global decision making process by helping to smooth out shocks and other unfortunate behavior.
Figure 6. We show the maximal and minimal values of q(p) for various values of ν and 〈k〉: (a) shows Erdös–Rényi graphs with 〈k〉 = 40, and (b) shows 〈k〉 = 10. All runs had uniform distributions on Pv with A = 2 and N = 2000 nodes. 100 trials were averaged to obtain each data point. We have also checked that these results are N-independent.
Download figure:
Standard image High-resolution image5.4. Correlated degree and influence
In many cases, such as models of the type presented in [33], it does not make sense to ignore correlations between degree and influence. In this section, we will consider what happens when we consider binary decision making with influence which translates into graph structure according to some relation kv = k(Jv) with k(J) a monotonically increasing function. For this paper, we will focus on the simple function

with J distributed as usual according to equation (12) and 0 < θ some power. We will choose to construct our graphs as random graphs made with the Molloy–Reed algorithm [23, 24]. For large k, the degree distribution of the network will be given by

Depending on the values of ν and θ, we thus expect a variety of possible behaviors to occur.
If ν ≤ θ, we see that the distribution on ρk is heavy-tailed enough that some nodes in the graph should touch a finite fraction of the nodes. This implies that very influential nodes have many connections. If ν < 1, we also conclude that there is no mean field limit and therefore the decision making is unpredictable. If ν > 1, then unless 〈k〉 is very small, we expect that we quickly transition into a homogeneous-like glass phase. To see this, we note that all of the most influential nodes are connected to each node, and since for ν > 1 the very influential nodes will count roughly equally, the independence of Pv from Jv implies that each node will see a crude ‘mean field’ collection of nodes. If ν > θ, then the distribution on ρk is not heavy-tailed enough, and the most connected node will most likely not reach a finite fraction of nodes in the graph. This implies that there is a mean field limit. If ν < 1, the mean field limit will be qualitatively similar to the universal glassy phase described in section 5.3, and if ν > 1, the mean field limit will be the homogeneous influence limit.
Figure 7 shows that these qualitative arguments hold in practice in our simulations. As in section 5.3, ‘far’ from limiting cases these arguments do not hold perfectly, but it is clear what the basic mechanisms at play are. When ν < 1, the average q(p) is qualitatively very similar to the Erdös–Rényi case described in the previous sections, with minor quantitative differences due to θ > 0. As θ becomes larger than ν, the size of the shocks becomes increasingly large, although due to numerical limitations we could not observe a ‘sharp’ transition between behaviors at ν = θ. The ν > 1 case depends very little on the value of θ, and only on 〈k〉. In general, we see substantial robustness of the model to precise details of the graph structure, at least for the random graph ensembles we have studied.
Figure 7. We show the maximal and minimal values of q(p) for graphs where influence is correlated to degree. (a) Scale-free graphs with ν = 0.4 and k0 = 10; the solid line shows an average curve, and the marked lines show single instances of the model (recall that the average q is identical for both phases in this limit, something which is true numerically as well); (b) shows ν = 1.5—here, we averaged over runs, and each run is on graphs with N = 2000. All runs had uniform distributions on Pv with A = 2; 200 trials were averaged to obtain each data point, with the exception of the solo runs in (a). We have also checked that these results are N-independent. We collected more data than are displayed—for clarity we have only displayed a small subset, but the qualitative trends hold for all of the data.
Download figure:
Standard image High-resolution image6. Conclusion
In this paper, we have discussed the consequences of adding heterogeneous influence to the binary decision model of [22]. Interestingly, we found that only in the limit where a few nodes have finite fractions of the total influence does the behavior dramatically differ from the case where there is no influence at all. Although we have found that less-heavy-tailed distributions can also qualitatively alter the behavior of the model near phase transitions, this is analogous to how network structure can suppress a phase transition when 〈k〉 is small.
When a few nodes dominate the overall influence of the graph, two new phases emerge: one with unpredictable shocks, and one with a universal and highly ‘glassy’ phase of non-interacting clusters. Although we have not explicitly checked whether the observed phenomena hold in more general models such as random field Ising models (with more standard choices of couplings), it seems to reasonable to postulate that these phenomena are fairly generic, at least at the qualitative level. We also stress that the results we have found for very influential decision making are robust against changes to the probability distribution on J, so long as they preserve the heavy-tailed nature of the distribution.
Most importantly, we comment on the possible relevance of this work to experiments. As we have seen, the presence of very influential decision makers (ν < 1) in a graph has dramatic effects. If the model we have presented is a qualitatively good description of real decision making, then these dramatic effects have experimentally observable consequences. If the influential decision makers also are connected to a finite fraction of nodes in the graph, the resulting shocks do not have associated critical phenomena. If an experimentalist has good resolution in p in observations of q(p), this would be a very easy effect to observe in practice. If the influential decision makers are connected only to small clusters of the graph, then the dynamics effectively splits into non-interacting clusters, and the macroscopic system does not undergo any phase transitions—thus, the observation of a phase transition or shock would easily invalidate this scenario. Note that determining whether a model without phase transitions contains very influential individuals or not is still rather subtle, at least purely from q(p)—instead, it is likely that microscopic data would be required. If the avalanche distribution contains an anomalously large number of avalanches of order 〈k〉, that would suggest that large clusters are switching states simultaneously, which is a strong indication of influential decision making.
There are a few possible drawbacks to looking for experimental evidence of the present model. Firstly, this paper has assumed that Jv and Pv are uncorrelated for each node—possible correlations may lead to new phenomena, such as influential elites crashing a market even when a large majority of typical nodes would be willing to participate in the absence of such influence. Secondly, this paper has assumed that the decision rules are the same for all nodes, but this may be a bad assumption (do influential nodes interact more strongly, e.g. through a larger value of A?). The assumption that all the neighbors of node v perceive the same influence Jv may be incorrect. Further study of generalized models which relax these assumptions is worthwhile.
Acknowledgments
AL is supported by the Purcell Fellowship at Harvard. He would like to thank the anonymous referees for helpful suggestions.
Footnotes
- 1
With probability 1, since this is a continuous distribution, all Ji will be distinct.
- 2
Rigorously, we mean that P(J1/N > r) goes to 0 for r → ∞, and 1 as r → 0, after we have taken the limit N → ∞.
- 3
The α scaling has been dropped for convenience, since our main interest here is the scaling with system size N.
- 4
Note that there can be two influential nodes which contribute to the same shock, as well, so the number of shocks does not correspond to the number of ‘very influential’ nodes.
- 5
Due to the fact that we have allowed nodes to count their own opinion in Qv, the spectrum Δq of the glassy phase in the ν ≫ 1 limit picks up a factor of 4, when compared with the result of [22]: Δq ≈ 4α2/(1 − α)〈k〉. This comes about from the fact that each node in a glassy pair changes Qv by 2α/〈k〉.