0% found this document useful (0 votes)

10 views7 pages

configuration model

This lecture discusses the configuration model in network analysis, focusing on its mathematical properties, including expected common neighbors, excess degree distribution, and clustering coefficients. It also explores the application of the configuration model as a null model for empirical networks, using the karate club example to compare structural measures. Additionally, it touches on directed random graphs and the implications of the model for understanding giant components and network diameter.

Uploaded by

Günay

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

10 views7 pages

configuration model

Uploaded by

Günay

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 7

Network Analysis and Modeling, CSCI 5352 Prof.

Aaron Clauset
Lecture 11 10 October 2013

1 More configuration model

In the last lecture, we explored the definition of the configuration model, a simple method for
drawing networks from the ensemble, and derived some of its mathematical properties. This time,
we’ll finish up a few more mathematical properties, and explore using it to study empirical networks.
Recall that the fundamental property of the configuration model is the probability of an edge
between i and j:
ki kj
pij = , (1)
2m
which holds in the large-m limit.

1.1 More mathematical properties

Expected number of common neighbors.
Given a pair of vertices i and j, with degrees ki and kj , how many common neighbors nij do we
expected them to have?

For some ℓ to be a common neighbor of a pair i and j, both the (i, ℓ) edge and the (j, ℓ) edges
must exist. As with the multi-edge calculation above, the correct calculation must account for the
reduction in the number of available stubs for the (j, ℓ) edge once we condition on the (i, ℓ) edge
existing. Thus, the probability that ℓ is a common neighbor is the product of the probability that
ℓ is a neighbor of i, which is given by Eq. (1), and the probability that ℓ is a neighbor of j, given
that the edge (i, ℓ) exists, which is also given by Eq. (1) except that we must decrement the stub
count on ℓ.
X ki kℓ kj (kℓ − 1)
nij =
2m 2m
ℓ
X
ki kj kℓ (kℓ − 1)
=
2m hkin
ℓ
hk2 i − hki
= pij . (2)
hki

Thus, the probability that i and j have a common neighbor is proportional to the probability that
they themselves are connected (where the constant of proportionality again depends on the first
and second moments of the degree sequence).

The excess degree distribution.

Many quantities about the configuration model, including the clustering coefficient, can be calcu-
lated using something called the excess degree distribution, which gives the degree distribution of a

1
Network Analysis and Modeling, CSCI 5352 Prof. Aaron Clauset
Lecture 11 10 October 2013

randomly chosen neighbor of a randomly chosen vertex, excluding the edge followed to get there.
This distribution also shows us something slightly counterintuitive about configuration model net-
works.

Let pk be the fraction of vertices in the network with degree k, and suppose that following the edge
brings us to a vertex of degree k. What is the probability that event? To have arrived at a vertex
with degree k, we must have followed an edge attached to one of the n pk vertices of degree k in the
network. Because edges are a random matching conditioned on the vertex’s degrees, the end point
of every edge in the network has the same probability k/2m (in the limit of large m) of connecting
to one of the stubs attached to our vertex.

Thus, the degree distribution of a randomly chosen neighbor is

k
pneighbor has k = n pk
2m
k pk
= . (3)
hki

Although the excess degree distribution is closely related to Eq. (3), there are a few interesting
things this formula implies that are worth describing.

From this expression, we can calculate the average degree of such a neighbor, as hkneighbor i =
2
P
k k p neighbor has k = hk i/hki, which is strictly greater than the mean degree itself hki (do you see
why?). Counterintuitively, this means that your neighbors in the network tend to have a greater
degree than you do. This happens because high-degree vertices have more edges attached to them,
and each edge provides a chance that the random step will choose them.

Returning to the excess degree distribution, note that because we followed an edge to get to our
final destination, its degree must be at least 1, as there are no edges we could follow to arrive a
vertex with degree 0. The excess degree distribution is the probability of the number of other edges
attached to our destination, and thus we substitute k + 1 for k in our expression for the probability
of a degree k. This yields

(k + 1)pk+1
qk = . (4)
hki

Expected clustering coefficient.

The clustering coefficient C is the average probability that two neighbors of a vertex are themselves
neighbors of each other, which we can calculate now using Eq. (4). Given that we start at some
vertex v (which has degree k ≥ 2), we choose a random pair of its neighbors i and j, and ask for
the probability that they themselves are connected. The degree distribution of i (or j), however, is

2
Network Analysis and Modeling, CSCI 5352 Prof. Aaron Clauset
Lecture 11 10 October 2013

exactly the excess degree distribution, because we chose a random vertex v and followed a randomly
chosen edge.

The probability that i and j are themselves connected is ki kj /2m, and the clustering coefficient is
given by this probability multiplied by the probability that i has excess degree ki and that j has
excess degree kj , and summed over all choices of ki and kj :
∞
∞ X
X ki kj
C= q ki q kj
2m
ki =0 kj =0
"∞ #2
1 X
= qk k
2m
k=0
"∞ #2
1 X
= k(k + 1)pk+1
2mhki2
k=0
"∞ #2
1 X
= k(k − 1)pk
2mhki2
k=0
"∞ ∞
#2
1 X
2
X
= k pk − k pk
2mhki2
k=0 k=0
2 2
1 hk i − hki
= . (5)
n hki3

where we have used the definition of the mth moment of a distribution to reduce the summations.
Like the expression we derived for the expected number of multi-edges, the expected clustering co-
efficient is a vanishing fraction O(1/n) in the limit of large networks, so long as the second moment
of the degree distribution is finite.

Expected clustering coefficient (alternative).

It should also be possible to calculate the expected clustering coefficient under the configuration
model by starting with the expected number of common neighbors nij for some pair i, j, which we
derived in Eq. (2). In particular, given the result derived in Eq. (5), we can express the clustering
coefficient in terms of nij and pij :
2
1 nij
C= . (6)
2m pij

(Can you explain why this formula is correct?)

3
Network Analysis and Modeling, CSCI 5352 Prof. Aaron Clauset
Lecture 11 10 October 2013

The giant component, and network diameter.

Just as with the Erdős-Rényi random graph model, the configuration model also exhibits a phase
transition for the appearance of a giant component. The most compact calculation uses generating
functions, and is given in Chapter 13.8 in Networks. The result of these calculations is a simple
formula for estimating when a giant component will exist, which, like all of our other results,
depends only on the first and second moments of the degree distribution:
hk2 i − 2hki > 0 . (7)
Unlike our previous results, however, this equation works even when the second moment of the
distribution is infinite. In that case, the requirement is trivially true.

A corollary of the existence of the giant component in this model is the implication that the
diameter of the network grows logarithmically with n, when a giant component exists. As with
G(n, p), the configuration model is locally tree-like (which is consistent with the vanishingly small
clustering coefficient derived above), implying that the number of vertices within a distance ℓ of
some vertex v grows exponentially with ℓ, where the rate of this growth again depends on the first
two moments of the degree distribution (which are themselves related to the number of first- and
second-neighbors of v).

1.2 Directed random graphs

All of these results can be generalized to the case of directed graphs, and the intuition we built from
the undirected case generally carries over to the directed case, as well. There are, of course, small
differences, as now we must concern ourselves with both the in- and out-degree distributions, and
the results will depend on second moments of these. (The first moments of the in- and out-degree
distributions must be equal. Do you see why?)

Constructing directed random graphs using the configuration model is also analogous, but with one
small variation. Now, instead of maintaining a single array v containing the names of the stubs,
we must maintain two arrays, vin and vout , each of length m, which contain the in- and out-stubs
respectively. The uniformly random matching we choose is then between these arrays, with the
beginning of an edge chosen from vout and the ending of an edge chosen from vin .

2 A null model for empirical networks

The most common use of the configuration model in analyzing real-world networks is as a null
model, i.e., as an expectation against which we measure deviations. Recall from the last lecture
our example of the karate club, and an instance drawn from the corresponding configuration model.
Using the configuration model to generate many such instances, we can use each network as input
to our structural measures. This produces a distribution of measures, which we can then compare

4
Network Analysis and Modeling, CSCI 5352 Prof. Aaron Clauset
Lecture 11 10 October 2013

directly to the empirical values. Each of the mini-experiments below used 1000 instances of the
configuration model, and where multi-edges were collapsed and self-loops discarded.

The degree distribution (shown below as both pdf and ccdf) is very similar, but with a few notable
differences. In particular, there the highest-degree vertices in the model have slightly lower degree
values than observed empirically. This is reflects the fact that both multi-edges and self-loops have
a higher probability of occurring if ki is large, and thus converting the generated network into
a simple network tends to remove edges attached to these high-degree vertices. Otherwise, the
generated degree distribution is very close to the empirical one, as we expect.1

0
0.35 10
Karate club
configuration model
0.3

0.25

0.2
Pr(K≥ k)
Pr(k)

0.15 −1
10

0.1

0.05

0 0 1 2
0 5 10 15 20 10 10 10
degree, k degree, k

1
It is possible to change the configuration model slightly in order to eliminate self-loops and multi-edges, by
flipping a coin for each pair i, j (where i 6= j) with bias exactly ki kj /2m. In this model, the expected degree we
generate is distributed as k̂i ∼ Poisson(ki ), which assumes the specified value in expectation.

5
Network Analysis and Modeling, CSCI 5352 Prof. Aaron Clauset
Lecture 11 10 October 2013

Both the distribution of pairwise geodesic distances and the network’s diameter are accurately
reproduced under the configuration model, indicating that neither of these measures of the network
are particularly interesting as patterns themselves. That is, they are about what we would expect
for a random graph with the same degree distribution. One nice feature of the configuration
model’s pairwise distance distribution is that it both follows and extends the empirical pattern out
to geodesic distances beyond what are observed in the network itself.

0
10 0.7
Karate club
configuration model
0.6
−1
10
0.5

Pr(diameter)
0.4
Pr(d)

−2
10
0.3

−3 0.2
10

0.1

−4
10 0
1 2 3 4 5 6 7 2 3 4 5 6 7 8 9
geodesic distance, d diameter

We may also examine vertex-level measures, such as measures of centrality. From the geodesic
distances used in the previous figures, we may also estimate the mean harmonic centrality of each
vertex. The first figure below plots both the empirical harmonic centralities (in order of vertex la-
bel, from 1 to 34) and the mean values under the configuration model. The various centrality scores
are now placed in context, showing that their scores are largely driven by the associated vertex
degree, as demonstrated by the similar overall pattern seen in the configuration model networks.2

But, not all of the values are explained by degree alone. The second figure plots the difference
between the observed and expected centrality scores ∆, where the line ∆ = 0 indicates no differ-
ence between observed and expected values. If an observed value is above this line, then it is more
central than we would expect based on degree alone, while if it is below the line, it is less central.

When making such comparisons, however, it is important to remember that the null model defines
a distribution over networks, and thus the difference is also a distribution. Fortunately, however,
computing the expected centrality scores by drawing many instances from the configuration model
also produces the distribution of centrality scores for each vertex, which provides us with a quan-
2
Recall also that the Pearson correlation coefficient for harmonic centrality and degree was large r 2 = 0.83, a fact
that reinforces our conclusion here.

6
Network Analysis and Modeling, CSCI 5352 Prof. Aaron Clauset
Lecture 11 10 October 2013

titative notion of how much variance is in the configuration model value. The grey shaded region
shows the 25 and 75% quantiles on the distribution of centrality scores for each vertex. When the
∆ = 0 line is outside of this range, we may claim with some confidence that the observed value is
different from the expected value.

0.75 0.15
Karate club
0.7 configuration model

0.65 0.1
harmonic centrality

0.6
0.05

difference
0.55

0.5
0
0.45

0.4 −0.05
0.35

−0.1
1 4 7 10 13 16 19 22 25 28 31 34 1 4 7 10 13 16 19 22 25 28 31 34
vertex label vertex label

This analysis shows that the main vertices (1 and 34, the president and instructor) are somewhat
more central than we would expect just based on their degree alone. In fact, most vertices are more
central than we would expect, one is less central than we expect, and about a third of the vertices
fall in line with the expectation.

3 At home
1. Read Chapter 13.3–13.11 (pages 445–483) in Networks

2. Next time: guest lecture on dynamic social networks

4.1 Configuration model introduction
No ratings yet
4.1 Configuration model introduction
3 pages
Interacting Network
No ratings yet
Interacting Network
33 pages
Introducción A La Teoría de Grafos
No ratings yet
Introducción A La Teoría de Grafos
19 pages
Unit 5 Machine
No ratings yet
Unit 5 Machine
9 pages
Md. Kamrul Hassan: Two-Hundred Years Long Journey From Graph To Complex Network Theory
No ratings yet
Md. Kamrul Hassan: Two-Hundred Years Long Journey From Graph To Complex Network Theory
47 pages
0007235v2
No ratings yet
0007235v2
19 pages
Maths - Ijmcar - On The Connectivity of - Kenchappa S Betageri
No ratings yet
Maths - Ijmcar - On The Connectivity of - Kenchappa S Betageri
8 pages
05&6RandomGraphs & Homophily
No ratings yet
05&6RandomGraphs & Homophily
66 pages
Random Graph Models of Social Networks: Paper Authors: M.E. Newman, D.J. Watts, S.H. Strogatz
No ratings yet
Random Graph Models of Social Networks: Paper Authors: M.E. Newman, D.J. Watts, S.H. Strogatz
21 pages
Oxford SC2 Transcribed Notes
No ratings yet
Oxford SC2 Transcribed Notes
42 pages
The Average Distance in A Random Graph
No ratings yet
The Average Distance in A Random Graph
22 pages
Normal and Stable Approximation To Subgraph Counts in Superpositions of Bernoulli Random Graphs
No ratings yet
Normal and Stable Approximation To Subgraph Counts in Superpositions of Bernoulli Random Graphs
15 pages
Basic Network Metrics
No ratings yet
Basic Network Metrics
30 pages
Lecture On Graph Theory 1
No ratings yet
Lecture On Graph Theory 1
31 pages
Week11 2graph011
No ratings yet
Week11 2graph011
25 pages
Graph Theory
No ratings yet
Graph Theory
107 pages
Preffered Attachment Model
No ratings yet
Preffered Attachment Model
9 pages
The Degree Sequence of A Scale-Free Random Graph Process: B Ela Bollob As, Oliver Riordan, Joel Spencer, G Abor Tusn Ady
No ratings yet
The Degree Sequence of A Scale-Free Random Graph Process: B Ela Bollob As, Oliver Riordan, Joel Spencer, G Abor Tusn Ady
12 pages
Graph Theory Notes
No ratings yet
Graph Theory Notes
84 pages
Lecture 14 - DM
No ratings yet
Lecture 14 - DM
6 pages
03 CD Phan Tich Mang Xa Hoi - Mo Hinh Toan Cho Mang Xa Hoi
No ratings yet
03 CD Phan Tich Mang Xa Hoi - Mo Hinh Toan Cho Mang Xa Hoi
116 pages
Algorithms Standard
No ratings yet
Algorithms Standard
38 pages
Advanced Topics in Data Mining Special Focus: Social Networks
No ratings yet
Advanced Topics in Data Mining Special Focus: Social Networks
35 pages
PPT10-W10-Graph Analytics For Big Data
No ratings yet
PPT10-W10-Graph Analytics For Big Data
55 pages
ComplexNets1
No ratings yet
ComplexNets1
64 pages
2 Networks in Economics and Finance PDF
No ratings yet
2 Networks in Economics and Finance PDF
24 pages
02_GraphTheory
No ratings yet
02_GraphTheory
26 pages
Notes Graph Theory 050319
No ratings yet
Notes Graph Theory 050319
74 pages
Project Report of ECE841
No ratings yet
Project Report of ECE841
7 pages
Chapter 10
No ratings yet
Chapter 10
49 pages
Random Topics in Graph Theory
No ratings yet
Random Topics in Graph Theory
43 pages
0704.0686v2
No ratings yet
0704.0686v2
21 pages
MIT14 15JS18 Lec2-3
No ratings yet
MIT14 15JS18 Lec2-3
30 pages
tmp3EC3 TMP
No ratings yet
tmp3EC3 TMP
10 pages
First Half Scribe
No ratings yet
First Half Scribe
33 pages
Topological Characteristics of IP Networks: Hamed Haddadi
No ratings yet
Topological Characteristics of IP Networks: Hamed Haddadi
4 pages
Basics of Graph Theory: 1 Basic Notions
No ratings yet
Basics of Graph Theory: 1 Basic Notions
51 pages
Path Dependence & Networks: Volver A La Semana 8
No ratings yet
Path Dependence & Networks: Volver A La Semana 8
8 pages
After CAT
No ratings yet
After CAT
443 pages
SoICT-Eng - ProbComp - Lec 9 - Random Network Models
No ratings yet
SoICT-Eng - ProbComp - Lec 9 - Random Network Models
80 pages
Graph Theory Lecture-1 by Amit
No ratings yet
Graph Theory Lecture-1 by Amit
36 pages
10 Models Erdos Renyi
No ratings yet
10 Models Erdos Renyi
9 pages
Nonlinear Barab#asi-Albert Network: Roberto N. Onody, Paulo A. de Castro
No ratings yet
Nonlinear Barab#asi-Albert Network: Roberto N. Onody, Paulo A. de Castro
12 pages
Relation and Graph Theory (Lecture Notes)
No ratings yet
Relation and Graph Theory (Lecture Notes)
12 pages
Calculating Clustering Coefficient PDF
No ratings yet
Calculating Clustering Coefficient PDF
1 page
Introduction To Graphs (Part-I)
No ratings yet
Introduction To Graphs (Part-I)
42 pages
Calculating Clustering Coefficient
No ratings yet
Calculating Clustering Coefficient
1 page
1 Introduction (1)
No ratings yet
1 Introduction (1)
5 pages
16-Graphs
No ratings yet
16-Graphs
28 pages
CS109/Stat121/AC209/E-109 Data Science: Network Models
No ratings yet
CS109/Stat121/AC209/E-109 Data Science: Network Models
20 pages
CS109/Stat121/AC209/E-109 Data Science: Network Models
No ratings yet
CS109/Stat121/AC209/E-109 Data Science: Network Models
20 pages
Networks
No ratings yet
Networks
19 pages
Network Models II: CS109/Stat121/AC209/E-109 Data Science
No ratings yet
Network Models II: CS109/Stat121/AC209/E-109 Data Science
19 pages
course 3-4
No ratings yet
course 3-4
46 pages
03_NetworkStructure
No ratings yet
03_NetworkStructure
75 pages
Network Definition
No ratings yet
Network Definition
2 pages
Lec24 25
No ratings yet
Lec24 25
17 pages
L-Graph-I
No ratings yet
L-Graph-I
41 pages
Algebraic Equations
From Everand
Algebraic Equations
Demetrios P. Kanoussis
No ratings yet
Feynman Lectures Simplified 2C: Electromagnetism: in Relativity & in Dense Matter
From Everand
Feynman Lectures Simplified 2C: Electromagnetism: in Relativity & in Dense Matter
Robert Piccioni
No ratings yet
Judy_Taylor
No ratings yet
Judy_Taylor
10 pages
lectures_2017.pdf_manifolds
No ratings yet
lectures_2017.pdf_manifolds
10 pages
lectures_2017_MANIFOLDS
No ratings yet
lectures_2017_MANIFOLDS
6 pages
Lecture3_statistics
No ratings yet
Lecture3_statistics
39 pages
lecture05_Survival Analysis
No ratings yet
lecture05_Survival Analysis
52 pages
lecture04_survival analysis
No ratings yet
lecture04_survival analysis
41 pages
JuniorCoach Waiver Form 3new 1 202109 1
No ratings yet
JuniorCoach Waiver Form 3new 1 202109 1
3 pages
Lecture 011 FOUNDATIONS OF STATISTICS
No ratings yet
Lecture 011 FOUNDATIONS OF STATISTICS
63 pages
taylor
No ratings yet
taylor
21 pages
521-lecture-13
No ratings yet
521-lecture-13
7 pages
Of Functions: An Atlas
No ratings yet
Of Functions: An Atlas
708 pages
4 PDF
No ratings yet
4 PDF
2 pages
DS Practical List
No ratings yet
DS Practical List
4 pages
Recurrence Relation1
No ratings yet
Recurrence Relation1
26 pages
20-Definite Integration-01 - Theory
No ratings yet
20-Definite Integration-01 - Theory
21 pages
MCA105
No ratings yet
MCA105
1 page
Jordan Canonical Form
No ratings yet
Jordan Canonical Form
27 pages
Functions: Ecc 3001 Engineering Mathematics 1
No ratings yet
Functions: Ecc 3001 Engineering Mathematics 1
39 pages
MATH2071: LAB 8: The Eigenvalue Problem
No ratings yet
MATH2071: LAB 8: The Eigenvalue Problem
16 pages
UCLA Math 61 Syllabus and Schedule
No ratings yet
UCLA Math 61 Syllabus and Schedule
2 pages
Name of The Student: Branch: Unit - I (Logic and Proofs) Definitions
No ratings yet
Name of The Student: Branch: Unit - I (Logic and Proofs) Definitions
10 pages
Extra Exercises
No ratings yet
Extra Exercises
3 pages
Week 5
No ratings yet
Week 5
14 pages
Derivada de Fréchet
No ratings yet
Derivada de Fréchet
18 pages
ada lab MANUAL updated
No ratings yet
ada lab MANUAL updated
30 pages
Heuristic Search
No ratings yet
Heuristic Search
29 pages
Exercise 4.5
No ratings yet
Exercise 4.5
8 pages
MCQ Questions Practice Set With AnsKEY
No ratings yet
MCQ Questions Practice Set With AnsKEY
6 pages
Composition of Functions
No ratings yet
Composition of Functions
6 pages
General Mathematics - Grade 11 Alternative Delivery Mode Quarter 1 - Module 2: Rational Functions First Edition, 2020
No ratings yet
General Mathematics - Grade 11 Alternative Delivery Mode Quarter 1 - Module 2: Rational Functions First Edition, 2020
35 pages
Download Problems and Proofs in Real Analysis Theory of Measure and Integration 1st Edition James J. Yeh ebook All Chapters PDF
No ratings yet
Download Problems and Proofs in Real Analysis Theory of Measure and Integration 1st Edition James J. Yeh ebook All Chapters PDF
51 pages
2013-05-24 Assessment On Rational Functions
No ratings yet
2013-05-24 Assessment On Rational Functions
6 pages
Lecture 14 MMP-II
No ratings yet
Lecture 14 MMP-II
4 pages
HSS Maths Lab All in One Click_2025_hssreporter•Com_122301
100% (1)
HSS Maths Lab All in One Click_2025_hssreporter•Com_122301
337 pages
Social Networks - NPTEL JANUARY 2022: Assignment 2
No ratings yet
Social Networks - NPTEL JANUARY 2022: Assignment 2
3 pages
Extra Exercises, Quizzes, Miscellaneous Notes: 1 X 1 1 X X !0 1/5 1/5 1/3 1/3 2 2 1 X !1
No ratings yet
Extra Exercises, Quizzes, Miscellaneous Notes: 1 X 1 1 X X !0 1/5 1/5 1/3 1/3 2 2 1 X !1
12 pages
Calculus and Linear Algebra
No ratings yet
Calculus and Linear Algebra
107 pages
Matgraph - A Matlab Toolbox For Graph Theory
No ratings yet
Matgraph - A Matlab Toolbox For Graph Theory
7 pages
Oliver and The Game - Topological Sort & Algorithms Practice Problems - HackerEarth
No ratings yet
Oliver and The Game - Topological Sort & Algorithms Practice Problems - HackerEarth
5 pages
[MAA 5.16-5.17] CONTINUITY - DIFFERENTIABILITY - L’HÔPITAL_solutions
No ratings yet
[MAA 5.16-5.17] CONTINUITY - DIFFERENTIABILITY - L’HÔPITAL_solutions
8 pages

configuration model

Uploaded by

configuration model

Uploaded by

Network Analysis and Modeling, CSCI 5352 Prof.

1 More configuration model

1.1 More mathematical properties

The excess degree distribution.

Thus, the degree distribution of a randomly chosen neighbor is

Expected clustering coefficient.

Expected clustering coefficient (alternative).

(Can you explain why this formula is correct?)

The giant component, and network diameter.

1.2 Directed random graphs

2 A null model for empirical networks

2. Next time: guest lecture on dynamic social networks

You might also like