0% found this document useful (0 votes)

9 views9 pages

Limits of Graph Sequences

The lecture discusses the convergence of sequences of graphs and the concept of convergence in distribution, particularly in relation to empirical distributions and motifs in graphs. It introduces definitions such as isomorphism, motif density, and homomorphisms, emphasizing the relationships between these concepts and their implications for understanding graph sequences. The document also highlights the transition from discrete to continuous representations in data analysis.

Uploaded by

sjmin711

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

9 views9 pages

Limits of Graph Sequences

Uploaded by

sjmin711

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 9

36-781 Lecture:

Limits of (dense) graph sequences

Lecturer: Cosma Shalizi

Scribe: Nicolás Kim
1 November 2016

Contents
1 Warm-up: convergence in distribution 1
1.1 Starting case: empirical distirbution on k categories . . . . . . . 1
1.2 Convergence of distributions on R1 . . . . . . . . . . . . . . . . . 1
1.3 Alternative view of convergence in distribution . . . . . . . . . . 3
1.4 Lessons . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5

2 Convergence of sequences of graphs 5

2.1 Some symbols and notation . . . . . . . . . . . . . . . . . . . . . 7
2.2 Injective homomorphisms . . . . . . . . . . . . . . . . . . . . . . 8
2.3 Homomorphisms . . . . . . . . . . . . . . . . . . . . . . . . . . . 8

1 Warm-up: convergence in distribution

1.1 Starting case: empirical distirbution on k categories
Pk
Represent a distribution on k categories by a vector pn ∈ Rk≥0 , where i=1 pn =
1. To be more precise, then, pn lives in the k-simplex Sk :
So, pn is any point on the k-simplex with rational coefficients. In other words,
each rational point on the k-simplex represents a distribution on k categories.
Figure 1 depicts the simplices for k = 2 and k = 3.
So, as n → ∞, pn → ρ even if ρ has irrational coefficients. This is the notion
of convergence in distribution for categorical distributions.

1.2 Convergence of distributions on R1

Definition 1. At any finite n, define the empirical cdf to be the function
n
1X
pn (x) = 1(xi ≤ x).
n i=1

1
Figure 1: S2 and S3 are the simplices corresponding to the range of possible
probability distributions on 2 and 3 categories, respectively. Each point in a
simplex is a probability distribution, since each point is a vector whose entries
sum to 1.

2
Figure 2: The empirical cdf (orange) is a step function which converges to the
true cdf (blue) as the sample size increases.

So, at any finite n, pn (x) takes jumps of size 1/n at (at most) n distinct
points. See Figure 2 for an example.
iid
Theorem 1.1. (Glivenko-Cantelli) If Xi ∼ ρ = true cdf, then
a.s.
max |pn (x) − ρ(x)| → 0.
x

The Glivenko-Cantelli thoerem is also known as the fundamental theorem of

statistics, since it essentially claims that we can use a sample from a distribution
to learn that distribution.
Note that, similarly to the case of convergence of categorical distributions,
ρ isn’t necessarily a step function, even though pn will be for any finite n. The
possible cdf’s that these step functions may converge to constitutes a broader
class of functions than just step functions.

1.3 Alternative view of convergence in distribution

Definition 2. Say we have a sequence of probability distributions (µ1 , µ2 , . . . , µn , . . . )
with corresponding pdf ’s (m1 , m2 , . . . , mn , . . . ). These converge on a limit µ
with pdf m, if: Z Z
n→∞
f (x)mn (x) dx −→ f (x)m(x) dx,
R R

3
for all bounded and continuous f (such f are known as “test functions”). If
d
this convergence holds, then we say that µn → µ, i.e. that µn converges in
distribution to µ.
How do we apply this notion to the convergence of empirical distributions?
In particular, the empirical distribution is a jump function, meaning it does not
have a pdf in the standard sense. However, we can consider the empirical pdf
to be a mixture of Dirac delta functions, as follows: put µn to be the empirical
distribution from n samples. Naturally, we let µ denote the distribution that
all data were drawn from. Define the pdf of µn to be
n
1X
δ(x − xi ),
n i=1

where δ(x) is the Dirac delta function 1 ,

(
0, x 6= 0,
δ(x) =
+∞, x = 0,

so that Z
f (x)δ(x) dx = f (0)
R
for any f : R → R.
Now, putting mn to be this mixture of Dirac delta functions, we get
Z n Z
1X
f (x)mn (x) dx = f (x)δ(x − xi ) dx (1)
R n i=1 R
n Z
1X
= f (y + xi )δ(y) dy (2)
n i=1 R
n
1X
= f (xi ). (3)
n i=1

d
So for µn → µ, we need
n
1X d
δ(x − xi ) → µ,
n i=1
which we now know means that
n Z
1X n→∞
f (xi ) −→ f (x) µ(dx),
n i=1 R

1 Alternatively, δ(x) can be thought of as

Z
lim f (x)φ(x; σ) dx,
σ→0 R

where φ(x) is the pdf of a random variable with distribution N (0, σ).

4
for all bounded and continuous test functions f . Since
n Z
1X
f (xi ) ∈ R, f (x) µ(dx) ∈ R,
n i=1 R

this is now just convergence in the sense of a sequence of real numbers.

1.4 Lessons
1. Observed data sets get represented as objects with lots of discreteness.
2. They tend towards continuous limit objects.

3. The discrete ones are special cases of the continuous limits.

4. Convergence in distribution is equivalent to convergence of averages/integrals/expectations
for bounded continuous test functions:
Z Z
d
µn → µ ⇐⇒ f (x) µn (dx) → f (x) µ(dx)
R R

for all bounded and continuous f .

2 Convergence of sequences of graphs

How can we apply the idea of convergence in distribution in order to define
convergence in graphs?
We have a sequence of graphs g1 , g2 , . . . , gm , . . . . Denote by V (gm ) and
E(gm ) the set of nodes and edges, respectively, for gm . In other words, E(gm ) ⊆
V (gm ) × V (gm ).
Fix a favorite graph f , which we will call a motif. For example, f may be
the triange graph K3 , or the four-cycle C4 . In general, a motif is any fixed,
finite graph.
Definition 3. An isomorphism between two graphs f and g is some bijective
function
→ V (g)
φ : V (f ) ,→
such that (i, j) ∈ E(f ) if and only if (φ(i), φ(j)) ∈ E(g). We will sometimes say
f ' g if f and g are isomorphic in this sense. An example is given in Figure 3.

Definition 4. The density of a k-node motif, f , in an n-node graph, g, is the

fraction of k-node induced subgraphs in g that are isomorphic to f (by definition,
the density is 0 if k ≥ n).
These motifs are going to be our test functions. Figure 4 demonstrates this
concept when the motif is a 2-star.

5
Figure 3: Graph isomorphism is determined by the graphs’ structure, not by
the label associated with each node.

Figure 4: Take three arbitrary nodes in g: do they have the same structure as
f ? Here, we identify two successful matchings between a triplet in g and f .

6
Figure 5: The subgraph induced by the three green nodes in g is isomorphic
to f . The same applies to the three pink nodes, and to a several other sets of
node triplets in g. The total number of such triplets that are isomophic to f is
denoted by Iso(f, g).

2.1 Some symbols and notation

Denote by Iso(f, g) the number of mappings from V (f ) to V (g) such that the
induced subgraph of g is isomorphic to f . Figure 5 demonstrates 2 of the several
such mappings from f to g.

Definition 5. The motif density of f in g is

Iso(f, g) Iso(f, g) Iso(f, g)

tiso (f, g) = = = n
.
# potential mappings from f to g n(n − 1) . . . (n − (k − 1)) k k!

This motif density is sometimes referred to as tind (f, g).

Definition 6. A sequence of graphs (g1 , g2 , . . . , gm , . . . ) converges (in the graph
sense) when tiso (f, gm ) converges as m → ∞ for every fixed motif f .2
One additional notation we will require is G[k] = the subgraph induced by
picking k distinct nodes from g at random. In this new notation, we can say

tiso (f, g) = P(f ' G[k]).

2 As a sanity check, if gm = g for every m, then the sequence still converges.

7
Figure 6: There exists some f 0 ⊇ f such that f 0 ' G[k].

2.2 Injective homomorphisms

Unfortunately, tiso is very hard to calculate. So, define a weaker notion of
matching than tiso :
tinjective(f,g) = P(f ⊆ G[k]),
where f ⊆ G[k] means that E(f ) ⊆ E(G[k]); Figure 6 provides a small example
of this concept. Now, if we know all of the isomorphism densities tiso (f 0 , g), for
all f 0 , we can calculate tinjective (f, g). If f ⊆ G[k], then there is some f 0 with
the same edges as f (plus some more edges), with an isomorphism f 0 ' G[k].
So, X
tinjective (f, g) = tiso (f 0 , g).
f 0 :f ⊆f 0

Since this is a linear system of equations, we can invert to get tiso (f, g) as a linear
combination of tinjective (f, g). This imples that if we have all of the tinjective (f, g),
we can always calculate all of the tiso (f, g), and vice versa. Therefore, the iso-
morphism densities converge if and only if the injective homomorphism densities
converge.

2.3 Homomorphisms
We previously went from a strong notion of graph matching, graph isomorphism,
to a weaker notion, injective homomorphism. We showed that convergence of

8
Figure 7: In a non-injective mapping, two different nodes in the domain graph
f can be mapped to the same node in the image graph g.

one implies the other, so that we could get away with working with the simpler
notion of injective homomorphism.
Naturally, the next step is to define an even weaker notion of graph matching,
which will be easier to calculate as well. To that end, we remove the restriction
of injectivity (in the sense of injective functions mapping to unique elements in
the image), to obtain the graph homomorphism.
Definition 7. A homomorphism from f to g is a mapping φ : V (f ) → V (g)
such that if (i, j) ∈ E(f ), then (φ(i), φ(j)) ∈ E(g). Figure 7 shows an exam-
ple where f is injectively mapped to g, and another example where f is non-
injectively mapped to g.
The notion of graph homomorphism, in contrast to the injective homomor-
phism, is akin to sampling nodes of g with replacement, in the sense that we are
allowing φ to assign the same node in g to more than one node from f .
Just as we defined G[k] as the subgraph induced by picking k nodes without
replacement, we define G0 [k] to be the subgraph induced by picking k (not
necessarily distinct) nodes of g, i.e. with replacement. This leads to a new
notion of motif density:

Hom(f, g)
thom = = P(f ⊆ G0 [k]).
nk

H2 Mathematics Textbook (Choo Yan Min)
100% (1)
H2 Mathematics Textbook (Choo Yan Min)
2,228 pages
Koh, Khee Meng - Tay Eng Guan, - Dong, F. M - Introduction To Graph Theory - Solutions Manual-World Scientific Publishing Company (2008 - 2007)
No ratings yet
Koh, Khee Meng - Tay Eng Guan, - Dong, F. M - Introduction To Graph Theory - Solutions Manual-World Scientific Publishing Company (2008 - 2007)
262 pages
MIR - Ivchenko G. I., Medvedev Yu. and Chistyakov A. - Problems in Mathematical Statistics - 1991
100% (4)
MIR - Ivchenko G. I., Medvedev Yu. and Chistyakov A. - Problems in Mathematical Statistics - 1991
282 pages
Ivchenko Medvedev Chistyakov Problems in Mathematical Statistics
No ratings yet
Ivchenko Medvedev Chistyakov Problems in Mathematical Statistics
282 pages
Math Major Chapters Formulas (1st & 2nd Paper)
0% (1)
Math Major Chapters Formulas (1st & 2nd Paper)
25 pages
Xii-Study Materials 2022-23 With Sp-1
No ratings yet
Xii-Study Materials 2022-23 With Sp-1
234 pages
Probmethod Notes
No ratings yet
Probmethod Notes
214 pages
Gtacbook
No ratings yet
Gtacbook
195 pages
Kreyszig 10 PDF
100% (1)
Kreyszig 10 PDF
328 pages
Large Networks and Graph Limits
100% (2)
Large Networks and Graph Limits
487 pages
The Fourier Series and Fourier Transform
No ratings yet
The Fourier Series and Fourier Transform
55 pages
B.sc. IV Semester Nep DSC M.Q.P.
No ratings yet
B.sc. IV Semester Nep DSC M.Q.P.
2 pages
Multiple Solutions For (P, Q) - Laplacian Equations in R N With Critical or Subcritical Exponents
No ratings yet
Multiple Solutions For (P, Q) - Laplacian Equations in R N With Critical or Subcritical Exponents
11 pages
U Statistics
No ratings yet
U Statistics
20 pages
Lecture 1 Matrices and Determinants
No ratings yet
Lecture 1 Matrices and Determinants
14 pages
LectureNotes201b v9
No ratings yet
LectureNotes201b v9
240 pages
Zwiebach Notes PDF
No ratings yet
Zwiebach Notes PDF
68 pages
Composite Function
No ratings yet
Composite Function
24 pages
SEMA 103-Lesson 4
No ratings yet
SEMA 103-Lesson 4
36 pages
Control Function Approach Slides
No ratings yet
Control Function Approach Slides
9 pages
Notes 2
No ratings yet
Notes 2
136 pages
Fundamentals of Statistics (18.6501x)
No ratings yet
Fundamentals of Statistics (18.6501x)
20 pages
R300 Advanced Econometrics Methods Lecture Slides
No ratings yet
R300 Advanced Econometrics Methods Lecture Slides
362 pages
DCT and KL Transform
No ratings yet
DCT and KL Transform
26 pages
Math1414 Logarithmetic Functions
No ratings yet
Math1414 Logarithmetic Functions
10 pages
Manuilov, Troitsky - Hilbert C-Modules
No ratings yet
Manuilov, Troitsky - Hilbert C-Modules
214 pages
Empirical Process (Sara Van de Geer)
No ratings yet
Empirical Process (Sara Van de Geer)
91 pages
Econ 623 AsymptoticTheory 2023
No ratings yet
Econ 623 AsymptoticTheory 2023
74 pages
Lecture10 sp15
No ratings yet
Lecture10 sp15
73 pages
Robust Statistics, Hypothesis Testing, and Homology Persistence
No ratings yet
Robust Statistics, Hypothesis Testing, and Homology Persistence
41 pages
Asymptotic Statistics (By Changliang ZOU)
No ratings yet
Asymptotic Statistics (By Changliang ZOU)
115 pages
11 ST Gallen Oct2024 Subgraphs
No ratings yet
11 ST Gallen Oct2024 Subgraphs
51 pages
Limits of Permutation Sequences
No ratings yet
Limits of Permutation Sequences
21 pages
Notas Curso QG Ha
No ratings yet
Notas Curso QG Ha
29 pages
Lecture3 sp15
No ratings yet
Lecture3 sp15
42 pages
12 ST Gallen Oct2024 NetworkPeerEffects
No ratings yet
12 ST Gallen Oct2024 NetworkPeerEffects
40 pages
DPP Inverse Trigonometry Function (Vikramaditya Cla - 230626 - 122038
No ratings yet
DPP Inverse Trigonometry Function (Vikramaditya Cla - 230626 - 122038
13 pages
7 ST Gallen Oct2024 CausalInference
No ratings yet
7 ST Gallen Oct2024 CausalInference
30 pages
5 ST Gallen Oct2024 NetworkDensity
No ratings yet
5 ST Gallen Oct2024 NetworkDensity
24 pages
Estimating and Understanding Exponential Random Graph Models
No ratings yet
Estimating and Understanding Exponential Random Graph Models
36 pages
Random Variables and Distribution Functions
No ratings yet
Random Variables and Distribution Functions
33 pages
Trigonometric Functions - Equations
No ratings yet
Trigonometric Functions - Equations
93 pages
Lecture Projection Theorem Oct2022
No ratings yet
Lecture Projection Theorem Oct2022
15 pages
Distance
No ratings yet
Distance
18 pages
Introduction
No ratings yet
Introduction
11 pages
CH 2
No ratings yet
CH 2
24 pages
Notes For 18.6501x, Fundamentals of Statistics: v0.2 (2019 April 24)
100% (1)
Notes For 18.6501x, Fundamentals of Statistics: v0.2 (2019 April 24)
14 pages
Topics in Random Graphs
No ratings yet
Topics in Random Graphs
35 pages
Part F
No ratings yet
Part F
25 pages
11th Physics Ch-2 - Motion in A Straigth Line (Worksheet) 2024-25
No ratings yet
11th Physics Ch-2 - Motion in A Straigth Line (Worksheet) 2024-25
25 pages
Mit18 225 f23 Notation Conventions
No ratings yet
Mit18 225 f23 Notation Conventions
5 pages
Module 1
No ratings yet
Module 1
32 pages
E4 Convdist
No ratings yet
E4 Convdist
20 pages
First Periodical Test
0% (1)
First Periodical Test
5 pages
Stochastic Convergence
No ratings yet
Stochastic Convergence
20 pages
Chapter 1 MAT455 PDF
No ratings yet
Chapter 1 MAT455 PDF
16 pages
Calculus On Graphs
No ratings yet
Calculus On Graphs
63 pages
Convergence of Random Variables - Wikipedia
No ratings yet
Convergence of Random Variables - Wikipedia
17 pages
STATSLIDE3
No ratings yet
STATSLIDE3
9 pages
Type 2: Improper Integrals With Infinite Discontinuities: Three Simple Examples
No ratings yet
Type 2: Improper Integrals With Infinite Discontinuities: Three Simple Examples
5 pages
Foss Lecture1
No ratings yet
Foss Lecture1
32 pages
Statistical Distances
No ratings yet
Statistical Distances
12 pages
1084-Article Text-4070-1-10-20141013
No ratings yet
1084-Article Text-4070-1-10-20141013
17 pages
Research: 1 Theorems and Open Problems
No ratings yet
Research: 1 Theorems and Open Problems
12 pages
N K N K
No ratings yet
N K N K
36 pages
Unit 8 Probability
No ratings yet
Unit 8 Probability
9 pages
Unit 2 Graphs
No ratings yet
Unit 2 Graphs
9 pages
Cayley Hamilton
No ratings yet
Cayley Hamilton
9 pages
Lec 4
No ratings yet
Lec 4
8 pages
Appendix B Algebra
No ratings yet
Appendix B Algebra
25 pages
2 PDF
No ratings yet
2 PDF
27 pages
College Statistics
No ratings yet
College Statistics
244 pages
Week9 Memo
No ratings yet
Week9 Memo
2 pages
Theorist's Toolkit Lecture 8: High Dimensional Geometry and Geometric Random Walks
No ratings yet
Theorist's Toolkit Lecture 8: High Dimensional Geometry and Geometric Random Walks
8 pages
Sampling Distributions of Statistics: Corresponds To Chapter 5 of Tamhaneand Dunlop
No ratings yet
Sampling Distributions of Statistics: Corresponds To Chapter 5 of Tamhaneand Dunlop
36 pages
Theorist's Toolkit Lecture 1: Probabilistic Arguments
No ratings yet
Theorist's Toolkit Lecture 1: Probabilistic Arguments
7 pages
CF Notes
No ratings yet
CF Notes
7 pages
Normal and Stable Approximation To Subgraph Counts in Superpositions of Bernoulli Random Graphs
No ratings yet
Normal and Stable Approximation To Subgraph Counts in Superpositions of Bernoulli Random Graphs
15 pages
Cywc Faq Sheet
No ratings yet
Cywc Faq Sheet
1 page
1982 11erdos
No ratings yet
1982 11erdos
7 pages
Hilbert Transformer: Dr. Salahedin Rehan
No ratings yet
Hilbert Transformer: Dr. Salahedin Rehan
11 pages
HW 8 Matlab Problem
0% (1)
HW 8 Matlab Problem
2 pages
Sketching Information Divergence
No ratings yet
Sketching Information Divergence
15 pages
Entropy 1
No ratings yet
Entropy 1
7 pages
Question 1734729
No ratings yet
Question 1734729
3 pages
Lesson 190 Combinatorics and Counting Methods
No ratings yet
Lesson 190 Combinatorics and Counting Methods
3 pages
Integration Cheat Sheet
No ratings yet
Integration Cheat Sheet
8 pages
Random Signals: 1 Kolmogorov's Axiomatic Definition of Probability
No ratings yet
Random Signals: 1 Kolmogorov's Axiomatic Definition of Probability
14 pages
Lecture 09
No ratings yet
Lecture 09
15 pages
Generalized Quotient Topologies
No ratings yet
Generalized Quotient Topologies
6 pages
Discussion Notes 2-6
No ratings yet
Discussion Notes 2-6
3 pages
Worksheet 3: Even More Precalc!: Russell Buehler
No ratings yet
Worksheet 3: Even More Precalc!: Russell Buehler
2 pages
Chapter 1: The Investment Environment: Assets Liabilities & Shareholders' Equity
No ratings yet
Chapter 1: The Investment Environment: Assets Liabilities & Shareholders' Equity
5 pages
MIT14 30s09 Lec17
No ratings yet
MIT14 30s09 Lec17
9 pages
Tips and Tricks in Real Analysis: Nate Eldredge August 3, 2008
No ratings yet
Tips and Tricks in Real Analysis: Nate Eldredge August 3, 2008
5 pages
A Treatise on the Calculus of Finite Differences
From Everand
A Treatise on the Calculus of Finite Differences
George Boole
4/5 (1)
Theory of Approximation
From Everand
Theory of Approximation
N. I. Achieser
No ratings yet
Multiple Integrals, A Collection of Solved Problems
From Everand
Multiple Integrals, A Collection of Solved Problems
Steven Tan
No ratings yet
Lectures on Integral Equations
From Everand
Lectures on Integral Equations
Harold Widom
4.5/5 (2)
Worked Examples in Mathematics for Scientists and Engineers
From Everand
Worked Examples in Mathematics for Scientists and Engineers
G. Stephenson
No ratings yet
Differential Forms
From Everand
Differential Forms
Henri Cartan
5/5 (2)
A-level Maths Revision: Cheeky Revision Shortcuts
From Everand
A-level Maths Revision: Cheeky Revision Shortcuts
Scool Revision
3.5/5 (8)

Limits of Graph Sequences

Uploaded by

Limits of Graph Sequences

Uploaded by

36-781 Lecture:

Limits of (dense) graph sequences

Lecturer: Cosma Shalizi

2 Convergence of sequences of graphs 5

1 Warm-up: convergence in distribution

1.2 Convergence of distributions on R1

The Glivenko-Cantelli thoerem is also known as the fundamental theorem of

1.3 Alternative view of convergence in distribution

where δ(x) is the Dirac delta function 1 ,

1 Alternatively, δ(x) can be thought of as

this is now just convergence in the sense of a sequence of real numbers.

3. The discrete ones are special cases of the continuous limits.

for all bounded and continuous f .

2 Convergence of sequences of graphs

Definition 4. The density of a k-node motif, f , in an n-node graph, g, is the

2.1 Some symbols and notation

Definition 5. The motif density of f in g is

Iso(f, g) Iso(f, g) Iso(f, g)

This motif density is sometimes referred to as tind (f, g).

tiso (f, g) = P(f ' G[k]).

2.2 Injective homomorphisms

You might also like