Random Graph Theo
Random Graph Theo
by
P . ERDŐS and A . RÉNYI
Dedicated to Professor P . Turán at
his 50th birthday.
Introduction
17
X(v)
if lim = 0
i
n- A(n)
(1) lim Pn,N(n) (A) _
n-+- N
if lim (n) __ +co .
n- A(n)
If (2) holds we shall say that A(n) is a „regular threshold function" for the
property A and call the function F(x) the threshold distribution function of the
property A .
For certain properties A there exist two functions A,(n) and A2(n)
Á2(n)
both tending monotonically to +- for n-->-+-, and satisfying lim = 0,
n- - Al(n)
such that
(n) - A,(n) _ -
if lim
i n-+- A2(n)
(3) lim Pn,N(n)(A)
n-.- N(n) - Al(n) _ +
if lim 00 .
n--+a A2 (n)
Clearly (3) implies that
N(n)
0 if lim sup < 1
n-a+m Al(n)
(4) lim Pn,N(n)(A) _
n-.- +(n)
~'
1 if liro inf > 1.
n - +ro A,.(n)
If (3) holds we call the pair (Al(n), A2(n)) a pair of °sharp threshold"-functions
of the property A . It follows from (4) that if (Aj(n), A2 (n)) is a pair of sharp
threshold functions for the property A then A, (n) is an (ordinary) threshold
function for the property A and the threshold distribution function figuring
in (2) is the degenerated distribution function
jr t ( x) _ 1
0 for x < 1
(1 for x>1
ON THE EVOLUTION OF RANDOM GRAPHS 19
and convergence in (2) takes place for every x 1 . In some cases besides
(3) it is also true that there exists a probability distribution function G(y)
defined for -- < y < -)- - such that if y is a point of continuity of G(y) then
NT(n) - A,(n) =
(ő) lim Pn .N(n)(A) = G(y) if lím y
n- n-- x A (n)
if (ő) holds we shall say that we have a regular sharp threshold and shall call
G(y) the sharp-threshold distribution function of the property A .
One of our chief aims will be to determine the threshold respectively
sharp threshold functions, and the corresponding distribution functions for
the most obvious structural properties, e . g. the presence in rn N of subgraphs
of a given type (trees, cycles of given order, complete subgraphs etc .) further
for certain global properties of the graph (connectedness, total number of
connected components, etc . ).
In a previous paper [7] we have considered a special problem of this
type ; we have shown that denoting by C the property that the graph is con-
nected, the pair C,(n) - 12 n log n. C2(n) = n is a pair of strong threshold
functions for the property C, and the corresponding sharp-threshold distri-
bution function is e -2v ; thus we have proved' that putting
A(12)
AT = 1 n log n + y n+ o(n.) we have
a
(Ó) lim PnMn)(C) = e_e 2Y (- - < y < + -)
n- -
i Partial result on this problem has been obtained already in 1939 by P . ERDős
and 11 . WHITNEY but their results have not been published .
2*
20 ERDÖS-RÉNYI
Comparing the method of the present paper with that of [10] it should
be pointed out that our aim is to obtain threshold functions resp . distributions,
and thus we are interested in asymptotic formulae for the probabilities con-
sidered. Exact formulae are of interest to us only so far as they help in determi-
ning the asymptotic behaviour of the probabilities considered (which is
rarely the case in this field, as the exact formulae are in most cases too compli-
cated). On the other hand in [10] the emphasis is on exact formulae resp .
on generating functions . The only exception is the average number of connected
components, for the asymptotic evaluation of which a way is indicated in
§ 5 . of [10] ; this question is however more fully discussed in the present paper
and our results go beyond that of [10] . -Moreover, we consider not only the
number but also the character of the components . Thus for instance we
point out the remarkable change occuring at N - n . If L-' - ne with c < 1/2
2
then with probability tending to 1 for n + - all points except a bounded
number of points of rn,N belong to components which are trees, while for
N tint with c > 1
this is no longer the case . Further for a fixed value of
2
n the average number of components of rn,N decreases asymptotically in a
linear manner with N, when N :< n,
while for N > n the formula giving
2 2
the average number of components is not linear in N .
In what follows we shall make use of the symbols O and o . As usually
a(n) = a (b(n)) (where b(n) > 0 for n = 1, 2, . . .~ means that Mim lI b(n)l - 0,
n-+-
la(n)j .
while a(n) = 0 (b(n)) means that is bounded . The parameters on
b(n)
ja(n) 1
2 edges . Thus in a complete graph of order k any two points are connected
by an edge .
A subgraph G' of a graph G will be called an isolated subgraph if all
edges of G one or both endpoints of which belong to G', belong to G' . A con-
nected isolated subgraph G' of a graph G is called a component of G . The
number of points belonging to a component G' of a graph G will be called the
size of G' .
Two graphs shall be called isomorphic, if there exists a one-to-one mapp-
ing of the vertices carrying over these graphs into another .
The graph G shall be called complementary graph of G if G consists
of the same vertices P l , P2, . . . . Pn as G and of those and only those edges
(P;, Pj ) which do not occur in G .
The number of edges starting from the point P of a graph G will be called
the degree of P in G .
A graph G is called a saturated even graph of type (a, b) if it consists of
a + b points and its points can be split in two subsets V, and V2 consisting
of a resp . b points, such that G contains any edge (P, Q) with P E V, and
Q E V2 and no other edge .
A graph is called planar, if it can be drawn on the plane so that no two
of its edges intersect .
We introduce further the following definitions : If a graph G has n
2 N
vertices and N edges, we call the number the "degree" of the graph.
V
21ti
(As a matter of fact is the average degree of the vertices of G .) If a graph
n
G has the property that G has no subgraph having a larger degree than G
itself, we call G a balanced graph .
We denote by P ( . . . ) the probability- of the event in the brackets, by
M(~) resp . D 2 (á) the mean value resp . variance of the random variable s .
In cases when it is not clear from the context in which probability space the
probabilities or respectively the mean values and variances are to be under-
stood, this will be explicitly indicated . Especially M n,N resp. Dn ,N will denote
the mean value resp . variance calculated with respect to the probabilities
Pn .N •
ON THE EVOLUTION OF RANDOM GRAPHS 23
1'
Y1
If however íl c I/Inn where c > 0 is a constant not depending on n., then the
appearance of trees of order 3 will have a probability which tends to a posi-
tive limit for n -* + -, but the appearance of a connected component consist-
ing of more than 3 points will be still very improbable . If N is increased while n
is fixed, the situation will change only if N reaches the order of magnitude
of n 2j3 . Then trees of order 4 (but not of higher order) will appear with a pro-
bability not tending to 0 . In general, the threshold function for the presence
k-2
of trees of order k is nk -1 (k = 3, 4, . . . ) . This result is contained in the
following
Theorem 1 . Let k >_
_ 2 and l (k - 1 < Z < 9 I be positive integers . Let
l lk
k„ denote an arbitrary not empty class of connected balanced graphs consisting
of k points and l edges . The threshold function for the property that the random
graph considered should contain at least one subgraph isomorphic with some ele-
2- k
ment of y k,, is n t .
Corollary 2. The threshold function for the property that a graph contains
a connected subgraph consisting of k >_ 3 points and k edges (i . e . containing
exactly one cycle) is n, for each value of k .
Corollary 3. The threshold function for the property that a graph contains
a cycle of order k is n, for each value of k _>_. 3 .
24 ERDŐS-RÉNYI
Corollary 4. The threshold function for the property that a graph contains
- 1 )
a complete subgraph of order k _> 3 is n z (i k-1, .
Corollary 5. The threshold function for the property that a graph contains
a saturated even subgraph of type (a, b) (i . e . a subgraph consisting of a + b
a+b
points P1 , . . . , Pa , Q I , . . . Q b and of the ab edges (P1 , Qj ) is n2 ab .
To deduce these Corollaries one has only to verify that all 5 types of
graphs figuring in Corollaries 1-5 . are balanced, which is easily seen .
Proof of Theorem 1 . Let Bk I >_ 1 denote the number of graphs belong-
ing to the class which can be formed from k given labelled points . Clearly
if Pn,N (&k,l) denotes the probability that the random graph rn,N contains
at least one subgraph isomorphic with some element of the class ^Vjk,l, then
nk 7-
Pn,N(&k,l) < Bk,1 l =o
(n n`
21-kJ
I
12
N
which proves the first part of the assertion of Theorem 1 . To prove the second
part of the theorem let -Vki denote the set of all subgraphs of the complete
graph consisting of n points, isomorphic with some element of 56 k 1 . To any
SE& let us associate a random variable e(S) such that E(S) = 1 or e(S) = 0
according to whether S is a subgraph of Pn N or not . Then clearly (we write
in what follows for the sake of brevity W instead of Mn,N)
n
l
On the other hand if SI and S2 are two elements of Z01)1 and if Sl and
82 do not contain a common edge then
f2J
N-21
-2l
If S,_ and S2 contain exactly s common points and r common edges (1 < r < l -1)
we have
-21+r
j 2nl
N - 21 + r - N21-r
M(s(Sl ) E(S2)) - .
n
0
f n41-2r
(l2
N
On the other hand the intersection of S I and S2 being a subgraph of S I (and S2 )
by our supposition that each S is balanced, we obtain r < l i . e . s z rk
S - k l
and thus the number of such pairs of subgraphs S I and S2 does not exceed
n-k 2k _rk
k n k
0 n
B2 `k kf k-~~ 1 .
Jz i
Thus we obtain
M (~E E(S)~~)
)
k,1
n
- 21
n
M(e(S)) +
n!Bk,1
k!2( n - 2 k)!
~2'
N
n
21
+OI
n
N1 kJ2 (n 2- 1 r
N
k
~
SE i k, 1
l2
;}T
1
Now clearly
In 2
1
2~
- 21 -l
lnJ
n! N -21 n 2 N-1
<
k! 2 (n - 2 k)! n' ~ k n 2
~2I ~21
1-
ERDŐS--RÉNYI
26
If we suppose that
n
it follows that we have
M(8 (8»2
(Z
r SE&(k,11
(1 .4) D2 1
e(S}) =0
I\ SEM
\ (n) ( CU
Pn,N ÍZ
ffi
e(S) - Z M(£(S)) > 1 .2 M(e(S»
SE k,1 SEM k,1 sETik ,1
and thus
(1 .5) Pn,N
L SEM (r' ),
e(S) < 1 2 M(E(S »)
2 SE Y~knj
=O
GJ
l1
I.
As clearly by (1 .2) if w -~ + then n M(e(S)) + - it follows not only
k,1
that the probability that 1'„ contains at least one subgraph isomorphic
with an element of k,I tends to 1, but also that with probability tending
to 1 the number of subgraphs of Fn N isomorphic to some element of ti 1; 1
will tend to +O° with the same order of magnitude as GJ 1 .
Thus Theorem 1 is proved .
It is interesting to compare the thresholds for the appearance of a sub-
graph of a certain type in the above sense with probability near to 1, with
the number of edges which is needed in order that the graph should have
necessarily a subgraph of the given type . Such -compulsory" thresholds
have been considered by P . TURÁN [11] (see also [12]) and later by P . ERDŐS
and A . 11 . STONE [ 17 ]) . For instance for a tree of order k clearly the compulsory
n(k - 2) ]
threshold is + 1 ; for the presence of at least one cycle the com-
I 2
pulsory threshold is n while according to a theorem of P . TURÁN [11] for
(k?)
complete subgraphs of order k the compulsory threshold is (n2 - r2) +
2(k - 1)
-v
where r=-n- (k-- 1) ~- In the paper [13] of T . KŐVÁ RI,
~21 Is - 11'
V. T . Sós and P . TuRÁN it has been shown that the compulsory threshold
for the presence of a saturated even subgraph of type (a, a) is of order of magni-
tude not greater than n2 a . In all cases the "compulsory" thresholds in
TURÁN's sense are of greater order of magnitude as our "probable" thresholds .
O\ THE EVOLUTION OF RANDOM GRAPHS 27
§ 2 . Trees
Now let us turn to the determination of threshold distribution functions
for trees of a given order . We shall prove somewhat more, namely that if
k-2
N e nk -1 where o > 0, then the number of trees of order k contained
in rn,N has in the limit for n -y + oo a Poisson distribution with mean value
_ (2 P )k 1 kk-z .
This implies that the threshold distribution function for
V
trees of order k is 1 - e - ~ .
In proving this we shall count only isolated trees of order k in Fn N , i- e.
trees of order k which are isolated subgraphs of I'n N' According to Theorem 1 .
this makes no essential difference, because if there would be a tree of order
k which is a subgraph but not an isolated subgraph of P n N , then rr,,N would
have a connected subgraph consisting of k + 1 points and the probability-
(
of this is tending to 0 if N = o l -n. k which condition is fulfilled in our
k-2
case as we suppose N - g .n k-I .
Thus we prove
N(
Theorem 2a . If lim k-2 = e > 0 and -r k denotes the number of isolated
n -~ n k-1
or j = 0, 1, . where
- (2 Q )k-1 kk-2
A
k!
r„ dJe - x
(2 .4) lim p ~~ -'n i = jl= (j = 0, 1, . . . )
n -. + . i=1 j!
(2 .5)
P11(j) = P ~Z
i-t
1„
Ene = ! . I
Clearly
(2:6) M
1Si,<iz< . . .<i .<1n j°r
thus it follows from (2 .3) that
- ~r
(2 .7) lim Z p, (j) (r = 1, 2, )
n>+mj=r r
l71 ri
uniformly in r .
It follows that for any z with I z < 1
(nz _~
(2 .8) hm 'Y Pn(9) II j ~ z r = ~, ()r = e' - 1
nyT' r=1 j-r lr •Í r. !
But
(2.9) Zr
,Z ~Pn(j) r1) -_ .G Pn(7) (1 + z)j - 1 .
r-1 (j=r l j=o
_1s however
j2 ,(k-1)r - 2Nrk
Sr )~ = -IV-r(k - 1) r2 l
(2 .14) M("(8002) . . . e( e n ( 1 O( n 2, 1
n ~r
(l2!
N
where the summation on the left hand side is extended over all r-tuples of
trees belonging to the set T(k) and the bound of the 0-term depends only on k .
Note that (2 .16) is valid independently of how N is tending to +- . This
will be needed in the proof of Theorem 3 .
Thus we have, uniformly in r
7!r
(2 .17) lim _ M(E(S1 ) e(S2) . . . E(S,,» = for r = 1, 2, . . .
-iP
k-s
nk-i
(2N _ 2N,k
ja2
I e n k;k -2
(2 .18) M(Tk) _
2N
n
k!
i+o ~n2 1~ l
Z
Fi .ure la.
i
o x (c) t 2c
F gure 1b .
32 ERDŐS-RÉNYI
Theorem 2h . If
N(n)
(2 .19) k-2~ + 00
n k-1
(2 .23) <P(x) = 1 x e - 2 du .
V2 n
where Mn,N is defined by (2 .22) . Now as well known (see [16], p . 176)
1 r!
(2 .24) - ag)
r
h l ! h 2 ! . . . h]-!
i~lRg=r, hi>1
ON THE EVOLUTION OF RANDOM GRAPHS 33
where v(Í) are the Stirling numbers of the second kind (see e . g . [16], p . 168)
defined by
r
(2 .25) xr = 60) x(x - 1) . . . (x -
Í=i
Thus we obtain
e a(ex 1) - 1=
+ +
orQ)
xr Aj_ _ W xr r
- d•(1)Ai
(2 .27)
Í=1 r=Í r~ r=1 91 .J=1
Thus it follows that
r dr l + a ~k e-
x
6rÍ)1 =
(2,28) , -- e .(e -1) I =
Í=1 dxr x=0 k=0k
We obtain therefrom
T, V
-~M""" ~ ~ - 1 11In,N r r2N
e Mn ry (k - MroN) ~ (1 + 0 ~- 2 ~ ~ .
(2 .29) M ~ (
+ •. 2k
Now evidently ki e -1 (k - ~t}r is the r-th central moment of the Poisson
distribution with mean value 2 . It can be however easily verified that the
moments of the Poisson distribution appropriately normalized tend to the
corresponding moments of the normal distribution, i . e . we have for r = 1, 2, . . .
+ ~ xa
Ak e
(2 .30) lim
).y+ . f
2 k i k.
( . - a )rl = - 1 -
V2 n 1
xr e 2 dx .
k.
(2 .31) í17 (n)= Inlogn+ I nloglogn+yn+o(n)
2k
hm Mn .N(n) (Tk) -
n-
Similarly from (2 .16) it follows that for r = 1, 2, . . .
~r
.N(n) ~~(sl) E(~2) . . . E(~r)) _
nhm ~ Mn r'
SIET k(n)
and the proof of Theorem 2c is completed by the use of our Lemma 1 exactly
as in the proof of Theorem 2a .
Note that Theorem 2c generalizes the results of the paper [7], where
only the case 1c = 1 is considered .
§ 3 . Cycles
Let us consider now the threshold function of cycles of a given order .
The situation is described by the following
Theorem 3a . Suppose that
(3 .1) N(n) -c n where e > 0 .
Let yk denote the number of cycles of order k contained in F, ,N (k= 3, 4, . . . ) .
Then we have
A1 e -a
(3 .2) "m Pn .N(n) (Yk 0, 1, . . .)
nom . ?
where
(3 .3) ` (2 c)k .
2k
Thus the threshold distribution corresponding to the threshold function A(n) = n
for the property that the graph contains a cycle of order k is 1 - e- 2k (2 1)k
It is interesting to compare Theorem 3a with the following two theorems :
Theorem 3b . Suppose again that (3 .1) holds . Let yk denote the number of
isolated cycles of order k contained in rn N (k = 3, 4, . . . ) . Then we have
w1
(3 .6) "M Pn,N(n) (ak = Í) _ (9=0,1, . . .)
where
_ (2 C e-2c)k k-3
(3 .7) w (1 + k + -
Z 2 + . . . -} ~ .
2k t 2! (k-3)!
2ív
e-2N
nn l k
M(fh)= 1 ~ ~( k - -1)!
I n
2k
As regards Theorem 3c it is known (see [10] and [15]) that the number
of connected graphs Gk.k (i" e . the number of connected graphs consisting
of k labelled vertices and k edges) is exactly
2 k}
I
1 ;~ ~, 2N k
8 n
(3 .11) M (S k)
= I
lti
- k' ~ -------
2k
- - I +k +
k2
2!
+ . . . +
kk-3
(k - 3)!,
3*
-RÉNYI
36
(3 .12) ~? kk
8
and thus
2N i-Zn ~k
~-
nee
(3 .13) M (k) -,• -- .
4k
e n
(3 .14) M(bk) .
4k
Using (3 .8), (3 .9) and (3 .11) the proofs of Theorems 3a, 3b and 3c follow
the same lines as that of Theorem 2a, using Lemma 1 . The details may be
left to the reader .
Similar results can be proved for other types of subgraphs, e . g. complete
subgraphs of a given order . As however these results and their proofs have
the same pattern as those given above we do not dwell on the subject any
longer and pass to investigate global properties of the random graph T,,,.
still almost all points (in fact n - 0(1) points) of I'n N belong to isolated
trees . There is however a surprisingly abrupt change in the structure of "n,N
with 1V - en, when c surpasses the value . If c > 1/2 in the average only a
2
positive fraction of all points Of rn,N belong to isolated trees, and the value
of this fraction tends to 0 for c + .
Thus we shall prove
Theorem 4b . Let ti' n, ,, denote the number of those points of Tn N which
belong to an isolated tree contained in h n,N . Let us suppose that
N(v)
(4 .2) hill = c > 0 .
n--- n
Then we have
(/ 1 for c < 1/2
M ( y ~n .N(n)) _ 1
(4 .3) lim - x(c) 1
n- . ;_ T n for c > -
2c 2
where x = x(c) is the only root satisfying 0 < x < 1 of the equation
(4 .4) x e - X= tee-2c,
which can also be obtained as the sum of a series as follows :
kk
(4 .5) x(c) _ 1 (2 c e 2c) k
k=1 V
Proof of Theorem 4b . 'e shall need the well known fact that the inverse
function of the function
(4 .6) y = x e-X (0 :< X :< 1)
Let T k denote the number of isolated trees of order k contained in -r. , N . Then
clearly
n
(4 .8) V n dN = N k -r k
k=1
and thus
n
(4 .9) M(i'n,N) _ ` k M(T I ) .
k=1
lim
1 M _ 1 k-2
V1
(2 c e -2c)k
(4 .10) (~k)
n-+ . n 2c kI
ERDŐS-RÉNYI
38
As however Vn,N(n) n and thus lim sup ~I(yrn,N(n)) < 1 it follows that
n -~ n
if (4 .2) holds and c < 1/2 we have
M (Vn,N(n)) = I .
(4 .14) lim
n->+ . n
Now let us consider the case c > 1. It follows from (2 .18) that if (4 .2)
(N(n) 1
where x = x I I is the only solution with o < x < 1 of the equation
` n J
2N(n)
2 N(n) -
xe - x = e n Thus it follows that if (4 .2) holds with c > 1/2
n
we have
M(Vn,N(n)) _ x(c)
(4 .17) lim
n-.+m n 2c
where x(c) is defined by (4.5) .
ON THE EVOLUTION OF RANDOM GRAPHS 39
The graph of the function x(c) is shown on Fig . la ; its meaning is shown
by Fig. 11) . The function
for c < 11 2
for c > 11 2
is shown on Fig . 2a .
Figure 2a.
Y Gfc1- 1- 2c)
Figure 2b.
ERDŐS-RÉNYI
40
Thus the proof of Theorem 41) is complete. Let us remark that in the
same way as we obtained (4 .16) we get that if (4 .2) holds with c < 1/2 we have
(4 .18) M( V n,N(n)) = n - 0(1)
where the bound of the 0(1) term depends only on c . (However (4 .18) is not,
true for c =1 as will be shown below .)
2
It follows by the well known inequality of Markov
n2
(4 .21) "M P 1, n,N(n) -
9 N(n) x
'rl
(1 n(n)
I ~ V
n a) n
where x = x ~ N(n)-~ is the only solution with 0 < x < 1 of the equation
l n
2N(n)
Te -x - 2 ~4T(n}
_e
n
n.
kk
Proof. We have clearly, as the series } (2 ce -2c)k is convergent,
h`_, k t
The proof follows the same lines as those of the preceding theorems .
Now (5 .1) follows easily, taking into account that (see (3 .8))
k 2N k
l2l
1
k2
M (Yk) 0
2 (k~ (k ) n 2k +
~2
N
42 ERDŐS--RÉNYI
Thus for e = 1 it is ,almost sure" that r n N(n) contains at least one cycle, while
2
for c < 1 the limit for n-> + co of the probability of this is less than 1 .
2
Proof. Let us suppose first c < . By an obvious sieve (taking into
account that according to Theorem 1 the probability that there will be in I'n,N(n)
with N(n) ~- no (c < 1/2) two circles having a point in common is negligibly
small) we obtain
_ lim M(H„,~v(„i)
(5 .7) lim Pn,N(n)(K ) = e - 1- 2 e eC+~~ .
Thus (5 .6) follows for c < 1/2 . As for c 1 /2 the function on the right of (5.6)
tends to 1, it follows that (5.6) holds for c = 1/2 too . The function y =
- 1 - V1 - 2c eC+C = is shown on Fig. 3 .
We prove now the following
Theorem 5c. Let Hn, N denote the total number of points of rnN which
belong to some cycle . Then we have for N = N(n) on with 0 e c <' 1/2
4 ca
(5 .8) lim M (H n,N(n)) -
n .+~ 1-2c
ON THE EVOLUTION OF RANDOM GRAPHS 43
,c
tJ=1 - v1-2r, .ec
0
1
2
Figure 3 .
(2c) 3 4c 3
M ( Hn ,N(n)) n k y4_
Z
k= t 2(1 -2c) 1-2c
The size of that part of I'n,N which does not consist of trees is still more
clearly shown by the following
Theorem 5d. Let ~0n,N denote the number of those points of rn N which
belong to components containing exactly one cycle . Then we have for N = N(n)
- en in case c * 1 12
k 0 -3
(5 .9) lien M(On,N(n)) =1 (2 ee-2c) k ~1 + + + .. . +
n -~ 2 g__g 1! 21 (k -3)!
while for c =1/2 we have
~1~
h
(5 .10) M 3 n 2,r3
( n,N(n)
12
k'
~I 1 )
3
M ('9i ;,N(n)) e n~ .ti b2 3 .
12
kk -3 1
1 ce -2c ) k j + 1k, + . . .
2 (J
(k - 3)11 4(1 - 2c) 2
2
[ log n ( kl r
-k - 1 - 0(nlog2 1~
M (Vn,N(n)) <_
1
k I
n 2 21 -~-~- 2 -
k=4 lk n
(12
Thus
X2(C)
M(~rz,N(n)) - }
lim _ - I x(e)
n--+m 9t 2C 22
where x =x(c) is the only solution satisfying 0 < x < 1 of the equation xe - z =
= Zee -2c , i . e .
kk-1 (2 CE 2c)is
x(C) _
k=1 k!
Proof of Theorem 6 . Let us consider first the case C < 1 . Clearly if we
2
add a new edge to a graph, then either this edge connects two points belong-
ing to different components, in which case the number of components is
decreased by 1, or it connects two points belonging to the same component
in which case the number of components does not change but at least one
new cycle is created. Thus 2
(6 .5) Sn,N - (n - N) < Hn,N
2 In fact according to a well known theorem of the theory of graphs (see [4], p. 29)
being a generalization of Euler's theorem on polyhedra we have N - n + ~n,N =
= xn,N, where xn,N - the ,cyclomatic number" of the graph rn,N - is equal to
the maximal number of independent cycles, in I'n ,N (For a definition of independent
cycles see [4] p . 28) .
ERDŐS-RÉNYI
46
x can be characterized also as the only solution satisfying 0 < x < 1 of the
equation xe_t = y . -
It follows that if 1V(i?) nc holds with c <'/2 we have
n2 2 N( n ) 4 2 (n) ~
(6 .8) MN(n)) _ - X - -{- 0(1) = n - 1' (n) -f- 0(1)
2 N(n) n 2n 2
which leads to a second proof of the first part of Theorem 6 .
To prove the second part, let us remark first that the number of compo-
nents of order greater than 4 is clearly < . Thus if n ,ti.(A) denotes the
(6 .11) M( ;n,N) e
.2k 1 k! n
and thus, according to (6 .6) if N(n) - cn holds with c > 1,12 We have
1.
(6 .12) lim MGn,N(n1) = x(c) - xu(c)
n -- n 2 e 2
1-c for 0 _
<c<
_ -1-
2
M( 5 n,N(n))
(6 .13) w(c) = lint
N(n) n 1 x2(c) ' 1
„ fx(c) - for c > / 2
2C -2 1
ON THE EVOLUTION OF RANDOM GRAPHS 47
I-Cfor0 :cs%2
z=zfc)=
(c)- x2cl) for c > %2
2c (x
Ő 1
-c
1
2
Figure 4 .
For the case N(n) - cn. with c >_ 1/2 one obtains by estimating the
variance of ~n,N(n) and using the inequality of Chebyshev that for any e > 0
Sn .N(n) _ x_ (e
(6.15) lim P I f x(C - < el - 1 .
n-, n 2c 2 ~ J
The proof is similar to that of (4 .21) and therefore we do not go into details .
and
where
(7 .3) e - a = 2 ee 1-2 c (i . e . a= 2 c- 1 -log 2 c
and thus a > 0 .)
Proof of Theorem 7a . We have clearly
(7 .4) P(d nMO >- z) = P f Zk > I < M(tk)
`k>x k?z
and thus by (2 .18)
ne ° x1
(7 .5)
POn,N(n) > z) - O - Z 5, 2
M(TZ2) a
5 ~`2
eaw -
(7 .7)
2cV27r
and
(7 .8) D 2 (r,,) = O(M (Zzs » .
Clearly
P(4 n,N(n) Z2) > P('rz E > 1) = 1 - P(txg = 0)
and it follows from (7 .7) and ('7 .8) by the inequality of Chebyshev that
Thus we obtain
(7 .10) P(4 n,N(n) >- z2) I - O(e -aw n) .
Thus (7 .2) is also proved .
Remark. If c < 1 the greatest tree which is a component of rn,N with
2
N r cn is - as mentioned above - at the same time the greatest component
ON THE EVOLUTION OF RANDOM GRAPHS 49
of I'n N , as Tn,N contains with probability tending to 1 besides trees only com-
ponents containing a single circle and being of moderate size . This follows
evidently from Theorem 4c . As will be seen in what follows (see § 9) for
c > 1 the situation is completely different, as in this case rn N contains
2
a very large component (in fact of size G(c)n with G(c) > 0) which is not a
tree . Note that if we put c = 1 log nn we have a = 1 log n and 1 log n k
2k k a
in conformity- with Theorem 2c .
We can prove also the following
Theorem 7b. If N- en, where c -/- 12 and e = 2ce 1-2 c then the number
5
of isolated trees of order h = flog n - loglog n, + l resp . of order >_ h (where
I is an arbitrary real -number such that h is a positive -integer) contained in
Fn ,N has for large it approximately a Poisson distribution with the mean value
_ a5!'2 e -al
a s 2 6-al
I = res 2
p. u
2c V2z 2cV2n(I -e - a)
1
Corollary . The probability that rn,N(n) with N(n) -nc where c
2
(7 .14) P dn ( N >
= n2-3 w<
n) =
k~~mn
M (t k) - O í 1
V ~0 n
which proves (7 .11) .
On the other hand, considering the mean and variance of z* _ Tk'
n z"
k ~, -
it follows that
M(a*) z A (03'2 where A > 0 and Dz(t*) = 0( 0)3!2)
and (7 .12) follows by using again the inequality of Chebyshev . Thus Theorem
7c is proved .
The following theorem can be proved by developing further the above
argument and using Lemma 1.
Theorem Let r(y) denote the number of trees of order > yn2 3 contained
7d. 1
= 1 e - x dx .
(7 .16)
V12 n x3; 2
s
In fact this is not true for >_ If2 as is shown by the following
c
ON THE EVOLUTION OF RANDOM GRAPHS 51
Theorem 8a. Let Y. N (d) denote the number of cycles of Gn,N of arbitrary
order which are such that exactly d diagonals of the cycle belong also to pn,N .
n
Then if N(n) = + ~n + o( V-n) where -~ < A < +-, we have
2
e-Q
(8 .1) lim P(Yn,N(n)(d) = y) _ eJ (j = 0, 1, . . . )
n-+w j1
where
AY 0
2 d
y 2a-iel'3 •e
(8 .2) 1
y•
P= 2 .6a d!
o
l 21
N
Vn
and thus if N (n) = n+ R + o(Vn)
2
n f k
3 k'
2n
M(Xn,N(n)(d) }^' k2d-1 e
(8 .4)
2 a +a ld! nd k=4
I 1
+ Vn
It follows from (8 .4) that
W a~ ~
1 yea-1 e Y3 2 dy .
ra lim M(Xn,N(n)(d))= 2
6a d
0
The proof can be finished by the same method as used in proving Theorem 2a .
Remark. Note that Theorem 8a implies that if N(n) = n + con V
2
with wn +°O then the probability that F,,,N(n) contains cycles with any
prescribed number of diagonals tends to 1, while if N(n) =n - w n Vn
2
the same probability tends to 0 . This shows again the fundamental difference
in the structure of ',,N between the cases N < n2 and N > 2n . This differ-
ence can be expressed also in the form of the following
4*
ERDŐS-RÉNYI
52
that the graph rn,N(,) is planar is tending to 1 while for c > 1 this probability tends
to 0 .
Proof of Theorem 8b . As well known trees and connected graphs contain-
ing exactly one cycle are planar . Thus the first part of Theorem Sb follows
from Theorem 5e . On the other hand if a graph contains a cycle with 3 dia-
gonals such that if these diagonals connect the pairs of points (P ;, P i ) (i =
= 1, 2, 3) the cyclic order of these points in the cycle is such that each pair
(P;, P,) dissects the cycle into two paths which both contain two of the other
points then the graph is not planar . Now it is easy to see that among the
k(k - 3)
2 k.
triples of 3 diameters of a given cycle of order k there are at least 6
3 (l
triples which have the mentioned property and thus for large values of k
approximately one out of 15 choices of the 3 diagonals will have the mentioned
property . It follows that if N(n) _ n + CJ', I/rn with wn +-, the proba-
bility that I'n
N(n) is not planar tends to 1 for n -~ +- . This proves Theorem
n
8b. We can show that for N(n) = + 7. with any real A the probability
2
of rn N(n) not being planar has a positive lower limit, but we cannot calculate
is value . It may even be 1, though this seems unlikely .
Theorem 9a . Let,-r, , ,N (A) denote the set of those points of rn,N which belong
to components of size >A, and let ff . ,N (A) denote the number of elements of
the set n,N (A) . If N1 (n) (c - s) n where e > 0, c - s > 1/2 and N2 (n) en
then with probability tending to 1 for n--->- +- from the H,,N,(n)(A) points
belonging to .-Tn,Nl(n)(A) more than (1 - b) Hn,N,(n)(A) points will be contained
in the same component of ~n,N2(n) for any b with 0 < b < 1 provided that
A > 50
(9 .2)
E2 62
where r > 0 is an arbitrary small positive number which will be chosen later
and
and let E;,1 ) denote the contrary event . It follows from what has been said
that
(9.5) lim P(E(,1 )) = 0 .
n- ;
1- a k 1 -{- a
2 _ <- j= ] a~ ` 2
(9.8) and
1-a n 1+a
-
< )' aj <_
2 j=k+1 2
(9 .10) H,,Nl(n)(A)
2 S min (Hn, H"t ) < max (Hn, Hn) < H,,,N,(n)(A) II -
2,
further no point of . °' is connected with a point of rn in F*,N2(n)
It follows that if a point P of the set '-- n Nl(n) (A) belongs to n (resp .
ten} then all other points of the component of I' n,NI(n) to which P belongs are
ON THE EVOLUTION OF RANDOM GRAPHS 55
wn
As by (9 .10)
~ Hn
1,1<
2 2
.-
b
+ ( 11á+b
=--<1a
2 2
H, 4 l 21 2 2
1
it follows that
1 H, .v,1n7(A) á ~e(1-3r)f2(A,e-e)n
(9.12) P(E(n3 ) < 2A ( 1-
2
and thus by (9.3) and (9 .6)
~(1+-r) log 2 e(1 - 3 i) f(Á, e s) á
(9 .13) P(En3 » < exp 1nf(A, c - e)
A 2 1
Thus if
(9 .14) A sá(1 - 3 z) f(A, c - s) > (1 + z) log o
then
(9.15) 1im P(EW) = 0 .
nom+
the inequality (9 .13) will be satisfied provided that r < 1 and A > 50
10 E2á2
Thus Theorem 9a is proved .
Clearly the "giant" component of rn,N2(n) the existence of which (with
probability tending to 1) has been now proved, contains more than
(1 -T)(1 -á)nf (A,c-s)
56 ERDŐS-E (;N l I
(9 .17)
an
2
< min (n', n') <
- max W, W) - < I1 - .~ n .
But the number of such divisions does not exceed 2 1 , and if the n points
are divided in this way, the number of ways N edges can be chosen so that
only points belonging to the same subset V' resp . V" are connected, is
n'~ + ~n"~)
2 2
l JI L 2
Cl N (n) N(n)a
(9 .18) P(B,(a, c)) < 2' ~1 - < 21 e 2
-I
2;
'e have seen that for N(n) - en with c > 112 l the random graph rn,N(n)
consists with probability tending to 1, neglecting o(n) points, only of isolated
n kk -2
trees (there being approximately (2c e-2c)k trees of order k) and of
2c k!
a single giant component of size G(c)n .
Clearly the isolated trees melt one after another into the giant compo-
nent, the "danger" of being absorbed by the "giant" being greater for larger
components . As shown by Theorem 2e for N(n) 1 log n only isolated
2k n
N(n) - 1/2 n log
trees of order < k survive, while for n- ~ +- the whole
n
graph will with probability tending to 1 be connected .
An interesting question is : what is the "life-time" distribution of an
isolated tree of order k which is present for AT(n) -cn ? This question is
answered by the following
Theorem 9r, . The probability that an isolated tree of order k which is present
in. 1,,N,( .) where N,(n) -~- cn and c > 1/2 should still remain an isolated tree
in rn M(n) where í1'2 (n) (c + t) n (t > 0) is approximately e -2kt ; thus the
„Life-time" of a tree of order k has approximately an exponential distribution
with mean value nand is independent of the "age" of the tree .
2 k-
Proof. The probability that no point of the tree in question will be
connected with any other point is
(n-k` -
N,(n) I 9 + k,
2
e -2kt
1
l=N,(n)+I
n,
~~ -
Suppose that
N(n)
(10 .1) lim = T o0
n + m n log n
Then we have for any e > o
p (2 c)J e-2c
(10 .3) lim P(Dn,N(n) ( k) _%) _ ~ (i - 0, 1, . . . )•
n--+ - ?
thus if N(n) -onn the degree of a given point has approximately a Poisson
distribution with mean value 2c . The number of points having the degree r
is thus in this case approximately
(2 C)r e -2c
n - (r = 0,1, . . )
r?
If N(n) _ (n log n) wn with co n -* +- then the probability that the degree of
2 N(n) (1 - 2 N(n)
a point will be outside the interval e) and (1 + E) is ap-
proximately
(2 o)„ •l ogn' ke - 2w„ log n -O ( 1
,k-21ogn .w,2,>e .2logn •w ,i k. neYwj
order . It is an open problem how large Ch (Pn N(n)) is for N(a) -on withc> 1/2 ?
A further result on the chromatic number can be deduced from our
above remark on independent vertices . If a graph T has the chromatic number
h, then its points can be divided into h classes, so that no two points of the
n
same class are connected by an edge ; as the largest class has at least points,
h
it follows that if f is the maximal number of independent vertices of r we have
n k) I
f >_ . IVOw we have seen that for N(n) = n _+ 2 (1 almost surely
h 2 J
j .2(1
f < k; it follows that for N(n) = - o( k~l almost surely Ch (rn,lY(n)) >
t2
6 0 ERDŐS-RÉNYI
Other open problems are the following : for what order of magnitude
r
of N(n) has n ,~~ n1 with probability tending to 1 a Hamilton-line (i .e . a path
which passes through all vertices) resp . in case n is even a factor of degree 1
(i .e . a set of disjoint edges which contain all vertices) .
An other interesting question is : what is the threshold for the appear-
ance of a "topological complete graph of order k" i .e . of k points such that
any two of them can be connected by a path and these paths do not inter-
sect . For k > 4 we do not know the solution of this question . For k = 4
it follows from Theorem 8a that the threshold is . It is interesting to
n2
compare this with an (unpublished) result of G . DIRAC according to which
if N >_ 2n: - 2 then GTn,N contains certainly a topological complete graph
of order 4 .
We hope to return to the above mentioned unsolved questions in an other
paper .
Remark added on May 16, 1960 . It should be mentioned that N . V .
SMIRNOV (see e. g . Marne~uamugecmwi C6opuux 6(1939) p . 6) has proved a
lemma which is similar to our Lemma 1 .
REFERENCES