Catalan Numbers and Random Matrices
Catalan Numbers and Random Matrices
Lawrence Ip
[email protected]
Department of Electrical Engineering
and Computer Sciences
University of California, Berkeley
supervised by Alan Edelman
May 19 1999
1 Introduction
Catalan numbers, the number of ways of pairing n brackets, arise naturally
in some random matrix results in the computation of moments of eigenvalue
distributions.
In Section 2 we show that Catalan numbers can be represented as the
number of Dyck paths. In Section 3 we show that this representation can be
used to drastically simplify some of the calculations that Wigner used for his
derivation of the semi-circle law and in Section 4 we show that Dyck paths
can be used to interpret an expression combinatorially to prove its validity.
This last result is the main result of this project.
1
However, this is not the only representation. In his new book, Stanley [2],
lists 66 combinatorial interpretations of Catalan numbers. We show that Cn
is also the number of Dyck paths of length 2n.
A Dyck path is a path on the square lattice with steps (1; 1) and (1; 1)
from (0; 0) to (2n; 0) that never falls below the x-axis. Consider a pairing
of n brackets. At each step, let i be the number of unmatched brackets to
the left of the ith bracket. So (()()) would be encoded 0,1,2,1,2,1,0. If we
now interpret the i as the heights of a Dyck path, we see that there is a
one to one correspondence between pairings of n brackets and Dyck paths of
length 2n.
We give a bijective proof of a formula for the number of Dyck paths and
hence a formula for Cn . Consider Dyck paths of length 2n. Now add a
downward step at the end of the Dyck path. There are now n + 1 downward
steps. Choose one of these steps to be special. Then we have (n + 1)Cn
possible congurations. Now cut the Dyck path into two parts, the part
before the special step and the part after it (including the special step).
Move the right part so that it starts from (0; 0) and append the left part to
the right part. We now have a special step going from (0; 0) to (1; 1) and
a path with steps (1; 1) and (1; 1) going from (1; 1) to (2n + 1; 1). We
can recover the Dyck path with the extra downward step and special step
by nding the leftmost minimum of the new path, cutting it into two parts
and reassembling. Thus this operation is a bijection. See Figure 1 for an
example.
Clearly the number of paths from (1; 1) to (2n + 1; 1) in is nn . So
2
we nd that
Cn = n + 1 2nn :
1
2
Figure 1: Bijection in derivation of formula for Cn.
3
Due to the symmetry of the distribution, the terms in the expression for
the th moment
X X
M = R 1
v i1 vi 1 ;
0 0
set i1 ;:::;i 1
length 2 + 1.
A type sequence of length 2 + 1 is a sequence that starts at 1 and ends
at 0, with successive members diering by 1 and all members non-negative.
By interpreting the type sequence as the heights of a Dyck path we see that
the number of type sequences t is the th Catalan number.
4
starting at 0 is (r), where (r) = 0 if r is odd, and
s
(2s) =
X 2s k k vk (v 1)s k
k =1
s 2s k
s
= v
X 2s s k (v 1)k
1
k k s =0
s
=
X 2 s 2s 2k + 1 (v 1)k :
k k 2s k + 1
=1
n 1
X
mn (z) = (1 + z) n (1 + z) j z n jC
n j:
2 +1 2 +1 1+
j =0
In the notation of McKay's paper, z = v 1 and s = n +1, and the expression
becomes
s 1
X
(2s) = v s v j (v 1)s j +1 C
s j: (1)
2 1 2 1
j =1
This formula also involves Catalan numbers and it turns out that it has
a straightforward combinatorial interpretation.
Let = ; ; : : : ; r be a walk of length r. We can think of this as a
0 1
traversal of the vertices of the graph. Let i be the depth of the traversal
at step i. We decrease i by one if we \backtrack" and increases i by one
otherwise. More formally,
m(d; i) = maxfj : j < i; j = dg
then
= 0,
0
i = i 1 if m
+1 ( i 1 ) ;i = i ,
+1
5
v 1
v 1
v 1 v 1 v 1
v 1 v 1 v 1
v v v v
Figure 2: Plot of i with s = 3, j = 2.
v 1
v 1 v 1 v 1
v 1 v 1
v v
Figure 3: Plot of i with s = 3, j = 1.
i = i + 1 if m
+1 ( i ;i
1 ) 6= i .
+1
Note that for all closed walks in the setting of Lemma 2.1 this denition is
equivalent to the denition given in McKay's paper because of the acyclic
condition of the subgraph. Thus the number of closed walks is given by the
number of walks with s = 0.
2
Now clearly all closed walks must end with a backtrack. The number of
walks that end with a backtrack is thus v s , the rst term of Equation 1.
2 1
steps. The remaining 2s 2j +1 steps must have i > j for all i > 2j 1.
2 1
Plotting the values of i we can think of this as one \upward" step followed by
a Dyck path of length 2(s j ). Now each upward step can occur in v 1 ways
and each \downward" step (corresponding to a backtrack), can occur in only
one way. So the number of ways of doing this will be (v 1)(v 1)s j Cs j .
Summing over j gives us Equation 1.
6
5 Conclusion
The representation of Catalan numbers as the number of pairings of n pairs
of brackets is the most well known but is not necessarily the most useful
one. In this project, we showed two applications where using the Dyck path
representation made it easier for the author to nd direct combinatorial in-
terpretations of various formulae. The reason Dyck paths may be easier to
work with is the graphical nature of their representation.
6 Acknowledgments
To Frank Calegari for recalling the key idea for the bijective proof of the
formula for the number of Dyck paths.
References
[1] B. D. McKay. The expected eigenvalue distribution of a large regular
graph. Linear Algebra and its Applications, 40:203{216, 1981.
[2] R. P. Stanley. Enumerative Combinatorics, volume 2. Cambridge Uni-
versity Press, 1999.
[3] E. P. Wigner. Characteristic vectors of bordered matrices with innite
dimensions. Annals of Mathematics, 62(3):548{564, November 1955.