Mit 18.786
Mit 18.786
H := {z ∈ C : Im(z) > 0}
For z ∈ H and α = a b
c d ∈ GL2 (R) we have
det(α) Im(αz)
Im(αz) = .
|cz + d|2
For det(α) > 0 we have |cz +d| 6= 0 and Im(αz) > 0, which implies that α 7→ αz is a holomorphic
automorphism of H for all α ∈ GL+ 2 (R) = {α ∈ GL2 (R) : det(α) > 0}.
:
Recall that a topological group G is a group object in the category of topological spaces (this
means the maps G × G → G and G × G defined by (g, h) 7→ gh and g 7→ g −1 are continuous). All
the topological groups we shall consider in this course are Hausdorff, but we won’t bake that
into the definition (some authors do). We view GL2 (R) and all its subgroups and quotients as
topological groups, where GL2 (R) has the subspace topology induced by GL2 (R) ⊆ M2 (R) ' R4 .
When a topological group G acts on a topological space X the action map G × X → X
defined by (g, x) 7→ g x is required to be continuous, in addition to satisfying the usual prop-
erties of a (left) group action. For each g ∈ G the map x 7→ g x is an automorphism of X (a
homeomorphism X → X ). For x ∈ X we have the G-stabilizer G x and G-orbit G x defined by
G x := {g ∈ G : g x = x} ⊆ G and G x := {g x : g ∈ G} ⊆ X
and we use G\X to denote the topological space defined by the G-orbits of X (with the quotient
topology), and we use G/G x to denote the topological space consisting of the right G x -cosets of
G (with the quotient topology).
Theorem 1.2. Let G be a second countable locally compact Hausdorff topological group acting
on a locally compact Hausdorff space X . Then for every x ∈ X the map g G x 7→ g x defines a
∼
homeomorphism G/G x −→ X.
Corollary 1.3. The map αSO2 (R) 7→ αi is a homeomorphism SL2 (R)/SO2 (R) −→
∼
H.
The corollary allows us to represent elements z of H as right cosets βSO2 (R), in which case
the action of αSL2 (R) on H is simply (left) matrix multiplication αz = αβSO2 (R) rather than a
linear fractional transformation.
polynomial t 2 − tr(α)t + det(α), whose sign determines the type of eigenvalues α has (distinct
complex conjugates, a single real (double) eigenvalue, or two real eigenvalues).
• α is elliptic if and only if its fixed points are z and z̄ for some z ∈ H.
• α is parabolic if and only if it has a unique fixed point x ∈ H∗ − H = R ∪ {∞}.
• α is hyperbolic if and only if it has distinct fixed points x 1 , x 2 ∈ H∗ − H = R ∪ {∞}.
In every case, the fixed points of α on P1 (C) lie in the extended upper half-plane H∗ , and they lie
in the upper half-plane if and only if α is elliptic.
Proof. This follows immediately from an analysis of the eigenvalues in each case.
GL2 (R)(p)
x
:= {α ∈ GL+
2 (R) x : α is parabolic or scalar}, GL+ + +
2 (R) x,x 0 = GL2 (R) x ∩ GL2 (R) x 0
:
• GL+
2 (R)i = SO2 (R);
• GL+ ×
2 (R)∞ = 0 d : a, d ∈ R , b ∈ R, ad > 0 ;
a b
(p)
• GL+ ×
2 (R)∞ =
a b
0 a :a∈R ,b∈R ;
• GL+ ×
2 (R)0,∞ = 0 d : a, d ∈ R , ad > 0 .
a 0
d x2 + d y2
p
dxd y
ds(z) = 2
and d v(z) =
y y2
We call the image Cz0 →z1 of an injective function φ : [0, 1] → H that is C ∞ at all but finitely
many points a path from z0 = φ(0) to z1 = φ(1). If we put φ(t) = x(t) + i y(t), the length of
Cz0 →z1 is
Z1 Z1p
(d x(t)/d t)2 + (d y(t)/d t)2
`(Cz0 →z1 ) := ds(φ(t)) = d t,
0 0 y(t)
which does not depend on the choice of φ,as long as it has image Cz0 →z1 .
A circle or line in H orthogonal to R is called a geodesic.
Lemma 1.7. For any two distinct z0 , z1 ∈ H, there is a unique shortest path Cz0 →z1 from z0 to z1 ,
which lies on a geodesic. For any z0 ∈ H and r > 0 the set {z ∈ H: `(Cz0 ,z ) = r} is a circle in H
orthogonal to every geodesic that contains z0 .
The isomorphism
H ' SL2 (R)/SO2 (R) allows us to uniquely represent α ∈ SL2 (R) in the
form α = ac db = hz kθ , where z = x + i y = αi ∈ H and θ = − arg(ci + d), with
y x cos θ sin θ
hz = y −1/2
0 1 ∈ SL2 (R), kθ = − sin θ cos θ ∈ SO2 (R).
d x d y dθ
dα = .
2π y 2
This measure is SL2 (R)-invariant: one can check that dα = dαβ = dβα for all α, β ∈ SL2 (R);
moreover, we have chosen the scaling so that dα is compatible with d v.
Theorem 1.8. SL2 (R) is unimodular, and for any measurable f : H → C we have
Z Z
f (z)d v(z) = f (αi)dα
H SL2 (R)
References
[1] Toshitsune Miyake, Modular forms, Springer, 2006.
In Lecture 1 we saw that the isometry group of the upper half-plane Isom(H) ' SL2 (R)/{±1}.
In any metric space X we can define the length of a path φ : [0, 1] → X as the supre-
Pn over all finite subdivisions 0 = t 0 < t 1 < · · · < t n < t n+1 = 1 of [0, 1] of the sum
mum
i=0 d(φ(t i ), φ(t i+1 ) (which need not be finite). Note that this length depends only on the im-
age φ, which we denote C x 0 →x 1 , where x 0 = φ(0) and x 1 = φ(1). Conversely, given a function
` assigning lengths to paths C x 0 →x 1 in X , we can define
and if the infimum exists for all x, y ∈ X , then d is a distance metric (positive on distinct points,
symmetric, and satisfies the triangle inequality), called the intrinsic metric. Metric spaces whose
distance functions can be defined in this way are called length spaces (or path metric spaces).
Definition 2.2. Shortest paths in in a length space (those that realize the infimum defining
the intrinsic metric) are called geodesic segments. A geodesic in a length space is the image
of a continuous map φ : R → X for which there exists ε > 0 such that for every t 0 , t 1 ∈ R
with t 0 < t 1 < t 0 + ε the image of the restriction of φ to [t 0 , t 1 ] is a geodesic segment. A
geodesic space is a length space in which every pair of distinct points are connected by a geodesic
segment, and if this geodesic segment is always unique, then X is a uniquely geodesic space.
In Lecture 1 we say that H is a uniquely geodesic space. More generally, we have the fol-
lowing theorem.
Theorem 2.3 (Hopf-Rinow). Every complete locally compact metric space is a geodesic space.
Definition 2.5. A (left) Haar measure µ on a locally compact Hausdorff group G is a nonzero
Radon measure that is translation invariant: µ(S) = µ(gS) for all g ∈ G and measurable S ⊆ G.
Replacing µ(gS) with µ(S g) yields a right Haar measure, and if µ is both a left and a right Haar
measure then µ and G are said to be unimodular.
It follows from the left and right invariance of the measure dα on SL2 (R) defined in Lecture 1
that SL2 (R) is unimodular. We used the homeomorphism SL2 (R)/SO(2) ' H defined by gSO2 →
g i to define this measure. But SL2 (R) ⊆ M2 (R) ' R4 inherits a natural metric space structure
with distance function d(α, β) = kα − βk, where
2
= a2 + b2 + c 2 + d 2 ,
a b
c d
Theorem 2.6 (Weil). Every locally compact Hausdorff group G has a Haar measure µ, and if µ0
is any other Haar measure on G then µ0 = λµ for some λ ∈ R>0 .
Definition 2.7. The action of a topological group G on a topological space X is a covering space
action if every x ∈ X has an open neighborhood U for which
ρ(G, U) := {g ∈ G : U ∩ g U 6= ;} = {1}.
If G is a covering space action, then π: X → G\X is a normal covering map and G ⊆ Deck(π).
Covering space actions are free actions, since G x ⊆ ρ(G, U) for every open U 3 x, but the
actions we are most interested have non-trivial stabilizers and cannot be covering space actions.
This leads to the following definition.
Lemma 2.9. For a Hausdorff group G acting on a Hausdorff space X the following are equivalent:
Lemma 2.10. For a Hausdorff group G acting on a Hausdorff space X the following are equivalent:
Recall that a continuous map f : X → Y is quasiproper if inverse images of compact sets are
compact, and if it is also a closed map then f is proper.
Lemma 2.11. Every quasiproper continuous map of locally compact Hausdorff spaces is proper.
Definition 2.12. A topological group G acts properly on a topological space X if the action map
λ: G × X → X × X defined by (g, x) 7→ (x, g x) is proper.
Proposition 2.13. Let G be a Hausdorff group acting properly on a Hausdorff space X . Then the
following hold:
Proof. (i)–(iii) follow from Lemma 2.10, since λ(G × X ) is closed, and (iv) follows from the fact
that the singleton set (x, x) ∈ X × X is compact, so G x ' G x × {x} = λ−1 (x, x) is too.
Theorem 2.14. Let G be a Hausdorff group acting on a locally compact Hausdorff space X . The
following are equivalent:
When X is metrizable and G acts via isometries, all six conditions are equivalent.
Proof. The action of Γ ⊆ SL2 (R) is is the same as that of its image in PSL2 (R), and Γ is discrete
if and only if its image in PSL2 (R) is discrete. For Γ ⊆ PSL2 (R) see [3, Proposition 34.7.2].
Definition 2.16. A Fuchsian group is a discrete subgroup of SL2 (R) or PSL2 (R).
References
[1] Martin R. Bridson and André Haefliger, Metric spaces of non-positive curvature, Springer,
1999.
[2] Joe Diestel and Angela Spalsbury, The Joys of Haar Measure, Amer. Math. Society, 2014.
Definition 3.4. A fundamental set for a group G acting on a topological space X is a subset
S ⊆ X for which (i) S 0 = S, (ii) GS = X , (iii) S 0 ∩ (gS)0 for all nontrivial g.
Let X be a completely locally compact geodesic space with metric d, and let Γ be a discrete
group of isometries acting properly on X .
is the hyperplane bisector separating x 1 and x 2 , and H(x 1 , x 2 )0 = {x ∈ X : d(x 1 , x) < d(x 2 , x)}.
Remark 3.6. Hyperplane bisectors need not be geodesics, but hyperplane bisectors in H are.
Definition 3.7. A set S ⊆ X is convex if it contains every geodesic segment connecting points
in S, and star-shaped with respect to x 0 ∈ S if it contains every geodesic segment connecting a
point in S to x 0 .
Proof. Consider any x ∈ H(x 1 , x 2 ) and any y on a geodesic segment from x to x 1 . Then
d(x 1 , y) + d( y, x) = d(x 1 , x). If y 6∈ H(x 1 , x 2 ) then d(x 2 , y) < d(x 1 , y) and
Corollary 3.9. The Dirichlet domain D(x 0 ) is closed and star-shaped with respect to x 0 .
Proof. D(x 0 ) is an intersection of Leibniz half-spaces that are closed and star-shaped.
Proof. For any x ∈ D(x 0 ) we can pick a bounded open U 3 x for which
\
U ∩ D(x 0 ) = U ∩ H(x 0 , γx 0 )
γi ∈F
where F is the finite set of Lemma 3.10 for S = U with elements of Γ x 0 removed. Therefore
\
U ∩ D(x 0 )0 = U ∩ H(x 0 , γx 0 )0
γ∈F
For γ ∈ Γ − (F ∪ Γ x 0 ) we have U ⊆ H(x 0 , γx 0 )0 , and if V is the set on the RHS of the corollary,
\
U ∩ D(x 0 )0 = U ∩ H(x 0 , γi x 0 )0 = U ∩ V,
γ∈Γ −Γ x 0
In the theorem below, by trivial, we mean an element or subgroup of Γ that acts trivially on
every x ∈ X (so for Γ ∈ SL2 (R) this would include ±1).
Theorem 3.14. Let X be a complete, locally compact geodesic space and let Γ be a discrete group
of isometries acting properly on X . If x 0 ∈ X has trivial stabilizer then D(x 0 ) is a locally finite
fundamental set for Γ that is star-shaped with respect to x 0 whose boundary consists of segments
of hyperplane bisectors.
Proof. Let x 0 ∈ X have trivial stabilizer. It follows from Lemma 3.8 that D(x 0 ) is star-shaped
with respect to x 0 , and Corollary 3.11 implies that its boundary is a union of segments of hy-
perplane bisectors. We now show that D(x 0 ) is a fundamental set. Corollary 3.11 implies that
D(x 0 )0 , so (i) holds. Now consider any x ∈ X . Then Γ x is discrete and
d(Γ x, x 0 ) = inf{d(γx, x 0 ) : γ ∈ Γ }
equipped with the metric induced by the hyperbolic length element and volume
q
d x 12 + d x 22 + d y 3 d x1 d x2 d y
ds = dV = .
y y3
Its boundary is the sphere at infinity P1 (C), analogous to the circle at infinity P1 (R) that forms
the boundary of the hyperbolic plane H2 := H.
The group SL2 (C) acts on P1 (C) via linear fractional transformations, and we extend this to
hyperbolic space by embedding C × R>0 in the Hamiltonians H = C + C j (where jz = x̄ j for
z ∈ C ⊆ H) via (x, y) 7→ x + y j and defining
az + b
a b
z=
c d cz + d
for z = x + y j ∈ H. One can then show that PSL2 (C) acts faithfully and transitively on H3 via
isometries, and that every orientation preserving isometry of H3 corresponds to the action of
an element of PSL2 (C) (and in general Isom(H3 ) ' PSL2 (C) o Z/2Z)), just as every orientation
preserving isometry of H2 corresponds to the action of an element of PSL2 (R) ' PGL+ 2 (R) (and
in general, Isom(H2 ) ' PGL2 (R))
Definition 3.15. A Kleinian group is a discrete subgroup of SL2 (C) or PSL2 (C).
As with Fuchisan groups (discrete subgroups of SL2 (R) or PSL2 (R)), Kleinian groups act
properly on H3 via isometries.
D(Γ ; z0 ) ∩ D(Γ0 ; u0 )
Proof. We first note that any Dirichlet domain D(x 0 ) for a Fuchsian or Kleinian group G (includ-
ing Γ and Γ0 ) is convex with geodesic boundary because it is a countable intersection of Leibniz
half-spaces H(x 0 , γx 0 ) over γ ∈ G; Leibniz half-spaces in H2 or H3 are convex with geodesic
boundary and G is countable (because G is discrete and X is locally compact second countable).
It follows that D(Γ ; z0 ) ∩ D(Γ0 ; u0 ) is a convex set with geodesic boundary.
Theorem 3.14 implies that D(Γ0 ; u0 ) is a locally finite fundamental set for Γ0 , and this implies
that D(γ; z0 ) ∩ D(Γ0 ; u0 ) is locally finite, since a subset of a locally finite set is locally finite. It
remains only to show that S := D(Γ ; z0 )∩ D(Γ0 ; u0 ) is a fundamental set; we have already shown
it is convex with geodesic boundary, so this will imply it is a fundamental domain.
The proof of Corollary 3.11 implies S 0 = S, since S is an intersection of Leibniz half-spaces,
so (i) holds. Now consider any z ∈ X , and choose γ ∈ Γ to minimize d(z0 , γz) and choose γ0 ∈ Γ0
so γ0 γz ∈ D(Γ0 ; u0 ) (note D(Γ0 ; u0 ) is a fundamental set). Then d(z0 , γ0 γz) = d(z0 γz), by the
minimality of γ, which implies that γ0 γz ∈ D(Γ ; z0 ) and therefore in S, thus Γ S = X and (ii)
holds. Finally, if γ ∈ Γ is nontrivial, either γ ∈ Γ0 and S 0 ∩ (γS)0 = ; because D(Γ0 ; u0 ) is a
fundamental set for Γ0 , or γ 6∈ Γ0 and S 0 ∩ (γS)0 6= ; implies d(z, z0 ) < d(γz, z0 ) ≤ d(z, z0 ), a
contradiction, as in the proof of Theorem 3.14; so (iii) holds.
Corollary 3.18. Let Γ by a Fuchsian group. If z0 ∈ H has trivial stabilizer then the Dirichlet domain
D(Γ ; z0 ) is a connected, convex, locally finite fundamental domain for Γ with geodesic boundary.
References
[1] Toshitsune Miyake, Modular forms, Springer, 2006.
This is summary of the material from §1.5, §1.7 of [1] and §37-38 of [2] presented in lecture.
• elliptic if disc(α) < 0, equivalently, α ∈ SL2 (R)z for exactly one z ∈ H and no z ∈ ∂ H;
• parabolic if disc(α) = 0, equivalently, α ∈ SL2 (R)z for no z ∈ H and exactly one z ∈ ∂ H;
• hyperbolic if disc(α) > 0, equivalently, α ∈ SL2 (R)z for no z ∈ H and exactly two z ∈ ∂ H.
For Γ ∈ SL2 (R) we let Z(Γ ) = Γ ∩ {±1} denote its trivial subgroup, so that Γ /Z(Γ ) is isomorphic
to the image of Γ in PSL2 (R).
Theorem 4.2. Let Γ ≤ SL2 (R) be a Fuchsian group and let z ∈ H = H ∪ ∂ H. The following hold:
(i) If z is an elliptic point of Γ then Γz is a finite cyclic group that strictly contains Z(Γ );
(ii) If z is a cusp of Γ then Γz /Z(Γ ) ' Z and ±Γz is SL2 (R)-conjugate to ± 10 1h with h > 0;
(iii) If z is a hyperbolic point of Γ then Γz ⊇ Γz,z 0 ) Z(Γ ) (for some z 0 6= z) then Γz,z 0 /Z(Γ ) ' Z
and ±Γz,z 0 is SL2 (R)-conjugate to ± 0u 10/u with u > 0.
Proof. (i) SL2 (R)z is conjugate to SL2 (R)i = SO2 (R) ' S 1 , a compact abelian group. Since Γ is
discrete, Γz = Γ ∩ SL2 (R)z is finite, hence cyclic, and it contains an elliptic γ 6∈ Z(Γ ).
(ii) WLOG z = ∞ and Γz = Γ∞ is upper triangular and contains some γ = 10 1h with
h 6= 0 (we can conjugate Γz to achieven this). Up to ±1 every element of Γz must be of this form:
a b −n 2n
if α = 0 1/a ∈ Γ x with |a| < 1 then α γα = 10 a 1 h ∈ Γz for all n ≥ 1, but Γz is discrete. Thus
Γ x /Z(Γ ) is isomorphic to a nontrivial discrete subgroup of R, which implies Γ x /Z(Γ ) ' Z.
(iii) WLOG z = 0 and Γz Γ0 ⊇ Γ0,∞ ) Z(Γ ), so that Γz,z 0 = Γ0,∞ is diagonal, consisting
of elements of the form 0u 10/u with u ∈ R× . It follows that Γ x /Z(Γ ) is a nontrivial discrete
Corollary 4.3. For any Fuchsian group Γ we may partition H = H ∪ ∂ H into elliptic points, cusps,
hyperbolic points, and ordinary points. In this partition, H consists of elliptic and ordinary points,
while ∂ H consists of parabolic, hyperbolic and ordinary points.
Proof. The theorem implies that a cusp cannot also be a hyperbolic point.
Corollary 4.4. Let Γ be a Fuchsian group. Every finite index Γ 0 ≤ Γ has the same set of cusps as Γ .
Definition 4.5. For a Fuchsian group Γ and z ∈ H = H ∪ ∂ H, the order of z with respect to Γ is
It follows from the theorem above that ez = 1 if z is ordinary, 1 < ez < ∞ if z is elliptic, and
ez = ∞ if z is a cusp or hyperbolic point.
Proof. The group ±Γ is a finite cyclic group whose unique element of order 2 is −1.
Definition 4.7. Let Γ be a Fuchsian group, let x ∈ ∂ H be acusp of Γ , and choose σ ∈ SL2 (R) so
σx = ∞. Then ±σΓ x σ−1 = ± 10 1h with h > 0. If 10 1h ∈ σΓ x σ−1 then x is a regular cusp,
and otherwise x is irregular. The latter can arise only when −1 6∈ Γ , and the regularity of x does
not depend on the choice of σ.
with elliptic points i = ζ4 of order 2, with Γi = 〈S〉, and ω = ζ3 of order 3 with Γω = 〈S T 〉. The
Dirichlet domain D(2i) also contains the elliptic point T ω on its boundary.
The closure of D(2i) in H∗Γ contains a single cusp ∞ with Γ∞ = 〈±T 〉 and PΓ = Γ ∞ = P1 (Q).
Then Y (1) := YΓ := Γ \H ' C and X (1) := X Γ := Γ \H∗Γ ' P1 (C).
Every subgroup of Γ = SL2 (Z) is also a Fuchsian group, but we will only be interested in those
that have finite index. An important example of such groups is given by congruence subgroups,
which are subgroups of SL2 (Z) that contain the group
Γ (N ) := γ ∈ SL2 (Z) : γ ≡ 1 mod N .
We note that not all finite index subgroups of SL2 (Z) are congruence subgroups.
i 2 = a, j 2 = b, i j = − ji.
A quaternion algebra is either a division ring (nonsplit) or isomorphic to M2 (k) (split). The
standard involution on B is the map α 7→ ᾱ that sends α = t + x i + y j −zi j to ᾱ = t − x i − y j −zi j.
The (reduced) norm of α ∈ B is N (α) = αᾱ = t 2 = a x 2 − b y 2 + a bz 2 ∈ k, which is multiplicative
and nonzero precisely when α is a unit in B. The set of norm-1 elements B 1 form a subgroup of
the unit group B × .
a,b
Let B = Q be a quaternion algebra over Q. We say that a place p of Q is ramified if B if
B ⊗Q Q p is a division ring, and unramified if B ⊗Q Q p is split. We call B definite if it is ramified at
−1,−1
the archimedean place of Q, equivalently, B∞ := B⊗Q R ' H = R and indefinite otherwise.
a,b
The quaternion algebra Q is definite if and only if a, b < 0.
The set of ramified places of B is finite of even cardinality; conversely, for every finite set
Σ of places of Q of even cardinality, there is a quaternion algebra with ramified exactly at the
places in Σ, which is unique up to isomorphism (this is true more generally for quaternion
algebras over any global field, see [2, Theorem 14.6.1]). The discriminant of B is the product
of the primes that ramify in B, which we negate when ∞ is ramified; the discriminant uniquely
determines the isomorphism class of a quaternion algebra over Q (and in fact |B| does).
If B is an indefinite quaternion algebra over Q,
a,b
equivalently disc(B) > 0 we can embed B in
M2 (R) via B ,→ B∞ ' M2 (R) by writing B = Q with a > 0 and defining
p
ι∞ : B ,→ M2 (Q( a)) ⊆ M2 (R)
p p
t+x a y +z a
t + x i + y j + zi j 7→ p p .
b( y − z a) t − x a
1+i + j +ij
O := Z ⊕ Zi ⊕ Z j ⊕ Zk, k= .
2
The Fuchsian group Γ := ι∞ (O1 ) has no cusps, and X Γ = YΓ = Γ \H = X (Γ ) is a good compact
complex 1-orbifold.
Lemma 5.5. Let Γ be a discrete subgroup of a second countable locally compact group G. Then
G/Γ admits a Haar measure if and only if ∆G (Γ ) = {1}.
Proof. Let π: G → G/Γ be the quotient map, let F be a measurable strict fundamental domain
for G/Γ as in Lemma 5.4, and let ν be a left Haar measure on G. For S ∈ Σ(G/Γ ) we define
µ(gS) = ν(F ∩ π−1 (gS)) = ν(F ∩ gπ−1 (S)) = ν(g −1 F ∩ π−1 (S)) = ν(F 0 ∩ T ),
where F 0 := g −1 F and T := π−1 (S). Now F 0 is also a measurable strict fundamental domain for
G/Γ , thus G = F 0 Γ = F Γ , and T Γ = T . If ∆G (Γ ) = {1} then ν is Γ -right invariant and
X X
µ(gS) = ν(G ∩ F 0 ∩ T ) = ν(F γ ∩ F 0 ∩ T ) = ν(F ∩ F 0 γ−1 ∩ T ) = ν(F ∩ G ∩ T ) = µ(S),
γ∈Γ γ∈Γ
implying that µ is a left G-invariant. The function µ is a nonzero Radon measure on G/Γ , since ν
restricts to a nonzero Radon measure on F (because F is measurable and G is second countable,
hence σ-compact). It follows that µ is a left Haar measure on G/Γ .
Conversely, given a left Haar measure µ on G/Γ , for any S ∈ Σ(G) we may define
X
ν(S) := µ(π(F γ ∩ S)).
γ∈Γ
The Haar measure µ on G/Γ lifts to a nonzero Radon measure on each of the measurable strict
fundamental domains F γ, which form a countable partition of G. It follows that ν is a nonzero
Radon measure on G. The G-invariance of µ implies that ν is a Haar measure, and we note
that ν is right Γ -invariant, since π is; it follows that ∆G (Γ ) = {1}.
Corollary 5.6. Let Γ be a lattice in a second countable locally compact group G. Then G is uni-
modular.
Proof. Γ ∈ ker ∆G , so ∆G factors through the map G/Γ → R× >0 induced by µ, and ∆G (G) must
be bounded, since µ is finite. But the only bounded subgroup of R×
>0 is the trivial group.
Lemma 5.7. Let G be a second countable locally compact group with a discrete cocompact sub-
group Γ . Then Γ is a lattice.
In other words, for discrete subgroups of second countable locally compact groups, cocom-
pact implies cofinite. The converse does not hold: SL2 (Z) ⊆ SL2 (R) is discrete and SL2 (R)
is unimodular, so SL2 (Z) is a lattice in SL2 (R), but it is not cocompact because the diagonal
subgroup of SL2 (R) has unbounded image in SL2 (R)/SL2 (Z). Cocompact lattices are uniform
Arithmetic groups are discrete (since Z is discrete in R) and cofinite, which implies they
are lattices (which may be uniform or non-uniform). Not all lattices are arithmetic (not even
all uniform lattices); triangle groups in SL2 (R) provide many examples (see the next section).
But in a semisimple Lie group G (no nontrivial connected normal abelian subgroup) of real
rank at least 2, every irreducible lattice Γ (infinite lattices for which Γ N is dense in G for every
noncompact closed N Å G 0 ) is arithmetic, by a theorem of Margulis [1].
The arithmetic lattices in SL2 (R) arise from orders O in quaternion algebras B over a totally
real field F in which exactly one of the real places splits (this includes both uniform and non-
uniform lattices). As in the previous lecture, we embed F in R via the unique real place that
splits and use this to embed B in M2 (R), then take the image of the norm-1 elements of O under
this embedding to get a discrete cofinite subgroup Γ (O1 ) ≤ SL2 (R).
When B is a division algebra, the arithmetic lattice Γ (O1 ) is uniform (cocompact) and has
no cusps, otherwise B is a matrix algebra and Γ (O1 ) is non-uniform and has at least one cusp.
Let Γ be a lattice in SL2 (R), equivalently, a finitely generated Fuchsian group of the first kind
with Z(Γ ) := Γ ∩{±1}. Then Γ /Z(Γ ) is generated by hyperbolic elements {α1 , . . . , α g , β1 , . . . , β g },
elliptic elements {γ1 , . . . , γs }, and parabolic elements {γs+1 , . . . , γs+t }, which satisfy
α1 β1 α−1 −1 −1 −1
1 β1 · · · α g β g α g β g γ1 · · · γs+t = 1
e
γi i = 1 (1 ≤ i ≤ s),
Definition 5.10. The signature of Γ is the tuple (g; e1 , . . . , es+t ), with es+1 = · · · = es+t . It
satisfies the inequality
s+t
1
X
2g − 2 + 1− > 0.
j=1
ei
1 1 1
+ + < 1.
e1 e2 e3
Proposition 5.11. Let (e1 , e2 , e3 ) and s be as above. Up to conjugacy in SL2 (R), the following
hold:
• If s ≥ 1 and ei is even for some i ≤ s then there is a unique triangle group of type (e1 , e2 , e3 )
and it contains −1.
• If g ≥ 2 and ei is odd for all i ≤ s then there are exactly two triangle groups of type (e1 , e2 , e3 ).
One contains −1 and the other of does not.
• If s = 0 or s = 1 and e1 is odd then there are exactly three triangle groups of type (e1 , e2 , e3 ).
One contains −1 and the other two do not and have index 2 in the one that does.
Theorem 5.13. Let Γ be an arithmetic triangle group. If t > 0 there are 9 possible types for Γ :
(2, 3, ∞), (2, 4, ∞), (2, 6, ∞), (2, ∞, ∞), (3, 3, ∞), (3, ∞, ∞), (4, 4, ∞), (6, 6, ∞), (∞, ∞, ∞)
• (2, 3, n) with n ∈ {7, . . . , 14, 16, 18, 24, 30} or (2, 4, n) with n ∈ {5, . . . , 8, 10, 12, 18};
• (2, 5, n) with n ∈ {5, 6, 8, 10, 20, 30} or (2, 6, n) with n ∈ {6, 8, 12};
• one of (2, 7, 7), (2, 7, 14), (2, 8, 8), (2, 8, 16), (2, 9, 18), (2, 10, 10), (2, 12, 12), (2, 12, 24),
(2, 15, 30), (2, 18, 18);
• (3, 3, n) with n ∈ {4, . . . , 9, 12, 15} or (3, 4, n) with n ∈ {4, 6, 12};
• one of (3, 5, 5), (3, 6, 6), (3, 6, 18), (3, 8, 8), (3, 8, 24), (3, 10, 30), (3, 12, 12);
• (4, 4, n) with n ∈ {4, 5, 6, 9} or one of (4, 5, 5), (4, 6, 6), (4, 8, 8), (4, 16, 16);
• one of (5, 5, 5), (5, 5, 10), (5, 5, 15), (5, 10, 10), (6, 6, 6), (6, 12, 12), (6, 24, 24), (7, 7, 7),
(8, 8, 8), (9, 9, 9), (9, 18, 18), (12, 12, 12), (15, 15, 15).
[3] Dave White Morris, Introduction to arithmetic groups, Deductive Press, 2015.
[4] Kisao Takeuchi, Arithmetic triangle groups, J. Math. Soc. Japan 29 (1977), 91-106.
Proof. Check.
Definition 6.3. Let k ∈ Z≥0 and γ ∈ GL+ 2 (R). The weight- k slash operator for γ acts on functions
f : H → C via
( f |k γ)(z) := (det γ)k/2 j(γ, z)−k f (γz).
We have ( f + g)|k γ = f |k γ + g|k γ and (c f )|k γ = c( f |k γ) for all γ ∈ GL+
2 (R), f , g : H → C, and
a ∈ C, thus the map f 7→ f |k γ is a linear operator on the vector space of functions H → C.
Moreover, for every γ1 , γ2 ∈ GL+ 2 (R) we have
f |k (γ1 γ2 ) = ( f |k γ1 )|k γ2 .
In other words, for each k ∈ Z the weight-k slash operator defines a linear right group action of
GL+2 (R) on the vector space of functions f : H → C.
We can omit the factor (det γ)k/2 if we only care about γ ∈ SL2 (R). For scalars a ∈ R× we have
f |k a = sgn(a)k f (so scalars, including −1, act trivially only when k is even).
Definition 6.4. Let Γ be a lattice in SL2 (R) (equivalently, a finitely generated Fuchsian group of
the first kind), and let k ∈ Z. We say that a meromorphic function f : H → C is an automorphic
function of weight k with respect to Γ if for every γ ∈ Γ we have
f |k γ = f .
Recall that a graded ring is a ring R whose additive group is an internal direct sum
M
R= Ri
i=I
indexed by a monoid I (typically Z or Z≥0 ) such that R i R j ⊆ R i+ j for all i, j ∈ I. Note that this
means each R i is an R0 -module.
alizes the C-vector space structure of Ωk (Γ ), which can be viewed as multiplication by constant
functions C ⊆ Ω0 (Γ )).
Suppose Γ has a cusp x with stabilizer Γ x , and pick σ ∈ SL2 (R) so that σx = ∞. Then
±σΓ x σ−1 = ± 10 1h
for all z ∈ H; this also holds when x is irregular and k is even, since f |k σ−1 |k − 1 = f |k σ−1 for
even k. There is then a function g : D0 → C (where D0 := {z ∈ C× : |z| < 1}) such that
Sk (Γ ) ⊆ Mk (Γ ) ⊆ Ak (Γ ) ⊆ Ωk (Γ ).
The ratio of automorphic forms of weight k and l is an automorphic form of weight k − l, thus
A(Γ ) is a field, as is its subspace A0 (Γ ), and all the Ak (Γ ) are A0 (Γ )-vector spaces. The field A0 (Γ )
is isomorphic to the field of meromorphic functions on the Riemann surface X Γ := Γ \H∗Γ .
Corollary 6.8. Let n≥1 an eπinz/h be the Fourier expansion of a cusp form f ∈ Sk (Γ ) at a cusp
P
of Γ . Then an = O(nk/2 ).
Remark 6.9. The bound in Corollary 6.8 can be improved to an = O(n(k−1)/2 ) when Γ is a
congruence subgroup [1, Thm. 4.5.17].
References
[1] Toshitsune Miyake, Modular forms, Springer, 2006.
[2] Fred Diamond and Jerry Shurman, A first course in modular forms, Springer, 2005.
[3] John Voight, Quaternion algebras, Springer, 2021.
ωi = (ω j ◦ ψi j )(dψi /dψ j )n
in which Ω (X ) ' C(X ) is the field of meromorphic functions on X . In particular, each C-vector
0
From now on we shall assume that X is compact. This implies that X is a smooth projective
curve over C with function field C(X ) ' Ω0 (X ) Recall that the categories of Riemann surfaces
(with holomorphic maps), smooth projective curves over C (with dominant morphisms), and
function fields of transcendence degree 1 over C (with field homomorphisms) are equivalent
categories. The functor from Riemann surfaces to curves is covariant, while the functor from
curves to function fields is contravariant.
dimC(X ) Ωn (X ) = 1.
( )
X
Div(X ) := n P P : n P ∈ Z with n P = 0 for all but finitely many P .
P∈X (C)
deg: Div(X ) → Z
D 7→ deg(D)
where ord P ( f ) is the order of vanishing of f at P. Divisors arising in this way are called principal.
For any f , g ∈ C(X )× we have div( f g) = div( f ) + div(g) and deg(div( f )) = 0. The set of
principal divisors form a subgroup Prin(X ) ≤ Div0 (X ) ≤ Div(X ), where Div0 (X ) := ker(deg).
We now define the divisor class group
The transition maps ψi j are holomorphic and nonzero, so ord P (ωi ) = ord P (ω j ) for Ui ∩ U j 6= ;.
This implies that ord P (ω) does not depend on the choice of Ui 3 P. We now define the divisor
X
div(ω) = ord P (ω)P,
P∈X (C)
which for ω ∈ Ω0 (X ) coincides with our previous definition. For any ω ∈ Ωm (X ) and ω0 ∈ Ωn (X )
with ω, ω0 6= 0 we have
div(ωω0 ) = div(ω) + div(ω0 ).
Recall that dimC(X ) Ωn = 1.
PThis implies that {div(ω) : 0 6= ω ∈ Ω (X )} is a divisor class.
n
which is a finite-dimensional C-vector space, with L(0) = C and L(D) = {0} for deg(D) < 0.
We denote its dimension by
`(D) := dimC L(D).
Theorem 7.3 (Riemann-Roch). Let X be a compact Riemann surface of genus g and let ω ∈ Ω1 (X )
be a nonzero meromorphic differential. Then for every D ∈ Div(X ) we have
Corollary 7.6. Let X be a compact Riemann surface of genus g. Then dimC Ω10 (X ) = g.
( f |k γ)(z) = (det γ)k/2 j(γ, z)−k f (γz) = j(γ, z)−k f (γz) = (cz + d)−k f az+b
cz+d ,
Theorem 7.7. Let A(Γ )even = n∈Z A2n (Γ ). Then A(Γ )even is isomorphic to Ω(X Γ ) as a graded
L
Let f ∈ A2n (Γ ) be nonzero and choose P ∈ X Γ (Z), z P ∈ π−1 (P), and z P ∈ U P ⊆ H∗Γ as above.
For z P ∈ H choose ρ ∈ SL2 (C) so that ρH = D and ρz P = 0. Choose ψ P : VP −→ ∼
WP ⊆ D so that
ψ P ◦ π(z) = (ρz) for all z ∈ U P , where e is the ramification index at P. Then
e
Now g P ◦ ρ −1 (w) = we , so
We also have
ordw ( f (ρ −1 w)(d(ρ −1 w)/d w)n ) = ordz0 ( f ),
1
ord P ( f ) := ordz0 ( f ),
e
and we then have
ord P (ω f ) = ord P ( f ) − n(1 − 1/e).
which shows that the definition of ord P ( f ) does not depend on the choice of z0 or ρ. When
n = 1 and z0 ∈ H we have
For z0 a cusp of Γ we choose σ ∈ SL2 (R) so σz0 = ∞ and define h > 0 as above so that
X
( f |2n σ−1 )(σz) = am (g P (z))m ,
m≥m0
References
[1] Toshitsune Miyake, Modular forms, Springer, 2006.
[2] Fred Diamond and Jerry Shurman, A first course in modular forms, Springer, 2005.
Div(X )Q = Div(X ) ⊗Z Q
ord ( f ).
otherwise
zP
−1
Here ord∞ ( f |2n σ ) is the index m of the first nonzero coefficient am in the Fourier expansion
−1
( f |2n σ )(z) = with ±σΓzP σ−1 = ± 10 1h . We also recall that e = #ΓzP /Z(Γ ) is
P 2πimz/h
e
equal to the ramification index of the map πΓ at e (the cardinality of π−1 Γ (Q) for every Q 6= P in
any sufficiently small neighborhood of P).
The lattice Γ has only finitely many inequivalent elliptic points and cusps (it is cofinite, hence
geometrically finite), so we may define
X
div( f ) := ord P ( f )P ∈ Div(X Γ )Q .
P∈X Γ (C)
Note that unlike div( f ), the divisor div(ω f ) ∈ Div(X Γ ) has integer coefficients (it is the divisor
of a differential form), and the Riemann–Roch theorem implies
where g := g(X Γ ) is the genus of X Γ , as is true for any meromorphic differential of degree n on
a compact Riemann surface of genus g.
The following theorem follows immediately from the formulas above.
where e P is the ramification index of P and g = g(X Γ ), with 1/e P = 0 when P is a cusp.
Now f1 − f2 is holomorphic, so ordQ (ω) ≥ −k/2 for every elliptic point or cusp Q 6= P, and
ordR (ω) ≥ 0 for every ordinary point R. It follows that
Remark 8.3. This corollary has some remarkable implications. If k(g − 1) + k(r + s)/2 < 0
the hypothesis always holds, implying that every f ∈ Mk (Γ ) is zero. It also implies that every
f ∈ Mk (Γ ) is uniquely determined by a finite prefix of its Fourier expansion at any cusp. This
yields an effective algorithm for testing whether two modular forms are equal or not, and if we
fix a basis for the finite dimensional C-vector space Mk (Γ ) it allows us to explicitly compute the
representation of any f ∈ Mk (Γ ) as a linear combination of basis elements.
and X
div( f ) := ord p ( f )P
P∈X Γ (C)
Theorem 8.4. Let k be an odd integer and assume −1 6∈ Γ . For each nonzero f ∈ Ak (Γ ) we have
X
div( f ) = 21 div(ω f 2 ) + 2k 1 − e1P P
XP
k 1
deg div( f ) = k(g − 1) + 2 1− eP
P
1
2 + Z if P is an irregular cusp
ord P ( f ) ∈ e1 + Z if P is an elliptic point
P
0 otherwise
¨
1 + 2Z if P is a regular cusp
ord P (ω f 2 ) ∈
2Z otherwise
Lemma 8.5. Let m be the lcm of the orders of all elliptic P ∈ X Γ (C). There is a nonzero f ∈ A2m (Γ )
which has no zeros or poles at any cusp or any elliptic point in X Γ (C).
Proof. Let f ∈ A2m (Γ ) be nonzero. Then div( f ) = div(ω f ) + 2k P 1 − e1P P ∈ Div(X Γ ), since
P
k/2 = m ∈ Z is divisible by every e P ∈ Z. Let S be the (finite) set of elliptic points and cusps in
X Γ (C), let g := g(X Γ ), and choose n ∈ Z so that
− deg(div(g)) − 1 + n > 2g − 2.
Fix Q 6∈ S and consider the divisors DQ := −div(g) + nQ and DP := −div(g) − P + nQ for each
P ∈ S. We have DQ ≥ DP , so L(DQ ) ) L(DP ), and deg(DQ ) > deg(DP ) > 2g − 2, so
contains some h ∈ C(X Γ ) ' A0 (Γ ) that does not lie in any of the L(DP ), with ord P ( f /h) = 0 for
P ∈ S, since ord P (h) = ord P ( f ) for P ∈ S. Then f /h ∈ A2m (Γ ) has not zeros or poles on any
cusp or elliptic point in X Γ (C).
Recall that v(X Γ ) is defined as the measure of any (measurable) fundamental domain for Γ
under the hyperbolic measure dv(z) := d xyd2 x on H, where z = x + i y, which is induced by the
Haar measure of SL2 (R).
where e P is the ramification index of P, with 1/e P = 0 when P is a cusp, and g is the genus of X Γ .
Proof sketch. Pick m and nonzero f ∈ A2m (Γ ) via Lemma 8.5, and a measurable fundamental
domain F for Γ such that f has no zeros or poles on ∂ F . Let v1 , . . . , vr be the vertices of F that
are cusps, and pick curves C1 , . . . , C r with Ci in a neighborhood of vi such that πΓ (Ci ) is a circle
around π(vi ) oriented counterclockwise. Let M be the compact set bounded by F and the curves
C1 , . . . , C r . Then Z
dx ∧ d y
v(X Γ ) = v(F ) = lim ,
Ci →x i
M y2
where Ci → x i means we are shrinking the neighborhoods of x i in which the Ci lie so that the
measure of the part of F outside M tends to zero. Applying Stokes Theorem yields
Z Z Z Z
dx ∧ d y
dz dz i i
= = + d(log f ) − d(log f ),
M y2 ∂M y ∂M y m m ∂M
by Cauchy’s residue formula, since we can assume f has no zeros or poles on ∂ M and its zeros
and poles in F all lie inside M . Applying Theorem 8.1 yields
X X
1 1 1
1 − e1p .
2π v(X Γ ) = m 2m(g − 1) + m 1 − eP = 2g − 2 +
P P∈C(X Γ )
Lemma 8.7. Let Γ ∈ SL2 (R) be a lattice with regular cusps P1 , . . . , Ps and irregular cusps Ps+1 , . . . , Pt .
Let f ∈ Ak (Γ ) be nonzero. We have the following isomorphisms of C-vector-spaces
Mk = { f h : h ∈ A0 (Γ ) : h = 0 or div( f h) ≥ 0}
= {h ∈ C(X Γ ) : h = 0 or div(h) + div( f ) ≥ 0}
= L(b(div( f )c).
If k is even then
f h ∈ Sk (Γ ) ⇔ ord Pi ( f h) ≥ 1 for 1 ≤ i ≤ t
⇔ ord Pi ( f ) + ord Pi (h) − 1 ≥ 0 for 1 ≤ i ≤ t
⇔ h ∈ L(bdiv( f ) − (P1 + · · · + Pt )c)
which implies Sk (Γ ) ' L(bdiv( f ) − (P1 + · · · + Pt )c). For k odd a similar argument works, except
we use the bound ord Pi ( f h) ≥ 1/2 for s < i ≤ t.
and we have d = v(X Γ )/(2π) > 0, by Theorem 8.6. We now consider k ∈ 2Z as follows:
• For k > 2. Fix any nonzero f ∈ Ak (Γ ) and let D = div( f ) − (P1 + · · · + Pt ), where P1 , . . . , Pt
are the cusps in X Γ (C). Then deg(D) = deg(div( f )) − t and
r
X
k 1
k
k−2
deg(bDc) = k(g − 1) + 2 1− ei + 2 −1 t ≥ 2 d + (2g − 2) > 2g − 2.
i=1
Lemma 8.7 and Riemann–Roch imply dim Sk (Γ ) = `(bDc) = deg(bDc) − g + 1, and we also
have dim Mk (Γ ) = dim Sk (Γ ) + `(bdiv( f )c) − `(bDc) = dim Sk (Γ ) + t.
Theorem 8.8. Let Γ ∈ SL2 (R) be a lattice, let e1 , . . . , e r be the orders of the elliptic points in X Γ (C),
let t be the number of cusps in X Γ (C), let g be the genus of X Γ and let k ∈ 2Z. Then
0 k < 0 or k = 0, t > 0
k=t =0
1
dim Sk (Γ ) =
g k=2
Pr k 1
k
(k − 1)(g − 1) + i=1 2 1 − ei + 2 − 1 t k > 2
and
0 k<0
k=0
1
dim Mk (Γ ) = g k = 2, t = 0
g −1+ t k = 2, t > 0
dim Sk (Γ ) + t k>2
Theorem 8.9. Let Γ ∈ SL2 (R) be a lattice with −1 6∈ Γ , let e1 , . . . , e r be the orders of the elliptic
points in X Γ (C), let u and v be the number of regular and irregular cusps in X Γ (C), let g be the
genus of X Γ and let k ∈ 1 + 2Z. Then
¨
0 k<0
d imSk (Γ = Pr
k 1
k−2 k−1
(k − 1)(g − 1) + i=1 2 1− ei + 2 u+ 2 v k≥3
and
0
k<0
dim Mk (Γ ) = dim S1 (Γ ) + 2u k=1
dim S (Γ ) + u
k≥3
k 2
Remark 8.10. No general formula is known for dim S1 (Γ ), even for congruence subgroups.
References
[1] Toshitsune Miyake, Modular forms, Springer, 2006.
If the sum defining F (z) converges, then by construction we have F |k γ = χ(γ)F for all γ ∈ Γ
and F ∈ Ωk (Γ , χ) is an automorphic form of weight k for Γ with character χ.
Theorem 9.1. Let φ, χ, Λ, Γ , zi , F be as above. Then F converges absolutely and uniformly on any
compact subset of H − {γzi |γ ∈ Γ , 1 ≤ i ≤ m} and F ∈ Ωk (Γ , χ).
Proof. See Theorem 2.6.6 in [1].
Now let x ∈ P1 (R) be a cusp of Γ , choose σ ∈ SL2 (R) so σx = ∞, and suppose that phi
also satisfies
(iv) if x is not a cusp of Λ then |(φ|k σ−1 )(z)| ≤ M |z|−1−ε on Im(z) > `, for some M , `, ε > 0.
(v) if x is a cusp of Λ then |(φ|k σ−1 )(z)| ≤ M |z|−ε on Im(z) > `, for some M , ` > 0 and ε ≥ 0.
Theorem 9.2. Let Γ , Λ, φ, χ, F be as above, let x 0 be a cusp of Γ such that (iv) and (v) hold for
x ∈ Γ x 0 . Then F is holomorphic at x 0 , and if ε > 0 in (v) then F vanishes at x 0 .
Proof. This is Theorem 2.6.7 in [1].
We now assume that Γ has at least one cusp x, with σx = ∞ and that
a condition that does not depend on the choice of σ and which implies that if χ is trivial, k is
odd, and −1 6∈ Γ then x is a regular cusp. Recall that
±σΓ x σ−1 = ± 10 1h
It is easy to check that for all k ≥ 3 the functions φm satisfy conditions (i) through (v) for
Γ , χ, Λ = Γ x , x, and we have the following theorem.
Definition 9.4. The Poincaré series Fk (z; φ0 , χ, Γ x , Γ ) in Theorem 9.3 is an Eisenstein series.
Theorem 9.5. Assume k ≥ 3 and let Γ , χ, φm , x be as above, satisfying conditions (i)–(v) and (1).
Let σx = ∞ and for m ∈ Z≥0 define
(m)
gk := Fk (z; φm , χ, Γ x , Γ ).
at x, then
¨
if m = 0
Z
¬
(m)
¶
(m) 0
f , gk = f (z)g k (z) Im(z)k d v(z) =
Γ \H am (4πm) 1−k k
h (k − 2)! if m > 0
proof sketch. Without loss of generality we may assume x = ∞ and σ = 1. As shown in the
proof of [1, Theorem 2.6.10], the integral in the Petersson inner product converges uniformly,
allowing us to swap the order of integration and the sum defining Fk . With z = x + i y we have
Z Z
(m)
X
f (z)g k (z) Im(z)d v(z) = χ(γ) f (z)e−2pim y z̄/h j(γ, z̄)−k Im(z)k d v(z)
Γ \H γ∈Γ x \Γ Γ \H
Z ∞Z h
= f (z)e−2πimz̄/h y k−2 d x d y
0 0
∞
X Z ∞ Z h
−2π(m+n) y/h
= an e y k−2
dy e2πi(n−m)x/h d x
n=1 0 0
¨
0 if m = 0
=
am (4πm) 1−k k
h (k − 2)! if m > 0
R1
where we have used the fact that 0
e2πinx d x vanishes for all nonzero integers n.
(m)
Corollary 9.6. Under the hypotheses of Thoerem 9.5, the set {g k (z) : m ≥ 1} generates Sk (Γ , χ).
(m)
¦ ©
V ⊥ := f : Sk (Γ , χ) : 〈 f , g k 〉 = 0 for m ≥ 1
Corollary 9.7. Let Γ ≤ SL2 (R) be a lattice and let x 1 , . . . , x r be the Γ -inequivalent cusps that satisfy
(1). Assume k ≥ 3 and let Γ , χ, φm be as above, satisfying conditions (i)–(v). Then the functions
g i (z) := Fk (z; φ0 , χ, Γ x i , Γ ) ∈ Sk (Γ , χ) (1 ≤ i ≤ r)
Proof. Consider f ∈ Mk (Γ , χ). If x is a cusp of Γ that does not satisfy (1) then f vanishes at x.
(i)
Otherwise, let a0 be the constant term in the Fourier expansion of f at x i . Theorem 9.3 implies
Pr (i)
that the function f (z)− i=1 a0 g i (z) lies in Sk (Γ , χ), therefore Mk (Γ , χ) = Sk (Γ , χ) V , where
L
V is the subspace of Mk (Γ , χ) spanned by the g i , and Theorem 9.5 implies that V is contained
in Ek (Γ , χ) = Sk (Γ , χ)⊥ , so Ek (Γ , χ) = V is spanned by the g i , and Theorem 9.3 implies that
the g i are linearly independent and thus form a basis.
References
[1] Toshitsune Miyake, Modular forms, Springer, 2006.
These notes summarize the material in §2.7–2.8 of [1] covered in lecture, along with some
relevant background on permutation modules.
R[H\G/H] ,→ R[H\G]
X
H g K 7→ H g0
H g 0 ∈{H gk:k∈K}
Elements in the image of this homomorphism are R-linear combination of sums of K-orbits in
[H\G], hence invariant under the action of K. Let R[H\G]K denote the R-submodule of R[H\G]
whose elements are fixed by K. Every such element must be an R-linear sum of K-orbits in
R[H\G], corresponding to the image of an R-linear sum of double cosets in [H\G/K] that lies
in the image of the map above. It follows that we then have an R-module isomorphism
and if ever K-coset in [G/K] has a finite H-orbit we similarly obtain an R-module isomorphism
We now that that everything above applies more generally to any semigroup ∆ ≤ G. The
semigroup ring R[∆] is defined in the same way as the group ring R[G] (and it is indeed a ring,
even though ∆ need not be a group). For H, K ⊆ ∆ we have the right R[∆]-module R[H\∆]
and the R[∆]-bimodule R[H\∆/K], and if every K orbit in [H\∆] and every H-orbit in [∆/K]
is finite, we have R-module isomorphisms
which implies that commensurability is transitive; it is clearly reflexive and symmetric, hence
an equivalence relation.
For g ∈ G, let H g denote g −1 H g ≤ G. For any H, K ≤ G and g ∈ G we have H ∼ K if
and only if H g ∼ K g . It follows that if H is commensurable with H g1 and H g2 then it is also
commensurable with H g1 g2 and therefore the commensurator
e := {g ∈ G : H ∼ H g }
H
−1 −1
with hi ∈ H ranging over H/(H ∩ K g ) representatives, of which there are m := [H : H ∩ K g ],
and k j ranging over (K ∩ H g )\K representatives, of which there are n = [K : K ∩ H g ].
This definition does not depend on the choice of the choice of the αi ∈ Γ αΓ 0 , since if Γ α0i = Γ αi
0 γ
for some then α0i = γi αi for some γi ∈ Γ and mαi = mγi αi = (mi )αi = mαi for all m ∈ M Γ . We
also note that m|Γ ααΓ ∈ M is Γ -invariant, since for any γ ∈ Γ , if we let α0i = αi γ0 then
0 0 0 0
0
X 0
X 0
(m|Γ αΓ 0 )γ = mα i γ = mαi = m|Γ αΓ 0 ,
i i
0 0
because Γ αΓ = i Γ αi
`
is just another right coset decomposition. This defines an R-module
0
homomorphism M → M Γ . Extend this R-linearly to an action of R[Γ \∆/Γ 0 ] yields an R-module
Γ
0
homomorphism R[Γ \∆/Γ 0 ] → HomR (M Γ , M Γ ).
with the summation over distinct double cosets Γ1 γΓ3 (not over all γ ∈ ∆). Note that cγ can
be nonzero only when Γ1 γΓ3 is equal to one of the finitely many double cosets Γ1 αi β j Γ3 , so the
product of Γ1 αΓ2 and Γ2 βΓ3 is an element of R[Γ1 \∆/Γ3 ], and this element does not depend on
the choice of the αi , β j , γ ∈ ∆. `
The decomposition Γ1 αΓ2 = i Γ1 αi defines an R-module isomorphism
as in §10.1. We can thus view R[Γ1 \∆/Γ2 ] as a Γ2 -invariant R-module on which R[Γ2 \∆/Γ3 ] acts,
and this action is given by the multiplication induced by (1), which extends to multiplication of
elements of R[Γ1 \∆/Γ2 ] by elements of R[Γ2 \∆/Γ3 ] via
! !
X X X
aα Γ1 αΓ2 bβ Γ2 βΓ3 = aα bβ (Γ1 αΓ2 · Γ2 βΓ3 ) ∈ R[Γ1 \∆/Γ3 ].
Γ1 αΓ2 Γ2 βΓ3 Γ1 αΓ2 ,Γ2 βΓ3
It is easy to verify that for all m ∈ M Γ1 , A1 ∈ R[Γ1 \∆/Γ2 ], A2 ∈ R[Γ2 \∆/Γ3 ] we have
and that for all A1 ∈ R[Γ1 \∆/Γ2 ], A2 ∈ R[Γ2 \∆/Γ3 ], A3 ∈ R[Γ3 \∆/Γ4 ] we have
If we now consider the case where Γ1 = Γ2 , the multiplication defined above makes R[Γ \∆/Γ ]
into a ring, in fact an R-algebra, with identity Γ = Γ 1∆ Γ .
• For all α ∈ ∆ the double coset Γ \Γ αΓ has a common` and right Γ -coset representa-
set of left `
r r
tives: there exist α1 , . . . , α r ∈ Γ αΓ such that Γ αΓ = i Γ αi = i=1 αi Γ .
• The Hecke algebra R[Γ \∆/Γ ] is commutative.
Proof. The existence of the involution ι implies that `decompositions of Γ αΓ into left or right
r
cosets all have the same cardinality. Indeed, if Γ αΓ = i=1 Γ αi then
r
a r
a r
a
Γ αΓ = ι(Γ αΓ ) = ι(Γ αi ) = ι(αi )ι(Γ ) = ι(αi )Γ .
i=1 i=1 i=1
where
cγ = #{(i, j) : Γ αi β j = Γ γ}
= #{(i, j) : Γ αi β j Γ = Γ γΓ }/#(Γ \Γ γΓ )
= #{(i, j) : Γ ι(β j )ι(αi )Γ = Γ ι(γ)Γ }/#(Γ \Γ ι(γ)Γ )
= #{(i, j) : Γ ι(β j )ι(αi ) = Γ ι(γ)}
= cγ0 .
This definition does not depend on the choice of the αi ; see [1, Theorem 2.8.1].
We now specialize to the case R = Z.
Theorem 10.4. Let Γ1 , Γ2 , Γ3 be finite index subgroups of a lattice Γ ≤ SL2 (R), let ∆ be a semigroup
containing Γ that lies in its commensurator in GL+ 2 (R), let k be an integer, and let χ be a character
of finite order on Γ extending to ∆ as above. The following hold:
Corollary 10.6. Let χ, ψ be distinct finite order characters of a lattice Γ ∈ SL2 (R). Then 〈 f , g〉 = 0
for all f ∈ Sk (Γ , χ) and g ∈ Mk (Γ , ψ).
References
[1] Toshitsune Miyake, Modular forms, Springer, 2006.
These notes summarize the material in §4.3 and §4.5 of [1] covered in lecture
A subgroup Γ ≤ Γ (1) that contains Γ (N ) for some N ≥ 1 is a congruence subgroup. The least
such N ≥ 1 is the level of Γ . The set of congruence subgroups of level N includes the groups
Γ0 (N ) := {γ ∈ Γ (1) : γ ≡ ( 0∗ ∗∗ ) mod N } ,
Γ1 (N ) := γ ∈ Γ (1) : γ ≡ 10 ∗∗ mod N .
Congruence subgroups are lattices in SL2 (R), so Fuchsian groups of the first kind.
and we can read off the Fourier coefficients of f (N z) from those of f (z). It follows that provided
we are willing to square the level when necessary, the study of modular forms for congruence
subgroups reduces to the study of modular forms for Γ1 (N ).
Now let χ be a Dirichlet character of modulus N ; this means that χ : Z → C is a periodic
multiplicative function that is the extension by zero of a group homomorphism (Z/N Z)× → C.
The set of all such χ form a group X (N ) that can be identified with the character group of
(Z/N Z)× , which is abstractly isomorphic to its dual (Z/N Z)× , since (Z/N Z)× is abelian.
We define a character χ : Γ0 (N ) → C via
χ ac db := χ(d)
Lemma 11.2. For each positive integer N we have C-vector space decompositions
M M
Mk (Γ1 (N )) = Mk (Γ0 (N ), χ) and Sk (Γ1 (N ) = Sk (Γ0 (N ), χ).
χ∈X (N ) χ∈X (N )
and just write Mk (N ) and Sk (N ) when the character χ is the trivial character of modulus N . For
any multiple M of N , each χ ∈ X (N ) induces a character χ ∈ X (M ), and we have
Mk (N , χ) ⊆ Mk (M , χ) and Sk (N , χ) ⊆ Sk (M , χ).
Definition 11.3. A Dirichlet character χ has even parity if χ(−1) = 1, and odd parity otherwise
(in which case we necessarily have χ(−1) = −1).
Lemma 11.4. If k and χ do not have the same parity then Mk (N , χ) = {0}.
Definition 11.5. Fix N ≥ 1. For each positive integer d coprime to N we define the diamond
operator 〈d〉 on Mk (Γ1 (N )) via
〈d〉 f := f |k α,
for any α = ac δb ∈ Γ0 (N ) with δ ≡ d mod N . The map ac db → d on Γ0 (N ) has kernel Γ1 (N ),
so the definition of 〈d〉 does not depend on the choice of α, since f = f |k γ for γ ∈ Γ1 (N ).
∈ GL+
ω(N ) = 0 −1
N 0 2 (Q).
ω(N )γω−1
N =
d −c
−bN a ,
−1
thus Γ0 (N )ω(N )= Γ0 (N ) and g|k γ = χ(a)g = χ̄(d) = χ̄(γ)g, since ad = 1.
Proof. We have q(z) = e2πiz , so q(z) = q(−z̄) and f˜(z) = f (−z̄) follows. For γ = ac db ∈ Γ0 (N ),
Proof. It suffices to consider Γ = SL2 (Z). Given α ∈ R× GL+ 2 (Q), we may choose c ∈ R so
×
+ −1 −1
that β = cα ∈ GL2 (Q) is an integer matrix, with α Γ α = β Γ β. Let m = det(β) ∈ Z≥1 so
that mβ −1 has integer entries. For γ ∈ Γ (m) the matrix mβ −1 γβ ∈ GL+ 2 (Q) has integer entries
−1 +
divisible by m, and it follows that β γβ ∈ GL2 (Q) is an integer matrix with determinant
1, hence an element of Γ . Therefore β −1 Γ (m)β = α−1 Γ (m)α ⊆ Γ ; it follows that αΓ α−1 ∩ Γ
contains Γ (m) and has finite index in Γ . Applying the same argument to α−1 ∈ R× GL+ 2 (Q)
and conjugating by α shows that αΓ α−1 ∩ Γ has finite index in αΓ α−1 . Thus Γ and αΓ α−1 are
commensurable, so α ∈ e Γ , which proves that R× GL+ 2 (Q) ⊆ Γ , since α was arbitrary.
e
+
Now suppose α ∈ Γ ⊆ GL2 (R). Then αΓ α is commensurable with Γ , as is βΓ β −1 , where
e −1
β is the inverse transpose of α. It follows that Γ , αΓ α−1 , βΓ β −1 all have the same set of cusps,
namely P1 (Q), since taking a finite index subgroup of a Fuchsian group does not change the
cusps, as proved in Lecture 4. So α−1 0, α−1 ∞, −1 −1
β 0, β 1 ∞ are all elements×of P+ (Q), and this
1
implies that every ratio of entries of α = ac db lies in P (Q), therefore α ∈ R GL2 (Q).
and for each α ∈ ∆∗0 (N ) there exist unique positive integers l|m with l ⊥ N such that
Γ0 (N )αΓ0 (N ) = Γ0 (N ) m
0 l Γ0 (N ).
0
Theorem 11.10. The Hecke algebras T(N ),T∗ (N ) are commutative, and for α ∈ ∆0 (N ) ∪ ∆∗0 (N )
the double coset Γ0 (N )αΓ0 (N ) admits a common set of representatives for its decomposition into
left and right Γ0 (N )-cosets.
Proof. By Theorem 10.3 proved in Lecture 10, it suffices to exhibit an anti-involution ι for which
ι(Γ0 (N )) = Γ0 (N ) and Γ0 (N )ι(α)Γ0 (N ) = Γ0 (N )αΓ0 (N ), for all α ∈ ∆0 (N ) and α ∈ ∆∗0 (N ). It
follows from Lemma 11.9 that the map
a c
cN d 7→ ( bN d )
a b
is such an involution.
χ(αγα−1 ) = χ(γ)
for all γ ∈ Γ and α ∈ ∆0 (N ) for which αγα−1 ∈ Γ0 (N ). By Lemma 11.9 we may assume
α = 0l m0 with l|m and l ⊥ N .
0: −1 a bl/m
∈ Γ0 (N ) then bl ≡ 0 mod m and γ0 = cN m/l d , in
For γ = cN d ∈ Γ0 (N ), if γ = αγα
a b
which case χ(γ0 ) = χ(αγα−1 ) = χ(γ0 ), as required. It follows from Theorem 10.4 of Lecture 10
that the Hecke algebra T(N ) acts on Mk (N , χ) and Sk (N , χ).
For ∆∗0 (N ) we extend χ to χ ∗ (α) = χ(d) and one similarly shows that T∗ (N ) acts on
Mk (N , χ) and Sk (N , χ).
Remark 11.11. For double cosets Γ0 (N )αΓ0 (N ) ∈ T(N ) ∩ T∗ (N ) the actions of Γ0 (N )αΓ0 (N )
as an element of T(N ) and T∗ (N ) need not coincide! However, the map α 7→ ω(N )−1 αω(N )
induces an isomorphism ∆0 (N ) ' ∆∗0 (N ) and we have
for α ∈ ∆0 (N ).
References
[1] Toshitsune Miyake, Modular forms, Springer, 2006.
These notes summarize the material in §4.3 and §4.5 of [2] covered in lecture
which act on Mk (N , χ) and Sk (N , χ) for all N ≥ 1 and Dirichlet characters χ of modulus N via
r
X
f |k Γ0 (N )αΓ0 (N ) = det(α)k/2−1 χ(αi ) f |k αi ,
i=1
r
X
= det(α)k−1 χ(αi ) j(αi , z)−k f (αi z),
i=i
`r
where Γ0 (N )αΓ0 (N ) = i=1 Γ0 (N )αi and we extend χ to ac db ∈ ∆0 (N ) via χ(α) := χ(a) and
to ac db ∈ ∆∗0 (N ) via χ ∗ (α) := χ(d). It follows from [2, Lemma 4.52] that
Γ0 (N )\∆0 (N )/Γ0 (N ) = Γ0 (N ) 0 m Γ0 (N )
l 0 : l, m ∈ Z≥1 with N ⊥ l|m ,
Γ0 (N )\∆∗0 (N )/Γ0 (N ) = Γ0 (N )
0 l Γ0 (N )
m0
: l, m ∈ Z≥1 with N ⊥ l|m ,
and distinct pairs (l, m) yield distinct double cosets on the RHS in each case. For m ⊥ N we
have
0 Γ (N ) = Γ (N ) m 0 Γ (N ),
Γ0 (N ) 0l m 0 0 0 l 0 (1)
but this does not contradict the statement above, we are just choosing a different representative
for this double coset depending on whether we are working with ∆0 (N ) or ∆∗0 (N ).
For positive integers l, m, n with l|m and l ⊥ N we define the following elements of T(N ), T∗ (N ):
T ∗ (m, l) := Γ0 (N ) m
T (l, m) := Γ0 (N ) 0l m
0 Γ (N )
0 and 0 l Γ0 (N ),
0
X X
T (n) := T (l, m) and T ∗ (n) := T ∗ (m, l),
l m=n lm=n
where the sums are taken over l, m ∈ Z≥1 with l|m and l ⊥ N as above, and we note that
T (n, n)T (l, m) = T (nl, nm) and T ∗ (n, n)T ∗ (m, l) = T ∗ (nm, nl).
Lemma 12.1. For any f ∈ Mk (N , χ) and positive integers l, m, n with l|m and l mn ⊥ N we have
χ ∗ (αi )/χ(αi ) = χ(d)/χ(a) = χ(d)χ(a) = χ(ad) = χ(ad − bc) = χ(det α) = χ(l m),
and X X
f |k T ∗ (n) = f k |T ∗ (m, l) = χ f |k T (l m) = χ(n) f |k T (n).
l m=n lm=n
for all λ ∈ C and u, v, w ∈ V is a (positive definite) Hermitian inner product. For every N ∈ Z≥1 ,
k ∈ Z and Dirichlet character χ of modulus N , the Petersson inner product is a Hermitian inner
product on the C-vector space Sk (N , χ), making it a (positive definite) Hermitian space.
If T is a linear transformation of a Hermitian space V , a linear operator T ∗ that satisfies
〈Tu, v〉 = 〈u, T ∗ v〉 for all u, v ∈ V is the adjoint operator of T ; it is necessarily unique and guar-
anteed to exist when V is finite dimensional (take the linear operator defined by the conjugate
transpose of a matrix representing T with respect to a 〈·, ·〉-orthonormal basis for V ).
For all u, v ∈ V and linear transformations S, T we have
thus T is the adjoint of its adjoint and the adjoint of the sum is the sum of the adjoints.
Linear operators on a Hermitian space may be classified as
• Hermitian: T = T ∗ ,
• unitary: T T ∗ = 1,
• normal: T T ∗ = T ∗ T .
Theorem 12.2 (Spectral theorem). Let T be a linear operator on a Hermitian space V of finite
dimension. Then T is normal if and only if V is the orthogonal sum of the eigenspaces of T .
Theorem 12.3. T (l, m) and T ∗ (m, l) are adjoint operators on the Hermitian space Sk (N , χ) with
the Petersson inner product, as are T (n) and T ∗ (n).
so that
r
Y
Γ0 (N )α0i ,
Γ0 (N ) m0
0 l Γ0 (N ) =
i
We have α−1 z = α0 z for any z ∈ H and αα0 = det(α) = det(α0 ). For γ, δ ∈ GL+ 2 (R) we have
j(γ, δz) = j(γδ, z)/ j(δ, z) and Im(δz) = det(δ)| j(δ, z)|−2 Im(z). Taking γ = α and δ = α0 yields
det(α)k/2 j(α, α−1 z)−k Im(α−1 z)k = det(α0 )k/2 j(α, α0 z)−k Im(α0 z)k
= det(α0 )k/2 (det(α0 )/ j(α0 , z))−k (det(α0 )| j(α0 , z)|−2 Im(z))k
= det(α0 )k j(α0 , z)−k Im(z)k .
Left linearity and right sesquilinearity of the Petersson inner product then implies
® r ¸
X
f |k T (l, m), g = det(αi ) k/2−1
χ(αi ) f |k αi , g
i=1
r
X
= det(αi )k/2−1 χ(α0i ) f |k αi , g
i=1
r
X
= det(α0i )k/2−1 χ ∗ (α0i ) f , g|k α0i
i=1
r
X
= f, det(α0i )k/2−1 χ ∗ (α0i )g|k α0i = f , g|k T ∗ (m, l) ,
i=1
proving that TP(l, m) and T ∗ (m, l) are adjoint operators as claimed, as are T (n) = T (l, m)
P
lm=n
and T ∗ (n) = l m=n T ∗ (m, l)
Corollary 12.4. For l|m and l mn ⊥ N the Hecke operators T (l, m) and T (n) are normal operators
on the Hermitian space Sk (N , χ).
T (l, m)T ∗ (m, l) = T (l, m)χ(l m)T (l, m) = χ T (l, m)T (l, m) = T ∗ (m, l)T (l, m),
which shows that T (l, m) commutes with its adjoint T ∗ (m, l), by Theorem 12.3. So T (l, m) is a
normal operator, and the same argument applies to T (n).1
Corollary 12.5. The C-vector space Sk (N , χ) has a basis of common eigenfunctions for the Hecke
operators T (n) and T (l, m) with l|m and l mn ⊥ N .
Proof. The Hecke algebra T(N ) is commutative, by Theorem 10.3, and the spectral theorem
implies that Sk (N , χ) admits a basis of simultaneous eigenvectors for the operators T (l, m) and
T (n) when they are normal, which holds for l mn ⊥ N , by the previous corollary.
Theorem 12.9. The Hecke algebra T(N ) is equal to the polynomial ring over Z generated by the
Hecke operators T (p), T (p, p), T (q) for prime numbers p - N and q|N .
Proof. That T (p), T (p, p), T (q) generate T(N ) follows from the commutativity of T(N ) and the
preceeding lemmas, and one can verify that they are algebraically independent over Q.
References
[1] Sheldon Axler, Linear algebra done right, Fourth Edition, Springer, 2023.
1. It makes modularity a much more powerful tool than it would otherwise be. Knowing that
an elliptic curve E/Q has the same L-function as a modular form is interesting precisely
because of the things we know about L-functions of modular forms. The LHS of the
BSD formula makes sense only because we know both the modularity theorem and that
L-functions of modular forms admit an analytic continuation and functional equation;
modularity alone is not enough.
2. It is a model for proving facts about L-functions of more general automorphic forms.
In the spirit of the second point, as a warmup we recall the proof of the analytic continuation
and functional equation for the Riemann zeta function, proved by Riemann in 1859, which was
the basis for Hecke’s results.
In order for this Dirichlet series to converge, we need bounds on the an . Note that |n−s | = n− Re(s) ,
so if we have |an | = O(nσ ) as n → ∞ then the Dirichlet series above converges absolutely and
uniformly to a holomorphic function on Re(s) > σ + 1. This function then uniquely determines
the Dirichlet coefficients an (no matter what σ is); they are intrinsic to the function.
−s
and n≥1 bn n−s converge absolutely at s = σ > 0.
P P
Lemma 13.1. Suppose n≥1 an n
If n≥1 an n−s = n≥1 bn n−s on Re(s) ≥ σ then an = bn for all n ≥ 1.
P P
The Riemann zeta function ζ(s) is defined by the Dirichlet series with 1 = a1 = a2 = · · · :
X
ζ(s) := n−s ,
which converges absolutely and uniformly to a holomorphic function on Re(s) > 1, as does any
Dirichlet series with bounded coefficients, since we can take σ = 0.
It is holomorphic on Re(s) > 0 and satisfies the functional equation Γ (s + 1) = sΓ (s), which
allows us to extend it to a meromorphic function on C whose only poles are simple poles at
s = −n for n ∈ Z≥0 , with residue (−1)n /n!. The gamma function has no zeros.
d m f (x)
sup x n < ∞.
x d xm
and is also Schwarz function. We also have the inverse Fourier transform
Z
(F −1 f )(x) := f ( y)e+2πi x y d y,
R
with F −1 F f = F F −1 f = f for all Schwarz functions f . We will need the following fact about
the Fourier function of the Guassian function.
2 2
Lemma 13.3. For any a > 0 the Fourier transform of e−πax is p1 e −π y /a .
a
provided that h(e−x ) is a Schwarz function. We also have the inverse Mellin transform
Z c+i∞
−1 1
(M f )(x) = x −s f (s)ds,
2πi c−i∞
defined for f (s) holomorphic on a vertical strip Re(s) ∈ (a, b) with c ∈ (a, b). We than have
M −1 M h = h and M M −1 f = f for suitable h: R>0 → C and f : C → C.
Theorem 13.4 (Poisson summation). Let f be a Schwarz function with Fourier transform fˆ. Then
X X
f (n) = fˆ(n).
n∈Z n∈Z
and we have X X X
f (n) = F (0) = cn = fˆ(n).
n∈Z n∈Z n∈Z
We now prove Riemann’s theorem for ζ(s), carefully spelling out each step (this is our model
for the next section where we won’t be as pedantic).
Theorem 13.5 (Riemann). Let Λ(s) = π−s/2 Γ 2s ζ(s). Then Λ(s) has a meromorphic continua-
1
tion to C and Λ(s) + 1s + 1−s is holomorphic on C and satisfies Λ(s) = Λ(1 − s).
Proof. For Re(s) > 1 we have
∞
X Z ∞
2 −s
Λ(2s) = (πn ) e−t t s−1 d t
n=1 0
∞ Z
X ∞
2
= e−πn t t s−1 d t
n=1 0
∞ X
Z ∞
2
= e−πn t
t s−1 d t,
0 n=1
P∞ −πn2 t
with a change of variable t 7→ πn2 t in the second line and n=1 e |t s−1 | < ∞ justifying
2
the third line (via Fubini). Applying Poisson summation and Lemma 13.3 to e−πx t yields
X 2 1 X −πn2 /t
e−πn t = p e ,
n∈Z t n∈Z
2
thus g(t) = n∈Z e−πn t satisfies the functional equation g(t) = p1t g( 1t ), and we have
P
Z∞
1
Λ(2s) = (g(t) − 1)t s−1 d t
2 0
Z 1 Z∞
1
1 1
= p g( ) − 1 t
t t
s−1
dt + s−1
(g(t) − 1) t d t
2 0 1
Z ∞ Z∞
1 p −1−s
= t g(t) − 1 t dt + s−1
(g(t) − 1) t d t
2 1 1
Z ∞ Z∞ Z∞ Z∞
1 1/2 −1−s 1 1/2−s −1 −1/2−s
= t g(t)t dt − − t t dt + t s−1
d t + (g(t) − 1)t d t
2 1 s 1 1 1
Z∞
1 1 1
t 1/2−s + t s (g(t) − 1) t −1 d t −
= − .
2 1 2s 1 − 2s
with a change of variable t 7→ 1t to get the third line. The last integral converges uniformly on
any compact subset and defines a holomorphic function on C, hence a meromorphic continua-
tion of Λ(s) to C for which Λ(s) + 1s + 1−s
1
is holomorphic. The last line is invariant under the
1
transformation s 7→ 2 − s, which implies Λ(s) = Λ(1 − s).
1
Corollary 13.6. The function ζ(s) + 1−s is holomorphic on C.
Lemma
P∞ 14.1. Fix σ ∈ R. If f (z) is a holomorphic function on H whose Fourier expansion
n=0 na e 2πinz
converges absolutely and uniformly on H with f (z) = O( y −σ ) as y → 0, then
|an | = O(nσ ) as n → ∞.
• the sum defining f (z) converges absolutely and uniformly on every compact subset of H;
• the function f (z) is holomorphic on H;
• | f (z)| = O( y −σ−1 ) as y → 0;
• | f (z) − a0 | = O(e−2π y ) as y → ∞.
Finally, we note the following lemma, which allows us to extend bounds on a holomorphic
function that hold at the edges of a vertical strip to the interior of the strip.
Lemma 14.3. Let σ1 , σ2 , a, c ∈ R with a, c > 0, and let φ(z)| be a function that is holomorphic
on an open set containing a vertical strip S bounded by the lines x = σ1 and x = σ2 .
a
Suppose |φ(z)| = O e| y| uniformly on S, with |φ(z)| = O(| y|c ) on the lines x = σ1 and
x = σ2 , as | y| → ∞. Then |φ(z)| = O(| y|c ) uniformly on S.
Now suppose f is a holomorphic on H with a Fourier expansion f (z) = n≥0 an e2πinz that
P
converges absolutely and uniformly on H, such that | f (z)| = O( y −σ ) as y → 0, for some σ > 0.
Then we also have an = O(nσ ) and | f (z) − a0 | = O(e−2π y ), by the first two lemmas above. Let
us call such f (z) nice holomorphic functions. P
To each nice holomorphic function f (z) = n≥0 an e2πinz we associate the Dirichlet series
∞
X
L( f , s) := an n−s ,
n=1
which converges absolutely and uniformly on Re(s) > σ + 1 and is holomorphic on this right
half plane. For each positive integer N we define the completed L -function
Theorem 14.4 (Hecke). Let f = n≥0 an e2πinz and g(z) = n≥0 bn e2πnz be nice holomorphic
P P
functions, and let k and N be positive integers. The following are equivalent:
p
(A) g(z) = (−i N z)−k f (− N1z ).
(B) The completed L-functions ΛN ( f , s) and ΛN (g, s) can be analytically continued to C, satisfy
ΛN ( f , s) = ΛN (g, k − s),
a0 a b0 b
with ΛN ( f , s) + s + k−s
0
and ΛN (s; g) + s + k−s
0
holomorphic and bounded on vertical strips.
Proof. (A)⇒(B): We have |an | = O(nσ ) and |bn | = O(nσ ) for some σ > 0, which implies that
X p
|an |e−2πnt/ N
n≥1
R∞ p
converges for t > 0, and that n≥1 0 |an |t c e−2πnt/ N t −1 d t converges on c > σ + 1, and
P
where the change of variable t 7→ t −1 gives the middle term in the first line, and (A) implies
p p p 1
g(i t/ N ) = (−i N (i t/ N ))−k f (− i t p N
) = t −k f ( t pi N ),
which we used in the second line. The fact that f and g are nice implies | f (i t) − a0 | = O(e−2πt )
and |g(i t) − b0 | = O(e−2πt ) as t → ∞, so the two integrals in the last displayed equation
valid for any α > 0. Applying this to all but the first term of f (i y) = n≥0 an e−2πn y yields
P
Z
1 X
f (i y) = an Γ (s)(2πn y)−s ds + a0
2πi n≥1 Re(s)=α
For α > σ + 1 the function L( f , s) = n≥1 an n−s is absolutely convergent and bounded on
P
Corollary 14.5. For any cusp form f ∈ Sk (N χ) the function ΛN ( f , s) is entire and satisfies
ΛN ( f , s) = i k ΛN ( f |k ω(N ), k − s).
When N = 1 the character χ is trivial, and f |k ω(1) = f , since ω(1) = 0 −1
1 0 ∈ Γ0 (1) = SL2 (Z),
so the functional equation becomes
ΛN ( f , s) = i k ΛN ( f , k − s).
For N > 1, if χ is trivial (in which case we should assume k is even), for f ∈ Sk (N ) we still have
f |k ω(N ) ∈ Sk (N ), but we do not quite have f |k ω(N ) = f , since ω(N ) 6∈ Γ0 (N ). But in fact, up
to a sign the functional equation above still holds provided that f is an eigenfunction for the
Fricke involution ωN , the linear operator defined by f 7→ f |k ω(N ). Note that
( f |k ωN )(z) = N −k/2 z −k f −1
Nz ,
so
(( f |k ωN )|k ωN )(z) = N −k/2 z −k N −k/2 ( −1
Nz )
−k
f (z) = (−1)k f (z) = f (z),
where the last equality follows from the fact that for f ∈ Sk (N ), either k is even or f = 0.
Thus ωN is an involution, and its eigenvalues are ±1. We will see later that if f ∈ Sk (N ) is an
eigenform for all the Hecke operators, then it is automatically an eigenfunction for the Fricke
involution, but in any case, for any f ∈ Sk (N ) that is an eigenfunction for ωN we have the
functional equation
ΛN ( f , s) = ±i k ΛN ( f , k − s).
More generally, if f ∈ Sk (N , χ) is an eigenform for all the Hecke operators, we will always have
a functional equation of the form
ΛN ( f , s) = "i k ΛN ( f , k − s),
References
[1] Toshitsune Miyake, Modular forms, Springer, 2006.
These notes summarize the material in §4.3, §4.6 of [3] and §11 of [1] presented in lecture.
In Lecture 11 we reduced the study of modular forms for congruence subgroups Γ to the
study of modular forms for Γ0 (N ) with character χ ∈ X (N ), where X (N ) denotes the group of
Dirichlet characters of modulus N . We did this via two standard reductions/decompositions
that can be applied to any weight k ∈ Z and level N ∈ Z≥1 :
• Mk (Γ ) ⊆ Mk (Γ1 (N 2 )), since Γ1 (N 2 ) ⊆ δ−1
N Γ (N )δN , where δN = 0 1 ,
: N 0
both of which preserve cusp forms. To ease notation we define Mk (N , χ) := Mk (Γ0 (N ), χ) and
let Sk (N , χ) ⊆ Mk (N , χ) denote the subspace of cusp forms.
Lecture 11 we also defined the diamond operator 〈d〉 defined by f 7→ f |k α, where α =
In
c δ ∈ Γ0 (N ) satisfies δ ≡ d mod N , and noted that Mk (N , χ) and Sk (N , χ) are χ-eigenspaces
a b
of diamond operators, they are the subspaces of Mk (Γ1 (N ) and Sk (Γ1 (N ) for which 〈d〉 f = χ(d) f
for all d ∈ (Z/N Z)× .
In Lecture 12 we showed that the Hecke algebra T(N ) := Z[Γ0 (N )\∆0 (N )/Γ0 (N ), where
∆0 (N ) := ac db ∈ M2 (Z) : c ≡ 0 mod N , a ⊥ N , ad − bc > 0 ,
The lemma below will be used to prove a key lemma due to Hecke that appears as Lemma
4.6.3 in [3] (the proof in [3] omits some details that are filled in by this lemma).
Lemma 15.2. Let f ∈ Mk (N , χ) with k > 0, let h > 1 be an integer prime to N . Suppose
c f (z) = f (z/h) for some nonzero c ∈ C (as functions on the upper half plane). Then f = 0.
Proof. Observe that f ∈ Mk (N , χ) implies f (z + 1) = f (z), and therefore
f ((z + 1)/h) = c f (z + 1) = c f (z) = f (z/h).
If f (z) = n≥0 a(n)e2πinz is the Fourier expansion of f at ∞, comparing coefficients of e2πinz/h
P
1 0 −1
= 10 /1m 6∈ Γ0 (N )
1
1 0 11
0 m 01 0 m
so αΓ0 (N )α−1 6∈ Γ0 (N
), and we can choose γ−1∈ Γ0 (N ) so that αγα
−1
6∈ Γ0 (N ). We have
−1
det(α)α = −c a ∈ ∆0 (N ), so det(α)αγα ∈ ∆0 (N ), and for some γ3 , γ4 ∈ Γ0 (N ) we
d −b
have
det(α)γ3 αγα−1 γ4 = 0u 0v ,
with u|v and u, v ∈ Z>0 . We have uv = det(α)2 with u 6= v, otherwise αγα−1 = γ−1 −1
3 γ4 ∈ Γ0 (N ).
Therefore h = u/v ∈ Z>1 . Now let c := hk/2 χ(γ3 )χ(γ)χ(γ4 ), and suppose that f |k α ∈ Mk (N , χ).
Then f ∈ Mk (αΓ0 (n)α−1 , χ), and f ∈ Mk (γ−1 −1
4 αΓ0 (N )α γ4 ), so we have
where M ranges over proper divisors of N divisible by m and l ranges over divisors of N /M . We
define the space of new cusp forms Sknew (N , χ) to be the orthogonal complement of Skold (N , χ)
with respect to the Petersson inner product.
Lemma 15.5. The spaces Skold (N , χ) and Sknew (N , χ) are stable under the action of T (n) for all
n ⊥ N.
The lemma implies that the old and new subspaces of Sk (N χ) each have bases of common
eigenfunctions for the Hecke operators T (n) with n ⊥ N . Moreover, each eigenfunction gener-
ates a one-dimensional subspace that is uniquely determined by the eigenvalues of the Hecke
operators T (n) for all n ⊥ N , or more generally for all n ⊥ L, for any integer L.
For each proper divisor Ni of N and divisor d of Ni the map δd : Sk (Ni , χ) → Sk (N , χ) defined
by f (z) 7→ f (dz) sends the new subspace of Sk (Ni , χ) to the old subspace of Sk (N , χ). The
images of these maps give us a complete decomposition of Sk (N , χ) into new subspaces
M
Sk (N , χ) ' Sknew (Ni , χ)⊕mi
cond(χ)|Ni |N
where mi is the number of divisors of N /Ni . As usual when we write Sk (M , χ) for an integer M
divisible by cond(χ) we understand χ to denote the unique Dirichlet character of modulus M
induced by the primitive character χ0 of modulus cond(χ) that induces χ.
Theorem 15.8. The newforms in Sknew (N , χ) are common eigenfunctions of T(N ) ∪ T∗ (N ) and
form a basis for Sknew (N , χ).
Proof. We have f |k T (n) = an f for all n ⊥ N . Consider T ∈ T(N ) and T ∗ ∈ T∗ (N ). Now T(N ) is
commutative, so T commutes with all the T (n), and similarly, T ∗ commutes with all the T ∗ (n).
We have
f |k T (n) = χ(N ) f |k T ∗ (n)
to T ∗ also commutes with T (n) for all n ⊥ N , and f |k T and f |k T ∗ are common eigenfunctions
of T (n) with the same eigenvalue an for all n ⊥ N . It follows that f |k T and f |k T ∗ must be
multiples of f , thus f is en eigenfunctions for T and T ∗ .
We have already shown that Sk (N , χ) has a basis of common eigenfunctions for T (n) with
n ⊥ N , a subset of which form a basis for Sknew (N , χ), and it follows that if we normalize these
so that a1 ( f ) = 1 we obtain a basis of newforms.
It follows from the theorem that for newforms f ∈ Sknew (N , χ) we have f |k T (n) = an f for
all n, not just for n ⊥ N , and the adjointness
of T (n) and T ∗ (n) implies that f |k T ∗ (n) = ān ( f ) f .
Moreover, if we put ωN := N 0 then
0 −1
Theorem 15.10. The action of ωN induces an isomorphism between Sknew (N , χ) and Sknew (N , χ),
and also between Skold (N , χ) and Skold (N , χ).
We conclude this section with two important theorems about newforms. The first is a strong
multiplicity one result, which implies that any newform f is uniquely determined by any subset
of its Fourier coefficients (equivalently, Hecke eigenvalues) an ( f ) that includes all n coprime to
some integer L that we are free to choose.
The second theorem characterizes the multiplicative relations between the Fourier coeffi-
cients of a newform, which together with the theorem above imply that every newform f is
uniquely determined by its Fourier coefficients a p ( f ) at almost all primes p.
Theorem 15.12. Let f ∈ Mk (N , χ). Then f is a newform if and only if the following hold:
• a1 ( f ) = 1;
• a p r ( f ) = a p ( f )a p r−1 ( f ) − χ(p)p k−1 a p r−2 ( f ) for all primes p and r ≥ 2;
• amn ( f ) = am ( f )an ( f ) for all integers m ⊥ n.
For newforms f ∈ Sknew (N , χ) one can take M = lcm(N , cond(ψ) cond(χψ)) in the lemma
above; see [1, Lemma 11.2.1].
References
[1] A.J. Best, J. Bober, A.R. Booker, E. Costa, J. Cremona, M. Derickx, M. Lee, D. Lowry-Duda,
D. Roe, A.V. Sutherland, and J. Voight, Computing classical modular forms, Arithmetic Ge-
ometry, Number Theory, and Computation, Simons Symposia (2021), 123–213.
[2] Fred Diamond and Jerry Shurman, A first course in modular forms, Springer, 2005.
[4] Kenneth A. Ribet, Twists of modular forms and endomorphisms of abelian varieties, Math.
Ann. 253 (1980), 43–62.
Kernel functions need not exist but are uniquely determined when the do (in which case H is
sometimes called a reproducing kernel Hilbert space or RKHS; we won’t use this terminology).
Indeed, if K and K 0 are both kernel functions of H then for every x, y ∈ X we have
® ¸
X X X X
〈 f , Ky〉 = ci f i , f i ( y) f i = ci f i ( y) = ci f i ( y) = f ( y).
i i i i
We now want to compute the kernel function of the Hilbert space H k2 (H). It follows from
[1, Corollary 2.62] that for any fixed k ∈ Z≥0 and z0 ∈ H there is a constant c such that
| f (z0 )| ≤ ck f k2k
for all f ∈ H k2 (H). Thus for any fixed z0 ∈ H the map f 7→ f (z0 ) is a continuous linear functional
on H k2 (H). Since H k2 (H) is a Hilbert space (so self-dual), there is therefor a unique gz0 ∈ H k2 (H)
for which Z
f (z0 ) = 〈 f , gz0 〉 = f (z)gz0 (z) Im(z)k d v(z),
H
K(w, z) := gz (w)
as the kernel function of H k2 (H). For any f ∈ H k2 (H) we have f (z1 ) = 〈 f , gz1 〉 = 〈 f , Kz1 〉 and
Z Z
f (z1 ) = 〈 f , Kz1 〉 = f (z2 )K(z2 , z1 ) Im(z2 )k d v(z2 ) = K(z1 , z2 ) f (z2 ) Im(z2 )k d v(z2 ).
H H
We now want to consider the behavior of K(z1 , z2 ) under the action of α ∈ SL2 (R) on H (via
linear fractional transformations). Recall that for α = ac db we define j(α, z) := cz + d, which
satisfies the identities
j(αα0 , z) = j(α, α0 z) j(α0 , z)
and we have
Im(z) Im(z)
Im(αz) = 2
= .
| j(α, z)| j(α, z) j(α, z)
For any α ∈ SL2 (R) and f ∈ H k2 (H), the function f (α−1 z) j(α, α−1 z)−k = f (α−1 z) j(α−1 , z)k lies
in H k2 (H), and we have d v(α−1 z) = d v(z) and Im(α−1 z) j(α−1 , z) = Im(z) j(α−1 , z)−1 , thus
Z
K(αz1 , αz2 ) j(α, z1 )−k j(α, z2 )−k f (z2 ) Im(z2 )k d v(z2 )
H
Z
−k
= j(α, z1 ) K(αz1 , z2 ) f (α−1 z2 ) j(α, α−1 z2 )−k Im(α−1 z2 )k d v(α−1 z2 )
ZH
= j(α, z1 )−k K(αz1 , z2 ) f (α−1 z2 ) j(α, α−1 z2 )k Im(z2 )k d v(z2 )
H
−k
= j(α, z1 ) f (α−1 (αz1 ) j(α, α−1 (αz1 ))k
= j(α, z1 )−k f (z1 ) j(α, z2 )k
= f (z1 )
Z
= K(z1 , z2 ) f (z2 ) Im(z2 )k d v(z2 ).
H
and
K(αz1 , z2 ) j(α, z1 )−k = K(z1 , α−1 z2 ) j(α−1 , z2 )−k .
Applying this to α = 10 1b yields
K(z1 + b, z2 + b) = K(z1 , z2 )
for all b ∈ R. If we let M := (z1 , z2 ) ∈ C2 : z1 ∈ H, z1 − z2 ∈ H and define
h(z1 , z2 ) := K(z1 , z1 − z2 ),
K(z1 , z2 ) = P(z1 − z 2 )
for all z1 , z2 ∈ H. If we now consider α = 0a a0−1 ∈ SL2 (R), we have P(a2 z) = a−2k P(z) and
P(i y) = y −k P(i)
z −k
P(z) = ck 2i
| f (z0 )|2 = |〈 f (z), K(z, z0 )〉|2 ≤ k f k22 kK(z, z0 )k22 = k f k22 K(z0 , z0 ) = ck Im(z0 )−k k f k22
p
so | f (z0 ) Im(z0 )k/2 | ≤ ck k f k2 for any z0 ∈ H, thus f ∈ H k∞ (H).
We now want to compute the constant ck in Theorem 16.1, and for this we need to define
the Fourier transform of f ∈ H k2 (H). For any y ∈ R>0 we define the function f y :− R → R via
f y (x) := f (x + i y).
R
Since k f k22 = | f (x + i y)|2 y k−2 d x d y < ∞, we have f y ∈ L 2 (R) for all y outside a set of
H R∞
measure zero; let fˆy (u) := f (x)e−2πiux d x of f y for all such y.
−∞
for u > 0, and let Gk (u) := 0 for u ≤ 0. Let Ĥ k2 denote the space of measurable functions
φ : R → C for which
Thus for f ∈ H k2 (H) we have fˆ ∈ Ĥ k2 , and in fact this defines an isomorphism of Hilbert spaces.
For φ ∈ Ĥ k2 we define
Z ∞
φ̂(z) := φ(u)e2πiuz du
−∞
Let K̂(u, z) denote the Fourier transform of K(z1 , z) as a function of z1 , for any fixed z. For
any φ ∈ Ĥ k2 we have
Z ∞
〈φ(u), K̂(u, z)〉 = 〈φ̂(z1 ), K(z1 , z)〉 = φ̂(z) = φ(u)e2πiuz dz,
0
and therefore Z ∞
z −k
Gk (4u)−1 e2πiuz du.
ck 2i =
0
Applying this with z = 2i yields
k−1
ck =
4π
and the following theorem.
k − 1 z1 − z̄2 −k
K(z1 , z2 ) = .
4π 2i
References
[1] Toshitsune Miyake, Modular forms, Springer, 2006.
These notes summarize the material in §6.3-4 presented in lecture. Recall the weight-k slash
operator associated to α ∈ GL+
2 (R) on functions f : H → C is defined by
neighborhood Ul = {z ∈ H| Im(z) > l} of ∞ and let an eπinz/h denote the Fourier espansion
P
of f |k σ, then
Z
∞> | f (z)|2 Im(z)k d v(z)
Γ \H
ZZ
1
≥ | f (σz) j(σ, z)−k |2 Im(z)k d v(z)
2 0≤Re(z)≤2h
l≤Im(z)<∞
Z ∞Z 2h X
1
= am ān e−π y(m+n)/h eπi x(m−n)/h y k−2 d x d y
2 l 0 m,n∈Z
Z ∞
2 −2π y n/h
≥ h|an | e y k−2 d y.
l
and X
K Γ (z1 , z2 ) := 1
#Z(Γ ) χ(γ)K(γz1 , z2 ) j(γ, z1 )−k ,
γ∈Γ
where #Z(Γ ) ∈ {1, 2} counts the trivial elements of Γ . In the previous lecture we proved
k − 1 z1 − z̄2 −k
K(z1 , z2 ) = , (2)
4π 2i
and we note that K(z2 , z1 ) = K(z1 , z2 ) implies K Γ (z2 , z1 ) = K Γ (z1 , z2 ), for all z1 , z2 ∈ H.
Theorem 17.3. For f ∈ L k1 (H) the sum in f Γ converges absolutely on H and f Γ lies in L k1 (Γ , χ),
and in H k1 (Γ , χ) if f is holomorphic. In particular, K Γ (z, z2 ) ∈ H k1 (Γ , χ) for every fixed z2 ∈ H.
Proof. This is [1, Theorem 6.3.3], but modulo issues of convergence, for f ∈ H k2 (Γ , χ) we have
Z Z
〈 f , KzΓ 〉 = f (z2 )K Γ (z2 , z1 ) Im(z2 )k d v(z2 ) = K Γ (z1 , z2 ) f (z2 ) Im(z2 )k d v(z2 )
1
Γ \H Γ \H
XZ
1
= #Z(Γ ) K(z1 , z2 ) f (z2 ) Im(z2 )k d v(z2 )
γ∈Γ γ−1 F
Z
= K(z1 , z2 ) f (z2 ) Im(z2 )k d v(z2 )
H
= f (z1 ),
since f ∈ H k2 (Γ , χ) ⊆ H k2 (H) and K(z1 , z2 ) is the kernel function of H k2 (H). and this implies that
K Γ (z1 , z2 ) is the kernel function of H k2 (Γ , χ), provided KzΓ ∈ H k2 , which one can show follows
1
from Kz1 ∈ H k2 (H).
Corollary 17.5. For any lattice Γ ≤ SL2 (R) and integer k > 2 we have the dimension formula
Z
dim Sk (Γ , χ) = K Γ (z, z) Im(z)k d v(z).
Γ \H
since as noted in Lecture 16, the RHS is the kernel of any finite dimensional Hilbert space with
orthonormal basis f1 , . . . , f r , and we have
r
X r Z
X Z
dim Sk (Γ , χ) = r = 〈 fi , fi 〉 = k
f i (z) f i (z) Im(z) d v(z) = K Γ (z, z) Im(z)k d v(z).
i=1 i=1 Γ \H Γ \H
with
χ(αγα−1 ) = χ(γ)
for all γ ∈ Γ and α ∈ ∆.
Fix k ∈ Z>2 . Recall that the Hecke algebra Z[Γ \∆/Γ ] acts on f ∈ Sk (Γ , χ) via
r
X
( f |k Γ αΓ )(z) := det(α)k−1 χ(αi ) j(αi , z)−k f (αi z),
i=1
`r
where Γ αΓ = i=1 Γ αi is any right coset decomposition and the action of an integer linear
combination of double cosets is computed by extending Z-linearly. For α ∈ ∆ we define
If T ∈ Z[Γ \∆/Γ ] is a sum of disjoint double cosets (a single double coset, for example) then the
` r summands of T is a subset of ∆ that admits a finite decomposition
union of the double coset
into right cosets T = i=1 Γ αi ; the Hecke operator T (n) ∈ T(N ) = Z[Γ0 (N )\∆0 (N )/Γ0 (N )] is
a notable example; it corresponds to the set {α ∈ ∆0 (N )| det(α) = n} ⊆ ∆0 (N ). We will regard
such Hecke operators as elements of Z[Γ \∆/Γ ] that are also subsets of ∆ we can sum over.
Theorem 17.6. Let T ∈ Z[Γ \∆/Γ ] be a sum of double cosets. The trace of T acting on Sk (Γ , χ) is
Z
1 X
tr(T ) = tr(T |Sk (Γ , χ)) = κ(α, z)d v(z)
#Z(Γ ) Γ \H α∈T
In order to compute the integral in Theorem 17.6 we need to treat cusps of Γ separately.
For x ∈ P1 (R) a cusp of Γ , and a Hecke operator T ⊆ ∆, let Tx := {α ∈ T : αx = x}. If U x is a
neighborhood of x that is stable under the action of Γ x := {γ ∈ Γ : γx = x} then
Z X Z X Z X
κ(α, z)d v(z) = κ(α, z)d v(z) + κ(α, z)d v(z)
Γ x \U x α∈T Γ x \U x α∈T −T x Γ x \U x α∈T x
Lemma 17.7. For any cusp x of Γ with Γ x -stable neighborhood U x and Hecke operator T ∈
Z[Γ \∆/Γ ] that is a sum of double cosets we have
Z X X Z
κ(α, z)d v(z) = κ(α, z)d v(z)
Γ x \U x α∈T −T x α∈T −T x Γ x \U x
Theorem 17.8. Let x be a cusp of Γ with σx = ∞ for some σ ∈ SL2 (R). then
Z X XZ
κ(α, z)d v(z) = lim+ κ(α, z) Im(z)−s | j(σ, z)|2s d v(z).
s→0
Γ x \U x α∈T x α∈T x Γ x \U x
R
We now consider the integrals Γ (α)\H
κ(α, z)d v(z). There are five different cases:
1. α is scalar;
2. α is elliptic (tr(α) < 4 det(α));
3. α is hyperbolic (tr(α) > 4 det(α)) with fixed points that are not cusps of Γ ;
4. α is hyperbolic (tr(α) > 4 det(α)) with a fixed point that is a cusp of Γ ;
5. α is parabolic (tr(α) = 4 det(α).
k − 1 z − z̄ −k −k
Z
2k−2
=a χ(α) a Im(z)k d v(z)
Γ \H 4π 2i
k − 1 k−2
= a χ(α)v(Γ \H).
4π
min(|η|, |ζ|)k−1
Z
κ(α, z)d v(z) = −χ(α sgn(ζ)k .
Γ (α)\H |ζ − η|
Let x be the fixed point of α; then x is a cusp of Γ and Γ (α) = Γz ; see the argument in [1].
ζλ
Choose σ ∈ SL2 (R) so that σx = ∞, and let σασ = 0 ζ . Let T p be the set of parabolic
p p
elements in T , and for each cusp x of Γ let Tx := T p ∩ Tx . Then x∈Γ \PΓ Tx is a complete
S
p
set of representatives for conjΓ (T p ). Let ±σΓ x σ−1 = 〈 10 1h 〉 with h > 0, and for α ∈ Tx , put
tr(T ) = t 0 + t e + t h + t p ,
References
[1] Toshitsune Miyake, Modular forms, Springer, 2006.
(this is always possible for X /C) and associate to D the linear form
XZ Qi
ω 7→ ω
i Pi
in Ω10 (X )∨ . Reducing modulo the lattice H1 (X , C) yields an element of Jac(X ). This is the Abel-
Jacobi map, which can be viewed as an extension of the map X → Pic0 (X ) that sends x ∈ X (C)
to the divisor x − x 0 , where x 0 ∈ X (C) is a fixed rational point.
If φ : X → Y is a nonconstant holomorphic map of compact Riemann surfaces, we have a
pullback map φ ∗ : C(Y ) → C(X ) defined by φ ∗ f = f ◦ φ, which extends to a linear map of
holomorphic differentials
φ ∗ : Ω10 (Y ) → Ω10 (X ).
Taking duals induces the pushforward map on Jacobians h∗ : Jac(X ) → Jac(Y ). There is also a
pushforward map h∗ : Pic0 (X ) → Pic0 (Y ) induced by the pushforward map Div0 (X ) → Div0 (Y )
that sends P ∈ C(X ) to h(P) ∈ C(Y ), and the two maps are compatible with the isomorphisms
Jac(X ) ≃ Pic0 (X ) and Jac(Y ) ≃ Pic0 (Y ).
There is also a pullback map h∗ : Pic0 (Y ) → Pic0 (X ) induced by the map Div0 (Y ) → Div0 (X )
that sends Q ∈ Y (C) to the sum of the points P ∈ φ −1 (Q), and a corresponding pullback map
Jac(Y ) → Jac(X ) that can be defined using the trace map Ω10 (X ) → Ω10 (Y ). These pullback maps
are also compatible with the isomorphisms Jac(X ) ≃ Pic0 (X ) and Jac(Y ) ≃ Pic0 (Y ).
Γ2 ←− Γ3 −→
∼
Γ3′ −→ Γ1
where the isomorphism is γ 7→ αγα−1 and the other arrows are inclusions. This induces mor-
phism of modular curves
π2 π1
X Γ2 ←− X Γ3 ≃ X 3′ −→ X Γ1
π−1
2 α π1
Γ2 z 7→ {Γ3 γi z} → {Γ3′ β j z} → {Γ1 β j z}
which defines an action of the Hecke algebra T = Z[Γ αΓ ] on the Jacobian Jac(X Γ ).
18.3 Eichler-Shimura
Now let f ∈ S2 (Γ1 (N )) be a newform, hence an eigenform of T(N ) = Z[Γ1 (N )∆(N )/Γ1 (N )], and
consider the eigenvalue map
λ f : T(N ) → C
defined by f |2 T = λ f (T ) f . Let I f := ker λ f := {T ∈ T(N ) : f |2 T = 0}. Since T(N ) acts on
Jac(X 1 (N )), we can view I f Jac(X 1 (N )) as a subgroup of the divisor class group Pic0 (X 1 (N ))
corresponding to an abelian subvariety of Jac1 (X 1 (N )).
Definition 18.1. The modular abelian variety A f associated to the newform f ∈ S2 (Γ1 (N )) is
the quotient Jac(X 1 (N ))/I f Jac(X 1 (N )).
The eigenvalues λ f (T ) and the Fourier coefficients an ( f ) are algebraic integers that gener-
ate a number field Q( f ). By letting Gal(Q/Q) act on the coefficients of f , we obtain a Galois
conjugate newform f σ associated to each σ ∈ Gal(Q/Q) (equivalently, we can view the coeffi-
cients of f as abstract elements of the number field Q( f ) and consider the different embedding
of Q( f ) into C.
The set of all Galois conjugates of f spans a subspace of S2 (Γ1 (N )) of dimension [Q( f ) : Q],
and this is equal to the dimension of A f ; indeed A f is invariant under the action of Gal(Q/Q)
and can be viewed as an abelian variety over Q. Replacing f with a Galois conjugate newform
does not change A f .
If we let Vf denote the subspace of S2 (Γ1 (N )) spanned by the Galois conjugates of f and
consider H1 (X 1 (N ), Z) ,→ Ω10 (X 1 (N )) ≃ S2 (Γ1 (N )), restricting H1 (X 1 (N ), Z) to its action on
elements of Vf ⊆ S2 (Γ1 (N )) yields a lattice Λ f ⊆ Vf , and we can alternatively define A f as the
abelian variety corresponding to the complex torus Vf /Λ f (which also admits a polarization).
References
[1] Fred Diamond and Jerry Shurman, A first course in modular forms, Springer, 2005.
which vanishes for odd k and converges for all k > 2. After replacing L with λL for some λ ∈ C×
we may assume L is the lattice spanned by [1, τ] for some τ ∈ H. We then have
X 1
Gk (z) := Gk ([1, τ]) = ,
m,n∈Z
(m + nz)k
(m,n)6=(0,0)
thus for k > 2 we may view Gk (τ) as a holomorphic function H → C that satisfies
and Gk (z) is holomorphic and nonvanishing at the cusps, makeing it a (non-cuspidal) modular
form of weight k for Γ (1) = SL2 (Z).
The Weierstrass ℘-function for L is defined by
1 X 1 1
℘(z : L) = 2 +
: − .
z (z − ω)2 ω2
ω∈L−{0}
and one can use this to show that ℘(z) = ℘(z; L) satisfies the differential equation
where g4 (L) := 60G4 (L) and g6 (L) := 140G6 (L). The map z 7→ (℘(z), ℘0 (z)) defines an isomor-
phism from the torus C/L (which is a Riemann surface of genus 1) to the elliptic curve
E L : y 2 = 4x 3 − g4 (L)x − g6 (L),
with nonzero discriminant ∆(L) := g4 (L)3 − 27g6 (L)2 , which sends ω ∈ L to the projective
point ∞ := (0 : 1 : 0) on E L , which serves as the identity element of the group E L (C). The
discriminant function ∆(τ) := ∆([1, τ]) is a cusp form of weight 12.
Definition 19.1. The j -invariant of the lattice L, and of the elliptic curve E L , is defined by
g2 (L)3 g2 (L)3
j(L) = 1728 = 1728 ∈ C.
∆(L) g2 (L)3 − 27g3 (L)3
4A3
j(E L ) = 1728 ,
4A3 + 27B 2
which defined the j-invariant of elliptic curves E/k for any field k whose characteristic is not 2
or 3 (and there are generalizations that work over any field).
Theorem 19.2. Lattices L and L 0 are homothetic if and only if j(L) = j(L 0 ), and if and only if the
corresponding elliptic curves E L and E L 0 are isomorphic.
Now consider j(τ) := j([1, τ]) as a holomorphic function H → C. We have j(γτ) = j(τ) for
all γ ∈ Γ (1) = SL2 (Z), thus j(τ) is a modular function. It is not a modular form (of weight 0)
because it is not holomorphic at the cusps: it has a pole at ∞.
The function j(τ) defines an isomorphism from Y (1) := H/Γ (1) to the affine line A1 (C) that
extends to an isomorphism from X (1) := H∗ /Γ (1) to the projective line P1 (C).
If we put q = e2πiτ the q-expansion of the j-function is
1 X
j(τ) = + 744 + an q n
q n≥1
with an ∈ Z. This explains the factor 1728 in the definition of j(τ); it is the least integer that
makes the q-expansion of j(τ) integral.
19.2 Isogenies
Morphisms of complex tori ϕ : C/L1 → C/L2 are induced by holomorphic functions f : C → C
for which the following diagram commutes
← f
C → C
←
π1 π2
→
←ϕ
C/L1 → C/L2
One can show that, up to homethety, we can always take f to be a multiplication-by-α map
z 7→ αz for some α ∈ C× for which αL1 ⊆ L2 . The corresponding homomorphism ϕα is then
defined by z + L1 7→ αz + L2 . We have an isomorphism of abelian groups
¦ © ¦ ©
α ∈ C : αL1 ⊆ L2 −→ ∼
ϕ : C/L1 → C/L2 = Hom(C/L1 , C/L2 ),
The connection between Γ0 (N ) and isogenies can be seen in two ways. First, there is a one-
to-one correspondence between cyclic subgroups H of Z/N Z ⊕ Z/N Z of order N and cosets of
Γ0 (N ) in Γ (1). Alternatively, if we fix a basis 〈P, Q〉 of Z/N Z ⊕ Z/N Z so that H = 〈P〉, then the
matrices in G L2 (Z/NZ) that send P to a multiple of P and therefore stabilize H are precisely
the upper triangular matrices; if we restrict to matrices of determinant 1, these are precisely the
reductions of elements of Γ0 (N ) modulo N .
This leads to the moduli interpretation of X 0 (N ). There is a one-to-one correspondence
between non-cuspidal points in X 0 (N )(C) and equivalence classes of pairs (E, 〈P〉), where E is
an elliptic curve and P ∈ E[N ] is a point of order N . We regard two pairs (E, 〈P〉) and (E 0 , 〈P 0 〉)
to be equivalent whenever there is an isomorphism ι : E → E 0 for which 〈P 0 〉 = 〈ι(P)〉.
Let us now consider the modular curve X 1 (N ). Matrices in Γ1 (N ) don’t just fix a cyclic
subgroup of Z/N Z × Z/N Z of order N , they fix a point of order N , and there is a one-to-
one correspondence between non-cuspidal points in X 1 (N )(C) and equivalence classes of pairs
(E, P), where E is an elliptic curve and P ∈ E[N ] is a point of order N , where we now regard
pairs (E, P) and (E 0 , P 0 ) as equivalent if there is an isomorphism ι : E → E 0 for which P 0 = ι(P).
For the modular curve X (N ) we have a one-to-one correspondence between non-cuspidal
points in X (N )(C) and equivalence classes of triples (E, P, Q) where E[N ] = 〈P, Q〉 and equiva-
lence involves an isomorphism ι : E → E 0 with ι(P) = P 0 and ι(Q) = Q0 .
Every matrix in GL2 (Z/N Z) defines an automorphism of X (N ) via its action on P and Q,
so given any subgroup H ≤ GL2 (Z/N Z) we can consider the quotient curve X (N )/H. If we
take H to be the subgroup of upper triangular matrices in GL2 (Z/N Z), this quotient is X 0 (N ),
and restricting to upper triangular matrices with a 1 in the upper left corner yields X 1 (N ).
Unlike the curves X 0 (N ) and X 1 (N ), which have models over Q, the curve X (N ) is defined
over Q(ζN )(so over Q only when N ≤ 2). But if H is a subgroup of GL2 (Z/N Z) for which
det(H) = (Z/N Z)× , the quotient X (N )/H will have a model over Q.
References
[1] Andrew V. Sutherland, Lecture notes for Elliptic Curves (18.783), 2023.
ρ E,m : GK → Aut(E[m]).
L
The abelian group E[m] ' Z/mZ Z/mZ is a free Z/mZ module of rank 2, hence its auto-
morphism group is isomorphic to GL2 (Z/mZ). After fixing a Z/mZ-module basis for E[m] we
obtain a linear representation
ρ E,m : GK → GL2 (Z/mZ).
Note that this representation depends on a choice of basis and is defined only up to conjugation.
When m is a prime ` we have Z/`Z ' F` and can view this as a 2-dimensional F` -representation.
The multiplication-by-` map P 7→ `P defines a surjective homomorphism E[`n+1 ] → E[`n ]
for each n ∈ Z≥0 . These maps uniquely determine an inverse system that is used to define the
`-adic Tate module
T` (E) := lim E[`n ],
←−
n
which is a free Z` -module of rank 2. If we fix a basis (P1 , Q 1 ) for E[`], we can then choose
(P2 , Q 2 ) so that `P2 = P1 and `Q 2 = Q 1 , since the multiplication-by-` map defines a surjection
E[`2 ] E[`] (it is a homomorphism whose kernel has cardinality #E[`] = `2 = [E[`2 ] : E[`]],
so it must be surjective). Continuing in this fashion yields a compatible system of bases that
allows us to identify Aut(T` (E)) with GL2 (Z` ) and we have the `-adic Galois representation
As above, the isomorphism with GL2 (Z` ) depends on a choice of basis, so this representation is
determined only up to conjugation in GL2 (Z` ).
More generally, for any m, n ∈ Z≥1 with n|m the multiplication-by-(m/n) map defines a sur-
jective homomorphism E[m] E[n], yielding an inverse system of torsion subgroups ordered
by divisibility, and we can consider the adelic Tate module
When k is a number field only the first two possibilities can occur, and when End0 (E) is an
imaginary quadratic field we say that E has complex multiplication. Such E are rare: for any
fixed number field K the set { j(E) : End(E) 6= Z} is finite, and even if we let K range over all
number fields of degree at most d, we still obtain a finite set.
We should note that End(E) is not invariant under base change; any endomorphisms defined
over K are necessarily defined over any extension L/K, but E may acquire additional endomor-
phisms over an extension L/K. If End(EK ) is an imaginary quadratic field then we say that E
has potential complex multiplication, and in this case there is a unique minimal extension L/K
of degree at most 2 for which End(E L ) = End(EK ): we can take L to be the compositum of K
and the imaginary quadratic field End(EK ).1
The endomorphism ring of EK has a profound impact on the Galois representation ρ E . If
End(E) is an imaginary quadratic order O (so E has complex multiplication) then E[m] is not
just a free (Z/mZ)-module of rank 2, it is also a free O /mO -module of rank 1. If we fix a basis
〈P, Q〉 = E[m] and consider the orbit of P under the action of End(E) ' O = [1τ], we obtain all
1
This fact is specific to elliptic curves; for a general abelian variety A/K whose geometric endomorphism algebra
is a field F , the minimal field L for which End(A L ) = End(AK ) is not necessarily K F .
This implies that the image of ρ E,m is an abelian subgroup of GL2 (Z/mZ), which forces it to be
much smaller than GL2 (Z/mZ); ignoring log factors, its cardinality will be quadratic in m, while
the cardinality of GL2 (Z/mZ) is roughly quartic in m. This is true for every m and implies that
ρ E (GK ) is an abelian subgroup of GL2 (Z
b ) and it cannot be open because it must have infinite
index.
Even when End(E) = Z, if End(EK ) is an imaginary quadratic order O (so E has potential
complex multiplication), then over a quadratic extension L/K we will have End(E L ) = O , in
which case ρ E (G L ) is an abelian subgroup of GL2 (Zb ) that has index 2 in ρ E (GK ); in this case
ρ E (GK ) will not be abelian (it be a generalized dihedral group), but it will still have infinite
index in GL2 (Z b ) and cannot be open.
Proof. The forward direction was argued above; the reverse direction is Théorèm 3 in [2].
References
[1] Andrew V. Sutherland, Lecture notes for Elliptic Curves (18.783), 2023.
[2] Jean-Pierre Serre, Propriétés galoisiennes des points d’ordre fini des courbes elliptiques, Invent.
Math. 15 (1972), 259–331.
φ(P1 ) = aP1 + c P2
φ(P2 ) = bP1 + d P2
we define
a b
ι(φ) := ,
c d
so that
ι(φ(P1 )) = ( ac ) = a b
c d 0 = ι(φ)ι(P1 )
1
ι(φ(P2 )) = db = a b
c d 1 = ι(φ)ι(P2 ).
0
This is consistent with the left action of GL2 (N ) on Z(N )2 . The isomorphism ι allows us to view
the mod-N Galois representation
ι
ρ E,N : Galk → Aut(E[N ]) −→
∼
GL2 (N )
b2
T (E) := E(k̄)tor = lim E[N ] ' Z
←−
N
is the adelic Tate module. The isomorphism ι allows us to view the Galois representation
ι
ρ E : Galk → Aut(T (E)) −→
∼
GL2 (Z
b)
as a continuous group homomorphism Galk → GL2 (Z b ). Recall that the level of an open subgroup
H ≤ GL2 (Zb ) is the least N ∈ Z≥1 for which H = π−1 (πN (H)), where πN : GL2 (Z b ) → GL2 (N ) is
N
the canonical projection associated to the inverse limit GL2 (Z) ' lim GL2 (N ).
b
←−N
Definition 21.1. For H ≤ GL2 (N ), an H -level structure on an elliptic curve E is an equivalence
class [ι]H of isomorphisms ι : E[N ] → Z(N )2 , where ι ∼ ι 0 whenever ι = h ◦ ι 0 for some h ∈ H,
where h ∈ H ≤ GL2 (N ) = Aut(E[N ]) acts as an automorphism of Z(N )2 ; this action gives the
abelian group Hom(E[N ], Z(N )2 ) the structure of a left H-module, and we can equivalently
view [ι]H = {h ◦ ι : h ∈ H} as the H-orbit of an isomorphism ι ∈ Hom(E[N ], Z(N )2 ).
For open H ≤ GL2 (Z b ) of level N , an H -level structure on E is a πN (H)-level structure on E;
equivalently, an H-level structure on E is the H-orbit of an isomorphism ι ∈ Hom(T (E), Z b 2 ).
When H ∈ GL2 (N ) is trivial, the H-orbits of isomorphisms ι ∈ Hom(E[N ], Z(N )2 ) are sin-
gletons, each corresponding to a choice of basis (P1 , P2 ) for E[N ]. This is the full level-N struc-
ture on E. As noted in the previous lecture, the non-cuspidal points on the modular curve
21.2 Twists
Definition 21.2. Two curves X , X 0 over a field k are twists (of each other) if X L ' X L0 for some
extension L/k. The same definition also applies to abelian varieties (and more generally to any
category of objects over a field k that has an appropriate notion of base change).
For a fixed curve X /k, the k-isomorphism classes of twists X 0 of X are parameterized by the
cohomology set H 1 (Galk , Aut(X k̄ )). Note that Aut(X k̄ ) need not be abelian, so this is a pointed
set and not necessarily a group. Twisting is a subtle topic in general, but it is quite simple when
Aut(X k̄ ) is a finite abelian group on which Galk acts trivially, which is true in almost all cases
where X is an elliptic curve.
For an elliptic curve E/k with j(E) 6= 0, 1728 the only non-trivial isomorphism of Ek̄ is
the map that sends P to −P, in which case Aut(Ek̄ ) = Aut(E) ' {±1} is a cyclic group of
order 2 with trivial Galk -action. In this situation H 1 (Galk , Aut(Ek̄ )) consists of all continuous
homomorphisms Galk → {±1}, each of which is either trivial (corresponding to a trivial twist
E 0 ' E) or has a kernel whose fixed field is a quadratic extension L/k over which E L0 ' E L for
some E 0 6' E that is unique up to k-isomorphism; this is the quadratic twist of E by L/k.
When k does not have characteristic 2 or 3, we can assume E is defined by a Weierstrass
equation y 2 = x 3 + Ax + B, in which case the map P 7→ −P simply negates the y-coordinate of
P, andpif we write the fixed field of the kernel of a non-trivial element of H 1 (Galk , Aut(Ek̄ )) as
L = k( d) for some non-square d ∈ k, the twist E 0 of E by L is isomorphic to d y 2 = x 3 + Ax + B
(and also to y 2 = x 3 + d 2 Ax + d 3 B).
H\GL2 (N )/A E ,
where A E := {ϕN : ϕ ∈ Aut(E)} with ϕN := ι(ϕ|E[N ] ). For j(E) 6= 0, 1728 we have A E = {±1}, and
if −1 ∈ H we can identify double cosets in H\GL2 (N )/{±1} with the right cosets in H\GL2 (N ).
Each σ ∈ Galk induces an isomorphism E σ [N ] → E[N ] defined by P 7→ σ−1 (P), where E σ
is the elliptic curve obtained by letting σ act on the coefficients of an equation defining E; let
σ−1 denote this isomorphism. We have a right Galk -action on YH (k̄) given by
and define YH (k) to be the fixed points of this action. Each point in YH (k) is represented by a
pair (E, [ι]H ), where E/k is an elliptic curve and for every σ ∈ Galk there is a ϕ ∈ Aut(Ek̄ ) and
h ∈ H such that
ι ◦ σ−1 = h ◦ ι ◦ ϕN .
Equivalently, YH (k) consists of pairs ( j(E), α) where j(E) ∈ k is the j-invariant of an elliptic
curve E/k and α = H gA E ∈ H is a double coset with H gρ E,N (σ)A E = H gA E for every σ ∈ Galk .
For E/k with j(E) 6= 0, 1728, if −1 ∈ H we can determine the k-rational points on YH corre-
sponding to E (if any) by computing the fixed points of ρ E,N (Galk ) ⊆ GL2 (N ) in the permutation
representation of GL2 (N ) given by the right action of GL2 (N ) on H\GL2 (N ). This amounts to
computing the rational points in the fiber of X H → X (1) above the point j(E) on X (1) ' P1k .
Proposition 21.3. Let H be an open subgroup of GL2 (Z b ) of level N and E/k be an elliptic curve. If
ρ E,N (GalK ) ≤ H then there exists an isomorphism ι : E[N ] −→ ∼
Z(N )2 for which (E, [ι]H ) ∈ YH (k).
Conversely, if (E, [ι]H ) ∈ YH (k), then for every twist E 0 of E there exists ι 0 : E 0 [N ] −→
∼
Z(N )2
0 0 0
with (E , [ι ]H ) ∈ YH (k), and if Aut(Ek̄ ) = {±1} for at least one twist E we haveρ E 0 ,N (Galk ) ≤ H.
Proof. If ρ E,N (Galk ) ≤ H then Galk fixes the trivial double coset HαA E with α = 1, and we
have ( j(E), α) ∈ YH (k). For the converse, if E 0 is a twist of E, then we have an isomorphism
φ : E 0 −→
∼
Ek̄ that induces an isomorphism φN : E 0 [N ] −→ ∼
E[N ], and if we put ι 0 = ι ◦ φN the
k̄
pairs (E, [ι]H ) and (E 0 , [ι 0 ]H ) are equivalent (as witnessed by φ) and represent the same point
in YH (k̄), so if (E, [ι]H ) lies in YH (k) then so does (E 0 , [ι 0 ]H ). See Problem 2 on Problem Set 7
for the final claim.
The modular curve X H is obtained from YH by adjoining cusps X H∞ , whose functor of points
we now describe. The cusps in X H∞ (k̄) correspond to double cosets H\GL2 (N )/U(N ), where
U(N ) = 〈± 10 11 〉 corresponds to the stabilizer of ∞ in SL2 (Z). We equip X H∞ (k̄) with a right
Galk -action hgu 7→ hgχN (σ)u, where the cyclotomic character χN (σ) := 0e 10 is defined by
σ(ζN ) = ζeN . Rational cusps in X H∞ (k) corresponds to double cosets in H\GL2 (N )/U(N ) that
are fixed by χN (Galk ).
References
[1] Pierre Deligne and Michael Rapoport, Les schémas de modules de courbes elliptiques, Modular
functions of one variable, II (Proc. Internat. Summer School, Univ Antwerp, 1972), 1973,
pp. 143–316, Lecture Notes in Mathematics 349, Springer.
[2] Jeremy Rouse, Andrew V. Sutherland, and David Zureick-Brown, with an appendix by John
Voight, `-adic images of Galois for elliptic curves over Q, Forum of Mathematics, Sigma 10
(2022), e62.
[3] Andrew V. Sutherland, Lecture notes for Elliptic Curves (18.783), 2023.
where π̂ E is the dual of π E (the unique endomorphism π̂ E for which π E π̂ E = deg π E = q) and
tr π E := π E + π̂ E ∈ Z is the trace of Frobenius.
For any positive integer m ⊥ q and endomorphism α ∈ End(E) with deg(α) ⊥ m, the action
of α on E[m] is given by a matrix αm ∈ GL2 (Z/mZ) that has the same characteristic polynomial
x 2 − tr(α)x + deg(α)
as α modulo m; this implies tr(αm ) ≡ tr(α) mod m and det(αm ) ≡ deg(α) mod m. For α = π E
the action of π E on E[m] is thus described by a matrix in GL2 (Z/mZ) with trace congruent to
tr(π E ) = q + 1 − #E(F p ) modulo m and determinant congruent to deg(π E ) = q modulo m; see
Lectures 6–7 of [7] for proofs of these facts.
Now tr(π E ) and deg(π E ) are integers that do not depend on m, and in fact there is an integer
matrix Aπ whose reduction modulo m describes the action of π E on E[m] for every m ⊥ q, as
proved by Duke and Tóth in [2].
Theorem 22.1 (Duke and Tóth). Let E/Fq be an elliptic curve with Frobenius endomorphism π E .
Define a, b, ∆ ∈ Z by a := tr π E , ∆ := disc(End(E) ∩ Q(π E )), and b := (a2 − 4q)/∆ if ∆ 6= 1,
p
with b := 0 if ∆ = 1. For a suitable choice of bases, the reduction modulo m of the integer matrix
a+bδ
b
Frob E := Frob(a, b, ∆) = 2
b(∆−δ) a−bδ (δ := ∆ mod 4 ∈ {0, 1})
4 2
Proof. This follows from [2, Theorem 2.1] and the Duering lifting theorem [7, Thm. 21.15].
Corollary 22.2. Let E/K be an elliptic curve over a number field, let p be a prime of good reduction
for E, and let π be the Frobenius endomorphism of the reduction of E to the residue field of p. Then
ρ E,m (Frobp ) ∈ GL2 (Z/mZ) is conjugate to the image of Frob E in GL2 (Z/mZ) for every m ⊥ p.
We note that the integers a, b, ∆ that define Frob E = Frob(a, b, ∆) satisfy the norm equation
4q = a2 − b2 ∆,
#X H (F p ) = #X H∞ (F p ) + #YH (F p ).
If we put G L2 (N ) := GL2 (Z/N Z) and let U(N ) = ± 10 11 , we then have
GalF
#X H∞ (F p ) = # H\GL2 (N )/U(N ) p
p
where GalFp acts via the matrix 0p 01 , since Frob p (ζN ) = ζN . We note that #X H∞ (F p ) depends
only on the value of p mod N , so we can determine #X H∞ (F p ) for every prime p - N by simply
counting double cosets in H\GL2 (N )/U(N ) fixed by 0a 01 with a varying over (Z/N Z)× .
where A E := {ϕN : ϕ ∈ Aut(Ek̄ )} is just {±1} for j(E) 6= 0, 1728 and the action of GFp is given by
ρ E,N (Frob p ) ≡ Frob E mod N . We note that while Frob E is not completely determined by j(E),
its action on the double-coset space H\GL2 (N )/A E is. Indeed, the elliptic curves E 0 /F p with
j(E 0 ) = j(E) are twists of E, and for j(E) 6= 0, 1728 there are just two F p -isomorphism classes
of twists, and their Frobenius matrices Frob E differ only by −1 ∈ A E .
For convenience we split the sum in (1) into four parts:
depending on whether j(E) = 0, j(E) = 1728, j(E) 6= 0, 1728 is ordinary, j(E) 6= 0, 1728 is
supersingular. Recall that E/F p is ordinary if tr π E 6≡ 0 mod p and supersingular otherwise. To
keep things simple, we will focus on the main term #YHord (F p ); see Lemma 5.1.1 and Table 6
in [6] for the other cases, all of which can be computed very quickly provided one knows the
class numbers h(−p) and h(−4p) (these are easy to compute as long as p is not too large and
precomputed values are available in the LMFDB for p < 1012 ).
For ordinary j(E) 6= 0, 1728 we will have ∆ < −4, in which case Frob E is determined up
to the signs of a and b by the norm equation 4q = a2 − b2 ∆, which has a unique solution
with a, b > 0. For each such solution the theory of complex multiplication implies that there
are exactly h(∆) j-invariants j(E) for which End(E) ∩ Q(π E ) = End(E) has discriminant ∆. It
thus suffices to enumerate solutions to 4q = a2 − b2 ∆ and for each solution count the right
cosets in H\GL2 (N ) that are fixed by the Frobenius Frob(A, B, ∆), which can be done using the
permutation character χH for H ≤ GL2 (N ), which for each g ∈ G counts the cosets in H\GL2 (N )
that are fixed by g. Note that this permutation character does not depend on p and can be
computed very quickly once one knows its values on generators for G, which can be done as a
precomputation. For all primes p > 3 we then have
X X
#YHord (F p ) = χH (Frob(a, b, ∆/b2 ))h(∆/b2 )
p
1≤a<2 p b| cond(∆)
∆=a2 −4p
Æ p
where cond(∆) := ∆/ disc(Q( p ∆)) is the conductor of the order of discriminant ∆ (its index
in the ring of integers of Q( ∆)).
This implies that for every prime p - N the Frobenius trace of JH at p is an integer linear com-
bination of the Frobenius traces of the A f at p. Now a p (JH ) = p + 1 − #X H (F p ), so we can
compute a p (JH ) by counting points on X H over the F p as described above.
If we have precomputed values of a p (A f i ) for 1 ≤ i ≤ r (which can be found in the LMFDB
for moderate values of p and N ), we can construct a linear system Ax = B, where A = (a pi ( f j ))i j
is an integer matrix with rows indexed by primes p j - N (up to some bound P) and a column
for each of the normalized eigenforms f1 , . . . , f m , while B is a column vector whose ith entry is
a pi (JH ) = pi + 1 − #X H (F pi ). For sufficiently large P the matrix A will have rank r and uniquely
determine the exponents ei that appear in the factorization of L(JH , s), thereby determining the
modular abelian varieties A f that appear as isogeny factors in JH along with their multiplicities.
The isogeny decomposition of JH has many applications. In particular, it allows us to com-
pute the analytic rank of the L-function of L(JH , s) as the sum of the analytic ranks of the modular
abelian varieties A f , which are easy to compute. Under a generalization of the BSD conjecture,
this analytic rank tells us the Mordell-Weil rank of JH , and if this rank is strictly less than the
genus of X H , we can (at least in principal) use the Chabauty-Coleman method to rigorously
determine the set X H (Q) and all elliptic curves E/Q that admit an H-level structure. Note that
Faltings’ theorem implies that X H (Q) is finite whenever g(X H ) ≥ 2.
Mazur famously determined the H-level structures that are compatible with rational torsion
(the 15 possible torsion subgroups that arise for E/Q, which as Z/N Z for 11 6= N ≤ 12 and
Z/2Z ⊕ Z/2N Z for N ≤ 4) in [4]. The corresponding X H have genus 0 (as conjectured by Ogg),
meaning that their rational points are parameterized by P1Q . Mazur also famously determined
the H-level structures that are compatible with a rational isogeny of prime degree [5], which
was soon extended to cyclic isogenies of any degree. Some of these X H ' X 0 (N ) have nonzero
genus, which is fortunate; the fact that X 0 (15) has only finitely many rational points played a
References
[1] Pierre Deligne and Michael Rapoport, Les schémas de modules de courbes elliptiques, Modular
functions of one variable, II (Proc. Internat. Summer School, Univ Antwerp, 1972), 1973,
pp. 143–316, Lecture Notes in Mathematics 349, Springer.
[2] William Duke and Árpád Tóth, The splitting of primes in division fields of elliptic curves,
Experiment. Math. 11 (2002), 555–565.
[3] Barry Mazur, Rational points on modular curves, Modular functions of one variable, V, 1977,
pp. 107-148, Lecture Notes in Math. 601, Springer.
[4] Barry Mazur, Modular curves and the Eisenstein ideal, Inst. Hautes Étudces Sci. Publ. Math.
47 (1977), 33–186.
[5] Barry Mazur, Rational isogenies of prime degree (with an appendix by D. Goldfeld), Invent.
Math. 44 (1978), 129–162.
[6] Jeremy Rouse, Andrew V. Sutherland, and David Zureick-Brown, with an appendix by John
Voight, `-adic images of Galois for elliptic curves over Q, Forum of Mathematics, Sigma 10
(2022), e62.
[7] Andrew V. Sutherland, Lecture notes for Elliptic Curves (18.783), 2023.
In this final lecture we survey some generalizations of the (classical) modular forms we have
been studying in this course (some of this material can be found in [4, Chapter 15] but much
of it is taken from the literature).
f |k γ = f
for every γ ∈ Γ and are holomorphic at the cusps of Γ , where the weight-k slash operator is
so the cocycle conduction holds. We have a surjective homomorphism Mp2 (R) → SL2 (R) defined
p
by (γ, ") 7→ γ whose kernel is {(1, ± z)} ' {±1}. This leads to the exact sequence
GL+ +
2 (K) = {α ∈ GL2 (K) : σi (α) ∈ GL2 (R) for 1 ≤ i ≤ r},
:
as the subgroup of matrices in GL2 (K) with totally positive determinant. The group GL+
2 (K) acts
+
on H via the isometric action of GL2 (R) on H. For functions f : H → C, vectors of integers
r r
k ∈ Z r , and α ∈ GL+
2 (K) we define the weight- k slash operator
r
Y
ki /2
( f |k α)(z) := det(σi (α)) j(σi (α), z) ki
f (αz),
i=1
which we note coincides with the usual weight-k slash operator when r = 1.
Let ZK denote the ring of integers of K (the integral closure of Z in K). The group GL+ 2 (ZK ) is
an arithmetic lattice in the locally compact group GL+ 2 (R) r
, as is any Γ ≤ GL +
2 (K) commensurable
with GL+2 (Z K ); we note that unlike the case r = 1, for r > 1 lattices in GL +
2 (R) are automatically
arithmetic.
Definition 23.1. Let K be a totally real number field of degree r > 1. For k ∈ Z r and lattices
Γ ≤ GL+ +
2 (K) commensurable with GL2 (ZK ), a Hilbert modular form f : H → C of weight k
r
Γ0 (n) := ac db ∈ GL+
2 (ZK ) : c ∈ n ,
and say that Hilbert modular forms for Γ0 (n) have level n. More generally, if c is any fractional
Z F -ideal, we can define
Γ0 (c, n) := ac db ∈ GL+ −1
2 (K) : a, d ∈ ZK , c ∈ cn, b ∈ c
and consider the space Mk (c, n) of Hilbert modular forms for Γ0 (c, n) of level (c, n). This is a finite
dimensional C-vector space, as is its subspace Sk (c, n) of cusp forms (Hilbert modular forms that
r
vanish at the cusps). These spaces have dimension zero unless k ∈ Z≥0 .
For suitable k (including parallel even weight k) there are dimension formulas and traces
formulas, as well as a nice theory of Hecke operators and newforms; we now have Hecke op-
erators Tp associated to each prime ideal p of ZK and newforms f have Fourier coefficients ap
equal to the eigenvalue of Tp on f (all of which are algebraic integers). We should note that the
field Q( f ) generated by these eigenvalues need not contain K; indeed, we may have Q( f ) = Q.
For parallel weight-2 newforms f there is a conjectural analog of the Eichler-Shimura con-
struction that yields modular abelian varieties over K that is known to hold in many cases (in-
cluding all cases where K has odd degree) due to a theorem of Hida [7]. When f has rational
Hecke eigenvalues (so Q( f ) = Q), one obtains an elliptic curve E/K.
Recall that for K = Q the modularity theorem gives a 1-to-1 correspondence between isogeny
classes of elliptic curves E/Q of conductor N and newforms f of weight 2 and level N with
Q( f ) = Q in which E and f have the same L-function. It is similarly conjectured that there
is a 1-to-1 correspondence between elliptic curves over totally real fields K of conductor n and
Hilbert modular newforms of parallel weight 2 and level n with Q( f ) = Q in which E and f
have the same L-function.
There is much recent progress toward this conjecture, which is now known to hold for all
real quadratic fields [6] all totally real cubic fields [5], and in many other cases. Unlike E/Q, it
is possible for E/K to have everywhere good reduction (meaning the its conductor is the trivial
ideal ZK ), corresponding to a Hilbert modular form for the full Hilbert modular group GL+ 2 (ZK ).
Remark 23.2. Some authors restrict to subgroups of SL2 (ZK ) ≤ GL+ 2 (ZK ) when discussing
Hilbert modular forms. This is a nontrivial restriction: the determinant of an element of GL+
2 (ZK )
is a unit that is positive in every real embedding (a totally positive unit), but this condition is
satisfied by the square of any unit, and there are infinitely many when K = Q.
−I g
0
of symmetric g × g matrices with positive definite imaginary part. Let J = Ig 0 ∈ M2g (Z).
The symplectic group
Sp2g (R) := {γ ∈ SL2g (R) : γT Jγ = J}
note that for g > 1 this expression involves matrices and the order of multiplication matters.
This extends to an isometric action of GSp+ 2g (R) on H g , where
and Λ∗ := { f ∈ V ∗ : Im f (Λ) ⊆ Z}. This is defines an isogeny from the abelian variety A = V /Λ
and the dual abelian variety A∗ = V ∗ /Λ∗ ; for principal polarizations this is an isomorphism.
Two elements τ, τ0 ∈ H g define isomorphic principally polarized abelian varieties Aτ ' Aτ0
if and only if they lie in the same Sp2g (Z) orbit. We thus have a bijection between isomor-
phism classes of principally polarized abelian varieties A/C of dimension g and the quotient
Sp2g (Z)\H g , which is a variety A g /Q of dimension g(g + 1)/2, the moduli space of principally
polarized abelian varieties of dimension g. For g = 1 the moduli space A1 corresponds to
the modular curve Y (1) = Γ (1)\H; note that Γ (1) = SL2 (Z) = Sp2 (Z) and H = H1 . As with
Y (1) = A1 , the moduli space A g is not compact, but it can be compactified and embedded into
projective space (analogous to the construction of X (1) from Y (1), but a bit more complicated).
We note that for g ≤ 3 the dimensions of A g and M g coincide. Here M g is the moduli space
of nice (smooth projective geometrically connected) curves; it has dimension 1 for g = 1 and
dimension 3g − 3 for g > 1. Thus almost all principally polarized abelian varieties of dimension
g ≤ 3 are Jacobians of nice curves of genus g, but this is not true for any g > 3.
We now define
Γ (N ) := {γ ∈ Sp2g (Z) : γ ≡ 1 mod N },
and call subgroups Γ ≤ Sp2g (Z) that contain Γ (N ) for some N congruence subgroups. For g > 1
a holomorphic function f : H g → C is a (classical) Siegel modular form of (parallel even) weight
k ∈ 2Z for a congruence subgroup Γ if it is invariant under the weight-k slash operator
( f |k γ)(τ) := (det J(γ, τ)−k f (γτ),
for all γ = CA DB ∈ Γ , where J(γ, τ) = Cτ + D. This definition coincides with the definition of
a modular form for a congruence subgroup when g = 1 except the holomorphicity condition at
the cusps is omitted because it is automatically satisfied (by Koecher’s principle).
This definition can be generalized in many ways, including to non-parallel weights and
vector-valued functions associated to representations of GL g (C), but for the purpose of this
introduction there is one particular generalization we want to consider, and to simplify matters
we will restrict to g = 2. Rather than a congruence subgroup Γ ≤ Sp2g (Z) we instead use a
paramodular group
Z NZ Z
Z
Z Z Z Z/N
K(N ) := ∩ Sp4 (Q).
Z NZ Z Z
NZ NZ NZ Z
y 2 + (x 3 + x 2 + x + 1) y = −x 2 − x,
which is the genus 2 curve with label 277.a.277.1 in the LMFDB. Infinite families of abelian
surfaces over Q for which the paramodular conjectures holds are now known [3], building on
the potential modular results of Boxer, Calegari, Gee, and Pilloni [1]. Very recently the same
authors have announced a proof that the paramodular conjecture holds for a positive proportion
of abelian surfaces over Q (but the new result uses different techniques).
One can define paramodular groups K(N ) ≤ Sp2g (Q) for even g > 2, but in general the
quotient K(N )\H g parameterizes abelian varieties whose endomorphism ring is an order in a
totally real field of degree g/2. For g > 2 these abelian varieties are not fully general and do
not account for a positive proportion of abelian varieties of dimension g.
References
[1] George Boxer, Frank Calegari, Toby Gee, and Vincent Pilloni, Abelian surfaces over totally
real fields are potentially modular, Publ. Math. IHES 134 (2021), 153–501.
[2] Armand Brumer and Ken Kramer, Paramodular abelian varieties of odd conductor, Trans.
Amer. Math. Soc. 366 (2013), 2463–2516 (but see 2018 correction in arXiv:1004.4699).
[3] Frank Calegari, Shiva Chidambaram, and David Roberts, Abelian surfaces with fixed 3-
torsion, Proceedings of the Fourteenth Algorithmic Number Theory Symposium (ANTS XIV),
Open Book Series 4, Mathematical Sciences Publishers, 2020.
[4] Henri Cohen and Fredrik Strömberg, Modular forms: A classical approach, Graduate Studies
in Mathematics 179, American Mathematical Society, 2017.
[5] Maarten Derickx, Filip Najman, and Samir Siksek, Elliptic curves over totally real cubic fields
are modular, Algebra Number Theory 14 (2020), 1791–1800.
[6] Nuno Freitas, Bao V. Le Hung, and Samir Siksek, Elliptic curves over real quadratic fields are
modular, Invent. Math. 201 (205), 159–206.
[7] Haruzo Hida, Hilbert modular forms and Iwasawa theory, Oxford University Press, 2006.
[8] Winfried Kohnen, Modular forms of half–integral weight on Γ0 (4), Math. Ann. 248 (1980),
249–266.
[9] Brooke Roberts and Ralf Schmidt, Local newforms for GSp(4), Lecture Notes in Mathematics
1918, Springer, 2007.
[10] Goro Shimura, On modular forms of half integral weight, Ann. of Math. 97 (1973), 440-
481.
[11] Nils–Peter Skoruppa and Don Zagier, Jacobi forms and a certain space of modular forms,
Invent. Math. 94 (1988), 113–146.