Special Relativity
Special Relativity
In this section we will discuss how to treat gravity. But first, lets discuss the Special Theory of Relativity
• laws of physics have the same form (invariant) in all inertial frames
• for all inertial observers, the speed of light in vacuum is a constant c and is independent of the relative
velocity with the source of the light
Einstein’s task was to find a set of transformation rules between inertial frames that satisfy the above two
statements. Such transformations were already worked out by Hendrik Lorentz.
where
1
= v/c , =p .
1 2
Throughout the course we will work in the natural unit ~ = 1, c = 1. With this choice, the LT reads
t0 = t x ,
0
x = x t , (16)
0 0
y = y, z =z
and
1
= v, =p . (17)
1 2
A set of reverse transformation (from S 0 ! S ) also exists – they can be obtained from the above set
by interchanging the primed and unprimed coordinates, and changing the direction of the velocity. For
the above set of relations, we have assumed the velocity along the x direction. For velocity in arbitrary
directions the relations are a little more complicated, but we do not need them for our discussions.
13
Figure 8: Two inertial frames with relative speed v.
Defining the two column vectors by x0µ ⌘ (x00 , x01 , x02 , x03 )T x⌫ ⌘ (x0 , x1 , x2 , x3 )T , the matrix by ⇤µ⌫ ,
and using the Einstein summation convention (repeated indices are summed over) we can write the above
relation as
X3
x0µ = ⇤µ⌫ x⌫ = ⇤µ⌫ x⌫ , µ, ⌫ = 0, 1, 2, 3 (19)
⌫=0
Here µ, ⌫ are called the Lorentz indices. Comparing the preceding equations we find ⇤00 = , ⇤01 = ,
etc.
One crucial observation is to be made here – the space and the time coordinates are no longer separate
entities – they transform together as single entity. It is called the spacetime. In analogy with the location of
a point in three dimensional space, lets introduce the concept of an event, defined as a set of four coordinates
xµ ⌘ (t, x, y, z) in the 4-dimensional spactime. The xµ is a vector in 4D spacetime that transforms like
(19) under a LT. It is noting but a location of an event in 4D spacetime. And it is analogous to specifying
the location of a point in a 3D space. Therefore, xµ is called a 4-vector
xµ = (x0 , x1 , x2 , x3 ) = (x0 , ~x) ,
where ~x is the usual three vector. Conventionally, the x0 is called the time component and x1 , x2 , x3 are
the space components.
3.1.2 4-vectors
The concept of 4vector extends beyond the location of an event in spacetime. Any four component object
~ that transforms like (19)
Aµ ⌘ (A0 , A1 , A2 , A3 ) = (A0 , A)
is called a four-vector. Remember that any set of four objects do not constitute a 4vector unless they
transform like the above.
In analogy with xµ , the 0th component A0 is called the time component and the other three components
are called the space components. In addition to xµ another 4vector that you should remember by heart is
the four momentum of a particle denoted as pµ . The time component of pµ is its energy E, and the space
components are just the components of its ordinary three-momentum p~. So
pµ = (E, p~) . (21)
14
p
If m is the mass of the particle then E = p~2 + m2 . Hence the squared of the 4momentum is
p µ pµ = E 2 p~2 = m2 .
where ! is the angular frequency, and k = 2⇡/ is the wave vector. In this case
pµ pµ = 0
measured from a coordinate system S (you can refer to the previous figure.) Suppose, we perform a LT of
the coordinate system to S 0 and the events are x0µ and x0µ + dx0µ . Let say the quantity ds2 is transformed
to (ds0 )2
ds02 = (dx00 )2 + (dx01 )2 + (dx02 )2 + (dx03 )2 .
Using LT relation dx0µ = ⇤µ⌫ dx⌫ we can show (try to do it yourself – will be roughly done in the class)
ds02 = ds2 ,
So ds2 = (dx0 )2 + (dx1 )2 + (dx2 )2 + (dx3 )2 is an invariant quantity. The ds is called spacetime interval,
or line element, or metric. This is analogous to the concept of length in a 3D space.
With the aid of the summation convention, the spacetime interval can be written in a compact form as
where in the last line we have introduced the Minkowski metric or simply the metric of spacetime
0 1
1 0 0 0
B0 1 0 0C
⌘µ⌫ =B
@0
C . (24)
0 1 0A
0 0 0 1
3.1.4 Definition of LT
For any 4-vector Aµ we can define ⌘↵⌫ A⌫ A↵ as the equivalent of length2 (see (23)). Since this is a scalar
it is invariant under the LT, i.e., A⌫ ⌘↵⌫ A↵ = A0µ ⌘ µ A0 . Equating and simplifying
A⌫ ⌘↵⌫ A↵ = A0µ ⌘ µA
0
= ⇤µ⌫ A⌫ ⌘ µ ⇤ ↵ A↵ ,
hence
⇤µ⌫ ⌘ µ⇤ ↵ = ⌘↵⌫ . (25)
15
Any matrix ⇤ that satisfies this condition is LT matrix. In fact, it defines Lorentz transformation.
The rotation of a 3-dimension coordinate system, for example the rotation of Cartesian coordinate in
the x y plane constitutes a LT (easy to prove, try!)
0 1
1 0 0 0
B0 cos ✓ sin ✓ 0C
⇤µ⌫ = B@0
C, (26)
sin ✓ cos ✓ 0A
0 0 0 1
where ✓ is the angle of rotation. Another example of LT is boost, which is thought rotations between space
and time directions” 0 1
cosh sinh 0 0
B sinh cosh 0 0C
⇤µ⌫ = B
@ 0
C. (27)
0 1 0A
0 0 0 1
Aµ ⌘ ⌘µ⌫ A⌫ = ( A0 , A1 , A2 , A3 ) .
The time component of contravariant vector is negative of the time component of its covariant form. For
two di↵erent 4-vectors Aµ and B µ we can write scalar product as
⌘µ⌫ Aµ B ⌫ = A0 B 0 + A1 B 1 + A2 B 2 + A3 B 3 = ~B
A0 B 0 + A. ~ = Aµ B µ . (30)
Corresponding to each covariant vector there exist a contravariant vector and vice-versa. But there are
some vectors that appear more often in the contravariant form and others in covariant. For example, dxµ
is more often contravariant, and the gradiant operator
@
@µ ⌘ (33)
@xµ
is more often covariant. The transformation rule of the gradiant operator is
@ @x⌫ @
= ,
@x0µ @x0µ @x⌫
16
As x0⌫ = ⇤⌫µ xµ , it follows that
@x⌫
= ⇤µ⌫ , (34)
@x0µ
hence,
@ @
@µ0 = 0µ
= ⇤µ⌫ ⌫ = ⇤µ⌫ @µ . (35)
@x @x
Equivalently,
@ 0µ = ⇤µ⌫ @ ⌫ . (36)
It is easy to see that the divergences @µ Aµ and @ µ Aµ are invariant. Another useful invariant operator is
the d’Alembertian, the product of the gradiant with itself
@2
⇤2 = ⌘ ↵ @↵ @ = + r2 . (37)
@t2
⌘ µ⌫ A⌫ = Aµ . (39)
⌘µ⌫ = ⌘ µ⌫ . (40)
3.1.7 Light-cone
The trajectories of particles in 4D spacetime are called world lines. The world line of light is special.
Consider the line element
The wavefront of light has velocity c = |d~x/dx0 | = 1 (since we have c = 1) which means the worldline of
light corresponds to
ds2 = 0 . (41)
One can draw the world line of a light ray in a space vs time plot. Since it is impossible to plot in four
dimension, we will resort to a two dimensional plot where the horizontal axis corresponds to space and the
vertical axis is time. The world line of a light ray makes 45o angle with respect to the space axis as shown
in 9. This is also called the lightlike world line. In 4-dimension this would actually be a cone, hence 9 is
called the light cone. For ordinary particles |dx/dt| < 1, therefore, the world lines for any massive particles
makes larger angle with the space axis.
Due to the relative negative sign between the time and space components, ds2 in eq.(23) can be of three
types
• timlike, ds2 < 0: Two events separated by ds2 < 0 are causally connected, i.e., information can travel
from one event to the other. An example is two events taking place at the same spatial position but
at di↵erent times.
• spacelike, ds2 > 0: Spacelike event are causally disconnected. Example would be two events are
simultaneous in time but separated in space.
• null or lightlike, ds2 = 0: Causally connected and lie in the world line of light.
17
Figure 9: Light cone and your world line.
Suppose there is a clock attached to the rest frame of an observer. The observer observes two consecutive
ticks in the clock, t1 = 0 and t2 = 0 + dt. Since the clock is at rest dx1 = 0, dx2 = 0, dx3 = 0, and
dx0 = t2 t1 = dt. So, the interval between the two ticks is
i.e., d⌧ is the measure of the ellapsed time dt in the clock of the observer. For this reason, d⌧ is known as
the proper time of an observer. Proper time is invariant, i.e., d⌧ 0 = d⌧ .
Exercise: There is a clock in the rest frame of an observer O. The time interval between two consicutive
ticks on the clock measured by the observer is dt. Another observer O0 is moving with a velocity v with
respect to the O’s frame. Show using the invariance of proper time, that the observer O0 would notice that
the clock is moving slower, as if the time is dialated.
Exercise: The length contraction exercise.
18
Then we can write the components as
dt 1
U0 = p =p =
1 2 dt 1 2
(46)
dxi 1 dxi
Ui = p =p = .
1 2 dt 1 2 dt
The norm of the 4velocity can be easily shown to be U µ Uµ = 1 (this would have been equal to c2 had
we not put c = 1). The negative norm of the four velocity reflects the fact that it is defined for timelike
world lines.
Exercise: Work out the relativistic addition of velocities.
The four acceleration can be defined as
dU µ
aµ ⌘ , (47)
d⌧
and the relativistic four-force is
dU µ
f µ = maµ = m . (48)
d⌧
In an inertial frame, for speed of a particle that is much smaller than 1 (in other words, speed of light),
this boils down to Newton’s laws. The constant m defined here is called the rest mass of the particle. The
four acceleration is perpendicular to the four velocity. It can be easily shown that
aµ U µ = 0 (49)
pµ p µ = m2 U µ U µ = m2
) E 2 = (p0 )2 = |~
p|2 + m2 (51)
This is the famous relativistic energy-momentum relation. Here we have the energy E = p0 . It is easy to
see that in the limit of small velocities, beta ⌧ 1, and ' 1 + 2 /2, p0 . and pi are
1
p0 ' m + mv 2 , pi = mv i .
2
The second term of p0 is Newtonian kinetic energy and pi are the ordinary momentum. This is why pµ
is called the energy-momentum vector. Note that the energy has two components m called the rest mass
energy and the rest is the kinetic energy. For particles at rest the only energy is the rest mass energy
E = m, (52)
which is the most famous equation in science (we have taken c = 1, otherwise E = mc2 ). Expression of
only the relativistic kinetic energy of a particle is obtained by subtracting the rest mass energy as
p
T = p2 + m m . (53)
For a massless particle like the photon, the four-momentum vector is given by
where ! is the frequency, and ~k is the wave vector, and we have purposefully kept the Planck constant ~.
For photon, p2 = pµ pµ = 0.
19