The Meaning of Einstein Field Equations
The Meaning of Einstein Field Equations
The Meaning of Einstein Field Equations
General relativity explains gravity as the curvature of spacetime. It's all about geometry. The basic equation of general relativity is called Einstein's equation. In units where , it says
It looks simple, but what does it mean? Unfortunately, the beautiful geometrical meaning of this equation is a bit hard to find in most treatments of relativity. There are many nice popularizations that explain the philosophy behind relativity and the idea of curved spacetime, but most of them don't get around to explaining Einstein's equation and showing how to work out its consequences. There are also more technical introductions which explain Einstein's equation in detail -- but here the geometry is often hidden under piles of tensor calculus. This is a pity, because in fact there is an easy way to express the whole content of Einstein's equation in plain English. In fact, after a suitable prelude, one can summarize it in a single sentence! One needs a lot of mathematics to derive all the consequences of this sentence, but it is still worth seeing -- and we can work out some of its consequences quite easily. In what follows, we start by outlining some differences between special and general relativity. Next we give a verbal formulation of Einstein's equation. Then we derive a few of its consequences concerning tidal forces, gravitational waves, gravitational collapse, and the big bang cosmology. In the last section we explain why our verbal formulation is equivalent to the usual one in terms of tensors.
Before stating Einstein's equation, we need a little preparation. We assume the reader is somewhat familiar with special relativity -- otherwise general relativity will be too hard. But there are some big differences between special and general relativity, which can cause immense confusion if neglected. In special relativity, we cannot talk about absolute velocities, but only relative velocities. For example, we cannot sensibly ask if a particle is at rest, only whether it is at rest relative to another. The reason is that in this theory, velocities are described as vectors in 4-dimensional spacetime. Switching to a different inertial coordinate system can change which way these vectors point relative to our coordinate axes, but not whether two of them point the same way. In general relativity, we cannot even talk about relative velocities, except for two particles at the same point of spacetime -- that is, at the same place at the same instant. The reason is that in general relativity, we take very seriously the notion that a vector is a little arrow sitting at a particular point in spacetime. To compare vectors at different points of spacetime, we must carry one over to the other. The process of carrying a vector along a path without turning or stretching it is called `parallel transport'. When spacetime is curved, the result of parallel transport from one point to another depends on the path taken! In fact, this is the very definition of what it means for spacetime to be curved. Thus it is ambiguous to ask whether two particles have the same velocity vector unless they are at the same point of spacetime. It is hard to imagine the curvature of 4-dimensional spacetime, but it is easy to see it in a 2-dimensional surface, like a sphere. The sphere fits nicely in 3-dimensional flat Euclidean space, so we can visualize vectors on the sphere as `tangent vectors'. If we parallel transport a tangent vector from the north pole to the equator by going straight down a meridian, we get a different result than if we go down another meridian and then along the equator:
Because of this analogy, in general relativity vectors are usually called `tangent vectors'. However, it is important not to take this analogy too seriously. Our curved spacetime need not be embedded in some higher-dimensional flat spacetime for us to understand its curvature, or the concept of tangent vector. The mathematics of tensor calculus is designed to let us handle these concepts `intrinsically' -- i.e., working solely within the 4-dimensional spacetime in which we find ourselves. This is one reason tensor calculus is so important in general relativity. Now, in special relativity we can think of an inertial coordinate system, or `inertial frame', as being defined by a field of clocks, all at rest relative to each other. In general relativity this makes no sense, since we can only unambiguously define the relative velocity of two clocks if they are at the same location. Thus the concept of inertial frame, so important in special relativity, is banned from general relativity! If we are willing to put up with limited accuracy, we can still talk about the relative velocity of two particles in the limit where they are very close, since curvature effects will then be very small. In this approximate sense, we can talk about a `local' inertial coordinate system. However, we must remember that this notion only makes perfect sense in the limit where the region of spacetime covered by the coordinate system goes to zero in size. Einstein's equation can be expressed as a statement about the relative acceleration of very close test particles in free fall. Let us clarify these terms a bit. A `test particle' is an idealized point particle with energy and momentum so small that its effects on spacetime curvature are negligible. A particle is said to be in free fall when its motion is affected by no forces except gravity. In general relativity, a test particle in free fall will trace out a `geodesic'. This means that its velocity vector is parallel transported along the curve it traces out in spacetime. A geodesic is the closest thing there is to a straight line in curved spacetime.
Again, all this is easier to visualize in 2d space rather than 4d spacetime. A person walking on a sphere `following their nose' will trace out a geodesic -- that is, a great circle. Suppose two people stand side-by-side on the equator and start walking north, both following geodesics. Though they start out walking parallel to each other, the distance between them will gradually start to shrink, until finally they bump into each other at the north pole. If they didn't understand the curved geometry of the sphere, they might think a `force' was pulling them together. Similarly, in general relativity gravity is not really a `force', but just a manifestation of the curvature of spacetime. Note: not the curvature of space, but of spacetime. The distinction is crucial. If you toss a ball, it follows a parabolic path. This is far from being a geodesic in space: space is curved by the Earth's gravitational field, but it is certainly not so curved as all that! The point is that while the ball moves a short distance in space, it moves an enormous distance in time, since one second equals about 300,000 kilometers in units where . This allows a slight amount of spacetime curvature to have a noticeable effect.
Einstein's Equation
To state Einstein's equation in simple English, we need to consider a round ball of test particles that are all initially at rest relative to each other. As we have seen, this is a sensible notion only in the limit where the ball is very small. If we start with such a ball of particles, it will, to second order in time, become an ellipsoid as time passes. This should not be too surprising, because any linear transformation applied to a ball gives an ellipsoid, and as the saying goes, ``everything is linear to first order''. Here we get a bit more: the relative velocity of the particles starts out being zero, so to first order in time the ball does not change shape at all: the change is a second-order effect. Let V(t) be the volume of the ball after a proper time has elapsed, as measured by the particle at the center of the ball. Then Einstein's equation says:
where these flows are measured at the center of the ball at time zero, using local inertial coordinates. These flows are the diagonal components of a 4X4 matrix called the `stress-energy tensor'. The components T of this matrix say how much momentum in the direction is flowing in the direction through a given point of spacetime, where , = t,x,y,z. The flow of -momentum in the -direction is just the energy density, often denoted . The flow of -momentum in the -direction is the `pressure in the direction' denoted Px, and similarly for y and z. It takes a while to figure out why pressure is really the flow of momentum, but it is eminently worth doing. Most texts explain this fact by considering the example of an ideal gas. In any event, we may summarize Einstein's equation as follows:
This equation says that positive energy density and positive pressure curve spacetime in a way that makes a freely falling ball of point particles tend to shrink. Since and we are working in units where , ordinary mass density counts as a form of energy density. Thus a massive object will make a swarm of freely falling particles at rest around it start to shrink. In short: gravity attracts. We promised to state Einstein's equation in plain English, but have not done so yet. Here it is: Given a small ball of freely falling test particles initially at rest with respect to each other, the rate at which it begins to shrink is proportional to its volume times: the energy density at the center of the ball, plus the pressure in the direction at that point, plus the pressure in the y direction, plus the pressure in the direction. In the final section of this article, we will prove that this sentence is equivalent to Einstein's equation. The reader who already knows general relativity may be somewhat skeptical of this claim. After all, Einstein's equation in its usual tensorial form is really a bunch of equations: the left and right sides of equation (1) are 4X4 matrices. It is hard to believe that the single equation (2) captures all that information. It does, though, as long as we include one bit of fine print: in order to get the full content of the Einstein equation from equation (2), we must consider small
balls with all possible initial velocities -- i.e., balls that begin at rest in all possible local inertial reference frames. Before we begin, it is worth noting an even simpler formulation of Einstein's equation that applies when the pressure is the same in every direction: Given a small ball of freely falling test particles initially at rest with respect to each other, the rate at which it begins to shrink is proportional to its volume times: the energy density at the center of the ball plus three times the pressure at that point. This version is only sufficient for `isotropic' situations: that is, those in which all directions look the same in some local inertial reference frame. But, since the simplest models of cosmology treat the universe as isotropic -- at least approximately, on large enough distance scales -- this is all we shall need to derive an equation describing the big bang! According to the Einstein convention, when an index variable appears twice in a single term it implies summation of that term over all the values of the index. So where the indices can range over the set {1, 2, 3},
The upper indices are not exponents but are indices of coordinates, coefficients or basis vectors. For example, x2 should be read as "x-two", not "x squared", and typically (x1, x2, x3) would be equivalent to the traditional (x, y, z). In general relativity, a common convention is that
the Greek alphabet is used for space and time components, where indices take values 0,1,2,3 (frequently used letters are , , ...), the Latin alphabet is used for spatial components only, where indices take values 1,2,3 (frequently used letters are i, j, ...),
The metric tensor is assigned to each point in spacetime, and is defined as:
The components of this tensor are u 33w3w3w 0sed to define the concept of line element:
Here, ds tells us the distance travelled by a point particle if infinitesimally small changes were made to its coordinates. Thus, the metric encodes the structure of spacetime. In fact, according to relativity, gravity is not the usual kind of force. It merely curves spacetime so as to produce an illusion of force, but the particle being affected is in fact only traversing the spacetime geodesic (the shortest route) like it should, in the absence of any force! Hence, the Earth does not pull us towards itself directly, it merely warps the spacetime geodesic that were supposed to travel, and thus (very cunningly) causes us to fall towards its surface! It must be noted that the geodesic being talked about is the spacetime geodesic, and not the space geodesic. For instance, when a projectile takes a parabolic curve, it is most certainly not travelling the space geodesic, because the space could not have been curved to such a drastic extent! If this fact doesnt sound too obvious to you, you can take it as a postulate of general relativity: objects travel space-time geodesics in the absence of nongravitational forces. Non-gravitational forces, like the electromagnetic force, are of a completely different character. They dare not touch the structure of spacetime; they have to rely on the not-so-cunning method of altering the usual course of the particle, and causing it to deviate from the geodesic. In we will now define the following symbol:
where is the inverse of the matrix and Einstein notation for summation) We give the Riemann curvature tensor by:
The scalar curvature is usually denoted by S (other notations are Sc, R). It is defined as the trace of the Ricci curvature tensor with respect to the metric:
The trace depends on the metric since the Ricci tensor is a (0,2)-valent tensor; one must first raise aThe scalar curvature is usually denoted by S (other notations are Sc, R). It is defined as the trace of the Ricci curvature tensor with respect to the metric:
The trace depends on the metric since the Ricci tensor is a (0,2)-valent tensor; one must first raise an index to obtain a (1,1)-valent tensor in order to take the trace. In terms of local coordinatesone can write
where Rij are the components of the Ricci tensor in the coordinate basis:
Given a coordinate system and a metric tensor, scalar curvature can be expressed as follows n index to obtain a (1,1)-valent tensor in order to take the trace. In terms of local coordinates one can write
where Rij are the components of the Ricci tensor in the coordinate basis:
Given a coordinate system and a metric tensor, scalar curvature can be expressed as follows: