1 Scalars and Vectors: 1.1 What Is A Vector?
1 Scalars and Vectors: 1.1 What Is A Vector?
1.1
What is a vector?
The three numbers (x, y, z) specify the vector fully. However, their actual numerical values depend on the particular coordinate axes chosen: In other words,
a set (
ex , ey , ez ) of mutually perpendicular unit vectors is chosen, and the values
(x, y, z) are with reference to this set1 . This choice is not unique. A dierent
orthonormal basis will give rise to a dierent set of numbers, say, (x , y , z ) associated with the same position vector. How are the two sets related? It is useful
to relabel the coordinates as
x1 = x , x2 = y , x3 = z
1
This is called an orthonormal basis. Each vector has unit magnitude, and they are mutually
orthogonal (perpendicular): ex ey = 0 etc.
3
!
Rij xj
(1.1)
j=1
where the nine numbers Rij are the coecients showing how the two sets of axes
are related. This can be rewritten in matrix form if we represent the coordinates
as a column vector and Rij as the entries in a matrix R with i as the row label
and j as the column label.
x1
x1
R11 R12 R13
x2 = R21 R22 R23 x2 .
(1.2)
x3
x3
R31 R32 R33
We can show that the matrix R satisfies the conditions (RT is the transpose of
the matrix R)
RT = R1 or RT R = R RT = I
and det R = 1 ,
(1.3)
(1.4)
where I is the (3 3) unit matrix. The first condition ensures that the magnitude
of vectors remain unchanged in all bases2 . The second condition preserves orientation, i.e., the basis vectors always form a right-handed triad. Such matrices
are called rotation matrices because two dierent choices of basis vectors are
related by rotations. For instance, consider the case when two bases have one
common basis vector, ez = ez , i.e., they have the same z-axis. The coordinates
are related by
x
cos sin 0
x
y = sin cos 0 y
(1.5)
z
0
0
1
z
i.e., the two x- and y-axes are related by a rotation by an angle about the
z-axis.
We can now generalise this to arbitrary rotations. Arbitrary rotations are
specified by an axis of rotation n
and an angle of rotation about the direction
of n
. Let us denote the corresponding rotation matrix by R(
n, ). Thus, the
rotation matrix given above corresponds to R(
ez , ). One can give an explicit
form for the general rotation matrix but we shall not do so now.
We can now provide the definition of a vector. A vector is a set of three
quantities that transform, under rotations of the coordinate axes, exactly as the set of coordinates itself transform.. We usually represent the
2
Actually it also ensures that the new basis vectors are unit vectors and orthogonal to each
other.
v1
three quantities as a column vector, v = v2 . Under the change of coordinates
v3
given by Eqn.(1.1), the components transform as
vi =
3
!
Rij vj
(1.6)
j=1
1.2
(1.7)
1.3
&
0
1
(1.8)
i = j
i=j
(1.9)
(Note the summation over i as per our convention.) The invariance of the dotproduct follows from the condition RT R = I satisfied by rotation matrices. Explicitly, this condition is
RT R = I Rji Rjk = ik .
Here i and k are free indices and the dummy index j is summed over from 1 to
3. The following steps show how the dot product is unchanged when going from
one basis to another
vj wj = Rji vi Rjk wk = Rji Rjk vi wk = ik vi wk = vi wi
The dot-product when we choose v = w
gives the square of the magnitude of
a vector. That is
|v|2 = vi vi = v12 + v22 + v32 ,
(1.10)
which is clearly positive definite. Note that |v| = 0 implies v = 0. For any two
vectors u and v , we can show that
u v = |u||v| cos ,
where is the angle between the two vectors u and v . Recall that in elementary
vector analysis this formula was used to define the scalar product. We now see
that it follows as a consequence of the definition u v = ui vi .
1.4
(1.11)
,
written symbolically as u = v w.
Eqs. (1.12) can be written in compact form
as
ui = ijk vj wk (v w)
i
(1.12)
where ijk is the Levi-Civita tensor (also called the permutation symbol)
defined as: the conditions:
123 = 1
ijk = jik = ikj = kji
The second condition implies that the Levi-Civita tensor is totally antisymmetric
and is non-vanishing if and only if all three indices are distinct. Thus, 123 =
231 = 312 = 1 and 132 = 321 = 213 = 1. All other components are vanishing.
1.5
As mentioned earlier, scalars and vectors are not the only kinds of objects that
one encounters in physical situations. A simple example of a more complicated
object - a tensor, is given by Aij = vi wj , where v and w
are any two vectors.
Under a change of basis, one can see that
Aij = vi wj = Rik Rjl vk wl = Rik Rjl Akl
(1.13)
Objects which transform in this fashion are called tensors of rank two (because
its components are specified by two indices i and j). Not all tensors of rank two
can be written as vi wj and thus one can use the above equation as the defining
condition. An example of such an object is given by the moment of inertia of a
rigid body. Other examples include the stress and strain tensors, the dielectric
tensor and so on.
Clearly, a tensor of second rank is the simplest generalisation of a vector.
Generalising further, tensors of rank r are objects which have r-indices. Thus,
scalars are tensors of rank zero and vectors are tensors of rank one. The
Levi-Civita tensor has rank three and the Kronecker delta has rank two. The
elastic constants that relate stress and strain in linear materials is a tensor of
rank four.
Exercise Using the transformation law for a tensor of rank two, show that the
Kronecker delta is invariant under rotations of the coordinate axes.
Exercise Using the transformation law for a tensor of rank three and the antisymmetry of the Levi-Civita tensor, show that
ijk = (detR) ijk
1.6
An important identity
As we just saw, the Kronecker delta and Levi-Civita tensor are invariant under
change of basis. They are related by the following important identity which
enables one to derive just about all vector identities:
ijk ilm = (jl km jm kl )
(1.14)
Here we must remember that the index i is repeated, and so a summation over
i from i = 1 to i = 3 is implicit. The most straightforward proof is to explicitly
verify that this is true for all values of the free indices (j, k, l, m). This is left
to the student. However, some simple checks are: both the LHS and RHS are
antisymmetric under the exchanges: j k and k l.
As an application of the identity (1.14), consider the multiple cross-product:
u (v w).
The i-th component of this is
(u (v w))
i = ijk uj (v w)
k = ijk uj klm vl wm = kij klm uj vl wm
Using the identity (1.14), we obtain
(u (v w))
i = (il jm im jl ) uj vl wm = um vi wm uj vj wi
which can be rewritten as
(u (v w))
= (u w)
v (u v )w
(1.15)
1.7
We saw that rotations are given by matrices that satisfy R T = R1 and detR = 1.
Are there transformations which satisfy only the first of these two conditions?
Taking the determinant on both sides, we find that detR2 = 1, which suggests
that one consider transformations such that detR = 1. It is not hard to see
that reflection about any plane is such an operation. Under reflections about the
xy-plane, one obtains
x
x
y = y .
z
z
P : y y
(1.16)
z
z
One can show that any matrix satisfying RT = R1 with determinant 1 can be
obtained by following the parity operation with a rotation. Thus, it suces to
consider parity alone.
A scalar is invariant under both rotations and parity while a pseudo-scalar
is one that is invariant under rotations but changes sign under parity.
A vector is one that transforms identical to the position vector under both
rotations as well as parity while a pseudo-vector transforms like the position
vector under rotations but is invariant under parity.
is a
The momentum p of a particle is a vector, while its angular momentum L
pseudo-vector. This follows from the fact that both r and p are vectors and hence
= r p does not change sign, and hence is a
change sign under parity. Thus L
pseudo-vector. In general, the cross-product of two vectors gives rise to a pseudovector, while the cross-product of a vector with a pseudo-vector is a vector. The
dot product of two vectors is a scalar (e.g. the kinetic energy of a particle), while
the dot product of a vector and a pseudo-vector is a pseudo-scalar. Vectors are
sometimes called polar vectors, to distinguish them from axial vectors which
is another name for pseudovectors.
The Lorentz force law
+ v B)
F = q(E
enables us to fix the behaviour of the electric and magnetic fields under parity.
Both F and v are vectors and q is a scalar, it follows that the electric field is a
vector while the magnetic field is a pseudo-vector. We usually decide if an
object is a pseudo-vector or pseudo-scalar by considering simple known equations.
Never equate a vector to a pseudo-vector or a scalar to a pseudo-scalar.
1.8
The full power of working with objects which transform nicely under change of
basis/rotations is that the equations which describe the physical motion
of particles can be written in manifestly invariant manner i.e., they retain
the same form in dierent bases. For instance, Newtons law
F = ma
is the same in all bases. The LHS is a vector and the RHS is the product of a
scalar and a vector. Hence the RHS is a vector as well.
1.9
1.9.1
Given two vectors u and v, one can construct a parallelogram with two edges u
and v (see Fig. 1.1). By considerations of elementary geometry, the area of the
v sin
A = (u v) .
(1.17)
Note that there is an ambiguity of an overall sign in the definition, for we could
also have chosen it to be (v u). We will return to this point when discussing
the unit vector perpendicular to an area element.
1.9.2
Volume as a scalar
Just as two vectors can define a parallelogram, three (non co-planar) vectors u,
v and w
can define a parallelepiped which has the three vectors as three of its
v
u
(1.18)
The above expression seems to treat the three vectors dierently. but this is not
so. We can re-express the above formula by using the Levi-Civita tensor
Volume = ijk ui vj wk = u (v w)
= v (w
u) = w
(u v )
(1.19)
We note again that there is a sign ambiguity in the above definition. This can
be removed by taking the magnitude in the RHS of the above equation, since we
require the volume to be non-negative. Note that if u, v, w
are co-planar, the
volume of the parallelepiped vanishes automatically, as it must.
DEPARTMENT OF PHYSICS
INDIAN INSTITUTE OF TECHNOLOGY, MADRAS
PH101 Physics I
Problem Set 1
5.8.2002
1. Vector identities: This problem is meant to show you how easily various
vector identities are derived using the index notation introduced in class.
Using this notation and the properties of the symbols ij and ijk , etc.,
prove the following standard vector identities:
~ (B
~ C)
~ = (A
~ B)
~ C
~ (A
~ C)
~ B
~
(a) A
~ B)
~ (C
~ D)
~ = [A
~ (C
~ D)]
~ B
~
(b) (A
~ B)
~ (C
~ D)
~ = (A
~ C)
~ (B
~ D)
~
(c) (A
~ (C
~ D)]
~ A
~
[B
~ D)
~ (B
~ C)
~
(A
(b)
~
~
d ~ ~
dA
~ A + @A
~ dB A
(A B) = @
B
dt
dt
dt
(c)
(d)
~
~
dA
d ~ ~
~ A + @A
~ dB A
(A B) = @
B
dt
dt
dt
~
d @ ~ dB
A
dt
dt
0
~
d @ ~ dB
A
dt
dt
2~
~
dA
~A = A
~d B
B
dt
dt2
2~
~
dA
~A = A
~d B
B
dt
dt2
~
d2 A
~
B
dt2
~
d2 A
~
B
dt2
2 (
na
) n
.