0% found this document useful (0 votes)
101 views

1 Scalars and Vectors: 1.1 What Is A Vector?

This document provides an overview of scalars and vectors in physics. It begins by explaining that physical laws must be independent of the observer's frame of reference. Vectors and tensors are introduced as objects that transform in a defined way under changes of reference frame, allowing equations to remain invariant. Vectors are defined as quantities that transform like the coordinates themselves under rotations. The dot and cross products are introduced to construct scalars and vectors from other vectors. Tensors of higher rank are also discussed.

Uploaded by

anupam10311
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
101 views

1 Scalars and Vectors: 1.1 What Is A Vector?

This document provides an overview of scalars and vectors in physics. It begins by explaining that physical laws must be independent of the observer's frame of reference. Vectors and tensors are introduced as objects that transform in a defined way under changes of reference frame, allowing equations to remain invariant. Vectors are defined as quantities that transform like the coordinates themselves under rotations. The dot and cross products are introduced to construct scalars and vectors from other vectors. Tensors of higher rank are also discussed.

Uploaded by

anupam10311
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 11

1

Physics I Course Material

Scalars and Vectors

Contents: Scalars and pseudo-scalars; Vectors and pseudo-vectors; dot-product


and the Einstein summation convention; cross-product and the Levi-Civita tensor; applications.
In physics, we are interested in obtaining laws (in the form of mathematical
equations) which govern the behaviour of dierent systems. These laws should
hold at dierent places as well as times. They should also be independent of the
observer testing them. We thus infer that the physical laws and values of experimentally measurable quantities must be frame independent, i.e., independent of
the choice of location of the origin of coordinates and the coordinate axes that
we choose. In order to check this, we need to be able to convert results in one
frame to equivalent results in another frame.
Suppose we are able to write the physical laws, i.e., the mathematical equations which describe them, in terms of objects that transform in a definite manner
under a change of frame, the rule for conversion is manifest. Examples of such
objects are scalars, vectors and more generally, tensors. When the equations are
written in terms of these objects, we say that the equations are form invariant.
For instance, an equation such as Newtons second law, F = ma, has vectors
on both sides of the equation. So while two dierent observers have dierent
numerical values for each component of their vectors, they will both agree that
the equation holds.
However, the elementary or high-school definition of a vector (a quantity
with both magnitude and direction) is rather unsatisfactory. In this unit, we
shall first provide a better definition of a vector through its properties under a
change in coordinate axes (change in basis vectors).

1.1

What is a vector?

The standard example of a vector in three dimensions is the position vector


r = x
ex + y
ey + z
ez

The three numbers (x, y, z) specify the vector fully. However, their actual numerical values depend on the particular coordinate axes chosen: In other words,
a set (
ex , ey , ez ) of mutually perpendicular unit vectors is chosen, and the values
(x, y, z) are with reference to this set1 . This choice is not unique. A dierent
orthonormal basis will give rise to a dierent set of numbers, say, (x , y , z ) associated with the same position vector. How are the two sets related? It is useful
to relabel the coordinates as
x1 = x , x2 = y , x3 = z

1
This is called an orthonormal basis. Each vector has unit magnitude, and they are mutually
orthogonal (perpendicular): ex ey = 0 etc.

V. Balakrishnan & S. Govindarajan

The relation between the two sets of coordinates can be written as


xi =

3
!

Rij xj

(1.1)

j=1

where the nine numbers Rij are the coecients showing how the two sets of axes
are related. This can be rewritten in matrix form if we represent the coordinates
as a column vector and Rij as the entries in a matrix R with i as the row label
and j as the column label.


x1
x1
R11 R12 R13
x2 = R21 R22 R23 x2 .
(1.2)
x3
x3
R31 R32 R33
We can show that the matrix R satisfies the conditions (RT is the transpose of
the matrix R)
RT = R1 or RT R = R RT = I
and det R = 1 ,

(1.3)
(1.4)

where I is the (3 3) unit matrix. The first condition ensures that the magnitude
of vectors remain unchanged in all bases2 . The second condition preserves orientation, i.e., the basis vectors always form a right-handed triad. Such matrices
are called rotation matrices because two dierent choices of basis vectors are
related by rotations. For instance, consider the case when two bases have one
common basis vector, ez = ez , i.e., they have the same z-axis. The coordinates
are related by


x
cos sin 0
x
y = sin cos 0 y
(1.5)
z
0
0
1
z
i.e., the two x- and y-axes are related by a rotation by an angle about the
z-axis.
We can now generalise this to arbitrary rotations. Arbitrary rotations are
specified by an axis of rotation n
and an angle of rotation about the direction
of n
. Let us denote the corresponding rotation matrix by R(
n, ). Thus, the
rotation matrix given above corresponds to R(
ez , ). One can give an explicit
form for the general rotation matrix but we shall not do so now.
We can now provide the definition of a vector. A vector is a set of three
quantities that transform, under rotations of the coordinate axes, exactly as the set of coordinates itself transform.. We usually represent the
2
Actually it also ensures that the new basis vectors are unit vectors and orthogonal to each
other.

Physics I Course Material


v1
three quantities as a column vector, v = v2 . Under the change of coordinates
v3
given by Eqn.(1.1), the components transform as
vi =

3
!

Rij vj

(1.6)

j=1

Exercise Given a vector v , find a rotation matrix R (or equivalently, a change


of basis) such that v = (0, 0, v)T . Is R unique? Explain.

1.2

The Einstein summation convention

This convention is very useful in writing in a compact and uncluttered form,


formulae involving several summations. The convention is simply that any index
that occurs twice is summed over. The summation always runs from 1 to 3 since
we are in three dimensions. Further, no index can occur more than twice in any
equation. Thus, we can rewrite Eqn. (1.6) as
vi = Rij vj

(1.7)

where the summation of j is implicitly assumed under the Einstein summation


convention and i is a free (i.e., unsummed) index. The index that is summed
over is referred to as a dummy index. Care should be taken to see that dierent
letters are used to indicate dierent summations. Further, a simple check is
to see that the same free indices occur on either side of any equation.

1.3

The dot-product: Scalars from vectors

A scalar is defined as an object that is unchanged or invariant under change of


basis/rotations.
Given two vectors v = (v1 , v2 , v3 )T and w
= (w1 , w2 , w3 )T , we can construct a
scalar from them. It is called the dot product, and is defined by
v w
vi ij wj
where
ij =

&

0
1

(1.8)

i = j
i=j

is the Kronecker delta. When it is written as a 3 3 matrix, it is nothing but


the identity matrix I. Thus, it is not hard to see that
v w
=w
v = vi wi

(1.9)

V. Balakrishnan & S. Govindarajan

(Note the summation over i as per our convention.) The invariance of the dotproduct follows from the condition RT R = I satisfied by rotation matrices. Explicitly, this condition is
RT R = I Rji Rjk = ik .
Here i and k are free indices and the dummy index j is summed over from 1 to
3. The following steps show how the dot product is unchanged when going from
one basis to another
vj wj = Rji vi Rjk wk = Rji Rjk vi wk = ik vi wk = vi wi
The dot-product when we choose v = w
gives the square of the magnitude of
a vector. That is
|v|2 = vi vi = v12 + v22 + v32 ,
(1.10)
which is clearly positive definite. Note that |v| = 0 implies v = 0. For any two
vectors u and v , we can show that
u v = |u||v| cos ,
where is the angle between the two vectors u and v . Recall that in elementary
vector analysis this formula was used to define the scalar product. We now see
that it follows as a consequence of the definition u v = ui vi .

1.4

The cross-product: New vectors from old

Given two vectors v and w,


one can construct a new vector u by the cross
product defined below: the components of the cross product are
u1 = v2 w3 v3 w2
u2 = v3 w1 v1 w3
u3 = v1 w2 v2 w1

(1.11)
,

written symbolically as u = v w.
Eqs. (1.12) can be written in compact form
as
ui = ijk vj wk (v w)
i
(1.12)
where ijk is the Levi-Civita tensor (also called the permutation symbol)
defined as: the conditions:
123 = 1
ijk = jik = ikj = kji
The second condition implies that the Levi-Civita tensor is totally antisymmetric
and is non-vanishing if and only if all three indices are distinct. Thus, 123 =
231 = 312 = 1 and 132 = 321 = 213 = 1. All other components are vanishing.

Physics I Course Material


For any two vectors u and v , one can show that
|u v | = |u||v| sin

where is the angle between the two vectors u and v . Clearly, u u = 0.

1.5

More complicated objects: Tensors

As mentioned earlier, scalars and vectors are not the only kinds of objects that
one encounters in physical situations. A simple example of a more complicated
object - a tensor, is given by Aij = vi wj , where v and w
are any two vectors.
Under a change of basis, one can see that
Aij = vi wj = Rik Rjl vk wl = Rik Rjl Akl

A tensor like Aij can be represented as a 3 3 matrix A. Then, the above


transformation law can be written as
A = RT A R

(1.13)

Objects which transform in this fashion are called tensors of rank two (because
its components are specified by two indices i and j). Not all tensors of rank two
can be written as vi wj and thus one can use the above equation as the defining
condition. An example of such an object is given by the moment of inertia of a
rigid body. Other examples include the stress and strain tensors, the dielectric
tensor and so on.
Clearly, a tensor of second rank is the simplest generalisation of a vector.
Generalising further, tensors of rank r are objects which have r-indices. Thus,
scalars are tensors of rank zero and vectors are tensors of rank one. The
Levi-Civita tensor has rank three and the Kronecker delta has rank two. The
elastic constants that relate stress and strain in linear materials is a tensor of
rank four.
Exercise Using the transformation law for a tensor of rank two, show that the
Kronecker delta is invariant under rotations of the coordinate axes.
Exercise Using the transformation law for a tensor of rank three and the antisymmetry of the Levi-Civita tensor, show that
ijk = (detR) ijk

Thus, the Levi-Civita tensor is invariant under rotations since detR = 1.

1.6

V. Balakrishnan & S. Govindarajan

An important identity

As we just saw, the Kronecker delta and Levi-Civita tensor are invariant under
change of basis. They are related by the following important identity which
enables one to derive just about all vector identities:
ijk ilm = (jl km jm kl )

(1.14)

Here we must remember that the index i is repeated, and so a summation over
i from i = 1 to i = 3 is implicit. The most straightforward proof is to explicitly
verify that this is true for all values of the free indices (j, k, l, m). This is left
to the student. However, some simple checks are: both the LHS and RHS are
antisymmetric under the exchanges: j k and k l.
As an application of the identity (1.14), consider the multiple cross-product:
u (v w).
The i-th component of this is
(u (v w))
i = ijk uj (v w)
k = ijk uj klm vl wm = kij klm uj vl wm
Using the identity (1.14), we obtain
(u (v w))
i = (il jm im jl ) uj vl wm = um vi wm uj vj wi
which can be rewritten as
(u (v w))
= (u w)
v (u v )w

(1.15)

One can further show that


(u v ) w
= u (v w)
,
i.e., the order of the cross-product is important. The student may explicitly verify
this.

1.7

Reflections and Parity

We saw that rotations are given by matrices that satisfy R T = R1 and detR = 1.
Are there transformations which satisfy only the first of these two conditions?
Taking the determinant on both sides, we find that detR2 = 1, which suggests
that one consider transformations such that detR = 1. It is not hard to see
that reflection about any plane is such an operation. Under reflections about the
xy-plane, one obtains

x
x
y = y .
z
z

Physics I Course Material

The matrix R = diag(1, 1, 1) clearly has determinant 1. Another example, is


the parity operation under which all coordinates change sign:


x
x

P : y y
(1.16)
z
z
One can show that any matrix satisfying RT = R1 with determinant 1 can be
obtained by following the parity operation with a rotation. Thus, it suces to
consider parity alone.
A scalar is invariant under both rotations and parity while a pseudo-scalar
is one that is invariant under rotations but changes sign under parity.
A vector is one that transforms identical to the position vector under both
rotations as well as parity while a pseudo-vector transforms like the position
vector under rotations but is invariant under parity.
is a
The momentum p of a particle is a vector, while its angular momentum L
pseudo-vector. This follows from the fact that both r and p are vectors and hence
= r p does not change sign, and hence is a
change sign under parity. Thus L
pseudo-vector. In general, the cross-product of two vectors gives rise to a pseudovector, while the cross-product of a vector with a pseudo-vector is a vector. The
dot product of two vectors is a scalar (e.g. the kinetic energy of a particle), while
the dot product of a vector and a pseudo-vector is a pseudo-scalar. Vectors are
sometimes called polar vectors, to distinguish them from axial vectors which
is another name for pseudovectors.
The Lorentz force law
+ v B)

F = q(E
enables us to fix the behaviour of the electric and magnetic fields under parity.
Both F and v are vectors and q is a scalar, it follows that the electric field is a
vector while the magnetic field is a pseudo-vector. We usually decide if an
object is a pseudo-vector or pseudo-scalar by considering simple known equations.
Never equate a vector to a pseudo-vector or a scalar to a pseudo-scalar.

1.8

Form invariance of physical laws

The full power of working with objects which transform nicely under change of
basis/rotations is that the equations which describe the physical motion
of particles can be written in manifestly invariant manner i.e., they retain
the same form in dierent bases. For instance, Newtons law
F = ma
is the same in all bases. The LHS is a vector and the RHS is the product of a
scalar and a vector. Hence the RHS is a vector as well.

V. Balakrishnan & S. Govindarajan

In more complicated situations, the equations need not be relations between


vectors but those between tensors of the same rank. That way, invariance is
guaranteed since both sides change in an identical fashion under change of bases
and reflections. The simple way to verify this is to check that free indices are
the same on both sides of any equation. One must also be cautious and check
that the dummy (non-free) indices occur only twice. If any index occurs three or
more timee, theres been a mistake somewhere.

1.9
1.9.1

Geometric meaning of the dot product and cross product


Area element as a vector

Given two vectors u and v, one can construct a parallelogram with two edges u
and v (see Fig. 1.1). By considerations of elementary geometry, the area of the

v sin

Figure 1.1: Area of a parallelogram


parallelogram is
Area = |u||v| sin ,
where is the angle between the two vectors u and v . Thus, one can see that the
area can be written as Area = |u v|.
It turns out that it is better to think of the area of a parallelogram (a planar
figure) as the vector (u v ). Clearly, the direction of this vector is normal to
the surface of the parallelogram. We thus define the area vector to be

A = (u v) .

(1.17)

Note that there is an ambiguity of an overall sign in the definition, for we could
also have chosen it to be (v u). We will return to this point when discussing
the unit vector perpendicular to an area element.
1.9.2

Volume as a scalar

Just as two vectors can define a parallelogram, three (non co-planar) vectors u,
v and w
can define a parallelepiped which has the three vectors as three of its

Physics I Course Material

v
u

Figure 1.2: Volume of a parallelepiped


edges (see Fig. 1.2). Again by means of elementary geometry, one can show that
the volume of the parallelepiped is given by
Volume = u (v w)
.

(1.18)

The above expression seems to treat the three vectors dierently. but this is not
so. We can re-express the above formula by using the Levi-Civita tensor
Volume = ijk ui vj wk = u (v w)
= v (w
u) = w
(u v )

(1.19)

We note again that there is a sign ambiguity in the above definition. This can
be removed by taking the magnitude in the RHS of the above equation, since we
require the volume to be non-negative. Note that if u, v, w
are co-planar, the
volume of the parallelepiped vanishes automatically, as it must.

DEPARTMENT OF PHYSICS
INDIAN INSTITUTE OF TECHNOLOGY, MADRAS
PH101 Physics I

Problem Set 1

5.8.2002

1. Vector identities: This problem is meant to show you how easily various
vector identities are derived using the index notation introduced in class.
Using this notation and the properties of the symbols ij and ijk , etc.,
prove the following standard vector identities:
~ (B
~ C)
~ = (A
~ B)
~ C
~ (A
~ C)
~ B
~
(a) A
~ B)
~ (C
~ D)
~ = [A
~ (C
~ D)]
~ B
~
(b) (A
~ B)
~ (C
~ D)
~ = (A
~ C)
~ (B
~ D)
~
(c) (A

~ (C
~ D)]
~ A
~
[B
~ D)
~ (B
~ C)
~
(A

2. Time-dependent vectors: Very frequently, we have to deal with vector


~
~
quantities that depend on a scalar variable like the time, t. Let A(t),
B(t)
~
and C(t)
be such vectors. The rules of dierentiation with respect to t
(such as the chain rule, etc.) are then essentially the same as those for
the dierentiation of ordinary functions of t, paying attention to the order
of the factors in cross products, etc. The following relationships are easily
proved. Do so.
(a)

(b)

~
~
d ~ ~
dA
~ A + @A
~ dB A
(A B) = @
B
dt
dt
dt

(c)

(d)

~
~
dA
d ~ ~
~ A + @A
~ dB A
(A B) = @
B
dt
dt
dt

~
d @ ~ dB
A
dt
dt
0

~
d @ ~ dB
A
dt
dt

2~
~
dA
~A = A
~d B
B
dt
dt2

2~
~
dA
~A = A
~d B
B
dt
dt2

~
d2 A
~
B
dt2
~
d2 A
~
B
dt2

(e) The following is a very useful relationship in several physical applications:


~
~ dA = A dA ,
A
dt
dt
1

~ It follows at once that the vector dA/dt


~
where A |A|.
is perpen~
~
dicular to the vector A if and only if A is a vector of constant
magnitude (that is, its direction may change with time, but its length
does not).
3. The laws of reflection and refraction can be written in a precise and
compact form using vectors. Consider a ray of light that is partially reflected and partially refracted at a plane surface (with unit normal vector
n
) separating vacuum and a medium of refractive index . Let a
, b and c be
the unit vectors along the incident, refected and refracted rays, respectively.
(a) Write the laws of reflection and refraction in terms of the unit vectors
a
, b, c and n
and . Your answers should not explicitly involve any
angles (such as the angles of incidence, refraction, etc.).
(b) Show that b can in fact be written in terms of a
and n
, as
b = a

2 (
na
) n
.

4. Force on a charged particle in a magnetic field: We know that the


force on a particle of mass m and charge q in the presence of a magnetic
~ is given by
field B
d~v
~ .
F~ = m
= q (~v B)
dt
A number of physical properties of the motion can be proved directly from
~ be a steady, uniform
this equation, without solving it explicitly. Let B
magnetic field pointing in some arbitrary direction given by the unit vector
n
. Do not assume that this is along any particular coordinate axis.
(a) Resolve both ~v and F~ into components respectively parallel and perpendicular to the direction n
of the magnetic field, as ~v = ~vk + ~v? and
~
~
~
F = Fk + F? , and write the respective equations for F~k and F~? .
(b) Show that ~vk remains constant in time.
(c) Using this, show that if ~vk is initially equal to zero, then the motion
of the particle is restricted to a plane for all time.
(d) If this plane is given by the equation ax + by + cz = constant, where
~
a, b and c are real constants, find the direction of the magnetic field B
in terms of these constants.

You might also like