Applied Mathematics
Applied Mathematics
UNIT OBJECTIVES:
At the end of this unit each student will able to:
Know representation of vectors geometrically
Understand operation of vectors algebraically and geometrically.
Realize properties of operations on vectors.
Learn about projection of a vector along another vector.
Understand the dot product and cross product of two vectors.
Know the norm /magnitude of a vector.
Develop parametric and non-parametric equation of a given line.
Develop the parametric form and normal form of equation of a plane.
Understand about vector spaces and subspaces
Identify base and dimension of vector space.
1
Definition of Points In n-Space
by .
When n 2 or 3, it is customary to use the terms ordered pair and ordered triple,
respectively, rather than ordered 2-tuple and ordered 3-tuple. When n 1, each ordered n-
tuple consists of one real number, so may be viewed as the set of real numbers. It is
usual to write rather than for this set. It might have occurred to you in the study of
3-space that the symbol a1, a2 , a3 has two different geometric interpretations: it can be
interpreted as a point, in which case, a , a , and a are the coordinates (Figure 1.1.1a), or
1 2 3
it can be interpreted as a vector, in which case a , a , and a are the components (Figure
1 2 3
1.1.1b). It follows, therefore, that an ordered n-tuple a1, a2 , a3 ,..., an can be viewed either
Figure 1.1: The ordered triple a1, a2 , a3 can be interpreted geometrically as a point or as a
vector
Definition 1.2: a) Two vectors u u1,u2 ,u3 ,..., un and v v1, v2 , v3 ,..., vn in are
2
b) The sum u v is defined by u v u1 v1 ,u 2 v2, ,..., u n vn
c) If k is any scalar, the scalar multiple ku is defined by ku ku1, ku2 , ku3 ,..., kun The
operations of addition and scalar multiplication in this definition are called the standard
operations on .
d) The zero vector in is denoted by 0 and is defined to be the vector .
e) If u u1,u2 ,u3 ,..., un is any vector in , then the negative (or additive inverse) of
u is denoted by u and is defined by u u1,u2 ,u3 ,..., un .
f) The difference of vectors in n is defined by v u v u or, in terms of components
v u v1 u1, v2 u2 ,v3 u3 ,..., vn un
.
In this section, vectors in 2-space and 3-space will be introduced geometrically, arithmetic
operations on vectors will be defined, and some basic properties of these arithmetic
operations will be established.
Geometric Vectors
3
Definition 1.3: If v and w are any two vectors, then the sum is the vector
determined as follows: Position the vector w so that its initial point coincides with the
terminal point of v. The vector is represented by the arrow from the initial point of v
to the terminal point of w (Figure 1.3 a).
In Figure 1.3b we have constructed two sums, (color arrows) and (gray arrows). It is
evident that and that the sum coincides with the diagonal of the
parallelogram determined by v and w when these vectors are positioned so that they have
the same initial point.
The vector of length zero is called the zero vector and is denoted by 0. We can easly see
that
for every vector v. Since there is no natural direction for the zero vector, we shall agree that
it can be assigned any direction that is convenient for the problem being considered. If v is
any nonzero vector, then v , the negative of v, is defined to be the vector that has the
same magnitude as v but is oppositely directed (Figure 1.2.3). This vector has the property
(Why?) In addition, we define .
4
The negative of v has the same length as v but is oppositely directed(Figure1.4).
Subtraction of vectors is defined as follows:
Definition 1.4: If v and w are any two vectors, then the difference of w from v is defined by
(Figure1.5a).
Definition 1.5: If v is a nonzero vector and k is a nonzero real number (scalar), then the
product is defined to be the vector whose length is k times the length of and whose
Figure 1.6 illustrates the relation between a vector v and the vector and .
Note that the vector has the same length as v but is oppositely directed. Thus
is just the negative of v; that is,
5
Figure 1.6
A vector of the form is called a scalar multiple of v. As evidenced by Figure 1.6,
vectors that are scalar multiples of each other are parallel. Conversely, it can be shown that
nonzero parallel vectors are scalar multiples of each other. We omit the proof.
6
Figure 1.8
If v v1, v2 and k is any scalar, then by using a geometric argument involving similar
and
( )
Since, , it follows from Formulas 1 and 2 that
. (Verify.)
Figure 1.9
Vectors in 3-Space
Just as vectors in the plane can be described by pairs of real numbers, vectors in 3-space
can be described by triples of real numbers by introducing a rectangular coordinate
system. To construct such a coordinate system, select a point O, called the origin, and
choose three mutually perpendicular lines, called coordinate axes, passing through the
origin. Label these axes x, y, and z, and select a positive direction for each coordinate axis
7
as well as a unit of length for measuring distances (Figure 1.10a).Each pair of coordinate
axes determines a plane called a coordinate plane. These are referred to as the xy -plane,
the xz -plane,and the yz -plane. To each point P in 3-space we assign a triple of numbers
(x, y, z), called the coordinates of P, as follows: Pass three planes through P parallel to the
coordinate planes, and denote the points of intersection of these planes with the three
coordinate axes by X, Y, and Z (Figure 1.10b). The coordinates of P are defined to be the
signed lengths and .
In Figure 1.11a we have constructed the point whose coordinates are (4, 5, 6) and in Figure
1.11b the point whose coordinates are (-3, 2, -4)
Figure 1.10
Figure 1.11
Rectangular coordinate systems in 3-space fall into two categories: left-handed and right-
handed. A right-handed system has the property that an ordinary screw pointed in the
positive direction on the z-axis would be advanced if the positive x-axis were rotated 90°
toward the positive y-axis (Figure 1.12a); the system is left-handed if the screw would be
retracted (Figure 1.12b).
8
Remark: In this module we shall use only right-handed coordinate systems.
If, as in Figure 1.13, a vector v in 3-space is positioned so its initial point is at the origin of
a rectangular coordinate system, then the coordinates of the terminal point are called the
components of v, and we write
Figure 1.13
If v v1 , v 2 , v3 and w w1 , w2 , w3 are two vectors in 3-space, then arguments similar to
those used for vectors in a plane can be used to establish the following results
v and w are equivalent if and only if
and
9
Sometimes a vector is positioned so that its initial point is not at the origin. If the vector
That is, the components of are obtained by subtracting the coordinates of the initial
point from the coordinates of the terminal point. This may be seen using Figure 1.14.
Figure 1.14
The vector ⃗⃗⃗⃗⃗⃗⃗⃗ is the difference of vectors ⃗⃗⃗⃗⃗⃗⃗ and ⃗⃗⃗⃗⃗⃗⃗ , so
The following theorem lists the most important properties of vectors in 2-space and 3-
space.
Theorem 1.6: Properties of Vector Arithmetic
10
If u, v, and w are vectors in 2- or 3-space and k and l are scalars, then the following
relationships hold.
(a) u v v u (e) k lu kl u
(b) u v w u v w (f) k u v ku kv
(c) u 0 0 u u (g) k l u ku lu
Before discussing the proof, we note that we have developed two approaches to vectors:
geometric, in which vectors are represented by arrows or directed line segments, and
analytic, in which vectors are represented by pairs or triples of numbers called components.
As a consequence, the equations in Theorem 1.6 can be proved either geometrically or
analytically. To illustrate, we shall prove part (b) both ways. The remaining proofs are left
as exercises.
Proof of part (b) (analytic) we shall give the proof for vectors in 3-space; the proof for
2-space is similar. If, ,and then
Also,
Therefore,
11
Figure 1.15 The vectors and are equal
The most important arithmetic properties of addition and scalar multiplication of vectors in
n are listed in the following theorem. The proofs are all easy and are left as exercises
Theorem 1.7 : Properties of Vectors in
If u u1,u2 ,u3 ,..., un , v v1, v2 , v3 ,..., vn , and w w1, w2 , w3 ,..., wn are vectors in
12
The magnitude of a vector which is a length of a vector and also called norm of a vector.
The length of a vector u is often called the norm of u and is denoted by . It follows
(Figure 1.16a).
Let be a vector in 3-space. Using Figure 1.16b and two applications of the
Theorem of Pythagoras, we obtain
Thus
Similarly, if and are points in 2-space, then the distance between them
is given by
13
Figure 1.17
From the definition of the product ku , the length of the vector ku is k times the length of u.
Exercise
(a)
(b)
(c)
2. Find the distance between and .
(a)
(b)
14
(c) Use the result in part (a) to find a unit vector that is oppositely directed to the vector
.
Definition 1.8 : IIf u and v are vectors in 2-space or 3-space and θ is the angle between u
and v, then the dot product or Euclidean inner product u.v is defined by
(1)
15
Figure 1.19
For purposes of computation, it is desirable to have a formula that expresses the dot
product of two vectors in terms of the components of the vectors. We will derive such a
formula for vectors in 3-space; the derivation for vectors in 2-space is similar.
Let u (u1 , u 2 , u 3 ) and v (v1 , v 2 , v3 ) be two nonzero vectors. If, as shown in Figure1.20
, θ is the angle between u and v, then the law of cosines yields
(2)
or
Substituting
And
16
Figure 1.20
If and are two vectors in 2-space, then the formula
corresponding to 3 is
(4)
(5)
Example 2: Dot Product Using (3)
Consider the vectors and . Find u.v and determine the angle
θ between u and v.
Solution:
Thus, 60 0 . The following theorem shows how the dot product can be used to obtain
information about the angle between two vectors; it also establishes an important
relationship between the norm and the dot product.
Theorem 1.9 : Let u and v be vectors in 2- or 3-space.
17
Proof (a): Since the angle θ between v and v is 0, we have
Proof (b): since θ satisfies 0 , it follows that θ is acute if and only if cos 0 , that
θ is obtuse if and only if cos 0 , and that if and only if cos 0 . But cos has
2
the same sign as u.v since , u 0 , and v 0 .
Therefore, u and v make an obtuse angle, v and w make an acute angle, and u and w are
perpendicular.
Orthogonal Vectors
Perpendicular vectors are also called orthogonal vectors. In light of the Theorem above b,
two nonzero vectors are orthogonal if and only if their dot product is zero. If we agree to
consider u and v to be perpendicular when either or both of these vectors is , then we can
state without exception that two vectors u and v are orthogonal (perpendicular) if and only
18
(6)
Since the vector runs along the line (Figure 1.21), we need only
show that n and are perpendicular. But on subtracting the equations in (6), we obtain
which can be expressed in the form
Figure 1.21
The following theorem lists the most important properties of the dot product. They are
useful in calculations involving vectors.
Theorem 1.10 : Properties of the Dot Product
If u, v, and w are vectors in 2-or 3-space and k is a scalar, then
(a) u.v v.u
(b) u. v w u.v u.w
Similarly,
19
In many applications it is of interest to ―decompose‖ a vector u into a sum of two terms,
one parallel to a specified nonzero vector a and the other perpendicular to a. If u and a are
positioned so that their initial points coincide at a point Q, we can decompose the vector u
as follows (Figure 1.3.7): Drop a perpendicular from the tip of u to the line through a, and
construct the vector from Q to the foot of this perpendicular. Next form the difference
Figure 1.22
The vector u is the sum of and , where is parallel to a and is
perpendicular to a.
The following theorem gives formulas for calculating and
Theorem 1.11 :
20
(8)
Taking the dot product of both sides of 8 with a and using the above Theorems yields
(9)
Since , we obtain
As a check, the reader may wish to verify that the vectors and a are
perpendicular by showing that their dot product is zero.
A formula for the length of the vector component of u along a can be obtained by writing
Which yields
(10)
If.θ denotes the angle between u and a, then , so 10 can also be written
as
21
(11)
(Verify.) A geometric interpretation of this result is given in Figure 1.23.
As an example, we will use vector methods to derive a formula for the distance from a
point in the plane to a line.
(in the interval 0, that a makes with the positive x ,y, and z -axes. (See Figure 1.24)
Figure 1.24
The cosines of these direction angles cos , cos , and cos , are called the direction
cosines of the vector a. Using the formula of angles between two vectors , we obtain
a i a a a
cos 1 , cos 2 , and cos 3
a i a a a
22
Solution: Since a 12 2 2 3 2 14 then
1 2 3
cos , cos , and cos
14 14 14
1 2 3
cos1 740 , cos1 580 and cos1 37 0
14 14 14
Exercise
1. Find .
(a)
(b)
2. In each part of Exercise 1, find the cosine of the angle θ between u and v.
3. Determine whether u and v make an acute angle, make an obtuse angle, or are
orthogonal.
(a)
(b)
4. Find the orthogonal projection of u on a.
(a)
(b)
5. In each part of Exercise 4, find the vector component of u orthogonal to a.
Definition 1.12 : The cross product of two vectors and , unlike the dot
product, is a vector. For this reason it is also called the vector product. Note that
23
Definition 1.13: If 〈 〉 and b 〈 〉, then the cross product of and is
the vector
〈 〉
This may seem like a strange way of defining a product. The reason for the particular form
of Definition 1.13 is that the cross product defined in this way has many useful properties,
as we will soon see. In particular, we will show that the vector is perpendicular to both and
. In order to make Definition 1.13 easier to remember, we use the notation of determinants.
Observe that each term on the right side of Equation 2 involves a number in the first
from the left side by deleting the row and column in which appears. Notice also the
minus sign in the second term. For example,
If we now rewrite Definition 1 using second-order determinants and the standard basis
vectors and , we see that the cross product of the vectors and
24
In view of the similarity between Equations 2 and 3, we often write
Although the first row of the symbolic determinant in Equation 4 consists of vectors, if we
expand it as if it were an ordinary determinant using the rule in Equation 2, we obtain
Equation 3. The symbolic formula in Equation 4 is probably the easiest way of
remembering and computing cross products.
The second method is slightly easier; however, many textbooks don‘t cover this method as
it will only work on determinant. This method says to take the determinant as listed
above in equation 4 and then copy the first columns onto the end as shown below.
We now have three diagomals that move from left to right and three diagonals that move
from right to left. We multiply along each diagonal and add those that move from left to
right and subtract those that move from right to left.
25
One of the most important properties of the cross product is given by the following
theorem.
Theorem 1.14: The vector is orthogonal to both and .
26
Since a vector is completely determined by its magnitude and direction, we can now say
that is the vector that is perpendicular to both and whose orientation is
determined by the right hand rule,whose length is
In fact, that is exactly how physicists define .
Proof: Two nonzero vectors a and b are parallel if and only if . In either case
The geometric interpretation of Theorem 1.15 can be seen by looking at Figure 2. If and
are represented by directed line segments with the same initial point, then they determine
a parallelogram with base | | altitude | |
Thus we have the following way of interpreting the magnitude of a cross product.
27
Example 3: Find a vector perpendicular to the plane that passes through the points
we obtain
Observe that
28
So the associative law for multiplication does not usually hold; that is, in general,
However, some of the usual laws of algebra do hold for cross products. The following
theorem summarizes the properties of vector products.
These properties can be proved by writing the vectors in terms of their components and
using the definition of a cross product. We give the proof of Property 5 and leave the
remaining proofs as exercises.
Proof Of Property 5
There is an important difference between the dot product and cross product of two
vectors—the dot product is a scalar and the cross product is a vector. The following
theorem gives some important relationships between the dot product and cross product
and also shows that is orthogonal to both u and v.
Theorem 1.17:- Relationships Involving Cross Product and Dot Product
If u, v, and w are vectors in 3-space, then
(a)
(b)
(c)
29
(d
(e)
(1)
and
(2)
the proof can be completed by ―multiplying out‖ the right sides of 1 and 2 and verifying
their equality.
Proof (d) and (e) As an Exercise.
Example 5: Is Perpendicular to u and to v
Consider the vectors
Then we can show that
Since
and
is orthogonal to both u and v, as guaranteed by the Theorem above.
Exercise
(a)
(b)
(c)
2.Find a vector that is orthogonal to both u and v.
(a) ,
(b) ,
30
A line in the xy-plane is determined when a point on the line and the direction of the line
(its slope or angle of inclination) are given. The equation of the line can then be written
using the point-slope form. Likewise, a line L in three-dimensional space is determined
when we know a point P0 x0 , y 0 , z 0 on Land the direction of L . In three dimensions the
direction of a line is conveniently described by a vector, so we let be a vector parallel to L .
Let Px, y, z be an arbitrary point on L and let r0 and r be the position vectors of P0 and
P(that is, they have representations and ). If a is the vector with representation
Figure 1.26
which is a vector equation of L . Each value of the parameter t gives the position vector
r of a point on L. In other words, as t varies, the line is traced out by the tip of the vector r.
As Figure 1.6.2 indicates, positive values of t correspond to points on L that lie on one side
of P0 , whereas negative values of t correspond to points that lie on the other side of P0 . If
the vector v that gives the direction of the line L is written in component form as
v a, b, c , then we have tv ta, tb, tc . We can also write r x, y, z and r0 x0 , y 0 , z 0
, so the vector equation (1) becomes x, y, z x0 ta, y0 tb, z 0 tc
Two vectors are equal if and only if corresponding components are equal. Therefore we
have the three scalar equations:
31
Where t . These equations are called parametric equations of the line L through the
point P0 x0 , y 0 , z 0 and parallel to the vector v a, b, c . Each value of the parameter t
Figure 1.27
Example 1:
(a) Find a vector equation and parametric equations for the line that passes through thepoint
5,1,3 and is parallel to the vector i 4 j 2k .
Figure 1.28
(a) Here r0 5,1,3 5i j 3k and v i 4 j 2k , so the vector equation(1) becomes
r 5i j 3k t i 4 j 2k r 5 t )i 1 4t j 3k 3 2t k
Or Parametric equations are x 5 t , y 1 4t , z 3 2t
(b) Choosing the parameter value t= 1 gives x=6, y=5, and z=1 so 6,5,1 is a
32
point on the line. Similarly t=-1, gives the point 4,3,5 .
The vector equation and parametric equations of a line are not unique. If we change the
point or the parameter or choose a different parallel vector, then the equations change.
Forinstance, if, instead of 5,1,3 , we choose the point 6,5,1 in Example 1, then the
parametric equations of the line become x 6 t , y 5 4t , z 1 2t Or, if we stay with
the point 5,1,3 but choose the parallel vector 2i 8 j 4k , we arrive at the equations
direction of a line L , then the numbers a, b, and c are called direction numbers of L.
Since any vector parallel to v could also be used, we see that any three numbers
proportional to a, b, and c could also be used as a set of direction numbers for L. Another
way of describing a line L is to eliminate the parameter from Equations 2. If
none of a, b, or c is 0, we can solve each of these equations for t, equate the results, and
obtain
These equations are called symmetric equations of L. Notice that the numbers a, b, andc
that appear in the denominators of Equations 3 are direction numbers of L , that is,
components of a vector parallel to L . If one of a, b, or c is 0, we can still eliminate t . For
instance, if a = 0, we could write the equations of L as
y y0 z z 0
x x0 , This means that L lies in the vertical plane x= x0 .
b c
Example 2:
(a) Find parametric equations and symmetric equations of the line that passes through
the points A2,4,3 and B3,1,1 .
(b) At what point does this line intersect the xy-plane?
Solution:
33
Figure 1.29
(a) We are not explicitly given a vector parallel to the line, but observe that the vector v
with representation line AB is parallel to the line and v 3 2,1 4,1 (3) 1,5,4
Thus direction numbers are a=1, b=-5, and c=4 . Taking the point 2,4,3 as P0 , we see
that parametric equations (2) are x 2 t , y 4 5t , z 3 4t and symmetric equations
x2 y4 z3
(3) are
1 5 4
(b) The line intersects the xy-plane when z = 0 , so we put z =0 in the symmetric equations
x2 y4 3 11 1
and obtain This gives x and y , so the line intersects the xy-
1 5 4 4 4
11 1
plane at the point , ,0 .
4 4
In general, the procedure of Example 2 shows that direction numbers of the line L
through the points P0 x0 , y 0 , z 0 and P1 x1 , y1 , z1 are x1 x0 , y1 y 0 , and z1 z 0 and so
symmetric equations of L are
x x0 y y0 z z0
x1 x0 y 1 y 0 z1 z 0
The vector equation of a line through the (tip of the) vector r0 in the direction of a vector v
is . If the line also passes through (the tip of) , then we can take
and so its vector equation is r r t r1 r0 1t r tr1
0 0
The line segment from to is given by the parameter interval 0 t 1 .
34
PLANES
Although a line in space is determined by a point and a direction, a plane in space is more
difficult to describe. A single vector parallel to a plane is not enough to convey the
―direction‖ of the plane, but a vector perpendicular to the plane does completely specify its
direction. Thus a plane in space is determined by a point P0 x0 , y 0 , z 0 in the plane and a
vector that is orthogonal to the plane. This orthogonal vector is called a normal
vector. Let Px, y, z be an arbitrary point in the plane, and let and be the position
vectors of P0 and P . Then the vector is represented by a line P0P. (See Figure
below.) The normal vector is orthogonal to every vector in the given plane. In particular,
is orthogonal to and so we have ( ) = 0 which can be rewritten as
And this Equation is called a vector equation of the plane. To obtain a
scalar equation for the plane, we write a, b, c , x, y, z , and x0 , y 0 , z 0 .
Figure 1.30
Example-1: Find an equation of the plane through the point 2,4,1 with normal Vector n
35
they -intercept is 4 and the z-intercept is 3. This enables us to sketch the portion of the
plane that lies in the first octant (see Figure below)
Figure1.31
Example-2: Find an equation of the plane that passes through the points P1,3,2, Q3,1,6
and R5,2,0
Solution: The vectors a and b corresponding to line PQ and line PR are a= 2,4,4 ,
b 4,1,2 .Since both a and b lie in the plane, their cross product axb is orthogonal to
the plane and can be taken as the normal vector.
i j k
Thus, n = axb = 2 4 4 = 12i+20j+14k With the point P1,3,2 and the normal vector
4 1 2
Note : Two planes are parallel if their normal vectors are parallel. For instance,
the planes x 2 y 3z 4 and 2 x 4 y 6 z 3 are parallel because their normal vectors
are = 1,2,3 and = 2,4,6 and =2 . If two planes are not parallel, then they
intersect in a straight line and the angle between the two planes is defined as the acute
angle between their normal vectors.
36
Definition 1.18 : Definition of a Vector Space
Let V be a set on which two operations (vector addition and scalar multiplication)
are defined. If the following axioms are satisfied for every u, v and w in V and for every
scalar (an element of a field F) c and d, then V is called vector space over a field F
Addition:
A-1 u + v is in V ------------------------- Closure under addition
A-2 u + v = v + u---------------------------Commutative Property
A-3 u + (v+ w) = (u + v) + w --------------Associative property
A-4 V has a zero vector 0 such that ----Existence of Additive Identity for every u in
V, u+0 = u
A-5 For every u in V, there is ----------- Existence of additive inverse a vector in V
denoted by-u Such that u + (-u) =0 where 0 is the identity element
Scalar Multiplication:
A-6 cu is in V ------------------ Closure under scalar multiplication
A-7- c (u + v) = cu + cv ---------- Distributive property
A-8- (c+d) u=cu + du -------- Distributive Property
A-9- c (du) = (cd) u = d (cu) -------- Associative Property
A-10- 1 (u) = u, where 1 F --- -- Scalar Identity
Remark
1. It is important to realize that a vector space consists of four entities: a set of vectors,
a set of scalars, and the two operations. When you refer to a vector space V, be sure
that all four entities are clearly stated or understood,
2- A vector space together with the defined operation addition is Abelian group.
Class Activity-
1- Is a vector space V can be empty set? What ever your answer comes with justification
2 - Identify the difference between vector space axioms 7 and 8 (A-7 & A-8)
3 - Is the zero vectors invertible under vector addition? If so, find its additive inverse
37
Examples of different models of a vector space
Throughout mathematics one encounters many examples of mathematical objects that can
be added to each other and multiplied b y numbers. Geometric vectors and vectors in n-
space studied in the preceding chapter are examples of such objects. Other examples are
real-valued functions, complex numbers, and polynomials over the real. In this chapter we
discuss a general mathematical concept which includes these examples and many more.
Throughout this chapter, by a ―field‖ we shall mean either the field of rational numbers, we
shall, how ever, use a neutral letter k, since it is necessary to deal with each of these fields.
The elements of k will also be called numbers or scalars.
Definition 1.19: A vector space v over a field is a set of objects which can be added and
multiplied by elements of such that the following axioms are satisfied
Axiom 1: (closure under addition)
For all v1 , v2 v, there exists a unique element of v called the
Sum of v1 and v2 , denoted v1 v2
Examples of Vectorspaces
Example-1 Let V= Define: x1 x 2 x1 x 2 x1 , x 2 R , usual product in
and , then show that V is a vector space over with 1 as the zero
element.
Solution:
1. Let x1 , x 2 V then x1 x 2 x1 x 2
38
= x2 x1
x1 x2 x2 x1 x1 , x2 R
Vector addition is commutative
3. Let x1, x2 , x3 V then
( x1 x 2 ) x3 = ( x1 x2 ) x3 by definition of addition in V
= ( x1 x 2 ) x3 , by definition of addition in V
= x1 ( x 2 x3 ) , (Why?)
= x1 ( x 2 x3 ), by definition of ‗+' in V
Thus, ( x1 x 2 ) x3 x1 ( x 2 x3 ) x1 , x 2 , x3 V
Now, e x x
e.x x, By definition of ‗+' in V
e 1 , ( How?)
The zero vector in V = is 1
Therefore, such that 1+ x x x 1x V
Additive identity exists in V
5. Let x V R suppose y V is
An additive inverse of x then
x y 1 y x
But, x y 1
x. y 1
y 1 / x (Why?)
39
V is closed under scalar multiplication.
7. Let a and x, y V then
a( x y) a ( x. y ) By definition of '+' in V
= ( x. y ) a by definition of '.' in V
= x a .y a (Why?)
= ax ay by definition of '.' in V =
Therefore, a ( x y ) ax ay Va and x, y V
8. Let a, b and x V then
= ax bx , by definition of ‗.' In V
( a b) x ax bx a, b And x V
9. Let a, b and x V then
( x b ) a , by definition of ‗.' in V
x ba , Why?
x ab
( ab) x
40
1. Let u , v V then u ( x,2 x ) for some x and v ( y,2 y ) for some y .
Now, u+v ( x,2 x) ( y,2 y )
( x y ,2 x 2 y ), By definition ‗+' in V
( x y,2( x y ))
( z , 2 z ), Where z x y
u v ( z , 2 z ) For some z x y
( x y) z, (2 x 2 y) 2 z , by definition of '+' in V
41
0+u (0,0) ( x,2 x) (0 x,0 2 x)
( x, 2 x )
u
0u u u V
Therefore additive identity element exists in V and is the additive identity
element in V
5. Let u ( x,2 x) V then u ( x,2 x) V and
u+-u ( x,2 x) ( x,2 x)
( x x,2 x 2 x)
(0,0)
u u 0
u V , u V Such that u u 0
That is, every element in V has an additive inverse.
6. Let a and u ( x, 2 x ) V then
a . u a ( x, 2 x )
( y,2 y ); Where y ax
a . u ( y ,2 y ) For some y ax
au V , why?
Therefore V is closed under scalar multiplication.
7. Let a and u ( x,2 x), v ( y,2 y ) V then a(u v) a( x,2 x) ( y,2 y)
a ( x y,2 x 2 y ), By definition of ' +' in V
= a ( x, 2 x ) a ( y , 2 y )
au av
a (u v) au av a u , v V
42
(a b) x, (a b)2 x) ; By definition of scalar multiplication in V
(ax bx,2ax 2bx) ; Why?
au bu
(a b)u au bu a, b And u V
u
1.u uu V
Therefore, by 1-10 we've V is a vector space over
43
a( x1 , x 2 , x3 ) (ax1 , ax2 , 0)
1.(1,1,1) (1,1,1) = u
1. u u
Remark
A single failure of one of the ten vector space axioms suffices to show that a set is not a
vector space. That is to show that a set V is not a vector space we need only find one thing
wrong regardless of how many of the ten axioms are satisfied.
Subspaces
Definition 1.20: (definition of subspace of a vector space)
Let V be a vector space over a field F and W be a subset of V, then we say that W is a
subspaces of V if W is itself a vector space over F under the operations of addition and
scalar multiplication in V.
Class Activity
1. Let V= the set of all real valued continuous functions defined on 0,1 and W be the
To establish that a set W is a vector space we must verify all the ten vector space axioms.
However if W is a subset of a larger vector space V (and the operations defined on W are
the same as those defined on V), then most of the ten properties are inherited from the
44
larger space and need no verification. The following theorem tells us that it is sufficient to
test for closure in order to establish a subset of a vector space is a subspace.
multiplication.
A-1 and A-6 are satisfied ----- ( )
Also, let u, v, w W and a, b F then
u,v,w V and a,b F ;Since WcV
A-2, A-3, A-7, A-8, A-9, and A-10 are satisfied automatically, by definition
of V --------
45
every element in W is invertible with respect to the induced additive
operation
A 5 is satisfied ---------
Thus, by , , and we have W is a vector space over F.
W is a subspace of V over F.
Remark:
1. If W is a subspace of a vector space V over a field F then both W and V must have
the same zero vector O. Justify!
2. Since a subspace of a vector space is itself a vector space, it must contain the zero
vector.
Infact, the simplest subspace of a vector space is the one consisting of only the zero
Vector = 0 W 0
Observe that Improper subspaces of a vector space always exist while the proper
Subspace of vector spaces may not exist.
Example (1) Let V be the set of all real valued functions defined on and
W = f : / f is even function then show that
A) With the standard operations, is a vector space over
B) W is a subspace of V over
Solution A) Left as an exercise to you.
B) Let V= f / f : is afunction and
46
W= f : / f ( x) f ( x)x
I) Clearly, W V ; why?
Also, consider the function f 0 that has a value of zero for all x.
f 0 W
W . Thus, W cV
II) Let F , G W then F , G : Such that both are even functions and hence
F G : such that F+G is even function,
Since (F+G) ( x) F ( x) G ( x) F ( x) G ( x); why?
( F G )( x)x
af ( x ); Why?
(af )( x)x
af W
Exercise:
47
1. Determine whether the set W is a subspace of 3 over with standard operations.
Justify your answer
a) W (a, b,2a 3b) : a, b
b) W (r, r t , t ) : r , t
c) W ( x, y, xy) : x, y
d) W ( x, y,3) : x, y
2. Which of the following subset of C (- , ) are subspaces of C (- , ) over ?
a) The set of all real valued nonnegative functions: f ( x) 0x
b)The set of all real valued even functions: f ( x) f ( x) x
c) The set of all real valued odd functions: f ( x) f ( x)x
d) The set of all real valued constant functions: f ( x) cx
e) The set of all functions such that: f (0) 0;0
f) The set of all functions such that: f (0) 1;0,1
3. Prove that a non empty set W is a subspace of a vector space V over field F if and only
if ax by is an element of Wx, y Wanda, b F .
4. Give an example showing that the union of two subspace of a vector space V over a
filed F is not necessarily a subspace of V over F.
Spanning Sets
Definition 1.23: Definition of linear combination of vectors
Let V be a vector space over a field F and v, x1 , x 2 , , x n be vectors in V then we
v 1 x1 2 x2 n xn ; Where i F ,1 i n
48
Example 2: Finding a liner combination
Let v (1,1,1), x1 (1,2,3), x2 (0,1,2) and x3 (1,0,1) be vectors in 3 then write a linear
combination of x1 , x2 and x3 .
But v 1 x1 2 x 2 3 x3
( 1 3 ,21 2 ,31 2 2 3 )
1 3 1
21 2 1
3 2 1
1 2 3
Hence; v 2 x1 3x2 x3 . other choices for r would yield other ways to write v as a linear
combination of x1 , x2 and x 3 .
49
xx1 yx2 zx3 We have S x1 , x 2 , x3 spans or generates 3
Note: The spanning sets given in example -3 are called the standard spanning sets
of and p 2 ,respectively
Definition 1.25: Let V be a vector space over a filed F and S be non empty subset of V
then the set of all finite linear combination of elements of S is called span of S and span of
S is denoted by span (S). That is span
n
(s) = i xi : i F and xi S i such that 1 i n
i 1
Proof: Exercise
Definition 1.27: Let V be a vector space over a field F and S = x1 , x2 , , xn be subset
n
of V then S is called linearly independent if the vector equation x
i 1
i i 0 has only the
trivial solution. If there are also non-trivial solutions, then S is called linearly dependent.
That is,
a) If ∑ , then S is a linearly independent set.
50
n
b) If 1 , 2 , , n F not all zero such that
i 1
i xi 0 then S is linearly
dependent set
Proof: Exercise
Example-4
Let S 1 x 2 x 2 ,2 5 x x 2 , x x 2 then show that S is linearly dependent subset o P2
51
Solution:
Let f1 1 x 2 x 2 , f 2 2 5 x x 2 and f 3 x x 2 thenone can observethat
Example 5:
Let V be a vector space over a field F and u, v V then show that u and v are linearly
dependent if and only if one is a scalar multiple of the other.
Solution:
It is a immediate consequence of theorem 2.6. It is a simple test for determining
Whether two vectors are linearly dependent
Exercise :
generates p 3
52
Bases and Dimension of a Vector space
Remark:
1. The above definition tells us that a basis has two features. A basis S must have enough
vectors to span V, but not so many vectors that one of them could be written as a linear
combination of the other vectors in S.
2. An infinite set S is said to be a basis for V if
i- S spans V and ii- S is linearly independent. That is, if every element of V is a finite
linear combination of elements of S and every finite subset of S is linearly independent.
Thus a basis of a vector space V may consists of infinite number of vectors.
Class Activity:
1. Show that s(1, 0, 0), (0, 1, 0), (0, 0,1 is a basis for R3. S is called the standard basis
for R3.
2. Let e1 (1, 0, ,0), e2 (0, 1, 0, ,0), , en (0,0, ,1)
then show that
S= e1 , e2 , , en is a basis for Rn S is called a standard basis for Rn.
3. Show that
a) S (1, 2, 3), (0, 1, 2), (2, 0, 1) is a non standard basis for R3
53
Let V be a vector space over field F and s x1 , x 2 , x n be a basis for V then every
vector in V can be written in one and only one way as a linear combination of vectors
in S.
Proof : Exercise
containing more than n vectors in V is linearly dependent. That is, the maximum number
of linearly independent vectors in V is n.
Proof: Exercise
Example-3: Linearly Dependent sets in R3, P3 & Rn
a) Since R3 has a basis consisting of three vectors, the set
(1,2,1), (1,1,0), (2,3,0), (5,9,1) must be linearly dependent.
b) Since P3 has a basis consisting of four vectors, the set
1, 1 x, 1 x,
1 x x 2 ,1 x x 2 must be linearly dependent.
c) Since, Rn has the standard basis
Containing of n vectors, it follows from theorem 2-8 that every set of vectors in Rn
congaing more than n vectors must be linearly dependent.
Proof:
Let S1 x1 , x 2 , , x n and S 2 y1 , y 2 , , y m be basis for V over F then since
54
is a basis for V and S1 is linearly independent we get n m (**)
Now, Combing (*) and (**) we have m n and n m which intern implies m=n.
Example-4
a) Since the standard basis for R3, (1,0,0), (0,1,0), (0,0,1) i, j, k has three vectors
b) Since the standard basis for p3 , 1, x, x 2 , x 3 has four elements we have the set
x 2, x 2
, x 3 1,3x 1, x 2 2 x 3 is not a basis for P3.
Example -5
a) Since e1, e2 , e3 , , en form a basis for Rn over R we have dim (Rn) = n
Exercise:
55
b) (1, i for overR
4. Let w1 (2t , t ) :t R
w2 (0, t ) :t R
w3 (2r , r ,r ) : r R
w4 (2s t , s, t ) : s, t R
a) Show that w1 and w2 are subspaces of R2 over R
b) Show that w3 and w4 are subspaces of R3 over R
SUMMARY OF CHAPTER-ONE
vectors are physical quantities that have both magnitude and direction, and thus can not
be described or represented by a single real number. Geometirically vectors are
represented by a directed line segment or an arrow. We define a vector in n-space to be
an n-tuple of numbers: ( x1 , x 2 , , x n )
56
g) (rs)v r ( sv) s(rv)
h) 1v v
i) 0v 0 r 0
The above properties are properties of vector addition and scalar multiplication.
In the xy-plane a vector v whose initial point is the origin and terminal point is
A( x, y ) is represented by or where and
and we have:
b) If p ( x1 , y1 ) then vectorAP A P ( x, y) ( x1 , y1 ) ( x x1 , y y1 )
c) k v k ( x, y ) (kx, ky)
d) If u (a, b ) then v u ( x a, y b ) and v u ( x a, y b)
a) u v is a vector in n
b) u v v u
c) (u v) w u (v w)
d) u 0 u
e) u (u ) 0
f) cu is a vector in n
g) c(u v) cu cv
h) (c d )u cu du
i) c(d u)=(c d)u=d(c u)
j) 1 u u
57
The above properties are properties of vector addition and scalar multiplication
Let u ( x1 , x 2 ,, x n ) and v ( y1 , y2 , , yn ) be vectors in n then
a) u x12 x 22 x n2
b) u.v x1 y1 x 2 y 2 x n y n
c) d (u, v) u v ( x1 y1 ) 2 ( x n y n ) 2
3.
4. There is an object 0 in V, called a zero vector for V , such that
for all
u in V .
5. For each u in V , there is an object in -u , called a negative of u , such that
.
6. If k is any scalar and u is any object in V , then ku is in V .
7.
8.
9.
10. 1.u = u
58
Miscellaneous Exercises
e)
2. (a) Find the components of the vector w if w 5 and the direction angle for w is
3000
3. (a) Let x and y be unit vectors and the angle between them is then find
6
x y and x y
5. Use the given functions f and g in C 1,1 and the definition
1
f.g = the dot product of f and g
1
f ( x ) g ( x ) dx to find
(i) f .g (iii) g
(ii) f (iv) d ( f , g )
(a) f ( x) x 2 , g ( x) x 2 1 ( c ) f ( x ) x, g ( x ) x 2 x 2
(b) f ( x) x, g ( x) e x (d ) f ( x) 1, g ( x) 3x 2 1
59
6. (a) Find a unit vector that is orthogonal to both u i 4 j k and v 2i 3 j
(b) Find the area of a parallelogram that has u 3i 4 j k and v 2 j 6k
as adjacent sides.
7. Find an equation of the plane passing through the given points
a) (1, 2, 1), (1, 1, 7) and (2, 1, 3)
b) (0, 1, 0), (1, 1, 0), (2, 1, 2)
c) (0, 0, 0), (1, 1, 0), (0, 1, 1)
d) (1, 2, 7), (4 , 4, 2), (3, 3, 4)
8. Prove that
a) If u v u v then u v
2i kj and ki j is 3 then k 1 or k 2
10
References
1. Bernard Kolman & David R. Hill, Elementary linear algebra, 8th ed., Prentice
Hall, 2004
2. Larson/Edwards, Elementary Linear Algebra, D.C. Heath and Company,
Lexington, 1988
3. H. Anton and C Rorres, Elementary linear algebra 8th ed, John Wiley &Sons,
INC., 1994
4. K. Hoffman & R. Kunze, Linear Algebra, 2nd ed., Prentice Hall INC.1971
5. S. Lipschutz, Theory and problems of linear algebra, 2nd ed.McGraw-Hill1991
6. D. C. Lay, Linear algebra and its applications, Pearson Addison Wesley, 2006
7. Demissu Gemeda, An Introduction to Linear Algebra, Department of
Mathematics, AAU, 2000
8. Serge Lang; Linear Algebra
9. Schaum‘s Outline in Linear Algebra
10. Piage and swift: Linear Algebra
11. Beaumont: Linear Algebra
12. Halms: Finite Dimensional Vector space
13. Nomizu: Fundamentals of Linear Algebra
14. Isaak and Manougian, Basic Concepts of Linear Algebra, 1st ed., George J.
McLead Limited, 1976
15. J.N. Sharma and et al, Linear Algebra, Krishna Prakashan Media(P) Ltd., 2003
16. Otto Bretscher, Linear algebra with application, 3rd ed., Prentice Hall, 2005
60
CHAPTER-TWO
MATRICES AND DETERMINANTS
UNIT OBJECTIVES:
At the end of this unit each student should able to:
Learn a matrix and know properties of matrices
Know properties of matrix operations
Realize inverses of a matrices
Know types of matrix
Work with elementary row/column operations
Understand about determinant of a matrix
Understand how to solve system of linear equations by using Cramer‘s rule.
Work with eigenvalues and eigenvectors of a matrix
Introduction
2 x1 3x 2 7 x3 5 x 4 1
x1 10x 2 x3 2 x 4 1
8 x1 4 x2 11x3 8 x4 0
61
Moreover, certain abstract objects in higher mathematics can also be represented by
matrices. Thus, matrices can be used to solve systems of linear equations. They have also
numerous applications.
a11 a1 2 a1 n
a 21 a 2 2 a 2 n
' ' '
m rows
' ' '
' ' '
a m1 a m 2 a m n
Remarks
1. The numbers in the above matrix are called the matrix entries and are denoted by
ai j ,Where i, j are indices (which are natural numbers) with 1 i m and
1 j n.
The index i is called the row index or row subscript because it gives the position
in the horizontal lines, and j is the column index or column subscript because it
gives the position in the vertical lines. So aij is the entry which appears in the ith
62
2. The plural of matrix is matrices. If each entry of a matrix is a real number /
complex number, then the matrix is called a real/ complex matrix.
3. An mxn matrix is said to be of order mxn. If m=n, then the matrix is square of
order n. For a square matrix, the entries a11 , a2 2 , a3 3 , are called the main
diagonal entries.
4. We can abbreviate the above matrix by writing A (ai j )mxn.
[ √ ]
Since A has three rows and four columns, we say that A is a 3x4 matrix or A is a matrix of
order 3x4. Moreover, a11 0, a12 1, a13 2, a14 3
a 21 e, a 2 2 2, a 2 3 1, a 2 4 5, a31 1
2 , a3 2 3, a3 3 0, a3 4 10
0 2 1 3 [ ]
Equality of matrices
63
That is, two matrices of the same order are said to be equal if their corresponding entries
are equal.
x 1 0 2 y
5
z 1
w 2
Solution:
Because two matrices are equal only if the corresponding entries are equal, we conclude
that: x 1 2, y 0, w 5 and z 1 2
x 1, y 0, w 5, z 1.
given by A B ai j bi j That is, we add the two matrices of the same order by adding
their corresponding entries and the sum of two matrices of different orders is undefined.
Solution:
a) Since A and B have different orders, we have A+B is undefined.
2 1 0 1 0 0
b) A+C = 4 0 1 + 0 1 0
3 2 2 0 0 1
64
Let A and B be two Matrices of the same order then the difference of A and B, denoted
by A-B , is defined to be the matrix obtained by subtracting each element of B from the
corresponding elements of A. That is if ( ) ( ) then
( ) .
1 2 3 2 3 0 1 2 23 3 0 1 1 3
A B =
0 1 4 1 2 5 0 (1) 1 2 4 5 1 1 3
Note that: Two matrices of the same order are said to be conformable for addition and
subtraction. Only conformable matrices can be added or subtracted.
mxn matrix given by: bA bai j . That is, bA is an mxn matrix obtained from A by
12 4 2 0 0
Let A= 3 0 1 and B 1 4 3 then find
2 1 2 1 3 2
a) 3A b) -B c) 3A-B
1 2 4 3(1) 3(2) 3(4) 3 6 12
Solution: a) 3A 3 3 0 1 3(2) 3(0) 3(1) 9 0 3
2 1 2 3(2) 3(1) 3(2) 6 3 6
65
3 6 12 2 0 0 1 6 12
c) 3 A B 9 0 3 1 4 3 10 4 6
6 3 6 1 3 2 7 0 4
Note:
(1) If A is any matrix and b is any scalar then bA=Ab
(2) -A represent the scalar product (-1) A.
Matrix Multiplication
It is a basic matrix operation. At first glance the following definition may seem unusual.
You will see later, however, that this definition of the product of two matrices has many
practical applications.
n
mxp matrix given by AB ci j where c ai k bk j
ij
k 1
ai 1 b1 j ai 2b2 j ai 3 b3 j ai n bn j
That is, the ij th element of AB is obtained by multiplying each element of the ith row of A
(denoted by Ai) in to the corresponding element of the jth column of B (denoted by Bj ) and
adding the products or equivalently, the ijth element of AB is the dot product of the ith row
vector of A by the jth column vector of B and hence AB ci j where ci j Ai .B j
Solution:
First note that the product AB is defined because A has order 3x2, and will take the form.
66
1 3 c11 c12
4 2 3 2 c
4 1 21 c 22
5 0
c31 c32
To find c11 (the entry in the first row and first column of the product) we take the dot
product of the first row of A and the first column of B That is,
[ ]* +
Similarly, to find c12 , we tare the dot product of the first row of A and the second column
of B and hence c1 2 A1 B 2
2
(1 3).
2
(1) (2) (3)(1) 2 3 1
Remarks
1. The product of two matrices A and B is defined if the number of columns Of A is
equal to the number of rows of B, say if A is an mxn matrix and B is an nxp matrix.
In this case, the product AB is an mxp matrix.
67
A1 B1 A1 .B 2 A1 B 3 A1 .B p
AB= A2 .B1 A2 B 2 A3 B 3 A2 B p
A B1 A B 2 A B 3 A B p
m m m m
Where A i . B j is the dot product of the ith row of A and jth column of B
3. If the product that AB is defined, then it is not necessary that BA must also be
defined. For instance, if A is of order 3x3 and B is of order 3x2, then clearly AB is
defined but BA is not defined, as the number of columns of B is not equal to the
number of rows of A.
6. (c d ) A cA dA Distributive property A
Proof:
Proof of these six properties follows directly from the definitions of matrix addition, scalar
multiplication, and the corresponding properties of scalars. For example, to prove the
commutative property of matrix addition, we let A ai j and B bi j .Then, using the
commutative property of scalars, we have A B a b b
ij ij ij ai j B A similarly,
to prove property 5, we use the distributive property for scalars of multiplication over
addition and hence c( A B) c(ai j bi j ) cai j cbi j cA cB.
68
Note: If A is an mxn matrix and o mn is the mxn matrix consisting entirely of zeros
then one can observe that A+ o mn =A. We call o mn a zero matrix or null matrix and it
serves as the additive identity for the set of all mxn matrices. For example, the matrix
0 0 0
023 serves as the additive identity for the set of all 2x3 matrices.
0 0 0
b) A ( A) Om n
c) cA Om n c o or A Om n
Remarks :
1. Property b) above can be described by saying that the matrix-A is the additive inverse
of A
2. The algebra of real numbers (scalars) and the algebra of matrices have many
similarities.
Theorem 2.9: Properties of Matrix Multiplication
Let A,B and C be matrices with orders such that the given matrix products are defined and
b is a scalar, then the following properties are true.
a ) A( BC ) ( AB )C ---------------------Associative property of multiplication
Proof: Exercise
Class Activity
1 2 3 4
1. Let A and B then solve for X in the equation 3X+A=B
0 3 2 1
69
1 7 0 1 1
1 0 5
2. Let A , B 3 1 0 and C 2 0 then show that
7 2 0 1 0 5 0 4
A(BC) =(AB)C
2 3
1 2 3 3 4 5
3. Consider A 1 1 , B andC
5 6 7
then show that
3 4 5
0 4
a) A( B C ) AB AC
b) ( B C ) A BA CA
1 3 2 1
4. Let A and B= then show that AB BA and conclude that matrix
2 1 0 2
multiplication is not commutative.
1 0 0 0 0
0 1 0 0 0
That is, I n 0 0 1 0 0
0 0 0 0 1
b) I m A A
70
c) If m=n then A I n I n A A
Note: For repeated multiplication of square matrices, we use the same exponential
notation used with real numbers.
That is, A1 A
A 2 AA and for k N
We define Ak AA A K factors
1) Row Matrix: A matrix which has only one row. It is a 1xn matrix, where n is arbitrary
natural number. It is also called a row vector
71
2
Example-2: 1 is a 3 by 1 column matrix.
5
3) Zero matrix: is a matrix of arbitrary size in which all the entries are zero. It is also
called null matrix and denoted by O.
Example-3: The following are null matrices
0 0
0 0 0 0
0 0 0 0
0 0
4) A square matrix: Is a matrix where the number of rows is equal to the number of
columns. It is an nxn matrix; where n N.
1 0 3
2 1 4 1 3
Example 4:
3 8 6 7 1 2
b) The sum of the main diagonal entries of A is called the trace of A and it is denoted by
n
Tr ( A) That is, Tr ( A) a11 a22 a33 ann aii
i 1
1 2 3 1 0 0
(a) A 0 2 4 (b) B 0 1 0
3 1 3 0 0 1
3
Solution: (a) T r ( A) aii 1 2 3 2
i 1
3
(b) Tr ( B) bii b11 b22 b33 1 1 1 3
i 1
72
5) Diagonal Matrix: Is a square matrix in which all entries that are not on the main
diagonal are zero.
Example-6: The following are Diagonal matrices
1 0 0 2 0 0
(a) 0 2 0 (b) 0 3 0
0 0 2 0 0 0
a11 0 0 0
0 a 0 0
22
(c) 0 0 a33 0
0 0 0 a nn
Note: If A (ai j )nxn is a diagonal matrix then ai j O i j while aii can be zero or
0 , if i j
ai j Where c is a scalar, then A is called scalar matrix.
c , if i j
3 0 0
Example-7: D 0 3 0
0 0 3
7) Identity Matrix: Is a diagonal matrix in which all the main diagonal elements are equal
to 1. It is also called a unit matrix.
1 0 0
1 0
Example-8: 1 , 0 1 0 are unit matrices of order 1,2 and 3 respectively
0 1 0 0 1
Note
1. The nxn identity matrix is denoted by I n
73
1 0 1
3- I n Are some short hand ways of drawing the matrix I n .
0 1 1
8) Upper triangular Matrix: Is a square matrix in which all the elements below the main
diagonal are zero. That is, if A (ai j )nxn is a square matrix such that
Example-9:
1 0 0 0
1 2 3 0 1
0 0
A 0 2 0 , B are upper triangular matrices of order 3 and 4
0 0 1 0
0 0 8
0 0 0 1
respectively.
9) Lower triangular matrix: Is a square matrix whose entries above the main diagonal
are 0.
1 0 0 0
1 0 0 0 1 0 0
Example-10: 1 0 0, are lower triangular matrices of order 3
0 0 1 0
2 3 1
0 0 0 1
and 4 respectively.
Note :
1) We often indicate that a whole region in a matrix consists of zeros by leaving it
blank or by putting in a single O.
2) We use * to indicate an arbitrary undetermined entry of a matrix.
[ ] [ ]
74
3 2 1
3
(a) A (b) B 4 1 0
4 0 0 1
3 4 0
Solution: (a) A 3 4 (b) B 2 1 0
t t
1 0 1
a11 a 21 a31 a m1
a a a32 a m 2
12 22
(c) C a13 a 23
t
a33 a m 3
a1n a 2 n a3n a mn
Theorem 2.13: Properties of Transposes
Let A and B be matrices with orders such that the given matrix operations are defined and c
is a scalar, then the following properties are true
(a) ( At ) t A Transpose of transpose
(e) ( I n ) t I n
Proof: Exercise
Definition 2.14: A matrix A is said to be symmetric if it is equal to its transpose. That is, if
At =A then A is called symmetric matrix.
75
1 1 2
Example-12: Let A 1 0 3 then since A=At we have A is symmetric
2 3 7
2 0
Example-13: Let A , then A A and hence A is skew symmetric.
t
2 0
1 2 3 4 5 6 1 2 1
Consider the matrices A 4 5 6 and B 1 2 3, C 2 1 2,
0 0 1 0 0 1 3 1 1
2 4 2 1 2 1 1 2 1
D 2 1 2 E 3 1 1 and F 0 7 4
3 1 1 2 1 2 2 1 2
Observe that:
I) B is obtained from A by interchanging the first and second rows of A
II) D is obtained from C by multiplying the first row of C by -2
III) F is obtained from matrix E by replacing the second row of E by a new row
This is made up by multiplying the first row by -3 and adding to the second row of E.
Such operations on rows of a matrix as described above in (i),(ii) and (iii) are called
elementary row operations. These operations will be very useful in finding inverse of a
matrix and solving systems of linear equations.
Similarly, elementary column operations can be defined. Hence we have the following
definition
76
Type II: The multiplication of any row or (column) by a non zero number
Type III: The addition of a multiple of one row (or column) to another row (or column)
Notations: We shall use the following notations for the three types of elementary
operations.
1. The interchange of ith and jth rows (columns) is denoted by Ri R j (Ci C j )
2. The multiplication of the ith row (column) by a none zero number k is denoted by
Ri kR i (Ci kCi )
3. The addition of k times the jth row (column) to ith row (column) is denoted by
Ri Ri KR j (C Ci kC j )
Note:
a) Ri kRi means replace the ith row by k times the ith row
b) Ri Ri kR j Means replace the ith row by the sum of itself and k times the jth
row.
c) Although elementary operations are simple to perform, they involve a lot of
arithmetic Because it is easy to make a mistake, we suggest that you get in the
habit of noting the elementary operation performed in each step so that you can
go back to check your work
Example-1
77
1 2 0 1 3 3
Let A 3 1 2 , B 1 4 3 then show that
2 3 2 1 3 4
1 2 0
(a) A 0 7 2
row row
(b) B I 3
0 0 0
Solution:
(a) 1 2 0 1 2 0
A 3 1 2
R2 3 R R
1
2 0 7 2
3 3 2 2 3 2
1 2 0 1 2 0
0
R3 2 R1 R3
7 2 0 7 2
R3 R2 R3
1 3 3 1 3 3 1 3 3
(b) B 1 4 3 0 1 0 0 1 0
R2 R2 R3 R3 R3 R1
1 0 3 1 0 0
0 1 0 0 1 0
R1 R1 3 R3
R1 R1 3 R2
Elementary Matrices
Definition 2.18: Definition of Elementary Matrix
Let A be an nxn matrix such that A can be Obtained from In by a single elementary
row /column operation then A is called an elementary matrix.
78
1 0 0 0
1 0 0 0 1 0 0
(a) A I n (b) B 0 4 0 (c ) C
0 0 1 0
0 0 1
0 0 0 0
1 0 0 1 0 0
1 0 0 1 0
(d) D (e) E 0 0 1 ( f ) F 0 2 0 ( g ) G
0 1 0 0 1 0 0 0 1 4 1
Solution:
(a) Since A I n can be obtained from itself by multiplying any one of its rows by 1.
1 1R1
(i.e. I n R A) we' ve I n is elementary matrix.
elementary matrix. Note that, row multiplication must be anon zero constant.
(d) D is not elementary because it is not square marix.
2 R3
(e) Since I 3 R E we've E is elementary.
(f) This matrix is not elementary because two elementary row operations are required
to obtain it from I 3 .
2 4 R1 R2
(g) Since I 2 R G we have G is elementary.
Note Elementary Matrices are useful because they enable us to use matrix
multiplication to perform elementary row/ column operations.
Example-3: Elementary Matrices and Elementary Row Operations
0 1 0 1 0 0 1 0 0
Let E1 1 0 0, E 2 0 1 2 0 and E3 2 1 0
0 0 1 0 0 1 0 0 1
0 2 1 1 0 4 1 1 0 1
Let A1 1 3 6, A2 0 2 6 4 and A3 2 2 3
3 2 1 0 1 3 1 0 4 5
Then
79
1 3 6
(a) E1 A1 0 2 1 , this product matrix is a matrix obtained by interchanging the first
3 2 1
two rows of A. And E1 is an elementary matrix in which the first two rows of I 3 has been
interchanged.
1 0 4 1
(b) E2 A2 0 1 3 2
0 1 3 1
Here, E2 is an elementary matrix in which the second row of I 3 has been multiplied by 1
2
1 R
and A 2 2 E 2 A2
2 R
1 0 1 2 2 R1 R2
I 3 R E3 and
(c) E3 A3 0 2 1 Here,
2 2 R1 R2
A3 R E3 A3
0 4 5
R R cR
(c) I n
i
i
j E then put E 1 E2 where E2 is an elementary matrix
R R cR
such that I n
i
i
j E2
80
And to check that E E 1 I n E 1 E, apply theorem 3.10
Example-5:
0 1 0 1 0 0 1 0 0
Let E1 1 0 0 ,
E 2 0 1 0 and E3 0 1 0 then
0 0 1 2 0 1 0 0 12
1 0 0 1 0 0
E1
1
E1 , E 1
2 0 1 0 and E 1
3 0 1 0
2 0 1 0 0 2
1. Row-Echelon Matrix
1 2 3 4 1 2 1 2
(c) 0 2 1 1 (d ) 0 0 0 0
0 0 1 3 0 1 1 4
81
Solution: The matrices shown in parts (a) and (b) are in row-echelon form, while the
matrices shown in parts (c) and (d) are not in row-echelon form.
Note: One can show that every matrix is row equivalent to a matrix in row-echelon
form.
Note that: It can be shown that the reduced row echelon matrix obtained from a
given matrix A by elementary operations is unique. That is, that it does not depend on
the particular sequence of operations used.
Theorem 2.23: Let A be an mxn matrix, then A is row equivalent to an mxn reduced
row-echelon form of a matrix.
Proof: Exercise
0 2 1 4
Example- 3 Let A 3 2 0 2 then find the unique reduced row-echelon matrix that
3 3 3 4
3 2 0 2 1 2 3 0 2 3
0 2 1 4
1 R2
R1 3 R1
1
A R 0 2 1 4
3 3 3 4 3 3 3 4
82
1 2 3 0 2 3 1 2 3 0 2 3
0 2 1 4 0 1 1 2 2
3 3 R1 R3
1
R
R2 R
2 2
1 0 0 2 3
0 1 0 2
R1 13 R3 R1
0 0 1 0
R2 12 R3 R2
1 0 0 2 3
Thus the reduced row-echelon matrix which is equivalent to A is 0 1 0 2 .
0 0 1 0
Example-14:
Find the inverse of the following matrices
1 1 2 1
(a) A (b) B
0 1 5 3
Solution:
(a) To find the inverse of A. We try to solve the matrix equation
AX I for X But , AX I
1 1 x11 x12 1 0
x x
0 1 21 22 0 1
83
x x 21 x12 x 22 1 0
11
x 21 x 22 0 1
Now, by equation the corresponding entries, we obtain the following two systems of linear
equations.
x11 x 21 1 and x12 x22 0
x 21 0 x22 1
x x 1 1
x 11 12
x 21 x 22 0 1
1 1
Therefore, the inverse of A is A 1 x
0 1
b11 b12
(b) Suppose B 1 1
be the inverse of B then B B I 2 B B
1
But,
22 22
b b
B B 1 I 2
2 1 b11 b12 1 0
b b 0 1
5 3 21 22
b b 3 1
B 1 11 12 =
b21 b22 5 2
Infact, one can check that B B 1 I 2 B 1 B
Remarks
84
1. Non square matrices do not have inverses. To see this, note that if A is of order mxn
and B is of order nxm (where m n) , then the products AB and BA are of different
orders and therefore could not be equal to each other.
1 2
2. Not all square matrices possess inverse. For example, the matrix doesn't have
1 2
inverse
3. We will see later that square matrix A is invertible if there is a square matrix B such
that either one of the two relations AB I or BA I holds, or then B is the inverse
of A.
Theorem 2.25: Uniqueness of an Inverses Matrix
If A is an invertible matrix, then its inverse is unique
Proof:
Since A is invertible, we know that it has at least one inverse B such that AB I BA.
Suppose that A has another inverse C such that AC I CA, then
B I B (CA) B
C ( AB ); Why ?
CI j why ?
=C
B C And it follows that the inverse of a matrix is unique
(b) ( A k ) 1 A 1 A 1 A 1 K factors
(c) (cA) 1 c 1 A 1 ; c 0
(d) ( At ) 1 ( A 1 ) t
Proof:
(a) Suppose A 1 is the inverse of A
A 1 A I A A 1
85
the inverse of A 1 is A ; by def int ion of an inverse
( A 1 ) 1 A
Note: For nonsingular matrices, the exponential notion used for repeated
multiplication of square matrices can be extended to include exponents that are
negative integers. This may be done by defying A k to be
A k A 1 A 1 A 1 k factors
With this convention we can show that the following properties.
(a) A j A k A j k
(b) ( A j ) k A j k For any integers j and K.
A( I ) A 1
( AI ) A 1
AA 1
I
And in a similar way we can show that
( B 1 A 1 ) ( AB) I we have ( AB) CB 1 A 1 ) I ( B 1 A 1 ) and hence we get B 1 A 1
is the inverse of AB.
86
B 1 A 1 ( AB ) 1 and AB is invertibe.
Remarks
1. The above Theorem says that the inverse of a product of two invertible matrices is
the product of their inverses taken in the reverse order. This can be generalized to
include the product of several invertible matrices.
( A1 A2 An ) 1 An1 An11 A31 A21 A11
Proof:
(a) Suppose AC BC , then sin ce C is invertible we have ( AC ) C 1 ( BC ) C 1
A(C C 1 ) B(C C 1 )
AIBI
AB
(b) Similar to (a)
Note If C is not invertible then cancellation is not usually valid and hence theorem
2.28 can be applied only if C is an invertible Matrix.
Recall that an elementary matrix is nonsingular as the inverse of the elementary operation
Ri R j , Ri CRi and Ri CR j Ri are Ri R j , Ri 1c Ri and
Ri Ri CR j respectively.
87
A is invertible or non singular.
row
() Suppose A is an invertible then by theorem 3.13 we have A B ; where B is a
reduced row-echelon matrix.
Claim B I n
guarantes that B I n .
It has at lest i 1 zeros before the first non-zero entry then one of the rows after the i th row
must be a row of zeros, Hence, in both cases, we have at least one row of zeros in B. But B
is invertible and hence B 1 exists. Thus BB 1 I n has at least one row of zeros; which is
contradiction. Therefore, B I n .
Proof: Exercise
88
2. If possible, row reduce A to I using elementary row operations on the entire
matrix A / I . The result will be the matrix I / A 1 . If this is not possible, then
A is not invertible
3. Check your work by multiplying to see that A A 1 I A 1 A .
Example-5: Finding the Inverse of a matrix
1 1 0
A 1 0 3 then find its inverse.
6 2 3
Solution
We begin by adjoining the identity matrix to A to form the matrix
2 3 1
Therefore, A is invertible and its inverse is A 3 3 1
1
2 4 1
Class Activity
1. Find the inverse of the following matrix by using Gauss-Jordan elimination
1 1 2 1
2 2 4 2 3 4 1
3 1
A 2 3 2 (b) (c )
2 2 3 3 3 1
1 1 1
1 2 3 1
2. Show that the following matrices have no inverses
89
1 2 0
3 1
(a) 3 1 2 b)
6 2
2 3 2
Note: - Using Gauss-Jordan elimination to find the inverse of a matrix works well
(even as a computer technique) for matrices of order 3x3 or greater. For 2x2
matrices, however, many people prefer to use a formula for the inverse, rather than find
the inverse by Gauss-Jordan elimination. The formula is given by:
d b a b
A1 1
ab bc c a where A c d with ad bc 0.
Thus A is invertible if and only if ad bc 0. . Why?
Definition 2.30: To every square matrix A with elements from the set of real numbers
there is assigned a specific real number called the determinant of A. It is usually denoted
by de t ( A) or A .
Remarks:
90
(i). Observe that determinant is a function from the set of all square matrices to the set
of real numbers, and it is defined only for square matrix. In fact, if A (ai j ) is a
square matrix over a field F ai j F then A F . In particular for F The set
of complex numbers then det A . From now on wards unless we have stated we
consider matrices over R.
(ii).The determinant of an nxn matrix is called determinant of order n. Before we go to
the definition of determinant of order n(n N ) we shall consider determinants of
order 1, 2 and 3.
You recall that a single number a is considered as a 1x1 matrix. Hence if A=
a11 thendet er min ant of A is defined to be the number itself. That is, det( A) a11 a11 .
a b
Definition 2.31: The determinant of the 2x2 matrix A is defined to be the
c d
a b
number ad-bc. That is, det A= ad bc
c d
A convenient method for remembering the formula is to note that ad bc is just the
difference of the products of diagonally opposite entries.
a b
That is, A ad bc
c d
Example-1: Find the determinants of the following matrices
2 3 3 1 1 2
(a) A (b) B (c) C
1 2 6 2 6 4
Solution:
2 3
(a) A (2) (2) (3) (1) 4 3 7
1 2
3 1
(b) B (3) (2) (6) (1) 6 6 0
6 2
91
1 2
(c) C (1) (4) (6) (2) 4 12 8
6 4
Remark: Note that the determinant of a matrix A can be positive, zero, or negative.
Definition 2.32 : Definition of Minors and cofactors of a matrix
Let A ai j be a square matrix then
1-The determinant of the matrix obtained by deleting or removing the i th row and j th
column of A is called the minor of the element ai j and it is denoted by M i j .
That is, M i j Ai j ; where Ai j is a matrix in which the i th row and j th column of A are
removed.
2- The product of the minor of the element ai j and the number (1) i j is called the
21 0
Example-2: Find all the minors and cofactors of A 1 1 4
3 2 5
1 4
Minor of a12 M 12 A12 (1( (5) (4) (3) 17,
3 5
1 1
Minor of a13 M 13 A13 (1( (5) (1) (2) (1) (3) 5,
3 2
1 0
Minor of a21 M 21 A21 (1) (5) 0.2 5,
2 5
2 0
Minor of a22 M 22 A22 (2) (5) 0(3) 10,
3 5
2 1
Minor of a23 M 23 A23 (2) (2) (1) (3) 7,
3 2
92
1 0
Minor of a31 M 31 A31 (1) (4) (0) (1) 4,
1 4
2 0
Minor of a32 M 32 A32 (2) (4) (0) (1) 8,
1 4
2 1
Minor of a33 M 33 A33 (2) (1) (1) (1) 1,
1 1
Remark
(i). The minors and cofactors of a matrix differ at most in sign. To obtain the cofactors of a
matrix, first find the minors and then apply the following checker board pattern of +‘s
and –‗s.
Sign Pattern for Cofactors
3x3 marix 4 x 4 matrix nxn matrix
Note that odd positions (where i+j is odd) have negative signs, and even positions
(where i+j is even) have positive signs.
ii) If A is an nxn matrix, then the order of Ai j is (n 1) x(n 1) )
93
= a11 (1)11 M 11 a12 (1)12 M 12 a13 (1)13 M 13
a 22 a 23 a 21 a 23 a 21 a 22
= a11 a12 a13
a32 a33 a31 a33 a31 a32
= a11 a 22 a33 a12 a 23 a31 a13 a 21 a32 a11 a 23 a32 a12 a 21 a33 a13 a 22 a31
Thus, A a11 C11 a12 C12 a13 C13 , First row exp ansion
2 1 0 1
(b) Since C21 2, C22 4,
0 1 4 1
0 2
And C23 (8) 8 we have
4 0
94
A a11 C11 a 21C 21 a31 C31 , First column exp ansion
When this definition is used to evaluate a determinant, we say that we are expanding by
cofactors.
n
A ai j ci j a1 j c1 j a2 j c2 j anj cnj ( j th column exp ansion)
i 1
Proof: Exercise
Note: when expanding by cofactors we do not need to evaluate the cofactors of zero
entries, because a zero entry times its cofactor is zero. That is, ai j ci j (0) ci j 0 .
Thus the row (or column) containing the most zeros is usually the best choice for
expansion by cofactors.
1 2 3 0
1 1 0 2
Example-4 Find the determinant of A
0 2 0 3
3 4 0 2
95
Solution:
Inspecting this matrix, we see that in third column of A we have more number of zeros than
the other rows and columns. Thus we can eliminate some of the work in the expansion by
using the third column.
Therefore,
A a13 C13 a 23 C 23 a33 C33 a 43 C 43
= 3 C 13
But
1 1 2 1 1 2
13
C 13 (1) 0 2 3 0 2 3
3 4 2 3 4 2
Assuming that the theorem is true for any upper triangular matrix of order k-1, we now
consider an upper triangular matrix A of order k. Expanding by the k th row, we obtain
A o ck1 o ck 2 o ckk 1 akk ckk
= a kk c kk
96
akk (1) k k M kk akk (1) 2 k M kk
a kk M kk
= a kk Akk
= a11 a 22 a33 a kk
This completes the proof
1 0 0 0 0
0 3 0 0 0
Example-5 (a) Let A 0
0 2 0 0 and B I n
0 0 0 4 0
0 0 0 0 2
Then since a diagonal matrix is both upper and lower triangular we have
A (1) (3) (2) (4) (2) 48 and B (1) (1) (1) 1 I n
Properties of Determinant
It should be clear that calculating the determinant of a large matrix from the definition can
be quite long and cumbersome. For instance, the determinant of a 6x6 matrix involves six
5x5 cofactors. Each of these in turn involves five 4x4 cofactors each of which involves
four 3x3 cofactors for a total of 120x3x3 cofactors to be calculated. Consequently other
methods are often used to compute determinants. These methods depend on the properties
of determinants which h are stated as follows.
97
(c) If B is obtained from A by multiplying a row (or a column) of A by a non zero c
constant c, then B c A .
Proof: Exercise
Example-1: Show that the determinant of an elementary matrix E of
(i) First kind (add a multiple of one row to another) is 1
(ii) Second kind (row interchange) is -1
(iii) Third kind (multiply a row by anon zero constant c) is c
Solution: We know that I n 1. Thus, we have
=1
(ii) (i) E I n , by property1 (a)
.
(iii) E c I n , by property1 (c)
=c.1=c
Example-2: Evaluating a determinant using elementary Row operations.
2 3 10
Let A 1 2 2 then find det er min ant of A .
0 1 3
Solution:
98
2 3 10 1 2 2
A 1 2 2 2 3 10 , by R1 R 2
0 1 3 0 1 3
1 2 2
0 7 14 , by R2 2 R 1 R2
0 1 3
Note: In the above solution we have changed the given matrix in to triangular
matrix.
determinant)
3 1 2
Example-3: Show that A At for the matrix A 2 0 0
4 1 5
3 2 4
A 1 0 1 We expand by cofactors down the second
t
To find the determinant of
2 0 5
3 2 4
1 1
column to get A 1 0 1 2
t
23 6
2 5
2 0 5
Thus, A 6 At A At
Notation
Let A is a square matrix of order n with columns A1, A2, A3… Aj…and An
And rows A1, A2, A3…A j…and An then we can also denote det (A) by
det (A1, A2, A3,…Aj,…, An) or det (A1, A2, A3…A j…, An)
99
Remarks - property-1 can be expressed as
(1) a. det (A1, A2, A3,…cAj,…, An) = c det (A1, A2, A3,…Aj,…, An)
b. det (A1, A2, A3…cA j…, An) = c det (A1, A2, A3…A j…, An) where c is a non zero
constant
(2) a. det (A1, A2, A3,…Ai,…, Aj,…, An) = - det (A1, A2, A3,…Aj,…Ai…, An)
b. det (A1, A2, A3…Ai,…,A j…, An) = - det (A1, A2, A3…A j,…,Ai,…, An)
(3) a. det (A1, A2, A3,…,cAj+ Ai,…, Aj,…, An)
= det (A1, A2, A3,…, Ai,…, Aj,…, An)
b. det (A1, A2, A3… cAj+ Ai,…,A j…, An) = det (A1, A2, A3… ,Ai,…,A j…, An) where
c is constant number.
Let A is a square matrix of order n and any of the following conditions is true, then
A0
100
(a) Without loss of generality suppose that the first row of A is a row of zeros then
det (A)=det (0, A2, A3…A j…, An) = det (0.0, A2, A3…A j…, An)
=0 det (0, A2, A3…A j…, An) = 0
Hence,the proof.
(b) Without loss of generality assume that A1=A2 then
det (A)=det (A1, A2, A3…A j…, An) = det (A1-A2, A2, A3…A j…, An)
= det (0, A2, A3…A j…, An), as A1=A2
= 0,by (a)
(c) Without loss of generality assume that A1= cA2 then
det (A)=det (A1, A2, A3…A j…, An) = det (cA2, A2, A3…A j…, An)
= c det (A2, A2, A3…A j…, An) = c.0=0, by (b)
2 4 5 1 2 4 1 2 3
Example- 4: Let A 0 0 0 B 0 1 2 and C 2 1 6 then
3 5 2 1 2 4 2 0 6
Proof: cA detcA det cA1 , cA 2 , cA 3 ,..., cA n
c det A1 , cA 2 , cA 3 ,..., cA n
c 2 det A1 , A 2 , cA 3 ,..., cA n
c det A , A , A ,..., cA
3 1 2 3 n
c n det A1 , A 2 , A 3 ,..., A n
c n det A c n A
101
Example- 5: If A is a matrix of order 3 and At 5 then find determinant of 10A
Theorem 2.38: Let E be an elementary matrix and A be an arbitrary square matrix then
EA E A
EA C A , by property (1)
= E A , since E = C
III. By adding a multiple of one row of I to another row of I, then E =1 and hence
EA A , by property ( 1 )
=1 A
= E A , since E =1
A1 A 2 Ak A1 A2 Ak .
102
Proof: Exercise
A A1 1
A 1 1
A
0 2 1 1 0 3
(a) A 3 2 1 (b) A 3 2 1
3 2 1 3 2 1
Solution:
(a) Because A 0 we conclude that A has no inverse
a 21 x1 a 22 x 2 a 23 x3 a 2 n x n b2
103
a m1 x1 a m 2 x 2 a m3 x3 a mn x n bm
Remark:
The double subscript notation indicates that ai j is the coefficient of x j in the i th
equation.
a 21 x1 a 22 x 2 a 2 n x n b2
a m1 x1 a m 2 x2 a mn xn bm
104
b1
b
B 2,
bm
And AX is the matrix product
Here, in matrix notation AX B, we have the following definitions.
(1) A ai j mxn is called the coefficient matrix of the system.
* +[ ] * +
(ii) A 0 1 2
3 4 6
is the coefficient matrix of the given system and
105
0 1 2 2
(iii) A B
3 4 6 1
is the associated augmented matrix of the system.
Proof : Exercise
The general procedure for using Gaussian elimination with back substitution to solve a
system of linear equations is summarized as follows:
Step-1 Write the augmented matrix of the system
Step- 2 Use elementary row operations to rewrite the augmented matrix in
Row-echelon form
Step- 3 Write the system of linear equations corresponding to the matrix in
Row-echelon form, and use back-substitution to find the solution.
Example-2
Use Gauss Ian elimination with back-substitution to solve the following system.
x 2 x3 2 x 4 3
106
x1 2 x 2 x3 2
2 x1 4 x 2 x3 3x 4 2
x1 4 x 2 7 x3 x 4 19
Solution:
Step-1: The augment matrix for this system is M A B where,
0 1 1 2 3
1 2 1 0 2
M A B =
2 4 1 3 2
1 4 7 1 19
Step-2: using the elementary operations
R1 R2 , R3 2 R1 R3 , R4 R1 R4 , R4 6 R2 R4 , R3 13 R3 and R4 13
1
R4
respectively we get the row-echelon form of the matrix M,
With Gaussian elimination, we apply elementary row operations to a matrix to obtain a row
equivalent row-echelon form. A second method of elimination called Gauss Jordan
elimination after Carl Gauss and Wilhelm Jordan (1842-1899), Continues the reduction
process until a reduced row-echelon form is obtained.
107
Example-3: use Gauss-Jordan elimination method to solve the system
x 2 y 3z 9
x 3 y 4
2 x 5 y 5 z 17
Solution
Step-1: The associated augmented matrix of the system is
Step-2: You can show that the reduced row-echelon form of the matrix M is given by
Note:
(1) If in the elimination process, you obtain a row with zeros except for the last entry, it
is unnecessary to continue the elimination process. You can simply conclude that
the system is inconsistent.
(2) If in the row-echelon form system the number of equations and the number of
unknowns are equal then the system will have a unique solution.
(3) Observe that if in the row-echelon form system the number of equations is less than
the number of unknown, then the system will have infinitely many solutions.
108
Cramer’s Rule, named after Gabriel Cramer (17 04- 17 52), is a formula that uses determinants to
solve a system of n linear equations in n-variables. This rule can be applied only to systems of
linear equations that have unique solutions.
Proof: Exercise
Example: Use Cramer‘s Rule to solve the following system of linear equations
for x, y &z
1 2 3 x
Solution: Let A 2 0 1 , x y and [ ], then the system can be represented
3 4 4 z
by . And since
| | | |
1 2 3 1 1 3 1 2 1
0 0 1 2 0 1 2 0 0
2 4 4 3 2 4 3 4 2
x , y , and z
10 10 10
x 4
5 , y 3
2 and z 8
5
109
Exercises:
Theorem 2.47: If A is an invertible matrix, then for each matrix b, the system
of equations has exactly one solution, namely,
Example: solve the following system of linear equation using inverse method
110
Definition 2.48: Eigenvalue and Eigenvector
Let A be an nxn matrix over a field F, then the scalar is called an Eigenvalue of A if
there is a non zero vector v F n such that The vector is called an Eigen vector
of A corresponding to .
Remarks:
1. The term Eigen value is derived from the German word eigenwerte, meaning
"proper value"
2. An eigenvector v cannot be zero. Allowing v to be the zero vector would
render the definition meaningless because A0 0 is true for all values of .
An Eigen value of 0, however, is possible.
3. A matrix can have more than one Eigen value.
Example-1:
2 0
Let A then verify that v1 (1, 0) and v2 (0, 1) are eigenvectors of A
0 1
corresponding to the Eigen values 1 2 and 2 1 respectively.
Solution:
111
2 0 1 2 1
Since Av1 0 0 20 2v1 , we have v1 (1, 0) is an eigenvector of A
0 1
corresponding to the Eigen values 1 2.
2 0 1 0 0
Also, since Av2 0 1 11 1v2
0 1
We have v2 (0, 1) is an eigenvector of A corresponding to the Eigen value 2 1.
Note
1. The sum of two Eigenvectors with the same Eigen value is also an
Eigenvector with Eigen value .
2. A nonzero multiple of an Eigenvector with Eigen value is also an
1 0 0 0 0
ii. Let v2 (0, y) then Av 2 y 1 y 1 v2
0 1 y
The eigenvectors on the corresponding to 2 1 are the non zero vectors on
the y-axis and hence
A1 (0, y) : y 0 is eigenspace of A belonging to eigenvalue 2 1.
1 0 x x
Av y y , which is a reflection in the y-axis.
0 1
112
Thus, the eigenspace corresponding to 1 1 is the x-axis and the eigenspace
Proof:
Let x F n be an eigen vector of A with eigen value then
Ax x
Ax I n x, sin ce I n x x x F n
( A I n )x 0
A I n is sin gular, why ?
A I n det ( A I n ) 0
This completes the proof.
Note:
1. The equation A I n 0 is called the characteristic equation of A. And the
113
1 6 2 12
(a) A (b) B
5 2 1 5
Solution:
(a) The characteristic equation of A is
A I2 0
1 6 1 0
5 0
2 0 1
1 6
0
5 2
2 3 28 0
( 7 ) ( 4) 0
7 or 4.
The corresponding eigenvectors are now found
Let 7 then ( A 7 I 2 ) x 0
1 6 1 0 x 0
7
5 2 0 1 y 0
6 6 x 0 6 x 6 y 0
0
5 5 y 5 x 5 y 0
yx
Hence, any vector of type (1, 1), where is a non zero scalar, is an eigenvector
corresponding to eigenvalue 7.
Let 4 then ( A 4 I 2 ) x 0
1 6 1 0 x 0
4
5 2 0 1 y 0
5 6 x 0 5 x 6 y 0
5 6 y 0 5 x 6 y 0
y 5 6 x
Hence, any vector of type (1, 5 6 ) ; where is a non zero scalar, is an eigenvector
corresponding to eigenvalue 4
Try checking that Ax i x for the eigenvalues and eigenvectors found in this example.
114
(b) Exercise
Remark:
A matrix is a rectangular array of numbers. The numbers in the array are called the
entries in the matrix.
Two matrices are defined to be equal if they have the same size and their
same size, then A =B if and only if , or, equivalently, aij bij for all i and
j.
If A and B are matrices of the same size, then the sum A B is the matrix obtained by
adding the entries of B to the corresponding entries of A, and the difference A B is
the matrix obtained by subtracting the entries of B from the corresponding entries of A.
Matrices of different sizes cannot be added or subtracted.
If A is any matrix and c is any scalar, then the product cA is the matrix obtained by
multiplying each entry of the matrix A by c. The matrix cA is said to be a scalar
multiple of A.
115
If A is an m r matrix and B is an r n matrix, then the product AB is the m n
matrix whose entries are determined as follows. To find the entry in row i and column j
of AB , single out row i from the matrix A and column j from the matrix B. Multiply
the corresponding entries from the row and column together, and then add up the
resulting products.
If A is any m n matrix, then the transpose of A, denoted by A t , is defined to be the
n m matrix that results from interchanging the rows and columns of A; that is, the first
column of A t is the first row of A, the second column of A t is the second row of A, and
so forth.
If A is a square matrix, then the trace of A, denoted by tr(A) , is defined to be the sum
of the entries on the main diagonal of A. The trace of A is undefined if A is not a square
matrix.
If R is the reduced row-echelon form of an n n matrix A, then either R has a row of
zeros or R is the identity matrix I n .
Miscellaneous Exercises
1 1 1 1
1 5 20 2 1 2
(d ) 0 4 8
3
( e)
0 1 2 1
0 0 6
0 0 7 3
2. Let A and B be 3x3 matrices such that
det( A) 4 and det B 5 then find the value of :
(a) AB (c) 2 AB
(b) 3 A (d ) A 1 B
116
1 3 1 1 1 1
1 2
(a) A (b) 2 1 1 (c) A 0 1 1
3 1
2 2 1 0 0 1
4. Use Cramer’s Rule to solve for each of the following systems
(a) x 2 y 3z 8
2x y z 7
y z 1
a 1 1 1 1
1 a 1 1 1
5. Evaluate 1 1 a 1 1
1 1 1 a 1
1 1 1 1 a
2 2 4
6. a) Find the ad joint of
2 3 2
1 1 1
b) If A is a triangular matrix, then show that adj ( A) is also triangular
a b 0 0
c d 0 0
7. (a) Let A , then find rank ( A)
e f g h
x y z w
(b) Let A be a square matrix of order n, then show that A is invertible if rank (A) = n
i j k
AxB a1 a2 a3 where i (1, 0, 0)
b1 b2 b3
j (0, 1 0)
k (0, 0, 1) are the s tan dard unit vectors in n
117
1 0 0 0 a b c d
0 1 0 0 e f g h
9. Let A = B
a b c d 0 0 1 0
e f g h 0 0 0 1
Then prove that
c d a b d
a) det A b) detB
g h e f h
10. Let A and B square matrices of order n then prove or disprove the following
a) det ( A B ) det ( A) det(B)
det ( A B)
b) det ( A B) 2
2
References
1. James Stewart, Calculus early trancedental, 6th ed., Prentice Hall, 2008
2. Howard Anton,Calculus a new horizon, 6th ed., John wiley and Sons Inc
3. Robert Ellis and Denny Gulick, Calculus with analytic geometry, 6th ed,
Harcourt Brace Jovanovich, Publishers, 5th ed, 1993
4. Adams, Calculus: A complete course, 5th ed, Addison Wesley, 2003
5. R. Wrede and M. R. Spiegel, Theory of advanced calculus, 2nd ed.,
McGraw-Hill, 2002.
6. A. E. Taylor and W. R. Mann, Advanced calculus, 3rd ed, John-Wiley and
Son, INC, 1995.
7. R. T. Smith and R. B. Minton, Calculus concepts and connections, McGram-
Hill book company, 2006
8. D. V. Widder, Advanced calculus, Prentice-Hall, 1979
9. Ross L. Finney et al, Calculus, Addison Wesley, 1995
10. E. J. Purcell and D. Varberg, Calculus with analytic geometry, Prentice-Hall
INC., 1987
11. Lynne, Garner: Calculus and Analytic Geometry, Dellen Publishing
Company
118
12. John A. Tierney: Calculus and Analytic, 4th edition, Allyn and Bacon, Inc. –
Boston.
13. Earl W. Swokowski: Calculus with Analytic Geometry, 2nd edition, Prindle,
Weber and Schmidt.
14. Goldstein, L. J., Lay D. C. , and Schneider D. I (1987). Calculus and its
applications, 4th, edn, Prentice-Hall, Inc, London
15. Kresying, Advanced Engineering Mathematics.
16. Alan Jeffry , Advanced Engineering Mathematics
CHAPTER 3
LIMITS AND CONTINUITY
Objectives:
By the end of this chapter, students will be able to:
Understand the formal definition of limit and continuity;
State some limit theorems;
Evaluate limits of functions;
Determine points of discontinuity of functions;
Apply Intermediate Value Theorem;
Introduction
The topic that we will be examining in this chapter is that of Limits. Limits are very
important in the study of calculus. We will be see limits in a variety of places once we
move out of this chapter. In particular we will see that limits are part of the formal
119
definition of the other two major topics. Here is a quick listing of the material that will be
covered in this chapter.
Objectives
At the end of this section, students should able to:
Define informal definition of limit;
Define formal definition of limit;
Use definition to show that the limit of at is L;
Find a non-existence of limit.
Definition 3.1: Let be defined on an open interval about except possibly at itself.
If gets arbitrarily close to (as close to L as we like) for all sufficiently close to
we say that approaches the limit as approaches and we write
Solution: The given formula defines ƒ for all real numbers except (since we
cannot divide by zero). For any x we can simplify the formula by factoring the
numerator and canceling common factors:
The graph of ƒ is thus the line with the point (1, 2) removed. This removed point is shown
as a ―hole‖ in Figure 3.1. Even though is not defined, it is clear that we can make the
120
value of ƒ as close as we want to 2 by choosing close enough to 1 and we say that
approaches the limit 2 as approaches 1 and we write,
The graph of is identical with the line except at where is not defined.
Example 2: The identity and constant functions have limits at every point. If is the
identity function , then for any value of (Figure 3. 2).
If is the constant function (function with the constant value k), then for any
value of (Figure 3.3)
121
Now that we have gained some insight into the limit concept, working intuitively with the
informal definition, we turn our attention to its precise definition. We replace vague
phrases like ―gets arbitrarily close to‖ in the informal definition with specific conditions
that can be applied to any particular example. With a precise definition we will be able to
prove conclusively the limit properties given in the preceding section, and we can establish
other particular limits important to the study of calculus.
To show that the limit of ) as equals the number L, we need to show that the gap
between ) can be made ―as small as we choose‖ if x is kept ―close enough‖ to
arbitrary number.
Definition 3.2 (Formal Definition of Limits)
Let be a function defined at each point of some open interval containing possibl at
itself,then umber L is the limit of as approaches (or is the limit of f at ) if for
every number there is a number such that
| | | | .
If is the limit of ) as approaches then we write
If such an can be found we say that the limit of at exists or that has a limit at or that
exists.
122
Let us look at the following graph and let us also assume that the limit does exist
What the definition is telling us is for any number that we pick we can go to our
graph and sketch two horizontal lines at as shown on the graph above.
Then somewhere out here in the world is another number 0. Which we will need to
determine that will allow us to add in two vertical lines to our graph at .
Now, if we take any in the vertical strip, between , then this will be
close to than either of and or | | . If we now identify the point on
the graph that our choice of gives, then this point on the graph will lie in the intersection
of the horizontal and vertical strip. This means that, this functional value of will be
close to than either of and or | | . So, if we take value of in the
horizontal strip then the graph for those values of will lie in the vertical strip.
Notice that there are actually an infinite number of ‘s that we can choose. In fact, if we go
back and look at the graph above it looks like we could have taken slightly large and still
gotten the graph from that horizontal strip to be completely contained in the vertical strip.
Example 1: Use the definition of limit to prove the following limit
Solution: We need to show that given then there exists such that
| | | |
To choose an appropriate we start with | | then we have
|5x- | | |
Now we can Choice
123
| | | | =5| 2|
| |
Therefore,
if we restrict to lie in some interval centered at 3. Infact, since we are interested only
in values of that are close to 3, it is reasonable to assume that is within a distance 1
from 3, that is , | | then so Thus we have | |
, and so is a suitable choice for the constant But now there are two restrictions on
| | namely
| | and | |
To make sure that both of these inequalities are satisfied , we take to be the small of the
124
II. Showing that this works .
Given , let , - . If | |
| | | || |
Solution: Referring to the quarter of the unit circle in figure 3.5, along with the
accompanying line segments, we see that | | . So that
| | | | for .
| | | | …………………… (1)
Now let , let be any positive number less than both and . From (1), it follows
that if | | , then | | | | .
We conclude that
To show that , observe form figure 3.5 that
| | | | | | | | For . Since , it follows that
Now let , let be any positive number less than both and . From (2) we find,
125
Next, we present an example of a function that does not have a limit at a certain point. For
a function f not to have a limit at a means that for every real number L, the statement ― is
the limit of at ‖ is false. What does it mean for that statement to be false?
Definition 3.3 (Negation of the Existence of a Limit)
― is the limit of at ‖ means that for every there is a number such that
if | | then | |
For this statement to be false there must be some such that for every it is false
that
if | | then | |
But to say that ( ) is false is the same, as to say that there must be a number such that
| | then | |
Thus to say that the statement is false is the same as to that there is some
such that for every there is a number satisfying
| | then | |
Example 5: Let be defined by
has no limit at 0.
Solution: Let be any number. We will prove that the statement ― is the limit of at ‖
is false by letting and showing that for any there is an satisfying
| | and | |
126
Let be any positive number. If , then we let and note that so
that
| | | | | |
| | | | | || | | |
Exercise
d). h)
| |
i) * |√ √ | +
√ √
127
Even if we have developed important techniques of solving limit problems by using the
formal definition, by now we have realized that it is not that easy to use this definition to
solve each and every problem. Nevertheless, the student had encountered in his or her
earlier studies of calculus rather easy ways of evaluating limits by the help of different
rules. Here we state and prove some of them by using definition 3.2 and use those to
evaluate more complex limit cases.
Theorem 3.4:
i. If the limit of a function at exists, then this limit is unique.
ii. If and are both limits of at then
Theorem 3.5: Assume that l exist and c is any constant.
Then
1. [ ]
2. [ ]
3. .
4.
5. If exist then .
128
| | | |
By triangle inequality. Since there exists a number such that for
all x
| | then | |
| | .
a. b.
Solution:
By law 1 and 2
=2 by 3
by 9,7 and 7
b. (by law 5)
(By 1, 2 and 3)
(By 9, 8 and 7
A function with the direct substitution property is called continuous at However, not all
limits can be evaluated by direct substitution as the following examples show.
Example 2: Find
129
Solution: Let ⁄ . We can‘t find the limit by substitution
because isn‘t defined nor can we apply the quotient rule because the limit of the
denominator is 0. Instead, we need to do some preliminary algebra. We factor the
numerator as a difference of square:
The numerator and denominator have a common factor of . When we take the limit as
approaches 1, we have and so . Therefore, we can cancel the common
factor and compute the limit as follows:
=
Example 3: Find where
130
b. From the constant multiple theorems we find that.
√ √ √
d. By the quotient Theorem along the results of part (a)and(c), we find that
b) c) d)
Note:
Another widely used form equivalent to is
(1)
and this is obtained by replacing by .
Using the sum and constant multiple theorems and (1) we can show that the sine cosine
functions have limits at any number.
Example 5:Show that for any number a, and
Solution: Using the trigonometric identities
(2)
(3)
We will prove that and beginning with the sine
function, from (2) it follows that
131
The other trigonometric functions also have the property that the limit at a point in the
domain by evaluating the function at . Thus
Although the quotient theorem does not guarantee the existence of , when
Example 6: Find √
Solution: Let y is the expression since approaches 1 as x approaches 0, it
follows those y approaches 1, and we have
√ = √ 1.
Example 7:
Find
132
⁄ )= ⁄ = ⁄ =0
Activities :
Find the following limits
⁄ b) ⁄ √ c) √
rule because does not exists, since the value of the function lies in the
for
and .
That is
.
133
Therefore,
.
=1
Solution: Using figure 3.8 we obtain the following equations which are valid for
.
so that
and (1)
and
we obtain
, for | |
exists and 1
134
Exercises
1. Given that
Find the limits if exist.If the limit does not exist, explain why.
a) [ ] d)
b) e) √
c) √ f)
2. Evaluate the limit and justify each step by indicating the appropriate limit laws.
√
a) e) √
b) f)
c) g) √
d) ( √ )
3. Evaluate the limit if it exists.
a) e)
b) f) √
c) g) ( )
√
d) h)
4. Find the limit, if it exists. If the limit does not exist, explain why.
135
a) | | e) ( | |
)
b) f) | |
| |
c) ( ) g) | |
and | |
| |
| |
d) h) | |
5. Let {
a) Find and
b) Does exist?
6. Let | |
. then find and
136
Definition 3.8:
a ) Let be defined on some open interval A number L is the limit of as
approaches from the right (or the right-hand limit of at ) if for every there
is a number such that
then | | .
In this case, we write .
b) Let be defined on some open interval A number L is the limit of ) as
approaches a from the left (or the left-hand limit of at ) if for every there is
a number such that
if – then | |
In this case we write
137
Squaring both sides of this last inequality gives if
If we choose we have implices √ or implices
that |√ |
From figure 1.12 we see that ⁄ has neither a right-hand nor a left-hand limit as
approaches zero.
138
Theorem 3.9: Let be defined on an open interval about except possible at it self.
Then if and only if both one sided
limits, exists and
In this case
| | ,
Since | | | |
For , we have | | and so | |
Therefore, by Theorem 3.9
| |
Example 2: Let {
find .
Solution: since and by theorem 3.9 we have
Hence =5
| |
Example 3: Prove that does not exist.
| | | |
Solution : . And
139
Since the right and the left limits are different, it follows that from theorem 1.9 that
| |
does not exist.The graph of the function | |⁄ is shown in Figure
Solution: The graph of the greatest integer function is shown in Figure 3.15
For we have.
⟦ ⟧
Because of one side limits are not equal. ⟦ ⟧ does not exist by Theorem 3.9.
Exercises
1. Let {
140
a) b) c)
d) f
2. If n is an integer ,evaluate.
a) ⟦ ⟧ b) ⟦ ⟧ c) ⟦ ⟧
3. Which of the following statements about the function sketched in figure 3.16
are true, and which are false?
e) i)
f) j)
d) h) l)
Now we consider the limit of as becomes larger and larger in absolute value. Here we
see precise definition of limit.
Definition 3.10
a) if for every there is a number M such that
if then | |
b) if for every 0 there is a number N such that
if , 141 | |
The symbol for infinity does not reprsent a real number. We use to describe the
behavior of a function when the values in its domain or range out grow all finite bounds.
For example, the function ⁄ is define for all (Figure 3.17). When is
positive and becomes increasingly large, ⁄ becomes increasingly small. When is
negative and its magnitude becomes increasingly large. ⁄ again becomes small. We
summarize these observation by saying ⁄ at infinity and negatve infinity.
implices ⟨ ⟨ ⟨ ⟨
prove ⁄
142
If then ⟨ ⟨ ⟨ ⟨ , but then if then
√
if then ⟨ ⟨ ⟨ ⟨
In the above examples that is the horizontal asymptote of the graphs of ⁄ and
⁄ .
Example 3:
As grows larger and larger you can see that the values of get closer and closer to 1
(figure 3.18). In fact ,it seems that we can make the values of as close we like 1 by
taking sufficient large.This situation is expressed symolically by writing
The curve illustrated in Figure 18 has the line as a horizontal asymptote because
143
Theorem 3.12 If is a rational number, then
To determine the limit of a rational function as , we can divide the numerator and
denominator by the highest power of in the denominator.Then depends on the degrees of
the polynomails involved. Example 4 Numerator and denominator of the same degree.
Example 4: Evaluate
( )
(by limit law 5)
( )
Example 5: Evaluate
(√ )
144
Solution: Because both √ and are large when is large, it is difficult to see what
happens to their difference, so we use algebra to rewrite the function. We first multiply
numerator and denominator by the conjugate radical.
√
(√ ) (√ )√
√ √
now divide numerator and denominator by √ for
√ √
145
Theorem 3.13
If L, M and K are real number and
and , then
1. Sum Rule:
2. Difference Rule:
3. Product Rule:
4. Constant Multiple Rule:
5. Quotient Rule: , M
⁄
provided that is real number.(If s is even we assume that .)
known limits
√
b. √ x
√ product rule
The notation
is used to indicate that the values of become large as becomes large. Similar
meanings are attached to the following symbols.
146
In fact ,we can make as big as we like by taking large enough.
Therefore we can write
Means that for every positve number M there is a corresponding positve number N
such that
If
Similar definition apply when the symbol is replace by
Infinite limits can also be defined in a precise way. Let us look again at the function
as the values of grow with out bound, that is
right (see Figure 3.21) therefore be haves near 1 exactly the way y behaves
near 0.
147
=
b. near
Solution:
a. As appraoches zero from either side,the values of are positive and
148
Instead of requiring to lie arbitrarily close to a finite for all sufficiently close to ,
the definition of infinte limits require to arbitrarily far from the origin. Except for this
change, the langauge is identical with what we have seen before.
Definetion 3.15 (Precise definition of infinite limits).
Let be a function defined on some open interval that contains the number except
possible at itself. Then
means that for vevery positve number M there is positve number such that
whenver | |
This says that the values of can be made arbitrarily large (larger than any given
Number taking close enough to (within a distance , where depends on , but
with ). A geometric illustration is shown in Figure 3.23.
Given any horizontal line , we can find a number such that if we restrict to lie in
the interval but , then the curve lies above the line .
You can see that if a larger is chosen, then a smaller may be required.
Solution: 1. guessing a value for . Given that , we want to find such that
Whenever | |
That is whenever | |
or | | whenever | | work
√
149
Thus Whenever | |
Therefore, .
Definition 3.16: Let be a function defined on some open interval that contains
the number except possibly at itself. Then
means that for every negative number here is a positive number such that
Notice that the distance between a point on the graph of ⁄ and the y-axis approaches
zero as the point moves vertically along the graph and away from the origin
Exercises
b) c)
150
e) f) ⁄
i)
√
√
( √ )
d) f e) f f)
151
Notice that definition 3.18 requires three properties if is continuous at
1. is defined (that is , is in the domain of )
2. exists.
3.
Figure 3.26 shows an example of function that is continuous at every point on its domain
152
Example 2: Figure 3.27 shows the graph of a function At which number is
discontinuous?
Solution: It looks as if there is a discontinuity where because the graph has a break
there. The main reason that is discontinuous at 1 is that is not defined.The graph
also has a break when , but the reason for the discontinuity is different. Here, is
defined , but does not exist (because the left and right limits are different). So
is discontinuouts at 3. What about Here, is defined and exist
(because the left and right limits are the same). But, So is
discontinuous at 5
Class activity
Where are each of the following functions discontinuous?
a. b. { if c. {
153
A continuous functions need not be continuous on every interval. For example ⁄ is
not continuous on [-1,1] (Figure 3.28), but it is continuous at its domain .
154
It follows from theorem 3.21 and definition 3. 20 that if and are continuous on an
intervals, then so are the functions if g is never zero 0) ⁄
Theorem 3.22
a. Any polynomial is continuous every where, that is continuous on
b. Any rational function is continuous wherever it is defined, that is continuous on
its domain.
Proof:
a. A polynomail is a function of the form:
Example 4: Find
155
Another way of combining continuous functions and to get a new continuous function
is to form the composite function this fact is a consequence of the following theorem.
Theorem 3.23 If is continuous at and
Theorem 3.24 tells us that the composite of two continuous functions at a given number is
continuous.
Theorem 3.24
If is continuous at and is continuous at , then the composite given
by is continuous at
Solution:
a. Let and , we have
Now is continuous on since it is polynomial and is continuous everywhere. Thus
is continuous on by theorem 3.24.
b. We know from theorem 3.22 that is continuous and
(because and continuous). Therefore by Theorem 3.24
is continuous, where it is defined. Now is defined when . So it
is undefined when and this is happened when . Thus is
discontinuous when is an odd multiple of and is continuous on the intervals between
these values.
Definition 3.25
a) A function is continuous on ,if it is continuous at every point in
b) A function is continuous on [ ], if it is continuous on , and is also continuous
from the right at and continuous from the left at b.
Exercises
156
1.Write an equation that expresses the fact that a function if continuous at the number
4.
2. If is continuous on ,what can you say about its graph?
3. a) From figure 3.29. State the number at which is discontinuous and explain why.
b) For each of the numbers stated in part (a) determine whether is continuous from
the right, or from the left ,or neither.
√ √
| |
{
5. Find the values for the constant k, that makesthe following functions are continuous at
or continuous every where.
a) { b) {
{ d) {
e) {
157
3.6- Intermediate value theorem
Objectives
At the end of this section, students will be able to:
State Intermediate value Theorem;
Use the intermedaite value Theorem to show that there is a root of the given
equation in the specified interval.
Functions that are continuous on intervals have properties that make them particularly
useful in mathematics and its applications. One of these is the Intermediate Value Property.
A function is said to have the Intermediate Value Property if whenever it takes on two
values, it also takes on all the values in between.
The intermediate value Theorem states that a continuous function takes on every
intermediate value between the functional values and It is illustrated by Figure
3.30 below. Note that the value N can be taken on once [as in part (a)] or more than once
[as in part (b)].
If we think of a continuous function as a function whose graph has no hole or break, then it
is easy to believe that the intermediate value theorem is true. In geometric terms it says that
if any horizontal line is given between and as in Figure 1.32,
then the graph of cannot jump over the line.it must intersect somewhere.
158
The intermediatevalue theorem is not true in general for discontinuous functions.
One use of the intermediate value theorem is in locating roots of equations as in the
following example.
Example 1: Show that the expression has at least one root between 1
and 2.
Solution: Let we are lookig for a solution of the given
equation , that is a number c between 1 and 2 such that . Therefore ,we take
and by the theorem, we have
159
Summary
I. Let be a function defined at each point of some open interval containing possibly at
itself, then a number L is the limit of as approaches (or is the limit of at ) if
for every number there is a number such that
| | then | | .
If is the limit of ) as approaches then we write
If such an L can be found we say that the limit of at exists or that has a limit at or
that exists.
160
Review Exercises
d) If then e) ( √ )
f)
√
b) c) d)
e) (√ √ )
√
f) g) (√ )
h)
3. Find two function and such that does not exist but
exists.
4. [ ] [ ] Then find
[ ], provided that the limits exist at
5. Explain why the function is discontinuous at the given number
a) | |
b) {
c {
is discontinuous?At which of these points is continuous from the right, from the left.
7. For what value of the constant c is the function continuous on (
161
8. Find the values of and that make continuous every where.
{
9. Show that is continuous on (
a) {
√
b) {
10.Use the intermedaite value Theorem to show that there is a root of the given equation in
the specified interval.
a) b) c) ,
d) √ e)
References
1. James Stewart, Calculus early trancedental, 6th ed., Prentice Hall, 2008
2. Howard Anton,Calculus a new horizon, 6th ed., John wiley and Sons Inc
3. Robert Ellis and Denny Gulick, Calculus with analytic geometry, 6th ed, Harcourt
Brace Jovanovich, Publishers, 5th ed, 1993
4. Adams, Calculus: A complete course, 5th ed, Addison Wesley, 2003
5. R. Wrede and M. R. Spiegel, Theory of advanced calculus, 2nd ed., McGraw-Hill,
2002.
6. A. E. Taylor and W. R. Mann, Advanced calculus, 3rd ed, John-Wiley and Son, INC,
1995.
7. R. T. Smith and R. B. Minton, Calculus concepts and connections, McGram-Hill book
company, 2006
8. D. V. Widder, Advanced calculus, Prentice-Hall, 1979
9. Ross L. Finney et al, Calculus, Addison Wesley, 1995
10. E. J. Purcell and D. Varberg, Calculus with analytic geometry, Prentice-Hall INC.,
1987
11. Lynne, Garner: Calculus and Analytic Geometry, Dellen Publishing Company
12. John A. Tierney: Calculus and Analytic, 4th edition, Allyn and Bacon, Inc. –Boston.
162
13. Earl W. Swokowski: Calculus with Analytic Geometry, 2nd edition, Prindle, Weber
and Schmidt.
14. Goldstein, L. J., Lay D. C. , and Schneider D. I (1987). Calculus and its applications,
4th, edn, Prentice-Hall, Inc, London
15. Kresying, Advanced Engineering Mathematics.
16. Alan Jeffry , Advanced Engineering Mathematics
CHAPTER-FOUR
DERIVATIVES AND APPLICATION OF DERIVATIVES
Objectives
At the end of this chapter, students will be able to:
Define the derivative and differntiability of a function;
Find the slope and the equation of the tangent line to the graph of a given function
at a given point;
State and prove some techniques of differntiation;
Compute the derivative of a given function;
To find higher order of derivative of functions.
Introduction
163
In this chapter,we will discuss the concept of tangent line and normal line to graph of a
function at a given point. With the concept of geometric interpretation of tangent line
to a curve at a point,we will see the formal definition of the derivative of a function at
point ,which will be followed by the derivative of a function at any point along with
the usual notations for the derivative. We will also duscuss the concept of
differentiablility and its relationship to continuity. We will also develop the Chain
Rule to find the derivative of composition of functions. we will also emphasize the
application of the derivative to graphing functions. We will learn how to determine
where the graph of a differentiable function rise and where it falls: where it has peak
and where it has valleys: where it curves upward and where it curves downward. The
concepts we will introduce have application not only to graphing functions but also to
problems in such widely varying areas.
(1)
(2)
If the limit in (2) exists, we say that has a derivative at and is defferentiable at or
that exists.
The derivative of a function at a point can also be defined as
164
[ ] [ ]
Differentaible functions
The derivative of function may or may not exist at particular point . If the limit in (3) fails
to exist, say at we say is not differentaible at
Definition 4.2: Let a function be defined in open interval containig the point c. The
function is differentaible at c if and only if the derivative exists. If is differentaile
at every point of its domain we say simply that is differentaible.
Example 2: Let Then determine the set of values of for which is
differentaible.
Solution:
Let
Since this expression is defined for every real number , is defferentaible in the whole
real line.
Theorem 4.3: If differentaible at point , then it is continuous at
Proof: To prove that is continuous at , we have to show that .
We do this by showing that the difference approaches 0.
The given information is that is differentiable at , that is
165
Taking the limit on both side of equation, we get
[ ]
Therefore, is continuous at .
The converse of this theorem is not true ; that is ,there are functions that are continuous at a
point but not differentaible at that point.
Example 3: Show that the function | | is continuous at 0, but not differentiable at .
Solution: The function | | continuous at 0, since
| | | |
For
| |
And for
| |
| |
does not exists, because the right and the left hand side limits are not
etc.
Exircses
1. Find
a) b) c)
d) e) f) f(x
166
g)
√
h) √
2. Each limit represents the derivative of some function at some number . State such an
and in each case.
√
a) b) c) ⁄
d) e) f)
3. Determine whether
4. Given {
Show that :
a) is continuous for all values of
b) is differentiable for all value of
c) is not continuous at
Tangent Lines
Activity
Q1.What is a tangent line? a normal line?
Q2. Is any line that touches a curve at one point a tangent line?
For circle, tangency is striaght forward. A line L is tangent to a circle at a point P if L
passes through P perpendicular to the raduis at P (Figure 4.1). Such a line just touches the
circle. But what does it mean to say that a line L is tangent to some other curve C at a
Point P? Generalizing from the geometry of the circle, we might say that it means one of
the following:
1. L passes through P perpendicular to the line from P to the center of C.
2. L passes through only one point of C, namely P.
3. L passes through P and lies on one side of C only.
167
L Passes through P perpendicular to radius OP.
Most curves do not have centers, and a line we may want to call tangent may intersect C at
other points or cross C at point of tangency (Figure 4.2).
In this section, we make use of limit concept to find the equation of a line tangent to the
graph of a function at a given point.
Definition 4.4: The tangent line to the curve at the point is the line
through p with slope
(4)
Example 1:
Find an equation of the tangent line to the parabola at the point .
Solution: Here we have and , so the slope is
168
Using the point slope form of the equation of the line, we find that an equation of the
tangent line at is or
There is another expression for the slope of tangent line that is sometimes easier to use.
Let so that
So the slope of the secant line PQ is
See Figure 4.3 where the case is illustrated and Q is to the right of P. If it happened
that However, Q would be to the left of P.
Notice that as approaches approaches (because so that the expression
for the slope of the tangent line in definition 4.4 becomes
Example 2:
Find an equation of the tangent line to the hyperbola ⁄ at the point
Solution: Let ⁄ . Then the slope of tangent at is
169
=
which simplifies to
Definition 4.5: The slope of the line tangent to the graph of the function at
is equal to
the derivative of at
The geometric interpretion of a derivative is shown in Figure 4.4.
If we use the point - slope form of the equation of a line, we can write an equation of the
tangent line to the curve at the point (
Example 3: Find the equation of the tangent line to the parabola at the
point (3,-6).
Solution: We know that the derivetive of at the number is
. Therefore, the slope of the tangent line at (3,-6) is
170
Thus, the equation of the tangent line to the parabola is:
Then we say that the graph of has a vertical tangent line at . In that case the
vertical line is called the line tangent to the graph of at .
Example 4:
⁄
Let . Show that the graph of has vertical tangent line at and find an
equation for it.
Solution: We observe that is continuous at 0 an d that
⁄
⁄
By definition 2.6 the graph has a vertical tangent line at and an equation of the
tangent is
Normal Lines
If the tangent line has slope then the normal line will have slope , and the
171
It is clear that if the tangent line is horizontal then the normal lne will be vertical, and vice
varsa.
Example 5: Find the equation of the normal line to the function at
Solution: Let us compute the slope of the tangent line to the curve at . This is done in
the following ways.
= .
That is , so the normal line is will have slope , hence the equation
of normal line is
Or
Execises
1. Find the slope of the tangent line to the curve at the point
a) using definition 1
b) using equation 3
2. Find an equation of the tangent line to the curve given point.
a) ,
b) √ ,
c) ⁄ , (3,2)
d) ⁄ ,
172
b) Find the slope of the tangent lines at the point whose x-coordinates ar
, and
4. Compute tangent line if it exists for each curve at the given value of a.
⁄
a) b) c) any
d) any e) f)
5. Find the equation of tangent line if it exists to the given curve at given point .
a) , b) , c) ,
d) √ , e) | |, f)
√
g) ,( )
6. Explian why there is no tangent line to the given curve at the given point.
| | ⁄
a) , b) √ , c)
d) | | e) | | f) ,
7. Find the equation for the normal line to the given curve at the given point.
⁄
a) , ( ⁄ ⁄ ) b) , c) √ , (
d) , e) ( f) √ , (√ √ )
Here, we will discuss rules stated as theorems that help us differentiate combinations of
functions. The proofs of these theorems depend mainly on the appropriate limit theorems.
173
Example 1: Let
Similarly,
When n is any positive integer ,then we have form the definition of derivative
so that [ ]
[ ]
[ ]
.
Therefore
174
The power rule (general version)
If n is any real number , then .
Activities: Differentiate
a. b. √ c. d)
√
[ ]
Example 2:
If the functions and are differentiate at a point a, then so are and and
1 sum rule
difference rule
Proof
1. Using the limit theorem , we find that
175
a. b. c.
Now, and
⁄
[ ]
176
Proof : Since exists by hypothesis, it follows that is continuous at so that
, because by hypothesis.
Therefore, ⁄ is defined throught some open interval about , and the following limits
exist:
⁄ ⁄
⁄
. / . /
[ ] [ ] [ ]
Therefore,
⁄
[ ]
Solution:
( )
177
[ ]
Activities.
Differentaite the following function.
a. b. c.
1. 4. ( 7.
2.( 5. ( )
3.( f 6.
Exercises
1. Find the derivative of in two ways: by using the product Rule and
performing the multiplication first. Do your ansewrs agree?
2. Find the derivative of the function
√
√
In two ways:by using the Quotient Rule and by simplifing first. Show that your answers are
equivalent.Which method do you prefer?
3. Differentiate.
a) f)
b) g)
c) h) ( )
d) √ i) ( √ )
e) j)
178
b) d)
6. If , where ,Find .
7. If and find ( )⟨
8. Find equation of the tangent line and normal line to the given curve at the specified
point.
√
a) , b) , (
a) c. b) d)
√
Then find;
a) b)
[ ] [ ]
179
Derivatives of Polynomial Functions
The constant multiple rule, sum rule the difference rule can be combined with the power
rule to find any polynomial, as the following example.
Example 1: Given the polynomial function , its
derivative is given as
The Quotient Rule and the other differentiation formulas enable us to compute the
derivative of any rational function, as the next example illustrates.
180
Example 3: Let . Then
Activity
1. State and describe the basics of derivatives of trignometric functons.
2. Show that the derivative of is .
In this discussion, we will find the derivatives of all the trignometric functions. There are
six important formulas for differentiating trignometric functons.
Theorem 4.8:
The trignometric functons are differentiable whenever they are defined and
1. 4.
Proof:
1. Using the definition of derivative, we have ,
181
Notice that and
Therefore,
2. To show (3), we make use of the definition and the quotient rule.
( )
Example 4:
Differentiate the following functions.
a. b.
Solution:
a. Using the sum and product rule,we get,
(x)=2x
b. Use the product and the quotient rule , we have,
182
Exercises
4. 5. 6.
7. 8. 9.
13. 14 . 15.
19. 20 . 21.
32.
33. Let
c) Show that your answers to parts (a) and (b) are equivalent.
35. For what values of x does the graph of f have a horizontal tangent?
183
Let us try to find the derivative of the exponential function using the definition
of a derivative .
( )
We can use chain Rule to differentiate an exponential function with any base
let then and Chain Rule gives
( )
In this section, we use implicit differentaition to find the derivatives of the logarithmic
function and in particular, the natural logarithmic function .
184
Proof: Let then Differentiating this equation implicity with
respect to and using formula (1) we get
and so
If we put in formula 1, then the factor on the right side becomes and
we get the formula for the derivative of the natural logarithmic function
By comparing formula 1 and 2 ,we see one of the main reasons that natural logarithms
(logarithms with base e) are used in calculus : The differentiation formula is simplest when
become In general , if we combine formula 2 with the chain rule we get
[ ]
Example 6: Differentiate
Solution: let and now by formula (3) we have
[ ]
Example 7: Find .
Example 8: Differentiate
Solution: Using formula 1 with , we have
Solution : Since {
It follows that
185
{
The result is
| |
Logarithmic differentiation
Solution : We take logarithms of both sides of the equation and use the law of logarithms
to simplify:
( )
186
The Power Rule:
Hence
Remark:
You should distinguish carefully between the Power Rule [ ] where the base
is variavble and the exponent is constant, and the rule for differentiating exponential
functions [ ] where the base is constant and the exponent is variable.
In general there are four cases for exponent and bases:
2. [ ] [ ] (x)
3. [ ]
example.
√
√
(√ ) √ (√ )
√ √
√ √
Another method is to write ( )
( √ ) ( √ ) √ (√ )
187
√ ( )
√
Activity:
⁄
Show that
Note:
If we put ⁄ in the above limit, then as and so an alternative
expression for is
( )
( )
188
The chain rule can be written in the form
( ( )) ( )
√ ( ) √ √
Activity 3.10
Differentiate a) √ b) ( ) c)
The chain rule assumes a very suggestive form in the Leibniz notation.
Suppose the function and in the chain rule are already given and let
Therefore
( ( ))
189
Or more concisely, (8)
If [ ] then we can write where . By using the chain
rule and the power rule we get
Therefore [ ]
(9)
Example 4: Find if .
√
⁄
Solution: Let and √ . So, and
( ) √
⁄
( )
⁄
( )
⁄
Example 5: Suppose the radius of a ballon varies with respect to time according to
the equation . Find the rate of change of the ballon‘s volume with respect to
time.
Solution: Let be the volume, then , while by assumption
190
The compound chain rule
Activity:
Find where √
But
So,
Therefore,
(10)
In the formula, the derivative of at the number appears first, then the
derivative of at the number and finally the derivative of at the number .
Example 6: Let . Find and calculate ⁄
Solution: Let and
Then ,
From (9) we have
( )
In particular,
( ⁄ ) ( ⁄ ) ( ⁄ )
√ √
⁄ ( )
191
Exercises
b. √ , find
5. If , where f'(5)=32 and g'(5)=6,
2 find
6. If ( ) where
and 6 find .
7. If √ where and find
192
s
193
194
195
4.2.2. Hyperbolic and Inverse hyperbolic Functions
196
this reason they are collectivelly called hyperbolic functions and individually called
hyperbolic sine and hyperbolic cosine and so on. We also define the rest of the
hyperbolic functions interms of these functions.
Definition 4.10:
The graphs of hyperbolic sine and hyperbolic cosine can be sketched using graphical
addition as in the Figure belows.
Note that has domain and range . While has domain and range [ ]
The hyperbolic functions satisfy a number of identities that are similar to well - known
trigonometric identities. We list some of them here and leave most of the proof as an
exercises.
Hyperbolic Identities
4.
Proof:
1. Trivial!
197
2. By definition, we have
( ) ( )
3. Trivial!
4. Left as an exercise
We list the differentiation formula for the hyperbolic functions below. We prove (1) and
(2). The remaining proof are left as exercise.Obserbe that the analogy with the
differentiation formulas for trigonometric functions, but be aware that the signs are
different in some cases.
1. 4.
2. 5.
3. 8.
Proof:
( )
( )
It can be seen from their graphs that and are one-to-one functions and so they
have inverse functions denoted by and . But, is not one –to-one.
However, when restricted to the domain [ ] it become one-to-one. The inverse
hyperbolic cosine function is defined as the inverse of this restricted function.
iff
iff
iff
The remaining invese hyperbolic functions are defined similarly.
198
We can sketch the graphs of ,
Since the hyperbolic functions are definded in terms of exponential funfions ,it is not
surprising to learn that the inverse hyperbolic functions can be expressed in terms of
logarithms. In particular,we have :
Theorem 4.11:
1. ( √ )
2. ( √ )
3. | |
√
4. ,
Our aim now is to write as a function of x. Using the quadratic formula and the fact that
,we deduce that
√
√
( √ )
199
( √ )
Exercises
d) e) ( ) f) √
g) h) √ i) [ ]
2. Find an equation of the tangent line to the curve at the given point.
a) b) ,
3. find the numerical value of the expression.
a) b) c). d)
e) f) g) ( √ )
4.Prove following function:
(a). (b)
(c). (d)
200
These are called the first derivative, the second derivative, the third derivative and so
forth. The notation of a derivative of arbitrary order is
and is read as order derivative.
Activity : Find
a) , where
b) ⟨ , where
.
.
.
( )
Therefore,
201
[ ]
[ [ ]] [ ]
[ [ ]] [ ]
In general, we write
[ ]
(11)
or more briefly
202
and , differentiating three more times, we have
Exercises
3. find and
a)
b)
c)
d)
4. a) Use the Quotient Rule to differentiate the function
find
a) ( ⁄ ) ⁄ .
Up to now, we have been concerned with differentiating functions that are expressed in
the form , that is, functions that can be expressed in one variable explicitly.
Because the variable appears alone on one side of the equation. However, sometimes
203
functions are defined by equations in which is not alone on one side: For example,
the equation
Thus, we say that (1) defines implicitly as a function of , the function being
Thus,
204
An equation of the tangent to the circle at (3,4) is therefore
or
⁄
√
So,
√
and an equation of the tangent is
Solution:
( )
Activity.
a) Use implicit differentiation to find for the equation
(1)
Substituting (1) into (2) and simplifying using the original equation, we obtain
⁄
Example 5: Find if
206
Exercises
1. Find
a) d)
b) e) √
c) √ f) √
a) f)
⁄
b) g)
c) h)
d) √ i) √
e)
3. Find the slope of the tangent line to the curve at the given points in two ways:
first by solving for interns of and differentiating and then by implicit
differentiation.
a) ( )
√ √ √ √
b)
4. Find an equation of the tangent line to the hyperbola
at the point
5. Find the value of a and b for the curve if the point (1,1) is on
its graph and the tangent line at (1,1) has the equation
207
4.5. Applications of Derivatives
Objectives:
In this section , we will give emphasize the application of the derivative to graphing
functions. We will learn how to determine where the graph of a differentiable function
rise and where it falls: where it has peak and where it has valleys: where it curves
upward and where it curves downward. The concepts we will introduce have
application not only to graphing functions but also to problems in such widely varying
areas.
209
Part (a)-(e) of figure 4.12 show that a continous function may or may not have relative
maximum or minimum on an infinite interval or on a finite open interval. However,
theorem 4.14 will show us that a continous function must have both an absolute
maximum and an absolute minimum on every finite closed interval(see part (f) of
figure 4.12)
Figure 4.13 shows an example of a function which has minimum value and no
maximum value.
Example 2. the function is continous, and hence is guaranteed to have
both an absolute maximum and an absolute minimum on every finite closed interval
and, in particular, on the interval [ ] the absolute minimum occures at and the
absolute maximum occurs at at which points the absolute minimum and
maximum values are and .
210
absolute extrema. However, knowing that something exists and finding it are two
separate things, so we will now address the problem of finding the absolute extrema.
Solution: First we find the critical point has extreme values because it is continuous
on [ ], since is differentiable,
√
.
211
√ √
Since is not in the interval [ ], the only critical number in our focus is .
√
( ) √
√
Solution: From example 4, the critical point is , therefore for
all in . Hence the extreme value of on [2,4] must occur at the end points of the
interval. Since , we conclude that is the maximum value
and occurs at , whereas is the minimum value and occur at .
Solution:
or
or
But is not in [ ] so
Activity
⁄
a) Find the critical number of the function:
b) Find the absolute maximum and absolute minimum value of f on the given
interval 212 [ ]
Example 7: A landowner wishes to use mile of fencing to enclose a rectangular
region of maximum area, what should the lengths of the sides be?
Solution: Any rectangular region the landowner could enclose must have a length
and width (figure 4.14)
for
The problem has been reduced to finding the maximum value of on [ ], since
and thus only when , can have its maximum value
( ) and
Thus the maximum value of occurs for (mile) (see figure 3.4 ); since
Exercise .
b) e)
213
c) f)
2. Find all extreme values (if any) of the given function on the given interval.
Determine at which number in the interval these value occur
a) [ ]
b) [ ]
c) [ ]
d) [ ]
⁄
e) [ ]
3. If a and b are positive number , find the maximum value of
(1)
214
or, equivalently,
(2)
Proof: We introduce an auxiliary function that allows us to simplify the proof
by using Rolle‘s Theorem. The function is defined by
* +
215
Solution: since
The Mean Value Theorem is one of the most important results in calculus. We will
use it now to prove two very different theorems; the first implies that if two functions
have identical slopes at each number in an interval; then the functions differ by a
constant on that interval.
Theorem 4.19
Proof:
To prove (a), let and be arbitrary numbers in with . By the Mean Value
Theorem there is a number in such that
(1)
Therefore, .
216
It follows that assigns the same value at any two points in , so is constant on .
which yhields
Exercises:
217
1. Let , where A, B, and C are costants with . Show that
for any interval [a, b], the number c guaranteed by the Mean Value Theorem is
the mid point of [a, b].
2. Let | |. Show that , but there is no number c in (-2, 2) such
that . Does this result contradict Rolle’s Theorem? Explain.
The main singificance of the Mean Value Theorem is that it enables us to analyze the
nature of graphs of functions. Our immediate use of this principle is to prove the basic
fact concerning increasing and decreasing functinos. But, before stating the theorem let
us see the following definitions.
218
Example 1: Let On which interals is strictly increasing
and strictly decreasing.
Solution: For all x, we have
Using sign chart Method to know where and where .this depends
on the sign of the three factors of , namely, .
From figure 3.6, we conclude that decreasing on and
and increasing on .
Exercises:
Find the intervals on which the given function is strictly increasing and those on
which it is strictly decreasing.
1.
2.
3.
4.
219
i. If a continuous is increasing on the portion of an interval I to the left of c
and decreasing on the portion to the right of c, then is the maximum
value of on I.
ii. If a continuous is decreasing on the portion of an interval I to the left of c
and increasing on the portion to the right of c, then is the minimum
value of on I.
The derivative changes from positive to negative at c if there exists some number
such that for all in and for all in .
The definition of changes from negative to positive at c, results from replacing
by and vice versa.
⁄ ⁄
Example 1: Let . Determine where changes from positive to
negative and where it changes from negative to positive.
⁄ ⁄
220
a. if changes from positive to negative at , then has a relative maximum at
( )
221
Example 3: Let . Sketch the graph of .
[ ]
From figure 4.19, is strictly increasing on the interval [ ] and [ and strictly
decreasing on the intervals ] and [ ]. We also find that changes from
negative to positive at 1 and 3 and from positive to negative at 2. As a result, the first
derivative test implies and are relative minimum value and
is relative maximum value. Figure 4.20 shows the graph of .
222
Theorem 4.25 (Second Derivate Test )
Assume that
a. If then is a relative maximum value of .
b. If then is a relative minimum value of .
If then from this test alone we cannot draw any conclusion about a
relative exterme value at .
and
an
We know from the second derivative test that is a relative maximum value
of , whereas is relative minimum value of .
Example 5: Use the Second Derivative Test to determine the relative extreme values
of the function
and
Now,
223
Thus is the only critical number of . More over, . Therefore,
by the Second Derivative Test relative maximum value.
⁄ ⁄
Example 6: Sketch the graph of the function
⁄
And ⁄ ⁄
, for all we have for and
0for all .
Figure 4.21
Exercise
c.
d.
224
2. Use the first derivative test to determine the relative extreme values(if any)
of the function
a.
b.
c.
e.
3. Use the second derivative test to determine the relative extreme value of
the function
a.
b.
d.
c.
4. Suppose is contionous on
a. If and . What cann you say about ?
b. If and . What cann you say about ?
To solve many applied problems, one needs to find a maximum or minimum value of a
suitable function on an interval I. Recall from the Extreme Value Theorem that if is
continuous on a closed, bounded interval [ ], assumes a maximum and a
minimum value. Moreover thes values can be assumed only at the end points a and bof
the interval or at critical points in . The following examples illustrate this idea.
Example 1: A farmer has 2400 ft of fencing and wants to fence off a rectangular field that
borders a straight river. He needs no fence along the river. What are the dimensions of the
field that has the largest area?
Solution: In order to get a feeling for what is happening in this problem, let‘s experiment
with some special cases.
We see that when we try shallow, wild fields or deep, narrow fields, we get relatively small
areas. It seems plausible that there is some intermediate configuration that produces the
225
largest area. Figure 4.22 shows the general case. We wish to maximize the area A of the
rectangle. Let and be the depth and width of the rectangle in feet. Then we express A in
terms of and :
Figure 4.22
Note that and (otherwise A < 0). So the function that we wish to
maximize is
Which gives . The maximum value of A must occur either at this critical number
or at an end point of the interval. Since , , and ,
the closed interval method gives the maximum value as .
Therefore, the rectangular field should be 600 ft deep and 1200 ft wide.
Example 2: Find the two nonnegative numbers whose sum is 18 and whose product is as
large as possible.
Solution: Let and be positive numbers such that with
. Let the product be . Then,
226
.
Now, is the critical number of A. From this we have
and
Thus, by the closed interval method, we have that is the largest possible
product of the numbers and
Related Rates
In this subtopic, we are going to study problems involving variables that are changing with
respect to time. If two or more such variables are related to each other, then their rates of
change with respect to time are also related.
For instance, suppose that and are related by the equation . If both variables
are changing with respect to time then their rates of change will also be related by the
equation
Solution:
a. In this case we assume is differentiable with respect to .
Thus,
b. Similarly
227
Example 4: A ladder 10 ft long rests against a vertical wall. If the bottom of the ladder
slides away from the wall at a rate of 1 ft per second, how fast is the top of the ladder
sliding down the wall when the bottom of the ladder is 6 ft from the wall?
Solution:
We first draw a diagram and label it as in the figure. Let feet be the distance from the
bottom of the ladder to the wall and y feet the distance from the top of the ladder to the
ground. Note that x and y are both functions of time t measured in seconds.
We are given that ⁄ and we are asked to find when . In this problem,
Differentiating each side with respect to t using the chain rule, we have
When , the Pythagorean Theorem gives and so, substituting these values we
have
⁄ ⁄ .
228
The fact that is negative means that the distance from the top of the ladder to the
ground is decreasing.
Example 5: A water tank has the shape of an inverted circular cone with base radius 2m
and height 4m . If water is being pumped in to the tank at a rate of 2 ⁄ , find the rate
at which the water level is raising when the water is 3m deep.
Solution:
Let V, r, and h be the volume of the water, the radius of the surface, and the height of the
water at a time t, where t is measured in minutes.
We are given that 2 ⁄ and we are asked to find when h is 3m. The given
V= ( ) .
so
229
Example 6: Air is being pumped into a spherical balloon at the rate of 4.5 ⁄ . Find
the rate of change of the radius when the radius is 2cm.
Solution: Let r be the radius of the sphere, then the volume V of the sphere is given by
Example 7: The radius r of a circle is increasing at a rate of 3cm/min. Find the rate of
change of the area when r = 8cm.
Solution: If A is the area of the circle and r is its radius, then
A=
Exercises
1. Find two real numbers whose difference is 16 and whose product is as small as
possible.
2. Find the area of the largest rectangle that can be inscribed in a semicircle.
3. A metal box (without top) is to be constructed from a square sheet of metal that is
10m on a side by first cutting square pieces of the same size from each corner of the
sheet and then folding the sides. Find the dimensions of the box with largest
volume.
4. A cylindrical can is to be made to hold 1 L of oil. Find the dimensions that will
minimize the cost of the metal to manufacture the can.
5. A ladder 5m long rests against a vertical wall. If the bottom of the ladder slides
away from the wall at a rate of 0.25m/sec, how fast is the top of the ladder sliding
down the wall when the bottom of the ladder is 3m from the wall?
Definition 4.26: Let be differentiable at and let be the line tangent to the graph
of at ( )
230
a. The graph of is concave upward at ( ) if there is an open interval
about c such that if is in and then lies above .
b. The graph of is concave downward at if there is an open interval
about c such that if is in and , then lies below .
c. The graph of a function is concave upward (respectively concave
downward) on an open interval I if it is concave upward (respectively concave
downward) at for each in I.
Notice from figure 4.25 (a) that the graph of lies below all its tangents and the slopes
of the tangents decrease from left to right. Thus, the graph is concave down ward.
Similarly, the graph of in figure 4.25 (b) lies above all its tangents. Thus it is
concave upward.
A point where a curve changes its direction of concavity is called an inflection point.
Solution:
231
and that is a relative minimum value of f. Now we determine the sign
of .
From figure 4.26 and theorem 4.27 we deduce that the graph of is concave upward
on and ⁄ and is concave downward on ⁄ . From this information
we conclude the graph of is as shown in figure 4.27.
Activity
232
2. For each value of c found in step 1, determine whether changes sign at c.
3. If changes sign at c, the point ( ) is an inflection point.
Exercises
1. Find the interval on which the graph of the function is concave upward
and those on which it concave downward. Then sketch the graph of the
function.
a.
b.
c.
d. √
e.
2. Find the interval of concavity and the inflection points
a.
b.
c.
233
5. Suppose the derivative of function is
.
On which interval is f increasing?
Curve sketching
Table 4.1 lists the items that are must important in graphing a function.
Table 4.1
Property Test
has intercept
has intercept
Graph of is symmetric with
respect to {
has a relative maximum value at
{
234
Solution:
intercept
, so the y –intercept is 2.
intercept
it is false for all so has no intercept.
Symmetry:
and
Concavity
Asymptote
235
Which means that - axis is a horizontal asymptote of the graph.
We are now ready to sketch the graph of shown in figure 3.20
Activity
Exercises
1. Skecth the graph of the given function noting all relevant proberties listed
in table3.1.
a.
b.
c.
d.
a.
b. c.
Summary
or exists.
236
If the derivative of a function is exist at each point in its domain then
we say that a function is differentiable.
If is differentiable at then f is contionous.
A function is differentiable on [ ] if it is differentiable on
and if the left and right sided limit are exist.
Differentiation rule: if and are differentiable function and c be any
constant number then:
( )
237
If on an interval then is decreasing on .
If changes from positive to negative at , then has a relative
maximum at Were c is a critical number.
If changes from negative to positive at , then has a relative
minimum value at . Where c is a critical number.
If then is a relative maximum value of . Where c is a
critical number.
If then is a relative minimum value of . Where c is
a critical number.
If for all in , then the graph of is concave upward on .
If for all in , then the graph of is concave downward on .
Review exercises
b. d.
2. Find
a. √ c.
b. √ d.
a. (√ )
b.
c. √ √
238
5. Assume that and are differentiable function of . find in terms of
and
a. b.
6. Let be differentiable at and let . Show that
7. What is the equation of a tangent line to the parabola at
8. If √ , find
9. If , find
10. If find
11. Find the relative extreme value of the function on the given interval.
Determine at which numbers in the interval they are assumed.
a. [ ]
b. √ [ ]
⁄
c. [ ]
d. [ ]
c. √ √
d. √
14. For what value of the constant and is a point of inflection of the
curve ?
15. Determine the intervals on which f is increasing and those on which f is
decreasing,
a.
⁄
b.
239
c.
16. Let
b.
c.
18. Sketch the graph of the function, indicating all relevant properties listed in
table3.1
a. d.
b. e.
c. √
19. Suppose the distance a car can travel on one tank of gas at a velocity
of v miles per hour is given by
√ ⁄ ⁄
for
b.
c.
21. A 10 cm ladder is leaning against a house. The base of the ladder is pulled
away from the house at a rate of 0.25m/sec. How fast is the top of the
ladder moving down the wall when the base is
a. 6 m from the house
240
b. 8 m from the house
c. 9 m from the house
22. A water tank is in the shape of an inverted circular cone with base radius
3m and height 5m. If water is being pumped in to the tank at a rate of
2 , find the rate at which the water level is raising when the water
is 3m deep.
differentiable wrt y.
24. The radius r of a sphere is increasing at a rate of 3cm/min. Find the rate of
change of the volume when
a) r = 2 cm b) r = 3 cm.
References
241
1. James Stewart, Calculus early trancedental, 6th ed., Prentice Hall, 2008
2. Howard Anton,Calculus a new horizon, 6th ed., John wiley and Sons Inc
3. Robert Ellis and Denny Gulick, Calculus with analytic geometry, 6th ed, Harcourt
Brace Jovanovich, Publishers, 5th ed, 1993
4. Adams, Calculus: A complete course, 5th ed, Addison Wesley, 2003
5. R. Wrede and M. R. Spiegel, Theory of advanced calculus, 2nd ed., McGraw-Hill,
2002.
6. A. E. Taylor and W. R. Mann, Advanced calculus, 3rd ed, John-Wiley and Son, INC,
1995.
7. R. T. Smith and R. B. Minton, Calculus concepts and connections, McGram-Hill book
company, 2006
8. D. V. Widder, Advanced calculus, Prentice-Hall, 1979
9. Ross L. Finney et al, Calculus, Addison Wesley, 1995
10. E. J. Purcell and D. Varberg, Calculus with analytic geometry, Prentice-Hall INC.,
1987
11. Lynne, Garner: Calculus and Analytic Geometry, Dellen Publishing Company
12. John A. Tierney: Calculus and Analytic, 4th edition, Allyn and Bacon, Inc. –Boston.
13. Earl W. Swokowski: Calculus with Analytic Geometry, 2nd edition, Prindle, Weber
and Schmidt.
14. Goldstein, L. J., Lay D. C. , and Schneider D. I (1987). Calculus and its applications,
4th, edn, Prentice-Hall, Inc, London
15. Kresying, Advanced Engineering Mathematics.
16. Alan Jeffry , Advanced Engineering Mathematics
CHAPTER-FIVE:
INTEGRATION
242
UNIT OBJECTIVES:
At the end of this unit each student will able to:
Know antidifferentiation
Understand indefinite and definite integrals.
Realize and apply techniques of integration.
Understand the Fundamental Theorem of Calculus.
Know properties of indefinite and definite integrals
Learn about improper integrals.
Introduction
In the past chapter we‘ve been given a function, , and asking what the derivative of this
function was. Starting with this section we are not going to turn things around. We now
want to ask what function we differentiated to get the function .
Definition 5.1
243
Solution :
244
Theorem 5.2 : If F(x) is an antiderivative of f(x) on an interval I, then for any constant C
the function F(x) + C is also an antiderivative of f(x) on that interval. Moreover, each
antiderivative of f(x) on the interval I can be expressed in the form F(x) + C by choosing
the constant C appropriately.
Proof: Exercise
245
The process of finding antiderivatives is called antidifferentiation or integration
Definition 5.3
246
Note that often we will just say integral instead of indefinite integral (or definite integral
for that matter when we get to those). It will be clear from the context of the problem that
we are talking about an indefinite integral (or definite integral).
The process of finding the indefinite integral is called integration or integrating f(x). If
we need to be specific about the integration variable we will say that we are integrating
f(x) with respect to x.
A couple of warnings are now in order. One of the more common mistakes that students
make with integrals (both indefinite and definite) is to drop the dx at the end of the integral.
This is required! Think of the integral sign and the dx as a set of parenthesis. You already
know and are probably quite comfortable with the idea that every time you open a
parenthesis you must close it. With integrals, think of the integral sign as an ―open
parenthesis‖ and the dx as a ―close parenthesis‖.
If you drop the dx it won‘t be clear where the integrand ends. Consider the following
variations of the above example.
You only integrate what is between the integral sign and the dx.
Knowing which terms to integrate is not the only reason for writing the dx down. In the
Substitution Rule section we will actually be working with the dx in the problem and if we
aren‘t in the habit of writing it down it will be easy to forget about it and then we will get
247
the wrong answer at that stage. The moral of this is to make sure and put in the dx! At this
stage it may seem like a silly thing to do, but it just needs to be there, if for no other reason
than knowing where the integral stops.
On a side note, the dx notation should seem a little familiar to you. We saw things like this
a couple of sections ago. We called the dx a differential in that section and yes that is
exactly what it is. The dx that ends the integral is nothing more than a differential. The next
topic that we should discuss here is the integration variable used in the integral. Actually
there isn‘t really a lot to discuss here other than to note that the integration variable doesn‘t
really matter. For instance,
Changing the integration variable in the integral simply changes the variable in the answer.
It is important to notice however that when we change the integration variable in the
integral we also changed the differential (dx, dt, or dw) to match the new variable. This is
more important that we might realize at this point.
Another use of the differential at the end of integral is to tell us what variable we are
integrating with respect to. To see why this is important take a look at the following two
integrals.
The second integral is also fairly simple, but we need to be careful. The dx tells us that we
are integrating x‘s. That means that we only integrate x‘s that are in the integrand and all
other variables in the integrand are considered to be constants. The second integral is then,
So, it may seem silly to always put in the dx, but it is a vital bit of notation that can cause
us to get the incorrect answer if we neglect to put it in.
248
Now, there are some important properties of integrals that we should take a look at.
Not listed in the properties above were integrals of products and quotients. The reason for
this is simple. Just like with derivatives each of the following will NOT work.
249
With derivatives we had a product rule and a quotient rule to deal with these cases.
However, with integrals there are no such rules. When faced with a product and quotient in
an integral we will have a variety of ways of dealing with it depending on just what the
integrand is.
We can now answer this question easily with an indefinite integral.
250
Solution:
cos
EXAMPLE 7: Evaluate d
sin 2
Solution:
t 2 2t 4
EXAMPLE 7: Evaluate dt
t4
Solution:
Activity:
Evaluate each of the following indefinite integrals
Objectives:
251
By the end of this section, students will be able to:
Evaluate indefinite integrals using integration by substitution
Find indefinite integrals using integration by parts & by partial fraction
Determine the value of trigonometric integrals
Apply integration by trigonometric substitution to calculate integrals
Over view: In this section, we are going to discuss Integration by substitution, by parts and by
partial fraction, Trigonometric integrals, Integration by trigonometric substitution.That is we
develop techniques for using the basic integration formulas
To obtain indefinite integrals of more complicated functions. Integration is not as
Straightforward as differentiation; there are no rules that absolutely guarantee obtaining an
indefinite integral of a function. Therefore we discuss a strategy for integration.
of integration so that these integrals (and many others) can be evaluated by using the basic
integration formulas which are given in section 5.1.
252
2. Integrate with respect to u.
3. Replace u by g(x) in the result.
253
254
The success of the substitution method depends on finding a substitution that changes
an integral we cannot evaluate directly into one that we can. If the first substitution fails,try
to simplify the integrand further with an additional substitution or two. Alternatively, we
can start fresh. There can be more than one good way to start,as in the next example.
Solution:
255
Integration by Parts
xe dx , and ln x dx
x
If we try to evaluate integrals of the type by using the method of
substitution we obviously fail. But don‘t worry the next theorem will enable us to evaluate
not only these, but also many other types of integrals.
Every differentiation rule has a corresponding integration rule. For instance, the
Substitution Rule for integration corresponds to the Chain Rule for differentiation. The rule
that corresponds to the Product Rule for differentiation is called the rule for integration by
parts.
Integration by parts is a technique for simplifying integrals of the form f xg xdx It is
useful when ƒ can be differentiated repeatedly and g can be integrated repeatedly without
difficulty.
Theorem 5.5 : Integration by parts
If and are differentiable and f ' and g ' are continous then
Proof:
256
257
258
259
Example 6 : A Reduction Formula
260
Example 7 : Using a reduction formula evaluate
261
Integration by Partial fraction
262
263
264
265
266
Note : we can determine the constants that appear in partial fractions using different
methods such as differentiation method and assigning numerical Values to x
Example 6 : Find A, B, and C in the equation using differentiation method
267
Example 7: Find A, B, and C in the equation using assigning numerical value for x
Exercise
268
I- Evaluate the following integrals using integration by substitution
5.
1
1ln x4dx 6. tan x dx
x
7. sec x dx
II- Evaluate the following integrals using integration by Parts
269
In this subsection we use trigonometric identities to integrate certain combinations of
trigonometric functions. We start with powers of sine and cosine.Trigonometric integrals
involve algebraic combinations of the six basic trigonometric functions. In principle, we
can always express such integrals in terms of sines and cosines,but it is often simpler to
work with other functions, as in the integral
The general idea is to use identities to transform the integrals we have to find into integrals
that are easier to work with.
270
271
272
Example 5 Evaluate
∫
273
EXAMPLE 7 Evaluate
274
∫
EXAMPLE 8 Evaluate
∫
275
EXAMPLE 9 Evaluate
Exercise:
276
5.2.3. Integration by Trigonometric Substitution
The functions in these substitutions have inverses only for selected values of . For
reversibility,
277
278
∫
( )
√
( )
( ) √
279
EXAMPLE 4 :Evaluate
280
Exercise
281
Objectives:
By the end of this section, students will be able to:
Define Definite Integral
Evaluate definite integral
Apply properties of definite integrals to calculate integrals
State fundamental theorem of calculus and apply it.
Over view: In this section we will formally define the definite integral and give many of the
properties of definite integrals ,then we will present Fundamental Theorem of Calculus ,which is
central theorem of integral calculus. Let‘s start off with the definition of a definite integral.
Definite Integral
The number ―a‖ that is at the bottom of the integral sign is called the lower limit of the
integral and the number ―b‖ at the top of the integral sign is called the upper limit of the
integral. Also, despite the fact that a and b were given as an interval the lower limit does
not necessarily need to be smaller than the upper limit. Collectively we‘ll often call a and b
the interval of integration.
282
283
284
Formulas
EXAMPLE 4: Using the formulas and properties from above determine the value of the
following summation.
Solution:
The first thing that we need to do is square out the stuff being summed and then break up
the summation using the properties as follows,
EXAMPLE 5: Using the definition of the definite integral compute the following.
285
We can now compute the definite integral.
286
287
EXAMPLE 6: Evaluate each of the following.
288
EXAMPLE 7: Evaluate the definite integral.
EXAMPLE 8:
289
EXAMPLE 9
290
291
EXAMPLE -10 : Differentiate each of the following
292
293
Using the chain rule as we did in the last part of this example we can derive some general
formulas for some more complicated problems.
Next, we can get a formula for integrals in which the upper limit is a constant and the lower
limit is a function of x. All we need to do here is interchange the limits on the integral
(adding in a minus sign of course) and then using the formula above to get,
Finally, we can also get a version for both limits being functions of x. In this case we‘ll
need to use Property 5 above to break up the integral as follows,
294
EXAMPLE -11 : Differentiate the following integral
Solution:
295
296
EXAMPLE -13 : Evaluate the following definite integral
297
Exercise
2.
298
Objectives:
By the end of this section, students will be able to:
Define Improper Integral
Evaluate Improper integral
Over view: In this section we will formally define the improper integral and evaluate improper
integrals.
we define the integral of f (not necessarily a positive function) over an infinite interval as
the limit of integrals over finite intervals.
299
300
Therefore,
301
302
303
Exercise
304
SUMMARY OF CHAPTER-FIVE
305
Look for an Obvious Substitution
Classify the Integrand According to Its Form (Trigonometric functions,
Rational functions, Integration by parts., Radicals.)
Miscellaneous Exercises
References
1. James Stewart, Calculus early trancedental, 6th ed., Prentice Hall, 2008
306
2. Howard Anton,Calculus a new horizon, 6th ed., John wiley and Sons Inc
3. Robert Ellis and Denny Gulick, Calculus with analytic geometry, 6th ed, Harcourt Brace
Jovanovich, Publishers, 5th ed, 1993
4. Adams, Calculus: A complete course, 5th ed, Addison Wesley, 2003
5. R. Wrede and M. R. Spiegel, Theory of advanced calculus, 2nd ed., McGraw-Hill, 2002.
6. A. E. Taylor and W. R. Mann, Advanced calculus, 3rd ed, John-Wiley and Son, INC,
1995.
7. R. T. Smith and R. B. Minton, Calculus concepts and connections, McGram-Hill book
company, 2006
8. D. V. Widder, Advanced calculus, Prentice-Hall, 1979
9. Ross L. Finney et al, Calculus, Addison Wesley, 1995
10. E. J. Purcell and D. Varberg, Calculus with analytic geometry, Prentice-Hall INC.,
1987
11. Lynne, Garner: Calculus and Analytic Geometry, Dellen Publishing Company
12. John A. Tierney: Calculus and Analytic, 4th edition, Allyn and Bacon, Inc. –
Boston.
13. Earl W. Swokowski: Calculus with Analytic Geometry, 2nd edition, Prindle,
Weber and Schmidt.
14. Goldstein, L. J., Lay D. C. , and Schneider D. I (1987). Calculus and its
applications, 4th, edn, Prentice-Hall, Inc, London
15. Kresying, Advanced Engineering Mathematics.
16. Alan Jeffry , Advanced Engineering Mathematics
CHAPTER-SIX
APPLICATION OF INTEGRATION
307
UNIT OBJECTIVES:
At the end of this unit each student should able to:
Apply integration to find area
Apply integration to find volume
In this chapter, we will see some applications of integration such as: Area of a region
bounded by curves of continuous defined on a closed interval [a, b] and volume of a solid
of revolutions.
6.1. Area
Let f be a non-negative continuous function on [a, b], then the area of the region R
bounded by f and the x-axis between x =a & x = b (as shown in the fig. below) is given by:
∫ …………………………….(*)
Example:
1) Find the area of the region bounded by the graph of the function
and the x-axis between x=0 & x=3.
Solution: The region R is the shaded region as shown below.
Now let R1, R2 and R3 be the sub-regions b/n x=0 and x=1, x=1 and x=2, and x=2 and
x=3, respectively. Then, using (*):
Where,
308
∫ ∫ ⟨
∫ ⟨
Since the sub-region R2 is below the x- axis, the negative sign is important. Similarly,
∫ ⟨ .
We can see that the area of a region bounded by a continuous curves y = f(x), y = g(x) and
the lines x = a, x = b such that for all x in [a, b] is:
∫ [ ] ……………………(*)
∫ [ ] ∫ [ ] ∫
⟨ .
309
Note that the intersection points can be obtained as: .
Solving this equation for x, we get , then , as indicated in
the graph above.
Example 3: Find the area of the region bounded by the curves y = sin x and y = cos x,
between x = 0 & x = .
The points of intersection occur when sin x = cos x, that is, when (since ).
Observe that cos x sin x when but cos x sin x when . Therefore,
∫ [ ] ∫ [ ]
[ ]| [ ]| √ .
Exercise:
1. Find the area of the region enclosed by the graph of f(x) = sin x and the x-axis
between and .
2. Find the area of the region in the first quadrant which is enclosed by the y-axis and
the curves of f(x) = cos x and g(x) = sinx.
3. Determine the area of the region enclosed by the graphs of and
4. Find the area of the region enclosed by the graphs and the line .
310
6.2. Volume
In section, we will apply integral calculus to determine the volume of a solid region by
considering cross sections.
Suppose a region rotates about a straight line, then a solid figure called a solid of
revolution, is formed. The volume of such a solid is said to be a volume of revolution and
the line about which the region rotates is an axis of symmetry.
Now consider the solid of revolution generated by revolving the region between the curve
and the x-axis from to as shown below.
Every cross section which is perpendicular to the x-axis at x a circular region with radius, r =
f(x). Thus, the area of the cross section A(x) is .
Thus, we define the volume of revolution V as by:
∫ ∫ ……………………..( )
Example 1: Find the volume of the solid obtained by rotating about the x-axis the region
under the curve √ from 0 to 1.
Solution: The region and the solid figure rotated about the x-axis from 0 to 1 are shown in
fig.(a) and fig.(b) below.
∫ ∫ ⟨ .
311
Note that if f and g are continuous, non-negative on [a, b] such that for all x in
[a, b]. let R be a region bounded by f(x) and g(x), on [a, b] as in the following figure. Then,
the volume of the solid of revolution generated by revolving the region R about the x-axis is
given by:
∫ , where [( ) ( ) ].
Example 2: The region R enclosed by the curves and is rotated about the x-
axis. Find the volume of the resulting solid.
Solution:
The curves y = x and y = x2 intersect at the points (0,0) and (1, 1). The region and solid of
rotation are shown in the fig. above. The cross-section in the plane has the shape of annular
ring with inner radius x2 and outer radius x. So, [ ] .
Therefore, we have
∫ ∫ [ ]|
Consider again the case where a region R is rotated about the y-axis: where, either R is a
region bounded by the curve x = u(y) and the y-axis, between the lines y = c & y = d or R is a
region between two curves v(y) and w(y) on [c, d] as in following figures:
312
Then the corresponding volume of revolution is given by:
∫ [ ] and ∫ [ ] [ ] respectively.
Example 3: Find the volume of the solid obtained by rotating the region bounded by y = x3, y
= 8, and x = 0 about the y-axis.
Solution: The rotation of the region is shown as:
∫ [ ] ∫ .
Exercises:
1. Find the volume of solid of revolution about the x-axis generated by revolving the
area between the lines y = x and y = 4 from x = 1 to x = 3.
2. Find the volume of the solid of revolution about the y-axis generated by revolving the
region enclosed by the curve x = √ and the y-axis from y = 0 to y = 4.
3. The area bounded by the graph of y = x2+1 and the line y = 4 rotates about the y –
axis, find the volume of the solid generated.
4. Find the volume of the solid obtained by rotating the region bounded by the curves y2
= x and x = 2y about y-axis.
313
References
1. James Stewart, Calculus early trancedental, 6th ed., Prentice Hall, 2008
2. Howard Anton,Calculus a new horizon, 6th ed., John wiley and Sons Inc
3. Robert Ellis and Denny Gulick, Calculus with analytic geometry, 6th ed,
Harcourt Brace Jovanovich, Publishers, 5th ed, 1993
4. Adams, Calculus: A complete course, 5th ed, Addison Wesley, 2003
5. R. Wrede and M. R. Spiegel, Theory of advanced calculus, 2nd ed., McGraw-
Hill, 2002.
6. A. E. Taylor and W. R. Mann, Advanced calculus, 3rd ed, John-Wiley and
Son, INC, 1995.
7. R. T. Smith and R. B. Minton, Calculus concepts and connections, McGram-
Hill book company, 2006
8. D. V. Widder, Advanced calculus, Prentice-Hall, 1979
9. Ross L. Finney et al, Calculus, Addison Wesley, 1995
10. E. J. Purcell and D. Varberg, Calculus with analytic geometry, Prentice-Hall
INC., 1987
11. Lynne, Garner: Calculus and Analytic Geometry, Dellen Publishing Company
12. John A. Tierney: Calculus and Analytic, 4th edition, Allyn and Bacon, Inc. –
Boston.
13. Earl W. Swokowski: Calculus with Analytic Geometry, 2nd edition, Prindle,
Weber and Schmidt.
14. Goldstein, L. J., Lay D. C. , and Schneider D. I (1987). Calculus and its
applications, 4th, edn, Prentice-Hall, Inc, London
15. Kresying, Advanced Engineering Mathematics.
16. Alan Jeffry , Advanced Engineering Mathematics
314