0% found this document useful (0 votes)
24 views547 pages

G12 Mathematics

Uploaded by

Rayhan MR
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
24 views547 pages

G12 Mathematics

Uploaded by

Rayhan MR
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 547

Chapter 1

RELATIONS AND FUNCTIONS

he
™There is no permanent place in the world for ugly mathematics ... . It may
be very hard to define mathematical beauty but that is just as true of
beauty of any kind, we may not know quite what we mean by a
beautiful poem, but that does not prevent us from recognising

is
one when we read it. — G. H. HARDY ™

1.1 Introduction

bl
Recall that the notion of relations and functions, domain,
co-domain and range have been introduced in Class XI
along with different types of specific real valued functions
pu
and their graphs. The concept of the term ‘relation’ in
mathematics has been drawn from the meaning of relation
be T

in English language, according to which two objects or


quantities are related if there is a recognisable connection
re
or link between the two objects or quantities. Let A be
o R

the set of students of Class XII of a school and B be the


set of students of Class XI of the same school. Then some
of the examples of relations from A to B are
tt E

(i) {(a, b) ∈ A × B: a is brother of b}, Lejeune Dirichlet


(ii) {(a, b) ∈ A × B: a is sister of b}, (1805-1859)
C

(iii) {(a, b) ∈ A × B: age of a is greater than age of b},


(iv) {(a, b) ∈ A × B: total marks obtained by a in the final examination is less than
the total marks obtained by b in the final examination},
no N

(v) {(a, b) ∈ A × B: a lives in the same locality as b}. However, abstracting from
this, we define mathematically a relation R from A to B as an arbitrary subset
of A × B.
©

If (a, b) ∈ R, we say that a is related to b under the relation R and we write as


a R b. In general, (a, b) ∈ R, we do not bother whether there is a recognisable
connection or link between a and b. As seen in Class XI, functions are special kind of
relations.
In this chapter, we will study different types of relations and functions, composition
of functions, invertible functions and binary operations.
2 MATHEMATICS

1.2 Types of Relations


In this section, we would like to study different types of relations. We know that a
relation in a set A is a subset of A × A. Thus, the empty set φ and A × A are two
extreme relations. For illustration, consider a relation R in the set A = {1, 2, 3, 4} given by
R = {(a, b): a – b = 10}. This is the empty set, as no pair (a, b) satisfies the condition
a – b = 10. Similarly, R′ = {(a, b) : | a – b | ≥ 0} is the whole set A × A, as all pairs

he
(a, b) in A × A satisfy | a – b | ≥ 0. These two extreme examples lead us to the
following definitions.
Definition 1 A relation R in a set A is called empty relation, if no element of A is
related to any element of A, i.e., R = φ ⊂ A × A.

is
Definition 2 A relation R in a set A is called universal relation, if each element of A
is related to every element of A, i.e., R = A × A.

bl
Both the empty relation and the universal relation are some times called trivial
relations.
Example 1 Let A be the set of all students of a boys school. Show that the relation R
pu
in A given by R = {(a, b) : a is sister of b} is the empty relation and R′ = {(a, b) : the
difference between heights of a and b is less than 3 meters} is the universal relation.
be T

Solution Since the school is boys school, no student of the school can be sister of any
student of the school. Hence, R = φ, showing that R is the empty relation. It is also
re
o R

obvious that the difference between heights of any two students of the school has to be
less than 3 meters. This shows that R′ = A × A is the universal relation.
tt E

Remark In Class XI, we have seen two ways of representing a relation, namely raster
method and set builder method. However, a relation R in the set {1, 2, 3, 4} defined by R
= {(a, b) : b = a + 1} is also expressed as a R b if and only if
C

b = a + 1 by many authors. We may also use this notation, as and when convenient.
If (a, b) ∈ R, we say that a is related to b and we denote it as a R b.
no N

One of the most important relation, which plays a significant role in Mathematics,
is an equivalence relation. To study equivalence relation, we first consider three
types of relations, namely reflexive, symmetric and transitive.
Definition 3 A relation R in a set A is called
©

(i) reflexive, if (a, a) ∈ R, for every a ∈ A,


(ii) symmetric, if (a1, a2) ∈ R implies that (a2, a1) ∈ R, for all a1, a2 ∈ A.
(iii) transitive, if (a1, a2) ∈ R and (a2, a3) ∈ R implies that (a1, a3) ∈ R, for all a1, a2,
a3 ∈ A.
RELATIONS AND FUNCTIONS 3

Definition 4 A relation R in a set A is said to be an equivalence relation if R is


reflexive, symmetric and transitive.
Example 2 Let T be the set of all triangles in a plane with R a relation in T given by
R = {(T1, T2) : T1 is congruent to T2}. Show that R is an equivalence relation.
Solution R is reflexive, since every triangle is congruent to itself. Further,
(T1, T2) ∈ R ⇒ T1 is congruent to T2 ⇒ T2 is congruent to T1 ⇒ (T2, T1) ∈ R. Hence,

he
R is symmetric. Moreover, (T1, T2), (T2, T3) ∈ R ⇒ T1 is congruent to T2 and T2 is
congruent to T3 ⇒ T1 is congruent to T3 ⇒ (T1, T3) ∈ R. Therefore, R is an equivalence
relation.

is
Example 3 Let L be the set of all lines in a plane and R be the relation in L defined as
R = {(L1, L2) : L1 is perpendicular to L2}. Show that R is symmetric but neither
reflexive nor transitive.

bl
Solution R is not reflexive, as a line L1 can not be perpendicular to itself, i.e., (L1, L1)
∉ R. R is symmetric as (L1, L2) ∈ R

pu L1 is perpendicular to L2
⇒ L2 is perpendicular to L1
⇒ (L2, L1) ∈ R.
be T

R is not transitive. Indeed, if L1 is perpendicular to L2 and


re
Fig 1.1
L2 is perpendicular to L3, then L1 can never be perpendicular to
o R

L3. In fact, L1 is parallel to L3, i.e., (L1, L2) ∈ R, (L2, L3) ∈ R but (L1, L3) ∉ R.
Example 4 Show that the relation R in the set {1, 2, 3} given by R = {(1, 1), (2, 2),
tt E

(3, 3), (1, 2), (2, 3)} is reflexive but neither symmetric nor transitive.
Solution R is reflexive, since (1, 1), (2, 2) and (3, 3) lie in R. Also, R is not symmetric,
C

as (1, 2) ∈ R but (2, 1) ∉ R. Similarly, R is not transitive, as (1, 2) ∈ R and (2, 3) ∈ R


but (1, 3) ∉ R.
no N

Example 5 Show that the relation R in the set Z of integers given by


R = {(a, b) : 2 divides a – b}
is an equivalence relation.
©

Solution R is reflexive, as 2 divides (a – a) for all a ∈ Z. Further, if (a, b) ∈ R, then


2 divides a – b. Therefore, 2 divides b – a. Hence, (b, a) ∈ R, which shows that R is
symmetric. Similarly, if (a, b) ∈ R and (b, c) ∈ R, then a – b and b – c are divisible by
2. Now, a – c = (a – b) + (b – c) is even (Why?). So, (a – c) is divisible by 2. This
shows that R is transitive. Thus, R is an equivalence relation in Z.
4 MATHEMATICS

In Example 5, note that all even integers are related to zero, as (0, ± 2), (0, ± 4)
etc., lie in R and no odd integer is related to 0, as (0, ± 1), (0, ± 3) etc., do not lie in R.
Similarly, all odd integers are related to one and no even integer is related to one.
Therefore, the set E of all even integers and the set O of all odd integers are subsets of
Z satisfying following conditions:
(i) All elements of E are related to each other and all elements of O are related to

he
each other.
(ii) No element of E is related to any element of O and vice-versa.
(iii) E and O are disjoint and Z = E ∪ O.
The subset E is called the equivalence class containing zero and is denoted by

is
[0]. Similarly, O is the equivalence class containing 1 and is denoted by [1]. Note that
[0] ≠ [1], [0] = [2r] and [1] = [2r + 1], r ∈ Z. Infact, what we have seen above is true

bl
for an arbitrary equivalence relation R in a set X. Given an arbitrary equivalence
relation R in an arbitrary set X, R divides X into mutually disjoint subsets Ai called
partitions or subdivisions of X satisfying:
pu
(i) all elements of Ai are related to each other, for all i.
(ii) no element of Ai is related to any element of Aj , i ≠ j.
be T

(iii) ∪ Aj = X and Ai ∩ Aj = φ, i ≠ j.
The subsets Ai are called equivalence classes. The interesting part of the situation
re
o R

is that we can go reverse also. For example, consider a subdivision of the set Z given
by three mutually disjoint subsets A1, A2 and A3 whose union is Z with
A1 = {x ∈ Z : x is a multiple of 3} = {..., – 6, – 3, 0, 3, 6, ...}
tt E

A2 = {x ∈ Z : x – 1 is a multiple of 3} = {..., – 5, – 2, 1, 4, 7, ...}


A3 = {x ∈ Z : x – 2 is a multiple of 3} = {..., – 4, – 1, 2, 5, 8, ...}
C

Define a relation R in Z given by R = {(a, b) : 3 divides a – b}. Following the


arguments similar to those used in Example 5, we can show that R is an equivalence
no N

relation. Also, A1 coincides with the set of all integers in Z which are related to zero, A2
coincides with the set of all integers which are related to 1 and A3 coincides with the
set of all integers in Z which are related to 2. Thus, A1 = [0], A2 = [1] and A3 = [2].
In fact, A1 = [3r], A2 = [3r + 1] and A3 = [3r + 2], for all r ∈ Z.
©

Example 6 Let R be the relation defined in the set A = {1, 2, 3, 4, 5, 6, 7} by


R = {(a, b) : both a and b are either odd or even}. Show that R is an equivalence
relation. Further, show that all the elements of the subset {1, 3, 5, 7} are related to each
other and all the elements of the subset {2, 4, 6} are related to each other, but no
element of the subset {1, 3, 5, 7} is related to any element of the subset {2, 4, 6}.
RELATIONS AND FUNCTIONS 5

Solution Given any element a in A, both a and a must be either odd or even, so
that (a, a) ∈ R. Further, (a, b) ∈ R ⇒ both a and b must be either odd or even
⇒ (b, a) ∈ R. Similarly, (a, b) ∈ R and (b, c) ∈ R ⇒ all elements a, b, c, must be
either even or odd simultaneously ⇒ (a, c) ∈ R. Hence, R is an equivalence relation.
Further, all the elements of {1, 3, 5, 7} are related to each other, as all the elements
of this subset are odd. Similarly, all the elements of the subset {2, 4, 6} are related to
each other, as all of them are even. Also, no element of the subset {1, 3, 5, 7} can be

he
related to any element of {2, 4, 6}, as elements of {1, 3, 5, 7} are odd, while elements
of {2, 4, 6} are even.

EXERCISE 1.1

is
1. Determine whether each of the following relations are reflexive, symmetric and
transitive:

bl
(i) Relation R in the set A = {1, 2, 3, ..., 13, 14} defined as
R = {(x, y) : 3x – y = 0}
(ii) Relation R in the set N of natural numbers defined as
pu R = {(x, y) : y = x + 5 and x < 4}
(iii) Relation R in the set A = {1, 2, 3, 4, 5, 6} as
be T

R = {(x, y) : y is divisible by x}
re
(iv) Relation R in the set Z of all integers defined as
o R

R = {(x, y) : x – y is an integer}
(v) Relation R in the set A of human beings in a town at a particular time given by
tt E

(a) R = {(x, y) : x and y work at the same place}


(b) R = {(x, y) : x and y live in the same locality}
C

(c) R = {(x, y) : x is exactly 7 cm taller than y}


(d) R = {(x, y) : x is wife of y}
(e) R = {(x, y) : x is father of y}
no N

2. Show that the relation R in the set R of real numbers, defined as


R = {(a, b) : a ≤ b2} is neither reflexive nor symmetric nor transitive.
3. Check whether the relation R defined in the set {1, 2, 3, 4, 5, 6} as
©

R = {(a, b) : b = a + 1} is reflexive, symmetric or transitive.


4. Show that the relation R in R defined as R = {(a, b) : a ≤ b}, is reflexive and
transitive but not symmetric.
5. Check whether the relation R in R defined by R = {(a, b) : a ≤ b3} is reflexive,
symmetric or transitive.
6 MATHEMATICS

6. Show that the relation R in the set {1, 2, 3} given by R = {(1, 2), (2, 1)} is
symmetric but neither reflexive nor transitive.
7. Show that the relation R in the set A of all the books in a library of a college,
given by R = {(x, y) : x and y have same number of pages} is an equivalence
relation.
8. Show that the relation R in the set A = {1, 2, 3, 4, 5} given by

he
R = {(a, b) : |a – b| is even}, is an equivalence relation. Show that all the
elements of {1, 3, 5} are related to each other and all the elements of {2, 4} are
related to each other. But no element of {1, 3, 5} is related to any element of {2, 4}.
9. Show that each of the relation R in the set A = {x ∈ Z : 0 ≤ x ≤ 12}, given by

is
(i) R = {(a, b) : |a – b| is a multiple of 4}
(ii) R = {(a, b) : a = b}

bl
is an equivalence relation. Find the set of all elements related to 1 in each case.
10. Give an example of a relation. Which is
pu(i) Symmetric but neither reflexive nor transitive.
(ii) Transitive but neither reflexive nor symmetric.
be T

(iii) Reflexive and symmetric but not transitive.


re
(iv) Reflexive and transitive but not symmetric.
o R

(v) Symmetric and transitive but not reflexive.


11. Show that the relation R in the set A of points in a plane given by
tt E

R = {(P, Q) : distance of the point P from the origin is same as the distance of the
point Q from the origin}, is an equivalence relation. Further, show that the set of
all points related to a point P ≠ (0, 0) is the circle passing through P with origin as
C

centre.
12. Show that the relation R defined in the set A of all triangles as R = {(T1, T2) : T1
no N

is similar to T2}, is equivalence relation. Consider three right angle triangles T1


with sides 3, 4, 5, T2 with sides 5, 12, 13 and T3 with sides 6, 8, 10. Which
triangles among T1, T2 and T3 are related?
13. Show that the relation R defined in the set A of all polygons as R = {(P1, P2) :
©

P1 and P2 have same number of sides}, is an equivalence relation. What is the


set of all elements in A related to the right angle triangle T with sides 3, 4 and 5?
14. Let L be the set of all lines in XY plane and R be the relation in L defined as
R = {(L1, L2) : L1 is parallel to L2}. Show that R is an equivalence relation. Find
the set of all lines related to the line y = 2x + 4.
RELATIONS AND FUNCTIONS 7

15. Let R be the relation in the set {1, 2, 3, 4} given by R = {(1, 2), (2, 2), (1, 1), (4,4),
(1, 3), (3, 3), (3, 2)}. Choose the correct answer.
(A) R is reflexive and symmetric but not transitive.
(B) R is reflexive and transitive but not symmetric.
(C) R is symmetric and transitive but not reflexive.

he
(D) R is an equivalence relation.
16. Let R be the relation in the set N given by R = {(a, b) : a = b – 2, b > 6}. Choose
the correct answer.
(A) (2, 4) ∈ R (B) (3, 8) ∈ R (C) (6, 8) ∈ R (D) (8, 7) ∈ R

is
1.3 Types of Functions
The notion of a function along with some special functions like identity function, constant

bl
function, polynomial function, rational function, modulus function, signum function etc.
along with their graphs have been given in Class XI.
Addition, subtraction, multiplication and division of two functions have also been
pu
studied. As the concept of function is of paramount importance in mathematics and
among other disciplines as well, we would like to extend our study about function from
be T

where we finished earlier. In this section, we would like to study different types of
functions.
re
o R

Consider the functions f1, f2, f3 and f4 given by the following diagrams.
In Fig 1.2, we observe that the images of distinct elements of X1 under the function
f1 are distinct, but the image of two distinct elements 1 and 2 of X1 under f2 is same,
tt E

namely b. Further, there are some elements like e and f in X2 which are not images of
any element of X1 under f1, while all elements of X3 are images of some elements of X1
C

under f3. The above observations lead to the following definitions:


Definition 5 A function f : X → Y is defined to be one-one (or injective), if the images
of distinct elements of X under f are distinct, i.e., for every x1, x2 ∈ X, f (x1) = f (x2)
no N

implies x1 = x2. Otherwise, f is called many-one.


The function f1 and f4 in Fig 1.2 (i) and (iv) are one-one and the function f2 and f3
in Fig 1.2 (ii) and (iii) are many-one.
©

Definition 6 A function f : X → Y is said to be onto (or surjective), if every element


of Y is the image of some element of X under f, i.e., for every y ∈ Y, there exists an
element x in X such that f (x) = y.
The function f3 and f4 in Fig 1.2 (iii), (iv) are onto and the function f1 in Fig 1.2 (i) is
not onto as elements e, f in X2 are not the image of any element in X1 under f1.
8 MATHEMATICS

he
is
bl
pu
Fig 1.2 (i) to (iv)
be T

Remark f : X → Y is onto if and only if Range of f = Y.


re
Definition 7 A function f : X → Y is said to be one-one and onto (or bijective), if f is
o R

both one-one and onto.


The function f4 in Fig 1.2 (iv) is one-one and onto.
tt E

Example 7 Let A be the set of all 50 students of Class X in a school. Let f : A → N be


function defined by f (x) = roll number of the student x. Show that f is one-one
C

but not onto.


Solution No two different students of the class can have same roll number. Therefore,
no N

f must be one-one. We can assume without any loss of generality that roll numbers of
students are from 1 to 50. This implies that 51 in N is not roll number of any student of
the class, so that 51 can not be image of any element of X under f. Hence, f is not onto.
©

Example 8 Show that the function f : N → N, given by f (x) = 2x, is one-one but not
onto.
Solution The function f is one-one, for f (x1) = f (x2) ⇒ 2x1 = 2x2 ⇒ x1 = x2. Further,
f is not onto, as for 1 ∈ N, there does not exist any x in N such that f (x) = 2x = 1.
RELATIONS AND FUNCTIONS 9

Example 9 Prove that the function f : R → R, given by f (x) = 2x, is one-one and onto.
Solution f is one-one, as f (x1) = f (x2) ⇒ 2x1 = 2x2 ⇒ x1 = x2. Also, given any real
y y y
number y in R, there exists in R such that f ( ) = 2 . ( ) = y. Hence, f is onto.
2 2 2

he
is
bl
pu
Fig 1.3
be T

Example 10 Show that the function f : N → N, given by f (1) = f (2) = 1 and f (x) = x – 1,
re
o R

for every x > 2, is onto but not one-one.


Solution f is not one-one, as f (1) = f (2) = 1. But f is onto, as given any y ∈ N, y ≠ 1,
we can choose x as y + 1 such that f (y + 1) = y + 1 – 1 = y. Also for 1 ∈ N, we
tt E

have f (1) = 1.
Example 11 Show that the function f : R → R,
C

defined as f (x) = x2, is neither one-one nor onto.


Solution Since f (– 1) = 1 = f (1), f is not one-
no N

one. Also, the element – 2 in the co-domain R is


not image of any element x in the domain R
(Why?). Therefore f is not onto.
©

Example 12 Show that f : N → N, given by


x 1,if x is odd,
f ( x)
x 1,if x is even
is both one-one and onto. Fig 1.4
10 MATHEMATICS

Solution Suppose f (x1) = f (x2). Note that if x1 is odd and x2 is even, then we will have
x1 + 1 = x2 – 1, i.e., x2 – x1 = 2 which is impossible. Similarly, the possibility of x1 being
even and x2 being odd can also be ruled out, using the similar argument. Therefore,
both x1 and x2 must be either odd or even. Suppose both x1 and x2 are odd. Then
f (x1) = f (x2) ⇒ x1 + 1 = x2 + 1 ⇒ x1 = x2. Similarly, if both x1 and x2 are even, then also
f (x1) = f (x2) ⇒ x1 – 1 = x2 – 1 ⇒ x1 = x2. Thus, f is one-one. Also, any odd number
2r + 1 in the co-domain N is the image of 2r + 2 in the domain N and any even number

he
2r in the co-domain N is the image of 2r – 1 in the domain N. Thus, f is onto.
Example 13 Show that an onto function f : {1, 2, 3} → {1, 2, 3} is always one-one.
Solution Suppose f is not one-one. Then there exists two elements, say 1 and 2 in the

is
domain whose image in the co-domain is same. Also, the image of 3 under f can be
only one element. Therefore, the range set can have at the most two elements of the
co-domain {1, 2, 3}, showing that f is not onto, a contradiction. Hence, f must be one-one.

bl
Example 14 Show that a one-one function f : {1, 2, 3} → {1, 2, 3} must be onto.
Solution Since f is one-one, three elements of {1, 2, 3} must be taken to 3 different
pu
elements of the co-domain {1, 2, 3} under f. Hence, f has to be onto.
Remark The results mentioned in Examples 13 and 14 are also true for an arbitrary
be T

finite set X, i.e., a one-one function f : X → X is necessarily onto and an onto map
f : X → X is necessarily one-one, for every finite set X. In contrast to this, Examples 8
re
o R

and 10 show that for an infinite set, this may not be true. In fact, this is a characteristic
difference between a finite and an infinite set.
tt E

EXERCISE 1.2
1
1. Show that the function f : R∗ → R∗ defined by f (x) = is one-one and onto,
C

x
where R∗ is the set of all non-zero real numbers. Is the result true, if the domain
R∗ is replaced by N with co-domain being same as R∗?
no N

2. Check the injectivity and surjectivity of the following functions:


(i) f : N → N given by f (x) = x2
(ii) f : Z → Z given by f (x) = x2
©

(iii) f : R → R given by f (x) = x2


(iv) f : N → N given by f (x) = x3
(v) f : Z → Z given by f (x) = x3
3. Prove that the Greatest Integer Function f : R → R, given by f (x) = [x], is neither
one-one nor onto, where [x] denotes the greatest integer less than or equal to x.
RELATIONS AND FUNCTIONS 11

4. Show that the Modulus Function f : R → R, given by f (x) = | x |, is neither one-


one nor onto, where | x | is x, if x is positive or 0 and | x | is – x, if x is negative.
5. Show that the Signum Function f : R → R, given by

1, if x 0
f ( x) 0, if x 0

he
–1, if x 0

is neither one-one nor onto.


6. Let A = {1, 2, 3}, B = {4, 5, 6, 7} and let f = {(1, 4), (2, 5), (3, 6)} be a function

is
from A to B. Show that f is one-one.
7. In each of the following cases, state whether the function is one-one, onto or

bl
bijective. Justify your answer.
pu(i) f : R → R defined by f (x) = 3 – 4x
(ii) f : R → R defined by f (x) = 1 + x2
8. Let A and B be sets. Show that f : A × B → B × A such that f (a, b) = (b, a) is
be T

bijective function.
re
n 1
o R

, if n is odd
2
9. Let f : N → N be defined by f (n) = for all n ∈ N.
n
, if n is even
tt E

2
State whether the function f is bijective. Justify your answer.
C

10. Let A = R – {3} and B = R – {1}. Consider the function f : A → B defined by


⎛ x−2⎞
f (x) = ⎜ ⎟ . Is f one-one and onto? Justify your answer.
no N

⎝ x−3⎠
11. Let f : R → R be defined as f(x) = x4. Choose the correct answer.
(A) f is one-one onto (B) f is many-one onto
©

(C) f is one-one but not onto (D) f is neither one-one nor onto.
12. Let f : R → R be defined as f (x) = 3x. Choose the correct answer.
(A) f is one-one onto (B) f is many-one onto
(C) f is one-one but not onto (D) f is neither one-one nor onto.
12 MATHEMATICS

1.4 Composition of Functions and Invertible Function


In this section, we will study composition of functions and the inverse of a bijective
function. Consider the set A of all students, who appeared in Class X of a Board
Examination in 2006. Each student appearing in the Board Examination is assigned a
roll number by the Board which is written by the students in the answer script at the
time of examination. In order to have confidentiality, the Board arranges to deface the

he
roll numbers of students in the answer scripts and assigns a fake code number to each
roll number. Let B ⊂ N be the set of all roll numbers and C ⊂ N be the set of all code
numbers. This gives rise to two functions f : A → B and g : B → C given by f (a) = the
roll number assigned to the student a and g (b) = the code number assigned to the roll

is
number b. In this process each student is assigned a roll number through the function f
and each roll number is assigned a code number through the function g. Thus, by the
combination of these two functions, each student is eventually attached a code number.

bl
This leads to the following definition:
Definition 8 Let f : A → B and g : B → C be two functions. Then the composition of
pu
f and g, denoted by gof, is defined as the function gof : A → C given by
gof (x) = g(f (x)), ∀ x ∈ A.
be T
re
o R
tt E

Fig 1.5
C

Example 15 Let f : {2, 3, 4, 5} → {3, 4, 5, 9} and g : {3, 4, 5, 9} → {7, 11, 15} be


functions defined as f (2) = 3, f (3) = 4, f (4) = f (5) = 5 and g (3) = g (4) = 7 and
no N

g (5) = g (9) = 11. Find gof.


Solution We have gof (2) = g (f (2)) = g (3) = 7, gof (3) = g (f (3)) = g (4) = 7,
gof (4) = g (f (4)) = g (5) = 11 and gof (5) = g (5) = 11.
©

Example 16 Find gof and fog, if f : R → R and g : R → R are given by f (x) = cos x
and g (x) = 3x2. Show that gof ≠ fog.
Solution We have gof (x) = g (f (x)) = g (cos x) = 3 (cos x)2 = 3 cos2 x. Similarly,
fog (x) = f (g (x)) = f (3x2) = cos (3x2). Note that 3cos2 x ≠ cos 3x2, for x = 0. Hence,
gof ≠ fog.
RELATIONS AND FUNCTIONS 13

⎧7 ⎫ ⎧3⎫ 3x + 4
Example 17 Show that if f : R − ⎨ ⎬ → R − ⎨ ⎬ is defined by f ( x) = and
⎩5⎭ ⎩5⎭ 5x − 7

⎧3⎫ ⎧7 ⎫ 7x + 4
g : R − ⎨ ⎬ → R − ⎨ ⎬ is defined by g ( x) = , then fog = IA and gof = IB, where,
⎩5 ⎭ ⎩5 ⎭ 5x − 3

he
⎧3⎫ ⎧7 ⎫
A = R – ⎨ ⎬ , B = R – ⎨ ⎬ ; IA (x) = x, ∀ x ∈ A, IB (x) = x, ∀ x ∈ B are called identity
⎩ ⎭
5 ⎩5⎭
functions on sets A and B, respectively.
Solution We have

is
⎛ (3 x + 4) ⎞
7⎜ +4
⎛ 3x + 4 ⎞ (5 x − 7) ⎟⎠ 21x + 28 + 20 x − 28 41x

bl
gof ( x) = g ⎜ ⎝ = =x
⎟= =
⎝ 5x − 7 ⎠ ⎛ (3 x + 4) ⎞ 15 x + 20 − 15 x + 21 41
5⎜ ⎟−3
⎝ (5 x − 7) ⎠
pu
⎛ (7 x + 4) ⎞
3⎜ +4
⎛ 7x + 4 ⎞ (5 x − 3) ⎟⎠ 21x + 12 + 20 x − 12 41x
be T

⎝ = =x
Similarly, fog ( x) = f ⎜ ⎟= =
⎝ 5x − 3 ⎠ ⎛ (7 x + 4) ⎞ 35 x + 20 − 35 x + 21 41
re
5⎜ ⎟−7
⎝ (5 x − 3) ⎠
o R

Thus, gof (x) = x, ∀ x ∈ B and fog (x) = x, ∀ x ∈ A, which implies that gof = IB
tt E

and fog = IA.

Example 18 Show that if f : A → B and g : B → C are one-one, then gof : A → C is


C

also one-one.
Solution Suppose gof (x1) = gof (x2)
no N

⇒ g (f (x1)) = g(f (x 2))


⇒ f (x1) = f (x2), as g is one-one
⇒ x1 = x2, as f is one-one
©

Hence, gof is one-one.

Example 19 Show that if f : A → B and g : B → C are onto, then gof : A → C is


also onto.
Solution Given an arbitrary element z ∈ C, there exists a pre-image y of z under g
such that g (y) = z, since g is onto. Further, for y ∈ B, there exists an element x in A
14 MATHEMATICS

with f (x) = y, since f is onto. Therefore, gof (x) = g (f (x)) = g (y) = z, showing that gof
is onto.
Example 20 Consider functions f and g such that composite gof is defined and is one-
one. Are f and g both necessarily one-one.
Solution Consider f : {1, 2, 3, 4} → {1, 2, 3, 4, 5, 6} defined as f (x) = x, ∀ x and
g : {1, 2, 3, 4, 5, 6} → {1, 2, 3, 4, 5, 6} as g (x) = x, for x = 1, 2, 3, 4 and g (5) = g (6) = 5.

he
Then, gof (x) = x ∀ x, which shows that gof is one-one. But g is clearly not one-one.
Example 21 Are f and g both necessarily onto, if gof is onto?
Solution Consider f : {1, 2, 3, 4} → {1, 2, 3, 4} and g : {1, 2, 3, 4} → {1, 2, 3} defined

is
as f (1) = 1, f (2) = 2, f (3) = f (4) = 3, g (1) = 1, g (2) = 2 and g (3) = g (4) = 3. It can be
seen that gof is onto but f is not onto.

bl
Remark It can be verified in general that gof is one-one implies that f is one-one.
Similarly, gof is onto implies that g is onto.
Now, we would like to have close look at the functions f and g described in the
pu
beginning of this section in reference to a Board Examination. Each student appearing
in Class X Examination of the Board is assigned a roll number under the function f and
each roll number is assigned a code number under g. After the answer scripts are
be T

examined, examiner enters the mark against each code number in a mark book and
submits to the office of the Board. The Board officials decode by assigning roll number
re
back to each code number through a process reverse to g and thus mark gets attached
o R

to roll number rather than code number. Further, the process reverse to f assigns a roll
number to the student having that roll number. This helps in assigning mark to the
tt E

student scoring that mark. We observe that while composing f and g, to get gof, first f
and then g was applied, while in the reverse process of the composite gof, first the
reverse process of g is applied and then the reverse process of f.
C

Example 22 Let f : {1, 2, 3} → {a, b, c} be one-one and onto function given by


f (1) = a, f (2) = b and f (3) = c. Show that there exists a function g : {a, b, c} → {1, 2, 3}
no N

such that gof = IX and fog = IY, where, X = {1, 2, 3} and Y = {a, b, c}.
Solution Consider g : {a, b, c} → {1, 2, 3} as g (a) = 1, g (b) = 2 and g (c) = 3. It is
easy to verify that the composite gof = IX is the identity function on X and the composite
fog = IY is the identity function on Y.
©

Remark The interesting fact is that the result mentioned in the above example is true
for an arbitrary one-one and onto function f : X → Y. Not only this, even the converse
is also true , i.e., if f : X → Y is a function such that there exists a function g : Y → X
such that gof = IX and fog = IY, then f must be one-one and onto.
The above discussion, Example 22 and Remark lead to the following definition:
RELATIONS AND FUNCTIONS 15

Definition 9 A function f : X → Y is defined to be invertible, if there exists a function


g : Y → X such that gof = IX and fog = IY. The function g is called the inverse of f and
is denoted by f –1.
Thus, if f is invertible, then f must be one-one and onto and conversely, if f is
one-one and onto, then f must be invertible. This fact significantly helps for proving a
function f to be invertible by showing that f is one-one and onto, specially when the

he
actual inverse of f is not to be determined.
Example 23 Let f : N → Y be a function defined as f (x) = 4x + 3, where,
Y = {y ∈ N : y = 4x + 3 for some x ∈ N }. Show that f is invertible. Find the inverse.

is
Solution Consider an arbitrary element y of Y. By the definition of Y, y = 4x + 3,
( y − 3)
for some x in the domain N . This shows that x = . Define g : Y → N by

bl
4
( y − 3) (4 x + 3 − 3)
g ( y) = . Now, gof (x) = g (f (x)) = g (4x + 3) = = x and
pu 4 4

⎛ ( y − 3) ⎞ 4 ( y − 3)
be T

fog (y) = f (g (y)) = f ⎜ ⎟= + 3 = y – 3 + 3 = y. This shows that gof = IN


⎝ 4 ⎠ 4
re
o R

and fog = IY, which implies that f is invertible and g is the inverse of f.
Example 24 Let Y = {n2 : n ∈ N } ⊂ N . Consider f : N → Y as f (n) = n2. Show that
tt E

f is invertible. Find the inverse of f.


Solution An arbitrary element y in Y is of the form n2, for some n ∈ N . This
C

implies that n = y . This gives a function g : Y → N , defined by g (y) = y . Now,

( y)=( y)
2
= y , which shows that
no N

gof (n) = g (n2) = n 2 = n and fog (y) = f


gof = IN and fog = IY. Hence, f is invertible with f –1 = g.
Example 25 Let f : N → R be a function defined as f (x) = 4x2 + 12x + 15. Show that
f : N → S, where, S is the range of f, is invertible. Find the inverse of f.
©

Solution Let y be an arbitrary element of range f. Then y = 4x2 + 12x + 15, for some

y 6 3
x in N, which implies that y = (2x + 3)2 + 6. This gives x , as y ≥ 6.
2
16 MATHEMATICS

y 6 3
Let us define g : S → N by g (y) = .
2
Now gof (x) = g (f (x)) = g (4x2 + 12x + 15) = g ((2x + 3)2 + 6)

=
(( (2 x + 3) 2 + 6 − 6 − 3 ) ) ( 2 x + 3 − 3)
= =x

he
2 2

(( )
y − 6) −3 ⎞ ⎛ 2 (( y − 6) −3 ) + 3 ⎞⎟
2

and fog (y) = f ⎜⎜ ⎟⎟ = ⎜⎜ ⎟ +6
⎝ 2 ⎠ ⎝ 2 ⎠

is
(( y − 6) −3+ 3 )) + 6 = ( y − 6 ) + 6 = y – 6 + 6 = y.
2 2
=

bl
Hence, gof = IN and fog =IS. This implies that f is invertible with f –1 = g.

Example 26 Consider f : N → N, g : N → N and h : N → R defined as f (x) = 2x,


pu
g (y) = 3y + 4 and h (z) = sin z, ∀ x, y and z in N. Show that ho(gof ) = (hog) of.
Solution We have
be T

ho(gof) (x) = h(gof (x)) = h(g (f (x))) = h (g (2x))


re
= h(3(2x) + 4) = h(6x + 4) = sin (6x + 4) x N.
o R

Also, ((hog) o f ) (x) = (hog) ( f (x)) = (hog) (2x) = h ( g (2x))


= h(3(2x) + 4) = h(6x + 4) = sin (6x + 4), ∀ x ∈ N.
tt E

This shows that ho(gof) = (hog) o f.


This result is true in general situation as well.
C

Theorem 1 If f : X → Y, g : Y → Z and h : Z → S are functions, then


ho(gof ) = (hog) o f.
no N

Proof We have
ho(gof ) (x) = h(gof (x)) = h(g (f (x))), ∀ x in X
and (hog) of (x) = hog(f (x)) = h(g (f (x))), ∀ x in X.
©

Hence, ho(gof) = (hog) o f.


Example 27 Consider f : {1, 2, 3} → {a, b, c} and g : {a, b, c} → {apple, ball, cat}
defined as f (1) = a, f (2) = b, f (3) = c, g(a) = apple, g(b) = ball and g(c) = cat.
Show that f, g and gof are invertible. Find out f –1, g–1 and (gof)–1 and show that
(gof) –1 = f –1o g–1.
RELATIONS AND FUNCTIONS 17

Solution Note that by definition, f and g are bijective functions. Let


f –1: {a, b, c} → (1, 2, 3} and g–1 : {apple, ball, cat} → {a, b, c} be defined as
f –1{a} = 1, f –1{b} = 2, f –1{c} = 3, g –1{apple} = a, g –1{ball} = b and g –1{cat} = c.
It is easy to verify that f –1 o f = I{1, 2, 3}, f o f –1 = I{a, b, c}, g –1og = I{a, b, c} and g o g–1 = ID,
where, D = {apple, ball, cat}. Now, gof : {1, 2, 3} → {apple, ball, cat} is given by
gof (1) = apple, gof (2) = ball, gof (3) = cat. We can define
(gof)–1 : {apple, ball, cat} → {1, 2, 3} by (gof)–1 (apple) = 1, (gof)–1 (ball) = 2 and

he
(g o f)–1 (cat) = 3. It is easy to see that (g o f)–1 o (g o f) = I{1, 2, 3} and
(gof) o (gof)–1 = ID. Thus, we have seen that f, g and gof are invertible.
Now, f –1og–1 (apple)= f –1(g–1(apple)) = f –1(a) = 1 = (gof)–1 (apple)
f –1og–1 (ball) = f –1(g–1(ball)) = f –1(b) = 2 = (gof)–1 (ball) and

is
f –1og–1 (cat) = f –1(g–1(cat)) = f –1(c) = 3 = (gof)–1 (cat).
Hence (gof)–1 = f –1og –1.

bl
The above result is true in general situation also.
Theorem 2 Let f : X → Y and g : Y → Z be two invertible functions. Then gof is also
pu
invertible with (gof)–1 = f –1og–1.
Proof To show that gof is invertible with (gof)–1 = f –1og–1, it is enough to show that
( f –1og–1)o(gof) = IX and (gof)o( f –1og–1) = IZ.
be T

Now, (f –1og–1) o (gof) = ((f –1og–1) og) of, by Theorem 1


re
= (f –1o(g–1og)) of, by Theorem 1
o R

= (f –1 o IY) of, by definition of g–1


= IX.
tt E

Similarly, it can be shown that (gof ) o (f –1 o g –1) = IZ.

Example 28 Let S = {1, 2, 3}. Determine whether the functions f : S → S defined as


C

below have inverses. Find f –1, if it exists.


(a) f = {(1, 1), (2, 2), (3, 3)}
(b) f = {(1, 2), (2, 1), (3, 1)}
no N

(c) f = {(1, 3), (3, 2), (2, 1)}

Solution
©

(a) It is easy to see that f is one-one and onto, so that f is invertible with the inverse
f –1 of f given by f –1 = {(1, 1), (2, 2), (3, 3)} = f.
(b) Since f (2) = f (3) = 1, f is not one-one, so that f is not invertible.
(c) It is easy to see that f is one-one and onto, so that f is invertible with
f –1 = {(3, 1), (2, 3), (1, 2)}.
18 MATHEMATICS

EXERCISE 1.3
1. Let f : {1, 3, 4} → {1, 2, 5} and g : {1, 2, 5} → {1, 3} be given by
f = {(1, 2), (3, 5), (4, 1)} and g = {(1, 3), (2, 3), (5, 1)}. Write down gof.
2. Let f, g and h be functions from R to R. Show that
(f + g) o h = foh + goh

he
(f . g) o h = (foh) . (goh)
3. Find gof and fog, if
(i) f (x) = | x | and g(x) = | 5x – 2 |

is
1

(ii) f (x) = 8x and g(x) = x 3 .


3

bl
(4 x + 3) 2 2
4. If f (x) = , x ≠ , show that fof (x) = x, for all x ≠ . What is the
(6 x − 4) 3 3
inverse of f ?
pu
5. State with reason whether following functions have inverse
(i) f : {1, 2, 3, 4} → {10} with
be T

f = {(1, 10), (2, 10), (3, 10), (4, 10)}


re
(ii) g : {5, 6, 7, 8} → {1, 2, 3, 4} with
o R

g = {(5, 4), (6, 3), (7, 4), (8, 2)}


(iii) h : {2, 3, 4, 5} → {7, 9, 11, 13} with
tt E

h = {(2, 7), (3, 9), (4, 11), (5, 13)}


C

x
6. Show that f : [–1, 1] → R, given by f (x) = is one-one. Find the inverse
( x + 2)
of the function f : [–1, 1] → Range f.
no N

x 2y
(Hint: For y ∈ Range f, y = f (x) = , for some x in [–1, 1], i.e., x = )
x+2 (1 − y )
©

7. Consider f : R → R given by f (x) = 4x + 3. Show that f is invertible. Find the


inverse of f.
8. Consider f : R+ → [4, ∞) given by f (x) = x2 + 4. Show that f is invertible with the
inverse f –1 of f given by f –1(y) = y − 4 , where R+ is the set of all non-negative
real numbers.
RELATIONS AND FUNCTIONS 19

9. Consider f : R+ → [– 5, ∞) given by f (x) = 9x2 + 6x – 5. Show that f is invertible


⎛ ( y + 6 ) −1⎞
with f –1(y) = ⎜ ⎟.
⎝ 3 ⎠
10. Let f : X → Y be an invertible function. Show that f has unique inverse.
(Hint: suppose g1 and g2 are two inverses of f. Then for all y ∈ Y,

he
fog1(y) = 1Y(y) = fog2(y). Use one-one ness of f).
11. Consider f : {1, 2, 3} → {a, b, c} given by f (1) = a, f (2) = b and f (3) = c. Find
f –1 and show that (f –1)–1 = f.
12. Let f : X → Y be an invertible function. Show that the inverse of f –1 is f, i.e.,

is
(f –1)–1 = f.
1

bl
13. If f : R → R be given by f (x) = (3 − 3 3
x ) , then fof (x) is
1
(A) x 3 (B) x 3 (C) x (D) (3 – x3).
pu
⎧ 4⎫ 4x
14. Let f : R – ⎨− ⎬ → R be a function defined as f (x) = . The inverse of
be T

⎩ 3⎭ 3x + 4
re
⎧ 4⎫
f is the map g : Range f → R – ⎨− ⎬ given by
o R

⎩ 3⎭

3y 4y
tt E

(A) g ( y) = (B) g ( y) =
3 − 4y 4 − 3y
C

4y 3y
(C) g ( y) = (D) g ( y) =
3 − 4y 4 − 3y
no N

1.5 Binary Operations


Right from the school days, you must have come across four fundamental operations
namely addition, subtraction, multiplication and division. The main feature of these
©

operations is that given any two numbers a and b, we associate another number a + b
a
or a – b or ab or , b ≠ 0. It is to be noted that only two numbers can be added or
b
multiplied at a time. When we need to add three numbers, we first add two numbers
and the result is then added to the third number. Thus, addition, multiplication, subtraction
20 MATHEMATICS

and division are examples of binary operation, as ‘binary’ means two. If we want to
have a general definition which can cover all these four operations, then the set of
numbers is to be replaced by an arbitrary set X and then general binary operation is
nothing but association of any pair of elements a, b from X to another element of X.
This gives rise to a general definition as follows:
Definition 10 A binary operation ∗ on a set A is a function ∗ : A × A → A. We denote

he
∗ (a, b) by a ∗ b.
Example 29 Show that addition, subtraction and multiplication are binary operations
on R, but division is not a binary operation on R. Further, show that division is a binary
operation on the set R∗ of nonzero real numbers.

is
Solution + : R × R → R is given by
(a, b) → a + b

bl
– : R × R → R is given by
(a, b) → a – b
× : R × R → R is given by
pu (a, b) → ab
Since ‘+’, ‘–’ and ‘×’ are functions, they are binary operations on R.
be T

a
But ÷: R × R → R, given by (a, b) → , is not a function and hence not a binary
re
b
o R

a
operation, as for b = 0, is not defined.
b
tt E

a
However, ÷ : R∗ × R∗ → R∗, given by (a, b) → is a function and hence a
b
C

binary operation on R∗.


Example 30 Show that subtraction and division are not binary operations on N.
no N

Solution – : N × N → N, given by (a, b) → a – b, is not binary operation, as the image


of (3, 5) under ‘–’ is 3 – 5 = – 2 ∉ N. Similarly, ÷ : N × N → N, given by (a, b) → a ÷ b
3
is not a binary operation, as the image of (3, 5) under ÷ is 3 ÷ 5 =
∉ N.
©

5
Example 31 Show that ∗ : R × R → R given by (a, b) → a + 4b2 is a binary
operation.
Solution Since ∗ carries each pair (a, b) to a unique element a + 4b2 in R, ∗ is a binary
operation on R.
RELATIONS AND FUNCTIONS 21

Example 32 Let P be the set of all subsets of a given set X. Show that ∪ : P × P → P
given by (A, B) → A ∪ B and ∩ : P × P → P given by (A, B) → A ∩ B are binary
operations on the set P.
Solution Since union operation ∪ carries each pair (A, B) in P × P to a unique element
A ∪ B in P, ∪ is binary operation on P. Similarly, the intersection operation ∩ carries
each pair (A, B) in P × P to a unique element A ∩ B in P, ∩ is a binary operation on P.

he
Example 33 Show that the ∨ : R × R → R given by (a, b) → max {a, b} and the
∧ : R × R → R given by (a, b) → min {a, b} are binary operations.
Solution Since ∨ carries each pair (a, b) in R × R to a unique element namely
maximum of a and b lying in R, ∨ is a binary operation. Using the similar argument,

is
one can say that ∧ is also a binary operation.
Remark ∨ (4, 7) = 7, ∨ (4, – 7) = 4, ∧ (4, 7) = 4 and ∧ (4, – 7) = – 7.

bl
When number of elements in a set A is small, we can express a binary operation ∗ on
the set A through a table called the operation table for the operation ∗. For example
consider A = {1, 2, 3}. Then, the operation ∨ on A defined in Example 33 can be expressed
pu
by the following operation table (Table 1.1) . Here, ∨ (1, 3) = 3, ∨ (2, 3) = 3, ∨ (1, 2) = 2.
Table 1.1
be T
re
o R
tt E

Here, we are having 3 rows and 3 columns in the operation table with (i, j) the
entry of the table being maximum of ith and jth elements of the set A. This can be
C

generalised for general operation ∗ : A × A → A. If A = {a1, a2, ..., an}. Then the
operation table will be having n rows and n columns with (i, j)th entry being ai ∗ aj.
Conversely, given any operation table having n rows and n columns with each entry
no N

being an element of A = {a1, a2, ..., an}, we can define a binary operation ∗ : A × A → A
given by ai ∗ aj = the entry in the ith row and jth column of the operation table.
One may note that 3 and 4 can be added in any order and the result is same, i.e.,
3 + 4 = 4 + 3, but subtraction of 3 and 4 in different order give different results, i.e.,
©

3 – 4 ≠ 4 – 3. Similarly, in case of multiplication of 3 and 4, order is immaterial, but


division of 3 and 4 in different order give different results. Thus, addition and
multiplication of 3 and 4 are meaningful, but subtraction and division of 3 and 4 are
meaningless. For subtraction and division we have to write ‘subtract 3 from 4’, ‘subtract
4 from 3’, ‘divide 3 by 4’ or ‘divide 4 by 3’.
22 MATHEMATICS

This leads to the following definition:


Definition 11 A binary operation ∗ on the set X is called commutative, if a ∗ b = b ∗ a,
for every a, b ∈ X.
Example 34 Show that + : R × R → R and × : R × R → R are commutative binary
operations, but – : R × R → R and ÷ : R∗ × R∗ → R∗ are not commutative.
Solution Since a + b = b + a and a × b = b × a, ∀ a, b ∈ R, ‘+’ and ‘×’ are

he
commutative binary operation. However, ‘–’ is not commutative, since 3 – 4 ≠ 4 – 3.
Similarly, 3 ÷ 4 ≠ 4 ÷ 3 shows that ‘÷’ is not commutative.
Example 35 Show that ∗ : R × R → R defined by a ∗ b = a + 2b is not commutative.

is
Solution Since 3 ∗ 4 = 3 + 8 = 11 and 4 ∗ 3 = 4 + 6 = 10, showing that the operation ∗
is not commutative.
If we want to associate three elements of a set X through a binary operation on X,

bl
we encounter a natural problem. The expression a ∗ b ∗ c may be interpreted as
(a ∗ b) ∗ c or a ∗ (b ∗ c) and these two expressions need not be same. For example,
(8 – 5) – 2 ≠ 8 – (5 – 2). Therefore, association of three numbers 8, 5 and 3 through
pu
the binary operation ‘subtraction’ is meaningless, unless bracket is used. But in case
of addition, 8 + 5 + 2 has the same value whether we look at it as ( 8 + 5) + 2 or as
8 + (5 + 2). Thus, association of 3 or even more than 3 numbers through addition is
be T

meaningful without using bracket. This leads to the following:


re
Definition 12 A binary operation ∗ : A × A → A is said to be associative if
o R

(a ∗ b) ∗ c = a ∗ (b ∗ c), ∀ a, b, c, ∈ A.
Example 36 Show that addition and multiplication are associative binary operation on
tt E

R. But subtraction is not associative on R. Division is not associative on R∗.


Solution Addition and multiplication are associative, since (a + b) + c = a + (b + c) and
C

(a × b) × c = a × (b × c) ∀ a, b, c ∈ R. However, subtraction and division are not


associative, as (8 – 5) – 3 ≠ 8 – (5 – 3) and (8 ÷ 5) ÷ 3 ≠ 8 ÷ (5 ÷ 3).
Example 37 Show that ∗ : R × R → R given by a ∗ b → a + 2b is not associative.
no N

Solution The operation ∗ is not associative, since


(8 ∗ 5) ∗ 3 = (8 + 10) ∗ 3 = (8 + 10) + 6 = 24,
while 8 ∗ (5 ∗ 3) = 8 ∗ (5 + 6) = 8 ∗ 11 = 8 + 22 = 30.
©

Remark Associative property of a binary operation is very important in the sense that
with this property of a binary operation, we can write a1 ∗ a2 ∗ ... ∗ an which is not
ambiguous. But in absence of this property, the expression a1 ∗ a2 ∗ ... ∗ an is ambiguous
unless brackets are used. Recall that in the earlier classes brackets were used whenever
subtraction or division operations or more than one operation occurred.
RELATIONS AND FUNCTIONS 23

For the binary operation ‘+’ on R, the interesting feature of the number zero is that
a + 0 = a = 0 + a, i.e., any number remains unaltered by adding zero. But in case of
multiplication, the number 1 plays this role, as a × 1 = a = 1 × a, ∀ a in R. This leads
to the following definition:
Definition 13 Given a binary operation ∗ : A × A → A, an element e ∈ A, if it exists,
is called identity for the operation ∗, if a ∗ e = a = e ∗ a, ∀ a ∈ A.

he
Example 38 Show that zero is the identity for addition on R and 1 is the identity for
multiplication on R. But there is no identity element for the operations
– : R × R → R and ÷ : R∗ × R∗ → R∗.

is
Solution a + 0 = 0 + a = a and a × 1 = a = 1 × a, ∀ a ∈ R implies that 0 and 1 are
identity elements for the operations ‘+’ and ‘×’ respectively. Further, there is no element

bl
e in R with a – e = e – a, ∀ a. Similarly, we can not find any element e in R∗ such that
a ÷ e = e ÷ a, ∀ a in R∗. Hence, ‘–’ and ‘÷’ do not have identity element.
Remark Zero is identity for the addition operation on R but it is not identity for the
pu
addition operation on N, as 0 ∉ N. In fact the addition operation on N does not have
any identity.
be T

One further notices that for the addition operation + : R × R → R, given any
a ∈ R, there exists – a in R such that a + (– a) = 0 (identity for ‘+’) = (– a) + a.
re
o R

1
Similarly, for the multiplication operation on R, given any a ≠ 0 in R, we can choose
a
tt E

1 1
in R such that a × = 1(identity for ‘×’) = × a. This leads to the following definition:
a a
C

Definition 14 Given a binary operation ∗ : A × A → A with the identity element e in A,


an element a ∈ A is said to be invertible with respect to the operation ∗, if there exists
an element b in A such that a ∗ b = e = b ∗ a and b is called the inverse of a and is
no N

denoted by a–1.

Example 39 Show that – a is the inverse of a for the addition operation ‘+’ on R and
1
©

is the inverse of a ≠ 0 for the multiplication operation ‘×’ on R.


a
Solution As a + (– a) = a – a = 0 and (– a) + a = 0, – a is the inverse of a for addition.
1 1 1
Similarly, for a ≠ 0, a × = 1 = × a implies that is the inverse of a for multiplication.
a a a
24 MATHEMATICS

Example 40 Show that – a is not the inverse of a ∈ N for the addition operation + on
1
N and is not the inverse of a ∈ N for multiplication operation × on N, for a ≠ 1.
a
Solution Since – a ∉ N, – a can not be inverse of a for addition operation on N,
although – a satisfies a + (– a) = 0 = (– a) + a.

he
1
Similarly, for a ≠ 1 in N, ∉ N, which implies that other than 1 no element of N
a
has inverse for multiplication operation on N.
Examples 34, 36, 38 and 39 show that addition on R is a commutative and associative

is
binary operation with 0 as the identity element and – a as the inverse of a in R ∀ a.

bl
EXERCISE 1.4
1. Determine whether or not each of the definition of ∗ given below gives a binary
operation. In the event that ∗ is not a binary operation, give justification for this.
pu
(i) On Z+, define ∗ by a ∗ b = a – b
(ii) On Z+, define ∗ by a ∗ b = ab
be T

(iii) On R, define ∗ by a ∗ b = ab2


re
(iv) On Z+, define ∗ by a ∗ b = | a – b |
o R

(v) On Z+, define ∗ by a ∗ b = a


2. For each operation ∗ defined below, determine whether ∗ is binary, commutative
tt E

or associative.
(i) On Z, define a ∗ b = a – b
C

(ii) On Q, define a ∗ b = ab + 1
ab
no N

(iii) On Q, define a ∗ b =
2
(iv) On Z+, define a ∗ b = 2ab
(v) On Z+, define a ∗ b = ab
©

a
(vi) On R – {– 1}, define a ∗ b =
b +1
3. Consider the binary operation ∧ on the set {1, 2, 3, 4, 5} defined by
a ∧ b = min {a, b}. Write the operation table of the operation ∧ .
RELATIONS AND FUNCTIONS 25

4. Consider a binary operation ∗ on the set {1, 2, 3, 4, 5} given by the following


multiplication table (Table 1.2).
(i) Compute (2 ∗ 3) ∗ 4 and 2 ∗ (3 ∗ 4)
(ii) Is ∗ commutative?
(iii) Compute (2 ∗ 3) ∗ (4 ∗ 5).
(Hint: use the following table)
Table 1.2

he
is
bl
pu
5. Let ∗′ be the binary operation on the set {1, 2, 3, 4, 5} defined by
a ∗′ b = H.C.F. of a and b. Is the operation ∗′ same as the operation ∗ defined
in Exercise 4 above? Justify your answer.
be T

6. Let ∗ be the binary operation on N given by a ∗ b = L.C.M. of a and b. Find


re
(i) 5 ∗ 7, 20 ∗ 16 (ii) Is ∗ commutative?
o R

(iii) Is ∗ associative? (iv) Find the identity of ∗ in N


(v) Which elements of N are invertible for the operation ∗?
7. Is ∗ defined on the set {1, 2, 3, 4, 5} by a ∗ b = L.C.M. of a and b a binary
tt E

operation? Justify your answer.


8. Let ∗ be the binary operation on N defined by a ∗ b = H.C.F. of a and b.
C

Is ∗ commutative? Is ∗ associative? Does there exist identity for this binary


operation on N?
9. Let ∗ be a binary operation on the set Q of rational numbers as follows:
no N

(i) a ∗ b = a – b (ii) a ∗ b = a2 + b2
(iii) a ∗ b = a + ab (iv) a ∗ b = (a – b)2
ab
(v) a ∗ b = (vi) a ∗ b = ab2
©

4
Find which of the binary operations are commutative and which are associative.
10. Find which of the operations given above has identity.
11. Let A = N × N and ∗ be the binary operation on A defined by
(a, b) ∗ (c, d) = (a + c, b + d)
26 MATHEMATICS

Show that ∗ is commutative and associative. Find the identity element for ∗ on
A, if any.
12. State whether the following statements are true or false. Justify.
(i) For an arbitrary binary operation ∗ on a set N, a ∗ a = a ∀ a ∈ N.
(ii) If ∗ is a commutative binary operation on N, then a ∗ (b ∗ c) = (c ∗ b) ∗ a
13. Consider a binary operation ∗ on N defined as a ∗ b = a3 + b3. Choose the

he
correct answer.
(A) Is ∗ both associative and commutative?
(B) Is ∗ commutative but not associative?

is
(C) Is ∗ associative but not commutative?
(D) Is ∗ neither commutative nor associative?

bl
Miscellaneous Examples
Example 41 If R1 and R2 are equivalence relations in a set A, show that R1 ∩ R2 is
pu
also an equivalence relation.
Solution Since R1 and R2 are equivalence relations, (a, a) ∈ R1, and (a, a) ∈ R2 ∀ a ∈ A.
be T

This implies that (a, a) ∈ R1 ∩ R2, ∀ a, showing R1 ∩ R2 is reflexive. Further,


(a, b) ∈ R1 ∩ R2 ⇒ (a, b) ∈ R1 and (a, b) ∈ R2 ⇒ (b, a) ∈ R1 and (b, a) ∈ R2 ⇒
re
(b, a) ∈ R1 ∩ R2, hence, R1 ∩ R2 is symmetric. Similarly, (a, b) ∈ R1 ∩ R2 and
o R

(b, c) ∈ R1 ∩ R2 ⇒ (a, c) ∈ R1 and (a, c) ∈ R2 ⇒ (a, c) ∈ R1 ∩ R2. This shows that


R1 ∩ R2 is transitive. Thus, R1 ∩ R2 is an equivalence relation.
tt E

Example 42 Let R be a relation on the set A of ordered pairs of positive integers


defined by (x, y) R (u, v) if and only if xv = yu. Show that R is an equivalence relation.
C

Solution Clearly, (x, y) R (x, y), ∀ (x, y) ∈ A, since xy = yx. This shows that R is
reflexive. Further, (x, y) R (u, v) ⇒ xv = yu ⇒ uy = vx and hence (u, v) R (x, y). This
no N

shows that R is symmetric. Similarly, (x, y) R (u, v) and (u, v) R (a, b) ⇒ xv = yu and
a a b a
ub = va ⇒ xv = yu ⇒ xv = yu ⇒ xb = ya and hence (x, y) R (a, b). Thus, R
u u v u
©

is transitive. Thus, R is an equivalence relation.


Example 43 Let X = {1, 2, 3, 4, 5, 6, 7, 8, 9}. Let R1 be a relation in X given
by R1 = {(x, y) : x – y is divisible by 3} and R2 be another relation on X given by
R2 = {(x, y): {x, y} ⊂ {1, 4, 7}} or {x, y} ⊂ {2, 5, 8} or {x, y} ⊂ {3, 6, 9}}. Show that
R1 = R2.
RELATIONS AND FUNCTIONS 27

Solution Note that the characteristic of sets {1, 4, 7}, {2, 5, 8} and {3, 6, 9} is
that difference between any two elements of these sets is a multiple of 3. Therefore,
(x, y) ∈ R1 ⇒ x – y is a multiple of 3 ⇒ {x, y} ⊂ {1, 4, 7} or {x, y} ⊂ {2, 5, 8}
or {x, y} ⊂ {3, 6, 9} ⇒ (x, y) ∈ R2. Hence, R1 ⊂ R2. Similarly, {x, y} ∈ R2 ⇒ {x, y}
⊂ {1, 4, 7} or {x, y} ⊂ {2, 5, 8} or {x, y} ⊂ {3, 6, 9} ⇒ x – y is divisible by
3 ⇒ {x, y} ∈ R1. This shows that R2 ⊂ R1. Hence, R1 = R2.

he
Example 44 Let f : X → Y be a function. Define a relation R in X given by
R = {(a, b): f(a) = f(b)}. Examine whether R is an equivalence relation or not.
Solution For every a ∈ X, (a, a) ∈ R, since f (a) = f (a), showing that R is reflexive.
Similarly, (a, b) ∈ R ⇒ f (a) = f (b) ⇒ f (b) = f (a) ⇒ (b, a) ∈ R. Therefore, R is

is
symmetric. Further, (a, b) ∈ R and (b, c) ∈ R ⇒ f (a) = f (b) and f (b) = f (c) ⇒ f (a)
= f (c) ⇒ (a, c) ∈ R, which implies that R is transitive. Hence, R is an equivalence
relation.

bl
Example 45 Determine which of the following binary operations on the set R are
associative and which are commutative.
pu ( a + b)
(a) a ∗ b = 1 ∀ a, b ∈ R (b) a ∗ b = ∀ a, b ∈ R
2
be T

Solution
re
(a) Clearly, by definition a ∗ b = b ∗ a = 1, ∀ a, b ∈ R. Also
o R

(a ∗ b) ∗ c = (1 ∗ c) =1 and a ∗ (b ∗ c) = a ∗ (1) = 1, ∀ a, b, c ∈ R. Hence


R is both associative and commutative.
tt E

a+b b+a
(b) a ∗ b = = = b ∗ a, shows that ∗ is commutative. Further,
2 2
C

⎛a+b⎞
(a ∗ b) ∗ c = ⎜ ⎟ ∗ c.
⎝ 2 ⎠
⎛a+b⎞
⎜ ⎟ + c a + b + 2c
no N

⎝ 2 ⎠ =
= .
2 4
⎛b+c⎞
But a ∗ (b ∗ c) = a ∗ ⎜ ⎟
©

⎝ 2 ⎠
b+c
a+
= 2 = 2a + b + c ≠ a + b + 2c in general.
2 4 4
Hence, ∗ is not associative.
28 MATHEMATICS

Example 46 Find the number of all one-one functions from set A = {1, 2, 3} to itself.
Solution One-one function from {1, 2, 3} to itself is simply a permutation on three
symbols 1, 2, 3. Therefore, total number of one-one maps from {1, 2, 3} to itself is
same as total number of permutations on three symbols 1, 2, 3 which is 3! = 6.
Example 47 Let A = {1, 2, 3}. Then show that the number of relations containing (1, 2)
and (2, 3) which are reflexive and transitive but not symmetric is three.

he
Solution The smallest relation R1 containing (1, 2) and (2, 3) which is reflexive and
transitive but not symmetric is {(1, 1), (2, 2), (3, 3), (1, 2), (2, 3), (1, 3)}. Now, if we add
the pair (2, 1) to R1 to get R2, then the relation R2 will be reflexive, transitive but not

is
symmetric. Similarly, we can obtain R3 by adding (3, 2) to R1 to get the desired relation.
However, we can not add two pairs (2, 1), (3, 2) or single pair (3, 1) to R1 at a time, as
by doing so, we will be forced to add the remaining pair in order to maintain transitivity

bl
and in the process, the relation will become symmetric also which is not required. Thus,
the total number of desired relations is three.
Example 48 Show that the number of equivalence relation in the set {1, 2, 3} containing
pu
(1, 2) and (2, 1) is two.
Solution The smallest equivalence relation R1 containing (1, 2) and (2, 1) is {(1, 1),
be T

(2, 2), (3, 3), (1, 2), (2, 1)}. Now we are left with only 4 pairs namely (2, 3), (3, 2),
(1, 3) and (3, 1). If we add any one, say (2, 3) to R1, then for symmetry we must add
re
o R

(3, 2) also and now for transitivity we are forced to add (1, 3) and (3, 1). Thus, the only
equivalence relation bigger than R1 is the universal relation. This shows that the total
number of equivalence relations containing (1, 2) and (2, 1) is two.
tt E

Example 49 Show that the number of binary operations on {1, 2} having 1 as identity
and having 2 as the inverse of 2 is exactly one.
C

Solution A binary operation ∗ on {1, 2} is a function from {1, 2} × {1, 2} to {1, 2}, i.e.,
a function from {(1, 1), (1, 2), (2, 1), (2, 2)} → {1, 2}. Since 1 is the identity for the
desired binary operation ∗, ∗ (1, 1) = 1, ∗ (1, 2) = 2, ∗ (2, 1) = 2 and the only choice
no N

left is for the pair (2, 2). Since 2 is the inverse of 2, i.e., ∗ (2, 2) must be equal to 1. Thus,
the number of desired binary operation is only one.
Example 50 Consider the identity function IN : N → N defined as IN (x) = x ∀ x ∈ N.
©

Show that although IN is onto but IN + IN : N → N defined as


(IN + IN) (x) = IN (x) + IN (x) = x + x = 2x is not onto.
Solution Clearly IN is onto. But IN + IN is not onto, as we can find an element 3
in the co-domain N such that there does not exist any x in the domain N with
(IN + IN) (x) = 2x = 3.
RELATIONS AND FUNCTIONS 29

⎡ π⎤
Example 51 Consider a function f : ⎢0, ⎥ → R given by f (x) = sin x and
⎣ 2⎦

π
g : ⎡⎢0, ⎤⎥ → R given by g(x) = cos x. Show that f and g are one-one, but f + g is not
⎣ 2⎦
one-one.

he
⎡ π⎤
Solution Since for any two distinct elements x1 and x2 in ⎢0, ⎥ , sin x1 ≠ sin x2 and
⎣ 2⎦
cos x1 ≠ cos x2, both f and g must be one-one. But (f + g) (0) = sin 0 + cos 0 = 1 and

is
⎛π⎞ π π
(f + g) ⎜ ⎟ = sin + cos = 1 . Therefore, f + g is not one-one.
⎝2⎠ 2 2

bl
pu Miscellaneous Exercise on Chapter 1
1. Let f : R → R be defined as f (x) = 10x + 7. Find the function g : R → R such
that g o f = f o g = 1R.
2. Let f : W → W be defined as f (n) = n – 1, if n is odd and f (n) = n + 1, if n is
be T

even. Show that f is invertible. Find the inverse of f. Here, W is the set of all
re
whole numbers.
o R

3. If f : R → R is defined by f(x) = x2 – 3x + 2, find f (f (x)).

x
tt E

4. Show that the function f : R → {x ∈ R : – 1 < x < 1} defined by f ( x) = ,


1+ | x |
x ∈ R is one one and onto function.
C

5. Show that the function f : R → R given by f (x) = x3 is injective.


6. Give examples of two functions f : N → Z and g : Z → Z such that g o f is
no N

injective but g is not injective.


(Hint : Consider f (x) = x and g (x) = | x |).
7. Give examples of two functions f : N → N and g : N → N such that g o f is onto
©

but f is not onto.

⎧ x − 1 if x > 1
(Hint : Consider f (x) = x + 1 and g ( x ) = ⎨
⎩ 1 if x = 1
8. Given a non empty set X, consider P(X) which is the set of all subsets of X.
30 MATHEMATICS

Define the relation R in P(X) as follows:


For subsets A, B in P(X), ARB if and only if A ⊂ B. Is R an equivalence relation
on P(X)? Justify your answer.
9. Given a non-empty set X, consider the binary operation ∗ : P(X) × P(X) → P(X)
given by A ∗ B = A ∩ B ∀ A, B in P(X), where P(X) is the power set of X.
Show that X is the identity element for this operation and X is the only invertible

he
element in P(X) with respect to the operation ∗.
10. Find the number of all onto functions from the set {1, 2, 3, ... , n} to itself.
11. Let S = {a, b, c} and T = {1, 2, 3}. Find F–1 of the following functions F from S
to T, if it exists.

is
(i) F = {(a, 3), (b, 2), (c, 1)} (ii) F = {(a, 2), (b, 1), (c, 1)}
12. Consider the binary operations ∗ : R × R → R and o : R × R → R defined as
a ∗b = |a – b| and a o b = a, ∀ a, b ∈ R. Show that ∗ is commutative but not

bl
associative, o is associative but not commutative. Further, show that ∀ a, b, c ∈ R,
a ∗ (b o c) = (a ∗ b) o (a ∗ c). [If it is so, we say that the operation ∗ distributes
over the operation o]. Does o distribute over ∗? Justify your answer.
pu
13. Given a non-empty set X, let ∗ : P(X) × P(X) → P(X) be defined as
A * B = (A – B) ∪ (B – A), ∀ A, B ∈ P(X). Show that the empty set φ is the
be T

identity for the operation ∗ and all the elements A of P(X) are invertible with
A–1 = A. (Hint : (A – φ) ∪ (φ – A) = A and (A – A) ∪ (A – A) = A ∗ A = φ).
re
o R

14. Define a binary operation ∗ on the set {0, 1, 2, 3, 4, 5} as


⎧ a + b, if a + b < 6
a ∗b = ⎨
tt E

⎩ a + b − 6 if a + b ≥ 6
Show that zero is the identity for this operation and each element a ≠ 0 of the set
is invertible with 6 – a being the inverse of a.
C

15. Let A = {– 1, 0, 1, 2}, B = {– 4, – 2, 0, 2} and f, g : A → B be functions defined


1
by f (x) = x2 – x, x ∈ A and g ( x) = 2 x − − 1, x ∈ A. Are f and g equal?
no N

2
Justify your answer. (Hint: One may note that two functions f : A → B and
g : A → B such that f (a) = g (a) ∀ a ∈ A, are called equal functions).
©

16. Let A = {1, 2, 3}. Then number of relations containing (1, 2) and (1, 3) which are
reflexive and symmetric but not transitive is
(A) 1 (B) 2 (C) 3 (D) 4
17. Let A = {1, 2, 3}. Then number of equivalence relations containing (1, 2) is
(A) 1 (B) 2 (C) 3 (D) 4
RELATIONS AND FUNCTIONS 31

18. Let f : R → R be the Signum Function defined as

⎧ 1, x > 0

f ( x) = ⎨ 0, x = 0
⎪−1, x < 0

and g : R → R be the Greatest Integer Function given by g (x) = [x], where [x] is

he
greatest integer less than or equal to x. Then, does fog and gof coincide in (0, 1]?
19. Number of binary operations on the set {a, b} are
(A) 10 (B) 16 (C) 20 (D ) 8

is
Summary
In this chapter, we studied different types of relations and equivalence relation,

bl
composition of functions, invertible functions and binary operations. The main features
of this chapter are as follows:
 Empty relation is the relation R in X given by R = φ ⊂ X × X.
pu
 Universal relation is the relation R in X given by R = X × X.
 Reflexive relation R in X is a relation with (a, a) ∈ R ∀ a ∈ X.
be T

 Symmetric relation R in X is a relation satisfying (a, b) ∈ R implies (b, a) ∈ R.


 Transitive relation R in X is a relation satisfying (a, b) ∈ R and (b, c) ∈ R
re
o R

implies that (a, c) ∈ R.


 Equivalence relation R in X is a relation which is reflexive, symmetric and
transitive.
tt E

 Equivalence class [a] containing a ∈ X for an equivalence relation R in X is


the subset of X containing all elements b related to a.
C

 A function f : X → Y is one-one (or injective) if


f (x1) = f (x2) ⇒ x1 = x2 ∀ x1, x2 ∈ X.
 A function f : X → Y is onto (or surjective) if given any y ∈ Y, ∃ x ∈ X such
no N

that f (x) = y.
 A function f : X → Y is one-one and onto (or bijective), if f is both one-one
and onto.
©

 The composition of functions f : A → B and g : B → C is the function


gof : A → C given by gof (x) = g(f (x)) ∀ x ∈ A.
 A function f : X → Y is invertible if ∃ g : Y → X such that gof = IX and
fog = IY.
 A function f : X → Y is invertible if and only if f is one-one and onto.
32 MATHEMATICS

 Given a finite set X, a function f : X → X is one-one (respectively onto) if and


only if f is onto (respectively one-one). This is the characteristic property of a
finite set. This is not true for infinite set
 A binary operation ∗ on a set A is a function ∗ from A × A to A.
 An element e ∈ X is the identity element for binary operation ∗ : X × X → X,
if a ∗ e = a = e ∗ a ∀ a ∈ X.

he
 An element a ∈ X is invertible for binary operation ∗ : X × X → X, if
there exists b ∈ X such that a ∗ b = e = b ∗ a where, e is the identity for the
binary operation ∗. The element b is called inverse of a and is denoted by a–1.
 An operation ∗ on X is commutative if a ∗ b = b ∗ a ∀ a, b in X.

is
 An operation ∗ on X is associative if (a ∗ b) ∗ c = a ∗ (b ∗ c) ∀ a, b, c in X.

bl
Historical Note
The concept of function has evolved over a long period of time starting from
R. Descartes (1596-1650), who used the word ‘function’ in his manuscript
pu
“Geometrie” in 1637 to mean some positive integral power xn of a variable x
while studying geometrical curves like hyperbola, parabola and ellipse. James
Gregory (1636-1675) in his work “ Vera Circuli et Hyperbolae Quadratura”
be T

(1667) considered function as a quantity obtained from other quantities by


re
successive use of algebraic operations or by any other operations. Later G. W.
o R

Leibnitz (1646-1716) in his manuscript “Methodus tangentium inversa, seu de


functionibus” written in 1673 used the word ‘function’ to mean a quantity varying
from point to point on a curve such as the coordinates of a point on the curve, the
tt E

slope of the curve, the tangent and the normal to the curve at a point. However,
in his manuscript “Historia” (1714), Leibnitz used the word ‘function’ to mean
quantities that depend on a variable. He was the first to use the phrase ‘function
C

of x’. John Bernoulli (1667-1748) used the notation φx for the first time in 1718 to
indicate a function of x. But the general adoption of symbols like f, F, φ, ψ ... to
represent functions was made by Leonhard Euler (1707-1783) in 1734 in the first
no N

part of his manuscript “Analysis Infinitorium”. Later on, Joeph Louis Lagrange
(1736-1813) published his manuscripts “Theorie des functions analytiques” in
1793, where he discussed about analytic function and used the notion f (x), F(x),
φ (x) etc. for different function of x. Subsequently, Lejeunne Dirichlet
©

(1805-1859) gave the definition of function which was being used till the set
theoretic definition of function presently used, was given after set theory was
developed by Georg Cantor (1845-1918). The set theoretic definition of function
known to us presently is simply an abstraction of the definition given by Dirichlet
in a rigorous manner.
—™ —
Chapter 2
INVERSE TRIGONOMETRIC
FUNCTIONS
v Mathematics, in general, is fundamentally the science of
self-evident things. — FELIX KLEIN v
2.1 Introduction
In Chapter 1, we have studied that the inverse of a function
f, denoted by f –1, exists if f is one-one and onto. There are
many functions which are not one-one, onto or both and
hence we can not talk of their inverses. In Class XI, we
studied that trigonometric functions are not one-one and
onto over their natural domains and ranges and hence their
inverses do not exist. In this chapter, we shall study about
the restrictions on domains and ranges of trigonometric
functions which ensure the existence of their inverses and
observe their behaviour through graphical representations.
Besides, some elementary properties will also be discussed.
The inverse trigonometric functions play an important Arya Bhatta
(476-550 A. D.)
role in calculus for they serve to define many integrals.
The concepts of inverse trigonometric functions is also used in science and engineering.
2.2 Basic Concepts
In Class XI, we have studied trigonometric functions, which are defined as follows:
sine function, i.e., sine : R → [– 1, 1]
cosine function, i.e., cos : R → [– 1, 1]
π
tangent function, i.e., tan : R – { x : x = (2n + 1) , n ∈ Z} → R
2
cotangent function, i.e., cot : R – { x : x = nπ, n ∈ Z} → R
π
secant function, i.e., sec : R – { x : x = (2n + 1) , n ∈ Z} → R – (– 1, 1)
2
cosecant function, i.e., cosec : R – { x : x = nπ, n ∈ Z} → R – (– 1, 1)
34 MATHEMATICS

We have also learnt in Chapter 1 that if f : X→Y such that f(x) = y is one-one and
onto, then we can define a unique function g : Y→X such that g (y) = x, where x ∈ X
and y = f (x), y ∈ Y. Here, the domain of g = range of f and the range of g = domain
of f. The function g is called the inverse of f and is denoted by f –1 . Further, g is also
one-one and onto and inverse of g is f. Thus, g –1 = (f –1) –1 = f. We also have
(f –1 o f ) (x) = f –1 (f (x)) = f –1 (y) = x
and (f o f –1) (y) = f (f –1(y)) = f (x) = y
Since the domain of sine function is the set of all real numbers and range is the

closed interval [–1, 1]. If we restrict its domain to 


−π π 
, , then it becomes one-one
 2 2 
and onto with range [– 1, 1]. Actually, sine function restricted to any of the intervals
 −3π – π  ,  −π π  ,  π , 3π  etc., is one-one and its range is [–1, 1]. We can,
 2 , 2   2 , 2   2 2 
   
therefore, define the inverse of sine function in each of these intervals. We denote the
inverse of sine function by sin–1 (arc sine function). Thus, sin –1 is a function whose
 −3π −π   −π π 
domain is [– 1, 1] and range could be any of the intervals  , , , or
 2 2   2 2 
 π 3π 
 2 , 2  , and so on. Corresponding to each such interval, we get a branch of the
 
 −π π 
function sin–1. The branch with range  ,  is called the principal value branch,
 2 2
whereas other intervals as range give different branches of sin–1 . When we refer
to the function sin–1, we take it as the function whose domain is [–1, 1] and range is
 −π π   −π π 
 2 , 2  . We write sin : [–1, 1] →  2 , 2 
–1
   
From the definition of the inverse functions, it follows that sin (sin–1 x) = x
π π
if – 1 ≤ x ≤ 1 and sin–1 (sin x) = x if − ≤ x ≤ . In other words, if y = sin–1 x, then
2 2
sin y = x.
Remarks
(i) We know from Chapter 1, that if y = f (x) is an invertible function, then x = f –1 (y).
Thus, the graph of sin–1 function can be obtained from the graph of original
function by interchanging x and y axes, i.e., if (a, b) is a point on the graph of
sine function, then (b, a) becomes the corresponding point on the graph of inverse
INVERSE TRIGONOMETRIC FUNCTIONS 35

of sine function. Thus, the graph of the function y = sin –1 x can be obtained from
the graph of y = sin x by interchanging x and y axes. The graphs of y = sin x and
y = sin–1 x are as given in Fig 2.1 (i), (ii), (iii). The dark portion of the graph of
y = sin–1 x represent the principal value branch.
(ii) It can be shown that the graph of an inverse function can be obtained from the
corresponding graph of original function as a mirror image (i.e., reflection) along
the line y = x. This can be visualised by looking the graphs of y = sin x and
y = sin–1 x as given in the same axes (Fig 2.1 (iii)).

Fig 2.1 (i)

Fig 2.1 (ii) Fig 2.1 (iii)

Like sine function, the cosine function is a function whose domain is the set of all
real numbers and range is the set [–1, 1]. If we restrict the domain of cosine function
to [0, π], then it becomes one-one and onto with range [–1, 1]. Actually, cosine function
36 MATHEMATICS

restricted to any of the intervals [– π, 0], [0,π], [π, 2π] etc., is bijective with range as
[–1, 1]. We can, therefore, define the inverse of cosine function in each of these
intervals. We denote the inverse of the cosine function by cos–1 (arc cosine function).
Thus, cos–1 is a function whose domain is [–1, 1] and range
could be any of the intervals [–π, 0], [0, π], [π, 2π] etc.
Corresponding to each such interval, we get a branch of the
function cos–1. The branch with range [0, π] is called the principal
value branch of the function cos–1. We write
cos–1 : [–1, 1] → [0, π].
The graph of the function given by y = cos –1 x can be drawn
in the same way as discussed about the graph of y = sin –1 x. The
graphs of y = cos x and y = cos–1 x are given in Fig 2.2 (i) and (ii).

Fig 2.2 (i) Fig 2.2 (ii)

Let us now discuss cosec–1x and sec–1x as follows:


1
Since, cosec x = , the domain of the cosec function is the set {x : x ∈ R and
sin x
x ≠ nπ, n ∈ Z} and the range is the set {y : y ∈ R, y ≥ 1 or y ≤ –1} i.e., the set
R – (–1, 1). It means that y = cosec x assumes all real values except –1 < y < 1 and is
not defined for integral multiple of π. If we restrict the domain of cosec function to
 π π
 − 2 , 2  – {0}, then it is one to one and onto with its range as the set R – (– 1, 1). Actually,
 
 −3π −π   −π π 
cosec function restricted to any of the intervals  ,  − {−π} ,  ,  – {0},
 2 2  2 2
 π 3π 
 2 , 2  − {π} etc., is bijective and its range is the set of all real numbers R – (–1, 1).
 
INVERSE TRIGONOMETRIC FUNCTIONS 37

Thus cosec–1 can be defined as a function whose domain is R – (–1, 1) and range could
 −π π   −3π −π 
,  − {−π} ,  ,  − {π} etc. The
π 3π
be any of the intervals  ,  − {0} ,   
 2 2  2 2 2 2 
 −π π 
function corresponding to the range  ,  − {0} is called the principal value branch
 2 2
of cosec–1. We thus have principal branch as
 −π π 
cosec–1 : R – (–1, 1) →  ,  − {0}
 2 2
The graphs of y = cosec x and y = cosec–1 x are given in Fig 2.3 (i), (ii).

Fig 2.3 (i) Fig 2.3 (ii)

1 π
Also, since sec x = , the domain of y = sec x is the set R – {x : x = (2n + 1) ,
cos x 2
n ∈ Z} and range is the set R – (–1, 1). It means that sec (secant function) assumes
π
all real values except –1 < y < 1 and is not defined for odd multiples of . If we
2
π
restrict the domain of secant function to [0, π] – { }, then it is one-one and onto with
2
38 MATHEMATICS

its range as the set R – (–1, 1). Actually, secant function restricted to any of the
−π π  3π
intervals [–π, 0] – { }, [0, π] –   , [π, 2π] – { } etc., is bijective and its range
2  2 2
is R – {–1, 1}. Thus sec–1 can be defined as a function whose domain is R– (–1, 1) and
−π π 3π
range could be any of the intervals [– π, 0] – { }, [0, π] – { }, [π, 2π] – { } etc.
2 2 2
Corresponding to each of these intervals, we get different branches of the function sec–1.
π
The branch with range [0, π] – { } is called the principal value branch of the
2
function sec–1. We thus have
π
sec–1 : R – (–1,1) → [0, π] – { }
2
The graphs of the functions y = sec x and y = sec-1 x are given in Fig 2.4 (i), (ii).

Fig 2.4 (i) Fig 2.4 (ii)

Finally, we now discuss tan –1 and cot–1


We know that the domain of the tan function (tangent function) is the set
π
{x : x ∈ R and x ≠ (2n +1) , n ∈ Z} and the range is R. It means that tan function
2
π
is not defined for odd multiples of . If we restrict the domain of tangent function to
2
INVERSE TRIGONOMETRIC FUNCTIONS 39

 −π π 
 ,  , then it is one-one and onto with its range as R. Actually, tangent function
 2 2 
 −3 π −π   −π π   π 3 π 
restricted to any of the intervals  ,  ,  , ,  ,  etc., is bijective
 2 2   2 2  2 2 
and its range is R. Thus tan–1 can be defined as a function whose domain is R and
 −3π −π   −π π   π 3π 
range could be any of the intervals  ,  ,  ,  ,  ,  and so on. These
 2 2   2 2  2 2 
 −π π 
intervals give different branches of the function tan–1 . The branch with range  , 
 2 2 
is called the principal value branch of the function tan–1 .
We thus have
 −π π 
tan –1 : R →  , 
 2 2
The graphs of the function y = tan x and y = tan –1x are given in Fig 2.5 (i), (ii).

Fig 2.5 (i) Fig 2.5 (ii)


We know that domain of the cot function (cotangent function) is the set
{x : x ∈ R and x ≠ nπ, n ∈ Z} and range is R. It means that cotangent function is not
defined for integral multiples of π. If we restrict the domain of cotangent function to
(0, π), then it is bijective with and its range as R. In fact, cotangent function restricted
to any of the intervals (–π, 0), (0, π), (π, 2π) etc., is bijective and its range is R. Thus
cot –1 can be defined as a function whose domain is the R and range as any of the
40 MATHEMATICS

intervals (–π, 0), (0, π), (π, 2π) etc. These intervals give different branches of the
function cot –1. The function with range (0, π) is called the principal value branch of
the function cot –1. We thus have
cot–1 : R → (0, π)
The graphs of y = cot x and y = cot–1 x are given in Fig 2.6 (i), (ii).

Fig 2.6 (i) Fig 2.6 (ii)


The following table gives the inverse trigonometric function (principal value
branches) along with their domains and ranges.
 π π
sin–1 : [–1, 1] → − 2 , 2 
 
cos –1 : [–1, 1] → [0, π]

 π π
cosec–1 : R – (–1,1) →  − 2 , 2  – {0}
 
π
sec –1 : R – (–1, 1) → [0, π] – { }
2
 −π π 
tan–1 : R →  , 
 2 2
cot–1 : R → (0, π)
INVERSE TRIGONOMETRIC FUNCTIONS 41

A Note 1
1. sin–1x should not be confused with (sin x) –1. In fact (sin x) –1 = and
sin x
similarly for other trigonometric functions.
2. Whenever no branch of an inverse trigonometric functions is mentioned, we
mean the principal value branch of that function.
3. The value of an inverse trigonometric functions which lies in the range of
principal branch is called the principal value of that inverse trigonometric
functions.
We now consider some examples:
 1 
Example 1 Find the principal value of sin–1  .
 2
 1  1
Solution Let sin–1   = y. Then, sin y = .
 2 2
 −π π 
We know that the range of the principal value branch of sin–1 is  ,  and
 2 2 
π 1  1  π
sin   = . Therefore, principal value of sin–1   is
 4 2  2 4
 −1 
Example 2 Find the principal value of cot–1  
 3
 −1 
Solution Let cot–1   = y. Then,
 3
−1 π  π  2π 
cot y = = − cot   = cot  π −  = cot  
3  3  3  3 
We know that the range of principal value branch of cot –1 is (0, π) and
 2 π  −1  −1  2π
cot   = . Hence, principal value of cot–1   is
 3  3  3 3

EXERCISE 2.1
Find the principal values of the following:

 1  3
1. sin–1  −  2. cos–1  2  3. cosec–1 (2)
 2  

 1
4. tan–1 ( − 3) 5. cos–1  −  6. tan–1 (–1)
 2
42 MATHEMATICS

 2   1 
7. sec –1   8. cot–1 ( 3) 9. cos–1  − 
 3  2
10. cosec–1 ( − 2 )
Find the values of the following:
 1  1  1  1
11. tan–1 (1) + cos–1  −  + sin–1  −  12. cos–1   + 2 sin–1  
 2   2  2  2
13. If sin–1 x = y, then
π π
(A) 0 ≤ y ≤ π (B) − ≤ y ≤
2 2
π π
(C) 0 < y < π (D) − < y <
2 2
14. tan–1 3 − sec−1 ( − 2 ) is equal to

π π 2π
(A) π (B) − (C) (D)
3 3 3
2.3 Properties of Inverse Trigonometric Functions
In this section, we shall prove some important properties of inverse trigonometric
functions. It may be mentioned here that these results are valid within the principal
value branches of the corresponding inverse trigonometric functions and wherever
they are defined. Some results may not be valid for all values of the domains of inverse
trigonometric functions. In fact, they will be valid only for some values of x for which
inverse trigonometric functions are defined. We will not go into the details of these
values of x in the domain as this discussion goes beyond the scope of this text book.
Let us recall that if y = sin–1x, then x = sin y and if x = sin y, then y = sin–1 x. This is
equivalent to
 π π
sin (sin–1 x) = x, x ∈ [– 1, 1] and sin–1 (sin x) = x, x ∈  − , 
 2 2
Same is true for other five inverse trigonometric functions as well. We now prove
some properties of inverse trigonometric functions.
1
1. (i) sin–1 = cosec–1 x, x ≥ 1 or x ≤ – 1
x
1
(ii) cos–1 = sec –1x, x ≥ 1 or x ≤ – 1
x
INVERSE TRIGONOMETRIC FUNCTIONS 43

1
(iii) tan–1 = cot–1 x, x > 0
x
To prove the first result, we put cosec–1 x = y, i.e., x = cosec y
1
Therefore = sin y
x
1
Hence sin–1 = y
x
1
or sin–1 = cosec–1 x
x
Similarly, we can prove the other parts.
2. (i) sin–1 (–x) = – sin–1 x, x ∈ [– 1, 1]
(ii) tan–1 (–x) = – tan–1 x, x ∈ R
(iii) cosec–1 (–x) = – cosec–1 x, | x | ≥ 1
Let sin–1 (–x) = y, i.e., –x = sin y so that x = – sin y, i.e., x = sin (–y).
Hence sin–1 x = – y = – sin–1 (–x)
Therefore sin–1 (–x) = – sin–1x
Similarly, we can prove the other parts.
3. (i) cos–1 (–x) = π – cos–1 x, x ∈ [– 1, 1]
(ii) sec–1 (–x) = π – sec–1 x, | x | ≥ 1
(iii) cot–1 (–x) = π – cot–1 x, x ∈ R
Let cos–1 (–x) = y i.e., – x = cos y so that x = – cos y = cos (π – y)
Therefore cos–1 x = π – y = π – cos–1 (–x)
Hence cos–1 (–x) = π – cos–1 x
Similarly, we can prove the other parts.

π
4. (i) sin–1 x + cos–1 x = , x ∈ [– 1, 1]
2
π
(ii) tan–1 x + cot–1 x = ,x∈ R
2
π
(iii) cosec–1 x + sec–1 x = , |x | ≥ 1
2
π 
Let sin–1 x = y. Then x = sin y = cos  − y 
 2 
π π
Therefore cos–1 x = −y = − sin –1 x
2 2
44 MATHEMATICS

π
Hence sin–1 x + cos–1 x =
2
Similarly, we can prove the other parts.
x+ y
5. (i) tan–1 x + tan–1 y = tan–1 , xy < 1
1 – xy

x–y
(ii) tan–1 x – tan–1 y = tan–1 , xy > – 1
1 + xy

Let tan–1 x = θ and tan–1 y = φ. Then x = tan θ, y = tan φ


tan θ + tan φ x + y
Now tan( θ +φ) = =
1 − tan θ tan φ 1 − xy

x+ y
This gives θ + φ = tan–1 1− xy

x+y
Hence tan–1 x + tan–1 y = tan–1 1− xy

In the above result, if we replace y by – y, we get the second result and by replacing
y by x, we get the third result as given below.

2x
6. (i) 2tan–1 x = sin–1 , |x| ≤ 1
1+ x2

1 – x2
(ii) 2tan–1 x = cos–1 ,x≥ 0
1+ x2

2x
(iii) 2 tan–1 x = tan–1 ,–1<x<1
1 – x2

Let tan–1 x = y, then x = tan y. Now


2x 2 tan y
sin–1 2 = sin
–1
1+ x 1 + tan 2 y
= sin–1 (sin 2y) = 2y = 2tan–1 x
INVERSE TRIGONOMETRIC FUNCTIONS 45

1 − x2 1 − tan 2 y
Also cos–1 = cos–1
= cos–1 (cos 2y) = 2y = 2tan–1 x
1 + x2 1 + tan 2 y
(iii) Can be worked out similarly.
We now consider some examples.

Example 3 Show that


1 1
( )
(i) sin–1 2x 1− x 2 = 2 sin–1 x, −
2
≤ x≤
2
1
( )
(ii) sin–1 2x 1 − x2 = 2 cos–1 x,
2
≤ x ≤1

Solution
(i) Let x = sin θ. Then sin–1 x = θ. We have

( ) (
sin–1 2x 1− x 2 = sin–1 2sin θ 1 − sin 2 θ )
= sin–1 (2sinθ cosθ) = sin–1 (sin2θ) = 2θ
= 2 sin–1 x
(ii) Take x = cos θ, then proceeding as above, we get, sin–1 ( 2 x 1 − x 2 ) = 2 cos–1 x
1 2 3
Example 4 Show that tan–1 + tan–1 = tan–1
2 11 4
Solution By property 5 (i), we have

1 2
+
–1 1 –1 2 –1 2 11 15 −1 3
L.H.S. = tan + tan = tan = tan −1 = tan = R.H.S.
2 11 1 2 20 4
1− ×
2 11

 cos x  − 3π π
Example 5 Express tan −1   , < x < in the simplest form.
 1 − sin x  2 2
Solution We write

 x x 
cos 2 − sin 2
−1  cos x  –1  2 2 
tan   = tan  x x x x 
 1 − sin x   cos2 + sin 2 − 2sin cos 
 2 2 2 2 
46 MATHEMATICS

 x x  x x 
  cos 2 + sin 2  cos 2 − sin 2  
–1   
= tan 
  x x 2 
  cos 2 − sin 2  
 

 x x  x
 cos + sin  1 + tan 
–1
= tan  2 2 = tan –1 2
x x  x
 cos − sin  1 − tan 
 2 2  2

–1   π x  π x
= tan  tan  +   = +
  4 2  4 2
Alternatively,

 π    π − 2x  
 sin  − x    sin  2  
 cos x  2    
tan –1   = tan   = tan –1 
–1

 1 − sin x   1 − cos  − x 
π  1 − cos  π − 2 x 
 2     2 
   

  π − 2x  π − 2x  
 2 sin  4  cos  4  
   
= tan 
–1

 π − 2x  
 2 sin 2   
 4 

–1   π − 2 x   = tan –1  tan  π − π − 2 x  
= tan cot    2 
4  
  4   

–1   π x  π x
= tan  tan  +   = +
  4 2  4 2

–1  1 
Example 6 Write cot  2  , x > 1 in the simplest form.
 x −1 

Solution Let x = sec θ, then x2 − 1 = sec2 θ − 1 = tan θ


INVERSE TRIGONOMETRIC FUNCTIONS 47

–1 1
Therefore, cot = cot–1 (cot θ) = θ = sec–1 x, which is the simplest form.
2
x −1

2x –1
 3 x − x3  1
Example 7 Prove that tan x + tan –1
2 = tan
–1  2 , | x|<
1− x  1 − 3x  3
Solution Let x = tan θ. Then θ = tan–1 x. We have

 3x − x 3  –1  3tan θ − tan θ 
3
R.H.S. = tan –1  2  = tan  2 
 1− 3x   1 − 3tan θ 
= tan–1 (tan3θ) = 3θ = 3tan–1 x = tan–1 x + 2 tan–1 x
2x
= tan–1 x + tan –1= L.H.S. (Why?)
1 − x2
Example 8 Find the value of cos (sec–1 x + cosec–1 x), | x | ≥ 1
π
Solution We have cos (sec–1 x + cosec–1 x) = cos   = 0
2 

EXERCISE 2.2
Prove the following:
 1 1
1. 3sin–1 x = sin–1 (3x – 4x3), x ∈  – , 
 2 2
1 
2. 3cos–1 x = cos–1 (4x3 – 3x), x ∈  , 1
2 
2 7 1
3. tan–1 + tan −1 = tan −1
11 24 2
1 1 31
4. 2 tan −1 + tan −1 = tan −1
2 7 17
Write the following functions in the simplest form:

1+ x 2 − 1 1
5. tan −1 ,x≠0 6. tan −1 , |x | > 1
x x2 −1
 1 − cos x   cos x − sin x 
7. tan −1   , 0 < x < π 8. tan −1   , 0 < x< π
 1 + cos x
   cos x + sin x 
48 MATHEMATICS

x
9. tan −1 , |x | < a
a − x2
2

 3a 2 x − x3  −a a
10. tan −1  3 2  , a > 0;
<x <
 a − 3ax  3 3
Find the values of each of the following:
  1 
11. tan –1  2 cos  2sin –1   12. cot (tan–1a + cot–1 a)
  2 

1 2x –1 1 − y 
2

13. tan sin –1 + cos  , | x | < 1, y > 0 and xy < 1


2 1+ x 2 1 + y2 

 –1 1 
14. If sin  sin + cos–1 x  =1 , then find the value of x
 5 
–1 x −1 x +1 π
15. If tan + tan –1 = , then find the value of x
x−2 x+2 4
Find the values of each of the expressions in Exercises 16 to 18.
 2π   3π 
16. sin–1  sin  17. tan–1  tan 
 3   4
 3 3
18. tan  sin –1 + cot –1 
 5 2
 7π 
19. cos −1  cos  is equal to
 6 
7π 5π π π
(A) (B) (C) (D)
6 6 3 6
π 1 
20. sin  − sin −1 ( − )  is equal to
3 2 
1 1 1
(A) (B) (C) (D) 1
2 3 4
21. tan −1 3 − cot −1 ( − 3 ) is equal to
π
(A) π (B) − (C) 0 (D) 2 3
2
INVERSE TRIGONOMETRIC FUNCTIONS 49

Miscellaneous Examples

−1 3π
Example 9 Find the value of sin (sin )
5

−1 3π 3 π
Solution We know that sin −1 (sin x) = x . Therefore, sin (sin )=
5 5
3π  π π 
∉ − , , which is the principal branch of sin –1 x
5  2 2 
But

3π 3π 2π 2π  π π 
However sin ( ) = sin( π − ) = sin and ∈ − ,
5 5 5 5  2 2 

3π 2π 2π
Therefore sin −1 (sin ) = sin −1 (sin ) =
5 5 5
−1 3 8 84
Example 10 Show that sin − sin −1 = cos−1
5 17 85
3 8
Solution Let sin −1 = x and sin−1 = y
5 17

3 8
Therefore sin x = and sin y =
5 17

9 4
Now cos x = 1 − sin 2 x = 1 − = (Why?)
25 5

64 15
and cos y = 1 − sin2 y = 1− =
289 17
We have cos (x–y) = cos x cos y + sin x sin y
4 15 3 8 84
= × + × =
5 17 5 17 85
 84 
Therefore x − y = cos−1  
 85 
3 8 84
Hence sin −1 − sin −1 = cos−1
5 17 85
50 MATHEMATICS

12 4 63
Example 11 Show that sin−1 + cos −1 + tan−1 = π
13 5 16

12 4 63
Solution Let sin −1 = x, cos−1 = y, tan −1 = z
13 5 16

12 4 63
Then sin x = , cos y = , tan z =
13 5 16

5 3 12 3
Therefore cos x = , sin y = , tan x = and tan y =
13 5 5 4
12 3
+
tan x + tan y 63
We have tan( x + y ) = = 5 4 =−
1 − tan x tan y 1 − 12 × 3 16
5 4
Hence tan( x + y ) = − tan z

i.e., tan (x + y) = tan (–z) or tan (x + y) = tan (π – z)


Therefore x + y = – z or x + y = π – z
Since x, y and z are positive, x + y ≠ – z (Why?)

–1 12 4 63
Hence x + y + z = π or sin + cos –1 + tan –1 = π
13 5 16

–1  a cos x − b sin x  a
Example 12 Simplify tan   , if tan x > –1
 b cos x + a sin x  b

Solution We have,

 a cos x − b sin x   a 
    − tan x 
–1 a cos x − b sin x –1 b cos x –1  b
tan   = tan  b cos x + a sin x  = tan  a 
 b cos x + a sin x    1 + tan x 
 b cos x   b 

–1 a a
= tan − tan –1 (tan x) = tan –1 − x
b b
INVERSE TRIGONOMETRIC FUNCTIONS 51

π
Example 13 Solve tan–1 2x + tan –1 3x =
4
π
Solution We have tan–1 2x + tan –1 3x =
4
 2x + 3x  π
or tan –1   =
 1 − 2 x × 3 x  4

 5x  π
i.e. tan–1  2  =
 1− 6x  4
5x π
Therefore 2 = tan =1
1− 6x 4
or 6x2 + 5x – 1 = 0 i.e., (6x – 1) (x + 1) = 0
1
which gives x= or x = – 1.
6
Since x = – 1 does not satisfy the equation, as the L.H.S. of the equation becomes
1
negative, x = is the only solution of the given equation.
6

Miscellaneous Exercise on Chapter 2


Find the value of the following:

 13π   7π 
1. cos –1  cos  2. tan–1  tan 
 6   6 
Prove that
3 24 8 3 77
3. 2sin –1 = tan –1 4. sin –1 + sin –1 = tan –1
5 7 17 5 36
4 12 33 12 3 56
5. cos –1 + cos –1 = cos –1 6. cos –1 + sin –1 = sin –1
5 13 65 13 5 65
63 5 3
7. tan–1 = sin –1 + cos –1
16 13 5
1 1 1 1 π
8. tan –1 + tan −1 + tan −1 + tan −1 =
5 7 3 8 4
52 MATHEMATICS

Prove that
1 1 − x 
tan –1 x = cos –1 
 1 + x 
9. , x ∈ [0, 1]
2

 1 + sin x + 1 − sin x  x  π
10. cot –1   = , x ∈  0, 4 

 1 + sin x − 1 − sin x  2

 1+x − 1−x  π 1 1
11. tan –1  –1
 = − cos x , − ≤ x ≤ 1 [Hint: Put x = cos 2θ]
 1+ x + 1 − x
  4 2 2

9π 9 1 9 2 2
12. − sin −1 = sin −1
8 4 3 4 3
Solve the following equations:
1− x 1
13. 2tan–1 (cos x) = tan–1 (2 cosec x) 14. tan –1 = tan –1 x, ( x > 0)
1+ x 2
15. sin (tan–1 x), | x | < 1 is equal to
x 1 1 x
(A) (B) (C) (D)
1 − x2 1 − x2 1 + x2 1 + x2

π
16. sin–1 (1 – x) – 2 sin –1 x = , then x is equal to
2

1 1 1
(A) 0, (B) 1, (C) 0 (D)
2 2 2

 x x− y
17. tan −1   − tan −1 is equal to
 y x+ y

π π π −3π
(A) (B) (C) (D)
2 3 4 4
INVERSE TRIGONOMETRIC FUNCTIONS 53

Summary
® The domains and ranges (principal value branches) of inverse trigonometric
functions are given in the following table:
Functions Domain Range
(Principal Value Branches)
 −π π 
y = sin–1 x [–1, 1]  2 , 2
 
y = cos–1 x [–1, 1] [0, π]
 −π π 
y = cosec–1 x R – (–1,1)  2 , 2  – {0}
 
π
y = sec–1 x R – (–1, 1) [0, π] – { }
2
 π π
y = tan–1 x R − , 
 2 2
y = cot–1 x R (0, π)
1
® sin–1x should not be confused with (sin x) –1. In fact (sin x) –1 =
sin x
and
similarly for other trigonometric functions.
® The value of an inverse trigonometric functions which lies in its principal
value branch is called the principal value of that inverse trigonometric
functions.
For suitable values of domain, we have
® y = sin–1 x ⇒ x = sin y ® x = sin y ⇒ y = sin–1 x
® sin (sin–1 x) = x ® sin–1 (sin x) = x
1
® sin–1
x
= cosec–1 x ® cos–1 (–x) = π – cos–1 x

1
® cos–1
x
= sec–1x ® cot–1 (–x) = π – cot–1 x

1
® tan–1
x
= cot–1 x ® sec–1 (–x) = π – sec–1 x
54 MATHEMATICS

® sin–1 (–x) = – sin–1 x ® tan–1 (–x) = – tan–1 x


π
® tan–1 x + cot–1 x =
2
® cosec–1 (–x) = – cosec–1 x

π π
® sin–1 x + cos–1 x =
2
® cosec–1 x + sec–1 x =
2
x+ y 2x
® tan–1 x + tan–1y = tan–1
1 − xy ® 2tan–1x = tan –1
1 − x2
x−y
® tan–1 x – tan–1 y = tan–1
1 + xy

2x 1 − x2
® 2tan–1 x = sin–1
1 + x2
= cos–1
1 + x2

Historical Note
The study of trigonometry was first started in India. The ancient Indian
Mathematicians, Aryabhatta (476A.D.), Brahmagupta (598 A.D.), Bhaskara I
(600 A.D.) and Bhaskara II (1114 A.D.) got important results of trigonometry. All
this knowledge went from India to Arabia and then from there to Europe. The
Greeks had also started the study of trigonometry but their approach was so
clumsy that when the Indian approach became known, it was immediately adopted
throughout the world.
In India, the predecessor of the modern trigonometric functions, known as
the sine of an angle, and the introduction of the sine function represents one of
the main contribution of the siddhantas (Sanskrit astronomical works) to
mathematics.
Bhaskara I (about 600 A.D.) gave formulae to find the values of sine functions
for angles more than 90°. A sixteenth century Malayalam work Yuktibhasa
contains a proof for the expansion of sin (A + B). Exact expression for sines or
cosines of 18°, 36°, 54°, 72°, etc., were given by Bhaskara II.
The symbols sin–1 x, cos–1 x, etc., for arc sin x, arc cos x, etc., were suggested
by the astronomer Sir John F.W. Hersehel (1813) The name of Thales
(about 600 B.C.) is invariably associated with height and distance problems. He
is credited with the determination of the height of a great pyramid in Egypt by
measuring shadows of the pyramid and an auxiliary staff (or gnomon) of known
INVERSE TRIGONOMETRIC FUNCTIONS 55

height, and comparing the ratios:

H h
= = tan (sun’s altitude)
S s
Thales is also said to have calculated the distance of a ship at sea through
the proportionality of sides of similar triangles. Problems on height and distance
using the similarity property are also found in ancient Indian works.

—v —
56 MATHEMATICS

Chapter 3
MATRICES

he
™ The essence of Mathematics lies in its freedom. — CANTOR ™

3.1 Introduction

is
The knowledge of matrices is necessary in various branches of mathematics. Matrices
are one of the most powerful tools in mathematics. This mathematical tool simplifies

bl
our work to a great extent when compared with other straight forward methods. The
evolution of concept of matrices is the result of an attempt to obtain compact and
simple methods of solving system of linear equations. Matrices are not only used as a
pu
representation of the coefficients in system of linear equations, but utility of matrices
far exceeds that use. Matrix notation and operations are used in electronic spreadsheet
be T

programs for personal computer, which in turn is used in different areas of business
re
and science like budgeting, sales projection, cost estimation, analysing the results of an
o R

experiment etc. Also, many physical operations such as magnification, rotation and
reflection through a plane can be represented mathematically by matrices. Matrices
tt E

are also used in cryptography. This mathematical tool is not only used in certain branches
of sciences, but also in genetics, economics, sociology, modern psychology and industrial
C

management.
In this chapter, we shall find it interesting to become acquainted with the
no N

fundamentals of matrix and matrix algebra.


3.2 Matrix
Suppose we wish to express the information that Radha has 15 notebooks. We may
©

express it as [15] with the understanding that the number inside [ ] is the number of
notebooks that Radha has. Now, if we have to express that Radha has 15 notebooks
and 6 pens. We may express it as [15 6] with the understanding that first number
inside [ ] is the number of notebooks while the other one is the number of pens possessed
by Radha. Let us now suppose that we wish to express the information of possession
MATRICES 57

of notebooks and pens by Radha and her two friends Fauzia and Simran which
is as follows:
Radha has 15 notebooks and 6 pens,
Fauzia has 10 notebooks and 2 pens,
Simran has 13 notebooks and 5 pens.

he
Now this could be arranged in the tabular form as follows:
Notebooks Pens
Radha 15 6
Fauzia 10 2

is
Simran 13 5
and this can be expressed as

bl
pu
be T
re
o R

or
Radha Fauzia Simran
tt E

Notebooks 15 10 13
Pens 6 2 5
C

which can be expressed as:


no N
©

In the first arrangement the entries in the first column represent the number of
note books possessed by Radha, Fauzia and Simran, respectively and the entries in the
second column represent the number of pens possessed by Radha, Fauzia and Simran,
58 MATHEMATICS

respectively. Similarly, in the second arrangement, the entries in the first row represent
the number of notebooks possessed by Radha, Fauzia and Simran, respectively. The
entries in the second row represent the number of pens possessed by Radha, Fauzia
and Simran, respectively. An arrangement or display of the above kind is called a
matrix. Formally, we define matrix as:
Definition 1 A matrix is an ordered rectangular array of numbers or functions. The

he
numbers or functions are called the elements or the entries of the matrix.
We denote matrices by capital letters. The following are some examples of matrices:

⎡ 1⎤
⎡– 2 5⎤ ⎢2 + i 3 − 2 ⎥

is
⎢ ⎥ ⎢ ⎥ ⎡1 + x x3 3 ⎤
A=⎢ 0 5 ⎥ , B = ⎢ 3.5 –1 2 ⎥ , C = ⎢ ⎥
⎢ ⎥ ⎣ cos x sin x + 2 tan x ⎦
⎢3 ⎥ 5
⎣ 6⎦

bl
⎢ 3 5 ⎥
⎣ 7 ⎦

In the above examples, the horizontal lines of elements are said to constitute, rows
pu
of the matrix and the vertical lines of elements are said to constitute, columns of the
matrix. Thus A has 3 rows and 2 columns, B has 3 rows and 3 columns while C has 2
be T

rows and 3 columns.


re
3.2.1 Order of a matrix
o R

A matrix having m rows and n columns is called a matrix of order m × n or simply m × n


matrix (read as an m by n matrix). So referring to the above examples of matrices, we
have A as 3 × 2 matrix, B as 3 × 3 matrix and C as 2 × 3 matrix. We observe that A has
tt E

3 × 2 = 6 elements, B and C have 9 and 6 elements, respectively.


In general, an m × n matrix has the following rectangular array:
C
no N
©

or A = [aij]m × n, 1≤ i ≤ m, 1≤ j ≤ n i, j ∈ N
Thus the ith row consists of the elements ai1, ai2, ai3,..., ain, while the jth column
consists of the elements a1j, a2j, a3j,..., amj ,
In general aij, is an element lying in the ith row and jth column. We can also call
it as the (i, j)th element of A. The number of elements in an m × n matrix will be
equal to mn.
MATRICES 59

$Note In this chapter


1. We shall follow the notation, namely A = [aij]m × n to indicate that A is a matrix
of order m × n.
2. We shall consider only those matrices whose elements are real numbers or
functions taking real values.

he
We can also represent any point (x, y) in a plane by a matrix (column or row) as

⎡x⎤
⎢ y ⎥ (or [x, y]). For example point P(0, 1) as a matrix representation may be given as
⎣ ⎦

is
⎡0 ⎤
P = ⎢ ⎥ or [0 1].

bl
⎣1 ⎦
Observe that in this way we can also express the vertices of a closed rectilinear
pu
figure in the form of a matrix. For example, consider a quadrilateral ABCD with vertices
A (1, 0), B (3, 2), C (1, 3), D (–1, 2).
Now, quadrilateral ABCD in the matrix form, can be represented as
be T

A⎡ 1 0⎤
re
A B C D
B ⎢⎢ 3 2 ⎥⎥
o R

⎡1 3 1 −1⎤
X=⎢ ⎥ or Y=
⎣ 0 2 3 2⎦ 2 × 4 C⎢ 1 3⎥
⎢ ⎥
D ⎣−1
tt E

2 ⎦ 4× 2

Thus, matrices can be used as representation of vertices of geometrical figures in


C

a plane.
Now, let us consider some examples.
no N

Example 1 Consider the following information regarding the number of men and women
workers in three factories I, II and III
Men workers Women workers
©

I 30 25
II 25 31
III 27 26
Represent the above information in the form of a 3 × 2 matrix. What does the entry
in the third row and second column represent?
60 MATHEMATICS

Solution The information is represented in the form of a 3 × 2 matrix as follows:


⎡ 30 25⎤
A = ⎢⎢ 25 31⎥⎥
⎢⎣ 27 26 ⎥⎦
The entry in the third row and second column represents the number of women

he
workers in factory III.
Example 2 If a matrix has 8 elements, what are the possible orders it can have?
Solution We know that if a matrix is of order m × n, it has mn elements. Thus, to find

is
all possible orders of a matrix with 8 elements, we will find all ordered pairs of natural
numbers, whose product is 8.

bl
Thus, all possible ordered pairs are (1, 8), (8, 1), (4, 2), (2, 4)
Hence, possible orders are 1 × 8, 8 ×1, 4 × 2, 2 × 4
1
pu
Example 3 Construct a 3 × 2 matrix whose elements are given by aij =
2
|i −3j |.

⎡ a11 a12 ⎤
be T

Solution In general a 3 × 2 matrix is given by A = ⎢ a21 a22 ⎥ .


⎢ ⎥
re
⎢⎣ a31 a32 ⎥⎦
o R

1
Now aij = | i − 3 j | , i = 1, 2, 3 and j = 1, 2.
2
tt E

1 1 5
Therefore a11 = |1 − 3 × 1| = 1 a12 = |1 − 3 × 2 | =
2 2 2
C

1 1 1
a21 = | 2 − 3 × 1| = a22 = | 2 − 3× 2 | = 2
2 2 2
no N

1 1 3
a31 = | 3 − 3 × 1| = 0 a32 = | 3 − 3× 2 | =
2 2 2
©

⎡1 5⎤
⎢ 2⎥
⎢1 ⎥
Hence the required matrix is given by A = ⎢ 2⎥ .
⎢2 3⎥
⎢0 ⎥
⎣ 2⎦
MATRICES 61

3.3 Types of Matrices


In this section, we shall discuss different types of matrices.
(i) Column matrix
A matrix is said to be a column matrix if it has only one column.

⎡ 0 ⎤

he
⎢ ⎥
⎢ 3⎥
For example, A = ⎢ −1 ⎥ is a column matrix of order 4 × 1.
⎢ ⎥
⎣⎢1/ 2 ⎦⎥

is
In general, A = [aij] m × 1 is a column matrix of order m × 1.
(ii) Row matrix

bl
A matrix is said to be a row matrix if it has only one row.

⎡ 1 ⎤
pu
For example, B = ⎢ −
⎣ 2
5 2 3⎥ is a row matrix.
⎦1× 4
In general, B = [bij] 1 × n is a row matrix of order 1 × n.
be T

(iii) Square matrix


re
A matrix in which the number of rows are equal to the number of columns, is
o R

said to be a square matrix. Thus an m × n matrix is said to be a square matrix if


m = n and is known as a square matrix of order ‘n’.
tt E

⎡ 3 −1 0⎤
⎢3 ⎥
= ⎢ 1 ⎥ is a square matrix of order 3.
C

For example A 3 2
⎢2 ⎥
⎢ −1⎦⎥
⎣4 3
no N

In general, A = [aij] m × m is a square matrix of order m.

$Note If A = [a ] is a square matrix of order n, then elements (entries) a , a , ..., a


ij 11 22 nn
©

⎡ 1 −3 1 ⎤
⎢ ⎥
are said to constitute the diagonal, of the matrix A. Thus, if A = ⎢ 2 4 −1⎥ .
⎣⎢ 3 5 6 ⎥⎦
Then the elements of the diagonal of A are 1, 4, 6.
62 MATHEMATICS

(iv) Diagonal matrix


A square matrix B = [bij] m × m is said to be a diagonal matrix if all its non
diagonal elements are zero, that is a matrix B = [bij] m × m is said to be a diagonal
matrix if bij = 0, when i ≠ j.
⎡ −1.1 0 0 ⎤
⎡ −1 0 ⎤ ⎢
For example, A = [4], B = ⎢ ⎥, C=⎢ 0 2 0 ⎥⎥ , are diagonal matrices

he
⎣ 0 2 ⎦ ⎢⎣ 0 0 3⎥⎦
of order 1, 2, 3, respectively.
(v) Scalar matrix

is
A diagonal matrix is said to be a scalar matrix if its diagonal elements are equal,
that is, a square matrix B = [bij] n × n is said to be a scalar matrix if
bij = 0, when i ≠ j

bl
bij = k, when i = j, for some constant k.
For example
pu ⎡ 3 0 0⎤
⎡ −1 0 ⎤ ⎢ ⎥
A = [3], B=⎢ ⎥, C=⎢ 0 3 0⎥
be T

⎣ 0 −1⎦ ⎢ ⎥
⎣0 0 3⎦
re
o R

are scalar matrices of order 1, 2 and 3, respectively.


(vi) Identity matrix
tt E

A square matrix in which elements in the diagonal are all 1 and rest are all zero
is called an identity matrix. In other words, the square matrix A = [aij] n × n is an
⎧1 if i = j
C

identity matrix, if aij = ⎨ .


⎩0 if i ≠ j
no N

We denote the identity matrix of order n by In. When order is clear from the
context, we simply write it as I.
⎡1 0 0 ⎤
⎡1 0 ⎤ ⎢0 1 0 ⎥
©

For example [1], ⎢ ⎥, ⎢ ⎥ are identity matrices of order 1, 2 and 3,


⎣ 0 1 ⎦ ⎢0 0 1 ⎥
⎣ ⎦
respectively.
Observe that a scalar matrix is an identity matrix when k = 1. But every identity
matrix is clearly a scalar matrix.
MATRICES 63

(vii) Zero matrix


A matrix is said to be zero matrix or null matrix if all its elements are zero.
⎡0 0 ⎤ ⎡0 0 0 ⎤
For example, [0], ⎢ ⎥, ⎢ ⎥ , [0, 0] are all zero matrices. We denote
⎣0 0 ⎦ ⎣0 0 0 ⎦
zero matrix by O. Its order will be clear from the context.

he
3.3.1 Equality of matrices
Definition 2 Two matrices A = [aij] and B = [bij] are said to be equal if
(i) they are of the same order

is
(ii) each element of A is equal to the corresponding element of B, that is aij = bij for
all i and j.

bl
⎡ 2 3⎤ ⎡ 2 3⎤ ⎡3 2⎤ ⎡ 2 3⎤
For example, ⎢ ⎥ and ⎢ ⎥ are equal matrices but ⎢ ⎥ and ⎢ ⎥ are
⎣ 0 1⎦ ⎣ 0 1⎦ ⎣0 1 ⎦ ⎣ 0 1⎦
not equal matrices. Symbolically, if two matrices A and B are equal, we write A = B.
pu
⎡ x y ⎤ ⎡ −1.5 0 ⎤
⎢ ⎥
If ⎢⎢ z a ⎥⎥ = ⎢ 2
be T

6 ⎥ , then x = – 1.5, y = 0, z = 2, a = 6 , b = 3, c = 2
⎣⎢b c ⎥⎦ ⎢⎣3 2 ⎥⎦
re
o R

⎡ x + 3 z + 4 2 y − 7⎤ ⎡ 0 6 3y − 2 ⎤
⎢ −6 a −1 ⎥ ⎢
0 ⎥ = ⎢− 6 −3 2c + 2 ⎥⎥
tt E

Example 4 If ⎢
⎢⎣b − 3 − 21 0 ⎥⎦ ⎢⎣ 2b + 4 − 21 0 ⎥⎦
C

Find the values of a, b, c, x, y and z.


Solution As the given matrices are equal, therefore, their corresponding elements
must be equal. Comparing the corresponding elements, we get
no N

x + 3 = 0, z + 4 = 6, 2y – 7 = 3y – 2
a – 1 = – 3, 0 = 2c + 2 b – 3 = 2b + 4,
Simplifying, we get
©

a = – 2, b = – 7, c = – 1, x = – 3, y = –5, z = 2
Example 5 Find the values of a, b, c, and d from the following equation:
⎡ 2a + b a − 2b ⎤ ⎡ 4 −3⎤
⎢5c − d 4c + 3d ⎥ = ⎢11 24 ⎥
⎣ ⎦ ⎣ ⎦
64 MATHEMATICS

Solution By equality of two matrices, equating the corresponding elements, we get


2a + b = 4 5c – d = 11
a – 2b = – 3 4c + 3d = 24
Solving these equations, we get
a = 1, b = 2, c = 3 and d = 4

he
EXERCISE 3.1

⎡2 5 19 −7 ⎤
⎢ ⎥

is
5
1. In the matrix A = ⎢ 35 −2 12 ⎥ , write:
⎢ 2 ⎥
⎢ ⎥
⎣ 3 1 −5 17 ⎦

bl
(i) The order of the matrix, (ii) The number of elements,
(iii) Write the elements a13, a21, a33, a24, a23.
pu
2. If a matrix has 24 elements, what are the possible orders it can have? What, if it
has 13 elements?
be T

3. If a matrix has 18 elements, what are the possible orders it can have? What, if it
has 5 elements?
re
o R

4. Construct a 2 × 2 matrix, A = [aij], whose elements are given by:

(i + j ) 2 i (i + 2 j ) 2
(i) aij = (ii) aij = (iii) aij =
tt E

2 j 2
5. Construct a 3 × 4 matrix, whose elements are given by:
C

1
(i) aij = | −3i + j | (ii) aij = 2i − j
2
no N

6. Find the values of x, y and z from the following equations:


⎡ x + y + z ⎤ ⎡9 ⎤
⎡ 4 3⎤ ⎡ y z ⎤ ⎡x + y 2 ⎤ ⎡6 2⎤ ⎢ x + z ⎥ = ⎢5⎥
(i) ⎢ ⎥=⎢ ⎥ (ii) ⎢ =
xy ⎥⎦ ⎢⎣ 5 8 ⎥⎦
(iii) ⎢ ⎥ ⎢ ⎥
⎣5 + z
©

⎣ x 5⎦ ⎣ 1 5 ⎦ ⎢⎣ y + z ⎥⎦ ⎢⎣7 ⎥⎦
7. Find the value of a, b, c and d from the equation:
⎡ a − b 2 a + c ⎤ ⎡ −1 5 ⎤
⎢ 2a − b 3c + d ⎥ = ⎢ 0 13⎥
⎣ ⎦ ⎣ ⎦
MATRICES 65

8. A = [aij]m × n\ is a square matrix, if


(A) m < n (B) m > n (C) m = n (D) None of these
9. Which of the given values of x and y make the following pair of matrices equal
⎡3 x + 7 5 ⎤ ⎡0 y − 2⎤
⎢ y + 1 2 − 3 x ⎥ , ⎢8 4 ⎥⎦
⎣ ⎦ ⎣
−1

he
(A) x = , y=7 (B) Not possible to find
3
−2 1 2
(C) y = 7, x = (D) x , y
3 3 3

is
10. The number of all possible matrices of order 3 × 3 with each entry 0 or 1 is:
(A) 27 (B) 18 (C) 81 (D) 512

bl
3.4 Operations on Matrices
In this section, we shall introduce certain operations on matrices, namely, addition of
matrices, multiplication of a matrix by a scalar, difference and multiplication of matrices.
pu
3.4.1 Addition of matrices
Suppose Fatima has two factories at places A and B. Each factory produces sport
be T

shoes for boys and girls in three different price categories labelled 1, 2 and 3. The
quantities produced by each factory are represented as matrices given below:
re
o R
tt E
C

Suppose Fatima wants to know the total production of sport shoes in each price
no N

category. Then the total production


In category 1 : for boys (80 + 90), for girls (60 + 50)
In category 2 : for boys (75 + 70), for girls (65 + 55)
©

In category 3 : for boys (90 + 75), for girls (85 + 75)


⎡80 + 90 60 + 50 ⎤
This can be represented in the matrix form as ⎢ 75 + 70 65 + 55 ⎥⎥ .

⎢⎣90 + 75 85 + 75 ⎥⎦
66 MATHEMATICS

This new matrix is the sum of the above two matrices. We observe that the sum of
two matrices is a matrix obtained by adding the corresponding elements of the given
matrices. Furthermore, the two matrices have to be of the same order.

⎡ a11 a12 a13 ⎤ ⎡b11 b12 b13 ⎤


Thus, if A = ⎢ ⎥ is a 2 × 3 matrix and B = ⎢ ⎥ is another
⎣ a21 a22 a23 ⎦ ⎣b21 b22 b23 ⎦

he
⎡ a11 + b11 a12 + b12 a13 + b13 ⎤
2×3 matrix. Then, we define A + B = ⎢ ⎥.
⎣ a21 + b21 a22 + b22 a23 + b23 ⎦

is
In general, if A = [aij] and B = [bij] are two matrices of the same order, say m × n.
Then, the sum of the two matrices A and B is defined as a matrix C = [cij]m × n, where
cij = aij + bij, for all possible values of i and j.

bl
⎡ 3 1 − 1⎤
⎡2 5 1⎤
Example 6 Given A = ⎢ ⎢
⎥ and B = ⎢ ⎥
1 ⎥ , find A + B
pu ⎣ 2 3 0 ⎦
⎢⎣
−2 3
2 ⎥⎦
be T

Since A, B are of the same order 2 × 3. Therefore, addition of A and B is defined


and is given by
re
o R

⎡ 2 + 3 1 + 5 1 − 1⎤ ⎡ 2 + 3 1 + 5 0 ⎤
A+B = ⎢ ⎥ ⎢
1⎥ = ⎢

⎢2 − 2 3+3 0+ 0 6
1⎥
tt E

⎢⎣ 2⎥⎦ ⎢⎣ 2 ⎥⎦

$ Note
C

1. We emphasise that if A and B are not of the same order, then A + B is not
no N

⎡ 2 3⎤ ⎡1 2 3⎤
defined. For example if A = ⎢ ⎥ , B=⎢ ⎥ , then A + B is not defined.
⎣1 0 ⎦ ⎣1 0 1 ⎦
2. We may observe that addition of matrices is an example of binary operation
©

on the set of matrices of the same order.

3.4.2 Multiplication of a matrix by a scalar


Now suppose that Fatima has doubled the production at a factory A in all categories
(refer to 3.4.1).
MATRICES 67

Previously quantities (in standard units) produced by factory A were

he
Revised quantities produced by factory A are as given below:

Boys Girls
1 ⎡ 2 × 80 2 × 60 ⎤

is
2 ⎢⎢ 2 × 75 2 × 65⎥⎥
3 ⎢⎣ 2 × 90 2 × 85 ⎥⎦

bl
⎡160 120 ⎤
pu
This can be represented in the matrix form as ⎢150
⎢ 130 ⎥⎥ . We observe that
⎢⎣180 170 ⎥⎦
be T

the new matrix is obtained by multiplying each element of the previous matrix by 2.
re
In general, we may define multiplication of a matrix by a scalar as follows: if
o R

A = [aij] m × n is a matrix and k is a scalar, then kA is another matrix which is obtained


by multiplying each element of A by the scalar k.
tt E

In other words, kA = k [aij] m × n = [k (aij)] m × n, that is, (i, j)th element of kA is kaij
for all possible values of i and j.
C

⎡ 3 1 1.5⎤
⎢ ⎥
For example, if A = ⎢ 5 7 −3 ⎥ , then
no N

⎢2 0 5⎥
⎣ ⎦

⎡ 3 1 1.5⎤ ⎡ 9 3 4.5⎤
©

⎢ ⎥ ⎢ ⎥
3A = 3 ⎢ 5 7 −3 ⎥ = ⎢3 5 21 −9 ⎥
⎢2 0 5⎥ ⎢ 6 0 15 ⎥⎦
⎣ ⎦ ⎣
Negative of a matrix The negative of a matrix is denoted by – A. We define
– A = (– 1) A.
68 MATHEMATICS

⎡ 3 1⎤
For example, let A= ⎢ ⎥ , then – A is given by
⎣ −5 x ⎦

⎡ 3 1 ⎤ ⎡ −3 − 1 ⎤
– A = (– 1) A = (−1) ⎢ ⎥=⎢ ⎥
⎣ −5 x ⎦ ⎣ 5 − x ⎦

he
Difference of matrices If A = [aij], B = [bij] are two matrices of the same order,
say m × n, then difference A – B is defined as a matrix D = [dij], where dij = aij – bij,
for all value of i and j. In other words, D = A – B = A + (–1) B, that is sum of the matrix
A and the matrix – B.

is
⎡1 2 3⎤ ⎡ 3 −1 3 ⎤
Example 7 If A = ⎢ ⎥ and B = ⎢ ⎥ , then find 2A – B.
⎣ 2 3 1⎦ ⎣ −1 0 2 ⎦

bl
Solution We have
1 2 3 3 1 3
2A – B = 2
pu 2 3 1 1 0 2
⎡ 2 4 6 ⎤ ⎡ −3 1 −3⎤
= ⎢ ⎥+⎢ ⎥
⎣ 4 6 2 ⎦ ⎣ 1 0 −2 ⎦
be T
re
⎡ 2 − 3 4 + 1 6 − 3 ⎤ ⎡ −1 5 3 ⎤
⎥=⎢
o R

= ⎢ ⎥
⎣ 4 + 1 6 + 0 2 − 2⎦ ⎣ 5 6 0⎦
3.4.3 Properties of matrix addition
tt E

The addition of matrices satisfy the following properties:


(i) Commutative Law If A = [aij], B = [bij] are matrices of the same order, say
C

m × n, then A + B = B + A.
Now A + B = [aij] + [bij] = [aij + bij]
no N

= [bij + aij] (addition of numbers is commutative)


= ([bij] + [aij]) = B + A
(ii) Associative Law For any three matrices A = [aij], B = [bij], C = [cij] of the
same order, say m × n, (A + B) + C = A + (B + C).
©

Now (A + B) + C = ([aij] + [bij]) + [cij]


= [aij + bij] + [cij] = [(aij + bij) + cij]
= [aij + (bij + cij)] (Why?)
= [aij] + [(bij + cij)] = [aij] + ([bij] + [cij]) = A + (B + C)
MATRICES 69

(iii) Existence of additive identity Let A = [a ij] be an m × n matrix and


O be an m × n zero matrix, then A + O = O + A = A. In other words, O is the
additive identity for matrix addition.
(iv) The existence of additive inverse Let A = [aij]m × n be any matrix, then we
have another matrix as – A = [– aij]m × n such that A + (– A) = (– A) + A= O. So
– A is the additive inverse of A or negative of A.

he
3.4.4 Properties of scalar multiplication of a matrix
If A = [aij] and B = [bij] be two matrices of the same order, say m × n, and k and l are
scalars, then

is
(i) k(A +B) = k A + kB, (ii) (k + l)A = k A + l A
(ii) k (A + B) = k ([aij] + [bij])

bl
= k [aij + bij] = [k (aij + bij)] = [(k aij) + (k bij)]
= [k aij] + [k bij] = k [aij] + k [bij] = kA + kB
(iii) ( k + l) A = (k + l) [aij]
pu
= [(k + l) aij] + [k aij] + [l aij] = k [aij] + l [aij] = k A + l A
be T

⎡8 0 ⎤ ⎡ 2 −2 ⎤
re
⎢ ⎥ ⎢ ⎥
Example 8 If A = ⎢ 4 −2 ⎥ and B = ⎢ 4 2 ⎥ , then find the matrix X, such that
o R

⎢⎣ 3 6 ⎥⎦ ⎢⎣ −5 1 ⎥⎦
2A + 3X = 5B.
tt E

Solution We have 2A + 3X = 5B
C

or 2A + 3X – 2A = 5B – 2A
or 2A – 2A + 3X = 5B – 2A (Matrix addition is commutative)
no N

or O + 3X = 5B – 2A (– 2A is the additive inverse of 2A)


or 3X = 5B – 2A (O is the additive identity)
1
or X= (5B – 2A)
©

⎛ ⎡ 2 −2 ⎤ ⎡8 0 ⎤ ⎞ ⎛ ⎡ 10 −10 ⎤ ⎡ −16 0 ⎤ ⎞
1⎜ ⎢ ⎥ ⎢ ⎥ ⎟ 1 ⎜⎢ ⎥ ⎢ −8 4 ⎥ ⎟
or X = ⎜ 5 ⎢ 4 2 ⎥ − 2 ⎢ 4 −2 ⎥ ⎟ = ⎜ ⎢ 20 10 ⎥ + ⎢ ⎥⎟
3⎜ 3 ⎜ ⎢ −25 5 ⎥
⎝ ⎢⎣ −5 1 ⎥⎦ ⎢⎣ 3 6 ⎥⎦ ⎟⎠ ⎝⎣ ⎦ ⎢⎣ −6 −12 ⎥⎦ ⎟⎠
70 MATHEMATICS

⎡ −10 ⎤
⎢ −2 3 ⎥
⎡ 10 − 16 −10 + 0 ⎤ ⎡ − 6 −10 ⎤ ⎢ ⎥
1⎢ ⎥ 1⎢ ⎥ ⎢ 4 14 ⎥
= ⎢ 20 − 8 10 + 4 ⎥ = 3⎢ 12 14 ⎥ = ⎢
3 3 ⎥
⎢⎣ −25 − 6 5 − 12 ⎥⎦ ⎢⎣ −31 −7 ⎥⎦ ⎢ ⎥
⎢ −31 −7 ⎥

he
⎢⎣ 3 3 ⎥⎦

⎡5 2⎤ ⎡3 6 ⎤
Example 9 Find X and Y, if X + Y = ⎢ ⎥ and X − Y = ⎢ ⎥.
⎣0 9 ⎦ ⎣ 0 −1⎦

is
⎡5 2⎤ ⎡3 6 ⎤
Solution We have ( X + Y ) + ( X − Y ) = ⎢ ⎥+⎢ ⎥.
⎣ 0 9 ⎦ ⎣0 −1⎦

bl
⎡8 8⎤ ⎡8 8⎤
or (X + X) + (Y – Y) = ⎢ ⎥ ⇒ 2X = ⎢ ⎥
pu ⎣ 0 8⎦ ⎣0 8⎦

1 ⎡8 8⎤ ⎡ 4 4 ⎤
or X= ⎢0 8⎥ = ⎢ 0 4 ⎥
be T

2 ⎣ ⎦ ⎣ ⎦
re
⎡5 2⎤ ⎡3 6 ⎤
o R

Also (X + Y) – (X – Y) = ⎢ ⎥−⎢ ⎥
⎣ 0 9 ⎦ ⎣0 −1⎦
tt E

⎡5 − 3 2 − 6 ⎤ ⎡ 2 −4 ⎤
or (X – X) + (Y + Y) = ⎢ ⎥ ⇒ 2Y = ⎢ ⎥
⎣ 0 9 + 1⎦ ⎣ 0 10 ⎦
C

1 ⎡ 2 − 4 ⎤ ⎡ 1 −2 ⎤
or Y= ⎢ 0 10 ⎥ = ⎢ 0 5 ⎥
2 ⎣ ⎦ ⎣ ⎦
no N

Example 10 Find the values of x and y from the following equation:

⎡x 5 ⎤ ⎡3 −4 ⎤ ⎡7 6⎤
2⎢ ⎥ +⎢ ⎥ = ⎢ ⎥
⎣7 y − 3 ⎦ ⎣1 2 ⎦ ⎣15 14 ⎦
©

Solution We have

⎡x 5 ⎤ ⎡3 −4 ⎤ ⎡7 6⎤ ⎡2 x 10 ⎤ ⎡3 − 4 ⎤ ⎡ 7 6 ⎤
+ ⇒ ⎢ ⎥+⎢ =
2⎢
y − 3⎥⎦ ⎢⎣1 2 ⎥⎦
= ⎢ ⎥
⎣7 ⎣15 14 ⎦ ⎣14 2 y − 6 ⎦ ⎣1 2 ⎥⎦ ⎢⎣15 14 ⎥⎦
MATRICES 71

⎡2x + 3 10 − 4 ⎤ ⎡7 6⎤ ⎡2 x + 3 6 ⎤ ⎡7 6⎤
⎢ 14 + 1 2 y − 6 + 2 ⎥ = ⎢ ⇒ ⎢ =
⎥ 2 y − 4 ⎥⎦ ⎢⎣15 14 ⎥⎦
or
⎣ ⎦ ⎣15 14 ⎦ ⎣ 15
or 2x + 3 = 7 and 2y – 4 = 14 (Why?)
or 2x = 7 – 3 and 2y = 18
4 18

he
or x= and y=
2 2
i.e. x =2 and y = 9.

Example 11 Two farmers Ramkishan and Gurcharan Singh cultivates only three

is
varieties of rice namely Basmati, Permal and Naura. The sale (in Rupees) of these
varieties of rice by both the farmers in the month of September and October are given
by the following matrices A and B.

bl
pu
be T
re
o R
tt E

(i) Find the combined sales in September and October for each farmer in each
C

variety.
(ii) Find the decrease in sales from September to October.
no N

(iii) If both farmers receive 2% profit on gross sales, compute the profit for each
farmer and for each variety sold in October.
Solution
(i) Combined sales in September and October for each farmer in each variety is
©

given by
72 MATHEMATICS

(ii) Change in sales from September to October is given by

he
2
(iii) 2% of B = × B = 0.02 × B
100

is
= 0.02

bl
=
pu
Thus, in October Ramkishan receives Rs 100, Rs 200 and Rs 120 as profit in the
sale of each variety of rice, respectively, and Grucharan Singh receives profit of Rs
be T

400, Rs 200 and Rs 200 in the sale of each variety of rice, respectively.
re
o R

3.4.5 Multiplication of matrices


Suppose Meera and Nadeem are two friends. Meera wants to buy 2 pens and 5 story
books, while Nadeem needs 8 pens and 10 story books. They both go to a shop to
tt E

enquire about the rates which are quoted as follows:


Pen – Rs 5 each, story book – Rs 50 each.
C

How much money does each need to spend? Clearly, Meera needs Rs (5 × 2 + 50 × 5)
that is Rs 260, while Nadeem needs (8 × 5 + 50 × 10) Rs, that is Rs 540. In terms of
matrix representation, we can write the above information as follows:
no N

Requirements Prices per piece (in Rupees) Money needed (in Rupees)

⎡2 5 ⎤ ⎡5⎤ ⎡ 5 × 2 + 5 × 50 ⎤ ⎡ 260 ⎤
⎢ 8 10 ⎥ ⎢50 ⎥ ⎢ 8 × 5 + 10 × 50⎥ = ⎢ 540 ⎥
⎣ ⎦ ⎣ ⎦ ⎣ ⎦ ⎣ ⎦
©

Suppose that they enquire about the rates from another shop, quoted as follows:
pen – Rs 4 each, story book – Rs 40 each.
Now, the money required by Meera and Nadeem to make purchases will be
respectively Rs (4 × 2 + 40 × 5) = Rs 208 and Rs (8 × 4 + 10 × 40) = Rs 432
MATRICES 73

Again, the above information can be represented as follows:


Requirements Prices per piece (in Rupees) Money needed (in Rupees)

⎡2 5 ⎤ ⎡4⎤ ⎡ 4 × 2 + 40 × 5 ⎤ ⎡ 208 ⎤
⎢ 8 10 ⎥ ⎢ 40 ⎥ ⎢ 8 × 4 + 10 × 4 0⎥ = ⎢ 432 ⎥
⎣ ⎦ ⎣ ⎦ ⎣ ⎦ ⎣ ⎦
Now, the information in both the cases can be combined and expressed in terms of

he
matrices as follows:
Requirements Prices per piece (in Rupees) Money needed (in Rupees)

⎡2 5 ⎤ ⎡5 4⎤ ⎡ 5 × 2 + 5 × 50 4 × 2 + 40 × 5 ⎤
⎢50 40 ⎥ ⎢ 8 × 5 + 10 × 5 0 8 × 4 + 10 × 4 0⎥

is
⎢ 8 10 ⎥
⎣ ⎦ ⎣ ⎦ ⎣ ⎦

⎡ 260 208 ⎤

bl
= ⎢ ⎥
⎣ 540 432 ⎦
The above is an example of multiplication of matrices. We observe that, for
pu
multiplication of two matrices A and B, the number of columns in A should be equal to
the number of rows in B. Furthermore for getting the elements of the product matrix,
be T

we take rows of A and columns of B, multiply them element-wise and take the sum.
Formally, we define multiplication of matrices as follows:
re
The product of two matrices A and B is defined if the number of columns of A is
o R

equal to the number of rows of B. Let A = [aij] be an m × n matrix and B = [bjk] be an


n × p matrix. Then the product of the matrices A and B is the matrix C of order m × p.
tt E

To get the (i, k)th element cik of the matrix C, we take the ith row of A and kth column
of B, multiply them elementwise and take the sum of all these products. In other words,
if A = [aij]m × n, B = [bjk]n × p, then the ith row of A is [ai1 ai2 ... ain] and the kth column of
C

⎡ b1k ⎤
⎢b ⎥ n
⎢ 2k ⎥
no N

B is ⎢ .. ⎥ , then cik = ai1 b1k + ai2 b2k + ai3 b3k + ... + ain bnk = ∑ aij b jk .
j =1
⎢ . ⎥
⎢b ⎥
⎣ nk ⎦
©

The matrix C = [cik]m × p is the product of A and B.

⎡ 2 7⎤
⎡1 −1 2 ⎤ ⎢ −1 1 ⎥ , then the product CD is defined
For example, if C = ⎢ ⎥ and D = ⎢ ⎥
⎣0 3 4 ⎦ ⎢⎣ 5 − 4 ⎥⎦
74 MATHEMATICS

⎡ 2 7⎤
⎡1 −1 2⎤ ⎢ ⎥
and is given by CD = ⎢ ⎥ ⎢ −1 1 ⎥ . This is a 2 × 2 matrix in which each
⎣ 0 3 4 ⎦ ⎢ 5 − 4⎥
⎣ ⎦
entry is the sum of the products across some row of C with the corresponding entries

he
down some column of D. These four computations are

is
bl
pu
be T
re
o R

⎡13 −2 ⎤
tt E

Thus CD = ⎢ ⎥
⎣17 −13⎦
C

⎡6 9⎤ ⎡ 2 6 0⎤
Example 12 Find AB, if A = ⎢ ⎥ and B = ⎢ ⎥.
⎣ 2 3⎦ ⎣7 9 8 ⎦
no N

Solution The matrix A has 2 columns which is equal to the number of rows of B.
Hence AB is defined. Now

⎡ 6(2) + 9(7) 6(6) + 9(9) 6(0) + 9(8) ⎤


©

AB = ⎢ ⎥
⎣ 2(2) + 3(7) 2(6) + 3(9) 2(0) + 3(8) ⎦

⎡12 + 63 36 + 81 0 + 72 ⎤ ⎡ 75 117 72 ⎤
=⎢ ⎥ = ⎢ ⎥
⎣ 4 + 21 12 + 27 0 + 24⎦ ⎣ 25 39 24 ⎦
MATRICES 75

Remark If AB is defined, then BA need not be defined. In the above example, AB is


defined but BA is not defined because B has 3 column while A has only 2 (and not 3)
rows. If A, B are, respectively m × n, k × l matrices, then both AB and BA are defined
if and only if n = k and l = m. In particular, if both A and B are square matrices of the
same order, then both AB and BA are defined.
Non-commutativity of multiplication of matrices

he
Now, we shall see by an example that even if AB and BA are both defined, it is not
necessary that AB = BA.

⎡ 2 3⎤
⎡ 1 −2 3⎤
and B = ⎢ 4 5⎥⎥ , then find AB, BA. Show that

is
Example 13 If A = ⎢ ⎥
⎣− 4 2 5 ⎦ ⎢⎣ 2 1⎥⎦

bl
AB ≠ BA.
Solution Since A is a 2 × 3 matrix and B is 3 × 2 matrix. Hence AB and BA are both
defined and are matrices of order 2 × 2 and 3 × 3, respectively. Note that
pu
⎡ 2 3⎤
⎡ 1 −2 3⎤ ⎢ ⎥ ⎡ 2−8+ 6 3 − 10 + 3 ⎤ ⎡ 0 − 4 ⎤
AB = ⎢ ⎢ ⎥=⎢ 3 ⎥⎦
be T


5⎦ ⎢
4 5 ⎥ =
⎣− 4 2 ⎣ −8 + 8 + 10 −12 + 10 + 5⎦ ⎣10
⎢⎣ 2 1⎥⎦
re
o R

⎡2 3⎤ ⎡ 2 − 12 − 4 + 6 6 + 15 ⎤ ⎡ −10 2 21⎤
⎡ 1 −2 3 ⎤ ⎢ ⎢ ⎥
and BA = ⎢⎢ 4 ⎥
5⎥ ⎢ ⎥ = ⎢ 4 − 20 −8 + 10 12 + 25⎥⎥ = ⎢ −16 2 37 ⎥
−4 2 5 ⎦
tt E

⎢⎣ 2 1⎥⎦ ⎣ ⎢⎣ 2 − 4 − 4 + 2 6 + 5 ⎥⎦ ⎢⎣ −2 −2 11 ⎥⎦
Clearly AB ≠ BA
C

In the above example both AB and BA are of different order and so AB ≠ BA. But
one may think that perhaps AB and BA could be the same if they were of the same
no N

order. But it is not so, here we give an example to show that even if AB and BA are of
same order they may not be same.

⎡1 0 ⎤ ⎡0 1 ⎤ ⎡ 0 1⎤
Example 14 If A = ⎢ ⎥ and B = ⎢ ⎥ , then AB = ⎢ ⎥.
©

⎣ 0 −1⎦ ⎣1 0⎦ ⎣ −1 0 ⎦

⎡ 0 −1⎤
and BA = ⎢ ⎥ . Clearly AB ≠ BA.
⎣1 0 ⎦
Thus matrix multiplication is not commutative.
76 MATHEMATICS

$ Note This does not mean that AB ≠ BA for every pair of matrices A, B for
which AB and BA, are defined. For instance,
⎡1 0 ⎤ ⎡3 0⎤ ⎡3 0⎤
If A = ⎢ ⎥ , B=⎢ ⎥ , then AB = BA = ⎢ ⎥
⎣0 2⎦ ⎣0 4⎦ ⎣0 8 ⎦
Observe that multiplication of diagonal matrices of same order will be commutative.

he
Zero matrix as the product of two non zero matrices
We know that, for real numbers a, b if ab = 0, then either a = 0 or b = 0. This need
not be true for matrices, we will observe this through an example.

is
⎡ 0 −1⎤ ⎡3 5⎤
Example 15 Find AB, if A = ⎢ ⎥ and B = ⎢ ⎥.
⎣0 2⎦ ⎣0 0⎦

bl
⎡ 0 −1 ⎤ ⎡ 3 5 ⎤ ⎡ 0 0 ⎤
Solution We have AB = ⎢ ⎥⎢ ⎥=⎢ ⎥.
⎣0 2 ⎦ ⎣0 0⎦ ⎣0 0 ⎦
pu
Thus, if the product of two matrices is a zero matrix, it is not necessary that one of
the matrices is a zero matrix.
be T

3.4.6 Properties of multiplication of matrices


re
The multiplication of matrices possesses the following properties, which we state without
o R

proof.
1. The associative law For any three matrices A, B and C. We have
tt E

(AB) C = A (BC), whenever both sides of the equality are defined.


2. The distributive law For three matrices A, B and C.
C

(i) A (B+C) = AB + AC
(ii) (A+B) C = AC + BC, whenever both sides of equality are defined.
no N

3. The existence of multiplicative identity For every square matrix A, there


exist an identity matrix of same order such that IA = AI = A.
Now, we shall verify these properties by examples.
©

⎡1 1 −1⎤ ⎡ 1 3⎤
⎢ ⎡1 2 3 − 4 ⎤
Example 16 If A = ⎢ 2 0 3 ⎥ , B = ⎢⎢ 0 2⎥⎥ and C = ⎢

⎥ , find
⎢⎣ 3 −1 2 ⎥⎦ ⎢⎣−1 4⎥⎦ ⎣ 2 0 −2 1 ⎦

A(BC), (AB)C and show that (AB)C = A(BC).


MATRICES 77

⎡1 1 −1⎤ ⎡ 1 3 ⎤ ⎡ 1 + 0 + 1 3 + 2 − 4 ⎤ ⎡ 2 1 ⎤
Solution We have AB = ⎢ 2 0 3 ⎥⎥ ⎢⎢ 0 2 ⎥⎥ = ⎢⎢ 2 + 0 − 3 6 + 0 + 12 ⎥⎥ = ⎢⎢−1 18⎥⎥

⎣⎢ 3 −1 2 ⎥⎦ ⎢⎣−1 4 ⎦⎥ ⎣⎢3 + 0 − 2 9 − 2 + 8 ⎦⎥ ⎣⎢ 1 15⎦⎥

⎡2 1⎤ ⎡ 2+2 4 + 0 6 − 2 − 8 +1 ⎤
⎡1 2 3 − 4⎤ ⎢

he
⎢ ⎥
(AB) (C) = ⎢−1 18⎥ ⎢ ⎥ = ⎢ −1 + 36 −2 + 0 −3 − 36 4 + 18⎥⎥

⎢⎣ 1 15⎥⎦ ⎣ ⎦ ⎢ +
2 0 2 1
⎣ 1 30 2 + 0 3 − 30 − 4 + 15⎦⎥

⎡4 4 4 −7 ⎤

is
⎢35 −2 −39 22 ⎥
= ⎢ ⎥
⎢⎣31 2 −27 11⎥⎦

bl
⎡ 1 3⎤ ⎡ 1 + 6 2 + 0 3 − 6 −4 + 3 ⎤
⎡1 2 3 −4 ⎤ ⎢
BC = ⎢ 0 ⎥ ⎥
Now
pu ⎢ 2⎥ ⎢
− ⎥ = ⎢ 0 + 4 0 + 0 0 − 4 0 + 2⎥
4 ⎦⎥ ⎣ ⎦ ⎢− +
2 0 2 1
⎣⎢ −1 ⎣ 1 8 −2 + 0 −3 − 8 4 + 4 ⎥⎦
be T

⎡7 2 −3 −1 ⎤
re
⎢ ⎥
o R

= ⎢ 4 0 −4 2 ⎥
⎢⎣7 −2 −11 8 ⎥⎦
tt E

⎡1 1 −1 ⎤ ⎡7 2 −3 −1 ⎤
A(BC) = ⎢⎢ 2 0 3 ⎥⎥ ⎢⎢ 4 0 −4 2 ⎥⎥
C

Therefore
⎢⎣ 3 −1 2 ⎥⎦ ⎢⎣7 −2 −11 8 ⎥⎦
no N

⎡ 7 + 4 − 7 2 + 0 + 2 −3 − 4 + 11 −1 + 2 − 8 ⎤
⎢ ⎥
= ⎢14 + 0 + 21 4 + 0 − 6 −6 + 0 − 33 −2 + 0 + 24 ⎥
⎢⎣ 21 − 4 + 14 6 + 0 − 4 −9 + 4 − 22 −3 − 2 + 16 ⎥⎦
©

⎡4 4 4 −7 ⎤
⎢35 −2 −39 22 ⎥
= ⎢ ⎥ . Clearly, (AB) C = A (BC)
⎢⎣31 2 −27 11⎥⎦
78 MATHEMATICS

⎡ 0 6 7⎤ ⎡0 1 1 ⎤ ⎡2⎤
Example 17 If A = ⎢⎢ − 6 0 8 ⎥ , B = ⎢1 0 2 ⎥ , C = ⎢⎢−2 ⎥⎥
⎥ ⎢ ⎥
⎢⎣ 7 −8 0 ⎥⎦ ⎢⎣1 2 0 ⎥⎦ ⎢⎣ 3 ⎥⎦
Calculate AC, BC and (A + B)C. Also, verify that (A + B)C = AC + BC
⎡ 0 7 8⎤

he
⎢ ⎥
Solution Now, A + B = ⎢−5 0 10 ⎥
⎣⎢ 8 − 6 0 ⎥⎦

is
⎡ 0 7 8⎤ ⎡2 ⎤ ⎡ 0 − 14 + 24 ⎤ ⎡10 ⎤
⎢ 10 ⎥⎥ ⎢−2 ⎥ = ⎢−10 + 0 + 30 ⎥ = ⎢ 20 ⎥
So (A + B) C = ⎢−5 0 ⎢ ⎥ ⎢ ⎥ ⎢ ⎥
⎢⎣ 8 − 6 0 ⎥⎦ ⎢⎣ 3 ⎥⎦ ⎢⎣ 16 + 12 + 0 ⎥⎦ ⎢⎣ 28⎥⎦

bl
⎡ 0 6 7 ⎤ ⎡ 2 ⎤ ⎡ 0 − 12 + 21 ⎤ ⎡9⎤
⎢ ⎥ ⎢ ⎥ ⎢ ⎥ = ⎢12 ⎥
pu
Further AC = ⎢ −6 0 8 ⎥ ⎢−2 ⎥ = ⎢−12 + 0 + 24 ⎥ ⎢ ⎥
⎢⎣ 7 − 8 0 ⎥⎦ ⎢⎣ 3 ⎥⎦ ⎢⎣ 14 + 16 + 0 ⎥⎦ ⎢⎣30 ⎥⎦
be T

⎡0 1 1 ⎤ ⎡ 2 ⎤ ⎡ 0 − 2 + 3⎤ ⎡ 1 ⎤
re
⎢ ⎥ ⎢ ⎥ = ⎢ 2 + 0 + 6⎥ = ⎢ 8 ⎥
BC = ⎢1 0 2 ⎥ ⎢−2
o R

and ⎥ ⎢ ⎥ ⎢ ⎥
⎢⎣1 2 0 ⎥⎦ ⎢⎣ 3 ⎥⎦ ⎢⎣ 2 − 4 + 0 ⎥⎦ ⎢⎣− 2 ⎥⎦
tt E

⎡9 ⎤ ⎡ 1 ⎤ ⎡10 ⎤
⎢ ⎥ ⎢ ⎥ ⎢ ⎥
So AC + BC = ⎢12 ⎥ + ⎢ 8 ⎥ = ⎢ 20 ⎥
C

⎢⎣30 ⎥⎦ ⎢⎣−2 ⎥⎦ ⎢⎣ 28⎥⎦


Clearly, (A + B) C = AC + BC
no N

⎡1 2 3⎤
⎢ 1⎥⎥ , then show that A3 – 23A – 40 I = O
Example 18 If A = ⎢ 3 −2
⎢⎣ 4 2 1⎥⎦
©

⎡1 2 3⎤ ⎡ 1 2 3⎤ ⎡19 4 8 ⎤
Solution We have A = A.A = ⎢ 3 −2
2
1⎥⎥ ⎢⎢ 3 −2 1⎥⎥ = ⎢⎢1 12 8 ⎥⎥

⎢⎣ 4 2 1⎥⎦ ⎢⎣ 4 2 1⎥⎦ ⎢⎣14 6 15 ⎥⎦
MATRICES 79

⎡1 2 3⎤ ⎡19 4 8 ⎤ ⎡ 63 46 69 ⎤
So A = A A = ⎢⎢ 3 −2
3 2
1⎥⎥ ⎢⎢1 12 8 ⎥⎥ = ⎢⎢ 69 −6 23⎥⎥
⎢⎣ 4 2 1⎥⎦ ⎢⎣14 6 15 ⎥⎦ ⎢⎣92 46 63⎥⎦
Now
⎡ 63 46 69 ⎤ ⎡1 2 3⎤ ⎡1 0 0⎤

he
3 ⎢ 69 −6 23⎥ – 23 ⎢ 3 −2 1⎥ – 40 ⎢⎢ 0 1 0⎥⎥

A – 23A – 40I = ⎢ ⎥ ⎢
⎢⎣92 46 63⎥⎦ ⎢⎣ 4 2 1⎥⎦ ⎢⎣ 0 0 1 ⎥⎦

is
⎡ 63 46 69 ⎤ ⎡ −23 −46 −69 ⎤ ⎡−40 0 0 ⎤
⎢69 −6 23⎥ + ⎢ −69 46 −23⎥ + ⎢ 0 −40 0 ⎥⎥
= ⎢ ⎥ ⎢ ⎥ ⎢
⎢⎣92 46 63⎥⎦ ⎢⎣ −92 −46 −23⎥⎦ ⎢⎣ 0 0 −40 ⎥⎦

bl
⎡63 − 23 − 40 46 − 46 + 0 69 − 69 + 0 ⎤
⎢ ⎥
pu = ⎢69 − 69 + 0 −6 + 46 − 40 23 − 23 + 0 ⎥
⎢⎣ 92 − 92 + 0 46 − 46 + 0 63 − 23 − 40 ⎥⎦
be T

⎡0 0 0⎤
re
⎢ ⎥
= ⎢0 0 0⎥ = O
o R

⎢⎣0 0 0 ⎥⎦
tt E

Example 19 In a legislative assembly election, a political group hired a public relations


firm to promote its candidate in three ways: telephone, house calls, and letters. The
cost per contact (in paise) is given in matrix A as
C

Cost per contact


⎡ 40 ⎤ Telephone
⎢ ⎥ Housecall
no N

A= ⎢ 100 ⎥
⎣⎢ 50 ⎦⎥ Letter
The number of contacts of each type made in two cities X and Y is given by
©

Telephone Housecall Letter


⎡1000 500 5000 ⎤ → X
B=⎢ . Find the total amount spent by the group in the two
⎣3000 1000 10,000⎥⎦ → Y
cities X and Y.
80 MATHEMATICS

Solution We have
⎡ 40,000 + 50,000 + 250,000 ⎤ → X
BA = ⎢ ⎥
⎣120,000 + 100,000 +500,000 ⎦ → Y
⎡340,000 ⎤ → X
= ⎢ ⎥
⎣ 720,000 ⎦ → Y

he
So the total amount spent by the group in the two cities is 340,000 paise and
720,000 paise, i.e., Rs 3400 and Rs 7200, respectively.

EXERCISE 3.2

is
⎡2 4⎤ ⎡ 1 3⎤ ⎡−2 5⎤
1. Let A = ⎢ ⎥ ,B=⎢ ⎥ ,C=⎢
⎣3 2⎦ ⎣−2 5⎦ ⎣3 4 ⎥⎦

bl
Find each of the following:
(i) A + B (ii) A – B (iii) 3A – C
pu
(iv) AB (v) BA
2. Compute the following:
⎡a b ⎤ ⎡a b ⎤ ⎡a 2 + b2 b 2 + c 2 ⎤ ⎡ 2ab 2bc ⎤
be T

(i) ⎢ + (ii) ⎢ 2 ⎥ + ⎢ ⎥
⎣−b a ⎥⎦ ⎢⎣ b a ⎥⎦ ⎢⎣ a + c
2
a 2 + b 2 ⎥⎦ ⎣−2ac −2ab ⎦
re
o R

⎡−1
4 −6⎤ ⎡12 7 6 ⎤
⎢ ⎡cos 2 x sin 2 x ⎤ ⎡ sin 2 x cos 2 x ⎤
5 16 ⎥⎥ + ⎢⎢ 8 0 5 ⎥⎥ (iv) ⎢
(iii) ⎢ 8 ⎥+⎢ 2 ⎥
⎢⎣ sin x cos x ⎥⎦ ⎢⎣cos x sin x ⎥⎦
2 2 2
⎢⎣ 2
8 5 ⎥⎦ ⎢⎣ 3 2 4 ⎥⎦
tt E

3. Compute the indicated products.


C

⎡1 ⎤
⎡a b ⎤ ⎡ a −b ⎤ ⎢ ⎥ ⎡ 1 −2 ⎤ ⎡1 2 3⎤
(i) ⎢ (ii) ⎢ 2⎥ [2 3 4]
a ⎥⎦ ⎢⎣ b a ⎥⎦
(iii) ⎢ ⎥⎢ ⎥
⎣−b ⎢⎣ 3⎥⎦ ⎣ 2 3 ⎦ ⎣ 2 3 1⎦
no N

⎡ 2 3 4 ⎤ ⎡1 −3 5⎤ ⎡2 1⎤
⎢ ⎡ 1 0 1⎤
(iv) ⎢ 3 4 5 ⎥ ⎢ 0 2 4⎥⎥ 2 ⎥⎥ ⎢
−1 2 1⎥⎦
(v) ⎢ 3
⎢ ⎥ ⎢ ⎣
©

⎣⎢ 4 5 6 ⎦⎥ ⎢⎣ 3 0 5 ⎥⎦ ⎢⎣−1 1⎦ ⎥

⎡ 2 −3⎤
⎡ 3 −1 3 ⎤ ⎢ ⎥
(vi) ⎢ ⎥ ⎢1 0⎥
⎣−1 0 2 ⎦ ⎢3 1⎥
⎣ ⎦
MATRICES 81

⎡1 2 −3⎤ ⎡ 3 −1 2⎤ ⎡4 1 2⎤
⎢ ⎥ ⎢ 5 ⎥ and C = ⎢⎢ 0 3
⎥ 2⎥⎥ , then compute
4. If A = ⎢5 0 2 ⎥ , B = ⎢ 4 2
⎢⎣1 −1 1 ⎥⎦ ⎢⎣ 2 0 3⎥⎦ ⎢⎣ 1 −2 3⎥⎦
(A+B) and (B – C). Also, verify that A + (B – C) = (A + B) – C.

he
⎡2 5⎤ ⎡2 3 ⎤
⎢3 1 1⎥
3⎥ ⎢5 5
⎢ ⎥ ⎢ ⎥
5. If A = ⎢⎢
4⎥
and B = ⎢
1 2 1 2 4⎥
⎥ ⎢ , then compute 3A – 5B.
3 3 3 5 5 5⎥
⎢ ⎥ ⎢ ⎥

is
⎢ 7 2⎥ ⎢ 7 6 2⎥
2
⎣⎢ 3 3 ⎦⎥ ⎢⎣ 5 5 5 ⎦⎥

bl
⎡ cos θ sin θ ⎤ ⎡ sin θ − cos θ ⎤
6. Simplify cosθ ⎢ ⎥ + sinθ ⎢
⎣ − sin θ cos θ ⎦ ⎣ cos θ sin θ⎥⎦
pu
7. Find X and Y, if
⎡7 0 ⎤ ⎡3 0⎤
(i) X + Y = ⎢ and X – Y = ⎢
be T

⎥ ⎥
⎣ 2 5⎦ ⎣0 3⎦
re
o R

⎡ 2 3⎤ ⎡ 2 −2 ⎤
(ii) 2X + 3Y = ⎢ ⎥ and 3X + 2Y = ⎢ ⎥
⎣ 4 0⎦ ⎣ −1 5 ⎦
tt E

⎡3 2 ⎤ ⎡ 1 0⎤
8. Find X, if Y = ⎢ ⎥ and 2X + Y = ⎢ −3 2 ⎥
⎣1 4 ⎦ ⎣ ⎦
C

⎡ 1 3 ⎤ ⎡ y 0 ⎤ ⎡5 6 ⎤
9. Find x and y, if 2 ⎢ ⎥+⎢ ⎥=⎢ ⎥
⎣ 0 x ⎦ ⎣ 1 2 ⎦ ⎣1 8 ⎦
no N

⎡x z⎤ ⎡ 1 −1 ⎤ ⎡ 3 5⎤
10. Solve the equation for x, y, z and t, if 2 ⎢ ⎥ +3⎢ ⎥ =3⎢ ⎥
⎣y t⎦ ⎣0 2 ⎦ ⎣ 4 6⎦
©

⎡ 2⎤ ⎡−1 ⎤ ⎡10 ⎤
11. If x ⎢ ⎥ + y ⎢ ⎥ = ⎢ ⎥ , find the values of x and y.
⎣ 3⎦ ⎣ 1 ⎦ ⎣5 ⎦

⎡x y ⎤ ⎡ x 6 ⎤ ⎡ 4 x + y⎤
12. Given 3 ⎢ ⎥ =⎢ ⎥ +⎢
3 ⎥⎦
, find the values of x, y, z and w.
⎣ z w ⎦ ⎣ −1 2 w ⎦ ⎣ z + w
82 MATHEMATICS

⎡cos x − sin x 0 ⎤
13. If F ( x) = ⎢ sin x cos x 0 ⎥ , show that F(x) F(y) = F(x + y).
⎢ ⎥
⎢⎣ 0 0 1 ⎥⎦
14. Show that
⎡ 5 −1⎤ ⎡ 2 1 ⎤ ⎡ 2 1 ⎤ ⎡ 5 −1⎤

he
(i) ⎢ ⎥⎢ ⎥≠⎢ ⎥⎢ ⎥
⎣6 7 ⎦ ⎣ 3 4⎦ ⎣ 3 4⎦ ⎣6 7 ⎦

⎡1 2 3⎤ ⎡−1 1 0 ⎤ ⎡ −1 1 0 ⎤ ⎡1 2 3⎤
⎢ ⎥⎢ 1 ⎥⎥ ≠ ⎢⎢ 0 −1 1 ⎥⎥ ⎢⎢ 0 1 0⎥⎥
(ii) ⎢0 1 0 ⎥ ⎢ 0 −1

is
⎢⎣1 1 0 ⎥⎦ ⎢⎣ 2 3 4⎥⎦ ⎢⎣ 2 3 4 ⎥⎦ ⎢⎣1 1 0⎥⎦

bl
⎡2 0 1⎤
15. Find A – 5A + 6I, if A = ⎢⎢ 2 1
2
3⎥⎥
⎢⎣1 −1 0 ⎥⎦
pu
⎡1 0 2⎤
be T

16. If A = ⎢ 0 2 1 ⎥ , prove that A3 – 6A2 + 7A + 2I = 0


⎢ ⎥
re
⎢⎣ 2 0 3 ⎥⎦
o R

⎡ 3 −2 ⎤ ⎡1 0 ⎤
17. If A = ⎢ ⎥ and I= ⎢ ⎥ , find k so that A = kA – 2I
2

⎣ 4 −2 ⎦
tt E

⎣0 1 ⎦

⎡ α⎤
− tan ⎥
C

⎢ 0 2
18. If A = ⎢ ⎥ and I is the identity matrix of order 2, show that
⎢ tan α 0 ⎥
⎢⎣ ⎥⎦
no N

⎡cos α − sin α ⎤
I + A = (I – A) ⎢
⎣ sin α cos α ⎥⎦
©

19. A trust fund has Rs 30,000 that must be invested in two different types of bonds.
The first bond pays 5% interest per year, and the second bond pays 7% interest
per year. Using matrix multiplication, determine how to divide Rs 30,000 among
the two types of bonds. If the trust fund must obtain an annual total interest of:
(a) Rs 1800 (b) Rs 2000
MATRICES 83

20. The bookshop of a particular school has 10 dozen chemistry books, 8 dozen
physics books, 10 dozen economics books. Their selling prices are Rs 80, Rs 60
and Rs 40 each respectively. Find the total amount the bookshop will receive
from selling all the books using matrix algebra.
Assume X, Y, Z, W and P are matrices of order 2 × n, 3 × k, 2 × p, n × 3 and p × k,
respectively. Choose the correct answer in Exercises 21 and 22.

he
21. The restriction on n, k and p so that PY + WY will be defined are:
(A) k = 3, p = n (B) k is arbitrary, p = 2
(C) p is arbitrary, k = 3 (D) k = 2, p = 3
22. If n = p, then the order of the matrix 7X – 5Z is:

is
(A) p × 2 (B) 2 × n (C) n × 3 (D) p × n
3.5. Transpose of a Matrix

bl
In this section, we shall learn about transpose of a matrix and special types of matrices
such as symmetric and skew symmetric matrices.
pu
Definition 3 If A = [aij] be an m × n matrix, then the matrix obtained by interchanging
the rows and columns of A is called the transpose of A. Transpose of the matrix A is
denoted by A′ or (AT). In other words, if A = [aij]m × n, then A′ = [aji]n × m. For example,
be T
re
⎡ 3 5⎤ ⎡3 3 0⎤
o R

⎢ ⎥ ⎢ ⎥
if A = ⎢ 3 1 ⎥ , then A′ =
⎢5 1 −1 ⎥
⎢ 0 −1 ⎥ ⎢⎣ 5 ⎥⎦ 2 × 3
⎢ ⎥
tt E

⎣ 5 ⎦3 × 2
3.5.1 Properties of transpose of the matrices
C

We now state the following properties of transpose of matrices without proof. These
may be verified by taking suitable examples.
no N

For any matrices A and B of suitable orders, we have


(i) (A′)′ = A, (ii) (kA)′ = kA′ (where k is any constant)
(iii) (A + B)′ = A′ + B′ (iv) (A B)′ = B′ A′
©

⎡3 3 2⎤ ⎡ 2 −1 2 ⎤
Example 20 If A = ⎢ ⎥ and B = ⎢
2 4 ⎥⎦
, verify that
⎣4 2 0⎦ ⎣1
(i) (A′)′ = A, (ii) (A + B)′ = A′ + B′,
(iii) (kB)′ = kB′, where k is any constant.
84 MATHEMATICS

Solution
(i) We have

⎡ 3 4⎤
⎡3 3 2⎤ ⎢ ⎥ ′ ⎡3 3 2⎤
A= ⎢ ⎥ ⇒ A′ = ⎢ 3 2 ⎥ ⇒ ( A′ ) = ⎢ ⎥=A
⎣ 4 2 0 ⎦ ⎢ 2 0⎥ ⎣ 4 2 0 ⎦
⎣ ⎦

he
Thus (A′)′ = A
(ii) We have
⎡3 3 2⎤ ⎡ 2 −1 2 ⎤ ⎡5 3 − 1 4⎤
⇒A+B=⎢

is
A= ⎢ ⎥, B = ⎢1 ⎥ ⎥
⎣4 2 0⎦ ⎣ 2 4⎦ ⎣5 4 4⎦

bl
⎡ 5 5⎤
⎢ ⎥
Therefore (A + B)′ = ⎢ 3 − 1 4 ⎥
⎢ 4 4 ⎥⎦
pu ⎣

⎡ 3 4⎤ ⎡ 2 1⎤
be T

⎢ ⎥ ⎢ ⎥
Now A′ = ⎢ 3 2 ⎥ , B′ = ⎢−1 2 ⎥ ,
re
⎢ 2 0⎥ ⎢⎣ 2 4 ⎦⎥
⎣ ⎦
o R

⎡ 5 5⎤
⎢ ⎥
tt E

So A′ + B′ = ⎢ 3 −1 4 ⎥
⎢ 4 4 ⎥⎦

C

Thus (A + B)′ = A′ + B′
(iii) We have
no N

⎡ 2 −1 2 ⎤ ⎡ 2 k −k 2k ⎤
kB = k ⎢ =
⎣1 2 4 ⎥⎦ ⎢⎣ k 2k 4k ⎥⎦
©

⎡ 2k k ⎤ ⎡ 2 1⎤
⎢ −k 2k ⎥ = k ⎢−1 2⎥ = kB′
Then (kB)′ = ⎢ ⎥ ⎢ ⎥
⎢⎣ 2k 4k ⎥⎦ ⎢⎣ 2 4⎥⎦

Thus (kB)′ = kB′


MATRICES 85

⎡ −2⎤
⎢ ⎥
Example 21 If A = ⎢ 4 ⎥ , B = [1 3 −6] , verify that (AB)′ = B′A′.
⎢⎣ 5 ⎥⎦
Solution We have
⎡ −2 ⎤

he
⎢ ⎥
A = ⎢ 4 ⎥ , B = [1 3 − 6]
⎣⎢ 5 ⎦⎥

is
⎡ −2 ⎤ ⎡ −2 −6 12 ⎤
then
⎢ 4 ⎥ 1 3 −6
AB = ⎢ ⎥ [ ] = ⎢⎢ 4 12 −24⎥⎥
⎢⎣ 5 ⎥⎦ ⎢⎣ 5 15 −30 ⎥⎦

bl
⎡ 1⎤
A′ = [–2 4 5] , B′ = ⎢ 3 ⎥
Now
pu ⎢ ⎥
⎢⎣− 6 ⎥⎦
be T

⎡ 1⎤ ⎡ −2 4 5⎤
re
⎢ 3 ⎥ −2 4 5 = ⎢ −6 12 15 ⎥⎥ = (AB)′
B′A′ = ⎢ ⎥ [ ] ⎢
o R

⎢⎣− 6 ⎥⎦ ⎢⎣12 −24 −30 ⎥⎦


tt E

Clearly (AB)′ = B′A′

3.6 Symmetric and Skew Symmetric Matrices


C

Definition 4 A square matrix A = [aij] is said to be symmetric if A′ = A, that is,


[aij] = [aji] for all possible values of i and j.
no N

⎡ 3 2 3⎤
⎢ ⎥
For example A = ⎢ 2 −1.5 −1 ⎥ is a symmetric matrix as A′ = A
⎢ 3 −1 1 ⎥⎦

©

Definition 5 A square matrix A = [aij] is said to be skew symmetric matrix if


A′ = – A, that is aji = – aij for all possible values of i and j. Now, if we put i = j, we
have aii = – aii. Therefore 2aii = 0 or aii = 0 for all i’s.
This means that all the diagonal elements of a skew symmetric matrix are zero.
86 MATHEMATICS

⎡ 0 e f⎤
For example, the matrix B = ⎢ −e 0 g ⎥⎥ is a skew symmetric matrix as B′= –B

⎢⎣ − f −g 0 ⎥⎦

Now, we are going to prove some results of symmetric and skew-symmetric

he
matrices.
Theorem 1 For any square matrix A with real number entries, A + A′ is a symmetric
matrix and A – A′ is a skew symmetric matrix.
Proof Let B = A + A′, then

is
B′ = (A + A′)′
= A′ + (A′)′ (as (A + B)′ = A′ + B′)

bl
= A′ + A (as (A′)′ = A)
= A + A′ (as A + B = B + A)
pu = B
Therefore B = A + A′ is a symmetric matrix
be T

Now let C = A – A′
re
C′ = (A – A′)′ = A′ – (A′)′ (Why?)
o R

= A′ – A (Why?)
= – (A – A′) = – C
tt E

Therefore C = A – A′ is a skew symmetric matrix.


Theorem 2 Any square matrix can be expressed as the sum of a symmetric and a
C

skew symmetric matrix.


Proof Let A be a square matrix, then we can write
no N

1 1
A = (A + A′) + (A − A′)
2 2
From the Theorem 1, we know that (A + A′) is a symmetric matrix and (A – A′) is
©

1
a skew symmetric matrix. Since for any matrix A, (kA)′ = kA′, it follows that (A + A′)
2
1
is symmetric matrix and (A − A′) is skew symmetric matrix. Thus, any square
2
matrix can be expressed as the sum of a symmetric and a skew symmetric matrix.
MATRICES 87

⎡ 2 −2 −4 ⎤
Example 22 Express the matrix B = ⎢−1 3 4 ⎥ as the sum of a symmetric and a
⎢ ⎥
⎢⎣ 1 −2 −3⎥⎦
skew symmetric matrix.
Solution Here

he
⎡ 2 −1 1 ⎤
B′ = ⎢⎢− 2 3 −2 ⎥⎥
⎢⎣ −4 4 −3⎥⎦

is
⎡ −3 −3 ⎤
⎢ 2 2 2⎥
⎡ 4 −3 −3⎤ ⎢ ⎥
1⎢ ⎢ −3 1 ⎥,
2⎥⎥ = ⎢
1

bl
Let P = (B + B′) = − 3 6 3

2 2⎢ 2
⎢⎣ −3 2 −6 ⎥⎦ ⎢ −3 ⎥
⎢ 1 −3 ⎥
⎣⎢ 2 ⎦⎥
pu ⎡ −3 −3 ⎤
⎢ 2 2 2⎥
⎢ ⎥
−3
be T

Now P′ = ⎢ 3 1 ⎥= P
⎢2 ⎥
re
⎢ ⎥
⎢ −3 1
o R

−3 ⎥
⎣⎢ 2 ⎦⎥
1
P = (B + B′) is a symmetric matrix.
tt E

Thus
2
⎡ −1 −5 ⎤
⎢0 2 2⎥
C

⎡0 −1 −5⎤ ⎢ ⎥
1⎢ ⎥
Q = (B – B′) = ⎢1 0 6 ⎥ = ⎢⎢ 3⎥
1 1
Also, let 0
2 2 2 ⎥
no N

⎢⎣5 −6 0 ⎥⎦ ⎢ ⎥
⎢ 5 −3 0⎥
⎣⎢ 2 ⎦⎥
⎡ 1 5⎤
⎢ 0 2 3⎥
©

⎢ ⎥
⎢ −1
Then Q′ = 0 −3⎥ = − Q
⎢2 ⎥
⎢ ⎥
⎢ −5 3 0 ⎥
⎣⎢ 2 ⎦⎥
88 MATHEMATICS

1
Thus Q= (B – B′) is a skew symmetric matrix.
2

⎡ −3 −3 ⎤ ⎡ −1 −5 ⎤
⎢2 2 2⎥ ⎢
0
2 2 ⎥ ⎡2
⎢ ⎥ ⎢ ⎥ −2 −4 ⎤
−3
P+Q=⎢ 1 ⎥+⎢ 3 ⎥ = ⎢⎢−1 3 4 ⎥⎥ = B
1

he
Now 3 0
⎢2 ⎥ ⎢2 ⎥
⎢ ⎥ ⎢ ⎥ ⎢⎣ 1 −2 −3⎥⎦
⎢ −3 1 −3 ⎥ ⎢
5
−3 0⎥
⎢⎣ 2 ⎥⎦ ⎢⎣ 2 ⎥⎦

is
Thus, B is represented as the sum of a symmetric and a skew symmetric matrix.

EXERCISE 3.3

bl
1. Find the transpose of each of the following matrices:
⎡5⎤
⎡ −1 5 6 ⎤
pu ⎢1⎥ ⎡ 1 −1 ⎤ ⎢ ⎥
(i) ⎢ ⎥ (ii) ⎢ ⎥ (iii) ⎢ 3 5 6 ⎥
⎢2⎥ ⎣2 3⎦
⎢−1 ⎥ ⎢ 2 3 −1⎥
be T

⎣ ⎦ ⎣ ⎦
re
⎡ −1 2 3⎤ ⎡ −4 1 −5⎤
o R

2. If A = 5 7 9 and B = ⎢ 1 2 0 ⎥ , then verify that


⎢ ⎥
⎢ ⎥ ⎢ ⎥
⎢⎣ −2 1 1 ⎥⎦ ⎢⎣ 1 3 1⎥⎦
tt E

(i) (A + B)′ = A′ + B′, (ii) (A – B)′ = A′ – B′

⎡ 3 4⎤
C

⎡ −1 2 1⎤
3. If A′ = ⎢−1 2 ⎥ and B = ⎢
1 2 3⎥⎦
, then verify that
⎢ ⎥
⎢⎣ 0 1 ⎥⎦ ⎣
no N

(i) (A + B)′ = A′ + B′ (ii) (A – B)′ = A′ – B′


⎡ −2 3 ⎤ ⎡ −1 0 ⎤
4. If A′ = ⎢ ⎥ and B = ⎢ ⎥ , then find (A + 2B)′
⎣ 1 2⎦ ⎣ 1 2⎦
©

5. For the matrices A and B, verify that (AB)′ = B′A′, where


⎡1 ⎤ ⎡0⎤
⎢ ⎥ , B = −1 2 1
(i) A = ⎢−4 ⎥ [ ] (ii) A = ⎢⎢1 ⎥⎥ , B = [1 5 7]
⎢⎣ 3 ⎥⎦ ⎢⎣ 2 ⎥⎦
MATRICES 89

⎡ cos α sin α ⎤
6. If (i) A = ⎢ ⎥ , then verify that A′ A = I
⎣ − sin α cos α ⎦

⎡ sin α cos α ⎤
(ii) If A = ⎢ ⎥ , then verify that A′ A = I
⎣ − cos α sin α ⎦

he
⎡1 −1 5 ⎤
7. (i) Show that the matrix A = ⎢⎢−1 2 1 ⎥⎥ is a symmetric matrix.
⎢⎣ 5 1 3⎥⎦

is
⎡ 0 1 −1 ⎤

bl
(ii) Show that the matrix A = ⎢ −1 0 1 ⎥ is a skew symmetric matrix.
⎢ ⎥
pu ⎢⎣ 1 −1 0 ⎥⎦

⎡1 5 ⎤
8. For the matrix A = ⎢ ⎥ , verify that
⎣6 7 ⎦
be T

(i) (A + A′) is a symmetric matrix


re
o R

(ii) (A – A′) is a skew symmetric matrix

⎡ 0 a b⎤
tt E

A − A′ , when A = ⎢ −a 0 c ⎥⎥
1( ) 1( )
9. Find A + A′ and
2 2 ⎢
⎢⎣ −b −c 0 ⎥⎦
C

10. Express the following matrices as the sum of a symmetric and a skew symmetric
matrix:
no N

⎡ 6 −2 2 ⎤
⎡ 3 5⎤ ⎢
(i) ⎢ ⎥ (ii) ⎢−2 3 −1 ⎥⎥
⎣1 −1⎦ ⎢⎣ 2 −1 3 ⎥⎦
©

⎡ 3 3 −1⎤
⎢−2 −2 1⎥ ⎡ 1 5⎤
(iii) ⎢ ⎥ (iv) ⎢ ⎥
⎣ −1 2 ⎦
⎢⎣ −4 −5 2⎥⎦
90 MATHEMATICS

Choose the correct answer in the Exercises 11 and 12.


11. If A, B are symmetric matrices of same order, then AB – BA is a
(A) Skew symmetric matrix (B) Symmetric matrix
(C) Zero matrix (D) Identity matrix
⎡ cos α − sin α ⎤
12. If A = ⎢ , then A + A′ = I, if the value of α is
⎣sin α cos α ⎥⎦

he
π π
(A) (B)
6 3

is
(C) π (D)
2
3.7 Elementary Operation (Transformation) of a Matrix

bl
There are six operations (transformations) on a matrix, three of which are due to rows
and three due to columns, which are known as elementary operations or
pu
transformations.
(i) The interchange of any two rows or two columns. Symbolically the interchange
of ith and jth rows is denoted by Ri ↔ Rj and interchange of ith and jth column is
be T

denoted by Ci ↔ Cj.
re
⎡1 1⎤ ⎡−1 3 1⎤
o R

2
⎢ ⎥ ⎢ ⎥
For example, applying R1 ↔ R2 to A = ⎢−1 3 1 ⎥ , we get ⎢1 2 1⎥ .
⎢5 6 7 ⎥⎦ ⎢5 6 7 ⎥⎦
⎣ ⎣
tt E

(ii) The multiplication of the elements of any row or column by a non zero
number. Symbolically, the multiplication of each element of the ith row by k,
C

where k ≠ 0 is denoted by Ri → k Ri.


The corresponding column operation is denoted by Ci → kCi
no N

⎡ 1⎤
1 ⎡ 1 2 1⎤ ⎢ 1 2
7⎥
For example, applying C3 → C3 , to B = ⎢ ⎥ , we get ⎢ ⎥
7 ⎣ −1 3 1⎦ ⎢ −1 3
1⎥
⎢⎣ 7 ⎥⎦
©

(iii) The addition to the elements of any row or column, the corresponding
elements of any other row or column multiplied by any non zero number.
Symbolically, the addition to the elements of ith row, the corresponding elements
of jth row multiplied by k is denoted by Ri → Ri + kRj.
MATRICES 91

The corresponding column operation is denoted by Ci → Ci + kCj.


⎡ 1 2⎤ ⎡1 2⎤
For example, applying R2 → R2 – 2R1, to C = ⎢ ⎥ , we get ⎢ ⎥.
⎣ 2 −1⎦ ⎣ 0 −5 ⎦
3.8 Invertible Matrices
Definition 6 If A is a square matrix of order m, and if there exists another square

he
matrix B of the same order m, such that AB = BA = I, then B is called the inverse
matrix of A and it is denoted by A– 1. In that case A is said to be invertible.
⎡2 3⎤ ⎡ 2 −3 ⎤
For example, let A= ⎢ ⎥ and B = ⎢ ⎥ be two matrices.
⎣1 2 ⎦ ⎣ −1 2 ⎦

is
⎡ 2 3 ⎤ ⎡ 2 −3 ⎤
Now AB = ⎢ ⎥⎢ ⎥
⎣ 1 2 ⎦ ⎣ −1 2 ⎦

bl
⎡ 4 − 3 −6 + 6 ⎤ ⎡ 1 0 ⎤
= ⎢ ⎥=⎢ ⎥=I
⎣ 2 − 2 −3 + 4 ⎦ ⎣ 0 1 ⎦
pu
⎡1 0 ⎤
Also BA = ⎢ ⎥ = I . Thus B is the inverse of A, in other
be T

⎣0 1 ⎦
words B = A– 1 and A is inverse of B, i.e., A = B–1
re
o R

$ Note
1. A rectangular matrix does not possess inverse matrix, since for products BA
tt E

and AB to be defined and to be equal, it is necessary that matrices A and B


should be square matrices of the same order.
C

2. If B is the inverse of A, then A is also the inverse of B.

Theorem 3 (Uniqueness of inverse) Inverse of a square matrix, if it exists, is unique.


no N

Proof Let A = [aij] be a square matrix of order m. If possible, let B and C be two
inverses of A. We shall show that B = C.
Since B is the inverse of A
©

AB = BA = I ... (1)
Since C is also the inverse of A
AC = CA = I ... (2)
Thus B = BI = B (AC) = (BA) C = IC = C
Theorem 4 If A and B are invertible matrices of the same order, then (AB)–1 = B–1 A–1.
92 MATHEMATICS

Proof From the definition of inverse of a matrix, we have


(AB) (AB)–1 = 1
or A–1 (AB) (AB)–1 = A –1 I (Pre multiplying both sides by A–1)
or (A–1A) B (AB)–1 = A –1 (Since A–1 I = A–1)
or IB (AB)–1 = A –1

he
or B (AB)–1 = A –1
or B–1 B (AB)–1 = B –1 A –1
or I (AB)–1 = B –1 A –1

is
Hence (AB)–1 = B –1 A –1

3.8.1 Inverse of a matrix by elementary operations

bl
Let X, A and B be matrices of, the same order such that X = AB. In order to apply a
sequence of elementary row operations on the matrix equation X = AB, we will apply
these row operations simultaneously on X and on the first matrix A of the product AB
pu
on RHS.
Similarly, in order to apply a sequence of elementary column operations on the
be T

matrix equation X = AB, we will apply, these operations simultaneously on X and on the
second matrix B of the product AB on RHS.
re
o R

In view of the above discussion, we conclude that if A is a matrix such that A–1
exists, then to find A–1 using elementary row operations, write A = IA and apply a
sequence of row operation on A = IA till we get, I = BA. The matrix B will be the
tt E

inverse of A. Similarly, if we wish to find A–1 using column operations, then, write
A = AI and apply a sequence of column operations on A = AI till we get, I = AB.
C

Remark In case, after applying one or more elementary row (column) operations on
A = IA (A = AI), if we obtain all zeros in one or more rows of the matrix A on L.H.S.,
then A–1 does not exist.
no N

Example 23 By using elementary operations, find the inverse of the matrix


⎡1 2 ⎤
A=⎢ ⎥.
©

⎣ 2 −1⎦

Solution In order to use elementary row operations we may write A = IA.


⎡ 1 2 ⎤ ⎡1 0 ⎤ ⎡1 2 ⎤ ⎡ 1 0 ⎤
or ⎢ 2 −1⎥ = ⎢ 0 1 ⎥ A, then ⎢ 0 −5⎥ = ⎢ −2 1 ⎥ A (applying R2 → R2 – 2R1)
⎣ ⎦ ⎣ ⎦ ⎣ ⎦ ⎣ ⎦
MATRICES 93

⎡1 0⎤
⎡1 2 ⎤ ⎢ −1 ⎥⎥ A (applying R2 → – R2)
1
or ⎢0 1 ⎥ = ⎢ 2
⎣ ⎦ 5
⎣5 5⎦
⎡1 2⎤
⎡1 0 ⎤ ⎢5 5⎥
or ⎢0 1 ⎥ = ⎢ ⎥ A (applying R1 → R1 – 2R2)

he
⎣ ⎦ ⎢2 −1 ⎥
⎢⎣ 5 5 ⎥⎦
⎡1 2 ⎤
⎢5 5 ⎥

is
Thus A–1 = ⎢ ⎥
⎢ 2 −1 ⎥
⎢⎣ 5 5 ⎥⎦

bl
Alternatively, in order to use elementary column operations, we write A = AI, i.e.,
⎡1 2 ⎤ ⎡1 0 ⎤
⎢ 2 −1⎥ = A ⎢ 0 1 ⎥
pu ⎣ ⎦ ⎣ ⎦
Applying C2 → C2 – 2C1, we get
⎡1 0 ⎤ ⎡ 1 −2 ⎤
be T

⎢ 2 −5 ⎥ = A ⎢ 0 1 ⎥
⎣ ⎦ ⎣ ⎦
re
o R

1
Now applying C2 → − C2 , we have
5
⎡ 2⎤
tt E

⎡1 0 ⎤ ⎢1 5⎥
⎢2 1⎥ = A ⎢ ⎥
−1 ⎥
⎣ ⎦ ⎢0
C

⎢⎣ 5 ⎥⎦
Finally, applying C1 → C1 – 2C2, we obtain
no N

⎡1 2⎤
⎡1 0 ⎤ ⎢5 5⎥
⎢0 1 ⎥ = A ⎢ 2 ⎥
−1 ⎥
⎣ ⎦ ⎢
⎢⎣ 5 5 ⎥⎦
©

⎡1 2⎤
⎢5 5⎥
Hence A–1 = ⎢ ⎥
⎢2 −1 ⎥
⎣⎢ 5 5 ⎦⎥
94 MATHEMATICS

Example 24 Obtain the inverse of the following matrix using elementary operations
⎡0 1 2⎤
A = ⎢⎢1 2 3 ⎥⎥ .
⎢⎣ 3 1 1 ⎥⎦

⎡0 1 2⎤ ⎡1 0 0 ⎤

he
⎢ ⎥ ⎢ ⎥
Solution Write A = I A, i.e., ⎢1 2 3 ⎥ = ⎢0 1 0 ⎥ A
⎢⎣ 3 1 1 ⎥⎦ ⎣⎢0 0 1 ⎦⎥

⎡1 2 3 ⎤ ⎡ 0 1 0 ⎤

is
⎢ 0 1 2 ⎥ ⎢1 0 0 ⎥ A
or ⎢ ⎥ = ⎢ ⎥ (applying R1 ↔ R2)
⎢⎣ 3 1 1 ⎥⎦ ⎢⎣0 0 1 ⎥⎦

bl
⎡1 2 3 ⎤ ⎡ 0 1 0⎤
or ⎢0 1 2 ⎥ = ⎢1 0 0 ⎥⎥ A (applying R3 → R3 – 3R1)
⎢ ⎥ ⎢
pu ⎢⎣0 −5 −8 ⎥⎦ ⎣⎢0 −3 1 ⎥⎦
be T

⎡1 0 −1 ⎤ ⎡ −2 1 0⎤
⎢0 1 2 ⎥ = ⎢
or
⎥ ⎢1 0 0 ⎥⎥ A (applying R1 → R1 – 2R2)
re

o R

⎢⎣0 −5 −8 ⎥⎦ ⎢⎣ 0 −3 1 ⎥⎦

⎡1 0 −1 ⎤ ⎡ −2 1 0⎤
tt E

⎢0 1 2 ⎥ = ⎢
or
⎢ ⎥ ⎢1 0 0 ⎥⎥ A (applying R3 → R3 + 5R2)
⎣⎢0 0 2 ⎥⎦ ⎢⎣ 5 −3 1 ⎥⎦
C

⎡ −2 1 0⎤
⎡1 0 −1 ⎤ ⎢1
⎢0 1 2 ⎥ ⎢ 0 0 ⎥⎥ A 1
no N

or ⎢ ⎥ = (applying R3 → R)
⎢ 5 −3 1⎥ 2 3
⎢⎣0 0 1 ⎥⎦ ⎢ ⎥
⎣2 2 2⎦
©

⎡ 1 −1 1⎤
⎡1 0 0 ⎤ ⎢ 2 2 2⎥
⎢0 1 2 ⎥ = ⎢ 1 0 ⎥
0 ⎥ A (applying R1 → R1 + R3)
⎥ ⎢
or

⎢⎣0 0 1 ⎥⎦ ⎢ 5 −3 1⎥
⎢ ⎥
⎣2 2 2⎦
MATRICES 95

⎡ 1 −1 1⎤
⎡1 0 0 ⎤ ⎢2
⎢0 1 0 ⎥ 2 2⎥
⎢ ⎥
or ⎢ ⎥ = ⎢− 4 3 −1⎥ A (applying R2 → R2 – 2R3)
⎢⎣0 0 1 ⎥⎦ ⎢ 5 −3 1⎥
⎢ ⎥
⎣2 2⎦

he
2
⎡ 1 −1 1⎤
⎢2 2 2⎥
⎢ ⎥
Hence A = ⎢
–1 −4 3 −1 ⎥
⎢ 5 −3 1⎥

is
⎢ ⎥
⎣2 2 2⎦

bl
Alternatively, write A = AI, i.e.,
⎡0 1 2 ⎤ ⎡1 0 0 ⎤
⎢1 2 3 ⎥ A ⎢0 1 0 ⎥
pu ⎢ ⎥ = ⎢ ⎥
⎢⎣3 1 1 ⎥⎦ ⎣⎢0 0 1⎦⎥
be T

⎡1 0 2 ⎤ ⎡0 1 0⎤
⎢2 1 3⎥ ⎢ ⎥
re
⎢ ⎥ = A ⎢1 0 0 ⎥ (C1 ↔ C2)
o R

or
⎣⎢1 3 1 ⎥⎦ ⎣⎢0 0 1⎦⎥

⎡1 0 0 ⎤ ⎡0 1 0 ⎤
tt E

⎢ 2 1 −1⎥ A ⎢1 0 −2 ⎥
or ⎢ ⎥ = ⎢ ⎥ (C3 → C3 – 2C1)
⎢⎣1 3 −1 ⎥⎦ ⎢⎣0 0 1 ⎥⎦
C

⎡1 0 0 ⎤ ⎡0 1 1 ⎤
⎢ 2 1 0 ⎥ A ⎢1 0 −2 ⎥
no N

or ⎢ ⎥ = ⎢ ⎥ (C3 → C3 + C2)
⎢⎣1 3 2 ⎥⎦ ⎢⎣0 0 1 ⎥⎦
©

⎡ 1⎤
⎢ 0 1
⎡1 0 0 ⎤ 2⎥
⎢ 2 1 0⎥ ⎢ ⎥ 1
or ⎢ ⎥ = A ⎢1 0 −1⎥ (C3 → C)
2 3
⎢⎣1 3 1 ⎥⎦ ⎢ 1⎥
⎢0 0 ⎥
⎣ 2⎦
96 MATHEMATICS

⎡ 1⎤
⎡ 1 0 0⎤ ⎢ −2 1 2 ⎥
⎢ 0 1 0⎥ ⎢ ⎥
or ⎢ ⎥ = A ⎢ 1 0 −1⎥ (C1 → C1 – 2C2)
⎢⎣ −5 3 1 ⎥⎦ ⎢ 1⎥
⎢0 0 ⎥
⎣ 2⎦

he
⎡ 1 1⎤
⎢ 1
⎡1 0 0 ⎤ 2 2⎥
⎢ 0 1 0 ⎥ A ⎢ − 4 0 −1 ⎥
or ⎢ ⎥ = ⎢ ⎥ (C1 → C1 + 5C3)

is
⎢ 1⎥
⎣⎢0 3 1⎦⎥ ⎢
5
0 ⎥
⎣ 2 2⎦

bl
⎡ 1 −1 1 ⎤
⎡1 0 0 ⎤ ⎢ 2 2⎥
pu ⎢0 1 0⎥ A ⎢ − 4
2

3 −1 ⎥ (C2 → C2 – 3C3)
or ⎢ ⎥ = ⎢
⎢⎣0 0 1⎥⎦ ⎢ 5 −3 1 ⎥
be T

⎢ ⎥
⎣ 2 2 2⎦
re
o R

⎡1 −1 1⎤
⎢2 2 2⎥
⎢ ⎥
tt E

Hence A–1 = ⎢ − 4 3 −1 ⎥
⎢ 5 −3 1⎥
⎢ ⎥
C

⎣ 2 2 2⎦

⎡10 −2 ⎤
no N

Example 25 Find P – 1, if it exists, given P = ⎢ ⎥.


⎣ −5 1 ⎦

⎡10 −2 ⎤ ⎡1 0 ⎤
Solution We have P = I P, i.e., ⎢ ⎥=⎢ ⎥ P.
©

⎣ −5 1 ⎦ ⎣0 1⎦

⎡ −1 ⎤ ⎡1 ⎤
or ⎢1 5 = ⎢10
⎥ 0⎥
P (applying R1 →
1
R)
⎢ ⎥ ⎢ ⎥ 10 1
⎣ −5 1 ⎦ ⎣0 1⎦
MATRICES 97

⎡ −1 ⎤ ⎡1 ⎤
⎢ 0⎥
or ⎢1 5 ⎥ = ⎢10 ⎥ P (applying R2 → R2 + 5R1)
⎢ ⎥ ⎢ 1⎥
1
⎣0 0⎦
⎣⎢ 2 ⎥⎦

We have all zeros in the second row of the left hand side matrix of the above

he
equation. Therefore, P–1 does not exist.

EXERCISE 3.4

is
Using elementary transformations, find the inverse of each of the matrices, if it exists
in Exercises 1 to 17.

⎡1 −1⎤ ⎡ 2 1⎤ ⎡1 3⎤

bl
1. ⎢ ⎥ 2. ⎢ ⎥ 3. ⎢ ⎥
⎣ 2 3⎦
pu ⎣ 1 1⎦ ⎣2 7⎦

⎡2 3⎤ ⎡2 1⎤ ⎡ 2 5⎤
4. ⎢5 7 ⎥ 5. ⎢ ⎥ 6. ⎢ ⎥
⎣ ⎦ ⎣7 4 ⎦ ⎣ 1 3⎦
be T

⎡3 1 ⎤ ⎡4 5⎤ ⎡ 3 10 ⎤
re
7. ⎢5 2 ⎥ 8. ⎢ ⎥ 9. ⎢ ⎥
⎣ ⎦ ⎣3 4⎦ ⎣2 7 ⎦
o R

⎡ 3 −1⎤ ⎡ 2 −6 ⎤ ⎡ 6 −3 ⎤
10. ⎢ ⎥ 11. ⎢ ⎥ 12. ⎢ ⎥
tt E

⎣ −4 2 ⎦ ⎣ 1 −2 ⎦ ⎣ −2 1 ⎦

⎡ 2 −3 3 ⎤
C

⎡ 2 −3⎤ ⎡2 1⎤ ⎢ ⎥
13. ⎢ −1 2 ⎥ 14. ⎢ ⎥. 15. ⎢ 2 2 3 ⎥
⎣ ⎦ ⎣4 2⎦ ⎢⎣ 3 −2 2 ⎥⎦
no N

⎡1 3 −2 ⎤ ⎡ 2 0 −1 ⎤
⎢ 0 −5⎥⎥ ⎢ ⎥
16. ⎢−3 17. ⎢ 5 1 0 ⎥
©

⎣⎢ 2 5 0 ⎥⎦ ⎢⎣ 0 1 3 ⎥⎦
18. Matrices A and B will be inverse of each other only if
(A) AB = BA (B) AB = BA = 0
(C) AB = 0, BA = I (D) AB = BA = I
98 MATHEMATICS

Miscellaneous Examples

⎡ cos θ sin θ ⎤ ⎡ cos nθ sin nθ ⎤


Example 26 If A = ⎢ ⎥ , then prove that A n = ⎢ ⎥ , n ∈ N.
⎣ − sin θ cos θ ⎦ ⎣ − sin nθ cos nθ ⎦

Solution We shall prove the result by using principle of mathematical induction.

he
⎡ cos θ sin θ ⎤ ⎡ cos nθ sin nθ ⎤
We have P(n) : If A = ⎢ ⎥ , then A n = ⎢ ⎥,n∈N
⎣ − sin θ cos θ ⎦ ⎣ − sin nθ cos nθ ⎦

⎡ cos θ sin θ ⎤ ⎡ cos θ sin θ ⎤

is
P(1) : A = ⎢ ⎥ , so A1 = ⎢ ⎥
⎣ − sin θ cos θ ⎦ ⎣ − sin θ cos θ ⎦
Therefore, the result is true for n = 1.

bl
Let the result be true for n = k. So

⎡ cos θ sin θ ⎤ ⎡ cos k θ sin k θ ⎤


pu P(k) : A = ⎢ ⎥ , then A = ⎢
k

⎣ − sin k θ cos k θ ⎦
⎣ − sin θ cos θ ⎦
Now, we prove that the result holds for n = k +1
be T

⎡ cos θ sin θ ⎤ ⎡ cos k θ sin k θ ⎤


re
Ak + 1 = A ⋅ A = ⎢
k
Now ⎥⎢ ⎥
o R

⎣ − sin θ cos θ ⎦ ⎣ − sin k θ cos k θ ⎦

⎡ cos θ cos k θ – sin θ sin k θ cos θ sin k θ + sin θ cos k θ ⎤


tt E

= ⎢ ⎥
⎣ − sin θ cos k θ + cos θ sin k θ − sin θ sin k θ + cos θ cos k θ ⎦
C

⎡ cos (θ + k θ) sin (θ + k θ) ⎤ ⎡ cos ( k + 1)θ sin ( k + 1)θ ⎤


= ⎢ ⎥=⎢ ⎥
⎣ − sin (θ + k θ) cos (θ + k θ) ⎦ ⎣ − sin ( k + 1)θ cos ( k + 1)θ ⎦
no N

Therefore, the result is true for n = k + 1. Thus by principle of mathematical induction,


⎡ cos n θ sin n θ ⎤
we have A n = ⎢
cos n θ ⎥⎦
, holds for all natural numbers.
⎣− sin n θ
©

Example 27 If A and B are symmetric matrices of the same order, then show that AB
is symmetric if and only if A and B commute, that is AB = BA.
Solution Since A and B are both symmetric matrices, therefore A′ = A and B′ = B.
Let AB be symmetric, then (AB)′ = AB
MATRICES 99

But (AB)′ = B′A′= BA (Why?)


Therefore BA = AB
Conversely, if AB = BA, then we shall show that AB is symmetric.
Now (AB)′ = B′A′
= B A (as A and B are symmetric)

he
= AB
Hence AB is symmetric.

⎡ 2 −1⎤ ⎡5 2⎤ ⎡ 2 5⎤
Example 28 Let A = ⎢ ⎥ ,B=⎢ ⎥ ,C=⎢ ⎥ . Find a matrix D such that

is
⎣ 3 4⎦ ⎣7 4⎦ ⎣ 3 8⎦
CD – AB = O.

bl
Solution Since A, B, C are all square matrices of order 2, and CD – AB is well
defined, D must be a square matrix of order 2.
⎡a b ⎤
Let
pu D= ⎢ ⎥ . Then CD – AB = 0 gives
⎣c d ⎦
be T

⎡ 2 5⎤ ⎡ a b ⎤ ⎡ 2 −1⎤ ⎡ 5 2 ⎤
or ⎢ 3 8⎥ ⎢ c d ⎥ − ⎢ 3 4⎥ ⎢7 4⎥ = O
⎣ ⎦⎣ ⎦ ⎣ ⎦⎣ ⎦
re
o R

⎡ 2a + 5c 2b + 5d ⎤ ⎡ 3 0 ⎤ ⎡ 0 0 ⎤
or ⎢ 3a + 8c 3b + 8d ⎥ − ⎢ 43 22 ⎥ = ⎢ ⎥
⎣ ⎦ ⎣ ⎦ ⎣0 0 ⎦
tt E

⎡ 2a + 5c − 3 2b + 5d ⎤ ⎡ 0 0 ⎤
or ⎢3a + 8c − 43 3b + 8d − 22 ⎥ = ⎢ 0 0 ⎥
C

⎣ ⎦ ⎣ ⎦
By equality of matrices, we get
no N

2a + 5c – 3 = 0 ... (1)
3a + 8c – 43 = 0 ... (2)
2b + 5d = 0 ... (3)
©

and 3b + 8d – 22 = 0 ... (4)


Solving (1) and (2), we get a = –191, c = 77. Solving (3) and (4), we get b = – 110,
d = 44.
⎡ a b ⎤ ⎡ −191 −110 ⎤
Therefore D= ⎢ ⎥=⎢ 44 ⎥⎦
⎣ c d ⎦ ⎣ 77
100 MATHEMATICS

Miscellaneous Exercise on Chapter 3

⎡0 1 ⎤
1. Let A = ⎢ ⎥ , show that (aI + bA) = a I + na bA, where I is the identity
n n n–1

⎣ 0 0 ⎦
matrix of order 2 and n ∈ N.

he
⎡1 1 1⎤ 3n 1
3n 1
3n 1

2. If A = ⎢1 1 1⎥ , prove that A n 3n 1
3n 1
3n 1
, n N.
⎢ ⎥
⎢⎣1 1 1⎥⎦ 3n 1
3n 1
3n 1

is
⎡ 3 −4 ⎤ ⎡1 + 2n −4n ⎤
3. If A = ⎢ ⎥ , then prove that A n = ⎢
1 − 2n ⎥⎦
, where n is any positive

bl
⎣1 −1⎦ ⎣ n
integer.
pu
4. If A and B are symmetric matrices, prove that AB – BA is a skew symmetric
matrix.
5. Show that the matrix B′AB is symmetric or skew symmetric according as A is
be T

symmetric or skew symmetric.


re
o R

⎡0 2 y z⎤
6. Find the values of x, y, z if the matrix A = ⎢ x y − z ⎥⎥ satisfy the equation

⎢⎣ x − y z ⎥⎦
tt E

A′A = I.
C

⎡1 2 0 ⎤ ⎡ 0 ⎤
7. For what values of x : [1 2 1] ⎢⎢ 2 0 1 ⎥⎥ ⎢⎢ 2 ⎥⎥ = O?
no N

⎢⎣1 0 2 ⎥⎦ ⎢⎣ x ⎥⎦

⎡3 1⎤
8. If A = ⎢ , show that A2 – 5A + 7I = 0.
⎣−1 2 ⎥⎦
©

⎡1 0 2 ⎤ ⎡ x ⎤
9. Find x, if [ x −5 −1] ⎢ 0 2 1 ⎥ ⎢ 4 ⎥ = O
⎢ ⎥⎢ ⎥
⎢⎣ 2 0 3 ⎥⎦ ⎢⎣ 1 ⎥⎦
MATRICES 101

10. A manufacturer produces three products x, y, z which he sells in two markets.


Annual sales are indicated below:
Market Products
I 10,000 2,000 18,000
II 6,000 20,000 8,000
(a) If unit sale prices of x, y and z are Rs 2.50, Rs 1.50 and Rs 1.00, respectively,

he
find the total revenue in each market with the help of matrix algebra.
(b) If the unit costs of the above three commodities are Rs 2.00, Rs 1.00 and 50
paise respectively. Find the gross profit.

is
⎡1 2 3⎤ ⎡ −7 −8 −9 ⎤
11. Find the matrix X so that X ⎢ ⎥=⎢ 4 6 ⎥⎦
⎣ 4 5 6⎦ ⎣ 2

bl
12. If A and B are square matrices of the same order such that AB = BA, then prove
by induction that ABn = BnA. Further, prove that (AB)n = AnBn for all n ∈ N.
Choose the correct answer in the following questions:
pu
13. If A = is such that A² = I, then
be T

(A) 1 + α² + βγ = 0 (B) 1 – α² + βγ = 0
re
o R

(C) 1 – α² – βγ = 0 (D) 1 + α² – βγ = 0
14. If the matrix A is both symmetric and skew symmetric, then
tt E

(A) A is a diagonal matrix (B) A is a zero matrix


(C) A is a square matrix (D) None of these
C

2
15. If A is square matrix such that A = A, then (I + A)³ – 7 A is equal to
(A) A (B) I – A (C) I (D) 3A
no N

Summary
 A matrix is an ordered rectangular array of numbers or functions.
 A matrix having m rows and n columns is called a matrix of order m × n.
©

 [aij]m × 1 is a column matrix.


 [aij]1 × n is a row matrix.
 An m × n matrix is a square matrix if m = n.
 A = [aij]m × m is a diagonal matrix if aij = 0, when i ≠ j.
102 MATHEMATICS

 A = [aij]n × n is a scalar matrix if aij = 0, when i ≠ j, aij = k, (k is some


constant), when i = j.
 A = [aij]n × n is an identity matrix, if aij = 1, when i = j, aij = 0, when i ≠ j.
 A zero matrix has all its elements as zero.
 A = [aij] = [bij] = B if (i) A and B are of same order, (ii) aij = bij for all

he
possible values of i and j.
 kA = k[aij]m × n = [k(aij)]m × n
 – A = (–1)A
 A – B = A + (–1) B

is
 A+B=B+A
 (A + B) + C = A + (B + C), where A, B and C are of same order.

bl
 k(A + B) = kA + kB, where A and B are of same order, k is constant.
 (k + l ) A = kA + lA, where k and l are constant.
pu n
 If A = [aij]m × n and B = [bjk]n × p , then AB = C = [cik]m × p, where cik = ∑ aij b jk
j =1
be T

 (i) A(BC) = (AB)C, (ii) A(B + C) = AB + AC, (iii) (A + B)C = AC + BC


re

o R

If A = [aij]m × n, then A′ or AT = [aji]n × m


 (i) (A′)′ = A, (ii) (kA)′ = kA′, (iii) (A + B)′ = A′ + B′, (iv) (AB)′ = B′A′
 A is a symmetric matrix if A′ = A.
tt E

 A is a skew symmetric matrix if A′ = – A.


 Any square matrix can be represented as the sum of a symmetric and a
C

skew symmetric matrix.


 Elementary operations of a matrix are as follows:
no N

(i) Ri ↔ Rj or Ci ↔ Cj
(ii) Ri → kRi or Ci → kCi
(iii) Ri → Ri + kRj or Ci → Ci + kCj

©

If A and B are two square matrices such that AB = BA = I, then B is the


inverse matrix of A and is denoted by A–1 and A is the inverse of B.
 Inverse of a square matrix, if it exists, is unique.

—™—
Chapter 4
DETERMINANTS

he
™ All Mathematical truths are relative and conditional. — C.P. STEINMETZ ™
4.1 Introduction

is
In the previous chapter, we have studied about matrices
and algebra of matrices. We have also learnt that a system
of algebraic equations can be expressed in the form of

bl
matrices. This means, a system of linear equations like
a1 x + b1 y = c 1
pu a2 x + b2 y = c 2
⎡a b ⎤ ⎡ x⎤ ⎡c ⎤
can be represented as ⎢ 1 1 ⎥ ⎢ ⎥ = ⎢ 1 ⎥ . Now, this
be T

⎣ a2 b2 ⎦ ⎣ y ⎦ ⎣ c2 ⎦
re
system of equations has a unique solution or not, is
o R

determined by the number a1 b2 – a2 b1. (Recall that if


a1 b1 P.S. Laplace
≠ or, a1 b2 – a2 b1 ≠ 0, then the system of linear
tt E

a2 b2 (1749-1827)
equations has a unique solution). The number a1 b2 – a2 b1
⎡a b ⎤
C

which determines uniqueness of solution is associated with the matrix A = ⎢ 1 1 ⎥


⎣ a2 b2 ⎦
and is called the determinant of A or det A. Determinants have wide applications in
no N

Engineering, Science, Economics, Social Science, etc.


In this chapter, we shall study determinants up to order three only with real entries.
Also, we will study various properties of determinants, minors, cofactors and applications
of determinants in finding the area of a triangle, adjoint and inverse of a square matrix,
©

consistency and inconsistency of system of linear equations and solution of linear


equations in two or three variables using inverse of a matrix.
4.2 Determinant
To every square matrix A = [aij] of order n, we can associate a number (real or
complex) called determinant of the square matrix A, where aij = (i, j)th element of A.
104 MATHEMATICS

This may be thought of as a function which associates each square matrix with a
unique number (real or complex). If M is the set of square matrices, K is the set of
numbers (real or complex) and f : M → K is defined by f (A) = k, where A ∈ M and
k ∈ K, then f (A) is called the determinant of A. It is also denoted by | A | or det A or Δ.
⎡a b ⎤ a b
If A = ⎢ ⎥ , then determinant of A is written as |A | = = det (A)
⎣c d ⎦ c d

he
Remarks
(i) For matrix A, | A | is read as determinant of A and not modulus of A.
(ii) Only square matrices have determinants.

is
4.2.1 Determinant of a matrix of order one
Let A = [a ] be the matrix of order 1, then determinant of A is defined to be equal to a

bl
4.2.2 Determinant of a matrix of order two
⎡ a11 a12 ⎤
Let A= ⎢ ⎥ be a matrix of order 2 × 2,
⎣ a21 a22 ⎦
pu
then the determinant of A is defined as:
be T

det (A) = |A| = Δ = = a11a22 – a21a12


re
o R

2 4
Example 1 Evaluate .
–1 2
tt E

2 4
Solution We have = 2 (2) – 4(–1) = 4 + 4 = 8.
–1 2
C

x x +1
Example 2 Evaluate
no N

x –1 x
Solution We have
x x +1
= x (x) – (x + 1) (x – 1) = x2 – (x2 – 1) = x2 – x2 + 1 = 1
©

x –1 x

4.2.3 Determinant of a matrix of order 3 × 3


Determinant of a matrix of order three can be determined by expressing it in terms of
second order determinants. This is known as expansion of a determinant along
a row (or a column). There are six ways of expanding a determinant of order
DETERMINANTS 105

3 corresponding to each of three rows (R1, R2 and R3) and three columns (C1, C2 and
C3) giving the same value as shown below.
Consider the determinant of square matrix A = [aij]3 × 3

a11 a12 a13


i.e., | A | = a21 a22 a23

he
a31 a32 a33
Expansion along first Row (R1)
Step 1 Multiply first element a11 of R1 by (–1)(1 + 1) [(–1)sum of suffixes in a11] and with the

is
second order determinant obtained by deleting the elements of first row (R1) and first
column (C1) of | A | as a11 lies in R1 and C1,

bl
a22 a23
i.e., (–1)1 + 1 a11
a32 a33
Step 2 Multiply 2nd element a12 of R1 by (–1)1 + 2 [(–1)sum of suffixes in a12] and the second
pu
order determinant obtained by deleting elements of first row (R1) and 2nd column (C2)
of | A | as a12 lies in R1 and C2,
be T

a21 a23
i.e., (–1)1 + 2 a12
re
a31 a33
o R

Step 3 Multiply third element a13 of R1 by (–1)1 + 3 [(–1)sum of suffixes in a ] and the second
13

order determinant obtained by deleting elements of first row (R1) and third column (C3)
tt E

of | A | as a13 lies in R1 and C3,


a21 a22
C

i.e., (–1)1 + 3 a13 a a32


31

Step 4 Now the expansion of determinant of A, that is, | A | written as sum of all three
no N

terms obtained in steps 1, 2 and 3 above is given by


a22 a23 a a
det A = |A| = (–1)1 + 1 a11 a + (–1)1 + 2 a12 21 23
32 a33 a31 a33
©

1+ 3 a21 a22
+ (–1) a13
a31 a32
or |A| = a11 (a22 a33 – a32 a23) – a12 (a21 a33 – a31 a23)
+ a13 (a21 a32 – a31 a22)
106 MATHEMATICS

= a11 a22 a33 – a11 a32 a23 – a12 a21 a33 + a12 a31 a23 + a13 a21 a32
– a13 a31 a22 ... (1)

$Note We shall apply all four steps together.


Expansion along second row (R2)

he
a 11 a 12 a 13
| A | = a 21 a 22 a 23
a 31 a 32 a 33

is
Expanding along R2, we get

2 +1 a12 a13 a a
| A | = (–1) a21 + (–1)2 + 2 a22 11 13

bl
a32 a33 a31 a33

a11 a12
+ (–1) 2 + 3 a23
pu a31 a32
= – a21 (a12 a33 – a32 a13) + a22 (a11 a33 – a31 a13)
be T

– a23 (a11 a32 – a31 a12)


| A | = – a21 a12 a33 + a21 a32 a13 + a22 a11 a33 – a22 a31 a13 – a23 a11 a32
re
o R

+ a23 a31 a12


= a11 a22 a33 – a11 a23 a32 – a12 a21 a33 + a12 a23 a31 + a13 a21 a32
tt E

– a13 a31 a22 ... (2)


Expansion along first Column (C1)
C

a11 a12 a13


| A | = a21 a22 a23
a31 a32 a33
no N

By expanding along C1, we get

1 + 1 a22 a23 a a13


| A | = a11 (–1) + a21 (−1) 2 + 1 12
©

a32 a33 a32 a33

3 + 1 a 12 a13
+ a31 (–1) a22 a23
= a11 (a22 a33 – a23 a32) – a21 (a12 a33 – a13 a32) + a31 (a12 a23 – a13 a22)
DETERMINANTS 107

| A | = a11 a22 a33 – a11 a23 a32 – a21 a12 a33 + a21 a13 a32 + a31 a12 a23
– a31 a13 a22
= a11 a22 a33 – a11 a23 a32 – a12 a21 a33 + a12 a23 a31 + a13 a21 a32
– a13 a31 a22 ... (3)
Clearly, values of | A | in (1), (2) and (3) are equal. It is left as an exercise to the
reader to verify that the values of |A| by expanding along R3, C2 and C3 are equal to the

he
value of | A | obtained in (1), (2) or (3).
Hence, expanding a determinant along any row or column gives same value.
Remarks
(i) For easier calculations, we shall expand the determinant along that row or column

is
which contains maximum number of zeros.
(ii) While expanding, instead of multiplying by (–1)i + j, we can multiply by +1 or –1

bl
according as (i + j) is even or odd.

⎡2 2⎤ ⎡1 1⎤
(iii) Let A = ⎢ ⎥ and B = ⎢ ⎥ . Then, it is easy to verify that A = 2B. Also
pu ⎣4 0⎦ ⎣2 0⎦
| A | = 0 – 8 = – 8 and | B | = 0 – 2 = – 2.
be T

Observe that, | A | = 4 (– 2) = 22 | B | or | A | = 2n | B |, where n = 2 is the order of


square matrices A and B.
re
o R

In general, if A = kB where A and B are square matrices of order n, then | A| = kn


| B |, where n = 1, 2, 3
1 2 4
tt E

Example 3 Evaluate the determinant Δ = –1 3 0 .


4 1 0
C

Solution Note that in the third column, two entries are zero. So expanding along third
column (C3), we get
no N

–1 3 1 2 1 2
Δ= 4 –0 +0
4 1 4 1 –1 3
©

= 4 (–1 – 12) – 0 + 0 = – 52

0 sin α – cos α
Example 4 Evaluate Δ = – sin α 0 sin β .
cos α – sin β 0
108 MATHEMATICS

Solution Expanding along R1, we get

0 sin β – sin α sin β – sin α 0


Δ= 0 – sin α – cos α
– sin β 0 cos α 0 cos α – sin β
= 0 – sin α (0 – sin β cos α) – cos α (sin α sin β – 0)
= sin α sin β cos α – cos α sin α sin β = 0

he
3 x 3 2
Example 5 Find values of x for which = .
x 1 4 1

3 x 3 2
=

is
Solution We have
x 1 4 1
i.e. 3 – x2 = 3 – 8

bl
i.e. x2 = 8

Hence x= ±2 2
pu
EXERCISE 4.1
be T

Evaluate the determinants in Exercises 1 and 2.


re
2 4
o R

1.
–5 –1

cos θ – sin θ x2 – x + 1 x – 1
tt E

2. (i) (ii)
sin θ cos θ x +1 x +1
C

⎡1 2⎤
3. If A = ⎢ ⎥ , then show that | 2A | = 4 | A |
⎣4 2⎦
no N

⎡1 0 1⎤
⎢ ⎥
4. If A = ⎢ 0 1 2 ⎥ , then show that | 3 A | = 27 | A |
⎢⎣ 0 0 4 ⎥⎦
©

5. Evaluate the determinants


3 –1 –2 3 –4 5
(i) 0 0 –1 (ii) 1 1 –2
3 –5 0 2 3 1
DETERMINANTS 109

0 1 2 2 –1 –2
(iii) –1 0 –3 (iv) 0 2 –1
–2 3 0 3 –5 0

⎡ 1 1 –2 ⎤
⎢ ⎥
6. If A = ⎢ 2 1 –3 ⎥ , find | A |

he
⎢⎣ 5 4 –9 ⎥⎦
7. Find values of x, if
2 4 2x 4 2 3 x 3
= =

is
(i) (ii)
5 1 6 x 4 5 2x 5
x 2 6 2
=

bl
8. If , then x is equal to
18 x 18 6
(A) 6
pu (B) ± 6 (C) – 6 (D) 0
4.3 Properties of Determinants
In the previous section, we have learnt how to expand the determinants. In this section,
we will study some properties of determinants which simplifies its evaluation by obtaining
be T

maximum number of zeros in a row or a column. These properties are true for
determinants of any order. However, we shall restrict ourselves upto determinants of
re
o R

order 3 only.
Property 1 The value of the determinant remains unchanged if its rows and columns
are interchanged.
tt E

a1 a2 a3
Verification Let Δ = b1 b2 b3
C

c1 c2 c3
Expanding along first row, we get
no N

b2 b3 b b b b
Δ = a1 − a2 1 3 + a3 1 2
c2 c3 c1 c3 c1 c2
= a1 (b2 c3 – b3 c2) – a2 (b1 c3 – b3 c1) + a3 (b1 c2 – b2 c1)
©

By interchanging the rows and columns of Δ, we get the determinant


a1 b1 c1
Δ1 = a2 b2 c2
a3 b3 c3
110 MATHEMATICS

Expanding Δ1 along first column, we get


Δ1 = a1 (b2 c3 – c2 b3) – a2 (b1 c3 – b3 c1) + a3 (b1 c2 – b2 c1)
Hence Δ = Δ 1
Remark It follows from above property that if A is a square matrix, then
det (A) = det (A′), where A′ = transpose of A.

he
$ Note If R = ith row and C = ith column, then for interchange of row and
i i
columns, we will symbolically write C ↔ R i i

Let us verify the above property by example.

is
2 –3 5
Example 6 Verify Property 1 for Δ = 6 0 4

bl
1 5 –7
Solution Expanding the determinant along first row, we have
0 4 6 4 6 0
pu Δ= 2
5 –7
– (–3)
1 –7
+5
1 5
= 2 (0 – 20) + 3 (– 42 – 4) + 5 (30 – 0)
be T

= – 40 – 138 + 150 = – 28
re
By interchanging rows and columns, we get
o R

2 6 1
Δ1 = –3 0 5 (Expanding along first column)
tt E

5 4 –7
0 5 6 1 6 1
C

= 2 – (–3) +5
4 –7 4 –7 0 5
= 2 (0 – 20) + 3 (– 42 – 4) + 5 (30 – 0)
no N

= – 40 – 138 + 150 = – 28
Clearly Δ = Δ1
Hence, Property 1 is verified.
©

Property 2 If any two rows (or columns) of a determinant are interchanged, then sign
of determinant changes.
a1 a2 a3
Verification Let Δ = b1 b2 b3
c1 c2 c3
DETERMINANTS 111

Expanding along first row, we get


Δ = a1 (b2 c3 – b3 c2) – a2 (b1 c3 – b3 c1) + a3 (b1 c2 – b2 c1)
Interchanging first and third rows, the new determinant obtained is given by
c1 c2 c3
Δ1 = b1 b2 b3

he
a1 a2 a3

Expanding along third row, we get


Δ1 = a1 (c2 b3 – b2 c3) – a2 (c1 b3 – c3 b1) + a3 (b2 c1 – b1 c2)

is
= – [a1 (b2 c3 – b3 c2) – a2 (b1 c3 – b3 c1) + a3 (b1 c2 – b2 c1)]
Clearly Δ1 = – Δ

bl
Similarly, we can verify the result by interchanging any two columns.

$ Note We can denote the interchange of rows by R ↔ R and interchange of


columns by C ↔ C .
pu i j

i j

2 –3 5
be T

Example 7 Verify Property 2 for Δ = 6 0 4 .


re
1 5 –7
o R

2 –3 5
tt E

Solution Δ = 6 0 4 = – 28 (See Example 6)


1 5 –7
C

Interchanging rows R2 and R3 i.e., R2 ↔ R3, we have


2 –3 5
no N

Δ1 = 1 5 –7
6 0 4
Expanding the determinant Δ1 along first row, we have
©

5 –7 1 –7 1 5
Δ1 = 2 – (–3) +5
0 4 6 4 6 0
= 2 (20 – 0) + 3 (4 + 42) + 5 (0 – 30)
= 40 + 138 – 150 = 28
112 MATHEMATICS

Clearly Δ1 = – Δ
Hence, Property 2 is verified.
Property 3 If any two rows (or columns) of a determinant are identical (all corresponding
elements are same), then value of determinant is zero.
Proof If we interchange the identical rows (or columns) of the determinant Δ, then Δ
does not change. However, by Property 2, it follows that Δ has changed its sign

he
Therefore Δ=–Δ
or Δ=0
Let us verify the above property by an example.

is
3 2 3
Example 8 Evaluate Δ = 2 2 3

bl
3 2 3
Solution Expanding along first row, we get
pu Δ = 3 (6 – 6) – 2 (6 – 9) + 3 (4 – 6)
= 0 – 2 (–3) + 3 (–2) = 6 – 6 = 0
be T

Here R1 and R3 are identical.


Property 4 If each element of a row (or a column) of a determinant is multiplied by a
re
o R

constant k, then its value gets multiplied by k.


a1 b1 c1
tt E

Verification Let Δ = a2 b2 c2
a3 b3 c3
C

and Δ1 be the determinant obtained by multiplying the elements of the first row by k.
Then
no N

k a1 k b1 k c1
Δ1 = a2 b2 c2
a3 b3 c3
©

Expanding along first row, we get


Δ1 = k a1 (b2 c3 – b3 c2) – k b1 (a2 c3 – c2 a3) + k c1 (a2 b3 – b2 a3)
= k [a1 (b2 c3 – b3 c2) – b1 (a2 c3 – c2 a3) + c1 (a2 b3 – b2 a3)]
=k Δ
DETERMINANTS 113

k a1 k b1 k c1 a1 b1 c1
Hence a2 b2 c2 = k a2 b2 c2
a3 b3 c3 a3 b3 c3

Remarks

he
(i) By this property, we can take out any common factor from any one row or any
one column of a given determinant.
(ii) If corresponding elements of any two rows (or columns) of a determinant are
proportional (in the same ratio), then its value is zero. For example

is
a1 a2 a3
Δ = b1 b2 b3 = 0 (rows R and R are proportional)

bl
1 2
k a1 k a2 k a3
pu 102 18 36
Example 9 Evaluate 1 3 4
17 3 6
be T
re
102 18 36 6(17) 6(3) 6(6) 17 3 6
o R

Solution Note that 1 3 4 = 1 3 4 =6 1 3 4 =0


17 3 6 17 3 6 17 3 6
tt E

(Using Properties 3 and 4)


C

Property 5 If some or all elements of a row or column of a determinant are expressed


as sum of two (or more) terms, then the determinant can be expressed as sum of two
(or more) determinants.
no N

a1 + λ1 a2 + λ 2 a3 + λ 3 a1 a2 a3 λ1 λ 2 λ3
For example, b1 b2 b3 = b1 b2 b3 + b1 b2 b3
©

c1 c2 c3 c1 c2 c3 c1 c2 c3

a1 + λ1 a2 + λ 2 a3 + λ 3
Verification L.H.S. = b1 b2 b3
c1 c2 c3
114 MATHEMATICS

Expanding the determinants along the first row, we get


Δ = (a1 + λ1) (b2 c3 – c2 b3) – (a2 + λ2) (b1 c3 – b3 c1)
+ (a3 + λ3) (b1 c2 – b2 c1)
= a1 (b2 c3 – c2 b3) – a2 (b1 c3 – b3 c1) + a3 (b1 c2 – b2 c1)
+ λ1 (b2 c3 – c2 b3) – λ2 (b1 c3 – b3 c1) + λ3 (b1 c2 – b2 c1)

he
(by rearranging terms)

a1 a2 a3 λ1 λ 2 λ3
= b1 b2 b3 + b1 b2 b3 = R.H.S.

is
c1 c2 c3 c1 c2 c3
Similarly, we may verify Property 5 for other rows or columns.

bl
a b c
Example 10 Show that a + 2 x b + 2 y c + 2 z = 0
pu x y z

a b c a b c a b c
be T

Solution We have a + 2 x b + 2 y c + 2 z = a b c + 2x 2 y 2z
re
x y z x y z x y z
o R

(by Property 5)
=0+0=0 (Using Property 3 and Property 4)
tt E

Property 6 If, to each element of any row or column of a determinant, the equimultiples
of corresponding elements of other row (or column) are added, then value of determinant
C

remains the same, i.e., the value of determinant remain same if we apply the operation
Ri → Ri + kRj or Ci → Ci + k Cj .
no N

Verification
a1 a2 a3 a1 + k c1 a2 + k c2 a3 + k c3
Let Δ = b1 b2 b3 and Δ =
1
b1 b2 b3 ,
©

c1 c2 c3 c1 c2 c3

where Δ1 is obtained by the operation R1 → R1 + kR3 .


Here, we have multiplied the elements of the third row (R3) by a constant k and
added them to the corresponding elements of the first row (R1).
Symbolically, we write this operation as R1 → R1 + k R3.
DETERMINANTS 115

Now, again
a1 a2 a3 k c1 k c2 k c3
Δ1 = b1 b2 b3 + b1 b2 b3 (Using Property 5)
c1 c2 c3 c1 c2 c3

=Δ+0 (since R1 and R3 are proportional)

he
Hence Δ = Δ1
Remarks
(i) If Δ1 is the determinant obtained by applying Ri → kRi or Ci → kCi to the

is
determinant Δ, then Δ1 = kΔ.
(ii) If more than one operation like Ri → Ri + kRj is done in one step, care should be
taken to see that a row that is affected in one operation should not be used in

bl
another operation. A similar remark applies to column operations.

a a+b a+b+c
pu
Example 11 Prove that 2a 3a + 2b 4a + 3b + 2c = a 3 .
3a 6a + 3b 10a + 6b + 3c
be T

Solution Applying operations R2 → R2 – 2R1 and R3 → R3 – 3R1 to the given


re
o R

determinant Δ, we have

a a+b a+b+c
tt E

Δ= 0 a 2a + b
0 3a 7 a + 3b
C

Now applying R3 → R3 – 3R2 , we get


a a+b a+b+c
no N

Δ= 0 a 2a + b
0 0 a
©

Expanding along C1, we obtain

a 2a + b
Δ= a +0+0
0 a

= a (a2 – 0) = a (a2) = a3
116 MATHEMATICS

Example 12 Without expanding, prove that


x+ y y+z z+x
Δ= z x y =0
1 1 1
Solution Applying R1 → R1 + R2 to Δ, we get

he
x+ y+z x+ y+z x+ y+z
Δ= z x y
1 1 1

is
Since the elements of R1 and R3 are proportional, Δ = 0.
Example 13 Evaluate

bl
1 a bc
Δ = 1 b ca
1 c ab
pu
Solution Applying R2 → R2 – R1 and R3 → R3 – R1, we get
be T

1 a bc
Δ = 0 b − a c ( a − b)
re
o R

0 c − a b (a − c)
Taking factors (b – a) and (c – a) common from R2 and R3, respectively, we get
tt E

1 a bc
Δ = (b − a ) (c − a) 0 1 –c
C

0 1 –b
= (b – a) (c – a) [(– b + c)] (Expanding along first column)
no N

= (a – b) (b – c) (c – a)
b+c a a
Example 14 Prove that b c+a b = 4 abc
©

c c a+b

b+c a a
Solution Let Δ = b c+a b
c c a+b
DETERMINANTS 117

Applying R1 → R1 – R2 – R3 to Δ, we get
0 –2c –2b
Δ= b c+a b
c c a+b
Expanding along R1, we obtain

he
c+a b b b b c+a
Δ= 0 – (–2 c ) + (–2b)
c a+b c a+b c c
= 2 c (a b + b2 – bc) – 2 b (b c – c2 – ac)

is
= 2 a b c + 2 cb2 – 2 bc2 – 2 b2c + 2 bc2 + 2 abc
= 4 abc
x2 1 + x3

bl
x
Example 15 If x, y, z are different and Δ = y y 2 1 + y 3 = 0 , then
z z2 1 + z3
pu
show that 1 + xyz = 0
Solution We have
be T

x x2 1 + x3
re
o R

Δ= y y 2 1 + y3
z z2 1 + z3
tt E

x x2 1 x x2 x3
y2 1 + y y2 y 3 (Using Property 5)
C

= y
z z2 1 z z2 z3
no N

1 x x2 1 x x2
= (−1) 1 y y 2 + xyz 1 y
2
y2 (Using C3 ↔ C2 and then C1 ↔ C2)
1 z z2 1 z z2
©

1 x x2
= 1 y y 2 (1+ xyz )
1 z z2
118 MATHEMATICS

1 x x2
= (1 + xyz ) 0 y−x y 2 − x2 (Using R2 → R2–R1 and R3 → R3– R1)
0 z−x z 2 − x2
Taking out common factor (y – x) from R2 and (z – x) from R3, we get

he
1 x x2
Δ = (1+xyz ) (y –x) (z –x) 0 1 y+x
0 1 z+x

is
= (1 + xyz) (y – x) (z – x) (z – y) (on expanding along C1)
Since Δ = 0 and x, y, z are all different, i.e., x – y ≠ 0, y – z ≠ 0, z – x ≠ 0, we get

bl
1 + xyz = 0
Example 16 Show that
pu
1+ a 1 1
⎛ 1 1 1⎞
1 1+ b 1 = abc ⎜1 + + + ⎟ = abc + bc + ca + ab
⎝ a b c⎠
be T

1 1 1+ c
re
Solution Taking out factors a,b,c common from R1, R2 and R3, we get
o R

1 1 1
+1
tt E

a a a
1 1 1
L.H.S. = abc +1
b b b
C

1 1 1
+1
c c c
no N

Applying R1→ R1 + R2 + R3, we have

1 1 1 1 1 1 1 1 1
1+ + + 1+ + + 1+ + +
©

a b c a b c a b c
1 1 1
Δ = abc +1
b b b
1 1 1
+1
c c c
DETERMINANTS 119

1 1 1
⎛ 1 1 1⎞ 1 1 1
= abc ⎜ 1+ + + ⎟ +1
⎝ a b c⎠ b b b
1 1 1
+1
c c c

he
Now applying C2 → C2 – C1, C3 → C3 – C1, we get
1 0 0
⎛ 1 1 1⎞ 1
Δ = abc ⎜ 1+ + + ⎟ 1 0
⎝ a b c⎠

is
b
1
0 1
c

bl
⎛ 1 1 1⎞
= abc ⎜1 + + + ⎟ ⎡⎣1(1 – 0 ) ⎤⎦
⎝ a b c⎠
pu ⎛ 1 1 1⎞
= abc ⎜1+ + + ⎟ = abc + bc + ca + ab = R.H.S.
⎝ a b c⎠
be T

$ Note Alternately try by applying C → C – C and C → C – C , then apply


re
1 1 2 3 3 2
C →C –aC.
o R

1 1 3

EXERCISE 4.2
tt E

Using the property of determinants and without expanding in Exercises 1 to 7, prove


that:
C

x a x+a a−b b−c c−a


1. y b y +b = 0 2. b−c c−a a−b = 0
z+c c−a a−b b−c
no N

z c

2 7 65 1 bc a ( b + c )
3. 3 8 75 = 0 4. 1 ca b ( c + a ) = 0
©

5 9 86 1 ab c ( a + b )

b+c q+r y+z a p x


5. c+a r+ p z+x = 2 b q y
a+b p+q x+ y c r z
120 MATHEMATICS

0 a −b −a 2 ab ac
6. − a 0 −c = 0 7. ba −b 2 bc = 4 a 2 b 2 c 2
b c 0 ca cb −c 2
By using properties of determinants, in Exercises 8 to 14, show that:

he
1 a a2
8. (i) 1 b b = ( a − b )( b − c )( c − a )
2

1 c c2

is
1 1 1
a b c = ( a − b )( b − c )( c − a )( a + b + c )

bl
(ii)
pu a3 b3 c3

x x2 yz
9. y y2 zx = (x – y) (y – z) (z – x) (xy + yz + zx)
be T

2
z z xy
re
o R

x + 4 2x 2x
10. (i) 2 x x + 4 2x = ( 5 x + 4 )( 4 − x )
2
tt E

2x 2x x + 4

y+k y y
C

(ii) y y+k y = k 2 (3 y + k )
y y y+k
no N

a−b−c 2a 2a
2b = ( a + b + c )
3
11. (i) 2b b−c−a
c−a−b
©

2c 2c

x + y + 2z x y
= 2( x + y + z)
3
(ii) z y + z + 2x y
z x z + x + 2y
DETERMINANTS 121

1 x x2
( )
2
12. x2 1 x = 1 − x3
x x2 1

1 + a 2 − b2 −2b

he
2ab
( )
3
13. 2ab 1− a + b
2 2
2a = 1 + a2 + b2
2b −2a 1 − a2 − b2

is
a2 + 1 ab ac
ab b2 + 1 bc =1 + a 2 + b 2 + c 2

bl
14.
puca cb c2 + 1

Choose the correct answer in Exercises 15 and 16.


15. Let A be a square matrix of order 3 × 3, then | kA | is equal to
(A) k| A | (B) k 2 | A | (C) k 3 | A | (D) 3k | A |
be T

16. Which of the following is correct


re
(A) Determinant is a square matrix.
o R

(B) Determinant is a number associated to a matrix.


(C) Determinant is a number associated to a square matrix.
tt E

(D) None of these


4.4 Area of a Triangle
C

In earlier classes, we have studied that the area of a triangle whose vertices are
1
(x1, y1), (x2, y2) and (x3, y3), is given by the expression [x (y –y ) + x2 (y3–y1) +
no N

2 1 2 3
x3 (y1–y2)]. Now this expression can be written in the form of a determinant as
x1 y1 1
1
©

Δ= x2 y2 1 ... (1)
2
x3 y3 1
Remarks
(i) Since area is a positive quantity, we always take the absolute value of the
determinant in (1).
122 MATHEMATICS

(ii) If area is given, use both positive and negative values of the determinant for
calculation.
(iii) The area of the triangle formed by three collinear points is zero.
Example 17 Find the area of the triangle whose vertices are (3, 8), (– 4, 2) and (5, 1).
Solution The area of triangle is given by
3 8 1

he
1
Δ= –4 2 1
2
5 1 1
1
⎡3 ( 2 – 1) – 8 ( – 4 – 5 ) + 1( – 4 – 10 ) ⎤⎦

is
2⎣
=

1 61
( 3 + 72 – 14 ) =

bl
=
2 2
Example 18 Find the equation of the line joining A(1, 3) and B (0, 0) using determinants
and find k if D(k, 0) is a point such that area of triangle ABD is 3sq units.
pu
Solution Let P (x, y) be any point on AB. Then, area of triangle ABP is zero (Why?). So
0 0 1
be T

1
1 3 1 =0
re
2
o R

x y 1
1
This gives ( y – 3 x ) = 0 or y = 3x,
tt E

2
which is the equation of required line AB.
Also, since the area of the triangle ABD is 3 sq. units, we have
C

1 3 1
1
0 0 1 =±3
no N

2
k 0 1
− 3k
This gives, = ± 3 , i.e., k = m 2.
2
©

EXERCISE 4.3
1. Find area of the triangle with vertices at the point given in each of the following :
(i) (1, 0), (6, 0), (4, 3) (ii) (2, 7), (1, 1), (10, 8)
(iii) (–2, –3), (3, 2), (–1, –8)
DETERMINANTS 123

2. Show that points


A (a, b + c), B (b, c + a), C (c, a + b) are collinear.
3. Find values of k if area of triangle is 4 sq. units and vertices are
(i) (k, 0), (4, 0), (0, 2) (ii) (–2, 0), (0, 4), (0, k)
4. (i) Find equation of line joining (1, 2) and (3, 6) using determinants.

he
(ii) Find equation of line joining (3, 1) and (9, 3) using determinants.
5. If area of triangle is 35 sq units with vertices (2, – 6), (5, 4) and (k, 4). Then k is
(A) 12 (B) –2 (C) –12, –2 (D) 12, –2
4.5 Minors and Cofactors

is
In this section, we will learn to write the expansion of a determinant in compact form
using minors and cofactors.

bl
Definition 1 Minor of an element aij of a determinant is the determinant obtained by
deleting its ith row and jth column in which element aij lies. Minor of an element aij is
denoted by Mij.
pu
Remark Minor of an element of a determinant of order n(n ≥ 2) is a determinant of
order n – 1.
be T

1 2 3
re
Example 19 Find the minor of element 6 in the determinant Δ = 4 5 6
o R

7 8 9
tt E

Solution Since 6 lies in the second row and third column, its minor M23 is given by

1 2
= 8 – 14 = – 6 (obtained by deleting R2 and C3 in Δ).
C

M23 =
7 8
Definition 2 Cofactor of an element aij , denoted by Aij is defined by
no N

Aij = (–1)i + j Mij , where Mij is minor of aij .

1 –2
Example 20 Find minors and cofactors of all the elements of the determinant
4 3
©

Solution Minor of the element aij is Mij


Here a11 = 1. So M11 = Minor of a11= 3
M12 = Minor of the element a12 = 4
M21 = Minor of the element a21 = –2
124 MATHEMATICS

M22 = Minor of the element a22 = 1


Now, cofactor of aij is Aij. So
A11 = (–1)1 + 1 M11 = (–1)2 (3) = 3
A12 = (–1)1 + 2 M12 = (–1)3 (4) = – 4
A21 = (–1)2 + 1 M21 = (–1)3 (–2) = 2

he
A22 = (–1)2 + 2 M22 = (–1)4 (1) = 1
Example 21 Find minors and cofactors of the elements a11, a21 in the determinant

a11 a12 a13

is
Δ = a21 a22 a23
a31 a32 a33

bl
Solution By definition of minors and cofactors, we have
pu a22 a23
Minor of a11 = M11 = = a22 a33– a23 a32
a32 a33
be T

Cofactor of a11 = A11 = (–1)1+1 M11 = a22 a33 – a23 a32


re
a12 a13
o R

Minor of a21 = M21 = = a12 a33 – a13 a32


a32 a33

Cofactor of a21 = A21 = (–1)2+1 M21 = (–1) (a12 a33 – a13 a32) = – a12 a33 + a13 a32
tt E

Remark Expanding the determinant Δ, in Example 21, along R1, we have


a22 a23 a21 a23 a21 a22
C

Δ = (–1) a11 a
1+1 1+2
a + (–1) a12 a
1+3
a + (–1) a13 a31 a32
32 33 31 33

= a11 A11 + a12 A12 + a13 A13, where Aij is cofactor of aij
no N

= sum of product of elements of R1 with their corresponding cofactors


Similarly, Δ can be calculated by other five ways of expansion that is along R2, R3,
C1, C2 and C3.
©

Hence Δ = sum of the product of elements of any row (or column) with their
corresponding cofactors.

$ Note If elements of a row (or column) are multiplied with cofactors of any
other row (or column), then their sum is zero. For example,
DETERMINANTS 125

Δ = a11 A21 + a12 A22 + a13 A23

a12 a13 a11 a13 a a


= a11 (–1)1+1 + a12 (–1)1+2 + a13 (–1)1+3 11 12
a32 a33 a31 a33 a31 a32
a11 a12 a13

he
= a11 a12 a13 = 0 (since R1 and R2 are identical)
a31 a32 a33
Similarly, we can try for other rows and columns.

is
Example 22 Find minors and cofactors of the elements of the determinant

2 –3 5

bl
6 0 4 and verify that a11 A31 + a12 A32 + a13 A33= 0
1 5 –7
pu 0 4
Solution We have M11 = = 0 –20 = –20; A11 = (–1)1+1 (–20) = –20
5 –7
be T
re
6 4
o R

M12 = = – 42 – 4 = – 46; A12 = (–1)1+2 (– 46) = 46


1 –7

6 0
tt E

M13 = = 30 – 0 = 30; A13 = (–1)1+3 (30) = 30


1 5
C

–3 5
M21 = = 21 – 25 = – 4; A21 = (–1)2+1 (– 4) = 4
5 –7
no N

2 5
M22 = = –14 – 5 = –19; A22 = (–1)2+2 (–19) = –19
1 –7
©

2 –3
M23 = = 10 + 3 = 13; A23 = (–1)2+3 (13) = –13
1 5

–3 5
M31 = = –12 – 0 = –12; A31 = (–1)3+1 (–12) = –12
0 4
126 MATHEMATICS

2 5
M32 = = 8 – 30 = –22; A32 = (–1)3+2 (–22) = 22
6 4
2 –3
and M33 = = 0 + 18 = 18; A33 = (–1)3+3 (18) = 18
6 0
Now a11 = 2, a12 = –3, a13 = 5; A31 = –12, A32 = 22, A33 = 18

he
So a11 A31 + a12 A32 + a13 A33
= 2 (–12) + (–3) (22) + 5 (18) = –24 – 66 + 90 = 0

is
EXERCISE 4.4
Write Minors and Cofactors of the elements of following determinants:

bl
2 –4 a c
1. (i) (ii)
pu 0 3 b d

1 0 0 1 0 4
2. (i) 0 1 0 (ii) 3 5 –1
0 0 1 0 1 2
be T
re
5 3 8
o R

3. Using Cofactors of elements of second row, evaluate Δ = 2 0 1 .


1 2 3
tt E

1 x yz
4. Using Cofactors of elements of third column, evaluate Δ = 1 y zx .
C

1 z xy

a11 a12 a13


no N

5. If Δ = a21 a22 a23 and Aij is Cofactors of aij , then value of Δ is given by
a31 a32 a33
(A) a11 A31+ a12 A32 + a13 A33 (B) a11 A11+ a12 A21 + a13 A31
©

(C) a21 A11+ a22 A12 + a23 A13 (D) a11 A11+ a21 A21 + a31 A31
4.6 Adjoint and Inverse of a Matrix
In the previous chapter, we have studied inverse of a matrix. In this section, we shall
discuss the condition for existence of inverse of a matrix.
To find inverse of a matrix A, i.e., A–1 we shall first define adjoint of a matrix.
DETERMINANTS 127

4.6.1 Adjoint of a matrix


Definition 3 The adjoint of a square matrix A = [aij]n × n is defined as the transpose of
the matrix [Aij]n × n, where Aij is the cofactor of the element aij . Adjoint of the matrix A
is denoted by adj A.

⎡ a11 a12 a13 ⎤


A = ⎢⎢ a21 a23 ⎥⎥

he
Let a22
⎢⎣ a31 a32 a33 ⎥⎦

⎡ A11 A12 A13 ⎤ ⎡ A11 A 21 A 31 ⎤


adj A = Transpose of ⎢⎢ A 21 A 23 ⎥⎥ = ⎢⎢ A12 A32 ⎥⎥

is
Then A 22 A 22
⎢⎣ A 31 A 32 A 33 ⎥⎦ ⎢⎣ A13 A 23 A 33 ⎥⎦

bl
⎡ 2 3⎤
Example 23 Find adj A for A = ⎢ ⎥
⎣1 4⎦
pu
Solution We have A11 = 4, A12 = –1, A21 = –3, A22 = 2
⎡ A11 A 21 ⎤ ⎡ 4 –3⎤
be T

Hence adj A = ⎢ ⎥ =⎢ ⎥
⎣ A12 A 22 ⎦ ⎣ –1 2 ⎦
re
Remark For a square matrix of order 2, given by
o R

⎡ a11 a12 ⎤
A= ⎢ ⎥
⎣ a21 a22 ⎦
tt E

The adj A can also be obtained by interchanging a11 and a22 and by changing signs
of a12 and a21, i.e.,
C
no N
©

We state the following theorem without proof.


Theorem 1 If A be any given square matrix of order n, then

A(adj A) = (adj A) A = A I ,
where I is the identity matrix of order n
128 MATHEMATICS

Verification

⎡ a11 a12 a13 ⎤ ⎡ A11 A 21 A 31 ⎤


⎢a ⎥ ⎢
Let A = ⎢ 21 a22 a23 ⎥ , then adj A = ⎢ A12 A 22 A 32 ⎥⎥
⎢⎣ a31 a32 a33 ⎥⎦ ⎢⎣ A13 A 23 A33 ⎥⎦
Since sum of product of elements of a row (or a column) with corresponding

he
cofactors is equal to | A | and otherwise zero, we have

⎡A 0 0⎤ ⎡1 0 0 ⎤
⎢ ⎥
A (adj A) = ⎢ 0 A 0 ⎥ = A ⎢0 1 0 ⎥ = A I
⎢ ⎥

is
⎢⎣ 0 0 A ⎥⎦ ⎢⎣0 0 1 ⎥⎦

Similarly, we can show (adj A) A = A I

bl
Hence A (adj A) = (adj A) A = A I
pu
Definition 4 A square matrix A is said to be singular if A = 0.
1 2
be T

For example, the determinant of matrix A = is zero


4 8
re
Hence A is a singular matrix.
o R

Definition 5 A square matrix A is said to be non-singular if A ≠ 0

⎡1 2 ⎤
tt E

1 2
Let A= ⎢ ⎥ . Then A = = 4 – 6 = – 2 ≠ 0.
⎣3 4 ⎦ 3 4
Hence A is a nonsingular matrix
C

We state the following theorems without proof.


Theorem 2 If A and B are nonsingular matrices of the same order, then AB and BA
no N

are also nonsingular matrices of the same order.


Theorem 3 The determinant of the product of matrices is equal to product of their
respective determinants, that is, AB = A B , where A and B are square matrices of
©

the same order


A 0 0
Remark We know that (adj A) A = A I = 0 A 0
0 0 A
DETERMINANTS 129

Writing determinants of matrices on both sides, we have

A 0 0
( adj A) A = 0 A 0
0 0 A

he
1 0 0
3
i.e. |(adj A)| |A| = A 0 1 0 (Why?)
0 0 1

is
i.e. |(adj A)| |A| = | A |3 (1)
i.e. |(adj A)| = | A | 2
In general, if A is a square matrix of order n, then | adj (A) | = | A |n – 1.

bl
Theorem 4 A square matrix A is invertible if and only if A is nonsingular matrix.
Proof Let A be invertible matrix of order n and I be the identity matrix of order n.
pu
Then, there exists a square matrix B of order n such that AB = BA = I
Now AB = I. So AB = I or A B =1 (since I =1, AB = A B )
be T

This gives A ≠ 0. Hence A is nonsingular.


re
o R

Conversely, let A be nonsingular. Then A ≠ 0

Now A (adj A) = (adj A) A = A I (Theorem 1)


tt E

⎛ 1 ⎞ ⎛ 1 ⎞
or A⎜ adj A ⎟ = ⎜ adj A ⎟ A = I
⎝ | A | ⎠ ⎝ | A | ⎠
C

1
or AB = BA = I, where B = adj A
|A|
no N

1
Thus A is invertible and A–1 = adj A
|A|
©

1 3 3
Example 24 If A = 1 4 3 , then verify that A adj A = | A | I. Also find A–1.
1 3 4

Solution We have A = 1 (16 – 9) –3 (4 – 3) + 3 (3 – 4) = 1 ≠ 0


130 MATHEMATICS

Now A11 = 7, A12 = –1, A13 = –1, A21 = –3, A22 = 1,A23 = 0, A31 = –3, A32 = 0,
A33 = 1
⎡ 7 −3 −3⎤
⎢ ⎥
Therefore adj A = ⎢ −1 1 0 ⎥
⎢⎣ −1 0 1 ⎥⎦

he
⎡1 3 3 ⎤ ⎡ 7 −3 −3⎤
⎢ ⎥⎢ ⎥
Now A (adj A) = ⎢1 4 3 ⎥ ⎢ −1 1 0 ⎥
⎢⎣1 3 4 ⎥⎦ ⎢⎣ −1 0 1 ⎥⎦

is
⎡ 7 − 3 − 3 −3 + 3 + 0 −3 + 0 + 3 ⎤
⎢ ⎥
= ⎢ 7 − 4 − 3 −3 + 4 + 0 − 3 + 0 + 3 ⎥

bl
⎢⎣ 7 − 3 − 4 −3 + 3 + 0 −3 + 0 + 4 ⎥⎦

⎡1 0 0 ⎤ ⎡1 0 0 ⎤
pu ⎢ ⎥
= ⎢0 1 0 ⎥ = (1)
⎢0 1 0 ⎥
⎢ ⎥ = A .I
⎢⎣0 0 1 ⎥⎦ ⎢⎣0 0 1 ⎥⎦
be T

⎡ 7 −3 −3⎤ ⎡ 7 −3 −3⎤
re
1 1⎢ ⎥ ⎢ ⎥
o R

Also A–1 a d j A = ⎢ −1 1 0 ⎥ = ⎢ −1 1 0 ⎥
A 1
⎢⎣ −1 0 1 ⎥⎦ ⎢⎣ −1 0 1 ⎥⎦
tt E

⎡2 3 ⎤ ⎡ 1 −2 ⎤
Example 25 If A = ⎢ ⎥ and B = ⎢ ⎥ , then verify that (AB) = B A .
–1 –1 –1

⎣1 − 4 ⎦ ⎣ −1 3 ⎦
C

⎡2 3⎤ ⎡ 1 −2 ⎤ ⎡ − 1 5 ⎤
Solution We have AB = ⎢ ⎥ ⎢ −1 3 ⎥ = ⎢ 5 −14 ⎥
⎣1 − 4 ⎦ ⎣ ⎦ ⎣ ⎦
no N

Since, AB = –11 ≠ 0, (AB)–1 exists and is given by

1 1 ⎡ −14 −5⎤ = 1 ⎡14 5⎤


adj (AB) = − ⎢ ⎢ ⎥
11 ⎣ −5 −1⎥⎦ 11 ⎣ 5 1 ⎦
(AB)–1 =
©

AB
Further, A = –11 ≠ 0 and B = 1 ≠ 0. Therefore, A–1 and B–1 both exist and are given by

1 4 3 3 2
A–1 = ,B 1
11 1 2 1 1
DETERMINANTS 131

1 3 2 4 3 1 14 5 1 ⎡14 5⎤
Therefore B 1A 1 =
11 1 1 1 2 11 5 1 11 ⎢⎣ 5 1 ⎥⎦
Hence (AB)–1 = B–1 A–1

2 3
Example 26 Show that the matrix A = satisfies the equation A2 – 4A + I = O,

he
1 2
where I is 2 × 2 identity matrix and O is 2 × 2 zero matrix. Using this equation, find A–1.
⎡ 2 3 ⎤ ⎡ 2 3 ⎤ ⎡ 7 12 ⎤
Solution We have A 2 = A.A = ⎢ ⎥⎢ ⎥ =⎢ ⎥
⎣1 2⎦ ⎣1 2 ⎦ ⎣ 4 7 ⎦

is
⎡ 7 12 ⎤ ⎡8 12 ⎤ ⎡1 0 ⎤ ⎡0 0 ⎤
Hence A 2 − 4A + I = ⎢ ⎥− ⎢ ⎥+⎢ ⎥ =⎢ ⎥=O
⎣ 4 7 ⎦ ⎣ 4 8 ⎦ ⎣0 1 ⎦ ⎣0 0 ⎦

bl
Now A2 – 4A + I = O
Therefore A A – 4A = – I
or
pu A A (A–1) – 4 A A–1 = – I A–1 (Post multiplying by A–1 because |A| ≠ 0)
or A (A A–1) – 4I = – A–1
be T

or AI – 4I = – A–1
re
⎡ 4 0 ⎤ ⎡2 3⎤ ⎡ 2 −3 ⎤
o R

or A–1 = 4I – A = ⎢ ⎥ −⎢ ⎥ = ⎢ ⎥
⎣ 0 4⎦ ⎣1 2⎦ ⎣ −1 2 ⎦
⎡ 2 −3 ⎤
tt E

Hence A −1 = ⎢ ⎥
⎣ −1 2 ⎦
C

EXERCISE 4.5
Find adjoint of each of the matrices in Exercises 1 and 2.
no N

1 1 2
1 2
1. 2. 2 3 5
3 4
2 0 1
©

Verify A (adj A) = (adj A) A = | A | I in Exercises 3 and 4


1 1 2
2 3 3 0 2
3. 4.
4 6
1 0 3
132 MATHEMATICS

Find the inverse of each of the matrices (if it exists) given in Exercises 5 to 11.
1 2 3
2 2 1 5
5. 6. 7. 0 2 4
4 3 3 2
0 0 5
1 0 0 2 1 3 1 1 2

he
8. 3 3 0 9. 4 1 0 10. 0 2 3
5 2 1 7 2 1 3 2 4

⎡1 0 0 ⎤

is
⎢ 0 cos α sin α ⎥⎥
11. ⎢
⎢⎣ 0 sin α − cos α ⎥⎦

bl
3 7 6 8
12. Let A = and B = . Verify that (AB)–1 = B–1 A–1.
2 5 7 9
pu 3 1
13. If A = , show that A2 – 5A + 7I = O. Hence find A–1.
1 2
be T

3 2
re
14. For the matrix A = , find the numbers a and b such that A2 + aA + bI = O.
o R

1 1

1 1 1
tt E

15. For the matrix A = 1 2 3


2 1 3
C

Show that A3– 6A2 + 5A + 11 I = O. Hence, find A–1.


2 1 1
no N

16. If A = 1 2 1
1 1 2
Verify that A3 – 6A2 + 9A – 4I = O and hence find A–1
©

17. Let A be a nonsingular square matrix of order 3 × 3. Then | adj A | is equal to


(A) | A | (B) | A | 2 (C) | A | 3 (D) 3 | A |
–1
18. If A is an invertible matrix of order 2, then det (A ) is equal to
1
(A) det (A) (B) det (A) (C) 1 (D) 0
DETERMINANTS 133

4.7 Applications of Determinants and Matrices


In this section, we shall discuss application of determinants and matrices for solving the
system of linear equations in two or three variables and for checking the consistency of
the system of linear equations.
Consistent system A system of equations is said to be consistent if its solution (one
or more) exists.

he
Inconsistent system A system of equations is said to be inconsistent if its solution
does not exist.

$ Note In this chapter, we restrict ourselves to the system of linear equations

is
having unique solutions only.
4.7.1 Solution of system of linear equations using inverse of a matrix
Let us express the system of linear equations as matrix equations and solve them using

bl
inverse of the coefficient matrix.
Consider the system of equations
pu a1 x + b1 y + c1 z = d1
a2 x + b2 y + c2 z = d 2
a3 x + b3 y + c3 z = d 3
be T

⎡ a1 b1 c1 ⎤ ⎡x⎤ ⎡ d1 ⎤
re
⎢ a b c ⎥ , X = ⎢ y ⎥ and B = ⎢ d ⎥
o R

Let A = ⎢ 2 2 2⎥ ⎢ ⎥ ⎢ 2⎥
⎢⎣ a3 b3 c3 ⎥⎦ ⎢⎣ z ⎥⎦ ⎢⎣ d3 ⎥⎦
tt E

Then, the system of equations can be written as, AX = B, i.e.,


⎡ a1 b1 c1 ⎤ ⎡ x⎤ ⎡ d1 ⎤
⎢a c2 ⎥⎥ ⎢ y⎥ ⎢ ⎥
C

⎢ 2 b2 ⎢ ⎥ = ⎢d2 ⎥
⎢⎣ a3 b3 c3 ⎥⎦ ⎢⎣ z ⎥⎦ ⎢⎣ d3 ⎥⎦
no N

Case I If A is a nonsingular matrix, then its inverse exists. Now


AX = B
or A (AX) = A–1 B
–1
(premultiplying by A–1)
©

or (A–1A) X = A–1 B (by associative property)


–1
or IX=A B
or X = A–1 B
This matrix equation provides unique solution for the given system of equations as
inverse of a matrix is unique. This method of solving system of equations is known as
Matrix Method.
134 MATHEMATICS

Case II If A is a singular matrix, then | A | = 0.


In this case, we calculate (adj A) B.
If (adj A) B ≠ O, (O being zero matrix), then solution does not exist and the
system of equations is called inconsistent.
If (adj A) B = O, then system may be either consistent or inconsistent according
as the system have either infinitely many solutions or no solution.

he
Example 27 Solve the system of equations
2x + 5y = 1
3x + 2y = 7

is
Solution The system of equations can be written in the form AX = B, where
⎡2 5⎤ ⎡ x⎤ ⎡1 ⎤
A= ⎢ ⎥ , X = ⎢ ⎥ and B = ⎢ ⎥

bl
⎣ 3 2⎦ ⎣ y⎦ ⎣7 ⎦
Now, A = –11 ≠ 0, Hence, A is nonsingular matrix and so has a unique solution.
pu 1 2 5
Note that A–1 =
11 3 2
be T

1 2 5 1
Therefore X = A–1B = –
re
11 3 2 7
o R

⎡x⎤ 1 33 3
i.e. ⎢ y⎥ =
⎣ ⎦ 11 11 1
tt E

Hence x = 3, y = – 1
Example 28 Solve the following system of equations by matrix method.
C

3x – 2y + 3z = 8
2x + y – z = 1
no N

4x – 3y + 2z = 4
Solution The system of equations can be written in the form AX = B, where
⎡ 3 −2 3 ⎤ ⎡ x⎤ ⎡8 ⎤
©

A = ⎢ 2 1 −1⎥ , X = ⎢⎢ y ⎥⎥ and B =
⎢ ⎥ ⎢1 ⎥
⎢ ⎥
⎢⎣ 4 −3 2 ⎥⎦ ⎢⎣ z ⎥⎦ ⎢⎣ 4⎥⎦
We see that
A = 3 (2 – 3) + 2(4 + 4) + 3 (– 6 – 4) = – 17 ≠ 0
DETERMINANTS 135

Hence, A is nonsingular and so its inverse exists. Now


A11 = –1, A12 = – 8, A13 = –10
A21 = –5, A22 = – 6, A23 = 1
A31 = –1, A32 = 9, A33 = 7

⎡ −1 − 5 −1⎤
1 ⎢ ⎥

he
Therefore A = − ⎢ −8 − 6 9 ⎥
–1
17
⎢⎣ −10 1 7 ⎥⎦

⎡ −1 − 5 −1⎤ ⎡8 ⎤
1 ⎢

is
⎥ ⎢ ⎥
X = A B = − ⎢ −8 − 6 9 ⎥ ⎢1 ⎥
–1
So
17
⎢⎣ −10 1 7 ⎥⎦ ⎢⎣ 4 ⎥⎦

bl
⎡x⎤ ⎡ −17 ⎤ ⎡ 1 ⎤
⎢ y⎥ 1 ⎢ ⎥ ⎢ ⎥
i.e. ⎢ ⎥ = − 17 ⎢ −34 ⎥ = ⎢ 2 ⎥
pu ⎢⎣ z ⎥⎦ ⎢⎣ −51⎥⎦ ⎢⎣ 3 ⎥⎦
Hence x = 1, y = 2 and z = 3.
be T

Example 29 The sum of three numbers is 6. If we multiply third number by 3 and add
re
second number to it, we get 11. By adding first and third numbers, we get double of the
o R

second number. Represent it algebraically and find the numbers using matrix method.
Solution Let first, second and third numbers be denoted by x, y and z, respectively.
tt E

Then, according to given conditions, we have


x+y+z=6
y + 3z = 11
C

x + z = 2y or x – 2y + z = 0
This system can be written as A X = B, where
no N

1 1 1 x 6
A= 0 1 3 , X = y and B = 11
1 –2 1 z 0
©

Here A 1 1 6 – (0 – 3) 0 –1 9 0 . Now we find adj A


A11 = 1 (1 + 6) = 7, A12 = – (0 – 3) = 3, A13 = – 1
A21 = – (1 + 2) = – 3, A22 = 0, A23 = – (– 2 – 1) = 3
A31 = (3 – 1) = 2, A32 = – (3 – 0) = – 3, A33 = (1 – 0) = 1
136 MATHEMATICS

⎡ 7 –3 2 ⎤
⎢ ⎥
Hence adj A = ⎢ 3 0 –3⎥
⎢⎣ –1 3 1 ⎥⎦

⎡ 7 –3 2 ⎤
1 1 ⎢ ⎥

he
Thus A –1 = adj (A) = ⎢ 3 0 –3⎥
A 9
⎢⎣ –1 3 1 ⎥⎦
Since X = A–1 B
⎡ 7 –3 2 ⎤ ⎡ 6 ⎤

is
1⎢ ⎥⎢ ⎥
X = ⎢ 3 0 –3⎥ ⎢11⎥
9
⎢⎣ –1 3 1 ⎥⎦ ⎢⎣ 0 ⎥⎦

bl
x ⎡ 42 − 33 + 0 ⎤ 9 1
y = 1 ⎢ 18 + 0 + 0 ⎥ 1 18 = 2
⎢ ⎥ =
or
pu z 9 ⎢ −6 + 33 + 0 ⎥ 9
⎣ ⎦ 27 3
Thus x = 1, y = 2, z = 3
be T

EXERCISE 4.6
re
o R

Examine the consistency of the system of equations in Exercises 1 to 6.


1. x + 2y = 2 2. 2x – y = 5 3. x + 3y = 5
tt E

2x + 3y = 3 x+y=4 2x + 6y = 8
4. x + y + z = 1 5. 3x–y – 2z = 2 6. 5x – y + 4z = 5
2x + 3y + 2z = 2 2y – z = –1 2x + 3y + 5z = 2
C

ax + ay + 2az = 4 3x – 5y = 3 5x – 2y + 6z = –1
Solve system of linear equations, using matrix method, in Exercises 7 to 14.
no N

7. 5x + 2y = 4 8. 2x – y = –2 9. 4x – 3y = 3
7x + 3y = 5 3x + 4y = 3 3x – 5y = 7
10. 5x + 2y = 3 11. 2x + y + z = 1 12. x – y + z = 4
3
©

3x + 2y = 5 x – 2y – z = 2x + y – 3z = 0
2
3y – 5z = 9 x+y+z=2
13. 2x + 3y +3 z = 5 14. x – y + 2z = 7
x – 2y + z = – 4 3x + 4y – 5z = – 5
3x – y – 2z = 3 2x – y + 3z = 12
DETERMINANTS 137

⎡ 2 –3 5⎤
⎢ 3 2 – 4⎥ –1 –1
15. If A = ⎢ ⎥ , find A . Using A solve the system of equations
⎢⎣1 1 –2 ⎥⎦

2x – 3y + 5z = 11

he
3x + 2y – 4z = – 5
x + y – 2z = – 3
16. The cost of 4 kg onion, 3 kg wheat and 2 kg rice is Rs 60. The cost of 2 kg onion,
4 kg wheat and 6 kg rice is Rs 90. The cost of 6 kg onion 2 kg wheat and 3 kg

is
rice is Rs 70. Find cost of each item per kg by matrix method.

Miscellaneous Examples

bl
Example 30 If a, b, c are positive and unequal, show that value of the determinant
a b c
pu Δ = b c a is negative.
c a b
be T

Solution Applying C1 → C1 + C2 + C3 to the given determinant, we get


re
a+b+c b c 1 b c
o R

Δ = a + b + c c a = (a + b + c) 1 c a
a+b+c a b 1 a b
tt E

1 b c
C

= (a + b + c) 0 c – b a – c (Applying R2→ R2–R1,and R3 →R3 –R1)


0 a–b b– c
no N

= (a + b + c) [(c – b) (b – c) – (a – c) (a – b)] (Expanding along C1)


= (a + b + c)(– a2 – b2 – c2 + ab + bc + ca)
–1
©

= (a + b + c) (2a2 + 2b2 + 2c2 – 2ab – 2bc – 2ca)


2

–1
= (a + b + c) [(a – b)2 + (b – c)2 + (c – a)2]
2
which is negative (since a + b + c > 0 and (a – b)2 + (b – c)2 + (c – a)2 > 0)
138 MATHEMATICS

Example 31 If a, b, c, are in A.P, find value of

2y + 4 5y + 7 8y + a
3y + 5 6 y + 8 9 y + b
4 y + 6 7 y + 9 10 y + c

he
Solution Applying R1 → R1 + R3 – 2R2 to the given determinant, we obtain

0 0 0
3y + 5 6 y + 8 9 y + b = 0 (Since 2b = a + c)

is
4 y + 6 7 y + 9 10 y + c
Example 32 Show that

bl
( y+ z )
2
xy zx
( x+ z )
2
Δ= xy yz = 2xyz (x + y + z)3
pu xz yz ( x+ y )
2

Solution Applying R1 → xR1, R2 → yR2 , R3 → z R3 to Δ and dividing by xyz, we get


be T
re
2
x y z x2 y x2 z
o R

1 2
Δ= xy 2 y x z y2 z
xyz 2
xz 2 yz 2 z x y
tt E

Taking common factors x, y, z from C1 C2 and C3, respectively, we get


C

( y + z)
2
x2 x2
xyz
( x + z)
2

Δ= y2 y2
no N

xyz
( x + y)
2
z2 z2

Applying C2 → C2– C1, C3 → C3– C1, we have


©

( y + z )2 x2 – ( y + z )
2
x2 − ( y + z )
2

Δ= y2 ( x + z )2 − y 2 0
z2 0 ( x + y )2 – z 2
DETERMINANTS 139

Taking common factor (x + y + z) from C2 and C3, we have

(y + z) x – ( y + z) x – ( y + z)
2

Δ = (x + y + z) 2 y 2
( x + z) – y 0
z2 0 ( x + y) – z
Applying R1 → R1 – (R2 + R3), we have

he
2 yz –2z –2y
Δ = (x + y + z)2
y2 x− y+z 0

is
z2 0 x+ y –z

bl
1 1
Applying C2 → (C2 + C ) and C3 C3 C1 , we get
pu y 1 z

2 yz 0 0
be T

y2
Δ = (x + y + z)2 y2 x z
z
re
o R

z2
z2 x y
y
Finally expanding along R1, we have
tt E

Δ = (x + y + z)2 (2yz) [(x + z) (x + y) – yz] = (x + y + z)2 (2yz) (x2 + xy + xz)


= (x + y + z)3 (2xyz)
C

1 –1 2 –2 0 1
Example 33 Use product 0 2 –3 9 2 –3 to solve the system of equations
no N

3 –2 4 6 1 –2
x – y + 2z = 1
2y – 3z = 1
©

3x – 2y + 4z = 2
⎡1 –1 2⎤ ⎡ –2 0 1 ⎤
⎢ – 3 ⎥⎥ ⎢9 – 3 ⎥⎥
Solution Consider the product ⎢ 0 2 ⎢ 2
⎢⎣ 3 –2 4 ⎥⎦ ⎢⎣ 6 1 – 2 ⎥⎦
140 MATHEMATICS

⎡ − 2 − 9 + 12 0 − 2 + 2 1 + 3 − 4⎤ 1 0 0
⎢ 0 + 18 − 18 0 + 4 − 3 0 − 6 + 6⎥
= ⎢ ⎥ = 0 1 0
⎢⎣ − 6 − 18 + 24 0 − 4 + 4 3 + 6 − 8⎥⎦ 0 0 1
–1
1 –1 2 –2 0 1
Hence

he
0 2 –3 9 2 –3
3 –2 4 6 1 –2
Now, given system of equations can be written, in matrix form, as follows
⎡1 –1 2 ⎤ ⎡ x ⎤ ⎡1 ⎤

is
⎢ 0 2 –3⎥ ⎢ y ⎥ ⎢ ⎥
⎢ ⎥ ⎢ ⎥ = ⎢1 ⎥
⎢⎣ 3 –2 4 ⎦⎥ ⎢⎣ z ⎥⎦ ⎢⎣ 2⎥⎦

bl
−1
x ⎡1 −1 2 ⎤ ⎡ 1 ⎤ –2 0 1 1

y = 0 ⎥ ⎢ ⎥
2 −3⎥ ⎢ 1 ⎥ = 9 2 –3 1
or
pu z

⎢⎣ 3 −2 4 ⎥⎦ ⎢⎣ 2 ⎥⎦ 6 1 –2 2

⎡ −2 + 0 + 2 ⎤ ⎡0 ⎤
be T

⎢ ⎥ ⎢ ⎥
= ⎢ 9 + 2 − 6 ⎥ = ⎢5 ⎥
re
o R

⎢⎣ 6 + 1 − 4 ⎦⎥ ⎢⎣ 3⎥⎦
Hence x = 0, y = 5 and z = 3
Example 34 Prove that
tt E

a + bx c + dx p + qx a c p
Δ = ax + b cx + d px + q = (1 − x ) b d q
2
C

u v w u v w
no N

Solution Applying R1 → R1 – x R2 to Δ, we get

a (1 − x 2 ) c (1 − x 2 ) p (1 − x 2 )
Δ= ax + b cx + d px + q
©

u v w

a c p
= (1 − x ) ax + b cx + d px + q
2

u v w
DETERMINANTS 141

Applying R2 → R2 – x R1, we get


a c p
Δ = (1 − x ) b d q
2

u v w

he
Miscellaneous Exercises on Chapter 4
x sin θ cos θ
1. Prove that the determinant – sin θ – x 1 is independent of θ.

is
cos θ 1 x
a a2 bc 1 a2 a3

bl
2. Without expanding the determinant, prove that b b 2 ca 1 b2 b3 .
pu c c2 ab 1 c2 c3

cos α cos β cos α sin β – sin α


3. Evaluate – sin β cos β 0 .
be T

sin α cos β sin α sin β cos α


re
4. If a, b and c are real numbers, and
o R

b+ c c+ a a+ b
Δ = c + a a + b b + c = 0,
tt E

a+ b b+ c c+ a
Show that either a + b + c = 0 or a = b = c.
C

x+a x x
5. Solve the equation x x+a x = 0, a ≠ 0
no N

x x x+a
a2 bc ac c 2
2
6. Prove that a ab b2 ac = 4a2b2c2
©

2
ab b bc c2
⎡ 3 –1 1 ⎤ ⎡ 1 2 –2 ⎤
⎢ ⎥ ⎢ ⎥
7. If A–1 = ⎢ –15 6 –5⎥ and B = ⎢ –1 3 0 ⎥ , find ( AB )
–1

⎢⎣ 5 –2 2 ⎥⎦ ⎢⎣ 0 –2 1 ⎥⎦
142 MATHEMATICS

1 –2 1
8. Let A = –2 3 1 . Verify that
1 1 5
(i) [adj A]–1 = adj (A–1) (ii) (A–1)–1 = A
x y x+ y

he
9. Evaluate y x+ y x
x+ y x y

is
1 x y
10. Evaluate 1 x + y y

bl
1 x x+ y
Using properties of determinants in Exercises 11 to 15, prove that:
pu
α α2 β+γ
11. β β2 γ + α = (β – γ) (γ – α) (α – β) (α + β + γ)
γ γ2 α +β
be T
re
o R

x x 2 1 px 3
12. y y 2 1 py 3 = (1 + pxyz) (x – y) (y – z) (z – x), where p is any scalar.
tt E

z z 2 1 pz 3

3a – a+ b – a+ c
C

13. –b+ a 3b – b + c = 3(a + b + c) (ab + bc + ca)


– c + a – c+ b 3c
no N

1 1+ p 1+ p + q sin α cos α cos ( α + δ )


14. 2 3 + 2 p 4 + 3 p+ 2q = 1 15. sin β cos β cos ( β + δ ) = 0
©

3 6 + 3 p 10 + 6 p + 3 q sin γ cos γ cos ( γ + δ )


16. Solve the system of equations
2 3 10
4
x y z
DETERMINANTS 143

4 6 5
– 1
x y z

6 9 20
– 2
x y z

he
Choose the correct answer in Exercise 17 to 19.
17. If a, b, c, are in A.P, then the determinant

x + 2 x + 3 x + 2a

is
x + 3 x + 4 x + 2b is
x + 4 x + 5 x + 2c

bl
(A) 0 (B) 1 (C) x (D) 2x

⎡ x 0 0⎤
pu
18. If x, y, z are nonzero real numbers, then the inverse of matrix A = ⎢⎢ 0 y 0 ⎥⎥ is
⎢⎣ 0 0 z ⎥⎦
be T
re
⎡ x −1 0 ⎤ ⎡ x −1 0 ⎤
o R

0 0
⎢ ⎥ ⎢ ⎥
(A) ⎢ 0 y −1 0 ⎥ (B) xyz ⎢ 0 y −1 0 ⎥
⎢ ⎥ ⎢ ⎥
z −1 ⎦ z −1 ⎦
tt E

⎣ 0 0 ⎣ 0 0
C

⎡ x 0 0⎤ ⎡1 0 0 ⎤
1 ⎢
0 y 0 ⎥⎥
1 ⎢
(C) (D) 0 1 0 ⎥⎥
xyz ⎢ xyz ⎢
⎢⎣ 0 0 z ⎥⎦ ⎢⎣0 0 1 ⎥⎦
no N

⎡ 1 sin θ 1 ⎤
⎢ − sin θ 1 sin θ ⎥⎥ , where 0 ≤ θ ≤ 2π. Then
©

19. Let A = ⎢
⎢⎣ −1 − sin θ 1 ⎥⎦

(A) Det (A) = 0 (B) Det (A) ∈ (2, ∞)


(C) Det (A) ∈ (2, 4) (D) Det (A) ∈ [2, 4]
144 MATHEMATICS

Summary
 Determinant of a matrix A = [a11]1 × 1 is given by | a11| = a11

a11 a12
 Determinant of a matrix A
a21 a22
is given by

he
a11 a12
A = = a11 a22 – a12 a21
a21 a22

a1 b1 c1

is
 Determinant of a matrix A a2 b2 c2 is given by (expanding along R1)
a3 b3 c3

bl
a1 b1 c1
b c2 a2 c2 a2 b2
pu A = a2 b2 c2 = a1 2
b3 c3
− b1
a3 c3
+ c1
a3 b3
a3 b3 c3
be T

For any square matrix A, the |A| satisfy following properties.



re
|A′| = |A|, where A′ = transpose of A.
o R

 If we interchange any two rows (or columns), then sign of determinant


changes.

tt E

If any two rows or any two columns are identical or proportional, then value
of determinant is zero.
 If we multiply each element of a row or a column of a determinant by constant
C

k, then value of determinant is multiplied by k.


 Multiplying a determinant by k means multiply elements of only one row
no N

(or one column) by k.


 If A = [aij ]3×3 , then k .A = k 3 A
 If elements of a row or a column in a determinant can be expressed as sum
©

of two or more elements, then the given determinant can be expressed as


sum of two or more determinants.
 If to each element of a row or a column of a determinant the equimultiples of
corresponding elements of other rows or columns are added, then value of
determinant remains same.
DETERMINANTS 145

 Area of a triangle with vertices (x1, y1), (x2, y2) and (x3, y3) is given by
x1 y1 1
1
Δ= x2 y2 1
2
x3 y3 1
 Minor of an element aij of the determinant of matrix A is the determinant

he
obtained by deleting ith row and jth column and denoted by Mij.
 Cofactor of aij of given by Aij = (– 1)i + j Mij
 Value of determinant of a matrix A is obtained by sum of product of elements
of a row (or a column) with corresponding cofactors. For example,

is
A = a11 A11 + a12 A12 + a13 A13.
 If elements of one row (or column) are multiplied with cofactors of elements

bl
of any other row (or column), then their sum is zero. For example, a11 A21 + a12
A22 + a13 A23 = 0
⎡ a11 a12 a13 ⎤ ⎡ A11 A 21 A 31 ⎤
pu ⎢a ⎥,
 If A = ⎢ 21 a a23 ⎥ then adj A = ⎢ A12 A 22 A 32 ⎥ , where A ij is
22
⎢ ⎥
⎢⎣ a31 a32 a33 ⎥⎦ ⎢⎣ A13 A 23 A 33 ⎥⎦
be T

cofactor of aij
re
 A (adj A) = (adj A) A = | A | I, where A is square matrix of order n.
o R

 A square matrix A is said to be singular or non-singular according as


| A | = 0 or | A | ≠ 0.
tt E

 If AB = BA = I, where B is square matrix, then B is called inverse of A.


Also A–1 = B or B–1 = A and hence (A–1)–1 = A.

C

A square matrix A has inverse if and only if A is non-singular.


1
 A –1 = (adj A)
A
no N

 If a 1 x + b1 y + c 1 z = d1
a2 x + b2 y + c 2 z = d2
a3 x + b3 y + c3 z = d3,
©

then these equations can be written as A X = B, where


⎡ a1 b1 c1 ⎤ ⎡ x⎤ ⎡ d1 ⎤
A = ⎢⎢ a2 b2 c2 ⎥ , X = ⎢ y ⎥ and B= ⎢⎢ d 2 ⎥⎥
⎥ ⎢ ⎥
⎢⎣ a3 b3 c3 ⎥⎦ ⎣⎢ z ⎦⎥ ⎢⎣ d3 ⎥⎦
146 MATHEMATICS

 Unique solution of equation AX = B is given by X = A–1 B, where A ≠ 0 .


 A system of equation is consistent or inconsistent according as its solution
exists or not.
 For a square matrix A in matrix equation AX = B
(i) | A | ≠ 0, there exists unique solution

he
(ii) | A | = 0 and (adj A) B ≠ 0, then there exists no solution
(iii) | A | = 0 and (adj A) B = 0, then system may or may not be consistent.

Historical Note

is
The Chinese method of representing the coefficients of the unknowns of
several linear equations by using rods on a calculating board naturally led to the

bl
discovery of simple method of elimination. The arrangement of rods was precisely
that of the numbers in a determinant. The Chinese, therefore, early developed the
idea of subtracting columns and rows as in simplification of a determinant
pu
‘Mikami, China, pp 30, 93.
Seki Kowa, the greatest of the Japanese Mathematicians of seventeenth
be T

century in his work ‘Kai Fukudai no Ho’ in 1683 showed that he had the idea of
determinants and of their expansion. But he used this device only in eliminating a
re
quantity from two equations and not directly in the solution of a set of simultaneous
o R

linear equations. ‘T. Hayashi, “The Fakudoi and Determinants in Japanese


Mathematics,” in the proc. of the Tokyo Math. Soc., V.
tt E

Vendermonde was the first to recognise determinants as independent functions.


He may be called the formal founder. Laplace (1772), gave general method of
expanding a determinant in terms of its complementary minors. In 1773 Lagrange
C

treated determinants of the second and third orders and used them for purpose
other than the solution of equations. In 1801, Gauss used determinants in his
theory of numbers.
no N

The next great contributor was Jacques - Philippe - Marie Binet, (1812) who
stated the theorem relating to the product of two matrices of m-columns and n-
rows, which for the special case of m = n reduces to the multiplication theorem.
©

Also on the same day, Cauchy (1812) presented one on the same subject. He
used the word ‘determinant’ in its present sense. He gave the proof of multiplication
theorem more satisfactory than Binet’s.
The greatest contributor to the theory was Carl Gustav Jacob Jacobi, after
this the word determinant received its final acceptance.
Chapter 5
CONTINUITY AND
DIFFERENTIABILITY
v The whole of science is nothing more than a refinement
of everyday thinking.” — ALBERT EINSTEIN v

5.1 Introduction
This chapter is essentially a continuation of our study of
differentiation of functions in Class XI. We had learnt to
differentiate certain functions like polynomial functions and
trigonometric functions. In this chapter, we introduce the
very important concepts of continuity, differentiability and
relations between them. We will also learn differentiation
of inverse trigonometric functions. Further, we introduce a
new class of functions called exponential and logarithmic
functions. These functions lead to powerful techniques of
differentiation. We illustrate certain geometrically obvious
conditions through differential calculus. In the process, we
will learn some fundamental theorems in this area.
Sir Issac Newton
5.2 Continuity (1642-1727)
We start the section with two informal examples to get a feel of continuity. Consider
the function
1, if x ≤ 0
f ( x) = 
 2, if x > 0
This function is of course defined at every
point of the real line. Graph of this function is
given in the Fig 5.1. One can deduce from the
graph that the value of the function at nearby
points on x-axis remain close to each other
except at x = 0. At the points near and to the
left of 0, i.e., at points like – 0.1, – 0.01, – 0.001,
the value of the function is 1. At the points near
and to the right of 0, i.e., at points like 0.1, 0.01, Fig 5.1
148 MATHEMATICS

0.001, the value of the function is 2. Using the language of left and right hand limits, we
may say that the left (respectively right) hand limit of f at 0 is 1 (respectively 2). In
particular the left and right hand limits do not coincide. We also observe that the value
of the function at x = 0 concides with the left hand limit. Note that when we try to draw
the graph, we cannot draw it in one stroke, i.e., without lifting pen from the plane of the
paper, we can not draw the graph of this function. In fact, we need to lift the pen when
we come to 0 from left. This is one instance of function being not continuous at x = 0.
Now, consider the function defined as
1, if x ≠ 0
f ( x) = 
2, if x = 0
This function is also defined at every point. Left and the right hand limits at x = 0
are both equal to 1. But the value of the
function at x = 0 equals 2 which does not
coincide with the common value of the left
and right hand limits. Again, we note that we
cannot draw the graph of the function without
lifting the pen. This is yet another instance of
a function being not continuous at x = 0.
Naively, we may say that a function is
continuous at a fixed point if we can draw the
graph of the function around that point without
Fig 5.2
lifting the pen from the plane of the paper.
Mathematically, it may be phrased precisely as follows:
Definition 1 Suppose f is a real function on a subset of the real numbers and let c be
a point in the domain of f. Then f is continuous at c if

lim f ( x ) = f ( c )
x →c

More elaborately, if the left hand limit, right hand limit and the value of the function
at x = c exist and equal to each other, then f is said to be continuous at x = c. Recall that
if the right hand and left hand limits at x = c coincide, then we say that the common
value is the limit of the function at x = c. Hence we may also rephrase the definition of
continuity as follows: a function is continuous at x = c if the function is defined at
x = c and if the value of the function at x = c equals the limit of the function at
x = c. If f is not continuous at c, we say f is discontinuous at c and c is called a point
of discontinuity of f.
CONTINUITY AND DIFFERENTIABILITY 149

Example 1 Check the continuity of the function f given by f (x) = 2x + 3 at x = 1.


Solution First note that the function is defined at the given point x = 1 and its value is 5.
Then find the limit of the function at x = 1. Clearly
lim f ( x) = lim (2 x + 3) = 2 (1) + 3 = 5
x→1 x→1

Thus lim f ( x ) = 5 = f (1)


x→ 1

Hence, f is continuous at x = 1.
Example 2 Examine whether the function f given by f (x) = x2 is continuous at x = 0.

Solution First note that the function is defined at the given point x = 0 and its value is 0.
Then find the limit of the function at x = 0. Clearly

lim f ( x) = lim x2 = 0 2 = 0
x →0 x→ 0

Thus lim f ( x ) = 0 = f (0)


x →0

Hence, f is continuous at x = 0.
Example 3 Discuss the continuity of the function f given by f(x) = | x | at x = 0.
Solution By definition

 − x, if x < 0
f (x) = 
 x, if x ≥ 0
Clearly the function is defined at 0 and f(0) = 0. Left hand limit of f at 0 is
lim f (x) = lim− (– x) = 0
x →0 − x→ 0

Similarly, the right hand limit of f at 0 is


lim f ( x) = lim+ x = 0
x →0 + x→ 0

Thus, the left hand limit, right hand limit and the value of the function coincide at
x = 0. Hence, f is continuous at x = 0.
Example 4 Show that the function f given by

 x3 + 3, if x ≠ 0
f (x) = 
1, if x = 0

is not continuous at x = 0.
150 MATHEMATICS

Solution The function is defined at x = 0 and its value at x = 0 is 1. When x ≠ 0, the


function is given by a polynomial. Hence,
lim f ( x ) = lim ( x3 + 3) = 0 3 + 3 = 3
x →0 x →0
Since the limit of f at x = 0 does not coincide with f (0), the function is not continuous
at x = 0. It may be noted that x = 0 is the only point of discontinuity for this function.
Example 5 Check the points where the constant function f (x) = k is continuous.
Solution The function is defined at all real numbers and by definition, its value at any
real number equals k. Let c be any real number. Then
lim f ( x) = lim k = k
x →c x →c

Since f(c) = k = lim


x →c f (x) for any real number c, the function f is continuous at
every real number.
Example 6 Prove that the identity function on real numbers given by f (x) = x is
continuous at every real number.
Solution The function is clearly defined at every point and f (c) = c for every real
number c. Also,
lim f ( x ) = lim x = c
x →c x →c

Thus, lim f (x) = c = f(c) and hence the function is continuous at every real number.
x→ c
Having defined continuity of a function at a given point, now we make a natural
extension of this definition to discuss continuity of a function.
Definition 2 A real function f is said to be continuous if it is continuous at every point
in the domain of f.
This definition requires a bit of elaboration. Suppose f is a function defined on a
closed interval [a, b], then for f to be continuous, it needs to be continuous at every
point in [a, b] including the end points a and b. Continuity of f at a means
lim f ( x) = f (a)
x → a+

and continuity of f at b means


lim f ( x) = f( b)
x →b –

Observe that lim− f ( x ) and lim+ f (x) do not make sense. As a consequence
x→a x →b

of this definition, if f is defined only at one point, it is continuous there, i.e., if the
domain of f is a singleton, f is a continuous function.
CONTINUITY AND DIFFERENTIABILITY 151

Example 7 Is the function defined by f (x) = | x |, a continuous function?


Solution We may rewrite f as

 − x, if x < 0
f (x) = 
 x, if x ≥ 0
By Example 3, we know that f is continuous at x = 0.
Let c be a real number such that c < 0. Then f (c) = – c. Also
lim f ( x) = lim ( − x) = – c (Why?)
x →c x →c

Since lim f ( x ) = f ( c ) , f is continuous at all negative real numbers.


x →c

Now, let c be a real number such that c > 0. Then f(c) = c. Also

lim f ( x ) = lim x = c (Why?)


x →c x →c

Since lim f ( x) = f ( c) , f is continuous at all positive real numbers. Hence, f


x →c

is continuous at all points.


Example 8 Discuss the continuity of the function f given by f (x) = x3 + x2 – 1.
Solution Clearly f is defined at every real number c and its value at c is c3 + c2 – 1. We
also know that
lim f ( x) = lim ( x 3 + x2 − 1) = c3 + c2 − 1
x →c x →c

Thus lim f ( x ) = f ( c ) , and hence f is continuous at every real number. This means
x →c

f is a continuous function.
1
Example 9 Discuss the continuity of the function f defined by f (x) = , x ≠ 0.
x
Solution Fix any non zero real number c, we have
1 1
lim f ( x) = lim =
x →c x→c x c
1
Also, since for c ≠ 0, f (c ) = , we have lim f ( x) = f ( c) and hence, f is continuous
c x →c

at every point in the domain of f. Thus f is a continuous function.


152 MATHEMATICS

We take this opportunity to explain the concept of infinity. This we do by analysing


1
the function f (x) = near x = 0. To carry out this analysis we follow the usual trick of
x
finding the value of the function at real numbers close to 0. Essentially we are trying to
find the right hand limit of f at 0. We tabulate this in the following (Table 5.1).
Table 5.1

x 1 0.3 0.2 0.1 = 10–1 0.01 = 10–2 0.001 = 10–3 10– n


f (x) 1 3.333... 5 10 100 = 102 1000 = 103 10n
We observe that as x gets closer to 0 from the right, the value of f (x) shoots up
higher. This may be rephrased as: the value of f (x) may be made larger than any given
number by choosing a positive real number very close to 0. In symbols, we write
lim f ( x) = + ∞
x →0 +

(to be read as: the right hand limit of f (x) at 0 is plus infinity). We wish to emphasise
that + ∞ is NOT a real number and hence the right hand limit of f at 0 does not exist (as
a real number).
Similarly, the left hand limit of f at 0 may be found. The following table is self
explanatory.
Table 5.2

x –1 – 0.3 – 0.2 – 10–1 – 10–2 – 10–3 – 10–n


f (x) – 1 – 3.333... –5 – 10 – 102 – 103 – 10n

From the Table 5.2, we deduce that the


value of f (x) may be made smaller than any
given number by choosing a negative real
number very close to 0. In symbols,
we write
lim f (x) = − ∞
x →0 −

(to be read as: the left hand limit of f (x) at 0 is


minus infinity). Again, we wish to emphasise
that – ∞ is NOT a real number and hence the
left hand limit of f at 0 does not exist (as a real
number). The graph of the reciprocal function
given in Fig 5.3 is a geometric representation
of the above mentioned facts. Fig 5.3
CONTINUITY AND DIFFERENTIABILITY 153

Example 10 Discuss the continuity of the function f defined by


 x + 2, if x ≤ 1
f (x) = 
 x − 2, if x > 1
Solution The function f is defined at all points of the real line.
Case 1 If c < 1, then f(c) = c + 2. Therefore, lim f ( x) = lim ( x + 2) = c + 2
x→c x→ c
Thus, f is continuous at all real numbers less than 1.
Case 2 If c > 1, then f (c) = c – 2. Therefore,
lim f ( x) = lim (x – 2) = c – 2 = f (c)
x →c x →c

Thus, f is continuous at all points x > 1.


Case 3 If c = 1, then the left hand limit of f at
x = 1 is
lim f ( x) = lim– ( x + 2) = 1 + 2 = 3
x →1 – x→1

The right hand limit of f at x = 1 is


lim f ( x) = lim+ ( x − 2) = 1 − 2 = −1
x →1 + x →1
Since the left and right hand limits of f at x = 1 Fig 5.4
do not coincide, f is not continuous at x = 1. Hence
x = 1 is the only point of discontinuity of f. The graph of the function is given in Fig 5.4.
Example 11 Find all the points of discontinuity of the function f defined by
 x + 2, if x < 1

f (x) =  0, if x = 1
 x − 2, if x > 1

Solution As in the previous example we find that f
is continuous at all real numbers x ≠ 1. The left
hand limit of f at x = 1 is
lim− f ( x) = lim– ( x + 2) = 1 + 2 = 3
x →1 x→1
The right hand limit of f at x = 1 is
lim+ f ( x) = lim+ ( x − 2) = 1 − 2 = −1
x →1 x →1

Since, the left and right hand limits of f at x = 1


do not coincide, f is not continuous at x = 1. Hence
x = 1 is the only point of discontinuity of f. The
graph of the function is given in the Fig 5.5. Fig 5.5
154 MATHEMATICS

Example 12 Discuss the continuity of the function defined by

 x + 2, if x < 0
f (x) = 
 − x + 2, if x > 0

Solution Observe that the function is defined at all real numbers except at 0. Domain
of definition of this function is
D1 ∪ D2 where D 1 = {x ∈ R : x < 0} and
D2 = {x ∈ R : x > 0}
Case 1 If c ∈ D1 , then lim f ( x) = lim (x + 2)
x →c x →c
= c + 2 = f (c) and hence f is continuous in D1.
Case 2 If c ∈ D2, then lim f ( x) = lim (– x + 2)
x →c x →c
= – c + 2 = f (c) and hence f is continuous in D2 .
Since f is continuous at all points in the domain of f,
we deduce that f is continuous. Graph of this
function is given in the Fig 5.6. Note that to graph Fig 5.6
this function we need to lift the pen from the plane
of the paper, but we need to do that only for those points where the function is not
defined.
Example 13 Discuss the continuity of the function f given by

 x, if x ≥ 0
f (x) =  2
 x , if x < 0
Solution Clearly the function is defined at
every real number. Graph of the function is
given in Fig 5.7. By inspection, it seems prudent
to partition the domain of definition of f into
three disjoint subsets of the real line.
Let D1 = {x ∈ R : x < 0}, D2 = {0} and
Fig 5.7
D3 = {x ∈ R : x > 0}
Case 1 At any point in D1 , we have f (x) = x2 and it is easy to see that it is continuous
there (see Example 2).
Case 2 At any point in D3, we have f (x) = x and it is easy to see that it is continuous
there (see Example 6).
CONTINUITY AND DIFFERENTIABILITY 155

Case 3 Now we analyse the function at x = 0. The value of the function at 0 is f(0) = 0.
The left hand limit of f at 0 is
lim f ( x) = lim− x2 = 02 = 0
x →0 – x→ 0
The right hand limit of f at 0 is
lim f ( x) = lim+ x = 0
x →0 + x→ 0

Thus lim f ( x) = 0 = f(0) and hence f is continuous at 0. This means that f is


x →0
continuous at every point in its domain and hence, f is a continuous function.
Example 14 Show that every polynomial function is continuous.
Solution Recall that a function p is a polynomial function if it is defined by
p(x) = a 0 + a 1 x + ... + an xn for some natural number n, an ≠ 0 and a i ∈ R. Clearly this
function is defined for every real number. For a fixed real number c, we have
lim p ( x ) = p ( c)
x →c
By definition, p is continuous at c. Since c is any real number, p is continuous at
every real number and hence p is a continuous function.
Example 15 Find all the points of discontinuity of the greatest integer function defined
by f (x) = [x], where [x] denotes the greatest integer less than or equal to x.
Solution First observe that f is defined for all real numbers. Graph of the function is
given in Fig 5.8. From the graph it looks like that f is discontinuous at every integral
point. Below we explore, if this is true.

Fig 5.8
156 MATHEMATICS

Case 1 Let c be a real number which is not equal to any integer. It is evident from the
graph that for all real numbers close to c the value of the function is equal to [c]; i.e.,
lim f ( x) = lim [ x] = [ c] . Also f (c) = [c] and hence the function is continuous at all real
x →c x →c

numbers not equal to integers.


Case 2 Let c be an integer. Then we can find a sufficiently small real number
r > 0 such that [c – r] = c – 1 whereas [c + r] = c.
This, in terms of limits mean that
lim f (x) = c – 1, lim+ f (x) = c
x →c− x →c

Since these limits cannot be equal to each other for any c, the function is
discontinuous at every integral point.
5.2.1 Algebra of continuous functions
In the previous class, after having understood the concept of limits, we learnt some
algebra of limits. Analogously, now we will study some algebra of continuous functions.
Since continuity of a function at a point is entirely dictated by the limit of the function at
that point, it is reasonable to expect results analogous to the case of limits.
Theorem 1 Suppose f and g be two real functions continuous at a real number c.
Then
(1) f + g is continuous at x = c.
(2) f – g is continuous at x = c.
(3) f . g is continuous at x = c.

(4)   is continuous at x = c, (provided g (c) ≠ 0).


f
g
Proof We are investigating continuity of (f + g) at x = c. Clearly it is defined at
x = c. We have

lim( f + g ) ( x) = lim [ f ( x) + g ( x)] (by definition of f + g)


x →c x →c

= lim f ( x) + lim g ( x ) (by the theorem on limits)


x →c x→c

= f (c) + g(c) (as f and g are continuous)


= (f + g) (c) (by definition of f + g)
Hence, f + g is continuous at x = c.
Proofs for the remaining parts are similar and left as an exercise to the reader.
CONTINUITY AND DIFFERENTIABILITY 157

Remarks
(i) As a special case of (3) above, if f is a constant function, i.e., f (x) = λ for some
real number λ, then the function (λ . g) defined by (λ . g) (x) = λ . g(x) is also
continuous. In particular if λ = – 1, the continuity of f implies continuity of – f.
(ii) As a special case of (4) above, if f is the constant function f (x) = λ, then the
λ λ λ
function defined by (x) = is also continuous wherever g (x) ≠ 0. In
g g g( x)
1
particular, the continuity of g implies continuity of .
g
The above theorem can be exploited to generate many continuous functions. They
also aid in deciding if certain functions are continuous or not. The following examples
illustrate this:
Example 16 Prove that every rational function is continuous.
Solution Recall that every rational function f is given by
p( x)
f (x) = , q( x) ≠ 0
q(x)
where p and q are polynomial functions. The domain of f is all real numbers except
points at which q is zero. Since polynomial functions are continuous (Example 14), f is
continuous by (4) of Theorem 1.
Example 17 Discuss the continuity of sine function.
Solution To see this we use the following facts
lim sin x = 0
x →0
We have not proved it, but is intuitively clear from the graph of sin x near 0.
Now, observe that f (x) = sin x is defined for every real number. Let c be a real
number. Put x = c + h. If x → c we know that h → 0. Therefore
lim f ( x) = lim sin x
x →c x →c

= lim sin(c + h)
h →0

= lim [sin c cos h + cos c sin h]


h →0

= lim [sin c cos h ] + lim [cos c sin h ]


h →0 h→ 0
= sin c + 0 = sin c = f (c)
Thus lim f (x) = f (c) and hence f is a continuous function.
x →c
158 MATHEMATICS

Remark A similar proof may be given for the continuity of cosine function.
Example 18 Prove that the function defined by f (x) = tan x is a continuous function.

sin x
Solution The function f (x) = tan x = . This is defined for all real numbers such
cos x
π
that cos x ≠ 0, i.e., x ≠ (2n +1) . We have just proved that both sine and cosine
2
functions are continuous. Thus tan x being a quotient of two continuous functions is
continuous wherever it is defined.
An interesting fact is the behaviour of continuous functions with respect to
composition of functions. Recall that if f and g are two real functions, then
(f o g) (x) = f (g (x))
is defined whenever the range of g is a subset of domain of f. The following theorem
(stated without proof) captures the continuity of composite functions.
Theorem 2 Suppose f and g are real valued functions such that (f o g) is defined at c.
If g is continuous at c and if f is continuous at g (c), then (f o g) is continuous at c.
The following examples illustrate this theorem.
Example 19 Show that the function defined by f(x) = sin (x2) is a continuous function.
Solution Observe that the function is defined for every real number. The function
f may be thought of as a composition g o h of the two functions g and h, where
g (x) = sin x and h (x) = x2. Since both g and h are continuous functions, by Theorem 2,
it can be deduced that f is a continuous function.
Example 20 Show that the function f defined by
f (x) = |1 – x + | x | |,

where x is any real number, is a continuous function.


Solution Define g by g (x) = 1 – x + | x | and h by h (x) = | x | for all real x. Then
(h o g) (x) = h (g (x))
= h (1– x + | x |)
= | 1– x + | x | | = f (x)
In Example 7, we have seen that h is a continuous function. Hence g being a sum
of a polynomial function and the modulus function is continuous. But then f being a
composite of two continuous functions is continuous.
CONTINUITY AND DIFFERENTIABILITY 159

EXERCISE 5.1
1. Prove that the function f(x) = 5x – 3 is continuous at x = 0, at x = – 3 and at x = 5.
2. Examine the continuity of the function f(x) = 2x2 – 1 at x = 3.
3. Examine the following functions for continuity.
1
(a) f (x) = x – 5 (b) f (x) = ,x≠5
x−5
x 2 − 25
(c) f (x) = , x ≠ –5 (d) f (x) = | x – 5 |
x+5
4. Prove that the function f (x) = xn is continuous at x = n, where n is a positive
integer.
5. Is the function f defined by
 x, if x ≤ 1
f ( x) = 
 5, if x > 1
continuous at x = 0? At x = 1? At x = 2?
Find all points of discontinuity of f, where f is defined by

| x | +3, if x ≤ − 3
 2 x + 3, if x ≤ 2 
6. f ( x) =  7. f ( x) =  −2 x, if − 3 < x < 3
 2 x − 3, if x > 2  6 x + 2, if x ≥ 3

| x |  x
 , if x ≠ 0  , if x < 0
8. f ( x) =  x 9. f ( x) =  | x |
 0, if x = 0  −1, if x ≥ 0
 

 x + 1, if x ≥ 1  x3 − 3, if x ≤ 2
10. f ( x) =  2 11. f ( x) = 
 x + 1, if x < 1  x2 + 1, if x > 2

 x10 − 1, if x ≤ 1
12. f ( x) = 
 x2 , if x > 1
13. Is the function defined by
 x + 5, if x ≤ 1
f ( x) = 
 x − 5, if x > 1
a continuous function?
160 MATHEMATICS

Discuss the continuity of the function f, where f is defined by


3, if 0 ≤ x ≤ 1  2 x, if x < 0
 
14. f ( x) =  4, if 1 < x < 3 15. f ( x) = 0, if 0 ≤ x ≤ 1
5, if 3 ≤ x ≤ 10  4 x, if x > 1
 

 −2, if x ≤ − 1

16. f ( x) =  2 x, if − 1 < x ≤ 1
 2, if x > 1

17. Find the relationship between a and b so that the function f defined by
 ax + 1, if x ≤ 3
f ( x) = 
bx + 3, if x > 3
is continuous at x = 3.
18. For what value of λ is the function defined by

 λ ( x − 2 x ), if x ≤ 0
2
f ( x) = 
 4 x + 1, if x > 0
continuous at x = 0? What about continuity at x = 1?
19. Show that the function defined by g (x) = x – [x] is discontinuous at all integral
points. Here [x] denotes the greatest integer less than or equal to x.
20. Is the function defined by f(x) = x2 – sin x + 5 continuous at x = π?
21. Discuss the continuity of the following functions:
(a) f (x) = sin x + cos x (b) f (x) = sin x – cos x
(c) f (x) = sin x . cos x
22. Discuss the continuity of the cosine, cosecant, secant and cotangent functions.
23. Find all points of discontinuity of f, where
 sin x
 , if x < 0
f ( x) =  x
 x + 1, if x ≥ 0

24. Determine if f defined by
 2 1
 x sin , if x ≠ 0
f ( x) =  x
 0, if x = 0

is a continuous function?
CONTINUITY AND DIFFERENTIABILITY 161

25. Examine the continuity of f, where f is defined by


sin x − cos x, if x ≠ 0
f ( x) = 
 −1, if x = 0
Find the values of k so that the function f is continuous at the indicated point in Exercises
26 to 29.
 k cos x π
 π − 2 x , if x ≠ 2 π
26. f ( x) =  at x =
3, π 2
if x =
 2

 kx 2 , if x ≤ 2
27. f ( x) =  at x = 2
3, if x > 2

 kx + 1, if x ≤ π
28. f ( x) =  at x = π
cos x, if x > π
 kx + 1, if x ≤ 5
29. f ( x) =  at x = 5
3 x − 5, if x > 5
30. Find the values of a and b such that the function defined by
 5, if x ≤ 2

f ( x) =  ax + b, if 2 < x < 10
 21,
 if x ≥ 10
is a continuous function.
31. Show that the function defined by f (x) = cos (x2 ) is a continuous function.
32. Show that the function defined by f(x) = | cos x | is a continuous function.
33. Examine that sin | x | is a continuous function.
34. Find all the points of discontinuity of f defined by f (x) = | x | – |x + 1 |.
5.3. Differentiability
Recall the following facts from previous class. We had defined the derivative of a real
function as follows:
Suppose f is a real function and c is a point in its domain. The derivative of f at c is
defined by
f (c + h) − f (c)
lim
h →0 h
162 MATHEMATICS

d
provided this limit exists. Derivative of f at c is denoted by f ′(c) or ( f ( x )) | c . The
dx
function defined by
f (x + h) − f ( x)
f ′( x) = lim
h →0 h
wherever the limit exists is defined to be the derivative of f. The derivative of f is
d dy
denoted by f ′ (x) or ( f ( x )) or if y = f (x) by or y ′. The process of finding
dx dx
derivative of a function is called differentiation. We also use the phrase differentiate
f (x) with respect to x to mean find f ′(x).
The following rules were established as a part of algebra of derivatives:
(1) (u ± v)′ = u′ ± v′
(2) (uv)′ = u′v + uv′ (Leibnitz or product rule)

(3)  u  = u ′v − uv′ , wherever v ≠ 0 (Quotient rule).
v  v2
The following table gives a list of derivatives of certain standard functions:
Table 5.3
n
f (x ) x sin x cos x tan x

f ′(x ) nx n – 1 cos x – sin x sec2 x

Whenever we defined derivative, we had put a caution provided the limit exists.
Now the natural question is; what if it doesn’t? The question is quite pertinent and so is
f (c + h) − f ( c)
its answer. If lim does not exist, we say that f is not differentiable at c.
h →0 h
In other words, we say that a function f is differentiable at a point c in its domain if both
f (c + h) − f (c ) f (c + h) − f (c)
lim– and lim+ are finite and equal. A function is said
h →0 h h →0 h
to be differentiable in an interval [a, b] if it is differentiable at every point of [a, b]. As
in case of continuity, at the end points a and b, we take the right hand limit and left hand
limit, which are nothing but left hand derivative and right hand derivative of the function
at a and b respectively. Similarly, a function is said to be differentiable in an interval
(a, b) if it is differentiable at every point of (a, b).
CONTINUITY AND DIFFERENTIABILITY 163

Theorem 3 If a function f is differentiable at a point c, then it is also continuous at that


point.
Proof Since f is differentiable at c, we have
f ( x) − f (c )
lim = f ′ ( c)
x →c x−c
But for x ≠ c, we have
f ( x) − f (c )
f (x) – f (c) = . ( x − c)
x−c
 f ( x) − f ( c) 
Therefore lim [ f ( x) − f ( c)] = lim  . ( x − c) 
x →c x →c  x−c 

lim [ f ( x)] − lim [ f ( c)] = lim 


f ( x) − f ( c) 
. lim [( x − c)]
x − c  x→c
or x →c x→c x →c 

= f ′(c) . 0 = 0
or lim f ( x) = f (c)
x →c
Hence f is continuous at x = c.
Corollary 1 Every differentiable function is continuous.
We remark that the converse of the above statement is not true. Indeed we have
seen that the function defined by f(x) = | x | is a continuous function. Consider the left
hand limit
f (0 + h ) − f (0) −h
lim– = = −1
h →0 h h
The right hand limit
f (0 + h ) − f (0) h
lim = =1
h →0 + h h
f (0 + h ) − f (0)
Since the above left and right hand limits at 0 are not equal, lim
h →0 h
does not exist and hence f is not differentiable at 0. Thus f is not a differentiable
function.
5.3.1 Derivatives of composite functions
To study derivative of composite functions, we start with an illustrative example. Say,
we want to find the derivative of f, where
f (x) = (2x + 1)3
164 MATHEMATICS

One way is to expand (2x + 1)3 using binomial theorem and find the derivative as
a polynomial function as illustrated below.
d d
f ( x) = (2 x + 1) 3 
dx dx
d
= (8 x3 + 12 x 2 + 6 x + 1)
dx
= 24x2 + 24x + 6
= 6 (2x + 1)2
Now, observe that f (x) = (h o g) (x)
where g(x) = 2x + 1 and h(x) = x3. Put t = g(x) = 2x + 1. Then f(x) = h(t) = t3. Thus
df dh dt
= 6 (2x + 1) 2 = 3(2x + 1)2 . 2 = 3t2 . 2 = ⋅
dx dt dx
The advantage with such observation is that it simplifies the calculation in finding
the derivative of, say, (2x + 1)100. We may formalise this observation in the following
theorem called the chain rule.
Theorem 4 (Chain Rule) Let f be a real valued function which is a composite of two
dt dv
functions u and v ; i.e., f = v o u. Suppose t = u (x) and if both and exist, we have
dx dt
df dv dt
= ⋅
dx dt dx
We skip the proof of this theorem. Chain rule may be extended as follows. Suppose
f is a real valued function which is a composite of three functions u, v and w ; i.e.,
f = (w o u) o v. If t = v (x) and s = u (t), then

df d ( w o u ) dt dw ds dt
= ⋅ = ⋅ ⋅
dx dt dx ds dt dx
provided all the derivatives in the statement exist. Reader is invited to formulate chain
rule for composite of more functions.

Example 21 Find the derivative of the function given by f (x) = sin (x2).

Solution Observe that the given function is a composite of two functions. Indeed, if
t = u(x) = x2 and v(t) = sin t, then
f (x) = (v o u) (x) = v(u(x)) = v(x2) = sin x2
CONTINUITY AND DIFFERENTIABILITY 165

dv dt
Put t = u(x) = x2. Observe that = cos t and = 2 x exist. Hence, by chain rule
dt dx
df dv dt
= ⋅ = cos t ⋅ 2 x
dx dt dx
It is normal practice to express the final result only in terms of x. Thus
df
= cos t ⋅ 2 x = 2 x cos x2
dx
Alternatively, We can also directly proceed as follows:
dy d
y = sin (x2) ⇒ = (sin x2)
dx dx
d 2
= cos x2 (x ) = 2x cos x2
dx
Example 22 Find the derivative of tan (2x + 3).
Solution Let f (x) = tan (2x + 3), u (x) = 2x + 3 and v(t) = tan t. Then
(v o u) (x) = v(u(x)) = v(2x + 3) = tan (2x + 3) = f (x)
dv
Thus f is a composite of two functions. Put t = u(x) = 2x + 3. Then = sec2 t and
dt
dt
= 2 exist. Hence, by chain rule
dx
df dv dt
= ⋅ = 2sec 2 (2 x + 3)
dx dt dx
Example 23 Differentiate sin (cos (x2 )) with respect to x.
Solution The function f(x) = sin (cos (x2)) is a composition f (x) = (w o v o u) (x) of the
three functions u, v and w, where u(x) = x2, v(t) = cos t and w(s) = sin s. Put
dw ds dt
t = u(x) = x2 and s = v (t) = cos t. Observe that = cos s , = − sin t and = 2x
ds dt dx
exist for all real x. Hence by a generalisation of chain rule, we have

df dw ds dt
= ⋅ ⋅ = (cos s) . (– sin t) . (2x) = – 2x sin x2 . cos (cos x2 )
dx ds dt dx
166 MATHEMATICS

Alternatively, we can proceed as follows:


y = sin (cos x2)
dy d d
Therefore = sin (cos x2) = cos (cos x2) (cos x2)
dx dx dx
d
= cos (cos x2) (– sin x2) (x2 )
dx
= – sin x2 cos (cos x2 ) (2x)
= – 2x sin x2 cos (cos x2)

EXERCISE 5.2
Differentiate the functions with respect to x in Exercises 1 to 8.
1. sin (x2 + 5) 2. cos (sin x) 3. sin (ax + b)
sin (ax + b)
4. sec (tan ( x )) 5. cos (cx + d) 6. cos x3 . sin2 (x5 )

7. 2 cot ( x2 ) 8. cos ( x )
9. Prove that the function f given by
f (x) = |x – 1|, x ∈ R
is not differentiable at x = 1.
10. Prove that the greatest integer function defined by
f (x) = [x], 0 < x < 3
is not differentiable at x = 1 and x = 2.

5.3.2 Derivatives of implicit functions


Until now we have been differentiating various functions given in the form y = f (x).
But it is not necessary that functions are always expressed in this form. For example,
consider one of the following relationships between x and y:
x–y–π=0
x + sin xy – y = 0
In the first case, we can solve for y and rewrite the relationship as y = x – π. In
the second case, it does not seem that there is an easy way to solve for y. Nevertheless,
there is no doubt about the dependence of y on x in either of the cases. When a
relationship between x and y is expressed in a way that it is easy to solve for y and
write y = f (x), we say that y is given as an explicit function of x. In the latter case it
CONTINUITY AND DIFFERENTIABILITY 167

is implicit that y is a function of x and we say that the relationship of the second type,
above, gives function implicitly. In this subsection, we learn to differentiate implicit
functions.

dy
Example 24 Find if x – y = π.
dx
Solution One way is to solve for y and rewrite the above as
y=x –π
dy
But then =1
dx
Alternatively, directly differentiating the relationship w.r.t., x, we have
d dπ
( x − y) =
dx dx

Recall that means to differentiate the constant function taking value π
dx
everywhere w.r.t., x. Thus
d d
( x) − ( y ) = 0
dx dx
which implies that
dy dx
= =1
dx dx

dy
Example 25 Find , if y + sin y = cos x.
dx
Solution We differentiate the relationship directly with respect to x, i.e.,
dy d d
+ (sin y) = (cos x )
dx dx dx
which implies using chain rule
dy dy
+ cos y ⋅ = – sin x
dx dx
dy sin x
This gives = −
dx 1 + cos y
where y ≠ (2n + 1) π
168 MATHEMATICS

5.3.3 Derivatives of inverse trigonometric functions


We remark that inverse trigonometric functions are continuous functions, but we will
not prove this. Now we use chain rule to find derivatives of these functions.
Example 26 Find the derivative of f given by f (x) = sin–1 x assuming it exists.
Solution Let y = sin–1 x. Then, x = sin y.
Differentiating both sides w.r.t. x, we get

dy
1 = cos y
dx

dy 1 1
which implies that = =
dx cos y cos(sin−1 x)

π π
Observe that this is defined only for cos y ≠ 0, i.e., sin–1 x ≠ − , , i.e., x ≠ – 1, 1,
2 2
i.e., x ∈ (– 1, 1).
To make this result a bit more attractive, we carry out the following manipulation.
Recall that for x ∈ (– 1, 1), sin (sin–1 x) = x and hence
cos2 y = 1 – (sin y) 2 = 1 – (sin (sin–1 x))2 = 1 – x2

 π π
Also, since y ∈  − ,  , cos y is positive and hence cos y = 1 − x 2
 2 2
Thus, for x ∈ (– 1, 1),
dy 1 1
= =
dx cos y 1 − x2

Example 27 Find the derivative of f given by f (x) = tan–1 x assuming it exists.


Solution Let y = tan–1 x. Then, x = tan y.
Differentiating both sides w.r.t. x, we get

dy
1 = sec2 y
dx
which implies that
dy 1 1 1 1
= = = −1
=
dx sec y 1 + tan y 1 + (tan (tan x)) 1 + x 2
2 2 2
CONTINUITY AND DIFFERENTIABILITY 169

Finding of the derivatives of other inverse trigonometric functions is left as exercise.


The following table gives the derivatives of the remaining inverse trigonometric functions
(Table 5.4):
Table 5.4

f (x) cos–1x cot–1x sec –1x cosec–1x

−1 −1 1 −1
f ′(x)
1 − x2 1 + x2 x x2 − 1 x x2 − 1
Domain of f ′ (–1, 1) R (–∞, –1) ∪ (1, ∞) (–∞, –1) ∪ (1, ∞)

EXERCISE 5.3
dy
Find in the following:
dx
1. 2x + 3y = sin x 2. 2x + 3y = sin y 3. ax + by2 = cos y
4. xy + y2 = tan x + y 5. x2 + xy + y2 = 100 6. x3 + x2y + xy2 + y3 = 81

 2x 
7. sin2 y + cos xy = κ 8. sin2 x + cos2 y = 1 9. y = sin–1  
1 + x 2 

 3x − x 3  1 1
10. y = tan–1  2 
, − <x<
 1− 3x  3 3

 1 − x2 
11. y = cos −1  , 0 < x <1
 1 + x2 

 1− x 2 
12. y = sin −1   ,0 < x < 1
 1+ x 2 

 2x 
13. y = cos −1  , −1 < x <1
 1 + x2 

14. (
y = sin −1 2 x 1 − x2 , − )
1
< x<
1
2 2

 1 , 1
15. y = sec−1  2  0< x <
 2x − 1  2
170 MATHEMATICS

5.4 Exponential and Logarithmic Functions


Till now we have learnt some aspects of different classes of functions like polynomial
functions, rational functions and trigonometric functions. In this section, we shall
learn about a new class of (related)
functions called exponential functions and
logarithmic functions. It needs to be
emphasized that many statements made
in this section are motivational and precise
proofs of these are well beyond the scope
of this text.
The Fig 5.9 gives a sketch of
y = f1 (x) = x, y = f 2(x) = x2 , y = f3(x) = x3
and y = f4(x) = x4. Observe that the curves
get steeper as the power of x increases.
Steeper the curve, faster is the rate of
growth. What this means is that for a fixed
increment in the value of x (> 1), the Fig 5.9
increment in the value of y = fn (x) increases as n increases for n = 1, 2, 3, 4. It is
conceivable that such a statement is true for all positive values of n, where fn (x) = xn.
Essentially, this means that the graph of y = fn (x) leans more towards the y-axis as n
increases. For example, consider f10(x) = x10 and f15 (x) = x15. If x increases from 1 to
2, f10 increases from 1 to 210 whereas f15 increases from 1 to 215. Thus, for the same
increment in x, f15 grow faster than f10.
Upshot of the above discussion is that the growth of polynomial functions is dependent
on the degree of the polynomial function – higher the degree, greater is the growth.
The next natural question is: Is there a function which grows faster than any polynomial
function. The answer is in affirmative and an example of such a function is
y = f (x) = 10x.
Our claim is that this function f grows faster than fn (x) = xn for any positive integer n.
For example, we can prove that 10x grows faster than f100 (x) = x100. For large values
of x like x = 10 3, note that f100 (x) = (103) 100 = 10300 whereas f (103) = 10 10 3 = 101000.
Clearly f(x) is much greater than f100 (x). It is not difficult to prove that for all
x > 103 , f(x) > f100 (x). But we will not attempt to give a proof of this here. Similarly, by
choosing large values of x, one can verify that f(x) grows faster than fn (x) for any
positive integer n.
CONTINUITY AND DIFFERENTIABILITY 171

Definition 3 The exponential function with positive base b > 1 is the function
y = f (x) = bx
The graph of y = 10x is given in the Fig 5.9.
It is advised that the reader plots this graph for particular values of b like 2, 3 and 4.
Following are some of the salient features of the exponential functions:
(1) Domain of the exponential function is R, the set of all real numbers.
(2) Range of the exponential function is the set of all positive real numbers.
(3) The point (0, 1) is always on the graph of the exponential function (this is a
restatement of the fact that b 0 = 1 for any real b > 1).
(4) Exponential function is ever increasing; i.e., as we move from left to right, the
graph rises above.
(5) For very large negative values of x, the exponential function is very close to 0. In
other words, in the second quadrant, the graph approaches x-axis (but never
meets it).
Exponential function with base 10 is called the common exponential function. In
the Appendix A.1.4 of Class XI, it was observed that the sum of the series
1 1
1+ + + ...
1! 2!
is a number between 2 and 3 and is denoted by e. Using this e as the base we obtain an
extremely important exponential function y = ex.
This is called natural exponential function.
It would be interesting to know if the inverse of the exponential function exists and
has nice interpretation. This search motivates the following definition.
Definition 4 Let b > 1 be a real number. Then we say logarithm of a to base b is x if
b x = a.
Logarithm of a to base b is denoted by logb a. Thus logb a = x if bx = a. Let us
work with a few explicit examples to get a feel for this. We know 23 = 8. In terms of
logarithms, we may rewrite this as log2 8 = 3. Similarly, 104 = 10000 is equivalent to
saying log10 10000 = 4. Also, 625 = 54 = 252 is equivalent to saying log5 625 = 4 or
log25 625 = 2.
On a slightly more mature note, fixing a base b > 1, we may look at logarithm as
a function from positive real numbers to all real numbers. This function, called the
logarithmic function, is defined by
logb : R+ → R
x → logb x = y if by = x
172 MATHEMATICS

As before if the base b = 10, we say it


is common logarithms and if b = e, then
we say it is natural logarithms. Often
natural logarithm is denoted by ln. In this
chapter, log x denotes the logarithm
function to base e, i.e., ln x will be written
as simply log x. The Fig 5.10 gives the plots
of logarithm function to base 2, e and 10.
Some of the important observations
about the logarithm function to any base
b > 1 are listed below: Fig 5.10
(1) We cannot make a meaningful definition of logarithm of non-positive numbers
and hence the domain of log function is R+ .
(2) The range of log function is the set of all real numbers.
(3) The point (1, 0) is always on the graph of the log function.
(4) The log function is ever increasing,
i.e., as we move from left to right
the graph rises above.
(5) For x very near to zero, the value
of log x can be made lesser than
any given real number. In other
words in the fourth quadrant the
graph approaches y-axis (but never
meets it).
(6) Fig 5.11 gives the plot of y = ex and
y = ln x. It is of interest to observe
that the two curves are the mirror Fig 5.11
images of each other reflected in the line y = x.
Two properties of ‘log’ functions are proved below:
(1) There is a standard change of base rule to obtain loga p in terms of log b p. Let
loga p = α, logb p = β and logb a = γ. This means a α = p, b β = p and b γ = a.
Substituting the third equation in the first one, we have
(bγ ) α = bγα = p
Using this in the second equation, we get
b β = p = b γα
CONTINUITY AND DIFFERENTIABILITY 173

β
which implies β = αγ or α = . But then
γ
log b p
loga p =
log b a
(2) Another interesting property of the log function is its effect on products. Let
logb pq = α. Then bα = pq. If logb p = β and logb q = γ, then bβ = p and b γ = q.
But then bα = pq = bβ bγ = b β + γ
which implies α = β + γ, i.e.,
logb pq = logb p + logb q
A particularly interesting and important consequence of this is when p = q. In
this case the above may be rewritten as
logb p2 = logb p + logb p = 2 log p
An easy generalisation of this (left as an exercise!) is
logb pn = n log p
for any positive integer n. In fact this is true for any real number n, but we will
not attempt to prove this. On the similar lines the reader is invited to verify
x
log b = logb x – logb y
y
Example 28 Is it true that x = elog x for all real x?
Solution First, observe that the domain of log function is set of all positive real numbers.
So the above equation is not true for non-positive real numbers. Now, let y = elog x. If
y > 0, we may take logarithm which gives us log y = log (elog x) = log x . log e = log x. Thus
y = x. Hence x = elog x is true only for positive values of x.
One of the striking properties of the natural exponential function in differential
calculus is that it doesn’t change during the process of differentiation. This is captured
in the following theorem whose proof we skip.
Theorem 5*
d x
(1) The derivative of ex w.r.t., x is ex; i.e., (e ) = ex.
dx
1 d 1
(2) The derivative of log x w.r.t., x is ; i.e., (log x) = .
x dx x

* Please see supplementary material on Page 286.


174 MATHEMATICS

Example 29 Differentiate the following w.r.t. x:


(i) e –x (ii) sin (log x), x > 0 (iii) cos–1 (ex) (iv) ecos x

Solution
(i) Let y = e– x. Using chain rule, we have
dy −x d
= e ⋅ (– x) = – e– x
dx dx
(ii) Let y = sin (log x). Using chain rule, we have
dy d cos (log x)
= cos (log x) ⋅ (log x) =
dx dx x
(iii) Let y = cos–1 (ex). Using chain rule, we have

dy −1 d x −ex
= ⋅ (e ) =
dx 1 − (e x ) 2 dx 1 − e2 x
(iv) Let y = ecos x. Using chain rule, we have
dy
= ecos x ⋅ ( − sin x) = − (sin x ) ecos x
dx

EXERCISE 5.4
Differentiate the following w.r.t. x:

ex −1 3
1. 2. esin x
3. ex
sin x
2 5
4. sin (tan–1 e–x) 5. log (cos ex) 6. ex + e x + ... + e x
cos x
7. e x
, x >0 8. log (log x), x > 1 9. , x >0
log x
10. cos (log x + ex), x > 0
5.5. Logarithmic Differentiation
In this section, we will learn to differentiate certain special class of functions given in
the form
y = f (x) = [u(x)]v (x)
By taking logarithm (to base e) the above may be rewritten as
log y = v(x) log [u(x)]
CONTINUITY AND DIFFERENTIABILITY 175

Using chain rule we may differentiate this to get


1 dy 1 .
⋅ = v(x) ⋅ u′(x) + v′(x) . log [u(x)]
y dx u( x)
which implies that
dy  v( x ) 
= y ⋅ u′( x ) + v′( x) ⋅ log [ u ( x) ] 
dx  u ( x) 
The main point to be noted in this method is that f (x) and u(x) must always be
positive as otherwise their logarithms are not defined. This process of differentiation is
known as logarithms differentiation and is illustrated by the following examples:

( x − 3) ( x2 + 4)
Example 30 Differentiate w.r.t. x.
3 x2 + 4 x + 5

( x − 3) ( x2 + 4)
Solution Let y =
(3 x2 + 4 x + 5)
Taking logarithm on both sides, we have
1
log y = [log (x – 3) + log (x2 + 4) – log (3x2 + 4x + 5)]
2
Now, differentiating both sides w.r.t. x, we get
1 dy 1 1 2x 6x + 4 
=  + 2 − 2
2  ( x − 3) x + 4 3 x + 4 x + 5 

y dx
dy y 1 2x 6x + 4 
=  + 2 − 2
2  ( x − 3) x + 4 3 x + 4 x + 5 
or
dx

1 ( x − 3) ( x 2 + 4)  1 2x 6x + 4 
= + 2 − 2
2 3 x + 4 x + 5  ( x − 3) x + 4 3 x + 4 x + 5 
2 

Example 31 Differentiate a x w.r.t. x, where a is a positive constant.


Solution Let y = a x. Then
log y = x log a
Differentiating both sides w.r.t. x, we have
1 dy
y dx = log a
176 MATHEMATICS

dy
or = y log a
dx
d x
Thus ( a ) = ax log a
dx
d x d x loga d
Alternatively (a ) = (e ) = e x log a ( x log a)
dx dx dx
= ex log a . log a = a x log a.
Example 32 Differentiate xsin x, x > 0 w.r.t. x.
Solution Let y = xsin x. Taking logarithm on both sides, we have
log y = sin x log x
1 dy d d
Therefore . = sin x (log x) + log x (sin x)
y dx dx dx
1 dy 1
or (sin x ) + log x cos x
y dx = x
dy  sin x 
or = y + cos x log x 
dx  x 

sin x  sin x 
= x  + cos x log x 
 x 
= x sin x −1 ⋅ sin x + xsin x ⋅ cos x log x

dy
Example 33 Find , if yx + xy + xx = ab.
dx
Solution Given that yx + xy + xx = ab.
Putting u = yx, v = xy and w = xx, we get u + v + w = a b
du dv dw
Therefore + + =0 ... (1)
dx dx dx
Now, u = yx. Taking logarithm on both sides, we have
log u = x log y
Differentiating both sides w.r.t. x, we have
CONTINUITY AND DIFFERENTIABILITY 177

1 du d d
⋅ = x (log y ) + log y ( x)
u dx dx dx
1 dy
= x ⋅ + log y ⋅ 1
y dx

du  x dy   x dy 
So = u + log y  = y x  + log y  ... (2)
dx  y dx   y dx 
Also v = xy
Taking logarithm on both sides, we have
log v = y log x
Differentiating both sides w.r.t. x, we have
1 dv d dy
⋅ = y (log x ) + log x
v dx dx dx
1 dy
= y ⋅ + log x ⋅
x dx
dv y dy 
So = v  + log x 
dx x dx 

y y dy 
= x  + log x  ... (3)
x dx 
Again w = xx
Taking logarithm on both sides, we have
log w = x log x.
Differentiating both sides w.r.t. x, we have

1 dw d d
⋅ = x (log x) + log x ⋅ ( x )
w dx dx dx

1
= x⋅ + log x ⋅1
x

dw
i.e. = w (1 + log x)
dx
= xx (1 + log x) ... (4)
178 MATHEMATICS

From (1), (2), (3), (4), we have


 x dy  y dy 
yx  + log y  + x y  + log x  + xx (1 + log x) = 0
 y dx  x dx 
dy
or (x . yx – 1 + xy . log x) = – xx (1 + log x) – y . xy–1 – yx log y
dx
dy − [ y x log y + y . x y −1 + x x (1 + log x)]
Therefore =
dx x . y x −1 + x y log x

EXERCISE 5.5
Differentiate the functions given in Exercises 1 to 11 w.r.t. x.

(x − 1) ( x − 2)
1. cos x . cos 2x . cos 3x 2.
( x − 3) ( x − 4) (x − 5)
3. (log x)cos x 4. xx – 2sin x
x  1
 1  1+ 
5. (x + 3) . (x + 4) . (x + 5)
2 3 4 6.  x +  + x x 
 x
7. (log x)x + xlog x 8. (sin x) x + sin–1 x
x2 + 1
9. xsin x + (sin x)cos x 10. x x cos x +
x2 − 1
1
11. (x cos x)x + ( x sin x) x
dy
Find of the functions given in Exercises 12 to 15.
dx
12. xy + yx = 1 13. yx = xy
14. y
(cos x) = (cos y) x
15. xy = e(x – y)
16. Find the derivative of the function given by f(x) = (1 + x) (1 + x2) (1 + x4 ) (1 + x8)
and hence find f ′(1).
17. Differentiate (x2 – 5x + 8) (x3 + 7x + 9) in three ways mentioned below:
(i) by using product rule
(ii) by expanding the product to obtain a single polynomial.
(iii) by logarithmic differentiation.
Do they all give the same answer?
CONTINUITY AND DIFFERENTIABILITY 179

18. If u, v and w are functions of x, then show that


d du dv dw
(u. v. w) = v. w + u . . w +u . v
dx dx dx dx
in two ways - first by repeated application of product rule, second by logarithmic
differentiation.
5.6 Derivatives of Functions in Parametric Forms
Sometimes the relation between two variables is neither explicit nor implicit, but some
link of a third variable with each of the two variables, separately, establishes a relation
between the first two variables. In such a situation, we say that the relation between
them is expressed via a third variable. The third variable is called the parameter. More
precisely, a relation expressed between two variables x and y in the form
x = f (t), y = g (t) is said to be parametric form with t as a parameter.
In order to find derivative of function in such form, we have by chain rule.
dy dy dx
= ⋅
dt dx dt
dy
dy  dx 
or = dt  whenever ≠ 0
dx dx  dt 
dt

dy g ′( t )  dy dx 
Thus =  as = g ′( t ) and = f ′( t )  [provided f ′(t) ≠ 0]
dx f ′(t )  dt dt 

dy
Example 34 Find , if x = a cos θ, y = a sin θ.
dx
Solution Given that
x = a cos θ, y = a sin θ
dx dy
Therefore = – a sin θ, = a cos θ
dθ dθ
dy
dy d θ a cos θ = − cot θ
Hence = dx =
dx − a sin θ

180 MATHEMATICS

dy
Example 35 Find , if x = at2, y = 2at.
dx
Solution Given that x = at2, y = 2at
dx dy
So = 2at and = 2a
dt dt

dy
dy dt = 2a = 1
Therefore =
dx dx 2 at t
dt
dy
Example 36 Find , if x = a (θ + sin θ), y = a (1 – cos θ).
dx
dx dy
Solution We have = a(1 + cos θ), = a (sin θ)
dθ dθ

dy
dy d θ = a sin θ = tan θ
Therefore =
dx dx a (1 + cos θ) 2

dy
ANote It may be noted here that dx
is expressed in terms of parameter only
without directly involving the main variables x and y.
2 2 2
Example 37 Find dy , if x 3 + y 3 = a 3 .
dx
Solution Let x = a cos3 θ, y = a sin3 θ. Then
2 2 2 2
x 3 + y 3 = (a cos3 θ)3 + (asin3 θ) 3
2 2
2 2
= a 3 (cos θ + (sin θ) = a 3
2 2 2
3 3
Hence, x = a cos θ, y = a sin θ is parametric equation of x3 + y3 = a3
dx dy
Now = – 3a cos2 θ sin θ and = 3a sin2 θ cos θ
dθ dθ
CONTINUITY AND DIFFERENTIABILITY 181

dy
dy 3a sin 2 θ cos θ y
Therefore = dθ = 2
= − tan θ = − 3
dx dx − 3 a cos θ sin θ x

ANote Had we proceeded in implicit way, it would have been quite tedious.
EXERCISE 5.6
If x and y are connected parametrically by the equations given in Exercises 1 to 10,
dy
without eliminating the parameter, Find .
dx
1. x = 2at2, y = at4 2. x = a cos θ, y = b cos θ
4
3. x = sin t, y = cos 2t 4. x = 4t, y =
t
5. x = cos θ – cos 2θ, y = sin θ – sin 2θ
sin 3 t cos3 t
6. x = a (θ – sin θ), y = a (1 + cos θ) 7. x = , y=
cos 2t cos2 t

 t
8.x = a  cos t + log tan  y = a sin t 9. x = a sec θ, y = b tan θ
 2
10. x = a (cos θ + θ sin θ), y = a (sin θ – θ cos θ)
−1 −1 dy y
11. If x = a sin t , y = ac os t , show that =−
dx x
5.7 Second Order Derivative
Let y = f (x). Then
dy
= f ′(x) ... (1)
dx
If f′(x) is differentiable, we may differentiate (1) again w.r.t. x. Then, the left hand
d  dy 
side becomes   which is called the second order derivative of y w.r.t. x and
dx  dx 

d2 y
is denoted by . The second order derivative of f (x) is denoted by f ″(x). It is also
dx2
182 MATHEMATICS

denoted by D2 y or y″ or y2 if y = f (x). We remark that higher order derivatives may be


defined similarly.
d2y
Example 38 Find , if y = x3 + tan x.
dx 2
Solution Given that y = x3 + tan x. Then
dy
= 3x2 + sec2 x
dx
d2y d ( 2
Therefore 2 =
3 x + sec 2 x )
dx dx
= 6x + 2 sec x . sec x tan x = 6x + 2 sec2 x tan x
d2y
Example 39 If y = A sin x + B cos x, then prove that + y =0.
dx 2
Solution We have
dy
= A cos x – B sin x
dx
d2y d
and 2 = (A cos x – B sin x)
dx dx
= – A sin x – B cos x = – y
d2 y
Hence +y=0
dx2
d2y dy
Example 40 If y = 3e2x + 2e3 x, prove that 2
− 5 + 6y = 0 .
dx dx
Solution Given that y = 3e2x + 2e3 x. Then
dy
= 6e2 x + 6e3x = 6 (e2 x + e3x)
dx
d2y
Therefore = 12e2x + 18e3x = 6 (2e2x + 3e3x)
dx 2
d2y dy
Hence 2
−5 + 6y = 6 (2e2x + 3e3x)
dx dx
– 30 (e2x + e3x) + 6 (3e2 x + 2e3x) = 0
CONTINUITY AND DIFFERENTIABILITY 183

d2y dy
Example 41 If y = sin–1 x, show that (1 – x2) 2
− x =0.
dx dx
Solution We have y = sin– 1 x. Then
dy 1
=
dx (1 − x2 )

dy
or (1 − x 2 ) =1
dx
d  2 dy 
So  (1 − x ) .  = 0
dx  dx 

d 2 y dy d
or (1 − x 2 ) ⋅ + ⋅
dx 2 dx dx
( )
(1 − x2 ) = 0

d 2 y dy 2x
or (1 − x 2 ) ⋅ 2
− ⋅ =0
dx dx 2 1 − x 2

d2y dy
Hence (1 − x2 ) 2
− x =0
dx dx
Alternatively, Given that y = sin–1 x, we have
1
y1 = , i.e., (1 − x 2 ) y2 = 1
2 1
1− x
So (1 − x2 ) . 2 y1 y2 + y12 (0 − 2 x) = 0
Hence (1 – x2) y2 – xy1 = 0

EXERCISE 5.7
Find the second order derivatives of the functions given in Exercises 1 to 10.
1. x2 + 3x + 2 2. x 20 3. x . cos x
3
4. log x 5. x log x 6. ex sin 5x
7. e6x cos 3x 8. tan–1 x 9. log (log x)
10. sin (log x)
d2y
11. If y = 5 cos x – 3 sin x, prove that + y =0
dx 2
184 MATHEMATICS

d2y
12. If y = cos–1 x, Find in terms of y alone.
dx 2
13. If y = 3 cos (log x) + 4 sin (log x), show that x2 y2 + xy1 + y = 0

d2y dy
14. If y = Aemx + Benx, show that 2
− (m + n) + mny = 0
dx dx
d2y
7x – 7x
15. If y = 500e + 600e , show that = 49 y
dx 2
2
d 2 y  dy 
y
16. If e (x + 1) = 1, show that = 
dx 2  dx 
17. If y = (tan–1 x)2, show that (x2 + 1)2 y2 + 2x (x2 + 1) y1 = 2

5.8 Mean Value Theorem


In this section, we will state two fundamental results in Calculus without proof. We
shall also learn the geometric interpretation of these theorems.
Theorem 6 (Rolle’s Theorem) Let f : [a, b] → R be continuous on [a, b] and
differentiable on (a, b), such that f(a) = f(b), where a and b are some real numbers.
Then there exists some c in (a, b) such that f ′(c) = 0.
In Fig 5.12 and 5.13, graphs of a few typical differentiable functions satisfying the
hypothesis of Rolle’s theorem are given.

Fig 5.12 Fig 5.13


Observe what happens to the slope of the tangent to the curve at various points
between a and b. In each of the graphs, the slope becomes zero at least at one point.
That is precisely the claim of the Rolle’s theorem as the slope of the tangent at any
point on the graph of y = f (x) is nothing but the derivative of f (x) at that point.
CONTINUITY AND DIFFERENTIABILITY 185

Theorem 7 (Mean Value Theorem) Let f : [a, b] → R be a continuous function on


[a, b] and differentiable on (a, b). Then there exists some c in (a, b) such that
f (b ) − f (a )
f ′(c ) =
b− a
Observe that the Mean Value Theorem (MVT) is an extension of Rolle’s theorem.
Let us now understand a geometric interpretation of the MVT. The graph of a function
y = f(x) is given in the Fig 5.14. We have already interpreted f′(c) as the slope of the
f (b ) − f (a )
tangent to the curve y = f (x) at (c, f (c)). From the Fig 5.14 it is clear that
b−a
is the slope of the secant drawn between (a, f (a)) and (b, f(b)). The MVT states that
there is a point c in (a, b) such that the slope of the tangent at (c, f(c)) is same as the
slope of the secant between (a, f (a)) and (b, f(b)). In other words, there is a point c in
(a, b) such that the tangent at (c, f(c)) is parallel to the secant between (a, f(a)) and
(b, f (b)).

Fig 5.14

Example 42 Verify Rolle’s theorem for the function y = x2 + 2, a = – 2 and b = 2.

Solution The function y = x2 + 2 is continuous in [– 2, 2] and differentiable in (– 2, 2).


Also f(– 2) = f( 2) = 6 and hence the value of f(x) at – 2 and 2 coincide. Rolle’s
theorem states that there is a point c ∈ (– 2, 2), where f ′ (c) = 0. Since f′ (x) = 2x, we
get c = 0. Thus at c = 0, we have f′ (c) = 0 and c = 0 ∈ (– 2, 2).
Example 43 Verify Mean Value Theorem for the function f (x) = x2 in the interval [2, 4].

Solution The function f(x) = x2 is continuous in [2, 4] and differentiable in (2, 4) as its
derivative f ′(x) = 2x is defined in (2, 4).
186 MATHEMATICS

Now, f(2) = 4 and f (4) = 16. Hence


f ( b ) − f ( a ) 16 − 4
= =6
b−a 4− 2
MVT states that there is a point c ∈ (2, 4) such that f ′(c) = 6. But f′ (x) = 2x which
implies c = 3. Thus at c = 3 ∈ (2, 4), we have f ′ (c) = 6.

EXERCISE 5.8
1. Verify Rolle’s theorem for the function f (x) = x2 + 2x – 8, x ∈ [– 4, 2].
2. Examine if Rolle’s theorem is applicable to any of the following functions. Can
you say some thing about the converse of Rolle’s theorem from these example?
(i) f(x) = [x] for x ∈ [5, 9] (ii) f(x) = [x] for x ∈ [– 2, 2]
2
(iii) f(x) = x – 1 for x ∈ [1, 2]
3. If f : [– 5, 5] → R is a differentiable function and if f ′(x) does not vanish
anywhere, then prove that f(– 5) ≠ f(5).
4. Verify Mean Value Theorem, if f (x) = x2 – 4x – 3 in the interval [a, b], where
a = 1 and b = 4.
5. Verify Mean Value Theorem, if f(x) = x3 – 5x2 – 3x in the interval [a, b], where
a = 1 and b = 3. Find all c ∈ (1, 3) for which f′(c) = 0.
6. Examine the applicability of Mean Value Theorem for all three functions given in
the above exercise 2.

Miscellaneous Examples
Example 44 Differentiate w.r.t. x, the following function:
1 2
(i) 3x + 2 + (ii) esec x + 3cos –1 x (iii) log7 (log x)
2x 2 + 4
Solution
1 1
1 2

(i) Let y = 3x + 2 + = (3 x + 2) 2 + (2 x + 4) 2
2x 2 + 4
2
Note that this function is defined at all real numbers x > − . Therefore
3
1 1
dy 1 −1 d  1 − −1 d
= (3 x + 2) 2 ⋅ (3 x + 2) +  −  (2 x2 + 4) 2 ⋅ (2 x2 + 4)
dx 2 dx  2 dx
CONTINUITY AND DIFFERENTIABILITY 187

1 3
1 − 1 2

= (3 x + 2) 2 ⋅ (3) −   (2 x + 4) 2 ⋅ 4 x
2 2
3 2x
= − 3
2 3x + 2 (2x 2 + 4) 2
2
This is defined for all real numbers x > − .
3
2
(ii) Let y = esec x + 3cos −1 x
This is defined at every real number in [ −1, 1] . Therefore

dy sec2 x d  1 
= e ⋅ (sec2 x ) + 3  −
dx dx  1 − x2 

sec2 x  d   1 
= e ⋅  2sec x (sec x ) + 3  − 
 dx   1 − x2 

sec2 x  1 
= 2sec x (sec x tan x) e +3 −
2 
 1− x 

2 sec2 x  1 
= 2sec x tan x e + 3 −
2 
 1− x 
Observe that the derivative of the given function is valid only in [ −1, 1] − {0} as
the derivative of cos– 1 x exists only in (– 1, 1) and the function itself is not
defined at 0.
log (log x )
(iii) Let y = log 7 (log x) = (by change of base formula).
log 7
The function is defined for all real numbers x > 1. Therefore
dy 1 d
= (log (log x))
dx log 7 dx
1 1 d
= ⋅ (log x)
log 7 log x dx
1
=
x log 7 log x
188 MATHEMATICS

Example 45 Differentiate the following w.r.t. x.

 sin x   2 x +1 
(i) cos – 1 (sin x) (ii) tan −1   (iii) sin −1  
 1 + cos x   1+ 4x 
Solution
(i) Let f (x) = cos – 1 (sin x). Observe that this function is defined for all real numbers.
We may rewrite this function as
f(x) = cos – 1 (sin x)
−1  π 
= cos  cos  − x 
 2 
π
= −x
2
Thus f ′(x) = – 1.
(ii) Let f (x) = tan – 1 
sin x 
 . Observe that this function is defined for all real
 1 + cos x 
numbers, where cos x ≠ – 1; i.e., at all odd multiplies of π. We may rewrite this
function as

− 1  sin x 
f(x) = tan  
 1 + cos x 
  x  x 
 2 sin  2  cos  2  
−1
= tan  x

 2 cos 2 
 2 

−1   x  x
= tan  tan   =
  2  2
 x
Observe that we could cancel cos   in both numerator and denominator as it
2
1.
is not equal to zero. Thus f′(x) =
2
 2x + 1 
(iii) Let f(x) = sin – 1   . To find the domain of this function we need to find all
1 + 4 x 

2 x +1
x such that −1 ≤ ≤ 1 . Since the quantity in the middle is always positive,
1+ 4x
CONTINUITY AND DIFFERENTIABILITY 189

2 x +1
we need to find all x such that ≤ 1 , i.e., all x such that 2x + 1 ≤ 1 + 4x. We
1+ 4x
1
may rewrite this as 2 ≤ + 2x which is true for all x. Hence the function
2x
is defined at every real number. By putting 2x = tan θ, this function may be
rewritten as

−1  2 
x+1
f(x) = sin  x
1 + 4 

−1  2 ⋅ 2 
x
= sin  2
1 + ( 2 x ) 

−1  2 tan θ 
= sin 
1 + tan 2 θ 
= sin – 1 [sin 2θ]
= 2θ = 2 tan – 1 (2x)
1 d
Thus f ′(x) = 2 ⋅ ⋅ (2 x )
x 2
1+ ( 2 ) dx

2
= x
⋅ (2 x ) log 2
1+ 4
2 x + 1 log 2
=
1+ 4x
Example 46 Find f ′(x) if f(x) = (sin x) sin x for all 0 < x < π.
Solution The function y = (sin x) sin x is defined for all positive real numbers. Taking
logarithms, we have
log y = log (sin x) sin x = sin x log (sin x)
1 dy d
Then = (sin x log (sin x))
y dx dx

1 d
= cos x log (sin x) + sin x . ⋅ (sin x )
sin x dx
= cos x log (sin x) + cos x
= (1 + log (sin x)) cos x
190 MATHEMATICS

dy
Thus = y ((1 + log (sin x)) cos x) = (1 + log (sin x)) ( sin x)sin x cos x
dx
dy
Example 47 For a positive constant a find , where
dx
1 a
t+  1
y =a and x =  t + 
t ,
 t
Solution Observe that both y and x are defined for all real t ≠ 0. Clearly
1
t+ d  1
dy
dt
=
dt a ( )
d t +1
t = a t  t +  ⋅ log a
dt  t 
1
t 1 − 
t+ 1
= a   log a
 t2 
a −1
dx  1 d  1
Similarly = a t +  ⋅ t + 
dt  t dt  t
a −1
 1  1
= a t +  ⋅ 1 − 2 
 t  t 
dx
≠ 0 only if t ≠ ± 1. Thus for t ≠ ± 1,
dt
1
t+  1
dy a t  1 − 2  log a
dy dt  t 
= = a − 1
dx dx  1  1
a t +  ⋅  1 − 2 
dt  t  t 
1
t+
a t log a
= a −1
 1
a t + 
 t
Example 48 Differentiate sin x w.r.t. e x.
2 cos

Solution Let u (x) = sin2 x and v (x) = e cos x. We want to find du = du / dx . Clearly
dv dv / dx
du dv
= 2 sin x cos x and = e cos x (– sin x) = – (sin x) e cos x
dx dx
CONTINUITY AND DIFFERENTIABILITY 191

du 2sin x cos x 2 cos x


Thus = cos x
= − cos x
dv − sin x e e

Miscellaneous Exercise on Chapter 5


Differentiate w.r.t. x the function in Exercises 1 to 11.
1. (3x2 – 9x + 5)9 2. sin3 x + cos 6 x
3. (5x) 3 cos 2 x 4. sin–1(x x ), 0 ≤ x ≤ 1
x
cos −1
5. 2 ,–2<x<2
2x + 7
 1 + sin x + 1 − sin x  π
6. cot −1   , 0 < x<
 1 + sin x − 1 − sin x  2
7. (log x) x, x > 1
log

8. cos (a cos x + b sin x), for some constant a and b.


π 3π
(sin x – cos x)
9. (sin x – cos x) < x< ,
4 4
10. xx + xa + a x + a a, for some fixed a > 0 and x > 0
2 2
−3 x
11. xx + ( x − 3) , for x > 3
dy π π
12. Find , if y = 12 (1 – cos t), x = 10 (t – sin t), − < t <
dx 2 2
dy
13. Find , if y = sin–1 x + sin–1 1 − x 2 , 0 < x < 1
dx
14. If x 1 + y + y 1 + x = 0 , for , – 1 < x < 1, prove that
dy 1
=−
dx (1+ x )2
15. If (x – a) 2 + (y – b) 2 = c2 , for some c > 0, prove that
3
  dy  2  2
1 +   
  dx  
d2y
dx2
is a constant independent of a and b.
192 MATHEMATICS

dy cos 2 ( a + y)
16. If cos y = x cos (a + y), with cos a ≠ ± 1, prove that = .
dx sin a

d2y
17. If x = a (cos t + t sin t) and y = a (sin t – t cos t), find .
dx 2
18. If f (x) = | x |3, show that f ″(x) exists for all real x and find it.
d ( n)
19. Using mathematical induction prove that x = nxn −1 for all positive
dx
integers n.
20. Using the fact that sin (A + B) = sin A cos B + cos A sin B and the differentiation,
obtain the sum formula for cosines.
21. Does there exist a function which is continuous everywhere but not differentiable
at exactly two points? Justify your answer.

f ( x) g ( x) h ( x) f ′( x) g ′(x ) h′(x)
dy
22. If y = l m n , prove that = l m n
dx
a b c a b c

2
23. If y = ea c os− 1 x , – 1 ≤ x ≤ 1, show that (1 − x2 ) d y − x dy − a 2 y = 0 .
dx 2 dx

Summary
® A real valued function is continuous at a point in its domain if the limit of the
function at that point equals the value of the function at that point. A function
is continuous if it is continuous on the whole of its domain.
® Sum, difference, product and quotient of continuous functions are continuous.
i.e., if f and g are continuous functions, then
(f ± g) (x) = f (x) ± g(x) is continuous.
(f . g) (x) = f (x) . g(x) is continuous.

f  f (x )
g  ( x) = g ( x) (wherever g (x) ≠ 0) is continuous.
 
® Every differentiable function is continuous, but the converse is not true.
CONTINUITY AND DIFFERENTIABILITY 193

® Chain rule is rule to differentiate composites of functions. If f = v o u, t = u (x)


dt dv
and if both and exist then
dx dt
df dv dt
= ⋅
dx dt dx
® Following are some of the standard derivatives (in appropriate domains):
d ( −1 ) 1 d ( −1 ) 1
sin x = cos x = −
dx 1 − x2 dx 1 − x2
d ( −1 ) 1 d ( −1 ) −1
tan x = cot x =
dx 1 + x2 dx 1 + x2
d ( −1 ) 1 d ( −1
sec x = cosec−1 x ) =
dx x 1 − x2 dx x 1 − x2

d ( x) x d 1
e =e ( log x ) =
dx dx x
® Logarithmic differentiation is a powerful technique to differentiate functions
of the form f (x) = [u (x)]v (x). Here both f(x) and u (x) need to be positive for
this technique to make sense.
® Rolle’s Theorem: If f : [a, b] → R is continuous on [a, b] and differentiable
on (a, b) such that f (a) = f (b), then there exists some c in (a, b) such that
f ′(c) = 0.
® Mean Value Theorem: If f : [a, b] → R is continuous on [a, b] and
differentiable on (a, b). Then there exists some c in (a, b) such that

f (b ) − f (a )
f ′(c ) =
b− a

—v —
194 MATHEMATICS

Chapter 6
APPLICATION OF
DERIVATIVES
v With the Calculus as a key, Mathematics can be successfully applied
to the explanation of the course of Nature.” — WHITEHEAD v

6.1 Introduction
In Chapter 5, we have learnt how to find derivative of composite functions, inverse
trigonometric functions, implicit functions, exponential functions and logarithmic functions.
In this chapter, we will study applications of the derivative in various disciplines, e.g., in
engineering, science, social science, and many other fields. For instance, we will learn
how the derivative can be used (i) to determine rate of change of quantities, (ii) to find
the equations of tangent and normal to a curve at a point, (iii) to find turning points on
the graph of a function which in turn will help us to locate points at which largest or
smallest value (locally) of a function occurs. We will also use derivative to find intervals
on which a function is increasing or decreasing. Finally, we use the derivative to find
approximate value of certain quantities.
6.2 Rate of Change of Quantities
ds
Recall that by the derivative , we mean the rate of change of distance s with
dt
respect to the time t. In a similar fashion, whenever one quantity y varies with another
dy
quantity x, satisfying some rule y = f ( x) , then (or f′(x)) represents the rate of
dx
dy 
change of y with respect to x and dx  (or f′(x0)) represents the rate of change
 x =x0
of y with respect to x at x = x0 .
Further, if two variables x and y are varying with respect to another variable t, i.e.,
if x = f ( t ) and y = g ( t ) , then by Chain Rule
dy dy dx dx
= , if ≠0
dx dt dt dt

2015-16 (11-11-2014)
APPLICATION OF DERIVATIVES 195

Thus, the rate of change of y with respect to x can be calculated using the rate of
change of y and that of x both with respect to t.
Let us consider some examples.
Example 1 Find the rate of change of the area of a circle per second with respect to
its radius r when r = 5 cm.
Solution The area A of a circle with radius r is given by A = πr2. Therefore, the rate
dA d
of change of the area A with respect to its radius r is given by = ( π r 2 ) = 2π r .
dr dr
dA
When r = 5 cm, = 10π . Thus, the area of the circle is changing at the rate of
dr
10π cm2/s.
Example 2 The volume of a cube is increasing at a rate of 9 cubic centimetres per
second. How fast is the surface area increasing when the length of an edge is 10
centimetres ?
Solution Let x be the length of a side, V be the volume and S be the surface area of
the cube. Then, V = x3 and S = 6x2, where x is a function of time t.
dV
Now = 9cm3/s (Given)
dt
dV d 3 d dx
Therefore 9= = ( x ) = ( x3 ) ⋅ (By Chain Rule)
dt dt dx dt
dx
= 3x ⋅
2
dt
dx 3
or = 2 ... (1)
dt x
dS d d dx
Now = (6x2 ) = (6x 2) ⋅ (By Chain Rule)
dt dt dx dt
 3  36
= 12x ⋅  2 = (Using (1))
x  x

dS
Hence, when x = 10 cm, = 3.6 cm 2 /s
dt

2015-16 (11-11-2014)
196 MATHEMATICS

Example 3 A stone is dropped into a quiet lake and waves move in circles at a speed
of 4cm per second. At the instant, when the radius of the circular wave is 10 cm, how
fast is the enclosed area increasing?
Solution The area A of a circle with radius r is given by A = πr2. Therefore, the rate
of change of area A with respect to time t is
dA d d dr dr
= (π r 2 ) = (π r 2) ⋅ = 2π r (By Chain Rule)
dt dt dr dt dt
dr
It is given that = 4cm/s
dt
dA
Therefore, when r = 10 cm, = 2π (10) (4) = 80π
dt
Thus, the enclosed area is increasing at the rate of 80π cm2/s, when r = 10 cm.

dy
ANote dx
is positive if y increases as x increases and is negative if y decreases

as x increases.

Example 4 The length x of a rectangle is decreasing at the rate of 3 cm/minute and


the width y is increasing at the rate of 2cm/minute. When x =10cm and y = 6cm, find
the rates of change of (a) the perimeter and (b) the area of the rectangle.
Solution Since the length x is decreasing and the width y is increasing with respect to
time, we have
dx dy
= −3 cm/min and = 2 cm/min
dt dt
(a) The perimeter P of a rectangle is given by
P = 2 (x + y)
dP  dx dy 
Therefore = 2  +  = 2 ( −3 + 2) = −2 cm/min
dt  dt dt 
(b) The area A of the rectangle is given by
A=x.y
dA dx dy
Therefore = ⋅y+ x⋅
dt dt dt
= – 3(6) + 10(2) (as x = 10 cm and y = 6 cm)
= 2 cm2/min

2015-16 (11-11-2014)
APPLICATION OF DERIVATIVES 197

Example 5 The total cost C(x) in Rupees, associated with the production of x units of
an item is given by
C (x) = 0.005 x3 – 0.02 x2 + 30x + 5000
Find the marginal cost when 3 units are produced, where by marginal cost we
mean the instantaneous rate of change of total cost at any level of output.
Solution Since marginal cost is the rate of change of total cost with respect to the
output, we have
dC
Marginal cost (MC) = = 0.005(3 x 2 ) − 0.02(2 x) + 30
dx
x = 3, MC = 0.015(3 ) − 0.04(3) + 30
2
When
= 0.135 – 0.12 + 30 = 30.015
Hence, the required marginal cost is Rs 30.02 (nearly).
Example 6 The total revenue in Rupees received from the sale of x units of a product
is given by R(x) = 3x2 + 36x + 5. Find the marginal revenue, when x = 5, where by
marginal revenue we mean the rate of change of total revenue with respect to the
number of items sold at an instant.
Solution Since marginal revenue is the rate of change of total revenue with respect to
the number of units sold, we have
dR
Marginal Revenue (MR) = = 6x + 36
dx
When x = 5, MR = 6(5) + 36 = 66
Hence, the required marginal revenue is Rs 66.

EXERCISE 6.1
1. Find the rate of change of the area of a circle with respect to its radius r when
(a) r = 3 cm (b) r = 4 cm
2. The volume of a cube is increasing at the rate of 8 cm3/s. How fast is the
surface area increasing when the length of an edge is 12 cm?
3. The radius of a circle is increasing uniformly at the rate of 3 cm/s. Find the rate
at which the area of the circle is increasing when the radius is 10 cm.
4. An edge of a variable cube is increasing at the rate of 3 cm/s. How fast is the
volume of the cube increasing when the edge is 10 cm long?
5. A stone is dropped into a quiet lake and waves move in circles at the speed of
5 cm/s. At the instant when the radius of the circular wave is 8 cm, how fast is
the enclosed area increasing?

2015-16 (11-11-2014)
198 MATHEMATICS

6. The radius of a circle is increasing at the rate of 0.7 cm/s. What is the rate of
increase of its circumference?
7. The length x of a rectangle is decreasing at the rate of 5 cm/minute and the
width y is increasing at the rate of 4 cm/minute. When x = 8cm and y = 6cm, find
the rates of change of (a) the perimeter, and (b) the area of the rectangle.
8. A balloon, which always remains spherical on inflation, is being inflated by pumping
in 900 cubic centimetres of gas per second. Find the rate at which the radius of
the balloon increases when the radius is 15 cm.
9. A balloon, which always remains spherical has a variable radius. Find the rate at
which its volume is increasing with the radius when the later is 10 cm.
10. A ladder 5 m long is leaning against a wall. The bottom of the ladder is pulled
along the ground, away from the wall, at the rate of 2cm/s. How fast is its height
on the wall decreasing when the foot of the ladder is 4 m away from the wall ?
11. A particle moves along the curve 6y = x3 +2. Find the points on the curve at
which the y-coordinate is changing 8 times as fast as the x-coordinate.
1
12. The radius of an air bubble is increasing at the rate of cm/s. At what rate is the
2
volume of the bubble increasing when the radius is 1 cm?
3
13. A balloon, which always remains spherical, has a variable diameter (2 x + 1) .
2
Find the rate of change of its volume with respect to x.
14. Sand is pouring from a pipe at the rate of 12 cm3 /s. The falling sand forms a cone
on the ground in such a way that the height of the cone is always one-sixth of the
radius of the base. How fast is the height of the sand cone increasing when the
height is 4 cm?
15. The total cost C (x) in Rupees associated with the production of x units of an
item is given by
C (x) = 0.007x3 – 0.003x2 + 15x + 4000.
Find the marginal cost when 17 units are produced.
16. The total revenue in Rupees received from the sale of x units of a product is
given by
R (x) = 13x2 + 26x + 15.
Find the marginal revenue when x = 7.
Choose the correct answer in the Exercises 17 and 18.
17. The rate of change of the area of a circle with respect to its radius r at r = 6 cm is
(A) 10π (B) 12π (C) 8π (D) 11π

2015-16 (11-11-2014)
APPLICATION OF DERIVATIVES 199

18. The total revenue in Rupees received from the sale of x units of a product is
given by
R(x) = 3x2 + 36x + 5. The marginal revenue, when x = 15 is
(A) 116 (B) 96 (C) 90 (D) 126
6.3 Increasing and Decreasing Functions
In this section, we will use differentiation to find out whether a function is increasing or
decreasing or none.
Consider the function f given by f (x) = x2, x ∈ R. The graph of this function is a
parabola as given in Fig 6.1.
Values left to origin Values right to origin
x f (x) = x2 x f (x) = x2

–2 4 0 0
3 9 1 1

2 4 2 4
–1 1 1 1
1 1 3 9

2 4 2 4
0 0 2 4
as we move from left to right, the as we move from left to right, the
height of the graph decreases height of the graph increases
Fig 6.1
First consider the graph (Fig 6.1) to the right of the origin. Observe that as we
move from left to right along the graph, the height of the graph continuously increases.
For this reason, the function is said to be increasing for the real numbers x > 0.
Now consider the graph to the left of the origin and observe here that as we move
from left to right along the graph, the height of the graph continuously decreases.
Consequently, the function is said to be decreasing for the real numbers x < 0.
We shall now give the following analytical definitions for a function which is
increasing or decreasing on an interval.
Definition 1 Let I be an interval contained in the domain of a real valued function f.
Then f is said to be
(i) increasing on I if x1 < x2 in I ⇒ f(x1) ≤ f (x2 ) for all x1 , x2 ∈ I.
(ii) strictly increasing on I if x1 < x2 in I ⇒ f (x1) < f (x2) for all x1, x2 ∈ I.

2015-16 (11-11-2014)
200 MATHEMATICS

(iii) decreasing on I if x1 < x2 in I ⇒ f (x1) ≥ f (x2) for all x1, x2 ∈ I.


(iv) strictly decreasing on I if x1 < x2 in I ⇒ f (x1) > f (x2) for all x1, x2 ∈ I.
For graphical representation of such functions see Fig 6.2.

Fig 6.2
We shall now define when a function is increasing or decreasing at a point.
Definition 2 Let x0 be a point in the domain of definition of a real valued function f.
Then f is said to be increasing, strictly increasing, decreasing or strictly decreasing at
x0 if there exists an open interval I containing x0 such that f is increasing, strictly
increasing, decreasing or strictly decreasing, respectively, in I.
Let us clarify this definition for the case of increasing function.
A function f is said to be increasing at x0 if there exists an interval I = (x0 – h, x0 + h),
h > 0 such that for x1, x2 ∈ I
x1 < x2 in I ⇒ f (x1 ) ≤ f (x2)
Similarly, the other cases can be clarified.
Example 7 Show that the function given by f(x) = 7x – 3 is strictly increasing on R.
Solution Let x1 and x2 be any two numbers in R. Then
x1 < x2 ⇒ 7x1 < 7x2 ⇒ 7x1 – 3 < 7x2 – 3 ⇒ f(x1) < f (x2)

2015-16 (11-11-2014)
APPLICATION OF DERIVATIVES 201

Thus, by Definition 1, it follows that f is strictly increasing on R.


We shall now give the first derivative test for increasing and decreasing functions.
The proof of this test requires the Mean Value Theorem studied in Chapter 5.
Theorem 1 Let f be continuous on [a, b] and differentiable on the open interval
(a,b). Then
(a) f is strictly increasing in [a,b] if f′(x) > 0 for each x ∈ (a, b)
(b) f is strictly decreasing in [a,b] if f ′(x) < 0 for each x ∈ (a, b)
(c) f is a constant function in [a,b] if f ′(x) = 0 for each x ∈ (a, b)

Proof (a) Let x1, x2 ∈ [a, b] be such that x1 < x2.


Then, by Mean Value Theorem (Theorem 8 in Chapter 5), there exists a point c
between x1 and x2 such that
f (x2) – f(x1) = f ′(c) (x2 – x1)
i.e. f (x2) – f(x1) > 0 (as f ′(c) > 0 (given))
i.e. f (x2 ) > f (x 1)
Thus, we have
x1 < x2 ⇒ f ( x1 ) < f ( x2 ), for all x1 , x2 ∈[ a , b]
Hence, f is an increasing function in [a,b].
The proofs of part (b) and (c) are similar. It is left as an exercise to the reader.

Remarks
(i) There is a more generalised theorem, which states that if f ′(x) > 0 for x in an
interval excluding the end points and f is continuous in the interval, then f is
strictly increasing. Similarly, if f ′(x) < 0 for x in an interval excluding the end
points and f is continuous in the interval, then f is strictly decreasing.
(ii) If a function is strictly increasing or strictly decreasing in an interval I, then it is
necessarily increasing or decreasing in I. However, the converse need not
be true.
Example 8 Show that the function f given by
f (x) = x3 – 3x2 + 4x, x ∈ R
is strictly increasing on R.
Solution Note that
f ′(x) = 3x2 – 6x + 4

2015-16 (11-11-2014)
202 MATHEMATICS

= 3(x2 – 2x + 1) + 1
= 3(x – 1)2 + 1 > 0, in every interval of R
Therefore, the function f is strictly increasing on R.

Example 9 Prove that the function given by f (x) = cos x is


(a) strictly decreasing in (0, π)
(b) strictly increasing in (π, 2π), and
(c) neither increasing nor decreasing in (0, 2π).
Solution Note that f ′(x) = – sin x
(a) Since for each x ∈ (0, π), sin x > 0, we have f ′(x) < 0 and so f is strictly
decreasing in (0, π).
(b) Since for each x ∈ (π, 2π), sin x < 0, we have f ′(x) > 0 and so f is strictly
increasing in (π, 2π).
(c) Clearly by (a) and (b) above, f is neither increasing nor decreasing in (0, 2π).

Example 10 Find the intervals in which the function f given by f (x) = x2 – 4x + 6 is


(a) strictly increasing (b) strictly decreasing
Solution We have
f (x) = x2 – 4x + 6
or f ′(x) = 2x – 4
Therefore, f ′(x) = 0 gives x = 2. Now
the point x = 2 divides the real line into two
disjoint intervals namely, (– ∞, 2) and (2, Fig 6.3
∞) (Fig 6.3). In the interval (– ∞, 2), f ′(x)
= 2x – 4 < 0.
Therefore, f is strictly decreasing in this interval. Also, in the interval (2, ∞) ,
f ′( x ) > 0 and so the function f is strictly increasing in this interval.

Example 11 Find the intervals in which the function f given by f (x) = 4x3 – 6x2 – 72x + 30
is (a) strictly increasing (b) strictly decreasing.
Solution We have
f(x) = 4x3 – 6x2 – 72x + 30

2015-16 (11-11-2014)
APPLICATION OF DERIVATIVES 203

or f ′(x) = 12x2 – 12x – 72


= 12(x2 – x – 6)
= 12(x – 3) (x + 2)
Therefore, f ′(x) = 0 gives x = – 2, 3. The
points x = – 2 and x = 3 divides the real line into
three disjoint intervals, namely, (– ∞, – 2), (– 2, 3) Fig 6.4
and (3, ∞).
In the intervals (– ∞, – 2) and (3, ∞), f ′(x) is positive while in the interval (– 2, 3),
f ′(x) is negative. Consequently, the function f is strictly increasing in the intervals
(– ∞, – 2) and (3, ∞) while the function is strictly decreasing in the interval (– 2, 3).
However, f is neither increasing nor decreasing in R.

Interval Sign of f ′(x) Nature of function f

(– ∞, – 2) (–) (–) > 0 f is strictly increasing

(– 2, 3) (–) (+) < 0 f is strictly decreasing

(3, ∞) (+) (+) > 0 f is strictly increasing

 π
Example 12 Find intervals in which the function given by f (x) = sin 3x, x ∈ 0,  is
 2
(a) increasing (b) decreasing.
Solution We have
f (x) = sin 3x
or f ′(x) = 3cos 3x
π 3π  π
Therefore, f ′(x) = 0 gives cos 3x = 0 which in turn gives 3x = , (as x ∈ 0, 
2 2  2
 3π  π π π  π
implies 3x ∈  0,  ). So x = and . The point x = divides the interval  0, 2 
 2 6 2 6  
 π π π
into two disjoint intervals  0,  and  ,  .
 6  6 2
Fig 6.5
 π π π
Now, f ′( x ) > 0 for all x ∈  0,  as 0 ≤ x < ⇒ 0 ≤ 3 x < and f ′( x ) < 0 for
 6 6 2
 π π π π π 3π
all x ∈ ,  as < x < ⇒ < 3 x < .
6 2 6 2 2 2

2015-16 (11-11-2014)
204 MATHEMATICS

 π π π
Therefore, f is strictly increasing in  0,  and strictly decreasing in  ,  .
 6  6 2 

π
Also, the given function is continuous at x = 0 and x = . Therefore, by Theorem 1,
6

 π  π π
f is increasing on  0,  and decreasing on  ,  .
 6 6 2

Example 13 Find the intervals in which the function f given by


f (x) = sin x + cos x, 0 ≤ x ≤ 2π
is strictly increasing or strictly decreasing.

Solution We have
f(x) = sin x + cos x,
or f ′(x) = cos x – sin x
π 5π
Now f ′( x ) = 0 gives sin x = cos x which gives that x = , as 0 ≤ x ≤ 2π
4 4
π 5π
The points x = and x = divide the interval [0, 2π] into three disjoint intervals,
4 4

 π   π 5π   5π 
namely,  0,  ,  ,  and  , 2π .
 4 4 4  4 
Fig 6.6
 π   5π 
Note that f ′( x) > 0 if x ∈ 0,  ∪  , 2 π
 4   4 

 π  5π 
or f is strictly increasing in the intervals  0,  and  , 2 π
 4  4 

 π 5π 
Also f ′( x ) < 0 if x ∈ , 
4 4

 π 5π 
or f is strictly decreasing in  , 
4 4

2015-16 (11-11-2014)
APPLICATION OF DERIVATIVES 205

Interval Sign of f ′ ( x ) Nature of function

 π
0, 4  >0 f is strictly increasing
 
 π 5π 
 ,  <0 f is strictly decreasing
4 4

 5π 
 , 2 π >0 f is strictly increasing
 4 

EXERCISE 6.2
1. Show that the function given by f (x) = 3x + 17 is strictly increasing on R.
2. Show that the function given by f (x) = e2x is strictly increasing on R.
3. Show that the function given by f (x) = sin x is

 π π 
(a) strictly increasing in  0,  (b) strictly decreasing in  , π 
 2 2 
(c) neither increasing nor decreasing in (0, π)
4. Find the intervals in which the function f given by f (x) = 2x2 – 3x is
(a) strictly increasing (b) strictly decreasing
5. Find the intervals in which the function f given by f (x) = 2x3 – 3x2 – 36x + 7 is
(a) strictly increasing (b) strictly decreasing
6. Find the intervals in which the following functions are strictly increasing or
decreasing:
(a) x2 + 2x – 5 (b) 10 – 6x – 2x2
(c) –2x3 – 9x2 – 12x + 1 (d) 6 – 9x – x2
(e) (x + 1)3 (x – 3)3
2x
7. Show that y = log(1+ x ) − , x > – 1, is an increasing function of x
2+ x
throughout its domain.
8. Find the values of x for which y = [x (x – 2)]2 is an increasing function.

4 sin θ  π
9. Prove that y = − θ is an increasing function of θ in  0,  .
(2 + cos θ)  2

2015-16 (11-11-2014)
206 MATHEMATICS

10. Prove that the logarithmic function is strictly increasing on (0, ∞).
11. Prove that the function f given by f (x) = x2 – x + 1 is neither strictly increasing
nor strictly decreasing on (– 1, 1).
 π
12. Which of the following functions are strictly decreasing on  0,  ?
 2
(A) cos x (B) cos 2x (C) cos 3x (D) tan x
13. On which of the following intervals is the function f given by f(x) = x100 + sin x –1
strictly decreasing ?
π   π
(A) (0,1) (B)  , π  (C)  0,  (D) None of these
2   2
14. Find the least value of a such that the function f given by f (x) = x2 + ax + 1 is
strictly increasing on [1, 2].
15. Let I be any interval disjoint from [–1, 1]. Prove that the function f given by
1
f (x) = x + is strictly increasing on I.
x
 π
16. Prove that the function f given by f (x) = log sin x is strictly increasing on  0, 
 2

π 
and strictly decreasing on  , π  .
2 
17. Prove that the function f given by f (x) = log |cos x| is strictly decreasing on
 π  3π 
 0,  and strictly increasing on  , 2π  .
 2  2 
18. Prove that the function given by f (x) = x – 3x2 + 3x – 100 is increasing in R.
3

19. The interval in which y = x2 e–x is increasing is


(A) (– ∞, ∞) (B) (– 2, 0) (C) (2, ∞) (D) (0, 2)
6.4 Tangents and Normals
In this section, we shall use differentiation to find the equation of the tangent line and
the normal line to a curve at a given point.
Recall that the equation of a straight line passing through a given point (x0, y0)
having finite slope m is given by
y – y0 = m (x – x0)

2015-16 (11-11-2014)
APPLICATION OF DERIVATIVES 207

Note that the slope of the tangent to the curve y = f (x)


dy 
at the point (x0, y0) is given by ( = f ′( x0 )) . So
dx  ( x0 , y0 )
the equation of the tangent at (x0, y0) to the curve y = f (x)
is given by
y – y0 = f ′(x0)(x – x0)
Also, since the normal is perpendicular to the tangent,
the slope of the normal to the curve y = f (x) at (x0, y0) is
−1 Fig 6.7
, if f ′( x0 ) ≠ 0 . Therefore, the equation of the
f ′ ( x0 )
normal to the curve y = f(x) at (x0, y0) is given by

−1
y – y0 = ( x − x0 )

f ( x0 )

i.e. ( y − y0 ) f ′ ( x0 ) + ( x − x0 ) = 0

ANote If a tangent line to the curve y = f (x) makes an angle θ with x-axis in the
dy
positive direction, then = slope of the tangent = tan θ .
dx
Particular cases
(i) If slope of the tangent line is zero, then tan θ = 0 and so θ = 0 which means the
tangent line is parallel to the x-axis. In this case, the equation of the tangent at
the point (x0 , y0) is given by y = y0.

π
(ii) If θ → , then tan θ → ∞, which means the tangent line is perpendicular to the
2
x-axis, i.e., parallel to the y-axis. In this case, the equation of the tangent at
(x0, y0 ) is given by x = x0 (Why?).

Example 14 Find the slope of the tangent to the curve y = x3 – x at x = 2.


Solution The slope of the tangent at x = 2 is given by

dy 
= 3x − 1 x= 2 = 11.
2
dx  x =2

2015-16 (11-11-2014)
208 MATHEMATICS

Example 15 Find the point at which the tangent to the curve y = 4 x − 3 − 1 has its
2
slope .
3
Solution Slope of tangent to the given curve at (x, y) is
−1
dy 1 2
= (4 x − 3) 2 4 =
dx 2 4x − 3
2
The slope is given to be .
3
2 2
So =
4x − 3 3
or 4x – 3 = 9
or x=3
Now y = 4 x − 3 − 1 . So when x = 3, y = 4(3) − 3 − 1 = 2 .
Therefore, the required point is (3, 2).
Example 16 Find the equation of all lines having slope 2 and being tangent to the curve
2
y+ =0.
x−3
Solution Slope of the tangent to the given curve at any point (x,y) is given by
dy 2
=
dx ( x − 3) 2
But the slope is given to be 2. Therefore
2
=2
(x − 3)2
or (x – 3)2 = 1
or x – 3 =±1
or x = 2, 4
Now x = 2 gives y = 2 and x = 4 gives y = – 2. Thus, there are two tangents to the
given curve with slope 2 and passing through the points (2, 2) and (4, – 2). The equation
of tangent through (2, 2) is given by
y – 2 = 2(x – 2)
or y – 2x + 2 = 0
and the equation of the tangent through (4, – 2) is given by
y – (– 2) = 2(x – 4)
or y – 2x + 10 = 0

2015-16 (11-11-2014)
APPLICATION OF DERIVATIVES 209

x 2 y2
Example 17 Find points on the curve + = 1 at which the tangents are (i) parallel
4 25
to x-axis (ii) parallel to y-axis.
x 2 y2
Solution Differentiating + = 1 with respect to x, we get
4 25
x 2 y dy
+ =0
2 25 dx

dy −25 x
or =
dx 4 y
(i) Now, the tangent is parallel to the x-axis if the slope of the tangent is zero which
−25 x x 2 y2
gives = 0 . This is possible if x = 0. Then + = 1 for x = 0 gives
4 y 4 25
y2 = 25, i.e., y = ± 5.
Thus, the points at which the tangents are parallel to the x-axis are (0, 5) and
(0, – 5).
(ii) The tangent line is parallel to y-axis if the slope of the normal is 0 which gives

4y x 2 y2
= 0 , i.e., y = 0. Therefore, + = 1 for y = 0 gives x = ± 2. Hence, the
25 x 4 25
points at which the tangents are parallel to the y-axis are (2, 0) and (–2, 0).

x−7
Example 18 Find the equation of the tangent to the curve y = at the
( x − 2)( x − 3)
point where it cuts the x-axis.

Solution Note that on x-axis, y = 0. So the equation of the curve, when y = 0, gives
x = 7. Thus, the curve cuts the x-axis at (7, 0). Now differentiating the equation of the
curve with respect to x, we obtain

dy 1 − y(2 x − 5)
= (Why?)
dx ( x − 2)( x − 3)

dy  1− 0 1
or  = =
dx  (7,0) (5)(4) 20

2015-16 (11-11-2014)
210 MATHEMATICS

1
Therefore, the slope of the tangent at (7, 0) is . Hence, the equation of the
20
tangent at (7, 0) is
1
y−0= ( x − 7) or 20 y − x + 7 = 0
20
2 2
Example 19 Find the equations of the tangent and normal to the curve x 3 + y 3 = 2
at (1, 1).
2 2
Solution Differentiating x 3 + y 3 = 2 with respect to x, we get
−1 −1
2 3 2 3 dy
x + y =0
3 3 dx
1
dy  y 3
or = − 
dx x
dy 
Therefore, the slope of the tangent at (1, 1) is = −1 .
dx (1,1)
So the equation of the tangent at (1, 1) is
y – 1 = – 1 (x – 1) or y+x–2=0
Also, the slope of the normal at (1, 1) is given by
−1
=1
slope of the tangent at (1,1)
Therefore, the equation of the normal at (1, 1) is
y – 1 = 1 (x – 1) or y–x=0
Example 20 Find the equation of tangent to the curve given by
x = a sin3 t , y = b cos3 t ... (1)
π
at a point where t = .
2
Solution Differentiating (1) with respect to t, we get
dx dy
= 3a sin 2 t cos t and = −3b cos 2 t sin t
dt dt

2015-16 (11-11-2014)
APPLICATION OF DERIVATIVES 211

dy
dy dt −3bcos2 t sin t −b cos t
or = = =
dx dx 3asin2 t cos t a sin t
dt

π
Therefore, slope of the tangent at t = is
2
π
dy  −b cos
2=0
dx  t= π = π
2 a sin
2

π
Also, when t = , x = a and y = 0. Hence, the equation of tangent to the given
2
π
curve at t = , i.e., at (a, 0) is
2
y – 0 = 0 (x – a), i.e., y = 0.

EXERCISE 6.3
1. Find the slope of the tangent to the curve y = 3x4 – 4x at x = 4.
x −1
2. Find the slope of the tangent to the curve y = , x ≠ 2 at x = 10.
x−2
3. Find the slope of the tangent to curve y = x3 – x + 1 at the point whose
x-coordinate is 2.
4. Find the slope of the tangent to the curve y = x3 –3x + 2 at the point whose
x-coordinate is 3.
π
5. Find the slope of the normal to the curve x = a cos3 θ, y = a sin 3 θ at θ = .
4
π
6. Find the slope of the normal to the curve x = 1 − a sin θ, y = b cos 2 θ at θ = .
2
7. Find points at which the tangent to the curve y = x3 – 3x2 – 9x + 7 is parallel to
the x-axis.
8. Find a point on the curve y = (x – 2)2 at which the tangent is parallel to the chord
joining the points (2, 0) and (4, 4).

2015-16 (11-11-2014)
212 MATHEMATICS

9. Find the point on the curve y = x3 – 11x + 5 at which the tangent is y = x – 11.
10. Find the equation of all lines having slope – 1 that are tangents to the curve
1
y= , x ≠ 1.
x −1
11. Find the equation of all lines having slope 2 which are tangents to the curve
1
y= , x ≠ 3.
x−3
12. Find the equations of all lines having slope 0 which are tangent to the curve
1
y= .
x − 2x + 3
2

x 2 y2
13. Find points on the curve + = 1 at which the tangents are
9 16
(i) parallel to x-axis (ii) parallel to y-axis.
14. Find the equations of the tangent and normal to the given curves at the indicated
points:
(i) y = x4 – 6x3 + 13x2 – 10x + 5 at (0, 5)
(ii) y = x4 – 6x3 + 13x2 – 10x + 5 at (1, 3)
(iii) y = x3 at (1, 1)
(iv) y = x2 at (0, 0)
π
(v) x = cos t, y = sin t at t =
4
15. Find the equation of the tangent line to the curve y = x2 – 2x +7 which is
(a) parallel to the line 2x – y + 9 = 0
(b) perpendicular to the line 5y – 15x = 13.
16. Show that the tangents to the curve y = 7x3 + 11 at the points where x = 2 and
x = – 2 are parallel.
17. Find the points on the curve y = x3 at which the slope of the tangent is equal to
the y-coordinate of the point.
18. For the curve y = 4x3 – 2x5, find all the points at which the tangent passes
through the origin.
19. Find the points on the curve x2 + y2 – 2x – 3 = 0 at which the tangents are parallel
to the x-axis.
20. Find the equation of the normal at the point (am2 ,am 3) for the curve ay2 = x3.

2015-16 (11-11-2014)
APPLICATION OF DERIVATIVES 213

21. Find the equation of the normals to the curve y = x3 + 2x + 6 which are parallel
to the line x + 14y + 4 = 0.
22. Find the equations of the tangent and normal to the parabola y2 = 4ax at the point
(at2, 2at).
23. Prove that the curves x = y2 and xy = k cut at right angles* if 8k2 = 1.

x2 y2
24. Find the equations of the tangent and normal to the hyperbola − = 1 at the
a2 b2
point (x0, y0).
25. Find the equation of the tangent to the curve y = 3x − 2 which is parallel to the
line 4 x − 2 y + 5 = 0 .
Choose the correct answer in Exercises 26 and 27.
26. The slope of the normal to the curve y = 2x2 + 3 sin x at x = 0 is
1 1
(A) 3 (B) (C) –3 (D) −
3 3
2
27. The line y = x + 1 is a tangent to the curve y = 4x at the point
(A) (1, 2) (B) (2, 1) (C) (1, – 2) (D) (– 1, 2)
6.5 Approximations
In this section, we will use differentials to approximate values of certain quantities.
Let f : D → R, D ⊂ R, be a given function
and let y = f (x ). Let ∆x denote a small
increment in x. Recall that the increment in y
corresponding to the increment in x, denoted
by ∆y, is given by ∆y = f (x + ∆x) – f (x). We
define the following
(i) The differential of x, denoted by dx, is
defined by dx = ∆x.
(ii) The differential of y, denoted by dy,
is defined by dy = f′( x) dx or
Fig 6.8
 dy 
dy =   ∆x.
 dx 
* Two curves intersect at right angle if the tangents to the curves at the point of intersection
are perpendicular to each other.

2015-16 (11-11-2014)
214 MATHEMATICS

In case dx = ∆x is relatively small when compared with x, dy is a good approximation


of ∆y and we denote it by dy ≈ ∆y.
For geometrical meaning of ∆x, ∆y, dx and dy, one may refer to Fig 6.8.

A Note In view of the above discussion and Fig 6.8, we may note that the
differential of the dependent variable is not equal to the increment of the variable
where as the differential of independent variable is equal to the increment of the
variable.

Example 21 Use differential to approximate 36.6 .


Solution Take y = x . Let x = 36 and let ∆x = 0.6. Then
∆y = x + ∆x − x = 36.6 − 36 = 36.6 − 6
or 36.6 = 6 + ∆y
Now dy is approximately equal to ∆y and is given by
 dy  1 1
dy =   ∆x = (0.6) = (0.6) = 0.05 (as y = x )
 dx  2 x 2 36

Thus, the approximate value of 36.6 is 6 + 0.05 = 6.05.


1

Example 22 Use differential to approximate (25) 3 .


1
Solution Let y = x3 . Let x = 27 and let ∆x = – 2. Then
1 1 1 1 1
∆y = ( x + ∆x) 3 − x3 = (25) 3 − (27) 3 = (25) 3 − 3
1
or (25) = 3 + ∆y
3

Now dy is approximately equal to ∆y and is given by


1
 dy  1
dy =   ∆x = 2
( −2) (as y = x 3 )
 
dx
3x 3
1 −2
= 1
( −2) = = − 0.074
3 2
27
3 ((27) )
1
Thus, the approximate value of (25) 3 is given by
3 + (– 0. 074) = 2.926

2015-16 (11-11-2014)
APPLICATION OF DERIVATIVES 215

Example 23 Find the approximate value of f (3.02), where f (x) = 3x2 + 5x + 3.


Solution Let x = 3 and ∆x = 0.02. Then
f (3. 02) = f (x + ∆x) = 3 (x + ∆x) 2 + 5(x + ∆x) + 3
Note that ∆y = f (x + ∆x) – f (x). Therefore
f (x + ∆x) = f (x) + ∆y
≈ f (x) + f ′(x) ∆x (as dx = ∆x)
or f (3.02) ≈ (3x + 5x + 3) + (6x + 5) ∆x
2

= (3(3)2 + 5(3) + 3) + (6(3) + 5) (0.02) (as x = 3, ∆x = 0.02)


= (27 + 15 + 3) + (18 + 5) (0.02)
= 45 + 0.46 = 45.46
Hence, approximate value of f (3.02) is 45.46.
Example 24 Find the approximate change in the volume V of a cube of side x meters
caused by increasing the side by 2%.
Solution Note that
V = x3

 dV 
or dV =   ∆x = (3x2) ∆x
 dx 
= (3x2) (0.02x) = 0.06x3 m3 (as 2% of x is 0.02x)
Thus, the approximate change in volume is 0.06 x3 m3.
Example 25 If the radius of a sphere is measured as 9 cm with an error of 0.03 cm,
then find the approximate error in calculating its volume.
Solution Let r be the radius of the sphere and ∆r be the error in measuring the radius.
Then r = 9 cm and ∆r = 0.03 cm. Now, the volume V of the sphere is given by
4 3
V= πr
3
dV
or = 4πr 2
dr
 dV 
 ∆r = (4πr ) ∆r
2
Therefore dV = 
 dr 
= 4π(9)2 (0.03) = 9.72πcm 3
Thus, the approximate error in calculating the volume is 9.72π cm 3.

2015-16 (11-11-2014)
216 MATHEMATICS

EXERCISE 6.4
1. Using differentials, find the approximate value of each of the following up to 3
places of decimal.
(i) 25.3 (ii) 49.5 (iii) 0.6

1 1 1
(iv) (0.009) 3 (v) (0.999)10 (vi) (15) 4
1 1 1
(vii) (26) 3 (viii) (ix)
(255) 4 (82) 4
1 1 1
(x) (401) 2 (xi) (0.0037) 2 (xii) (26.57) 3

1 3 1
(xiii) (81.5) 4 (xiv) (3.968) 2 (xv) (32.15) 5
2. Find the approximate value of f (2.01), where f (x) = 4x2 + 5x + 2.
3. Find the approximate value of f (5.001), where f (x) = x3 – 7x2 + 15.
4. Find the approximate change in the volume V of a cube of side x metres caused
by increasing the side by 1%.
5. Find the approximate change in the surface area of a cube of side x metres
caused by decreasing the side by 1%.
6. If the radius of a sphere is measured as 7 m with an error of 0.02 m, then find the
approximate error in calculating its volume.
7. If the radius of a sphere is measured as 9 m with an error of 0.03 m, then find the
approximate error in calculating its surface area.
8. If f(x) = 3x2 + 15x + 5, then the approximate value of f (3.02) is
(A) 47.66 (B) 57.66 (C) 67.66 (D) 77.66
9. The approximate change in the volume of a cube of side x metres caused by
increasing the side by 3% is
(A) 0.06 x3 m 3 (B) 0.6 x3 m3 (C) 0.09 x3 m3 (D) 0.9 x3 m3

6.6 Maxima and Minima


In this section, we will use the concept of derivatives to calculate the maximum or
minimum values of various functions. In fact, we will find the ‘turning points’ of the
graph of a function and thus find points at which the graph reaches its highest (or

2015-16 (11-11-2014)
APPLICATION OF DERIVATIVES 217

lowest) locally. The knowledge of such points is very useful in sketching the graph of
a given function. Further, we will also find the absolute maximum and absolute minimum
of a function that are necessary for the solution of many applied problems.
Let us consider the following problems that arise in day to day life.
(i) The profit from a grove of orange trees is given by P(x) = ax + bx2 , where a,b
are constants and x is the number of orange trees per acre. How many trees per
acre will maximise the profit?
(ii) A ball, thrown into the air from a building 60 metres high, travels along a path
x2
given by h ( x) = 60 + x − , where x is the horizontal distance from the building
60
and h(x) is the height of the ball . What is the maximum height the ball will
reach?
(iii) An Apache helicopter of enemy is flying along the path given by the curve
f (x) = x2 + 7. A soldier, placed at the point (1, 2), wants to shoot the helicopter
when it is nearest to him. What is the nearest distance?
In each of the above problem, there is something common, i.e., we wish to find out
the maximum or minimum values of the given functions. In order to tackle such problems,
we first formally define maximum or minimum values of a function, points of local
maxima and minima and test for determining such points.
Definition 3 Let f be a function defined on an interval I. Then
(a) f is said to have a maximum value in I, if there exists a point c in I such that
f (c ) > f ( x) , for all x ∈ I.
The number f (c) is called the maximum value of f in I and the point c is called a
point of maximum value of f in I.
(b) f is said to have a minimum value in I, if there exists a point c in I such that
f (c) < f (x), for all x ∈ I.
The number f (c), in this case, is called the minimum value of f in I and the point
c, in this case, is called a point of minimum value of f in I.
(c) f is said to have an extreme value in I if there exists a point c in I such that
f (c) is either a maximum value or a minimum value of f in I.
The number f (c), in this case, is called an extreme value of f in I and the point c
is called an extreme point.
Remark In Fig 6.9(a), (b) and (c), we have exhibited that graphs of certain particular
functions help us to find maximum value and minimum value at a point. Infact, through
graphs, we can even find maximum/minimum value of a function at a point at which it
is not even differentiable (Example 27).

2015-16 (11-11-2014)
218 MATHEMATICS

Fig 6.9
Example 26 Find the maximum and the minimum values,
if any, of the function f given by
f (x) = x2, x ∈ R.
Solution From the graph of the given function (Fig 6.10),
we have f(x) = 0 if x = 0. Also
f (x) ≥ 0, for all x ∈ R.
Therefore, the minimum value of f is 0 and the point
of minimum value of f is x = 0. Further, it may be observed
from the graph of the function that f has no maximum
value and hence no point of maximum value of f in R.
Fig 6.10
A Note If we restrict the domain of f to [– 2, 1] only,
2
then f will have maximum value(– 2) = 4 at x = – 2.

Example 27 Find the maximum and minimum values


of f , if any, of the function given by f(x) = | x |, x ∈ R.

Solution From the graph of the given function


(Fig 6.11) , note that
f (x) ≥ 0, for all x ∈ R and f (x) = 0 if x = 0.
Therefore, the function f has a minimum value 0
and the point of minimum value of f is x = 0. Also, the
graph clearly shows that f has no maximum value in
R and hence no point of maximum value in R. Fig 6.11

A Note
(i) If we restrict the domain of f to [– 2, 1] only, then f will have maximum value
|– 2| = 2.

2015-16 (11-11-2014)
APPLICATION OF DERIVATIVES 219

(ii) One may note that the function f in Example 27 is not differentiable at
x = 0.
Example 28 Find the maximum and the minimum values, if any, of the function
given by
f (x) = x, x ∈ (0, 1).
Solution The given function is an increasing (strictly) function in the given interval
(0, 1). From the graph (Fig 6.12) of the function f , it
seems that, it should have the minimum value at a
point closest to 0 on its right and the maximum value
at a point closest to 1 on its left. Are such points
available? Of course, not. It is not possible to locate
such points. Infact, if a point x0 is closest to 0, then
x0
we find < x0 for all x0 ∈(0,1) . Also, if x1 is
2
x1 + 1
closest to 1, then > x1 for all x1 ∈(0,1) .
2 Fig 6.12
Therefore, the given function has neither the maximum value nor the minimum
value in the interval (0,1).
Remark The reader may observe that in Example 28, if we include the points 0 and 1
in the domain of f , i.e., if we extend the domain of f to [0,1], then the function f has
minimum value 0 at x = 0 and maximum value 1 at x = 1. Infact, we have the following
results (The proof of these results are beyond the scope of the present text)
Every monotonic function assumes its maximum/minimum value at the end
points of the domain of definition of the function.
A more general result is
Every continuous function on a closed interval has a maximum and a minimum
value.

A Note By a monotonic function f


increasing in I or decreasing in I.
in an interval I, we mean that f is either

Maximum and minimum values of a function defined on a closed interval will be


discussed later in this section.
Let us now examine the graph of a function as shown in Fig 6.13. Observe that at
points A, B, C and D on the graph, the function changes its nature from decreasing to
increasing or vice-versa. These points may be called turning points of the given
function. Further, observe that at turning points, the graph has either a little hill or a little
valley. Roughly speaking, the function has minimum value in some neighbourhood
(interval) of each of the points A and C which are at the bottom of their respective

2015-16 (11-11-2014)
220 MATHEMATICS

Fig 6.13
valleys. Similarly, the function has maximum value in some neighbourhood of points B
and D which are at the top of their respective hills. For this reason, the points A and C
may be regarded as points of local minimum value (or relative minimum value) and
points B and D may be regarded as points of local maximum value (or relative maximum
value) for the function. The local maximum value and local minimum value of the
function are referred to as local maxima and local minima, respectively, of the function.
We now formally give the following definition
Definition 4 Let f be a real valued function and let c be an interior point in the domain
of f. Then
(a) c is called a point of local maxima if there is an h > 0 such that
f (c) > f (x), for all x in (c – h, c + h)
The value f (c) is called the local maximum value of f.
(b) c is called a point of local minima if there is an h > 0 such that
f (c) < f (x), for all x in (c – h, c + h)
The value f (c) is called the local minimum value of f .
Geometrically, the above definition states that if x = c is a point of local maxima of f,
then the graph of f around c will be as shown in Fig 6.14(a). Note that the function f is
increasing (i.e., f ′(x) > 0) in the interval (c – h, c) and decreasing (i.e., f ′(x) < 0) in the
interval (c, c + h).
This suggests that f′(c) must be zero.

Fig 6.14

2015-16 (11-11-2014)
APPLICATION OF DERIVATIVES 221

Similarly, if c is a point of local minima of f , then the graph of f around c will be as


shown in Fig 6.14(b). Here f is decreasing (i.e., f ′(x) < 0) in the interval (c – h, c) and
increasing (i.e., f ′(x) > 0) in the interval (c, c + h). This again suggest that f ′(c) must
be zero.
The above discussion lead us to the following theorem (without proof).
Theorem 2 Let f be a function defined on an open interval I. Suppose c ∈ I be any
point. If f has a local maxima or a local minima at x = c, then either f ′(c) = 0 or f is not
differentiable at c.

Remark The converse of above theorem need


not be true, that is, a point at which the derivative
vanishes need not be a point of local maxima or
local minima. For example, if f (x) = x3, then f ′(x)
= 3x2 and so f ′(0) = 0. But 0 is neither a point of
local maxima nor a point of local minima (Fig 6.15).

ANote A point c in the domain of a function


f at which either f ′(c) = 0 or f is not differentiable
is called a critical point of f. Note that if f is
continuous at c and f ′(c) = 0, then there exists
an h > 0 such that f is differentiable in the interval Fig 6.15
(c – h, c + h).
We shall now give a working rule for finding points of local maxima or points of
local minima using only the first order derivatives.
Theorem 3 (First Derivative Test) Let f be a function defined on an open interval I.
Let f be continuous at a critical point c in I. Then
(i) If f ′(x) changes sign from positive to negative as x increases through c, i.e., if
f ′(x) > 0 at every point sufficiently close to and to the left of c, and f ′(x) < 0 at
every point sufficiently close to and to the right of c, then c is a point of local
maxima.
(ii) If f ′(x) changes sign from negative to positive as x increases through c, i.e., if
f ′(x) < 0 at every point sufficiently close to and to the left of c, and f ′(x) > 0 at
every point sufficiently close to and to the right of c, then c is a point of local
minima.
(iii) If f ′(x) does not change sign as x increases through c, then c is neither a point of
local maxima nor a point of local minima. Infact, such a point is called point of
inflection (Fig 6.15).

2015-16 (11-11-2014)
222 MATHEMATICS

ANote If c is a point of local maxima of f , then f(c) is a local maximum value of


f. Similarly, if c is a point of local minima of f , then f(c) is a local minimum value of f.
Figures 6.15 and 6.16, geometrically explain Theorem 3.

Fig 6.16
Example 29 Find all points of local maxima and local minima of the function f
given by
f (x) = x3 – 3x + 3.
Solution We have
f (x) = x3 – 3x + 3
or f ′(x) = 3x2 – 3 = 3 (x – 1) (x + 1)
or f ′(x) = 0 at x = 1 and x = – 1
Thus, x = ± 1 are the only critical points which could possibly be the points of local
maxima and/or local minima of f . Let us first examine the point x = 1.
Note that for values close to 1 and to the right of 1, f ′(x) > 0 and for values close
to 1 and to the left of 1, f ′(x) < 0. Therefore, by first derivative test, x = 1 is a point
of local minima and local minimum value is f (1) = 1. In the case of x = –1, note that
f ′(x) > 0, for values close to and to the left of –1 and f ′(x) < 0, for values close to and
to the right of – 1. Therefore, by first derivative test, x = – 1 is a point of local maxima
and local maximum value is f (–1) = 5.
Values of x Sign of f ′ (x) = 3(x – 1) (x + 1)
to the right (say 1.1 etc.) >0
Close to 1 to the left (say 0.9 etc.) <0

to the right (say − 0.9 etc.) <0


Close to –1 to the left (say − 1.1 etc.) >0

2015-16 (11-11-2014)
APPLICATION OF DERIVATIVES 223

Example 30 Find all the points of local maxima and local minima of the function f
given by
f (x) = 2x3 – 6x2 + 6x +5.
Solution We have
f (x) = 2x3 – 6x2 + 6x + 5
or f ′(x) = 6x2 – 12x + 6 = 6 (x – 1) 2
or f ′(x) = 0 at x = 1
Thus, x = 1 is the only critical point of f . We shall now examine this point for local
maxima and/or local minima of f. Observe that f ′(x) ≥ 0, for all x ∈ R and in particular
f ′(x) > 0, for values close to 1 and to the left and to the right of 1. Therefore, by first
derivative test, the point x = 1 is neither a point of local maxima nor a point of local
minima. Hence x = 1 is a point of inflexion.
Remark One may note that since f ′(x), in Example 30, never changes its sign on R,
graph of f has no turning points and hence no point of local maxima or local minima.
We shall now give another test to examine local maxima and local minima of a
given function. This test is often easier to apply than the first derivative test.
Theorem 4 (Second Derivative Test) Let f be a function defined on an interval I
and c ∈ I. Let f be twice differentiable at c. Then
(i) x = c is a point of local maxima if f ′(c) = 0 and f ″(c) < 0
The value f (c) is local maximum value of f .
(ii) x = c is a point of local minima if f ′( c) = 0 and f ″(c) > 0
In this case, f (c) is local minimum value of f .
(iii) The test fails if f ′(c) = 0 and f ″(c) = 0.
In this case, we go back to the first derivative test and find whether c is a point of
local maxima, local minima or a point of inflexion.

A Note As f is twice differentiable at c, we mean


second order derivative of f exists at c.

Example 31 Find local minimum value of the function f


given by f (x) = 3 + | x |, x ∈ R.
Solution Note that the given function is not differentiable
at x = 0. So, second derivative test fails. Let us try first
derivative test. Note that 0 is a critical point of f . Now
Fig 6.17
to the left of 0, f (x) = 3 – x and so f ′(x) = – 1 < 0. Also

2015-16 (11-11-2014)
224 MATHEMATICS

to the right of 0, f (x) = 3 + x and so f ′(x) = 1 > 0. Therefore, by first derivative test,
x = 0 is a point of local minima of f and local minimum value of f is f (0) = 3.

Example 32 Find local maximum and local minimum values of the function f given by
f (x) = 3x4 + 4x3 – 12x2 + 12
Solution We have
f (x) = 3x4 + 4x3 – 12x2 + 12
or f ′(x) = 12x3 + 12x2 – 24x = 12x (x – 1) (x + 2)
or f ′(x) = 0 at x = 0, x = 1 and x = – 2.
Now f ″(x) = 36x2 + 24x – 24 = 12 (3x2 + 2x – 1)

 f ′′(0) = −12 < 0



or  f ′′(1) = 48 > 0
 f ′′(−2) = 84 > 0

Therefore, by second derivative test, x = 0 is a point of local maxima and local
maximum value of f at x = 0 is f (0) = 12 while x = 1 and x = – 2 are the points of local
minima and local minimum values of f at x = – 1 and – 2 are f (1) = 7 and f (–2) = –20,
respectively.
Example 33 Find all the points of local maxima and local minima of the function f
given by
f(x) = 2x3 – 6x2 + 6x +5.
Solution We have
f(x) = 2x3 – 6x2 + 6x +5

 f ′( x) = 6 x − 12 x + 6 = 6( x − 1)
2 2

or 
 f ′′( x) = 12( x − 1)
Now f ′(x) = 0 gives x =1. Also f ″(1) = 0. Therefore, the second derivative test
fails in this case. So, we shall go back to the first derivative test.
We have already seen (Example 30) that, using first derivative test, x =1 is neither
a point of local maxima nor a point of local minima and so it is a point of inflexion.
Example 34 Find two positive numbers whose sum is 15 and the sum of whose
squares is minimum.
Solution Let one of the numbers be x. Then the other number is (15 – x). Let S(x)
denote the sum of the squares of these numbers. Then

2015-16 (11-11-2014)
APPLICATION OF DERIVATIVES 225

S(x) = x2 + (15 – x)2 = 2x2 – 30x + 225


S′ ( x) = 4 x − 30
or 
S′ ( x) = 4

15  15 
Now S′(x) = 0 gives x = . Also S′   = 4 > 0 . Therefore, by second derivative
2  2
15
test, x = is the point of local minima of S. Hence the sum of squares of numbers is
2
15 15 15
minimum when the numbers are and 15 − = .
2 2 2
Remark Proceeding as in Example 34 one may prove that the two positive numbers,
k k
whose sum is k and the sum of whose squares is minimum, are and .
2 2
Example 35 Find the shortest distance of the point (0, c) from the parabola y = x2,
where 0 ≤ c ≤ 5.
Solution Let (h, k) be any point on the parabola y = x2. Let D be the required distance
between (h, k) and (0, c). Then

D = ( h − 0) 2 + ( k − c) 2 = h 2 + ( k − c)2 ... (1)


Since (h, k) lies on the parabola y = x2, we have k = h 2. So (1) gives

D ≡ D(k) = k + ( k − c) 2

1 + 2(k − c)
or D′(k) =
2 k + (k − c)2

2c − 1
Now D′(k) = 0 gives k =
2

2c − 1
Observe that when k < , then 2( k − c ) + 1 < 0 , i.e., D′( k ) < 0 . Also when
2
2c − 1 2c − 1
k> , then D′( k ) > 0 . So, by first derivative test, D (k) is minimum at k = .
2 2
Hence, the required shortest distance is given by

2015-16 (11-11-2014)
226 MATHEMATICS

2
 2c − 1  2 c − 1  2c − 1  4c − 1
D = + − c =
 2  2  2  2

ANote The reader may note that in Example 35, we have used first derivative
test instead of the second derivative test as the former is easy and short.

Example 36 Let AP and BQ be two vertical poles at


points A and B, respectively. If AP = 16 m, BQ = 22 m
and AB = 20 m, then find the distance of a point R on
AB from the point A such that RP2 + RQ 2 is minimum.

Solution Let R be a point on AB such that AR = x m.


Then RB = (20 – x) m (as AB = 20 m). From Fig 6.18,
we have
RP2 = AR2 + AP2
Fig 6.18
and RQ 2 = RB2 + BQ2
Therefore RP 2 + RQ2 = AR2 + AP2 + RB2 + BQ2
= x2 + (16)2 + (20 – x) 2 + (22)2
= 2x2 – 40x + 1140
Let S ≡ S(x) = RP2 + RQ2 = 2x2 – 40x + 1140.
Therefore S′(x) = 4x – 40.
Now S′(x) = 0 gives x = 10. Also S″(x) = 4 > 0, for all x and so S″(10) > 0.
Therefore, by second derivative test, x = 10 is the point of local minima of S. Thus, the
distance of R from A on AB is AR = x =10 m.
Example 37 If length of three sides of a trapezium other than base are equal to 10cm,
then find the area of the trapezium when it is maximum.
Solution The required trapezium is as given in Fig 6.19. Draw perpendiculars DP and

Fig 6.19

2015-16 (11-11-2014)
APPLICATION OF DERIVATIVES 227

CQ on AB. Let AP = x cm. Note that ∆APD ~ ∆BQC. Therefore, QB = x cm. Also, by
Pythagoras theorem, DP = QC = 100 − x 2 . Let A be the area of the trapezium. Then
1
A ≡ A(x) = (sum of parallel sides) (height)
2

(2 x + 10 + 10) ( 100 − x2 )
1
=
2
= ( x + 10) ( 100 − x2 )
+ ( 100 − x2 )
( −2 x)
or A′(x) = ( x + 10)
2 100 − x 2

−2 x2 − 10 x + 100
=
100 − x2
Now A′(x) = 0 gives 2x2 + 10x – 100 = 0, i.e., x = 5 and x = –10.
Since x represents distance, it can not be negative.
So, x = 5. Now
( −2 x )
100 − x2 ( −4 x −10) − ( −2 x 2 − 10 x + 100)
2 100 − x2
A″(x) =
100 − x 2

2 x3 − 300 x − 1000
= 3 (on simplification)
(100 − x2 ) 2

2(5)3 − 300(5) − 1000 −2250 −30


or A″(5) = 3
= = <0
75 75 75
(100 − (5)2 ) 2
Thus, area of trapezium is maximum at x = 5 and the area is given by
A(5) = (5 + 10) 100 − (5)2 = 15 75 = 75 3 cm 2

Example 38 Prove that the radius of the right circular cylinder of greatest curved
surface area which can be inscribed in a given cone is half of that of the cone.
Solution Let OC = r be the radius of the cone and OA = h be its height. Let a cylinder
with radius OE = x inscribed in the given cone (Fig 6.20). The height QE of the cylinder
is given by

2015-16 (11-11-2014)
228 MATHEMATICS

QE EC
= (since ∆QEC ~ ∆AOC)
OA OC
QE r −x
or =
h r
h(r − x)
or QE =
r
Let S be the curved surface area of the given
cylinder. Then
2πx h ( r − x) 2πh
S ≡ S(x) = = ( rx − x2 )
r r Fig 6.20

 ′ 2πh
S ( x) = r ( r − 2 x)
or 
S′ ( x) = − 4 πh
 r

r r r
Now S′(x) = 0 gives x = . Since S″(x) < 0 for all x, S′   < 0 . So x = is a
2  2 2
point of maxima of S. Hence, the radius of the cylinder of greatest curved surface area
which can be inscribed in a given cone is half of that of the cone.
6.6.1 Maximum and Minimum Values of a Function in a Closed Interval
Let us consider a function f given by
f (x) = x + 2, x ∈ (0, 1)
Observe that the function is continuous on (0, 1) and neither has a maximum value
nor has a minimum value. Further, we may note that the function even has neither a
local maximum value nor a local minimum value.
However, if we extend the domain of f to the closed interval [0, 1], then f still may
not have a local maximum (minimum) values but it certainly does have maximum value
3 = f (1) and minimum value 2 = f (0). The maximum value 3 of f at x = 1 is called
absolute maximum value (global maximum or greatest value) of f on the interval
[0, 1]. Similarly, the minimum value 2 of f at x = 0 is called the absolute minimum
value (global minimum or least value) of f on [0, 1].
Consider the graph given in Fig 6.21 of a continuous function defined on a closed
interval [a, d]. Observe that the function f has a local minima at x = b and local

2015-16 (11-11-2014)
APPLICATION OF DERIVATIVES 229

Fig 6.21
minimum value is f (b). The function also has a local maxima at x = c and local maximum
value is f (c).
Also from the graph, it is evident that f has absolute maximum value f (a) and
absolute minimum value f (d). Further note that the absolute maximum (minimum)
value of f is different from local maximum (minimum) value of f .
We will now state two results (without proof) regarding absolute maximum and
absolute minimum values of a function on a closed interval I.
Theorem 5 Let f be a continuous function on an interval I = [a, b]. Then f has the
absolute maximum value and f attains it at least once in I. Also, f has the absolute
minimum value and attains it at least once in I.
Theorem 6 Let f be a differentiable function on a closed interval I and let c be any
interior point of I. Then
(i) f ′(c) = 0 if f attains its absolute maximum value at c.
(ii) f ′(c) = 0 if f attains its absolute minimum value at c.
In view of the above results, we have the following working rule for finding absolute
maximum and/or absolute minimum values of a function in a given closed interval
[a, b].
Working Rule
Step 1: Find all critical points of f in the interval, i.e., find points x where either
f ′( x ) = 0 or f is not differentiable.
Step 2: Take the end points of the interval.
Step 3: At all these points (listed in Step 1 and 2), calculate the values of f .
Step 4: Identify the maximum and minimum values of f out of the values calculated in
Step 3. This maximum value will be the absolute maximum (greatest) value of
f and the minimum value will be the absolute minimum (least) value of f .

2015-16 (11-11-2014)
230 MATHEMATICS

Example 39 Find the absolute maximum and minimum values of a function f given by
f (x) = 2x3 – 15x2 + 36x +1 on the interval [1, 5].
Solution We have
f (x) = 2x3 – 15x2 + 36x + 1
or f ′(x) = 6x2 – 30x + 36 = 6 (x – 3) (x – 2)
Note that f ′(x) = 0 gives x = 2 and x = 3.
We shall now evaluate the value of f at these points and at the end points of the
interval [1, 5], i.e., at x = 1, x = 2, x = 3 and at x = 5. So
f (1) = 2 (13) – 15(12 ) + 36 (1) + 1 = 24
f (2) = 2 (23) – 15(22 ) + 36 (2) + 1 = 29
f (3) = 2 (33) – 15(32 ) + 36 (3) + 1 = 28
f (5) = 2 (53) – 15(52 ) + 36 (5) + 1 = 56
Thus, we conclude that absolute maximum value of f on [1, 5] is 56, occurring at
x =5, and absolute minimum value of f on [1, 5] is 24 which occurs at x = 1.
Example 40 Find absolute maximum and minimum values of a function f given by
4 1
f ( x) = 12 x 3 − 6 x 3 , x ∈[ −1, 1]
Solution We have
4 1
f (x) = 12 x 3 − 6 x 3
1
2 2(8 x − 1)
or f ′(x) = 16 x 3 − 2
= 2
x3 x3
1
Thus, f ′(x) = 0 gives x = . Further note that f ′(x) is not defined at x = 0. So the
8
1
critical points are x = 0 and x = . Now evaluating the value of f at critical points
8
1
x = 0, and at end points of the interval x = –1 and x = 1, we have
8
4 1
f (–1) = 12 ( −1) 3 − 6 ( −1) 3 = 18
f (0) = 12 (0) – 6(0) = 0

2015-16 (11-11-2014)
APPLICATION OF DERIVATIVES 231

4 1
 1
f   = 12  1  − 6  1  = − 9
3 3

 
8 8 8 4
4 1
f (1) = 12 (1) − 6 (1) = 6
3 3

Hence, we conclude that absolute maximum value of f is 18 that occurs at x = – 1


−9 1
and absolute minimum value of f is that occurs at x = .
4 8
Example 41 An Apache helicopter of enemy is flying along the curve given by
y = x2 + 7. A soldier, placed at (3, 7), wants to shoot down the helicopter when it is
nearest to him. Find the nearest distance.
Solution For each value of x, the helicopter’s position is at point (x, x 2 + 7).
Therefore, the distance between the helicopter and the soldier placed at (3,7) is

( x − 3) 2 + ( x2 + 7 − 7) 2 , i.e., ( x − 3) 2 + x4 .
Let f (x) = (x – 3)2 + x4
or f ′(x) = 2(x – 3) + 4x3 = 2(x – 1) (2x2 + 2x + 3)
Thus, f ′(x) = 0 gives x = 1 or 2x2 + 2x + 3 = 0 for which there are no real roots.
Also, there are no end points of the interval to be added to the set for which f ′ is zero,
i.e., there is only one point, namely, x = 1. The value of f at this point is given by
f (1) = (1 – 3)2 + (1)4 = 5. Thus, the distance between the solider and the helicopter is
f (1) = 5 .
Note that 5 is either a maximum value or a minimum value. Since

f (0) = (0 − 3)2 + (0)4 = 3 > 5 ,

it follows that 5 is the minimum value of f ( x) . Hence, 5 is the minimum


distance between the soldier and the helicopter.

EXERCISE 6.5
1. Find the maximum and minimum values, if any, of the following functions
given by
(i) f (x) = (2x – 1) 2 + 3 (ii) f (x) = 9x2 + 12x + 2
(iii) f (x) = – (x – 1)2 + 10 (iv) g (x) = x3 + 1

2015-16 (11-11-2014)
232 MATHEMATICS

2. Find the maximum and minimum values, if any, of the following functions
given by
(i) f (x) = |x + 2 | – 1 (ii) g (x) = – | x + 1| + 3
(iii) h (x) = sin(2x) + 5 (iv) f (x) = | sin 4x + 3|
(v) h (x) = x + 1, x ∈ (– 1, 1)
3. Find the local maxima and local minima, if any, of the following functions. Find
also the local maximum and the local minimum values, as the case may be:
(i) f (x) = x2 (ii) g (x) = x3 – 3x
π
(iii) h (x) = sin x + cos x, 0 < x <
2
(iv) f (x) = sin x – cos x, 0 < x < 2π
x 2
(v) f (x) = x3 – 6x2 + 9x + 15 (vi) g (x) = + , x >0
2 x
1
(vii) g ( x) = (viii) f ( x) = x 1 − x , 0 < x < 1
x +2
2

4. Prove that the following functions do not have maxima or minima:


(i) f (x) = ex (ii) g (x) = log x
3 2
(iii) h (x) = x + x + x +1
5. Find the absolute maximum value and the absolute minimum value of the following
functions in the given intervals:
(i) f (x) = x3, x ∈ [– 2, 2] (ii) f (x) = sin x + cos x , x ∈ [0, π]

1  9
(iii) f (x) = 4 x − x2 , x ∈  −2,  (iv) f ( x) = ( x − 1) 2 + 3, x ∈[ −3,1]
2  2
6. Find the maximum profit that a company can make, if the profit function is
given by
p (x) = 41 – 72x – 18x2
7. Find both the maximum value and the minimum value of
3x4 – 8x3 + 12x2 – 48x + 25 on the interval [0, 3].
8. At what points in the interval [0, 2π], does the function sin 2x attain its maximum
value?
9. What is the maximum value of the function sin x + cos x?
10. Find the maximum value of 2x3 – 24x + 107 in the interval [1, 3]. Find the
maximum value of the same function in [–3, –1].

2015-16 (11-11-2014)
APPLICATION OF DERIVATIVES 233

11. It is given that at x = 1, the function x4 – 62x2 + ax + 9 attains its maximum value,
on the interval [0, 2]. Find the value of a.
12. Find the maximum and minimum values of x + sin 2x on [0, 2π].
13. Find two numbers whose sum is 24 and whose product is as large as possible.
14. Find two positive numbers x and y such that x + y = 60 and xy3 is maximum.
15. Find two positive numbers x and y such that their sum is 35 and the product x2 y5
is a maximum.
16. Find two positive numbers whose sum is 16 and the sum of whose cubes is
minimum.
17. A square piece of tin of side 18 cm is to be made into a box without top, by
cutting a square from each corner and folding up the flaps to form the box. What
should be the side of the square to be cut off so that the volume of the box is the
maximum possible.
18. A rectangular sheet of tin 45 cm by 24 cm is to be made into a box without top,
by cutting off square from each corner and folding up the flaps. What should be
the side of the square to be cut off so that the volume of the box is maximum ?
19. Show that of all the rectangles inscribed in a given fixed circle, the square has
the maximum area.
20. Show that the right circular cylinder of given surface and maximum volume is
such that its height is equal to the diameter of the base.
21. Of all the closed cylindrical cans (right circular), of a given volume of 100 cubic
centimetres, find the dimensions of the can which has the minimum surface
area?
22. A wire of length 28 m is to be cut into two pieces. One of the pieces is to be
made into a square and the other into a circle. What should be the length of the
two pieces so that the combined area of the square and the circle is minimum?
23. Prove that the volume of the largest cone that can be inscribed in a sphere of
8
radius R is of the volume of the sphere.
27
24. Show that the right circular cone of least curved surface and given volume has
an altitude equal to 2 time the radius of the base.
25. Show that the semi-vertical angle of the cone of the maximum volume and of
given slant height is tan −1 2 .
26. Show that semi-vertical angle of right circular cone of given surface area and
−1  1 
maximum volume is sin   .
 3

2015-16 (11-11-2014)
234 MATHEMATICS

Choose the correct answer in the Exercises 27 and 29.


27. The point on the curve x2 = 2y which is nearest to the point (0, 5) is
(A) (2 2, 4) (B) (2 2, 0) (C) (0, 0) (D) (2, 2)

1− x + x 2
28. For all real values of x, the minimum value of is
1+ x + x 2
1
(A) 0 (B) 1 (C) 3 (D)
3
1
29. The maximum value of [ x( x − 1) + 1]3 , 0 ≤ x ≤ 1 is
1
 1 3 1
(A)   (B) (C) 1 (D) 0
 3 2

Miscellaneous Examples

Example 42 A car starts from a point P at time t = 0 seconds and stops at point Q. The
distance x, in metres, covered by it, in t seconds is given by

 t
x = t2  2 − 
 3

Find the time taken by it to reach Q and also find distance between P and Q.

Solution Let v be the velocity of the car at t seconds.

 t
Now x = t2  2 − 
 3

dx
Therefore v= = 4t – t2 = t(4 – t)
dt
Thus, v = 0 gives t = 0 and/or t = 4.
Now v = 0 at P as well as at Q and at P, t = 0. So, at Q, t = 4. Thus, the car will
reach the point Q after 4 seconds. Also the distance travelled in 4 seconds is given by

2 4  2  32
x] t = 4 = 4  2 −  = 16   = m
 3 3  3

2015-16 (11-11-2014)
APPLICATION OF DERIVATIVES 235

Example 43 A water tank has the shape of an inverted right circular cone with its axis
vertical and vertex lowermost. Its semi-vertical angle is tan–1 (0.5). Water is poured
into it at a constant rate of 5 cubic metre per hour. Find the rate at which the level of
the water is rising at the instant when the depth of water in the tank is 4 m.
r
Solution Let r, h and α be as in Fig 6.22. Then tan α = .
h

−1  r 
So α = tan   .
h
But α = tan (0.5) (given)
–1

r
or = 0.5
h
h
or r=
2
Let V be the volume of the cone. Then
Fig 6.22
2
1 2 1  h πh 3
V = πr h = π   h =
3 3  2 12

dV d  πh 3  dh
Therefore =  ⋅ (by Chain Rule)
dt dh  12  dt

π 2 dh
= h
4 dt
dV
Now rate of change of volume, i.e., = 5 m3/h and h = 4 m.
dt
π 2 dh
Therefore 5= (4) ⋅
4 dt
dh 5 35  22 
or = = m/h  π = 
dt 4π 88  7 
35
Thus, the rate of change of water level is m/h .
88
Example 44 A man of height 2 metres walks at a uniform speed of 5 km/h away from
a lamp post which is 6 metres high. Find the rate at which the length of his shadow
increases.

2015-16 (11-11-2014)
236 MATHEMATICS

Solution In Fig 6.23, Let AB be the lamp-post, the


lamp being at the position B and let MN be the man
at a particular time t and let AM = l metres. Then,
MS is the shadow of the man. Let MS = s metres.

Note that ∆MSN ~ ∆ASB

MS MN
or =
AS AB
Fig 6.23
or AS = 3s (as MN = 2 and AB = 6 (given))
Thus AM = 3s – s = 2s. But AM = l
So l = 2s
dl ds
Therefore = 2
dt dt
dl 5
Since = 5 km/h. Hence, the length of the shadow increases at the rate km/h.
dt 2
Example 45 Find the equation of the normal to the curve x2 = 4y which passes through
the point (1, 2).
Solution Differentiating x2 = 4y with respect to x, we get
dy x
=
dx 2
Let (h, k) be the coordinates of the point of contact of the normal to the curve
x2 = 4y. Now, slope of the tangent at (h, k) is given by
dy  h
dx  ( h, k ) = 2

−2
Hence, slope of the normal at (h, k) =
h
Therefore, the equation of normal at (h, k) is
−2
y–k= ( x − h) ... (1)
h
Since it passes through the point (1, 2), we have
−2 2
2− k = (1 − h ) or k = 2 + (1 − h ) ... (2)
h h

2015-16 (11-11-2014)
APPLICATION OF DERIVATIVES 237

Since (h, k) lies on the curve x2 = 4y, we have


h 2 = 4k ... (3)
From (2) and (3), we have h = 2 and k = 1. Substituting the values of h and k in (1),
we get the required equation of normal as
−2
y −1 = ( x − 2) or x + y = 3
2
Example 46 Find the equation of tangents to the curve
y = cos (x + y), – 2π ≤ x ≤ 2π
that are parallel to the line x + 2y = 0.
Solution Differentiating y = cos(x + y) with respect to x, we have

dy − sin ( x + y )
=
dx 1 + sin ( x + y )

− sin ( x + y )
or slope of tangent at (x, y) =
1 + sin ( x + y )
Since the tangents to the given curve are parallel to the line x + 2y = 0, whose slope
−1
is , we have
2
− sin( x + y) −1
1 + sin( x + y) =
2
or sin (x + y) = 1
π,
or x + y = nπ + (– 1)n n∈Z
2
 π
Then y = cos(x + y) = cos  nπ + (−1)n  , n ∈ Z
 2
= 0, for all n ∈ Z
−3 π π
Also, since −2 π ≤ x ≤ 2 π , we get x = and x = . Thus, tangents to the
2 2

 −3 π  π 
given curve are parallel to the line x + 2y = 0 only at points  , 0  and  , 0  .
 2  2 
Therefore, the required equation of tangents are

2015-16 (11-11-2014)
238 MATHEMATICS

−1 3π 
y–0= x+  or 2 x + 4 y + 3π = 0
2 2 
−1  π
and y–0= x−  or 2 x + 4 y − π = 0
2  2
Example 47 Find intervals in which the function given by
3 4 4 3 36
f (x) = x − x − 3 x2 + x + 11
10 5 5
is (a) strictly increasing (b) strictly decreasing.
Solution We have
3 4 4 3 36
f (x) = x − x − 3x2 + x + 11
10 5 5
3 4 36
Therefore f ′(x) = (4 x3 ) − (3 x2 ) − 3(2 x) +
10 5 5
6
= ( x −1) ( x + 2) (x − 3) (on simplification)
5
Now f ′(x) = 0 gives x = 1, x = – 2, or x = 3. The
points x = 1, – 2, and 3 divide the real line into four
disjoint intervals namely, (– ∞, – 2), (– 2, 1), (1, 3) Fig 6.24
and (3, ∞) (Fig 6.24).
Consider the interval (– ∞, – 2), i.e., when – ∞ < x < – 2.
In this case, we have x – 1 < 0, x + 2 < 0 and x – 3 < 0.
(In particular, observe that for x = –3, f ′(x) = (x – 1) (x + 2) (x – 3) = (– 4) (– 1)
(– 6) < 0)
Therefore, f ′(x) < 0 when – ∞ < x < – 2.
Thus, the function f is strictly decreasing in (– ∞, – 2).
Consider the interval (– 2, 1), i.e., when – 2 < x < 1.
In this case, we have x – 1 < 0, x + 2 > 0 and x – 3 < 0
(In particular, observe that for x = 0, f ′(x) = (x – 1) (x + 2) (x – 3) = (–1) (2) (–3)
= 6 > 0)
So f ′(x) > 0 when – 2 < x < 1.
Thus, f is strictly increasing in (– 2, 1).

2015-16 (11-11-2014)
APPLICATION OF DERIVATIVES 239

Now consider the interval (1, 3), i.e., when 1 < x < 3. In this case, we have
x – 1 > 0, x + 2 > 0 and x – 3 < 0.
So, f ′(x) < 0 when 1 < x < 3.
Thus, f is strictly decreasing in (1, 3).
Finally, consider the interval (3, ∞), i.e., when x > 3. In this case, we have x – 1 > 0,
x + 2 > 0 and x – 3 > 0. So f ′(x) > 0 when x > 3.
Thus, f is strictly increasing in the interval (3, ∞).
Example 48 Show that the function f given by
f (x) = tan– 1(sin x + cos x), x > 0
π
is always an strictly increasing function in  0,  .
 4
Solution We have
f (x) = tan–1 (sin x + cos x), x > 0
1
Therefore f ′(x) = (cos x − sin x)
1+ (sin x + cos x)2
cos x − sin x
= (on simplification)
2 + sin 2 x

 π
Note that 2 + sin 2x > 0 for all x in  0,  .
4
Therefore f ′(x) > 0 if cos x – sin x > 0
or f ′(x) > 0 if cos x > sin x or cot x > 1
π
Now cot x > 1 if tan x < 1, i.e., if 0 < x <
4
π
Thus f ′(x) > 0 in  0, 
 4

π
Hence f is strictly increasing function in  0,  .
 4
Example 49 A circular disc of radius 3 cm is being heated. Due to expansion, its
radius increases at the rate of 0.05 cm/s. Find the rate at which its area is increasing
when radius is 3.2 cm.

2015-16 (11-11-2014)
240 MATHEMATICS

Solution Let r be the radius of the given disc and A be its area. Then
A = πr 2
dA dr
or = 2 πr (by Chain Rule)
dt dt
dr
Now approximate rate of increase of radius = dr = ∆t = 0.05 cm/s.
dt
Therefore, the approximate rate of increase in area is given by

dA  dr 
dA = ( ∆t ) = 2 πr  ∆t 
dt  dt 
= 2π (3.2) (0.05) = 0.320π cm2 /s (r = 3.2 cm)
Example 50 An open topped box is to be constructed by removing equal squares from
each corner of a 3 metre by 8 metre rectangular sheet of aluminium and folding up the
sides. Find the volume of the largest such box.
Solution Let x metre be the length of a side of the removed squares. Then, the height
of the box is x, length is 8 – 2x and breadth is 3 – 2x (Fig 6.25). If V(x) is the volume
of the box, then

Fig 6.25

V (x) = x (3 – 2x) (8 – 2x)


= 4x3 – 22x2 + 24x

 V′( x) = 12 x − 44 x + 24 = 4( x − 3)(3x − 2)
2
Therefore 
 V′ ( x) = 24 x − 44

2
Now V′(x) = 0 gives x = 3, . But x ≠ 3 (Why?)
3
2 2   2
Thus, we have x = . Now V′   = 24   − 44 = − 28 < 0 .
3 3   3

2015-16 (11-11-2014)
APPLICATION OF DERIVATIVES 241

2 2
Therefore, x = is the point of maxima, i.e., if we remove a square of side
3 3
metre from each corner of the sheet and make a box from the remaining sheet, then
the volume of the box such obtained will be the largest and it is given by
3 2
 2
V   = 4   − 22   + 24  
2 2 2
 3  3  3 3 
200 3
= m
27

 x 
Example 51 Manufacturer can sell x items at a price of rupees  5 −  each. The
 100 

x 
cost price of x items is Rs  + 500 . Find the number of items he should sell to earn
5 
maximum profit.
Solution Let S (x) be the selling price of x items and let C (x) be the cost price of x
items. Then, we have
 x  x2
S(x) =  5 −  x = 5 x −
 100  100
x
and C (x) = + 500
5
Thus, the profit function P(x) is given by
x2 x
P(x) = S( x) − C ( x ) = 5 x − − − 500
100 5
24 x2
i.e. P(x) = x− − 500
5 100
24 x
or P′(x) = −
5 50
−1 −1
Now P′(x) = 0 gives x = 240. Also P′′( x) = . So P′′(240) = <0
50 50
Thus, x = 240 is a point of maxima. Hence, the manufacturer can earn maximum
profit, if he sells 240 items.

2015-16 (11-11-2014)
242 MATHEMATICS

Miscellaneous Exercise on Chapter 6


1. Using differentials, find the approximate value of each of the following:
1
1
(a)  17  ( 33) − 5
4
(b)
 81 

log x
2. Show that the function given by f (x) = has maximum at x = e.
x
3. The two equal sides of an isosceles triangle with fixed base b are decreasing at
the rate of 3 cm per second. How fast is the area decreasing when the two equal
sides are equal to the base ?
4. Find the equation of the normal to curve x 2 = 4y which passes through the point
(1, 2).
5. Show that the normal at any point θ to the curve
x = a cosθ + a θ sin θ, y = a sinθ – aθ cosθ
is at a constant distance from the origin.
6. Find the intervals in which the function f given by
4 sin x − 2 x − x cos x
f ( x) =
2 + cos x
is (i) strictly increasing (ii) strictly decreasing.
1
7. Find the intervals in which the function f given by f ( x) = x3 + , x ≠ 0 is
x3

(i) increasing (ii) decreasing.


x2 y2
8. Find the maximum area of an isosceles triangle inscribed in the ellipse + =1
a2 b2
with its vertex at one end of the major axis.
9. A tank with rectangular base and rectangular sides, open at the top is to be
constructed so that its depth is 2 m and volume is 8 m3. If building of tank costs
Rs 70 per sq metres for the base and Rs 45 per square metre for sides. What is
the cost of least expensive tank?
10. The sum of the perimeter of a circle and square is k, where k is some constant.
Prove that the sum of their areas is least when the side of square is double the
radius of the circle.

2015-16 (11-11-2014)
APPLICATION OF DERIVATIVES 243

11. A window is in the form of a rectangle surmounted by a semicircular opening.


The total perimeter of the window is 10 m. Find the dimensions of the window to
admit maximum light through the whole opening.
12. A point on the hypotenuse of a triangle is at distance a and b from the sides of
the triangle.
2 2 3
Show that the maximum length of the hypotenuse is (a 3 + b 3 ) 2 .
13. Find the points at which the function f given by f (x) = (x – 2) 4 (x + 1)3 has
(i) local maxima (ii) local minima
(iii) point of inflexion
14. Find the absolute maximum and minimum values of the function f given by
f (x) = cos2 x + sin x, x ∈ [0, π]
15. Show that the altitude of the right circular cone of maximum volume that can be
4r
inscribed in a sphere of radius r is .
3
16. Let f be a function defined on [a, b] such that f ′(x) > 0, for all x ∈ (a, b). Then
prove that f is an increasing function on (a, b).
17. Show that the height of the cylinder of maximum volume that can be inscribed in
2R
a sphere of radius R is . Also find the maximum volume.
3
18. Show that height of the cylinder of greatest volume which can be inscribed in a
right circular cone of height h and semi vertical angle α is one-third that of the
4
cone and the greatest volume of cylinder is πh 3 tan 2 α .
27
Choose the correct answer in the Exercises from 19 to 24.
19. A cylindrical tank of radius 10 m is being filled with wheat at the rate of 314
cubic metre per hour. Then the depth of the wheat is increasing at the rate of
(A) 1 m/h (B) 0.1 m/h
(C) 1.1 m/h (D) 0.5 m/h
20. The slope of the tangent to the curve x = t2 + 3t – 8, y = 2t2 – 2t – 5 at the point
(2,– 1) is
22 6 7 −6
(A) (B) (C) (D)
7 7 6 7

2015-16 (11-11-2014)
244 MATHEMATICS

21. The line y = mx + 1 is a tangent to the curve y2 = 4x if the value of m is


1
(A) 1 (B) 2 (C) 3 (D)
2
22. The normal at the point (1,1) on the curve 2y + x2 = 3 is
(A) x + y = 0 (B) x – y = 0
(C) x + y +1 = 0 (D) x – y = 1
2
23. The normal to the curve x = 4y passing (1,2) is
(A) x + y = 3 (B) x – y = 3
(C) x + y = 1 (D) x – y = 1
24. The points on the curve 9y = x , where the normal to the curve makes equal
2 3

intercepts with the axes are


 8  −8
(A)  4, ±  (B)  4, 
 3 3

 3  3
(C)  4, ±  (D)  ± 4, 
 8  8

Summary

® If a quantity y varies with another quantity x, satisfying some rule y = f ( x) ,


dy
then (or f ′( x ) ) represents the rate of change of y with respect to x and
dx
dy 
dx  x =x0 (or f ′( x0 ) ) represents the rate of change of y with respect to x at

x = x0 .
® If two variables x and y are varying with respect to another variable t, i.e., if
x = f ( t ) and y = g ( t ) , then by Chain Rule
dy dy dx dx
= , if ≠0.
dx dt dt dt
® A function f is said to be
(a) increasing on an interval (a, b) if
x1 < x2 in (a, b) ⇒ f (x1) ≤ f(x2) for all x1, x2 ∈ (a, b).

2015-16 (11-11-2014)
APPLICATION OF DERIVATIVES 245

Alternatively, if f ′(x) ≥ 0 for each x in (a, b)


(b) decreasing on (a,b) if
x1 < x2 in (a, b) ⇒ f (x1 ) ≥ f (x2) for all x1, x2 ∈ (a, b).
Alternatively, if f ′(x) ≤ 0 for each x in (a, b)
® The equation of the tangent at (x0 , y0) to the curve y = f (x) is given by

dy 
y − y0 = (x − x0 )
dx  ( x0 ,y 0 )

dy
® If
dx
does not exist at the point ( x0 , y0 ) , then the tangent at this point is
parallel to the y-axis and its equation is x = x0.
dy 
® If tangent to a curve y = f (x) at x = x0 is parallel to x-axis, then =0.
dx  x =x0

® Equation of the normal to the curve y = f (x) at a point ( x0 , y0 ) is given by

−1
y − y0 = ( x − x0 )
dy 
dx ( x0 , y0 )

dy
® If
dx
at the point ( x0 , y0 ) is zero, then equation of the normal is x = x0.

dy
® If
dx
at the point ( x0 , y0 ) does not exist, then the normal is parallel to x-axis
and its equation is y = y0 .
® Let y = f(x), ∆x be a small increment in x and ∆y be the increment in y
corresponding to the increment in x, i.e., ∆y = f (x + ∆x) – f (x). Then dy
given by
 dy 
dy = f ′(x ) dx or dy =   ∆x .
 dx 
is a good approximation of ∆y when dx = ∆x is relatively small and we denote
it by dy ≈ ∆y.
® A point c in the domain of a function f at which either f ′(c) = 0 or f is not
differentiable is called a critical point of f.

2015-16 (11-11-2014)
246 MATHEMATICS

® First Derivative Test Let f be a function defined on an open interval I. Let


f be continuous at a critical point c in I. Then
(i) If f ′(x) changes sign from positive to negative as x increases through c,
i.e., if f ′(x) > 0 at every point sufficiently close to and to the left of c,
and f ′(x) < 0 at every point sufficiently close to and to the right of c,
then c is a point of local maxima.
(ii) If f ′(x) changes sign from negative to positive as x increases through c,
i.e., if f ′(x) < 0 at every point sufficiently close to and to the left of c,
and f ′(x) > 0 at every point sufficiently close to and to the right of c,
then c is a point of local minima.
(iii) If f ′(x) does not change sign as x increases through c, then c is neither
a point of local maxima nor a point of local minima. Infact, such a point
is called point of inflexion.
® Second Derivative Test Let f be a function defined on an interval I and
c ∈ I. Let f be twice differentiable at c. Then
(i) x = c is a point of local maxima if f ′(c) = 0 and f ″(c) < 0
The values f (c) is local maximum value of f .
(ii) x = c is a point of local minima if f ′(c) = 0 and f ″(c) > 0
In this case, f (c) is local minimum value of f .
(iii) The test fails if f ′(c) = 0 and f ″(c) = 0.
In this case, we go back to the first derivative test and find whether c is
a point of maxima, minima or a point of inflexion.
® Working rule for finding absolute maxima and/or absolute minima
Step 1: Find all critical points of f in the interval, i.e., find points x where
either f ′(x) = 0 or f is not differentiable.
Step 2:Take the end points of the interval.
Step 3: At all these points (listed in Step 1 and 2), calculate the values of f .
Step 4: Identify the maximum and minimum values of f out of the values
calculated in Step 3. This maximum value will be the absolute maximum
value of f and the minimum value will be the absolute minimum value of f .

—v —

2015-16 (11-11-2014)
INTEGRALS 287

Chapter 7
INTEGRALS

v Just as a mountaineer climbs a mountain – because it is there, so


a good mathematics student studies new material because
it is there. — JAMES B. BRISTOL v

7.1 Introduction
Differential Calculus is centred on the concept of the
derivative. The original motivation for the derivative was
the problem of defining tangent lines to the graphs of
functions and calculating the slope of such lines. Integral
Calculus is motivated by the problem of defining and
calculating the area of the region bounded by the graph of
the functions.
If a function f is differentiable in an interval I, i.e., its
derivative f ′ exists at each point of I, then a natural question
arises that given f ′ at each point of I, can we determine
the function? The functions that could possibly have given
function as a derivative are called anti derivatives (or G .W. Leibnitz
primitive) of the function. Further, the formula that gives (1646 -1716)

all these anti derivatives is called the indefinite integral of the function and such
process of finding anti derivatives is called integration. Such type of problems arise in
many practical situations. For instance, if we know the instantaneous velocity of an
object at any instant, then there arises a natural question, i.e., can we determine the
position of the object at any instant? There are several such practical and theoretical
situations where the process of integration is involved. The development of integral
calculus arises out of the efforts of solving the problems of the following types:
(a) the problem of finding a function whenever its derivative is given,
(b) the problem of finding the area bounded by the graph of a function under certain
conditions.
These two problems lead to the two forms of the integrals, e.g., indefinite and
definite integrals, which together constitute the Integral Calculus.
288 MATHEMATICS

There is a connection, known as the Fundamental Theorem of Calculus, between


indefinite integral and definite integral which makes the definite integral as a practical
tool for science and engineering. The definite integral is also used to solve many interesting
problems from various disciplines like economics, finance and probability.
In this Chapter, we shall confine ourselves to the study of indefinite and definite
integrals and their elementary properties including some techniques of integration.
7.2 Integration as an Inverse Process of Differentiation
Integration is the inverse process of differentiation. Instead of differentiating a function,
we are given the derivative of a function and asked to find its primitive, i.e., the original
function. Such a process is called integration or anti differentiation.
Let us consider the following examples:
d
We know that (sin x) = cos x ... (1)
dx
d x3
( ) = x2 ... (2)
dx 3
d x
and ( e ) = ex ... (3)
dx
We observe that in (1), the function cos x is the derived function of sin x. We say
x3
that sin x is an anti derivative (or an integral) of cos x. Similarly, in (2) and (3),and
3
ex are the anti derivatives (or integrals) of x2 and ex, respectively. Again, we note that
for any real number C, treated as constant function, its derivative is zero and hence, we
can write (1), (2) and (3) as follows :
d d x3 d x
(sin x + C) = cos x , ( + C) = x2 and (e + C) = ex
dx dx 3 dx
Thus, anti derivatives (or integrals) of the above cited functions are not unique.
Actually, there exist infinitely many anti derivatives of each of these functions which
can be obtained by choosing C arbitrarily from the set of real numbers. For this reason
C is customarily referred to as arbitrary constant. In fact, C is the parameter by
varying which one gets different anti derivatives (or integrals) of the given function.
d
More generally, if there is a function F such that F (x ) = f ( x) , ∀ x ∈ I (interval),
dx
then for any arbitrary real number C, (also called constant of integration)
d
[F (x) + C] = f (x), x ∈ I
dx
INTEGRALS 289

Thus, {F + C, C ∈ R} denotes a family of anti derivatives of f.


Remark Functions with same derivatives differ by a constant. To show this, let g and h
be two functions having the same derivatives on an interval I.
Consider the function f = g – h defined by f (x) = g(x) – h (x), ∀ x ∈ I

df
Then = f′ = g′ – h′ giving f′ (x) = g′ (x) – h′ (x) ∀ x ∈ I
dx
or f ′ (x) = 0, ∀ x ∈ I by hypothesis,
i.e., the rate of change of f with respect to x is zero on I and hence f is constant.
In view of the above remark, it is justified to infer that the family {F + C, C ∈ R}
provides all possible anti derivatives of f.
We introduce a new symbol, namely, ∫ f (x ) dx which will represent the entire
class of anti derivatives read as the indefinite integral of f with respect to x.
Symbolically, we write ∫ f (x ) dx = F (x) + C .
dy
Notation Given that
dx
= f (x ) , we write y = ∫ f (x) dx .
For the sake of convenience, we mention below the following symbols/terms/phrases
with their meanings as given in the Table (7.1).

Table 7.1

Symbols/Terms/Phrases Meaning

∫ f (x ) dx Integral of f with respect to x

f (x) in ∫ f (x) dx Integrand

x in ∫ f (x ) dx Variable of integration
Integrate Find the integral
An integral of f A function F such that
F′(x) = f (x)
Integration The process of finding the integral
Constant of Integration Any real number C, considered as
constant function
290 MATHEMATICS

We already know the formulae for the derivatives of many important functions.
From these formulae, we can write down immediately the corresponding formulae
(referred to as standard formulae) for the integrals of these functions, as listed below
which will be used to find integrals of other functions.
Derivatives Integrals (Anti derivatives)

d  xn +1  n x n +1
 = x ; ∫ x dx =
n
(i) + C , n ≠ –1
dx  n + 1  n +1
Particularly, we note that
d
dx
( x) =1 ; ∫ dx = x + C
d
(ii)
dx
(sin x) = cos x ; ∫ cos x dx = sin x + C
d
(iii)
dx
( – cos x ) = sin x ; ∫ sin x dx = – cos x + C
d
( tan x) = sec2 x ; ∫ sec
2
(iv) x dx = tan x + C
dx
d
( – cot x ) = cosec 2 x ; ∫ cosec
2
(v) x dx = – cot x + C
dx
d
(vi)
dx
(sec x ) = sec x tan x ; ∫ sec x tan x dx = sec x + C
d
(vii)
dx
( – cosec x) = cosec x cot x ; ∫ cosec x cot x dx = – cosec x + C
d 1 dx
(
–1
(viii) dx sin x = ) ; ∫ = sin – 1 x + C
1 – x2 1–x 2

d 1 dx
(
–1
(ix) dx – cos x = ;) ∫ = – cos
–1
x+ C
1 – x2 1–x 2

d 1 dx
(x)
dx
( )
tan – 1 x =
1 + x2
; ∫ 1 + x2 = tan
–1
x+ C

d 1 dx
(xi)
dx
(
– cot – 1 x =)1 + x2
; ∫ 1 + x2 = – cot
–1
x+ C
INTEGRALS 291

d 1 dx
( –1
(xii) dx sec x = ) ; ∫x = sec– 1 x + C
x x2 – 1 2
x –1
d 1 dx
( –1
(xiii) dx – cosec x = ) ; ∫x = – cosec– 1 x + C
x x2 – 1 2
x –1
d x
( e ) = ex ; ∫e
x
(xiv) dx = ex + C
dx
d 1 1
(xv)
dx
( log | x |) = ;
x ∫ x dx = log | x | +C
d  ax  ax
∫ a dx =
x x
(xvi)  = a ; +C
dx  log a  log a

A Note In practice, we normally do not mention the interval over which the various
functions are defined. However, in any specific problem one has to keep it in mind.
7.2.1 Geometrical interpretation of indefinite integral
Let f (x) = 2x. Then ∫ f (x ) dx = x + C . For different values of C, we get different
2

integrals. But these integrals are very similar geometrically.


Thus, y = x2 + C, where C is arbitrary constant, represents a family of integrals. By
assigning different values to C, we get different members of the family. These together
constitute the indefinite integral. In this case, each integral represents a parabola with
its axis along y-axis.
Clearly, for C = 0, we obtain y = x2 , a parabola with its vertex on the origin. The
curve y = x2 + 1 for C = 1 is obtained by shifting the parabola y = x2 one unit along
y-axis in positive direction. For C = – 1, y = x2 – 1 is obtained by shifting the parabola
y = x2 one unit along y-axis in the negative direction. Thus, for each positive value of C,
each parabola of the family has its vertex on the positive side of the y-axis and for
negative values of C, each has its vertex along the negative side of the y-axis. Some of
these have been shown in the Fig 7.1.
Let us consider the intersection of all these parabolas by a line x = a. In the Fig 7.1,
we have taken a > 0. The same is true when a < 0. If the line x = a intersects the
parabolas y = x2, y = x2 + 1, y = x2 + 2, y = x2 – 1, y = x2 – 2 at P0 , P1, P2, P–1 , P–2 etc.,
dy
respectively, then at these points equals 2a. This indicates that the tangents to the
dx
curves at these points are parallel. Thus, ∫ 2 x dx = x + C = FC ( x) (say), implies that
2
292 MATHEMATICS

Fig 7.1
the tangents to all the curves y = F C (x), C ∈ R, at the points of intersection of the
curves by the line x = a, (a ∈ R), are parallel.
Further, the following equation (statement) ∫ f ( x) dx = F (x ) + C = y (say) ,
represents a family of curves. The different values of C will correspond to different
members of this family and these members can be obtained by shifting any one of the
curves parallel to itself. This is the geometrical interpretation of indefinite integral.
7.2.2 Some properties of indefinite integral
In this sub section, we shall derive some properties of indefinite integrals.
(I) The process of differentiation and integration are inverses of each other in the
sense of the following results :
d
dx ∫
f ( x) dx = f (x)

and ∫ f ′(x) dx = f (x) + C, where C is any arbitrary constant.


INTEGRALS 293

Proof Let F be any anti derivative of f, i.e.,


d
F(x) = f (x)
dx

Then ∫ f (x ) dx = F(x) + C
d d
Therefore
dx ∫ f (x) dx = dx
( F ( x) + C )

d
= F (x ) = f ( x)
dx
Similarly, we note that
d
f ′(x) = f ( x)
dx

and hence ∫ f ′(x) dx = f (x) + C


where C is arbitrary constant called constant of integration.
(II) Two indefinite integrals with the same derivative lead to the same family of
curves and so they are equivalent.
Proof Let f and g be two functions such that

d d
∫ dx ∫
f ( x) dx = g (x ) dx
dx

d 
f ( x) dx – ∫ g (x) dx  = 0
dx  ∫
or

Hence ∫ f (x ) dx – ∫ g (x) dx = C, where C is any real number (Why?)

or ∫ f (x ) dx = ∫ g (x) dx + C
So the families of curves {∫ f (x) dx + C , C ∈ R}1 1

and {∫ g(x) dx + C , C ∈ R} are identical.


2 2

Hence, in this sense, ∫ f (x ) dx and ∫ g (x) dx are equivalent.


294 MATHEMATICS

A Note The equivalence of the families {∫ f (x) dx + C ,C ∈ R} and 1 1

{∫ g(x) dx + C ,C ∈ R} is customarily expressed by writing ∫ f (x ) dx = ∫ g (x) dx ,


2 2

without mentioning the parameter.

(III) ∫[ f (x) + g (x )] dx = ∫ f (x) dx + ∫ g (x) dx


Proof By Property (I), we have
d 
[ f ( x) + g (x )] dx  = f (x) + g (x)
dx  ∫  ... (1)

On the otherhand, we find that


d  d d
∫ f ( x) dx+ ∫ g ( x) dx  = ∫ f (x) dx + dx ∫ g (x) dx
dx   dx
= f (x) + g (x) ... (2)
Thus, in view of Property (II), it follows by (1) and (2) that

∫ ( f (x) + g (x) ) dx = ∫ f (x ) dx + ∫ g (x) dx .


(IV) For any real number k, ∫ k f ( x) dx = k ∫ f ( x) dx

d
dx ∫
Proof By the Property (I), k f ( x) dx = k f (x) .

d  d
Also
dx 
k ∫ f (x) dx  = k
dx ∫ f (x) dx = k f ( x)

Therefore, using the Property (II), we have ∫k f ( x) dx = k ∫ f (x) dx .


(V) Properties (III) and (IV) can be generalised to a finite number of functions
f1, f2 , ..., fn and the real numbers, k1, k2, ..., kn giving

∫[ k1 f1 (x ) + k2 f2 (x) + ... + kn f n (x)] dx


= k1 ∫ f1 ( x) dx + k2 ∫ f 2 ( x) dx + ... + kn ∫ fn (x) dx .
To find an anti derivative of a given function, we search intuitively for a function
whose derivative is the given function. The search for the requisite function for finding
an anti derivative is known as integration by the method of inspection. We illustrate it
through some examples.
INTEGRALS 295

Example 1 Write an anti derivative for each of the following functions using the
method of inspection:
1
(i) cos 2x (ii) 3x2 + 4x3 (iii) ,x≠0
x
Solution
(i) We look for a function whose derivative is cos 2x. Recall that
d
sin 2x = 2 cos 2x
dx
1 d d 1 
or cos 2x = (sin 2x) =  sin 2 x 
2 dx dx  2 
1
Therefore, an anti derivative of cos 2x issin 2 x .
2
(ii) We look for a function whose derivative is 3x2 + 4x3 . Note that
d 3
dx
(
x + x 4 = 3x2 + 4x3. )
Therefore, an anti derivative of 3x2 + 4x3 is x3 + x4 .
(iii) We know that
d 1 d 1 1
(log x) = , x > 0 and [log ( – x)] = ( – 1) = , x < 0
dx x dx –x x
d 1
Combining above, we get
dx
( log x ) = , x ≠ 0
x
1 1.
Therefore, ∫ x dx = log x is one of the anti derivatives of
x
Example 2 Find the following integrals:
3
x3 – 1 2 1
∫ ∫ (x 2 + 2 e – x ) dx
x
(i)
x2
dx (ii) ∫ (x 3 + 1) dx (iii)

Solution
(i) We have

x3 – 1
∫ x 2
dx = ∫ x dx – ∫ x– 2 dx (by Property V)
296 MATHEMATICS

 x1 + 1   x– 2 + 1 
=  + C1 –
 + C 2  ; C , C are constants of integration
 1 +1   – 2 +1  1 2

x2 x– 1 2
= + C1 – – C 2 = x + 1 + C1 – C2
2 –1 2 x
x2 1
= + + C , where C = C 1 – C2 is another constant of integration.
2 x

A Note From now onwards, we shall write only one constant of integration in the
final answer.
(ii) We have
2 2

∫ (x 3 + 1) dx = ∫ x 3 dx + ∫ dx
2
+1
5
x3 3
= 2 + x + C = x3 + x + C
+1 5
3
3 3
1 1
∫ (x 2 + 2 e – ) dx = ∫ x 2 dx + ∫ 2 e x dx – ∫ x dx
x
(iii) We have
x
3
+1
x2
= 3 + 2 e x – log x + C
+1
2
5
2
= x 2 + 2 e x – log x + C
5
Example 3 Find the following integrals:
(i) ∫ (sin x + cos x ) dx (ii) ∫ cosec x (cosec x + cot x) dx
1 – sin x
(iii) ∫ cos 2 x
dx

Solution
(i) We have
∫ (sin x + cos x ) dx = ∫ sin x dx + ∫ cos x dx
= – cos x + sin x + C
INTEGRALS 297

(ii) We have

∫ (cosec x (cosec x + cot x) dx = ∫ cosec x dx + ∫ cosec x cot x dx


2

= – cot x – cosec x + C
(iii) We have
1 – sin x 1 sin x
∫ cos x2
dx = ∫ 2 dx – ∫
cos x cos 2 x
dx

= ∫ sec x dx – ∫ tan x sec x dx


2

= tan x – sec x + C

Example 4 Find the anti derivative F of f defined by f (x) = 4x3 – 6, where F (0) = 3
Solution One anti derivative of f (x) is x4 – 6x since
d 4
( x – 6 x) = 4x3 – 6
dx
Therefore, the anti derivative F is given by

F(x) = x4 – 6x + C, where C is constant.

Given that F(0) = 3, which gives,

3 = 0 – 6 × 0 + C or C = 3
Hence, the required anti derivative is the unique function F defined by
F(x) = x4 – 6x + 3.
Remarks
(i) We see that if F is an anti derivative of f, then so is F + C, where C is any
constant. Thus, if we know one anti derivative F of a function f, we can write
down an infinite number of anti derivatives of f by adding any constant to F
expressed by F(x) + C, C ∈ R. In applications, it is often necessary to satisfy an
additional condition which then determines a specific value of C giving unique
anti derivative of the given function.
(ii) Sometimes, F is not expressible in terms of elementary functions viz., polynomial,
logarithmic, exponential, trigonometric functions and their inverses etc. We are
therefore blocked for finding ∫ f (x ) dx . For example, it is not possible to find
2

∫e
– x2
dx by inspection since we can not find a function whose derivative is e– x
298 MATHEMATICS

(iii) When the variable of integration is denoted by a variable other than x, the integral
formulae are modified accordingly. For instance
y4+ 1 1
∫ y dy =
4
+ C = y5 + C
4+1 5

7.2.3 Comparison between differentiation and integration


1. Both are operations on functions.
2. Both satisfy the property of linearity, i.e.,
d d d
(i) [k1 f1 (x) + k2 f 2 (x )] = k1 f1 (x) + k2 f 2 (x )
dx dx dx
(ii) ∫[ k1 f1 ( x) + k 2 f2 ( x) ] dx = k1 ∫ f1 (x) dx + k2 ∫ f2 (x) dx
Here k1 and k2 are constants.
3. We have already seen that all functions are not differentiable. Similarly, all functions
are not integrable. We will learn more about nondifferentiable functions and
nonintegrable functions in higher classes.
4. The derivative of a function, when it exists, is a unique function. The integral of
a function is not so. However, they are unique upto an additive constant, i.e., any
two integrals of a function differ by a constant.
5. When a polynomial function P is differentiated, the result is a polynomial whose
degree is 1 less than the degree of P. When a polynomial function P is integrated,
the result is a polynomial whose degree is 1 more than that of P.
6. We can speak of the derivative at a point. We never speak of the integral at a
point, we speak of the integral of a function over an interval on which the integral
is defined as will be seen in Section 7.7.
7. The derivative of a function has a geometrical meaning, namely, the slope of the
tangent to the corresponding curve at a point. Similarly, the indefinite integral of
a function represents geometrically, a family of curves placed parallel to each
other having parallel tangents at the points of intersection of the curves of the
family with the lines orthogonal (perpendicular) to the axis representing the variable
of integration.
8. The derivative is used for finding some physical quantities like the velocity of a
moving particle, when the distance traversed at any time t is known. Similarly,
the integral is used in calculating the distance traversed when the velocity at time
t is known.
9. Differentiation is a process involving limits. So is integration, as will be seen in
Section 7.7.
INTEGRALS 299

10. The process of differentiation and integration are inverses of each other as
discussed in Section 7.2.2 (i).

EXERCISE 7.1
Find an anti derivative (or integral) of the following functions by the method of inspection.
1. sin 2x 2. cos 3x 3. e 2x
2 3x
4. (ax + b) 5. sin 2x – 4 e
Find the following integrals in Exercises 6 to 20:
1
6. ∫ (4 e + 1) dx 7. ∫ x (1 – 2 ) dx 8. ∫ ( ax + bx + c ) dx
3x 2 2
x
2
 1  x 3 + 5 x2 – 4
9. ∫ (2 x + e ) dx
2 x
10. ∫  x–

 dx 11.
x
∫ x2
dx

x3 + 3 x + 4 x3 − x2 + x – 1
12. ∫ x
dx 13. ∫ x –1
dx 14. ∫ (1 – x) x dx

∫ x ( 3 x + 2 x + 3) dx ∫ (2 x – 3cos x + e ) dx
2 x
15. 16.

∫ (2 x – 3sin x + 5 x ) dx ∫ sec x (sec x + tan x) dx


2
17. 18.

sec2 x 2 – 3sin x
19. ∫ 2
dx 20. ∫ dx.
cosec x cos 2 x
Choose the correct answer in Exercises 21 and 22.
 1 
21. The anti derivative of  x +  equals
 x
1 1 2
1 3 2 3 1 2
(A) x + 2 x2 + C (B) x + x +C
3 3 2
3 1 3 1
2 2 3 2 1 2
(C) x + 2 x2 + C (D) x + x +C
3 2 2
d 3
22. If f ( x) = 4 x3 − 4 such that f (2) = 0. Then f (x) is
dx x
4 1 129 3 1 129
(A) x + − (B) x + +
x3 8 x4 8
4 1 129 3 1 129
(C) x + 3 + (D) x + 4 −
x 8 x 8
300 MATHEMATICS

7.3 Methods of Integration


In previous section, we discussed integrals of those functions which were readily
obtainable from derivatives of some functions. It was based on inspection, i.e., on the
search of a function F whose derivative is f which led us to the integral of f. However,
this method, which depends on inspection, is not very suitable for many functions.
Hence, we need to develop additional techniques or methods for finding the integrals
by reducing them into standard forms. Prominent among them are methods based on:
1. Integration by Substitution
2. Integration using Partial Fractions
3. Integration by Parts
7.3.1 Integration by substitution
In this section, we consider the method of integration by substitution.
The given integral ∫ f (x ) dx can be transformed into another form by changing
the independent variable x to t by substituting x = g (t).
Consider I= ∫ f (x ) dx
dx
Put x = g(t) so that = g′(t).
dt
We write dx = g′(t) dt

Thus I= ∫ f (x) dx = ∫ f (g (t )) g ′(t) dt


This change of variable formula is one of the important tools available to us in the
name of integration by substitution. It is often important to guess what will be the useful
substitution. Usually, we make a substitution for a function whose derivative also occurs
in the integrand as illustrated in the following examples.
Example 5 Integrate the following functions w.r.t. x:
(i) sin mx (ii) 2x sin (x2 + 1)

tan 4 x sec 2 x sin (tan – 1 x)


(iii) (iv)
x 1 + x2

Solution
(i) We know that derivative of mx is m. Thus, we make the substitution
mx = t so that mdx = dt.
1 1 1
Therefore, ∫ sin mx dx = m ∫ sin t dt = – m cos t + C = – m cos mx + C
INTEGRALS 301

(ii) Derivative of x2 + 1 is 2x. Thus, we use the substitution x2 + 1 = t so that


2x dx = dt.

∫ 2 x sin (x + 1) dx = ∫ sin t dt = – cos t + C = – cos (x2 + 1) + C


2
Therefore,
1
1 –2 1
(iii) Derivative of x is x = . Thus, we use the substitution
2 2 x
1
x = t so that dx = dt giving dx = 2t dt.
2 x
tan 4 x sec 2 2t tan 4 t sec2 t dt
x
Thus, ∫ x t
dx = ∫ = 2 ∫ tan t sec t dt
4 2

Again, we make another substitution tan t = u so that sec2 t dt = du


u5
Therefore, 2 ∫ tan 4 t sec2 t dt = 2 ∫ u 4 du = 2 +C
5
2
tan 5 t + C (since u = tan t)
=
5
2 5
= tan x + C (since t = x )
5
tan 4 x sec 2 x 2
Hence, ∫ x
dx = tan5 x + C
5
Alternatively, make the substitution tan x = t
1
(iv) Derivative of tan– 1x = . Thus, we use the substitution
1+ x 2
dx
tan–1 x = t so that = dt.
1 + x2
sin (tan – 1 x)
Therefore , ∫ dx = ∫ sin t dt = – cos t + C = – cos (tan –1x) + C
1 + x2
Now, we discuss some important integrals involving trigonometric functions and
their standard integrals using substitution technique. These will be used later without
reference.
(i) ∫ tan x dx = log sec x + C
We have
sin x
∫ tan x dx = ∫ cos x dx
302 MATHEMATICS

Put cos x = t so that sin x dx = – dt


dt
Then ∫ tan x dx = – ∫ t = – log t + C = – log cos x + C
or ∫ tan x dx = log sec x + C
(ii) ∫ cot x dx = log sin x + C

cos x
We have ∫ cot x dx = ∫ sin x dx

Put sin x = t so that cos x dx = dt


dt
Then ∫ cot x dx = ∫ t = log t + C = log sin x + C
(iii) ∫ sec x dx = log sec x + tan x + C
We have
sec x (sec x + tan x)
∫ sec x dx = ∫
sec x + tan x
dx

Put sec x + tan x = t so that sec x (tan x + sec x) dx = dt


dt
Therefore, ∫ sec x dx = ∫ = log t + C = log sec x + tan x + C
t
(iv) ∫ cosec x dx = log cosec x – cot x + C
We have
cosec x (cosec x + cot x)
∫ cosec x dx = ∫ (cosec x + cot x)
dx

Put cosec x + cot x = t so that – cosec x (cosec x + cot x) dx = dt


dt
So ∫ cosec x dx = – ∫ t = – log | t | = – log |cosec x + cot x | + C
cosec2 x − cot 2 x
= – log +C
cosec x − cot x
= log cosec x – cot x + C
Example 6 Find the following integrals:
sin x 1
∫ sin ∫ sin (x + a ) dx ∫ 1 + tan x dx
3
(i) x cos 2 x dx (ii) (iii)
INTEGRALS 303

Solution
(i) We have

∫ sin x cos 2 x dx = ∫ sin 2 x cos 2 x ( sin x) dx


3

= ∫ (1 – cos x) cos x (sin x) dx


2 2

Put t = cos x so that dt = – sin x dx

∫ sin x cos 2 x (sin x) dx = − ∫ (1 – t 2 ) t 2 dt


2
Therefore,

 t3 t5 
= – ∫ ( t – t ) dt = –  –  + C
2 4

3 5
1 1
= – cos3 x + cos5 x + C
3 5
(ii) Put x + a = t. Then dx = dt. Therefore
sin x sin (t – a)
∫ sin (x + a) dx = ∫ sin t
dt

sin t cos a – cos t sin a


= ∫ sin t
dt

= cos a ∫ dt – sin a ∫ cot t dt

= (cos a ) t – (sin a )  log sin t + C1 

= (cos a ) ( x + a ) – (sin a) log sin (x + a) + C1 

= x cos a + a cos a – (sin a) log sin ( x + a ) – C1 sin a

sin x
Hence, ∫ sin (x + a ) dx = x cos a – sin a log |sin (x + a)| + C,

where, C = – C1 sin a + a cos a, is another arbitrary constant.


dx cos x dx
(iii) ∫ 1 + tan x = ∫ cos x + sin x
1 (cos x + sin x + cos x – sin x) dx
=
2 ∫ cos x + sin x
304 MATHEMATICS

1 1 cos x – sin x
= 2 ∫ dx + 2 ∫ cos x + sin x dx
x C1 1 cos x – sin x
= + +
2 2 2 ∫ cos x + sin x dx ... (1)

cos x – sin x
Now, consider I = ∫ dx
cos x + sin x
Put cos x + sin x = t so that (cos x – sin x) dx = dt
dt
Therefore I=∫
= log t + C 2 = log cos x + sin x + C 2
t
Putting it in (1), we get
dx x C1 1 C
∫ 1 + tan x = 2 + + log cos x + sin x + 2
2 2 2
x 1 C C
= + log cos x + sin x + 1 + 2
2 2 2 2
x 1  C C 
= + log cos x + sin x + C ,  C = 1 + 2 
2 2  2 2 

EXERCISE 7.2
Integrate the functions in Exercises 1 to 37:

1.
2x
2.
( log x )2 3.
1
1 + x2 x x + x log x
4. sin x sin (cos x) 5. sin ( ax + b ) cos ( ax + b )

6. ax + b 7. x x+ 2 8. x 1 + 2 x2
1 x
9. (4 x + 2) x 2 + x + 1 10. 11. , x> 0
x– x x+ 4

3
1
5
x2 1
12. (x – 1) 3 x 13. 14. , x > 0, m ≠ 1
(2 + 3 x3 ) 3 x (log x) m
x x
15. 16. e2 x + 3 17. 2
9 – 4x2 ex
INTEGRALS 305

–1
etan x e2 x – 1 e2 x – e– 2 x
18. 19. 20.
1 + x2 e 2x + 1 e2 x + e– 2 x

sin – 1 x
21. tan2 (2x – 3) 22. sec2 (7 – 4x) 23.
1 – x2

2cos x – 3sin x 1 cos x


24. 25. 26.
6cos x + 4sin x cos x (1 – tan x) 2
2
x
cos x
27. sin 2x cos 2 x 28. 29. cot x log sin x
1 + sin x

sin x sin x 1
30. 1 + cos x 31. (1 + cos x ) 2 32. 1 + cot x

33.
1
34.
tan x
35.
(1 + log x )2
1 – tan x sin x cos x x

36.
(x + 1) ( x + log x )
2
37.
(
x3 sin tan – 1 x4 )
8
x 1+ x
Choose the correct answer in Exercises 38 and 39.
10 x 9 + 10 x log e10 dx
38. ∫ x10 + 10 x
equals

(A) 10x – x10 + C (B) 10x + x10 + C


(C) (10x – x10 )–1 + C (D) log (10x + x10) + C
dx
39. ∫ sin2 x cos 2 x equals
(A) tan x + cot x + C (B) tan x – cot x + C
(C) tan x cot x + C (D) tan x – cot 2x + C
7.3.2 Integration using trigonometric identities
When the integrand involves some trigonometric functions, we use some known identities
to find the integral as illustrated through the following example.

Example 7 Find (i) ∫ cos x dx (ii) ∫ sin 2 x cos 3 x dx (iii) ∫ sin x dx


2 3
306 MATHEMATICS

Solution
(i) Recall the identity cos 2x = 2 cos2 x – 1, which gives
1 + cos 2x
cos2 x =
2
1 1 1
∫ cos ∫ (1 + cos 2x) dx = ∫ dx + ∫ cos 2 x dx
2
Therefore, x dx =
2 2 2
x 1
= + sin 2 x + C
2 4
1
(ii) Recall the identity sin x cos y = [sin (x + y) + sin (x – y)] (Why?)
2
1
sin 5 x dx • ∫ sin x dx
∫ sin 2 x cos 3 x dx 2 ∫
Then = 

1  1 
=  – 5 cos 5 x + cos x  + C
2  
1 1
cos 5 x + cos x + C
= –
10 2
(iii) From the identity sin 3x = 3 sin x – 4 sin3 x, we find that
3sin x – sin 3 x
sin3 x =
4
3 1
∫ sin ∫ sin x dx – ∫ sin 3 x dx
3
Therefore, x dx =
4 4
3 1
= – cos x + cos 3 x + C
4 12

∫ sin x dx = ∫ sin 2 x sin x dx = ∫ (1 – cos 2 x) sin x dx


3
Alternatively,
Put cos x = t so that – sin x dx = dt
t3
∫ sin x dx = − ∫ 1 – t 2 dt = – ∫ dt + ∫ t 2 dt = – t +
( )
3
Therefore, +C
3
1
cos 3x + C
= – cos x +
3
Remark It can be shown using trigonometric identities that both answers are equivalent.
INTEGRALS 307

EXERCISE 7.3
Find the integrals of the functions in Exercises 1 to 22:
1. sin2 (2x + 5) 2. sin 3x cos 4x 3. cos 2x cos 4x cos 6x
3 3 3
4. sin (2x + 1) 5. sin x cos x 6. sin x sin 2x sin 3x
1 – cos x cos x
7. sin 4x sin 8x 8. 9.
1 + cos x 1 + cos x
sin 2 x
10. sin4 x 11. cos4 2x 12.
1 + cos x
cos 2x – cos 2α cos x – sin x
13. 14. 15. tan3 2x sec 2x
cos x – cos α 1 + sin 2 x

sin 3 x + cos3 x cos 2 x + 2sin 2 x


16. tan4 x 17. 18.
sin 2 x cos 2 x cos 2 x
1 cos 2 x
19. 20. 21. sin – 1 (cos x)
sin x cos 3 x ( cos x + sin x )2
1
22.
cos (x – a ) cos (x – b )
Choose the correct answer in Exercises 23 and 24.
sin 2 x − cos2 x
23. ∫ sin 2 x cos2 x dx is equal to
(A) tan x + cot x + C (B) tan x + cosec x + C
(C) – tan x + cot x + C (D) tan x + sec x + C
e x (1 + x)
24. ∫ cos 2 (ex x) dx equals
(A) – cot (exx) + C (B) tan (xex) + C
(C) tan (ex) + C (D) cot (ex) + C
7.4 Integrals of Some Particular Functions
In this section, we mention below some important formulae of integrals and apply them
for integrating many other related standard integrals:
dx 1 x–a
(1) ∫ 2 2
= log +C
x –a 2a x +a
308 MATHEMATICS

dx 1 a+x
(2) ∫ a 2 – x 2 = 2a log a– x
+C

dx 1 x
∫ x 2 + a 2 = a tan
–1
(3) +C
a

dx

2 2
(4) = log x + x – a + C
2 2
x –a

dx x
(5) ∫ a –x2 2
= sin – 1
a
+C

dx
(6) ∫ x +a2 2
= log x + x 2 + a 2 + C

We now prove the above results:

1 1
(1) We have 22
=
x –a ( x – a ) (x + a )

1  (x + a ) – (x – a )  1  1 1 
 = –
2a  ( x – a ) ( x + a )  2a  x – a x + a 

=

dx 1  dx dx 
Therefore, ∫ x2 – a 2 = 2 a  ∫ x – a – ∫ x + a 

1
= [log | (x – a )| – log | (x + a )|] + C
2a

1 x–a
= log +C
2a x +a

(2) In view of (1) above, we have


1 1  ( a + x) + ( a − x)  1  1 1 
=   = +
2a  a − x a + x 

2 2
a –x 2 a  ( a + x) ( a − x) 
INTEGRALS 309

dx 1  dx dx 
Therefore, ∫ a 2 – x2 =  ∫
2a  a − x
+∫
a + x 
1
= [− log | a − x | + log | a + x |] + C
2a
1 a+x
= log +C
2a a− x

ANote The technique used in (1) will be explained in Section 7.5.


(3) Put x = a tan θ. Then dx = a sec2 θ dθ.
dx a sec2 θ d θ
Therefore, ∫ x2 + a 2 = ∫ a 2 tan 2 θ + a 2
1 1 1 –1 x
= ∫ dθ = θ + C = tan +C
a a a a
(4) Let x = a sec θ. Then dx = a sec θ tan θ d θ.
dx a secθ tanθ d θ
Therefore, ∫ x2 − a 2 = ∫ a 2 sec2θ − a 2
= ∫ secθ dθ = log secθ + tanθ + C1

x x2
= log + – 1 + C1
a a2
2 2
= log x + x – a − log a + C1
2 2
= log x + x – a + C , where C = C1 – log |a|
(5) Let x = a sinθ. Then dx = a cosθ dθ.
dx a cosθ d θ
Therefore, ∫ a 2 − x2
= ∫ a 2 – a 2 sin 2θ
x
= ∫ d θ = θ + C = sin
–1
+C
a
(6) Let x = a tan θ. Then dx = a sec2 θ dθ.
dx a sec2 θ dθ
Therefore, ∫ x2 + a 2
= ∫ a 2 tan 2 θ + a 2
= ∫ secθ dθ = log (secθ + tan θ) + C1
310 MATHEMATICS

x x2
= log + + 1 + C1
a a2

2 2
= log x + x + a − log |a | + C1

2 2
= log x + x + a + C , where C = C1 – log |a|

Applying these standard formulae, we now obtain some more formulae which
are useful from applications point of view and can be applied directly to evaluate
other integrals.
dx
(7) To find the integral ∫ ax2 + bx + c , we write
 2 b c  b   c b2
2

ax + bx + c = a  x + x +  = a  x +
2
 + – 
 a a  2 a   a 4a 2  

b c b2 2
Now, put x + = t so that dx = dt and writing – 2 = ± k . We find the
2a a 4a

1 dt  c b2 
integral reduced to the form ∫
a t ± k2
2 depending upon the sign of  – 2
 a 4a 
and hence can be evaluated.
dx
(8) To find the integral of the type ∫ 2
ax + bx + c
, proceeding as in (7), we

obtain the integral using the standard formulae.


px + q
(9) To find the integral of the type ∫ ax2 + bx + c dx , where p, q, a, b, c are

constants, we are to find real numbers A, B such that


d
px + q = A (ax2 + bx + c) + B = A (2ax + b) + B
dx
To determine A and B, we equate from both sides the coefficients of x and the
constant terms. A and B are thus obtained and hence the integral is reduced to
one of the known forms.
INTEGRALS 311

( px + q) dx
(10) For the evaluation of the integral of the type
ax2 + bx + c
∫ , we proceed

as in (9) and transform the integral into known standard forms.


Let us illustrate the above methods by some examples.
Example 8 Find the following integrals:
dx dx
(i) ∫ x2 − 16 (ii) ∫ 2 x − x2
Solution
dx dx 1 x–4
(i) We have ∫ x2 − 16 = ∫ x2 – 4 2 =
8
log
x+ 4
+ C [by 7.4 (1)]

dx dx
(ii) ∫ 2 x − x2
=∫
2
1 – ( x – 1)
Put x – 1 = t. Then dx = dt.

dx dt
Therefore, ∫ 2x − x 2
= ∫ 1– t 2
–1
= sin (t ) + C [by 7.4 (5)]

–1
= sin (x – 1) + C
Example 9 Find the following integrals :
dx dx dx
(i) ∫ x2 − 6 x + 13 (ii) ∫ 3 x2 + 13 x − 10 (iii) ∫ 5x 2 − 2x
Solution
(i) We have x2 – 6x + 13 = x2 – 6x + 32 – 32 + 13 = (x – 3)2 + 4

dx 1
So, ∫ x2 − 6 x + 13 = ∫ ( x – 3)2 + 22 dx
Let x – 3 = t. Then dx = dt
dx dt 1 t
∫ x2 − 6 x + 13 = ∫ t 2 + 2 2 = 2 tan
–1
Therefore, +C [by 7.4 (3)]
2
1 x–3
= tan– 1 +C
2 2
312 MATHEMATICS

(ii) The given integral is of the form 7.4 (7). We write the denominator of the integrand,

 2 13 x 10 
3x 2 + 13 x – 10 = 3  x + – 
 3 3

 13   17  
2 2

= 3   x +    

  6   6   (completing the square)

dx 1 dx
Thus = ∫
∫ 3 x2 + 13 x − 10 3  13  2  17  2
x+  − 
 6 6 

13
Put x + = t . Then dx = dt.
6
dx 1 dt
Therefore, ∫ 3 x2 + 13 x − 10 =
3 ∫ 2
 17 
t2 −  
 6 

17
t–
1 6 +C
= log 1 [by 7.4 (i)]
17 17
3 × 2× t+
6 6

13 17
x+ –
1 6 6 +C
= log 1
17 13 17
x+ +
6 6

1 6x − 4
= log + C1
17 6 x + 30

1 3x − 2 1 1
= log + C1 + log
17 x+5 17 3

1 3x − 2 1 1
= log + C , where C = C1 + log
17 x+5 17 3
INTEGRALS 313

dx dx
(iii) We have ∫ 5x 2 − 2x
=∫
 2x 
5 x 2 – 
 5 

1 dx
=
5
∫ 2 2
(completing the square)
 1 1 
x– –
5   5 

1
Put x – = t . Then dx = dt.
5
dx 1 dt
Therefore, ∫ 2
=
5
∫ 2
5x − 2x  1
t2 –  
 5
2
1 1 
= log t + t 2 –   +C [by 7.4 (4)]
5 5 

1 1 2x
= log x – + x2 – +C
5 5 5

Example 10 Find the following integrals:


x+ 2 x+3
(i) ∫ 2 x 2 + 6 x + 5 dx (ii) ∫ 5 − 4 x + x2
dx

Solution
(i) Using the formula 7.4 (9), we express
d
x+2= A
dx
( )
2 x 2 + 6 x + 5 + B = A (4 x + 6) + B
Equating the coefficients of x and the constant terms from both sides, we get
1 1
4A = 1 and 6A + B = 2 or A = and B = .
4 2
x+ 2 1 4x + 6 1 dx
Therefore, ∫ 2 x 2 + 6 x + 5 = 4 ∫ 2 x2 + 6 x + 5 dx + 2 ∫ 2 x2 + 6 x + 5
1 1
= I1 + I2 (say) ... (1)
4 2
314 MATHEMATICS

In I1, put 2x2 + 6x + 5 = t, so that (4x + 6) dx = dt


dt
Therefore, I1 = ∫ t
= log t + C1

2
= log | 2 x + 6 x + 5 | + C1 ... (2)

dx 1 dx
and I2 = ∫ 2x 2 + 6x + 5 = 2 ∫ 5
x2 + 3 x +
2

1 dx
= ∫
2  2
3 1 
2

 x + +
  
 2 2 

3
Put x + = t , so that dx = dt, we get
2
1 dt 1
I2 =
2 ∫ 1 
2 =
1
tan – 1 2 t + C 2 [by 7.4 (3)]
2
t +  2×
 2 2

–1  3 –1
= tan 2  x +  + C 2 = tan ( 2 x + 3 ) + C 2 ... (3)
 2
Using (2) and (3) in (1), we get

x+ 2 1 1
∫ 2 x 2 + 6 x + 5 dx = 4 log 2 x
2
+ 6x + 5 + tan – 1 ( 2 x + 3 ) + C
2

C1 C2
where, C=+
4 2
(ii) This integral is of the form given in 7.4 (10). Let us express

d
x+3= A (5 – 4 x – x 2 ) + B = A (– 4 – 2x) + B
dx
Equating the coefficients of x and the constant terms from both sides, we get
1
– 2A = 1 and – 4 A + B = 3, i.e., A = – and B = 1
2
INTEGRALS 315

x+3 1 (– 4 – 2 x) dx + dx
Therefore, ∫ 2
dx = –
2 ∫ 5 − 4x − x 2 ∫ 5 − 4 x − x2
5 − 4x − x
1
I +I= – ... (1)
2 1 2
In I1, put 5 – 4x – x2 = t, so that (– 4 – 2x) dx = dt.
( – 4 − 2 x) dx dt
Therefore, I1 = ∫ 5 − 4x − x 2
=∫
t
= 2 t + C1

= 2 5 – 4x – x 2 + C1 ... (2)

dx dx
Now consider I2 = ∫ 5 − 4x − x 2
=∫
9 – ( x + 2) 2
Put x + 2 = t, so that dx = dt.
dt t

–1
Therefore, I2 = = sin + C2 [by 7.4 (5)]
2
3 −t 2 3

–1 x+ 2
= sin + C2 ... (3)
3
Substituting (2) and (3) in (1), we obtain
x+3 x+ 2 C
∫ 5 – 4x – x 2
= – 5 – 4x – x2 + sin – 1
3
+ C , where C = C2 – 1
2

EXERCISE 7.4
Integrate the functions in Exercises 1 to 23.
3 x2 1 1
1. 2. 3.
x6 +1 1 + 4x 2 ( 2 – x) 2 + 1
1 3x x2
4. 5. 6.
9 – 25 x 2 1 + 2 x4 1 − x6

x –1 x2 sec2 x
7. 8. 9.
x2 – 1 x6 + a 6 tan 2 x + 4
316 MATHEMATICS

1 1 1
10. 11. 2 12.
x2 + 2 x + 2 9 x + 6x + 5 7 – 6 x – x2

1 1 1
13. 14. 15.
( x – 1)( x – 2 ) 8 + 3x – x2 ( x – a )( x – b )
4 x +1 x+2 5x − 2
16. 17. 18.
2
2x + x – 3 2
x –1 1 + 2 x + 3 x2

6x + 7 x+2 x+2
19. 20. 21.
( x – 5)( x – 4) 4x – x 2 2
x + 2x + 3

x+3 5x + 3
22. 2 23. .
x – 2x − 5 x2 + 4 x + 10
Choose the correct answer in Exercises 24 and 25.
dx
24. ∫ x2 + 2 x + 2 equals
(A) x tan–1 (x + 1) + C (B) tan–1 (x + 1) + C
(C) (x + 1) tan–1 x + C (D) tan–1 x + C
dx
25. ∫ 9 x − 4 x2
equals

1  9x − 8  1  8x − 9 
(A) sin –1  +C (B) sin –1  +C
9  8  2  9 

1  9x − 8  1  9x − 8 
sin –1  sin –1  +C
(C)  +C (D)
2  9 
3  8 
7.5 Integration by Partial Fractions
Recall that a rational function is defined as the ratio of two polynomials in the form
P(x)
, where P (x) and Q(x) are polynomials in x and Q(x) ≠ 0. If the degree of P(x)
Q(x)
is less than the degree of Q(x), then the rational function is called proper, otherwise, it
is called improper. The improper rational functions can be reduced to the proper rational
INTEGRALS 317

P( x) P(x) P (x )
functions by long division process. Thus, if is improper, then = T( x) + 1 ,
Q( x) Q(x) Q( x)

P1 (x)
where T(x) is a polynomial in x and is a proper rational function. As we know
Q(x)
how to integrate polynomials, the integration of any rational function is reduced to the
integration of a proper rational function. The rational functions which we shall consider
here for integration purposes will be those whose denominators can be factorised into
P(x) P(x)
linear and quadratic factors. Assume that we want to evaluate ∫ Q(x ) dx , where Q(x)
is proper rational function. It is always possible to write the integrand as a sum of
simpler rational functions by a method called partial fraction decomposition. After this,
the integration can be carried out easily using the already known methods. The following
Table 7.2 indicates the types of simpler partial fractions that are to be associated with
various kind of rational functions.
Table 7.2

S.No. Form of the rational function Form of the partial fraction


px + q A B
1. ,a≠b +
(x –a ) ( x– b ) x –a x–b

px + q A B
+
2. x – a ( x – a)2
(x – a ) 2

px2 + qx + r A B C
3. + +
(x – a ) ( x – b) ( x – c) x – a x – b x –c

px 2 + qx + r A B C
4. + 2
+
(x – a ) 2 ( x – b ) x – a (x – a ) x–b

px2 + qx + r A Bx + C
5. + 2 ,
(x – a ) ( x2 + bx + c) x – a x + bx + c
where x2 + bx + c cannot be factorised further
In the above table, A, B and C are real numbers to be determined suitably.
318 MATHEMATICS

dx
Example 11 Find ∫ (x + 1) (x + 2)
Solution The integrand is a proper rational function. Therefore, by using the form of
partial fraction [Table 7.2 (i)], we write

1 A B
= + ... (1)
(x + 1) ( x + 2) x +1 x + 2
where, real numbers A and B are to be determined suitably. This gives
1 = A (x + 2) + B (x + 1).
Equating the coefficients of x and the constant term, we get
A+B=0
and 2A + B = 1
Solving these equations, we get A =1 and B = – 1.
Thus, the integrand is given by
1 1 –1
= +
(x + 1) ( x + 2) x +1 x + 2

dx dx dx
Therefore, ∫ (x + 1) (x + 2) = ∫ x + 1 – ∫ x + 2
= log x + 1 − log x + 2 + C

x +1
= log +C
x +2

Remark The equation (1) above is an identity, i.e. a statement true for all (permissible)
values of x. Some authors use the symbol ‘≡’ to indicate that the statement is an
identity and use the symbol ‘=’ to indicate that the statement is an equation, i.e., to
indicate that the statement is true only for certain values of x.
x2 + 1
Example 12 Find ∫ x2 − 5 x + 6
dx

x2 + 1
Solution Here the integrand is not proper rational function, so we divide
x 2 – 5x + 6
x2 + 1 by x2 – 5x + 6 and find that
INTEGRALS 319

x2 + 1 5x – 5 5x – 5
= 1+ 2 =1 +
x 2 – 5x + 6 x – 5x + 6 (x – 2) (x – 3)
5x – 5 A B
Let = +
(x – 2) ( x – 3) x –2 x–3
So that 5x – 5 = A (x – 3) + B (x – 2)
Equating the coefficients of x and constant terms on both sides, we get A + B = 5
and 3A + 2B = 5. Solving these equations, we get A = – 5 and B = 10
x2 + 1 5 10
Thus, 2 = 1− +
x – 5x + 6 x – 2 x –3
x2 + 1 1 dx
Therefore, ∫ x2 – 5 x + 6 dx = ∫ dx − 5 ∫ x – 2 dx + 10∫ x – 3
= x – 5 log |x – 2 | + 10 log |x – 3 | + C.
3x − 2
Example 13 Find ∫ dx
(x + 1)2 (x + 3)
Solution The integrand is of the type as given in Table 7.2 (4). We write

3x – 2 A B C
2 = + 2
+
(x + 1) ( x + 3) x + 1 ( x + 1) x+3
So that 3x – 2 = A (x + 1) (x + 3) + B (x + 3) + C (x + 1)2
= A (x2 + 4x + 3) + B (x + 3) + C (x2 + 2x + 1 )
Comparing coefficient of x , x and constant term on both sides, we get
2

A + C = 0, 4A + B + 2C = 3 and 3A + 3B + C = – 2. Solving these equations, we get


11 –5 –11 . Thus the integrand is given by
A= ,B = and C =
4 2 4
3x − 2 11 5 11
2 = – 2

(x + 1) ( x + 3) 4 ( x + 1) 2 ( x + 1) 4 ( x + 3)
3x − 2 11 dx 5 dx 11 dx
Therefore, ∫ (x + 1)2 (x + 3) = ∫ – ∫
4 x + 1 2 ( x + 1) 2
− ∫
4 x+3
11 5 11
= log x+1 + − log x + 3 + C
4 2 ( x + 1) 4
11 x +1 5
= log + +C
4 x + 3 2 (x +1)
320 MATHEMATICS

x2
Example 14 Find ∫ 2 dx
(x + 1) ( x2 + 4)

x2
Solution Consider and put x2 = y.
( x2 + 1) ( x 2 + 4)

x2 y
Then 2 2 =
(x + 1) ( x + 4) (y + 1) ( y + 4)

y A B
Write = +
(y + 1) ( y + 4) y +1 y + 4
So that y = A (y + 4) + B (y + 1)
Comparing coefficients of y and constant terms on both sides, we get A + B = 1
and 4A + B = 0, which give
1 4
A= − and B =
3 3

x2 1 4
Thus, 2 2 = – 2
+ 2
(x + 1) ( x + 4) 3 ( x + 1) 3 (x + 4)

x 2 dx 1 dx 4 dx
Therefore, ∫ (x 2 + 1) (x2 + 4) = – 3 ∫ x2 + 1 + 3 ∫ x 2 + 4
1 –1 4 1 –1 x
= – tan x + × tan +C
3 3 2 2
1 –1 2 –1 x
= – tan x + tan +C
3 3 2
In the above example, the substitution was made only for the partial fraction part
and not for the integration part. Now, we consider an example, where the integration
involves a combination of the substitution method and the partial fraction method.
(3 sin φ – 2 ) cos φ
Example 15 Find ∫ 5 – cos 2φ – 4 sin φ d φ
Solution Let y = sin φ
Then dy = cosφ dφ
INTEGRALS 321

( 3 sin φ – 2 ) cos φ (3y – 2) dy


Therefore, ∫ 5 – cos 2φ – 4 sinφ d φ = ∫ 5 – (1 – y2 ) – 4 y
3y – 2
= ∫ y 2 – 4 y + 4 dy
3y – 2
= ∫ ( y – 2 )2 = I (say)
3y – 2 A B
Now, we write = + [by Table 7.2 (2)]
( y – 2) 2
y − 2 (y − 2)2
Therefore, 3y – 2 = A (y – 2) + B
Comparing the coefficients of y and constant term, we get A = 3 and B – 2A = – 2,
which gives A = 3 and B = 4.
Therefore, the required integral is given by
3 4 dy dy
I = ∫[ + 2
] dy = 3 ∫ +4∫
y – 2 (y – 2) y–2 (y – 2) 2

 1 
= 3 log y − 2 + 4  – +C
 y−2
4
= 3 log sin φ − 2 + +C
2 – sin φ

4
= 3 log (2 − sin φ) + + C (since, 2 – sin φ is always positive)
2 − sin φ

x 2 + x + 1 dx
Example 16 Find ∫ (x + 2) (x 2 + 1)
Solution The integrand is a proper rational function. Decompose the rational function
into partial fraction [Table 2.2(5)]. Write

x2 + x + 1 A Bx + C
2 = + 2
(x + 1) ( x + 2) x + 2 ( x + 1)
Therefore, x2 + x + 1 = A (x2 + 1) + (Bx + C) (x + 2)
322 MATHEMATICS

Equating the coefficients of x2, x and of constant term of both sides, we get
A + B =1, 2B + C = 1 and A + 2C = 1. Solving these equations, we get
3 2 1
A = , B = and C =
5 5 5
Thus, the integrand is given by
2 1
x+
x2 + x + 1 3 3 1  2x + 1 
2 = + 52 5 = +  2 
(x + 1) ( x + 2) 5 (x + 2) x + 1 5 (x + 2) 5  x + 1 

x2 + x +1 3 dx 1 2x 1 1
Therefore, ∫ (x 2 +1) (x + 2) dx = 5 ∫ x + 2 + 5 ∫ x 2 + 1 dx + 5 ∫ x2 + 1 dx
3 1 1
= log x + 2 + log x 2 + 1 + tan–1x + C
5 5 5

EXERCISE 7.5
Integrate the rational functions in Exercises 1 to 21.
x 1 3x –1
1. 2. 2 3.
(x + 1) ( x + 2) x –9 (x – 1) (x – 2) ( x – 3)

x 2x 1 – x2
4. 5. 2 6.
(x –1) (x – 2) (x – 3) x + 3x + 2 x (1 – 2 x )

x x 3x + 5
7. 8. 9.
2
(x + 1) (x – 1) 2
(x –1) (x + 2) x – x2 − x + 1
3

2x − 3 5x x3 + x + 1
10. 2 11. 12.
(x – 1) (2x + 3) (x + 1) ( x2 − 4) x2 −1
2 3x – 1 1
13. 14. 15.
(1 − x) (1 + x2 ) (x + 2) 2 4
x −1
1
16. [Hint: multiply numerator and denominator by x n – 1 and put xn = t ]
x (x n + 1)
cos x
17. [Hint : Put sin x = t]
(1 – sin x) (2 – sin x)
INTEGRALS 323

( x2 + 1) (x 2 + 2) 2x 1
18. 19. 20.
(x 2 + 3) ( x2 + 4) (x + 1) ( x2 + 3)
2
x (x 4 – 1)
1
21. x [Hint : Put ex = t]
(e – 1)
Choose the correct answer in each of the Exercises 22 and 23.
x dx
22. ∫ (x − 1) (x − 2) equals

( x − 1) 2 ( x − 2) 2
(A) log +C (B) log +C
x−2 x −1
2
 x −1 
(C) log   +C (D) log ( x − 1) ( x − 2) + C
 x − 2
dx
23. ∫ x (x 2 + 1) equals

1 2 1 2
(A) log x − log (x +1) + C (B) log x + log (x +1) + C
2 2
1 1 2
(C) − log x + log (x 2 +1) + C (D) log x + log (x +1) + C
2 2
7.6 Integration by Parts
In this section, we describe one more method of integration, that is found quite useful in
integrating products of functions.
If u and v are any two differentiable functions of a single variable x (say). Then, by
the product rule of differentiation, we have
d dv du
( uv) = u + v
dx dx dx
Integrating both sides, we get
dv du
uv = ∫ u dx + ∫ v dx
dx dx
dv du
or ∫ u dx dx = uv – ∫v dx dx ... (1)

dv
Let u = f (x) and = g (x). Then
dx
du
= f ′(x) and v = ∫ g ( x) dx
dx
324 MATHEMATICS

Therefore, expression (1) can be rewritten as

∫ f (x ) g (x) dx = f (x) ∫ g ( x) dx – ∫ [ ∫ g (x) dx] f ′( x) dx

i.e., ∫ f (x ) g (x) dx = f (x) ∫ g (x) dx – ∫ [ f ′ ( x) ∫ g ( x) dx] dx


If we take f as the first function and g as the second function, then this formula
may be stated as follows:
“The integral of the product of two functions = (first function) × (integral
of the second function) – Integral of [(differential coefficient of the first function)
× (integral of the second function)]”
Example 17 Find ∫ x cos x dx
Solution Put f (x) = x (first function) and g (x) = cos x (second function).
Then, integration by parts gives
d
∫ x cos x dx = x ∫ cos x dx – ∫[ dx (x) ∫ cos x dx] dx
= x sin x – ∫ sin x dx = x sin x + cos x + C
Suppose, we take f (x) = cos x and g (x) = x. Then
d
∫ x cos x dx = cos x ∫ x dx – ∫[ (cos x) ∫ x dx] dx
dx
x2 x2
= ( cos x) + ∫ sin x dx
2 2
Thus, it shows that the integral ∫ x cos x dx is reduced to the comparatively more
complicated integral having more power of x. Therefore, the proper choice of the first
function and the second function is significant.
Remarks
(i) It is worth mentioning that integration by parts is not applicable to product of
functions in all cases. For instance, the method does not work for ∫ x sin x dx .
The reason is that there does not exist any function whose derivative is
x sin x.
(ii) Observe that while finding the integral of the second function, we did not add
any constant of integration. If we write the integral of the second function cos x
INTEGRALS 325

as sin x + k, where k is any constant, then

∫ x cos x dx = x (sin x + k ) − ∫ (sin x + k ) dx

= x (sin x + k ) − ∫ (sin x dx − ∫ k dx

= x (sin x + k ) − cos x – kx + C = x sin x + cos x + C


This shows that adding a constant to the integral of the second function is
superfluous so far as the final result is concerned while applying the method of
integration by parts.
(iii) Usually, if any function is a power of x or a polynomial in x, then we take it as the
first function. However, in cases where other function is inverse trigonometric
function or logarithmic function, then we take them as first function.
Example 18 Find ∫ log x dx
Solution To start with, we are unable to guess a function whose derivative is log x. We
take log x as the first function and the constant function 1 as the second function. Then,
the integral of the second function is x.
d
Hence, ∫ (logx.1) dx = log x ∫1 dx − ∫[ dx (log x) ∫1 dx] dx
1
= (log x ) ⋅ x – ∫ x dx = x log x – x + C .
x
∫ x e dx
x
Example 19 Find

Solution Take first function as x and second function as ex. The integral of the second
function is ex.

∫ x e dx = x ex − ∫ 1 ⋅ e x dx = xex – ex + C.
x
Therefore,

x sin – 1 x
Example 20 Find ∫ 1 − x2
dx

x
Solution Let first function be sin – 1x and second function be .
1 − x2
x dx
First we find the integral of the second function, i.e., ∫ 1 − x2
.

Put t =1 – x2. Then dt = – 2x dx


326 MATHEMATICS

x dx 1 dt
∫ ∫
2
Therefore, = – = – t = − 1− x
1− x 2 2 t

x sin – 1 x
Hence, ∫ 1− x 2
dx = (sin – 1 x) – 1 − x2 − ∫( 1− x
) 1
2
( – 1 − x 2 ) dx

2 −1 2 −1
= – 1− x sin x + x + C = x – 1 − x sin x + C
Alternatively, this integral can also be worked out by making substitution sin–1 x = θ and
then integrating by parts.

∫e
x
Example 21 Find sin x dx

Solution Take ex as the first function and sin x as second function. Then, integrating
by parts, we have

I = ∫ ex sin x dx = ex ( – cos x) + ∫ e xcos x dx


= – e x cos x + I1 (say) ... (1)
x
Taking e and cos x as the first and second functions, respectively, in I1 , we get
I 1 = e sin x – ∫ e sin x dx
x x

Substituting the value of I1 in (1), we get


I = – ex cos x + ex sin x – I or 2I = ex (sin x – cos x)
ex
Hence, I = ∫ e x sin x dx =
(sin x – cos x) + C
2
Alternatively, above integral can also be determined by taking sin x as the first function
and ex the second function.

∫e
x
7.6.1 Integral of the type [ f ( x) + f ′ ( x)] dx

I = ∫ e x [ f ( x) + f ′( x)] dx = ∫ e f ( x) dx + ∫ e f ′(x) dx
x x
We have

= I1 + ∫ e f ′(x) dx, where I1 = ∫ e f ( x) dx


x x
... (1)
x
Taking f(x) and e as the first function and second function, respectively, in I 1 and

∫ f ′(x) e dx + C
x
integrating it by parts, we have I1 = f (x) ex –
Substituting I1 in (1), we get
I = e f ( x) − ∫ f ′( x) e dx + ∫ e f ′( x) dx + C = ex f (x) + C
x x x
INTEGRALS 327

∫e
x
Thus, [ f ( x ) + f ′( x )] dx = e x f ( x ) + C

1 (x 2 + 1) ex
Example 22 Find (i) ∫ e x (tan – 1 x +
1 + x2
) dx (ii) ∫ (x + 1)2 dx
Solution
1
(i) We have I = ∫ e (tan x +
x –1
) dx
1 + x2
1
Consider f (x) = tan– 1x, then f ′(x) =
1 + x2
Thus, the given integrand is of the form ex [ f (x) + f ′(x)].
1
Therefore, I = ∫ ex (tan – 1 x + ) dx = ex tan– 1 x + C
1+ x 2

( x2 +1) ex 2
x x – 1 +1+1)
(ii) We have I = ∫ dx = ∫ e [ ] dx
(x +1) 2 (x +1) 2

x2 – 1 2 x –1 2
= ∫e x [ 2
+ 2
] dx = ∫ e x [ + ] dx
(x +1) ( x+1) x +1 (x +1) 2

x −1 2
Consider f (x ) = , then f ′( x) =
x +1 ( x + 1) 2
Thus, the given integrand is of the form ex [f (x) + f ′(x)].
x2 + 1 x x −1 x
Therefore, ∫ (x + 1)2 e dx = x + 1 e + C

EXERCISE 7.6
Integrate the functions in Exercises 1 to 22.
1. x sin x 2. x sin 3x 3. x2 ex 4. x log x
5. x log 2x 6. x2 log x 7. x sin– 1 x 8. x tan–1 x
x cos −1 x
9. x cos–1 x 10. (sin–1x) 2 11. 12. x sec2 x
2
1− x
13. tan –1x 14. x (log x)2 15. (x2 + 1) log x
328 MATHEMATICS

x ex  1 + sin x 
16. ex (sinx + cosx) 17. 18. ex  
(1 + x) 2  1 + cos x 

1 1 
x (x − 3) e x
19. e  – 2  20. 21. e2x sin x
x x  ( x − 1) 3

– 1  2x 
22. sin  2 
 1+ x 
Choose the correct answer in Exercises 23 and 24.

∫x e
2 x3
23. dx equals

1 x3 1 x2
(A) e +C (B) e +C
3 3
1 x3 1 x2
(C) e +C (D) e +C
2 2

∫e
x
24. sec x (1 + tan x) dx equals
(A) ex cos x + C (B) ex sec x + C
(C) ex sin x + C (D) ex tan x + C
7.6.2 Integrals of some more types
Here, we discuss some special types of standard integrals based on the technique of
integration by parts :

(i)
∫ x 2 − a 2 dx (ii) ∫ x 2 + a 2 dx (iii) ∫ a2 − x 2 dx

(i) Let I = ∫ x2 − a2 dx
Taking constant function 1 as the second function and integrating by parts, we
have
1 2x
I= x x2 − a 2 − ∫ x dx
2 x − a2
2

x2 x2 − a 2 + a 2
= x x −a −∫
2 2
dx = x x2 − a 2 − ∫ dx
2
x −a 2 x2 − a 2
INTEGRALS 329

dx
= x x2 − a 2 − ∫ x 2 − a 2 dx − a 2 ∫
x − a2
2

dx
= x x2 − a 2 − I − a 2 ∫
x − a2
2

dx
or 2I = x x2 − a 2 − a 2 ∫
x 2 − a2
x a2

2 2
or I= x 2 – a2 dx =
x – a – log x + x 2 – a 2 + C
2 2
Similarly, integrating other two integrals by parts, taking constant function 1 as the
second function, we get
1 a2
(ii) ∫ x 2 + a 2 dx =
2
x x2 + a2 +
2
log x + x 2 + a 2 + C

1 a2 x
∫ a 2 – x 2 dx = 2 2
(iii) x a – x + sin–1 + C
2 2 a
Alternatively, integrals (i), (ii) and (iii) can also be found by making trigonometric
substitution x = a sec θ in (i), x = a tanθ in (ii) and x = a sin θ in (iii) respectively.

Example 23 Find ∫ x2 + 2 x + 5 dx

Solution Note that

∫ x 2 + 2 x + 5 dx = ∫ (x + 1)2 + 4 dx
Put x + 1 = y, so that dx = dy. Then

∫ x 2 + 2 x + 5 dx = ∫ y2 + 2 2 dy

1 4
= y y2 + 4 + log y + y2 + 4 + C [using 7.6.2 (ii)]
2 2
1
= (x + 1) x2 + 2x + 5 + 2 log x + 1+ x2 + 2x + 5 + C
2
Example 24 Find ∫ 3 − 2x − x 2 dx

Solution Note that ∫ 3 − 2 x − x 2 dx = ∫ 4 − ( x + 1) 2 dx


330 MATHEMATICS

Put x + 1 = y so that dx = dy.


Thus ∫ 3 − 2x − x 2 dx = ∫ 4 − y 2 dy
1 4 y
= y 4 − y2 + sin –1 + C [using 7.6.2 (iii)]
2 2 2
1  x + 1
= ( x + 1) 3 − 2 x − x2 + 2 sin –1   +C
2  2 

EXERCISE 7.7
Integrate the functions in Exercises 1 to 9.
1. 4 − x2 2. 1 − 4x 2 3. x2 + 4 x + 6
4. x2 + 4 x + 1 5. 1 − 4x − x2 6. x2 + 4 x − 5
x2
7. 1 + 3x − x 2 8. x2 + 3 x 9. 1+
9
Choose the correct answer in Exercises 10 to 11.
10. ∫ 1 + x2 dx is equal to
x 1
(A)
2 2
(
1+ x 2 + log x + 1 + x2 ) +C
3 3
2 2
(B) (1+ x 2 )2 + C (C) x (1+ x 2 )2 + C
3 3
x2 1
(D) 1 + x 2 + x2 log x + 1 + x 2 + C
2 2
11. ∫ x 2 − 8 x + 7 dx is equal to
1
(A) ( x − 4) x 2 − 8x + 7 + 9log x − 4 + x 2 − 8x + 7 + C
2
1
(B) ( x + 4) x2 − 8x + 7 + 9 log x + 4 + x2 − 8x + 7 + C
2
1
(C) ( x − 4) x 2 − 8x + 7 − 3 2 log x − 4 + x2 − 8x + 7 + C
2
1 9
(D) ( x − 4) x 2 − 8 x + 7 − log x − 4 + x 2 − 8 x + 7 + C
2 2
INTEGRALS 331

7.7 Definite Integral


In the previous sections, we have studied about the indefinite integrals and discussed
few methods of finding them including integrals of some special functions. In this
section, we shall study what is called definite integral of a function. The definite integral
b
has a unique value. A definite integral is denoted by ∫ a f (x) dx , where a is called the
lower limit of the integral and b is called the upper limit of the integral. The definite
integral is introduced either as the limit of a sum or if it has an anti derivative F in the
interval [a, b], then its value is the difference between the values of F at the end
points, i.e., F(b) – F(a). Here, we shall consider these two cases separately as discussed
below:
7.7.1 Definite integral as the limit of a sum
Let f be a continuous function defined on close interval [a, b]. Assume that all the
values taken by the function are non negative, so the graph of the function is a curve
above the x-axis.
b
The definite integral ∫a f (x) dx is the area bounded by the curve y = f (x), the
ordinates x = a, x = b and the x-axis. To evaluate this area, consider the region PRSQP
between this curve, x-axis and the ordinates x = a and x = b (Fig 7.2).
Y
S

M
C D
f (x) L
Q y=

P A B R
X' X
O a = x0 x1 x2 xr-1 xr xn=b
Y'
Fig 7.2

Divide the interval [a, b] into n equal subintervals denoted by [x0, x1], [x1, x2] ,...,
[xr – 1, xr ], ..., [xn – 1, xn ], where x0 = a, x1 = a + h, x2 = a + 2h, ... , xr = a + rh and
b −a
xn = b = a + nh or n = . We note that as n → ∞, h → 0.
h
332 MATHEMATICS

The region PRSQP under consideration is the sum of n subregions, where each
subregion is defined on subintervals [xr – 1, xr ], r = 1, 2, 3, …, n.
From Fig 7.2, we have
area of the rectangle (ABLC) < area of the region (ABDCA) < area of the rectangle
(ABDM) ... (1)
Evidently as xr – xr–1 → 0, i.e., h → 0 all the three areas shown in (1) become
nearly equal to each other. Now we form the following sums.
n −1
sn = h [f(x0) + … + f (xn - 1)] = h ∑ f ( xr ) ... (2)
r= 0
n
and Sn = h [ f ( x1 ) + f ( x2 ) +…+ f ( xn )] = h ∑ f ( xr ) ... (3)
r =1
Here, sn and Sn denote the sum of areas of all lower rectangles and upper rectangles
raised over subintervals [xr–1, xr ] for r = 1, 2, 3, …, n, respectively.
In view of the inequality (1) for an arbitrary subinterval [xr–1, xr], we have
sn < area of the region PRSQP < Sn ... (4)
As n → ∞ strips become narrower and narrower, it is assumed that the limiting
values of (2) and (3) are the same in both cases and the common limiting value is the
required area under the curve.
Symbolically, we write
b
lim Sn = lim sn = area of the region PRSQP =
n→∞ n →∞ ∫ a f (x)dx ... (5)
It follows that this area is also the limiting value of any area which is between that
of the rectangles below the curve and that of the rectangles above the curve. For
the sake of convenience, we shall take rectangles with height equal to that of the
curve at the left hand edge of each subinterval. Thus, we rewrite (5) as
b
∫a f ( x) dx = lim h [ f ( a ) + f ( a + h ) + ... + f (a + ( n – 1) h ]
h →0

b 1
or ∫a f ( x) dx = (b – a ) lim
n →∞ n
[ f ( a) + f ( a + h ) + ... + f ( a + ( n – 1) h ] ... (6)

b–a
where h= → 0 as n → ∞
n
The above expression (6) is known as the definition of definite integral as the limit
of sum.
Remark The value of the definite integral of a function over any particular interval
depends on the function and the interval, but not on the variable of integration that we
INTEGRALS 333

choose to represent the independent variable. If the independent variable is denoted by


b b
t or u instead of x, we simply write the integral as ∫ a f (t ) dt or ∫a f ( u ) du instead of

b
∫a f (x) dx . Hence, the variable of integration is called a dummy variable.

2
∫0 (x
2
Example 25 Find + 1) dx as the limit of a sum.

Solution By definition
b 1
∫a f ( x) dx = (b – a ) lim
n →∞ n
[ f ( a) + f ( a + h ) + ... + f ( a + ( n – 1) h ],

b–a
where, h=
n
2–0 2
In this example, a = 0, b = 2, f (x) = x2 + 1, h = =
n n
Therefore,
2 1 2 4 2 ( n – 1)
∫ 0 (x
2
+ 1) dx = 2 lim [ f (0) + f ( ) + f ( ) + ... + f ( )]
n →∞ n n n n

1 22 42  (2n – 2) 2 
= 2 lim [1 + ( 2 + 1) + ( 2 + 1) + ... +  + 1 ]
n →∞ n n n  n2 

1 1
= 2 lim [(1 + 1 + ... + 1) + 2 (22 + 42 + ... + (2n – 2) 2 ]
n→ ∞ n 14 4 244 3 n
n -terms

1 22
= 2 lim [ n + 2 (12 + 22 + ... + (n – 1) 2 ]
n →∞ n n

1 4 ( n − 1) n (2n – 1)
= 2 lim [n + 2 ]
n →∞ n n 6
1 2 (n − 1) (2n – 1)
= 2 lim [n + ]
n →∞ n 3 n
2 1 1 4 14
= 2 lim [1 + (1 − ) (2 – )] = 2 [1 + ] =
n →∞ 3 n n 3 3
334 MATHEMATICS

2 x
Example 26 Evaluate ∫0 e dx as the limit of a sum.
Solution By definition

2 1  0 2 4 2n – 2

∫0
x
e dx = (2 – 0) lim e + e + e + ... + e n 
n n
n→ ∞ n
 

2
Using the sum to n terms of a G.P., where a = 1, r = en , we have

2n
 
2 1 e n –1 1  e2– 1 
∫0
x
e dx = 2 lim [ 2 ] = 2 lim
n →∞ n n →∞ n  2 
e −1
n
 e n– 1 
2 ( e2 – 1) ( eh − 1)
= = e2 – 1 [using lim = 1]
 2  h →0 h
e n – 1 
lim 
2 
⋅2
n →∞
 
 n 

EXERCISE 7.8
Evaluate the following definite integrals as limit of sums.
b 5 3 2
1. ∫ a x dx 2. ∫ 0 ( x + 1) dx 3. ∫2 x dx

4 1 4
∫ 1 (x ∫ −1 e ∫ 0 (x + e
2 x 2x
4. − x) dx 5. dx 6. ) dx

7.8 Fundamental Theorem of Calculus


7.8.1 Area function
b
We have defined ∫a f ( x) dx as the area of

the region bounded by the curve y = f(x),


the ordinates x = a and x = b and x-axis. Let x
x
be a given point in [a, b]. Then
∫a f (x) dx
represents the area of the light shaded region Fig 7.3
INTEGRALS 335

in Fig 7.3 [Here it is assumed that f (x) > 0 for x ∈ [a, b], the assertion made below is
equally true for other functions as well]. The area of this shaded region depends upon
the value of x.
In other words, the area of this shaded region is a function of x. We denote this
function of x by A(x). We call the function A(x) as Area function and is given by
x
A (x) = ∫a f ( x ) dx ... (1)
Based on this definition, the two basic fundamental theorems have been given.
However, we only state them as their proofs are beyond the scope of this text book.
7.8.2 First fundamental theorem of integral calculus
Theorem 1 Let f be a continuous function on the closed interval [a, b] and let A (x) be
the area function. Then A′′ (x) = f (x), for all x ∈ [a, b].
7.8.3 Second fundamental theorem of integral calculus
We state below an important theorem which enables us to evaluate definite integrals
by making use of anti derivative.
Theorem 2 Let f be continuous function defined on the closed interval [a, b] and F be
b
an anti derivative of f. Then ∫a f ( x) dx = [F(x )] ba = F (b) – F(a).

Remarks
b
(i) In words, the Theorem 2 tells us that ∫a f ( x) dx = (value of the anti derivative F
of f at the upper limit b – value of the same anti derivative at the lower limit a).
(ii) This theorem is very useful, because it gives us a method of calculating the
definite integral more easily, without calculating the limit of a sum.
(iii) The crucial operation in evaluating a definite integral is that of finding a function
whose derivative is equal to the integrand. This strengthens the relationship
between differentiation and integration.
b
(iv) In ∫a f ( x) dx , the function f needs to be well defined and continuous in [a, b].
1
3
For instance, the consideration of definite integral ∫ −2 x( x2 – 1) 2 dx is erroneous
1
2
since the function f expressed by f(x) = x (x –1) 2 is not defined in a portion
– 1 < x < 1 of the closed interval [– 2, 3].
336 MATHEMATICS

b
Steps for calculating ∫a f ( x) dx .

(i) Find the indefinite integral ∫ f ( x) dx . Let this be F(x). There is no need to keep
integration constant C because if we consider F(x) + C instead of F(x), we get
b
∫a f ( x) dx = [F ( x) + C] ba = [F(b ) + C] – [F( a) + C] = F( b ) – F( a) .
Thus, the arbitrary constant disappears in evaluating the value of the definite
integral.
b
∫a
b
(ii) Evaluate F(b) – F(a) = [F ( x)] a , which is the value of f ( x) dx .
We now consider some examples
Example 27 Evaluate the following integrals:
3 9 x
(i) ∫2 x2 dx (ii) ∫4 3
dx
(30 – x 2 )2
π
2 x dx
(iii) ∫1 ( x + 1) ( x + 2)
(iv) ∫ 0
4 sin 3 2 t cos 2 t dt

Solution
3 x3
(i) Let I = ∫ 2 x dx . Since ∫ x dx =
2 2
= F ( x) ,
3
Therefore, by the second fundamental theorem, we get
27 8 19
I = F (3) – F (2) = – =
3 3 3
9 x
(ii) Let I = ∫ 4 3
dx . We first find the anti derivative of the integrand.
(30 – x 2 )2
3
3 2
Put 30 – x 2 = t. Then – x dx = dt or x dx = – dt
2 3

 
x 2 dt 2 1 2  1  = F ( x)
Thus, ∫ 3
dx = – ∫ 2 =
3 t 3 t  = 3 
  3 

(30 – x 2 ) 2  (30 – x 2 ) 
INTEGRALS 337

Therefore, by the second fundamental theorem of calculus, we have


9
 
2 1 
I = F(9) – F(4) =  3 
3
 (30 – x 2 ) 
4

2  1 1  2  1 1  19
=  −  =  − =
3  (30 – 27) 30 – 8  3  3 22  99

2 x dx
(iii) Let I = ∫
1 ( x + 1) ( x + 2)

x –1 2
Using partial fraction, we get = +
(x + 1) (x + 2) x + 1 x + 2

x dx
So ∫ (x + 1) ( x + 2) = – log x + 1 + 2 log x + 2 = F( x)

Therefore, by the second fundamental theorem of calculus, we have


I = F(2) – F(1) = [– log 3 + 2 log 4] – [– log 2 + 2 log 3]
 32 
= – 3 log 3 + log 2 + 2 log 4 = log  
 27 
π
(iv) Let I = ∫ 2t cos2 t dt . Consider ∫ sin 2t cos2 t dt
3
4
sin3
0

1
Put sin 2t = u so that 2 cos 2t dt = du or cos 2t dt = du
2

1 3
∫ sin 2∫
3
So 2t cos2 t dt = u du

1 4 1 4
= [ u ] = sin 2t = F ( t ) say
8 8
Therefore, by the second fundamental theorem of integral calculus

π 1 π 1
I = F ( ) – F (0) = [sin 4 – sin 4 0] =
4 8 2 8
338 MATHEMATICS

EXERCISE 7.9
Evaluate the definite integrals in Exercises 1 to 20.
1 31 2
∫ −1( x + 1) dx ∫ 2 x dx ∫ 1 (4 x
3
1. 2. 3. – 5 x2 + 6 x + 9) dx

π π π
5 x
∫ 0 sin 2x dx ∫4
4
∫ 0 cos 2x dx ∫
2 e dx 4
4. 5. 6. 7. tan x dx
0

π
dx dx 3 dx
∫ 2 x2 − 1
1 1
8. ∫ 4
π
cosec x dx 9. ∫0 10. ∫ 01 + x2 11.
6 1– x2
π 3 x dx 1 2x + 3 1
12. ∫ 0
2
cos2 x dx 13. ∫ 2 x2 + 1 14. ∫ 0 5 x 2 + 1 dx 15. ∫0 x e
x2
dx

π
2 5x 2 π x x
∫1 ∫ ∫ 0 (sin
2
16. 17. 4
(2 sec2 x + x3 + 2) dx 18. – cos2 ) dx
x 2 + 4x + 3 0 2 2

2 6x + 3 πx 1
∫ 0 x 2 + 4 dx ∫0(x e
x
19. 20. ) dx + sin
4
Choose the correct answer in Exercises 21 and 22.
3 dx
21. ∫1 1 + x2
equals

π 2π π π
(A) (B) (C) (D)
3 3 6 12
2
dx
22. ∫ 0
3
4 + 9 x2
equals

π π π π
(A) (B) (C) (D)
6 12 24 4
7.9 Evaluation of Definite Integrals by Substitution
In the previous sections, we have discussed several methods for finding the indefinite
integral. One of the important methods for finding the indefinite integral is the method
of substitution.
INTEGRALS 339

b
To evaluate ∫a f ( x) dx , by substitution, the steps could be as follows:
1. Consider the integral without limits and substitute, y = f (x) or x = g(y) to reduce
the given integral to a known form.
2. Integrate the new integrand with respect to the new variable without mentioning
the constant of integration.
3. Resubstitute for the new variable and write the answer in terms of the original
variable.
4. Find the values of answers obtained in (3) at the given limits of integral and find
the difference of the values at the upper and lower limits.

A Note In order to quicken this method, we can proceed as follows: After


performing steps 1, and 2, there is no need of step 3. Here, the integral will be kept
in the new variable itself, and the limits of the integral will accordingly be changed,
so that we can perform the last step.

Let us illustrate this by examples.


1
∫ −15 x
4
Example 28 Evaluate x5 + 1 dx .

Solution Put t = x5 + 1, then dt = 5x4 dx.

3 3
2 2 2
∫ 5 x x + 1 dx = ∫
4 5
Therefore, t dt = t = ( x5 + 1) 2
3 3
1
2 
3
1
∫ −1 5 x
4
Hence, x + 1 dx = ( x5 + 1) 2 
5
3   – 1

2 5 
3 3
= ( 5
)
(1 + 1) 2 – (– 1) + 1 2 
3  

2 2 
3 3
2 4 2
= 2 − 0 2  = (2 2) =
3   3 3
Alternatively, first we transform the integral and then evaluate the transformed integral
with new limits.
340 MATHEMATICS

Let t = x5 + 1. Then dt = 5 x4 dx.


Note that, when x = – 1, t = 0 and when x = 1, t = 2
Thus, as x varies from – 1 to 1, t varies from 0 to 2
1 2
∫ −1 5 x ∫0
4
Therefore x 5 + 1 dx = t dt
2
2  2 2 2 
3 3 3
2 4 2
= t  = 2 – 0 2
 = (2 2) =
3   3   3 3
0

tan – 1 x
1
Example 29 Evaluate ∫ 0 1 + x2 dx
1
Solution Let t = tan – 1x, then dt = dx . The new limits are, when x = 0, t = 0 and
1 + x2
π π
when x = 1, t = . Thus, as x varies from 0 to 1, t varies from 0 to .
4 4
π
π
1 tan x–1 t2  4 1  π2  π2
Therefore ∫0 1+ x 2
dx = ∫ 0
4
t dt   =  – 0  =
 2 0 2  16  32

EXERCISE 7.10
Evaluate the integrals in Exercises 1 to 8 using substitution.
π
1 x 1 –1 2x 
1. ∫0 2
x +1
dx 2. ∫ 0
2
sin φ cos 5 φ d φ 3. ∫ 0 sin 
 1+ x
2  dx

π
2 sin x
4. ∫0 x x + 2 (Put x + 2 = t2) 5. ∫ 2
0 1 + cos 2 x
dx

2 dx 1 dx 21 1 
∫ 0 x + 4 – x2 ∫ −1 x 2 + 2 x + 5 ∫ 1  x – 2 x 2  e
2x
6. 7. 8. dx

Choose the correct answer in Exercises 9 and 10.


1
1 ( x − x3 ) 3
9. The value of the integral ∫1 x4
dx is
3
(A) 6 (B) 0 (C) 3 (D) 4
x
10. If f (x) = ∫ 0 t sin t dt , then f′(x) is
(A) cosx + x sin x (B) x sinx
(C) x cosx (D) sinx + x cosx
INTEGRALS 341

7.10 Some Properties of Definite Integrals


We list below some important properties of definite integrals. These will be useful in
evaluating the definite integrals more easily.
b b
P0 : ∫a f ( x) dx = ∫ f ( t ) dt
a

b a a
P1 : ∫a f ( x) dx = – ∫
b
f ( x) dx . In particular, ∫a f ( x ) dx = 0

b c b
P2 : ∫a f ( x) dx = ∫ f ( x) dx + ∫ f ( x) dx
a c

b b
P3 : ∫a f (x) dx = ∫ f (a + b − x) dx
a

a a
P4 : ∫0 f ( x ) dx = ∫ f ( a − x) dx
0
(Note that P4 is a particular case of P3 )
2a a a
P5 : ∫0 f ( x ) dx = ∫ f ( x) dx + ∫ f (2a − x) dx
0 0

2a a
P6 : ∫0 f ( x ) dx = 2 ∫ f ( x) dx, if f (2 a − x) = f ( x) and
0
0 if f (2a – x) = – f (x)
a a
P7 : (i) ∫ − a f (x) dx = 2 ∫ 0 f ( x) dx , if f is an even function, i.e., if f (– x) = f (x).
a
(ii) ∫ − a f (x) dx = 0 , if f is an odd function, i.e., if f (– x) = – f (x).
We give the proofs of these properties one by one.
Proof of P0 It follows directly by making the substitution x = t.
Proof of P1 Let F be anti derivative of f. Then, by the second fundamental theorem of
b a
calculus, we have ∫a f ( x) dx = F (b ) – F ( a ) = – [F ( a) − F ( b )] = − ∫ f ( x) dx
b

a
Here, we observe that, if a = b, then ∫a f ( x ) dx = 0 .
Proof of P2 Let F be anti derivative of f. Then
b
∫a f ( x) dx = F(b) – F(a) ... (1)
c
∫a f ( x) dx = F(c) – F(a) ... (2)
b
and ∫c f ( x) dx = F(b) – F(c) ... (3)
342 MATHEMATICS

c b b
Adding (2) and (3), we get ∫a f ( x) dx + ∫ f (x) dx = F(b) – F(a) = ∫ f ( x) dx
c a
This proves the property P2.
Proof of P3 Let t = a + b – x. Then dt = – dx. When x = a, t = b and when x = b, t = a.
Therefore
b a
∫a f ( x) dx = −∫ f ( a + b – t) dt
b
b
= ∫a f (a + b – t ) dt (by P1 )
b
= ∫a f (a + b – x) dx by P0
Proof of P4 Put t = a – x. Then dt = – dx. When x = 0, t = a and when x = a, t = 0. Now
proceed as in P3.
2a a 2a
Proof of P5 Using P2 , we have ∫0 f (x) dx = ∫ f (x) dx + ∫
0 a
f ( x) dx .
Let t = 2a – x in the second integral on the right hand side. Then
dt = – dx. When x = a, t = a and when x = 2a, t = 0. Also x = 2a – t.
Therefore, the second integral becomes
2a 0 a a
∫a f ( x ) dx = – ∫ f (2a – t) dt =
a ∫0 f (2 a – t ) dt = ∫0 f (2a – x) dx
2a a a
Hence ∫0 f (x ) dx = ∫0 f (x ) dx + ∫ f (2a − x) dx
0
2a a a
Proof of P6 Using P5, we have ∫0 f ( x ) dx = ∫
0
f ( x) dx +∫ f (2 a − x) dx
0
... (1)

Now, if f (2a – x) = f (x), then (1) becomes


2a a a a
∫0 f (x ) dx = ∫0 f ( x ) dx + ∫ f ( x ) dx = 2 ∫ f ( x) dx,
0 0

and if f(2a – x) = – f (x), then (1) becomes


2a a a
∫0 f ( x ) dx = ∫0 f ( x ) dx − ∫ f ( x) dx = 0
0
Proof of P7 Using P2 , we have
a 0 a
∫ − a f (x) dx = ∫ − a f (x) dx + ∫ 0 f ( x) dx . Then

Let t = – x in the first integral on the right hand side.


dt = – dx. When x = – a, t = a and when
x = 0, t = 0. Also x = – t.
INTEGRALS 343

a 0 a
Therefore ∫ − a f (x) dx = – ∫ a f (–t) dt +∫ 0 f ( x) dx
a a
= ∫0 f (– x) dx + ∫ f ( x) dx
0
(by P0) ... (1)
(i) Now, if f is an even function, then f (–x) = f (x) and so (1) becomes
a a a a
∫ − a f (x) dx = ∫ 0 f ( x) dx + ∫
0
f ( x) dx = 2 ∫ f ( x) dx
0

(ii) If f is an odd function, then f (–x) = – f(x) and so (1) becomes


a a a
∫ − a f (x) dx = −∫ 0 f ( x) dx + ∫ 0 f (x) dx = 0

2
Example 30 Evaluate ∫ −1 x3 – x dx

Solution We note that x3 – x ≥ 0 on [– 1, 0] and x3 – x ≤ 0 on [0, 1] and that


x3 – x ≥ 0 on [1, 2]. So by P2 we write
2 0 1 2
∫ −1 x3 – x dx = ∫ −1 ( x – x) dx + ∫ – ( x3 – x ) dx + ∫ ( x3 – x ) dx
3
0 1

0 1 2
∫ −1 ( x – x) dx + ∫ ( x – x3 ) dx + ∫ ( x3 – x) dx
3
= 0 1

0 1 2
 x 4 x2   x 2 x4   x4 x2 
=  –  + –  +  – 
4 2  –1  2 4 0  4 2 1

 1 1  1 1  1 1 
= –  –  +  –  + (4 – 2) –  – 
4 2 2 4 4 2
1 1 1 1 1 1 3 3 11
= – + + − +2− + = − + 2=
4 2 2 4 4 2 2 4 4
π


4
Example 31 Evaluate –π
sin 2 x dx
4
Solution We observe that sin2 x is an even function. Therefore, by P7 (i), we get
π π

∫ 4
–π
sin 2 x dx = 2 ∫ 4 sin 2 x dx
0
4
344 MATHEMATICS

π π
(1 − cos 2 x )
= 2∫ 4 dx = ∫ 4
(1 − cos 2 x) dx
0 2 0

π
 1 4 π 1 π π 1
=  x – sin 2 x =  – sin  – 0 = –
 2 0 4 2 2 4 2
π x sin x
Example 32 Evaluate ∫ 0 1 + cos 2 x dx
π x sin x
Solution Let I = ∫ 0 1 + cos 2 x dx . Then, by P , we have 4

π ( π − x ) sin ( π − x ) dx
I= ∫0 1 + cos 2 ( π − x)
π ( π − x ) sin x dx π sin x dx
= ∫0 2
1 + cos x
= π∫0
1 + cos 2 x
−I

π sin x dx
or 2 I = π ∫0
1 + cos 2 x
π π sin x dx
2 ∫ 0 1 + cos 2 x
or I=

Put cos x = t so that – sin x dx = dt. When x = 0, t = 1 and when x = π, t = – 1.


Therefore, (by P1) we get
–π −1 dt π 1 dt
I=
2 ∫1 1+ t 2 =
2 ∫ −1 1 + t 2
1 dt 1
= π∫0 2 (by P7 ,
since is even function)
1+ t 1+ t 2
2
–1 1 π  π
= π  tan t  0 = π  tan 1 – tan 0  = π  – 0  =
–1 −1

4  4
1
∫−1 sin
5
Example 33 Evaluate x cos 4 x dx

1
∫ −1sin
5
Solution Let I = x cos 4 x dx . Let f(x) = sin5 x cos4 x. Then
f (– x) = sin5 (– x) cos4 (– x) = – sin5 x cos4 x = – f (x), i.e., f is an odd function.
Therefore, by P7 (ii), I = 0
INTEGRALS 345

π
sin 4 x
Example 34 Evaluate ∫ 2
0 sin 4 x + cos4 x
dx

π
sin 4 x
Solution Let I = ∫ 2
0 sin 4 x + cos4 x
dx ... (1)

Then, by P4
π
π sin 4 ( − x) π
2 cos 4 x
I= ∫0 2
π π
dx =
∫ 2
0 cos 4 x + sin 4 x
dx ... (2)
sin 4 ( − x) + cos 4 ( − x)
2 2
Adding (1) and (2), we get
π π π
sin 4 x + cos 4 x π
2I = ∫ 0
2
4
sin x + cos x4
dx = ∫ 2 dx = [ x ] 2 =
0 0 2
π
Hence I=
4
π
dx

3
Example 35 Evaluate π
6
1 + tan x
π π
dx cos x dx
∫ =∫
3 3
Solution Let I = π π
... (1)
6
1+ tan x 6
cos x + sin x

π π 
π cos  + − x  dx
3 6 

3
Then, by P3 I= π
π π  π π 
6 cos  + − x  + sin  + − x 
3 6  3 6 
π
sin x

3
= π
dx ... (2)
6
sin x + cos x
Adding (1) and (2), we get
π π
π π π
2I = ∫ π
3
dx = [ x] 3
= − = . Hence I = π
π 3 6 6
6 6
12
346 MATHEMATICS

π
Example 36 Evaluate ∫ 0
2
log sin x dx


2
Solution Let I = log sin x dx
0

Then, by P4
π π
π 
I= ∫ 0
2
log sin  − x  dx = ∫ 2 log cos x dx
2  0

Adding the two values of I, we get


π
2I = ∫ ( log sin x + log cos x ) dx
0
2

π
= ∫ ( log sin x cos x + log 2 − log 2 ) dx (by adding and subtracting log 2)
0
2

π π

∫ sin 2x dx − ∫ 2 log 2 dx
2
= log (Why?)
0 0

π
Put 2x = t in the first integral. Then 2 dx = dt, when x = 0, t = 0 and when x = ,
2
t = π.
1 π π
Therefore 2I =
2 ∫ 0 log sin t dt − 2 log 2
π
2 π
=
2 ∫ 0
2
log sin t dt −
2
log 2 [by P6 as sin (π – t) = sin t)

π
π
= ∫
0
2
log sin x dx −
2
log 2 (by changing variable t to x)

π
= I − log 2
2
π
–π
Hence ∫ 0
2
log sin x dx =
2
log 2 .
INTEGRALS 347

EXERCISE 7.11
By using the properties of definite integrals, evaluate the integrals in Exercises 1 to 19.
3
π π π
sin x sin 2 x dx
1.
∫ 0
2
cos2 x dx 2. ∫ 0
2
sin x + cos x
dx 3. ∫ 0
2
3 3
sin 2 x + cos 2 x

π
cos 5 x dx 5 8
4. ∫ 0
2
sin 5 x + cos 5 x
5. ∫ − 5 | x + 2 | dx 6. ∫2 x − 5 dx

π
1 2
∫ 0 x (1 − x ) ∫ ∫0 x
n
7. dx 8. 4
log (1 + tan x) dx 9. 2 − x dx
0

π π

∫ ∫
2
10. 2
(2 log sin x − log sin 2 x) dx 11. –π
sin 2 x dx
0
2
π
π x dx 2π
12. ∫0 1 + sin x
13. ∫
2
–π
sin 7 x dx 14. ∫0 cos 5 x dx
2
π
sin x − cos x a x
∫0
π
15. ∫ 0
2
1 + sin x cos x
dx 16. ∫ 0 log (1 + cos x ) dx 17.
x + a−x
dx
4
18. ∫0 x − 1 dx
a a
19. Show that ∫0 f (x )g( x) dx = 2 ∫ f (x) dx , if f and g are defined as f(x) = f(a – x)
0

and g(x) + g(a – x) = 4


Choose the correct answer in Exercises 20 and 21.
π

20. The value of ∫ 2


−π
( x3 + x cos x + tan 5 x + 1) dx is
2

(A) 0 (B) 2 (C) π (D) 1


π
 4 + 3 sin x 
21. The value of ∫ 0
2
log   dx is
 4 + 3 cos x 
3
(A) 2 (B) (C) 0 (D) –2
4
348 MATHEMATICS

Miscellaneous Examples

Example 37 Find ∫ cos 6 x 1 + sin 6 x dx

Solution Put t = 1 + sin 6x, so that dt = 6 cos 6x dx


1
1 2
Therefore ∫ cos 6 x 1 + sin 6 x dx = 6∫
t dt
3 3
1 2 2 1
= × ( t ) + C = (1 + sin 6 x) 2 + C
6 3 9
1
4 4
Example 38 Find ∫ ( x −5 x) dx
x
1
1 1 4
4 (1 − )
(x − x) 4 x3 dx
Solution We have
∫ x5
dx = ∫
x4
1 3
Put 1 − 3
= 1 – x – 3 = t, so that 4 dx = dt
x x
1 5
1 5
( x 4 − x) 4 1 4 1 4 4  1 4
Therefore ∫ dx = ∫ t dt = × t 4 + C = 1 − 3  + C
x 5
3 3 5 15  x 

x 4 dx
Example 39 Find ∫ (x − 1) (x 2 + 1)
Solution We have

x4 1
2 = ( x + 1) + 3 2
( x − 1) ( x + 1) x − x + x −1

1
= ( x + 1) + ... (1)
( x − 1) ( x 2 + 1)

1 A Bx + C
Now express 2 = + 2 ... (2)
( x − 1) ( x + 1) ( x − 1) ( x + 1)
INTEGRALS 349

So 1 = A (x2 + 1) + (Bx + C) (x – 1)
= (A + B) x2 + (C – B) x + A – C
Equating coefficients on both sides, we get A + B = 0, C – B = 0 and A – C = 1,

which give A = 1 , B = C = – 1 . Substituting values of A, B and C in (2), we get


2 2
1 1 1 x 1
2 = − 2
− 2 ... (3)
(x − 1) (x + 1) 2( x − 1) 2 ( x + 1) 2( x + 1)
Again, substituting (3) in (1), we have
x4 1 1 x 1
2 = ( x + 1) + − 2
− 2
( x − 1) ( x + x + 1) 2( x − 1) 2 ( x + 1) 2( x + 1)
Therefore
x4 x2 1 1 1
∫ (x − 1) (x 2 + x + 1) dx =
2
+ x + log x − 1 – log ( x2 + 1) – tan – 1 x + C
2 4 2

 1 
Example 40 Find ∫  log (log x) + (log x)2  dx
 1 
Solution Let I = ∫ log (log x ) +  dx
 (log x) 2 

1
= ∫ log (log x) dx + ∫ dx
(log x ) 2
In the first integral, let us take 1 as the second function. Then integrating it by
parts, we get
1 dx
I = x log (log x ) − ∫ x dx + ∫
x log x (log x) 2
dx dx
= x log (log x ) − ∫ +∫ ... (1)
log x (log x)2
dx
Again, consider ∫ log x , take 1 as the second function and integrate it by parts,
dx  x  1  1  
we have ∫ log x =  log x – ∫ x  – (log x)2  x   dx  ... (2)
 
350 MATHEMATICS

Putting (2) in (1), we get


x dx dx x
I = x log (log x) − −∫ 2
+∫ 2 =
x log (log x ) − +C
log x (log x) (log x) log x

Example 41 Find ∫  cot x + tan x  dx

Solution We have

I= ∫ 
cot x + tan x  dx = ∫ tan x (1 + cot x) dx
Put tan x = t2 , so that sec2 x dx = 2t dt
2t dt
or dx =
1+ t 4

 1  2t
Then I = ∫ t 1 + 2  4
dt
 t  (1 + t )

 1  1
(t 2 + 1) 1 + 2  dt  1 + 2  dt
dt = 2 ∫ 
t 
=2∫ 
t 
= 2∫
t4 +1  2 1  1
2
 t +   t −  +2
 t2   t

1  1
Put t − = y, so that 1 + 2  dt = dy. Then
t  t 

 1
t− 
dy y –1  t
I = 2∫ 2
= 2 tan – 1 + C = 2 tan +C
y2 + ( 2) 2 2

 t 2 − 1 – 1  tan x − 1 
= 2 tan – 1   + C = 2 tan  +C
 2t   2 tan x 

sin 2 x cos 2 x dx
Example 42 Find ∫ 9 – cos4 (2 x)

sin 2 x cos 2 x
Solution Let I = ∫ dx
9 – cos 4 2 x
INTEGRALS 351

Put cos2 (2x) = t so that 4 sin 2x cos 2x dx = – dt

1 dt 1 t  1 1 
Therefore I=–
4 ∫ 9 – t2
=–
4
sin –1   + C = − sin − 1  cos 2 2 x  + C
 3 4 3 
3


2
Example 43 Evaluate x sin (π x) dx
−1

 x sin π x for −1 ≤ x ≤ 1

Solution Here f (x) = | x sin πx | =  3
 − x sin π x for 1 ≤ x ≤
 2

3 3
1
Therefore ∫ 2
−1
| x sin π x | dx = ∫ −1 x sin π x dx +∫ 2 − x sin π x dx
1

3
1
= ∫ −1 x sin π x dx − ∫ 2 x sin π x dx
1

Integrating both integrals on righthand side, we get


3
3 1
 – x cos π x sin π x   − x cos π x sin π x  2
∫ 2
−1
| x sin π x | dx = 
 π
+ 2 
π − 1 
− 
π
+
π 2 1

2  1 1 3 1
= − − − = +
π  π2 π  π π2
π x dx
Example 44 Evaluate ∫ 0 a 2 cos2 x + b 2 sin 2 x
π x dx π (π − x) dx
Solution Let I = ∫0 2 2 2 2
a cos x + b sin x
= ∫ 0 a cos (π − x) + b2 sin 2 (π − x)
2 2 (using P4)

π dx π x dx
= π∫ 2 2 22
−∫ 2
0 a cos x + b sin x 0 a cos x + b 2 sin 2 x
2

π dx
= π ∫0 −I
a cos x + b 2 sin 2 x
2 2

π dx
Thus 2I = π ∫ 0
a cos x + b 2 sin 2 x
2 2
352 MATHEMATICS

π
π π dx π dx
or I= ∫ 2 2 2 2
= ⋅2 ∫ 2 2 2
2 0 a cos x + b sin x 2 0 a cos x + b 2 sin 2 x
(using P6)

π π
 4 dx dx 
π∫ ∫ π a2 cos 2 x + b2 sin2 x 
2
+
=  0 a cos x + b 2 sin 2 x
2 2
 4

π π
 4 sec2 xdx 2
2 cosec xdx

= π  ∫ 2 2 2
+ ∫ 2 2 2
 0 a + b tan x π a cot x + b 
4

 1 dt 0 du 
= π  ∫ 2 2 2 − ∫ 2 2 2  ( put tan x = t and cot x = u)
 0 a +b t 1 a u +b 

1 0
π  –1 bt  π  –1 au  π  –1 b –1 a  π2
 tan  – tan  tan + tan
ab  b  2ab
= = =
ab  a  0 ab  b 1 a

Miscellaneous Exercise on Chapter 7


Integrate the functions in Exercises 1 to 24.
1 1 1 a
1. 2. 3. [Hint:Put x = ]
x − x3 x+a + x+b x ax − x 2 t

1 1 1 1
4. 3 5. [Hint: = , put x = t6]
1 1 1 1 1
 1

x 2 ( x4 + 1) 4 x2 + x3 x2 + x3 x3  1 + x 6

 
 

5x sin x e5 log x − e4 log x


6. 7. 8.
( x + 1) ( x 2 + 9) sin ( x − a) e3 log x − e 2 log x
cos x sin 8 − cos8 x 1
9. 10. 11.
4 − sin x 2 1 − 2 sin 2 x cos 2 x cos ( x + a ) cos ( x + b)

x3 ex 1
12. 13. 14.
1− x 8 (1 + e x ) (2 + ex ) ( x + 1) ( x 2 + 4)
2

15. cos 3 x elog sinx 16. e3 logx (x4 + 1)– 1 17. f ′ (ax + b) [f (ax + b)] n
1 sin − 1 x − cos − 1 x
18. 19. , x ∈ [0, 1]
sin 3 x sin ( x + α) sin − 1 x + cos −1 x
INTEGRALS 353

1− x 2 + sin 2x x x2 + x + 1
20. 21. e 22.
1+ x 1 + cos 2x ( x + 1) 2 ( x + 2)

1− x x2 + 1 log ( x 2 + 1) − 2 log x


23. tan – 1 24.
1+ x x4
Evaluate the definite integrals in Exercises 25 to 33.
π π
π x  1 − sin x  4 sin x cos x cos2 x dx
25. ∫π e   dx 26.
 1 − cos x  ∫ 0 cos 4 x + sin 4 x
dx 27. ∫ 2
0 cos 2 x + 4 sin 2 x
2
π π
sin x + cos x dx 4 sin x + cos x
1
28. ∫ π
3

sin 2 x
dx 29. ∫0 1+ x − x
30. ∫ 0 9 + 16 sin 2 x
dx
6
π
π x tan x
31. ∫ 0
2
sin 2 x tan −1 (sin x) dx 32. ∫ 0 sec x + tan x dx
4
33. ∫ 1 [ x − 1| + | x − 2 | + | x − 3|] dx
Prove the following (Exercises 34 to 39)
3 dx 2 2 1
∫ 1 x2 ( x + 1) = 3 + log 3 ∫0 x e
x
34. 35. dx = 1

1 π
2
36. ∫ −1 x17 cos 4 x dx = 0 37. ∫ 0
2 sin 3 x dx =
3
π
1 π
∫ ∫ 0 sin
−1
38. 4
2 tan 3 x dx = 1 − log 2 39. x dx = −1
0 2
1 2 −3 x
40. Evaluate ∫0e dx as a limit of a sum.
Choose the correct answers in Exercises 41 to 44.
dx
41. ∫ ex + e− x is equal to

(A) tan–1 (ex) + C (B) tan–1 (e–x) + C


(C) log (ex – e– x) + C (D) log (ex + e–x) + C
cos2 x
42. ∫ dx is equal to
(sin x + cos x)2
–1
(A) +C (B) log |sin x + cos x | + C
sin x + cos x
1
(C) log |sin x − cos x | + C (D)
(sin x + cos x)2
354 MATHEMATICS

b
43. If f (a + b – x) = f (x), then ∫a x f ( x) dx is equal to

a+b b a+b b
(A)
2 ∫ a f (b − x) dx (B)
2 ∫ a f (b + x) dx
b−a b a+b b
(C)
2 ∫ a f (x) dx (D)
2 ∫ a f ( x) dx
−1  2 x −1 
1
44. The value of ∫ 0 tan   dx is
 1 + x − x2 
π
(A) 1 (B) 0 (C) –1 (D)
4

Summary
® Integration is the inverse process of differentiation. In the differential calculus,
we are given a function and we have to find the derivative or differential of
this function, but in the integral calculus, we are to find a function whose
differential is given. Thus, integration is a process which is the inverse of
differentiation.
d
Let F( x) = f ( x) . Then we write ∫ f ( x) dx = F ( x) + C . These integrals
dx
are called indefinite integrals or general integrals, C is called constant of
integration. All these integrals differ by a constant.
® From the geometric point of view, an indefinite integral is collection of family
of curves, each of which is obtained by translating one of the curves parallel
to itself upwards or downwards along the y-axis.
® Some properties of indefinite integrals are as follows:
1. ∫[ f (x ) + g (x )] dx = ∫ f ( x) dx + ∫ g ( x) dx
2. For any real number k, ∫ k f ( x) dx = k ∫ f ( x) dx
More generally, if f1, f2 , f 3, ... , fn are functions and k1, k2, ... ,kn are real
numbers. Then

∫[k1 f1( x) + k2 f 2 (x) + ... + kn f n (x)] dx


= k1 ∫ f1( x) dx + k2 ∫ f2 (x ) dx + ... + kn ∫ f n ( x) dx
INTEGRALS 355

® Some standard integrals

x n +1
∫ x dx = + C , n ≠ – 1. Particularly, ∫ dx = x + C
n
(i)
n +1

(ii) ∫ cos x dx = sin x + C (iii) ∫ sin x dx = – cos x + C


∫ sec ∫ cosec
2 2
(iv) x dx = tan x + C (v) x dx = – cot x + C

(vi) ∫ sec x tan x dx = sec x + C


dx
(vii) ∫ cosec x cot x dx = – cosec x + C (viii) ∫ 1− x 2
= sin − 1 x + C

dx dx
(ix) ∫ 1− x 2
= − cos − 1 x + C (x) ∫ 1 + x 2 = tan
−1
x+C

dx
∫ 1 + x 2 = − cot ∫e
−1 x
(xi) x+C (xii) dx = ex + C

ax dx
(xiii) ∫ a dx = ∫x
x
+C (xiv) = sec− 1 x + C
log a 2
x −1
dx 1
(xv) ∫x = − cosec− 1 x + C (xvi) ∫ x dx = log | x | + C
x2 − 1
® Integration by partial fractions

P( x )
Recall that a rational function is ratio of two polynomials of the form ,
Q( x)
where P(x) and Q (x) are polynomials in x and Q (x) ≠ 0. If degree of the
polynomial P (x) is greater than the degree of the polynomial Q (x), then we
P( x) P ( x)
may divide P (x) by Q (x) so that = T ( x) + 1 , where T(x) is a
Q( x) Q( x)
polynomial in x and degree of P1 (x) is less than the degree of Q(x). T (x)
P1 ( x)
being polynomial can be easily integrated. can be integrated by
Q( x)
356 MATHEMATICS

P1 ( x)
expressing as the sum of partial fractions of the following type:
Q( x)
px + q A B
1. = + ,a≠b
(x − a ) (x − b ) x− a x−b

px + q A B
2. = +
(x − a )2 x − a ( x − a )2

px 2 + qx + r A B C
3. = + +
(x − a ) (x − b ) (x − c ) x− a x−b x−c

px2 + qx + r A B C
4. = + 2
+
(x − a )2 ( x − b ) x − a ( x − a) x−b

px2 + qx + r A Bx + C
5. = + 2
( x − a ) ( x 2 + bx + c ) x − a x + bx + c
where x2 + bx + c can not be factorised further.
® Integration by substitution
A change in the variable of integration often reduces an integral to one of the
fundamental integrals. The method in which we change the variable to some
other variable is called the method of substitution. When the integrand involves
some trigonometric functions, we use some well known identities to find the
integrals. Using substitution technique, we obtain the following standard
integrals.
(i) ∫ tan x dx = log sec x + C (ii) ∫ cot x dx = log sin x + C

(iii) ∫ sec x dx = log sec x + tan x + C

(iv) ∫ cosec x dx = log cosec x − cot x + C


® Integrals of some special functions
dx 1 x−a
(i) ∫ x2 − a 2 = 2 a log x +a
+C

dx 1 a+x dx 1 x
(ii) ∫ a 2 − x2 = 2 a log a−x
+C (iii) ∫ x2 + a 2 = a tan
−1
a
+C
INTEGRALS 357

dx x
(iv) ∫
dx
= log x + x2 − a 2 + C (v) ∫ 2
a −x 2
= sin − 1
a
+C
x2 − a 2
dx
(vi) ∫
x +a 2 2
= log | x + x2 + a 2 | + C

® Integration by parts
For given functions f1 and f2, we have
d 
∫ f1( x) ⋅ f 2 (x) dx = f1( x) ∫ f 2 (x) dx − ∫  dx
f1 ( x) ⋅ ∫ f 2 ( x) dx dx , i.e., the

integral of the product of two functions = first function × integral of the
second function – integral of {differential coefficient of the first function ×
integral of the second function}. Care must be taken in choosing the first
function and the second function. Obviously, we must take that function as
the second function whose integral is well known to us.
® ∫ e x[ f ( x) + f ′( x)] dx = ∫ e x f (x) dx + C
® Some special types of integrals
x 2 a2
(i) ∫ x 2 − a 2 dx =
2
x − a 2 − log x + x2 − a 2 + C
2
x 2 a2
(ii) ∫ x 2 + a 2 dx =
2
x + a2 +
2
log x + x 2 + a 2 + C

x 2 a2 x
(iii) ∫ a 2 − x 2 dx =
2
a − x2 +
2
sin −1 + C
a
dx dx
(iv) Integrals of the types ∫ ax2 + bx + c or ∫ ax 2 + bx + c
can be

transformed into standard form by expressing

 2 b c  b   c b2
2

ax + bx + c = a  x + x +
2 = a   x +  + − 
 a a    2a   a 4a2  

px + q dx px + q dx
(v) Integrals of the types ∫ ax2 + bx + c or ∫ ax 2 + bx + c
can be
358 MATHEMATICS

transformed into standard form by expressing


d
px + q = A ( ax 2 + bx + c) + B = A (2ax + b ) + B , where A and B are
dx
determined by comparing coefficients on both sides.
b
® We have defined ∫a f (x) dx as the area of the region bounded by the curve
y = f (x), a ≤ x ≤ b, the x-axis and the ordinates x = a and x = b. Let x be a
x
given point in [a, b]. Then ∫a f ( x) dx represents the Area function A (x).

This concept of area function leads to the Fundamental Theorems of Integral


Calculus.
® First fundamental theorem of integral calculus
x
Let the area function be defined by A(x) = ∫a f (x) dx for all x ≥ a, where
the function f is assumed to be continuous on [a, b]. Then A′ (x) = f (x) for all
x ∈ [a, b].
® Second fundamental theorem of integral calculus
Let f be a continuous function of x defined on the closed interval [a, b] and
d
let F be another function such that F( x) = f ( x) for all x in the domain of
dx
b
∫a
b
f, then f ( x) dx = [ F( x) + C] a = F ( b) − F ( a ) .

This is called the definite integral of f over the range [a, b], where a and b
are called the limits of integration, a being the lower limit and b the
upper limit.

—v —
APPLICATION OF INTEGRALS 359

Chapter 8
APPLICATION OF INTEGRALS

v One should study Mathematics because it is only through Mathematics that


nature can be conceived in harmonious form. – BIRKHOFF v

8.1 Introduction
In geometry, we have learnt formulae to calculate areas
of various geometrical figures including triangles,
rectangles, trapezias and circles. Such formulae are
fundamental in the applications of mathematics to many
real life problems. The formulae of elementary geometry
allow us to calculate areas of many simple figures.
However, they are inadequate for calculating the areas
enclosed by curves. For that we shall need some concepts
of Integral Calculus.
In the previous chapter, we have studied to find the
area bounded by the curve y = f (x), the ordinates x = a,
x = b and x-axis, while calculating definite integral as the
limit of a sum. Here, in this chapter, we shall study a specific
application of integrals to find the area under simple curves, A.L. Cauchy
area between lines and arcs of circles, parabolas and (1789-1857)
ellipses (standard forms only). We shall also deal with finding
the area bounded by the above said curves.
8.2 Area under Simple Curves
In the previous chapter, we have studied
definite integral as the limit of a sum and
how to evaluate definite integral using
Fundamental Theorem of Calculus. Now,
we consider the easy and intuitive way of
finding the area bounded by the curve
y = f (x), x-axis and the ordinates x = a and
x = b. From Fig 8.1, we can think of area
under the curve as composed of large
number of very thin vertical strips. Consider
an arbitrary strip of height y and width dx,
then dA (area of the elementary strip) = ydx,
where, y = f(x). Fig 8.1
360 MATHEMATICS

This area is called the elementary area which is located at an arbitrary position
within the region which is specified by some value of x between a and b. We can think
of the total area A of the region between x-axis, ordinates x = a, x = b and the curve
y = f (x) as the result of adding up the elementary areas of thin strips across the region
PQRSP. Symbolically, we express
b b b
A= ∫ a
d A = ∫ ydx = ∫ f ( x) dx
a a

The area A of the region bounded by


the curve x = g (y), y-axis and the lines y = c,
y = d is given by
d d
A= ∫ c
xdy = ∫ g( y) dy
c

Here, we consider horizontal strips as shown in


the Fig 8.2 Fig 8.2
Remark If the position of the curve under consideration is below the x-axis, then since
f (x) < 0 from x = a to x = b, as shown in Fig 8.3, the area bounded by the curve, x-axis
and the ordinates x = a, x = b come out to be negative. But, it is only the numerical
value of the area which is taken into consideration. Thus, if the area is negative, we
b
take its absolute value, i.e., ∫ a
f ( x) dx .

Fig 8.3

Generally, it may happen that some portion of the curve is above x-axis and some is
below the x-axis as shown in the Fig 8.4. Here, A1 < 0 and A2 > 0. Therefore, the area
A bounded by the curve y = f (x), x-axis and the ordinates x = a and x = b is given
by A = |A1 | + A2.
APPLICATION OF INTEGRALS 361

Fig 8.4

Example 1 Find the area enclosed by the circle x2 + y2 = a2.

Solution From Fig 8.5, the whole area enclosed


by the given circle
= 4 (area of the region AOBA bounded by
the curve, x-axis and the ordinates x = 0 and
x = a) [as the circle is symmetrical about both
x-axis and y-axis]
a
= 4 ∫ 0 ydx (taking vertical strips)

a
= 4∫ a2 − x2 dx
0

Since x2 + y2 = a 2 gives y = ± a2 − x 2 Fig 8.5

As the region AOBA lies in the first quadrant, y is taken as positive. Integrating, we get
the whole area enclosed by the given circle

a
x 2 a2 x
= 4 a − x2 + sin –1 
2 2 a 0

 a a2 −1    a2   π 
= 4  × 0 + sin 1 − 0  = 4     =πa
2

 2 2    2   2 
362 MATHEMATICS

Alternatively, considering horizontal strips as shown in Fig 8.6, the whole area of the
region enclosed by circle
a a
= 4 ∫ xdy = 4 ∫ a 2 − y 2 dy (Why?)
0 0

a
y 2 a2 y
= 4 a − y2 + sin − 1 
2 2 a 0
 a a2  
= 4  × 0 + sin − 1 1 − 0 
 2 2  
a2 π
= 4 = πa 2
2 2
Fig 8.6
x2 y2
Example 2 Find the area enclosed by the ellipse 2 + 2 =1
a b
Solution From Fig 8.7, the area of the region ABA′B′A bounded by the ellipse

 area of the region AOBA in the first quadrant bounded 


= 4 
 by the curve, x − axis and the ordinates x = 0, x = a 
(as the ellipse is symmetrical about both x-axis and y-axis)
a
= 4 ∫ 0 ydx (taking verticalstrips)

x2 y2 b 2
Now 2 + 2 = 1 gives y =± a − x2 , but as the region AOBA lies in the first
a b a
quadrant, y is taken as positive. So, the required area is
ab
= 4∫0 a 2 − x2 dx
a
a
4b  x 2 2 a 2 –1 x 
=  a − x + sin  (Why?)
a 2 2 a 0

4b  a a2 −1
 
=  × 0 + sin 1 − 0 
a  2 2  

4b a 2 π
= =π ab
a 2 2 Fig 8.7
APPLICATION OF INTEGRALS 363

Alternatively, considering horizontal strips as


shown in the Fig 8.8, the area of the ellipse is
b
b a
= 4 ∫ xdy = 4 ∫ b 2 − y 2 dy (Why?)
0 b0

b
4a y 2 b 2 –1 y 
=  b − y 2
+ sin 
b 2 2 b 0

4a  b b 2 –1   Fig 8.8
= b  × 0 + sin 1  − 0 
 2 2  

4a b 2 π
= =πab
b 2 2
8.2.1 The area of the region bounded by a curve and a line
In this subsection, we will find the area of the region bounded by a line and a circle,
a line and a parabola, a line and an ellipse. Equations of above mentioned curves will be
in their standard forms only as the cases in other forms go beyond the scope of this
textbook.
Example 3 Find the area of the region bounded
by the curve y = x2 and the line y = 4.
Solution Since the given curve represented by
the equation y = x2 is a parabola symmetrical
about y-axis only, therefore, from Fig 8.9, the
required area of the region AOBA is given by
4
2∫ xdy =
0
Fig 8.9
 area of the region BONB bounded by curve, y − axis 
2 
 and the lines y = 0 and y = 4 
4
2 
3
4 4 32
= 2∫0 y dy = 2 ×  y 2  = × 8 = (Why?)
3  0 3 3

Here, we have taken horizontal strips as indicated in the Fig 8.9.


364 MATHEMATICS

Alternatively, we may consider the vertical


strips like PQ as shown in the Fig 8.10 to
obtain the area of the region AOBA. To this
end, we solve the equations x2 = y and y = 4
which gives x = –2 and x = 2.
Thus, the region AOBA may be stated as
the region bounded by the curve y = x 2, y = 4
and the ordinates x = –2 and x = 2.
Therefore, the area of the region AOBA
2 Fig 8.10
= ∫ −2 ydx

[ y = ( y-coordinate of Q) – (y-coordinate of P) = 4 – x 2 ]

( )
2
= 2 ∫ 0 4 − x dx
2
(Why?)

2
 x3   8  32
= 2 4 x −  = 2 4 × 2 −  =
 3 0  3 3
Remark From the above examples, it is inferred that we can consider either vertical
strips or horizontal strips for calculating the area of the region. Henceforth, we shall
consider either of these two, most preferably vertical strips.
Example 4 Find the area of the region in the first quadrant enclosed by the x-axis,
the line y = x, and the circle x2 + y2 = 32.
Y
Solution The given equations are
y= x ... (1) y=x
2 2 B
and x + y = 32 ... (2)
(4,4)
Solving (1) and (2), we find that the line
and the circle meet at B(4, 4) in the first
quadrant (Fig 8.11). Draw perpendicular A
BM to the x-axis. X' X
M
O (4 2 ,0)
Therefore, the required area = area of
the region OBMO + area of the region
BMAB.
Now, the area of the region OBMO
4 4
= ∫ 0 ydx = ∫ 0 xdx ... (3)
Y'
1 2 4
= x  = 8 Fig 8.11
2  0
APPLICATION OF INTEGRALS 365

Again, the area of the region BMAB


4 2 4 2
= ∫4 ydx = ∫4 32 − x2 dx

4 2
1 1 x 
=  x 32 − x2 + × 32 × sin –1 
2 2 4 2 4

1 1 –1  4 1 1 
=  4 2 × 0 + × 32 × sin 1  −  32 − 16 + × 32 × sin –1 
2 2  2 2 2
= 8 π – (8 + 4π) = 4π – 8 ... (4)
Adding (3) and (4), we get, the required area = 4π.

x2 y2
Example 5 Find the area bounded by the ellipse 2 + 2 =1 and the ordinates x = 0
a b
and x = ae, where, b 2 = a2 (1 – e2) and e < 1.

Solution The required area (Fig 8.12) of the region BOB′RFSB is enclosed by the
ellipse and the lines x = 0 and x = ae. Y
Note that the area of the region BOB′RFSB
B x = ae
S
ae b ae 2
= 2∫0
a ∫0
ydx = 2 a − x2 dx
F (ae, o)
X′ X
ae O
2b  x 2 a x
2

=  a − x2 + sin –1 
a 2 2 a 0
R
B'
2b 
= ae a2 − a2e2 + a2 sin–1 e
2a   Y′
Fig 8.12
= ab  e 1 − e + sin e
2 –1

EXERCISE 8.1
1. Find the area of the region bounded by the curve y2 = x and the lines x = 1,
x = 4 and the x-axis in the first quadrant.
2. Find the area of the region bounded by y2 = 9x, x = 2, x = 4 and the x-axis in the
first quadrant.
366 MATHEMATICS

3. Find the area of the region bounded by x2 = 4y, y = 2, y = 4 and the y-axis in the
first quadrant.
x2 y2
4. Find the area of the region bounded by the ellipse + =1 .
16 9
x2 y2
5. Find the area of the region bounded by the ellipse + =1 .
4 9
6. Find the area of the region in the first quadrant enclosed by x-axis, line x = 3 y
and the circle x2 + y2 = 4.
a
7. Find the area of the smaller part of the circle x2 + y2 = a2 cut off by the line x =
.
2
8. The area between x = y2 and x = 4 is divided into two equal parts by the line
x = a, find the value of a.
9. Find the area of the region bounded by the parabola y = x2 and y = x .
10. Find the area bounded by the curve x2 = 4y and the line x = 4y – 2.
11. Find the area of the region bounded by the curve y2 = 4x and the line x = 3.
Choose the correct answer in the following Exercises 12 and 13.
12. Area lying in the first quadrant and bounded by the circle x2 + y2 = 4 and the lines
x = 0 and x = 2 is
π π π
(A) π (B) (C) (D)
2 3 4
13. Area of the region bounded by the curve y = 4x, y-axis and the line y = 3 is
2

9 9 9
(A) 2 (B) (C) (D)
4 3 2
8.3 Area between Two Curves
Intuitively, true in the sense of Leibnitz, integration is the act of calculating the area by
cutting the region into a large number of small strips of elementary area and then
adding up these elementary areas. Suppose we are given two curves represented by
y = f (x), y = g (x), where f (x) ≥ g(x) in [a, b] as shown in Fig 8.13. Here the points of
intersection of these two curves are given by x = a and x = b obtained by taking
common values of y from the given equation of two curves.
For setting up a formula for the integral, it is convenient to take elementary area in
the form of vertical strips. As indicated in the Fig 8.13, elementary strip has height
APPLICATION OF INTEGRALS 367

f(x) – g (x) and width dx so that the elementary area

Fig 8.13
dA = [f (x) – g(x)] dx, and the total area A can be taken as
b
A= ∫ a [f (x )− g (x)]dx
Alternatively,
A = [area bounded by y = f (x), x-axis and the lines x = a, x = b]
– [area bounded by y = g (x), x-axis and the lines x = a, x = b]
b b b
= ∫a f ( x) dx − ∫ g ( x) dx = ∫ [ f ( x) − g ( x) ] dx, where f (x) ≥ g (x) in [a, b]
a a

If f (x) ≥ g (x) in [a, c] and f (x) ≤ g (x) in [c, b], where a < c < b as shown in the
Fig 8.14, then the area of the regions bounded by curves can be written as
Total Area = Area of the region ACBDA + Area of the region BPRQB
c b
= ∫ a [ f (x )− g (x)] dx +∫ c [g (x) − f ( x)]dx

y = g (x)
Y y = f (x) P
C
B R
A

D y = g (x) Q y = f (x)

x=a x =c x =b
X′ O X

Y′ Fig 8.14
368 MATHEMATICS

Example 6 Find the area of the region bounded by the two parabolas y = x2 and y2 = x.
Solution The point of intersection of these two
parabolas are O (0, 0) and A (1, 1) as shown in
the Fig 8.15.
Here, we can set y 2 = x or y = x = f(x) and y = x2
= g (x), where, f (x) ≥ g (x) in [0, 1].
Therefore, the required area of the shaded region
1
= ∫ 0 [ f (x )− g (x)] dx
1
1  2 3 x3 
 2
= ∫ 0  x − x  dx =  x 2 − 
 3 3  0 Fig 8.15

2 1 1
= − =
3 3 3
Example 7 Find the area lying above x-axis and included between the circle
x2 + y2 = 8x and inside of the parabola y2 = 4x.
Solution The given equation of the circle x2 + y 2 = 8x can be expressed as
(x – 4)2 + y2 = 16. Thus, the centre of the Y
circle is (4, 0) and radius is 4. Its intersection
P (4, 4)
with the parabola y2 = 4x gives
x2 + 4x = 8x
or x2 – 4x = 0
or x (x – 4) = 0
X¢ X
or x = 0, x = 4 O C (4, 0) Q (8, 0)
Thus, the points of intersection of these
two curves are O(0, 0) and P(4,4) above the
x-axis.
From the Fig 8.16, the required area of
the region OPQCO included between these Fig 8.16
two curves above x-axis is Y¢
= (area of the region OCPO) + (area of the region PCQP)
4 8
= ∫ 0 ydx + ∫ 4 ydx
4 8
= 2∫ x dx + ∫ 42 − (x − 4)2 dx (Why?)
0 4
APPLICATION OF INTEGRALS 369

4
2 
3 4
= 2 ×  x 2  + ∫ 4 − t dt , where, x − 4 = t
2 2
(Why?)
3   0 0

4
32  t 1 t
= +  4 2 − t 2 + × 4 2 × sin –1 
3 2 2 4 0

32  4 1  32  π  32 4
= +  × 0 + × 4 2 × sin –1 1 = +  0 + 8 ×  = + 4π = (8+ 3π)
3 2 2  3  2 3 3

Example 8 In Fig 8.17, AOBA is the part of the ellipse 9x2 + y2 = 36 in the first
quadrant such that OA = 2 and OB = 6. Find the area between the arc AB and the
chord AB.

x2 y2
Solution Given equation of the ellipse 9x2 + y2 = 36 can be expressed as + = 1 or
4 36

x2 y2
+ = 1and hence, its shape is as given in Fig 8.17.
22 62

Accordingly, the equation of the chord AB is


6− 0
y– 0 = (x − 2)
0− 2
or y = – 3(x – 2)
or y = – 3x + 6
Area of the shaded region as shown in the Fig 8.17.
2 2
= 3∫ 4 − x2 dx − ∫ (6 − 3 x) dx (Why?)
0 0

2 Fig 8.17
x  3x 2 
2
x 4
= 3 4 − x 2 + sin –1  − 6 x − 
2 2 2 0  2 0

2   12  π
= 3  × 0 + 2sin –1 (1)  − 12 −  = 3 × 2 × − 6 = 3π – 6
2   2 2
370 MATHEMATICS

Example 9 Using integration find the area of region bounded by the triangle whose
vertices are (1, 0), (2, 2) and (3, 1).
Solution Let A(1, 0), B(2, 2) and C (3, 1) be
the vertices of a triangle ABC (Fig 8.18).
Area of ∆ABC
= Area of ∆ABD + Area of trapezium
BDEC – Area of ∆AEC
Now equation of the sides AB, BC and
CA are given by
Fig 8.18
1
y = 2 (x – 1), y = 4 – x, y = (x – 1), respectively.
2
2 3 3 x −1
Hence, area of ∆ ABC = ∫ 1 2 (x − 1) dx + ∫ 2 (4 − x) dx − ∫ 1 2
dx
2 3 3
 x2   x2  1  x2 
= 2  − x +  4 x −  −  − x 
2 1  2 2 2  2 1
 2 2   1    32   2 2   1  3 2   1 
= 2  − 2  −  − 1   +  4 × 3 −  −  4 × 2 −   –  − 3  −  − 1 
 2   2    2   2   2  2   2 
3
=
2
Example 10 Find the area of the region enclosed between the two circles: x2 + y2 = 4
and (x – 2)2 + y2 = 4.
Solution Equations of the given circles are
x2 + y2 = 4 ... (1)
and (x – 2)2 + y2 = 4 ... (2)
Equation (1) is a circle with centre O at the
origin and radius 2. Equation (2) is a circle with
centre C (2, 0) and radius 2. Solving equations
(1) and (2), we have
(x –2)2 + y2 = x2 + y2
or x – 4x + 4 + y2 = x2 + y2
2

or x = 1 which gives y = ± 3
Thus, the points of intersection of the given
circles are A(1, 3 ) and A′(1, – 3 ) as shown in
Fig 8.19
the Fig 8.19.
APPLICATION OF INTEGRALS 371

Required area of the enclosed region OACA′O between circles


= 2 [area of the region ODCAO] (Why?)
= 2 [area of the region ODAO + area of the region DCAD]

= 2  ∫ 0 y dx + ∫ 1 y dx 
1 2

 

= 2  ∫ 0 4 − ( x − 2) dx + ∫ 1 4 − x dx
1 2 2 2
(Why?)
 
1
1 1  x − 2 
= 2  ( x − 2) 4 − ( x − 2) 2 + × 4sin –1  
2 2  2   0
2
1 1 x
+ 2  x 4 − x 2 + × 4 sin –1 
2 2 2 1
1 2
  x − 2   –1 x 
=  ( x − 2) 4 − ( x − 2) 2 + 4 sin –1    +  x 4 − x + 4sin
2

  2  0  2 1

 –1  −1   −1   −1 1 
=   − 3 + 4 sin    − 4sin ( −1)  + 4 sin 1 − 3 − 4 sin
–1

  2    2 

 π π  π π
= − 3 − 4×  + 4× + 4× − 3 − 4× 
 6 2  2 6

 2π   2π 
= − 3 − + 2 π  +  2π − 3 − 
 3   3 

= −2 3
3
EXERCISE 8.2
1. Find the area of the circle 4x2 + 4y2 = 9 which is interior to the parabola x2 = 4y.
2. Find the area bounded by curves (x – 1)2 + y2 = 1 and x2 + y2 = 1.
3. Find the area of the region bounded by the curves y = x2 + 2, y = x, x = 0 and
x = 3.
4. Using integration find the area of region bounded by the triangle whose vertices
are (– 1, 0), (1, 3) and (3, 2).
5. Using integration find the area of the triangular region whose sides have the
equations y = 2x + 1, y = 3x + 1 and x = 4.
372 MATHEMATICS

Choose the correct answer in the following exercises 6 and 7.


6. Smaller area enclosed by the circle x2 + y2 = 4 and the line x + y = 2 is
(A) 2 (π – 2) (B) π – 2 (C) 2π – 1 (D) 2 (π + 2)
7. Area lying between the curves y = 4x and y = 2x is
2

2 1 1 3
(A) (B) (C) (D)
3 3 4 4

Miscellaneous Examples
Example 11 Find the area of the parabola y2 = 4ax bounded by its latus rectum.
Solution From Fig 8.20, the vertex of the parabola Y
L
y2 = 4ax is at origin (0, 0). The equation of the
latus rectum LSL′ is x = a. Also, parabola is
symmetrical about the x-axis.
The required area of the region OLL′O
= 2 (area of the region OLSO) (a, 0)
a a
X′ X
O S
= 2∫ 0 ydx = 2 ∫ 0 4ax dx
a
= 2× 2 a ∫0 xdx

a
L'
2 2
3
= 4 a × x  Y′
3   Fig 8.20
0

8  3 8
= a  a 2  = a2
3   3

Example 12 Find the area of the region bounded


by the line y = 3x + 2, the x-axis and the ordinates
x = –1 and x = 1.
Solution As shown in the Fig 8.21, the line
−2
y = 3x + 2 meets x-axis at x = and its graph
3
 −2
lies below x-axis for x ∈ −1,  and above
 3
 −2  Fig 8.21
x-axis for x ∈ , 1 .
 3 
APPLICATION OF INTEGRALS 373

The required area = Area of the region ACBA + Area of the region ADEA
−2
1
= ∫ 3
−1
(3 x + 2) dx + ∫ −2 (3 x + 2) dx
3

−2
1
 3 x2  3  3 x2  1 25 13
=  + 2 x  + + 2x  = + =
 2  −1  2  −2 6 6 3
3

Example 13 Find the area bounded by


the curve y = cos x between x = 0 and
x = 2π.

Solution From the Fig 8.22, the required


area = area of the region OABO + area
of the region BCDB + area of the region
DEFD.
Fig 8.22
Thus, we have the required area

π 3π

∫ 0 cos x dx + ∫ π cos x dx + ∫ 3π cos x dx
2 2
=
2 2

π 3π

= [ sin x ] + [sin x ] + [ sin x] 3 π
2 2
0 π
2 2

=1+2+1=4
Example 13 Prove that the curves y2 = 4x and x2 = 4y
divide the area of the square bounded by x = 0, x = 4,
y = 4 and y = 0 into three equal parts.
Solution Note that the point of intersection of the
parabolas y2 = 4x and x2 = 4y are (0, 0) and (4, 4) as Fig 8.23
374 MATHEMATICS

shown in the Fig 8.23.


Now, the area of the region OAQBO bounded by curves y2 = 4x and x2 = 4y.
4
4 x2  2 2 x3 
3
= ∫0  2 x − 4  dx =  2 × 3 x − 12 
  0

32 16 16
− == ... (1)
3 3 3
Again, the area of the region OPQAO bounded by the curves x2 = 4y, x = 0, x = 4
and x-axis

x2 1 3 4 16
4
∫ 0 4 dx = 12  x  0 = 3
= ... (2)

Similarly, the area of the region OBQRO bounded by the curve y2 = 4x, y-axis,
y = 0 and y = 4
4 4 y2 1 4 16
= ∫ 0 xdy = ∫ 0 dy = y 3  = ... (3)
4 12   0 3
From (1), (2) and (3), it is concluded that the area of the region OAQBO = area of
the region OPQAO = area of the region OBQRO, i.e., area bounded by parabolas
y2 = 4x and x2 = 4y divides the area of the square in three equal parts.

Example 14 Find the area of the region Y


R
{(x, y) : 0 ≤ y ≤ x + 1, 0 ≤ y ≤ x + 1, 0 ≤ x ≤ 2}
2
Q(1, 2)
Solution Let us first sketch the region whose area is to
be found out. This region is the intersection of the x=2
following regions. P (0,1) x = 1
A1 = {(x, y) : 0 ≤ y ≤ x2 + 1}, X' X
O T S
A2 = {(x, y) : 0 ≤ y ≤ x + 1}
Y'
and A3 = {(x, y) : 0 ≤ x ≤ 2}
Fig 8.24
The points of intersection of y = x2 + 1 and y = x + 1 are points P(0, 1) and Q(1, 2).
From the Fig 8.24, the required region is the shaded region OPQRSTO whose area
= area of the region OTQPO + area of the region TSRQT
1 2
∫0(x + 1) dx + ∫ ( x + 1) dx
2
= 1 (Why?)
APPLICATION OF INTEGRALS 375

1 2
 x3   x 2 
=  + x   +  + x  
 3   0  2  1

 1     1  23
=  + 1 − 0  +  ( 2 + 2 ) −  + 1  =
 3     2  6

Miscellaneous Exercise on Chapter 8


1. Find the area under the given curves and given lines:
(i) y = x2, x = 1, x = 2 and x-axis
(ii) y = x4, x = 1, x = 5 and x-axis
2. Find the area between the curves y = x and y = x2.
3. Find the area of the region lying in the first quadrant and bounded by y = 4x2,
x = 0, y = 1 and y = 4.
0
4. Sketch the graph of y = x + 3 and evaluate ∫− 6 x + 3 dx .
5. Find the area bounded by the curve y = sin x between x = 0 and x = 2π.
6. Find the area enclosed between the parabola y2 = 4ax and the line y = mx.
7. Find the area enclosed by the parabola 4y = 3x2 and the line 2y = 3x + 12.
x2 y2
8. Find the area of the smaller region bounded by the ellipse + = 1 and the
9 4
x y
line + =1 .
3 2
x2 y2
9. Find the area of the smaller region bounded by the ellipse 2 + 2 =1 and the
a b
x y
line + = 1 .
a b
10. Find the area of the region enclosed by the parabola x2 = y, the line y = x + 2 and
the x-axis.

11. Using the method of integration find the area bounded by the curve x + y = 1 .
[Hint: The required region is bounded by lines x + y = 1, x– y = 1, – x + y = 1 and
– x – y = 1].
376 MATHEMATICS

12. Find the area bounded by curves {(x, y) : y ≥ x2 and y = | x |}.


13. Using the method of integration find the area of the triangle ABC, coordinates of
whose vertices are A(2, 0), B (4, 5) and C (6, 3).
14. Using the method of integration find the area of the region bounded by lines:
2x + y = 4, 3x – 2y = 6 and x – 3y + 5 = 0
15. Find the area of the region {(x, y) : y2 ≤ 4x, 4x2 + 4y2 ≤ 9}
Choose the correct answer in the following Exercises from 16 to 20.
16. Area bounded by the curve y = x3, the x-axis and the ordinates x = – 2 and x = 1 is

−15 15 17
(A) – 9 (B) (C) (D)
4 4 4
17. The area bounded by the curve y = x | x |, x-axis and the ordinates x = – 1 and
x = 1 is given by

1 2 4
(A) 0 (B) (C) (D)
3 3 3
[Hint : y = x2 if x > 0 and y = – x2 if x < 0].
18. The area of the circle x2 + y2 = 16 exterior to the parabola y2 = 6x is

4 4 4 4
(A) (4 π − 3) (B) (4 π + 3) (C) (8π − 3 ) (D) (8π + 3)
3 3 3 3

π
19. The area bounded by the y-axis, y = cos x and y = sin x when 0 ≤ x ≤ is
2

(A) 2 ( 2 − 1) (B) 2 −1 (C) 2 +1 (D) 2

Summary
® The area of the region bounded by the curve y = f (x), x-axis and the lines
b b
x = a and x = b (b > a) is given by the formula: Area = ∫ ydx = ∫ f ( x) dx .
a a

® The area of the region bounded by the curve x = φ (y), y-axis and the lines
d d
y = c, y = d is given by the formula: Area = ∫ xdy = ∫ φ ( y)dy .
c c
APPLICATION OF INTEGRALS 377

® The area of the region enclosed between two curves y = f (x), y = g (x) and
the lines x = a, x = b is given by the formula,
b
Area = ∫
a
[ f ( x) − g ( x)]dx , where, f (x) ≥ g (x) in [a, b]
® If f (x ) ≥ g (x ) in [a, c ] and f (x) ≤ g (x) in [c, b], a < c < b, then
c b
Area = ∫
a
[ f ( x) − g( x)]dx + ∫ c [ g( x) − f ( x)]dx .

Historical Note
The origin of the Integral Calculus goes back to the early period of development
of Mathematics and it is related to the method of exhaustion developed by the
mathematicians of ancient Greece. This method arose in the solution of problems
on calculating areas of plane figures, surface areas and volumes of solid bodies
etc. In this sense, the method of exhaustion can be regarded as an early method
of integration. The greatest development of method of exhaustion in the early
period was obtained in the works of Eudoxus (440 B.C.) and Archimedes
(300 B.C.)
Systematic approach to the theory of Calculus began in the 17th century.
In 1665, Newton began his work on the Calculus described by him as the theory
of fluxions and used his theory in finding the tangent and radius of curvature at
any point on a curve. Newton introduced the basic notion of inverse function
called the anti derivative (indefinite integral) or the inverse method of tangents.
During 1684-86, Leibnitz published an article in the Acta Eruditorum
which he called Calculas summatorius, since it was connected with the summation
of a number of infinitely small areas, whose sum, he indicated by the symbol ‘∫’.
In 1696, he followed a suggestion made by J. Bernoulli and changed this article to
Calculus integrali. This corresponded to Newton’s inverse method of tangents.
Both Newton and Leibnitz adopted quite independent lines of approach which
was radically different. However, respective theories accomplished results that
were practically identical. Leibnitz used the notion of definite integral and what is
quite certain is that he first clearly appreciated tie up between the antiderivative
and the definite integral.
Conclusively, the fundamental concepts and theory of Integral Calculus
and primarily its relationships with Differential Calculus were developed in the
work of P.de Fermat, I. Newton and G. Leibnitz at the end of 17th century.
378 MATHEMATICS

However, this justification by the concept of limit was only developed in the
works of A.L. Cauchy in the early 19th century. Lastly, it is worth mentioning the
following quotation by Lie Sophie’s:
“It may be said that the conceptions of differential quotient and integral which
in their origin certainly go back to Archimedes were introduced in Science by the
investigations of Kepler, Descartes, Cavalieri, Fermat and Wallis .... The discovery
that differentiation and integration are inverse operations belongs to Newton
and Leibnitz”.

—v—
DIFFERENTIAL EQUATIONS 379

Chapter 9
DIFFERENTIAL EQUATIONS

he
™ He who seeks for methods without having a definite problem in mind
seeks for the most part in vain. – D. HILBERT ™

is
9.1 Introduction
In Class XI and in Chapter 5 of the present book, we

bl
discussed how to differentiate a given function f with respect
to an independent variable, i.e., how to find f ′(x) for a given
function f at each x in its domain of definition. Further, in
pu
the chapter on Integral Calculus, we discussed how to find
a function f whose derivative is the function g, which may
also be formulated as follows:
be T

For a given function g, find a function f such that


re
o R

dy
= g (x), where y = f (x) ... (1)
dx
An equation of the form (1) is known as a differential Henri Poincare
tt E

equation. A formal definition will be given later. (1854-1912 )

These equations arise in a variety of applications, may it be in Physics, Chemistry,


C

Biology, Anthropology, Geology, Economics etc. Hence, an indepth study of differential


equations has assumed prime importance in all modern scientific investigations.
no N

In this chapter, we will study some basic concepts related to differential equation,
general and particular solutions of a differential equation, formation of differential
equations, some methods to solve a first order - first degree differential equation and
some applications of differential equations in different areas.
©

9.2 Basic Concepts


We are already familiar with the equations of the type:
x2 – 3x + 3 = 0 ... (1)
sin x + cos x = 0 ... (2)
x+y=7 ... (3)
380 MATHEMATICS

Let us consider the equation:


dy
x +y =0 ... (4)
dx
We see that equations (1), (2) and (3) involve independent and/or dependent variable
(variables) only but equation (4) involves variables as well as derivative of the dependent
variable y with respect to the independent variable x. Such an equation is called a

he
differential equation.
In general, an equation involving derivative (derivatives) of the dependent variable
with respect to independent variable (variables) is called a differential equation.

is
A differential equation involving derivatives of the dependent variable with respect
to only one independent variable is called an ordinary differential equation, e.g.,
3
d 2 y ⎛ dy ⎞

bl
2 2 + ⎜ ⎟ = 0 is an ordinary differential equation .... (5)
dx ⎝ dx ⎠
Of course, there are differential equations involving derivatives with respect to
pu
more than one independent variables, called partial differential equations but at this
stage we shall confine ourselves to the study of ordinary differential equations only.
Now onward, we will use the term ‘differential equation’ for ‘ordinary differential
be T

equation’.
re
$ Note
o R

1. We shall prefer to use the following notations for derivatives:


tt E

dy d2y d3y
= y′ , 2 = y ′′, 3 = y′′′
dx dx dx
C

2. For derivatives of higher order, it will be inconvenient to use so many dashes


dny
as supersuffix therefore, we use the notation yn for nth order derivative .
dx n
no N

9.2.1. Order of a differential equation


Order of a differential equation is defined as the order of the highest order derivative of
©

the dependent variable with respect to the independent variable involved in the given
differential equation.
Consider the following differential equations:
dy
= ex ... (6)
dx
DIFFERENTIAL EQUATIONS 381

d2y
+y =0 ... (7)
dx 2
3
⎛ d3y ⎞ 2 ⎛ d2y ⎞
⎜ 3 ⎟+ x ⎜ 2 ⎟ = 0 ... (8)
⎝ dx ⎠ ⎝ dx ⎠

he
The equations (6), (7) and (8) involve the highest derivative of first, second and
third order respectively. Therefore, the order of these equations are 1, 2 and 3 respectively.

9.2.2 Degree of a differential equation

is
To study the degree of a differential equation, the key point is that the differential
equation must be a polynomial equation in derivatives, i.e., y′, y″, y″′ etc. Consider the

bl
following differential equations:
2
d 3 y ⎛ d 2 y ⎞ dy
+ 2⎜ 2 ⎟ − + y = 0
pu dx3 ⎝ dx ⎠ dx
... (9)
be T

2
⎛ dy ⎞ ⎛ dy ⎞
⎜ ⎟ + ⎜ ⎟ − sin y = 0
2
... (10)
⎝ ⎠ ⎝ ⎠
re
dx dx
o R

dy ⎛ dy ⎞
+ sin ⎜ ⎟ = 0 ... (11)
dx ⎝ dx ⎠
tt E

We observe that equation (9) is a polynomial equation in y″′, y″ and y′, equation (10)
is a polynomial equation in y′ (not a polynomial in y though). Degree of such differential
C

equations can be defined. But equation (11) is not a polynomial equation in y′ and
degree of such a differential equation can not be defined.
no N

By the degree of a differential equation, when it is a polynomial equation in


derivatives, we mean the highest power (positive integral index) of the highest order
derivative involved in the given differential equation.
In view of the above definition, one may observe that differential equations (6), (7),
©

(8) and (9) each are of degree one, equation (10) is of degree two while the degree of
differential equation (11) is not defined.

$ Note Order and degree (if defined) of a differential equation are always
positive integers.
382 MATHEMATICS

Example 1 Find the order and degree, if defined, of each of the following differential
equations:
2
dy d2y ⎛ dy ⎞ dy
(i) − cos x = 0 (ii) xy 2 + x ⎜ ⎟ − y =0
dx dx ⎝ dx ⎠ dx
(iii) y ′′′ + y 2 + e y′ = 0

he
Solution
dy
(i) The highest order derivative present in the differential equation is , so its
dx

is
dy
order is one. It is a polynomial equation in y′ and the highest power raised to
dx

bl
is one, so its degree is one.
d2y
(ii) The highest order derivative present in the given differential equation is , so
dx 2
pu
d2y dy
its order is two. It is a polynomial equation in 2 and and the highest
be T

dx dx
re
d2y
o R

power raised to is one, so its degree is one.


dx 2
(iii) The highest order derivative present in the differential equation is y ′′′ , so its
tt E

order is three. The given differential equation is not a polynomial equation in its
derivatives and so its degree is not defined.
C

EXERCISE 9.1
Determine order and degree (if defined) of differential equations given in Exercises
no N

1 to 10.
4
d4y ⎛ ds ⎞ d 2s
1. + sin( y′′′) = 0 2. y′ + 5y = 0 3. ⎜ ⎟ + 3s 2 = 0
dx 4 ⎝ dt ⎠ dt
©

2
⎛ d2y ⎞ ⎛ dy ⎞ d2y
4. ⎜ 2 ⎟ + cos ⎜ ⎟ = 0 5. = cos3 x + sin 3 x
⎝ dx ⎠ ⎝ dx ⎠ dx 2

6. ( y ′′′) 2 + (y″)3 + (y′)4 + y5 = 0 7. y ′′′ + 2y″ + y′ = 0


DIFFERENTIAL EQUATIONS 383

8. y′ + y = ex 9. y″ + (y′)2 + 2y = 0 10. y″ + 2y′ + sin y = 0


11. The degree of the differential equation
3
⎛ d 2 y ⎞ ⎛ dy ⎞ 2 ⎛ dy ⎞
⎜ 2 ⎟ + ⎜ ⎟ + sin ⎜ ⎟ + 1 = 0 is
⎝ dx ⎠ ⎝ dx ⎠ ⎝ dx ⎠
(A) 3 (B) 2 (C) 1 (D) not defined

he
12. The order of the differential equation
d2y dy
2 x2 2
− 3 + y = 0 is
dx dx

is
(A) 2 (B) 1 (C) 0 (D) not defined
9.3. General and Particular Solutions of a Differential Equation

bl
In earlier Classes, we have solved the equations of the type:
pu x2 + 1 = 0 ... (1)
2
sin x – cos x = 0 ... (2)
Solution of equations (1) and (2) are numbers, real or complex, that will satisfy the
given equation i.e., when that number is substituted for the unknown x in the given
be T

equation, L.H.S. becomes equal to the R.H.S..


re
d2y
o R

Now consider the differential equation + y=0 ... (3)


dx 2
In contrast to the first two equations, the solution of this differential equation is a
tt E

function φ that will satisfy it i.e., when the function φ is substituted for the unknown y
(dependent variable) in the given differential equation, L.H.S. becomes equal to R.H.S..
The curve y = φ (x) is called the solution curve (integral curve) of the given
C

differential equation. Consider the function given by


y = φ (x) = a sin (x + b), ... (4)
no N

where a, b ∈ R. When this function and its derivative are substituted in equation (3),
L.H.S. = R.H.S.. So it is a solution of the differential equation (3).
π
Let a and b be given some particular values say a = 2 and b = , then we get a
©

4
⎛ π⎞
function y = φ1(x) = 2sin ⎜ x + ⎟ ... (5)
⎝ 4⎠
When this function and its derivative are substituted in equation (3) again
L.H.S. = R.H.S.. Therefore φ1 is also a solution of equation (3).
384 MATHEMATICS

Function φ consists of two arbitrary constants (parameters) a, b and it is called


general solution of the given differential equation. Whereas function φ1 contains no
arbitrary constants but only the particular values of the parameters a and b and hence
is called a particular solution of the given differential equation.
The solution which contains arbitrary constants is called the general solution
(primitive) of the differential equation.

he
The solution free from arbitrary constants i.e., the solution obtained from the general
solution by giving particular values to the arbitrary constants is called a particular
solution of the differential equation.
Example 2 Verify that the function y = e– 3x is a solution of the differential equation

is
d 2 y dy
+ − 6y = 0
dx 2 dx

bl
Solution Given function is y = e– 3x. Differentiating both sides of equation with respect
to x , we get
dy
= −3 e −3 x
pu dx
... (1)

Now, differentiating (1) with respect to x, we have


be T

d2y
= 9 e – 3x
re
dx 2
o R

d 2 y dy
Substituting the values of , and y in the given differential equation, we get
dx 2 dx
tt E

L.H.S. = 9 e– 3x + (–3e– 3x) – 6.e– 3x = 9 e– 3x – 9 e– 3x = 0 = R.H.S..


Therefore, the given function is a solution of the given differential equation.
C

Example 3 Verify that the function y = a cos x + b sin x, where, a, b ∈ R is a solution


2
of the differential equation d y + y = 0
no N

dx 2
Solution The given function is
y = a cos x + b sin x ... (1)
©

Differentiating both sides of equation (1) with respect to x, successively, we get


dy
= – a sin x + b cos x
dx
d2y
= – a cos x – b sin x
dx 2
DIFFERENTIAL EQUATIONS 385

d2y
Substituting the values of and y in the given differential equation, we get
dx 2
L.H.S. = (– a cos x – b sin x) + (a cos x + b sin x) = 0 = R.H.S.
Therefore, the given function is a solution of the given differential equation.

he
EXERCISE 9.2
In each of the Exercises 1 to 10 verify that the given functions (explicit or implicit) is a
solution of the corresponding differential equation:
1. y = ex + 1 : y″ – y′ = 0

is
2
2. y = x + 2x + C : y′ – 2x – 2 = 0
3. y = cos x + C : y′ + sin x = 0

bl
xy
4. y = 1 + x2 : y′ =
1 + x2
xy′ = y (x ≠ 0)
pu
5. y = Ax :

6. y = x sin x : xy′ = y + x x 2 − y 2 (x ≠ 0 and x > y or x < – y)


be T

y2
(xy ≠ 1)
re
7. xy = log y + C : y′ =
1 − xy
o R

8. y – cos y = x : (y sin y + cos y + x) y′ = y


9. x + y = tan–1y : y2 y′ + y2 + 1 = 0
tt E

dy
10. y = a 2 − x 2 x ∈ (–a, a) : x+y
dx
= 0 (y ≠ 0)
C

11. The number of arbitrary constants in the general solution of a differential equation
of fourth order are:
no N

(A) 0 (B) 2 (C) 3 (D) 4


12. The number of arbitrary constants in the particular solution of a differential equation
of third order are:
(A) 3 (B) 2 (C) 1 (D) 0
©

9.4 Formation of a Differential Equation whose General Solution is given


We know that the equation
x2 + y2 + 2x – 4y + 4 = 0 ... (1)
represents a circle having centre at (– 1, 2) and radius 1 unit.
386 MATHEMATICS

Differentiating equation (1) with respect to x, we get

dy x +1
= (y ≠ 2) ... (2)
dx 2− y
which is a differential equation. You will find later on [See (example 9 section 9.5.1.)]
that this equation represents the family of circles and one member of the family is the

he
circle given in equation (1).
Let us consider the equation
x2 + y2 = r 2 ... (3)
By giving different values to r, we get different members of the family e.g.

is
x2 + y2 = 1, x2 + y2 = 4, x2 + y2 = 9 etc. (see Fig 9.1).
Thus, equation (3) represents a family of concentric
circles centered at the origin and having different radii.

bl
We are interested in finding a differential equation
that is satisfied by each member of the family. The
pu
differential equation must be free from r because r is
different for different members of the family. This
equation is obtained by differentiating equation (3) with
be T

respect to x, i.e.,
re
dy dy
o R

2x + 2y =0 or x+y =0 ... (4) Fig 9.1


dx dx
which represents the family of concentric circles given by equation (3).
tt E

Again, let us consider the equation


y = mx + c ... (5)
C

By giving different values to the parameters m and c, we get different members of


the family, e.g.,
no N

y=x (m = 1, c = 0)
y= 3x (m = 3 , c = 0)
y=x+1 (m = 1, c = 1)
©

y=–x (m = – 1, c = 0)
y=–x–1 (m = – 1, c = – 1) etc. ( see Fig 9.2).
Thus, equation (5) represents the family of straight lines, where m, c are parameters.
We are now interested in finding a differential equation that is satisfied by each
member of the family. Further, the equation must be free from m and c because m and
DIFFERENTIAL EQUATIONS 387

c are different for different members of the family. 3x 1


Y x+
This is obtained by differentiating equation (5) with y= y= y= x
–x
respect to x, successively we get y= y=
–x
–1
dy d2y
= m , and =0 ... (6) X’ X
dx dx 2 O

he
The equation (6) represents the family of straight
lines given by equation (5).
Note that equations (3) and (5) are the general Y’

is
solutions of equations (4) and (6) respectively.
Fig 9.2
9.4.1 Procedure to form a differential equation that will represent a given

bl
family of curves
(a) If the given family F1 of curves depends on only one parameter then it is
represented by an equation of the form
pu F1 (x, y, a) = 0 ... (1)
For example, the family of parabolas y2 = ax can be represented by an equation
be T

of the form f (x, y, a) : y2 = ax.


re
Differentiating equation (1) with respect to x, we get an equation involving
o R

y′, y, x, and a, i.e.,


g (x, y, y′, a) = 0 ... (2)
The required differential equation is then obtained by eliminating a from equations
tt E

(1) and (2) as


F (x, y, y′) = 0 ... (3)
C

(b) If the given family F2 of curves depends on the parameters a, b (say) then it is
represented by an equation of the from
no N

F2 (x, y, a, b) = 0 ... (4)


Differentiating equation (4) with respect to x, we get an equation involving
y′, x, y, a, b, i.e.,
©

g (x, y, y′, a, b) = 0 ... (5)


But it is not possible to eliminate two parameters a and b from the two equations
and so, we need a third equation. This equation is obtained by differentiating
equation (5), with respect to x, to obtain a relation of the form
h (x, y, y′, y″, a, b) = 0 ... (6)
388 MATHEMATICS

The required differential equation is then obtained by eliminating a and b from


equations (4), (5) and (6) as
F (x, y, y′, y″) = 0 ... (7)

$ Note The order of a differential equation representing a family of curves is


same as the number of arbitrary constants present in the equation corresponding to

he
the family of curves.
Example 4 Form the differential equation representing the family of curves y = mx,
where, m is arbitrary constant.

is
Solution We have
y = mx ... (1)
Differentiating both sides of equation (1) with respect to x, we get

bl
dy
=m
dx
dy
pu
Substituting the value of m in equation (1) we get y
dx
x

dy
x
be T

or –y=0
dx
which is free from the parameter m and hence this is the required differential equation.
re
o R

Example 5 Form the differential equation representing the family of curves


y = a sin (x + b), where a, b are arbitrary constants.
tt E

Solution We have
y = a sin (x + b) ... (1)
Differentiating both sides of equation (1) with respect to x, successively we get
C

dy
= a cos (x + b) ... (2)
dx
no N

d2y
= – a sin (x + b) ... (3)
dx 2
©

Eliminating a and b from equations (1), (2) and (3), we get


d2y
+y =0 ... (4)
dx 2
which is free from the arbitrary constants a and b and hence this the required differential
equation.
DIFFERENTIAL EQUATIONS 389

Example 6 Form the differential equation


representing the family of ellipses having foci on
x-axis and centre at the origin.
Solution We know that the equation of said family
of ellipses (see Fig 9.3) is

he
x2 y 2
+ =1 ... (1)
a 2 b2 Fig 9.3

2 x 2 y dy
Differentiating equation (1) with respect to x, we get + =0

is
a 2 b 2 dx

y ⎛ dy ⎞ −b 2

bl
or ⎜ ⎟= 2 ... (2)
x ⎝ dx ⎠ a
Differentiating both sides of equation (2) with respect to x, we get
pu
dy
2 x y
y d y dx dy
be T

=0
x dx 2 x 2
dx
re
o R

2
d2y dy dy
or xy x –y =0 ... (3)
dx 2 dx dx
tt E

which is the required differential equation.


Example 7 Form the differential equation of the family Y
C

of circles touching the x-axis at origin.


Solution Let C denote the family of circles touching
no N

x-axis at origin. Let (0, a) be the coordinates of the


centre of any member of the family (see Fig 9.4).
Therefore, equation of family C is
©

x2 + (y – a)2 = a2 or x2 + y2 = 2ay ... (1) X


X’ O
where, a is an arbitrary constant. Differentiating both
sides of equation (1) with respect to x,we get
dy dy Y’
2x 2 y = 2a Fig 9.4
dx dx
390 MATHEMATICS

dy
x y
dy dy dx
or x y = a or a = ... (2)
dx dx dy
dx
Substituting the value of a from equation (2) in equation (1), we get

he
dy
x y
dx
x2 + y2 = 2 y
dy
dx

is
dy 2 dy
or (x y 2 ) = 2 xy 2 y 2
dx dx

bl
dy 2 xy
or = 2 2
dx x –y
pu
This is the required differential equation of the given family of circles.
Example 8 Form the differential equation representing the family of parabolas having
be T

vertex at origin and axis along positive direction of x-axis.


re
Solution Let P denote the family of above said parabolas (see Fig 9.5) and let (a, 0) be the
o R

focus of a member of the given family, where a is an arbitrary constant. Therefore, equation
of family P is
tt E

y2 = 4ax ... (1)


Differentiating both sides of equation (1) with respect to x, we get
C

dy
2y = 4a ... (2)
dx
Substituting the value of 4a from equation (2)
no N

in equation (1), we get

⎛ dy ⎞
y2 = ⎜ 2 y ⎟ ( x)
⎝ dx ⎠
©

dy
or y 2 − 2 xy =0
dx
which is the differential equation of the given family
of parabolas. Fig 9.5
DIFFERENTIAL EQUATIONS 391

EXERCISE 9.3
In each of the Exercises 1 to 5, form a differential equation representing the given
family of curves by eliminating arbitrary constants a and b.
x y
1. + =1 2. y2 = a (b2 – x2) 3. y = a e3x + b e– 2x
a b

he
4. y = e2x (a + bx) 5. y = ex (a cos x + b sin x)
6. Form the differential equation of the family of circles touching the y-axis at
origin.
7. Form the differential equation of the family of parabolas having vertex at origin

is
and axis along positive y-axis.
8. Form the differential equation of the family of ellipses having foci on y-axis and
centre at origin.

bl
9. Form the differential equation of the family of hyperbolas having foci on x-axis
and centre at origin.
pu
10. Form the differential equation of the family of circles having centre on y-axis
and radius 3 units.
11. Which of the following differential equations has y = c1 ex + c2 e–x as the general
be T

solution?
re
d2y d2y d2y d2y
+ = − = + = −1 = 0
o R

(A) y 0 (B) y 0 (C) 1 0 (D)


dx 2 dx 2 dx 2 dx 2
12. Which of the following differential equations has y = x as one of its particular
solution?
tt E

d2y dy d2y dy
(A) 2
− x2 + xy = x (B) 2
+x + xy = x
C

dx dx dx dx
d 2 y 2 dy d2y dy
(C) x xy 0 (D) + x + xy = 0
dx 2 2
no N

dx dx dx
9.5. Methods of Solving First Order, First Degree Differential Equations
In this section we shall discuss three methods of solving first order first degree differential
equations.
©

9.5.1 Differential equations with variables separable


A first order-first degree differential equation is of the form
dy
= F (x, y) ... (1)
dx
392 MATHEMATICS

If F (x, y) can be expressed as a product g (x) h(y), where, g(x) is a function of x


and h(y) is a function of y, then the differential equation (1) is said to be of variable
separable type. The differential equation (1) then has the form
dy
= h (y) . g (x) ... (2)
dx
If h (y) ≠ 0, separating the variables, (2) can be rewritten as

he
1
dy = g (x) dx ... (3)
h( y )
Integrating both sides of (3), we get

is
1
∫ h( y) dy = ∫ g ( x) dx ... (4)

bl
Thus, (4) provides the solutions of given differential equation in the form
pu H (y) = G (x) + C
1
Here, H (y) and G (x) are the anti derivatives of h( y ) and g (x) respectively and
C is the arbitrary constant.
be T

dy x + 1
re
Example 9 Find the general solution of the differential equation = , (y ≠ 2)
o R

dx 2 − y
Solution We have
tt E

dy x +1
= ... (1)
dx 2− y
C

Separating the variables in equation (1), we get


(2 – y) dy = (x + 1) dx ... (2)
no N

Integrating both sides of equation (2), we get

∫ (2 − y) dy = ∫ ( x + 1) dx
©

y2 x2
or 2y − = + x + C1
2 2
or x2 + y2 + 2x – 4y + 2 C1 = 0
or x2 + y2 + 2x – 4y + C = 0, where C = 2C1
which is the general solution of equation (1).
DIFFERENTIAL EQUATIONS 393

dy 1 + y 2
Example 10 Find the general solution of the differential equation = .
dx 1 + x 2
Solution Since 1 + y2 ≠ 0, therefore separating the variables, the given differential
equation can be written as

he
dy dx
2 = ... (1)
1+ y 1 + x2
Integrating both sides of equation (1), we get
dy dx
∫ 1 + y2 ∫ 1 + x2

is
=

or tan–1 y = tan–1x + C

bl
which is the general solution of equation (1).
dy
Example 11 Find the particular solution of the differential equation = − 4 xy 2 given
pu dx
that y = 1, when x = 0.
Solution If y ≠ 0, the given differential equation can be written as
be T

dy
re
= – 4x dx ... (1)
y2
o R

Integrating both sides of equation (1), we get


dy
∫ y2 = − 4∫ x dx
tt E

1
C

or − = – 2x2 + C
y
1
no N

or y= ... (2)
2x − C 2

Substituting y = 1 and x = 0 in equation (2), we get, C = – 1.


Now substituting the value of C in equation (2), we get the particular solution of the
©

1
given differential equation as y = .
2x + 1
2

Example 12 Find the equation of the curve passing through the point (1, 1) whose
differential equation is x dy = (2x2 + 1) dx (x ≠ 0).
394 MATHEMATICS

Solution The given differential equation can be expressed as

2 x2 1
dy* = dx*
x

⎛ 1⎞
or dy = ⎜ 2x + ⎟ dx ... (1)
⎝ x⎠

he
Integrating both sides of equation (1), we get

⎛ 1⎞
∫ dy
= ∫ ⎜ 2x + ⎟ dx
⎝ x⎠

is
2
or y = x + log | x | + C ... (2)
Equation (2) represents the family of solution curves of the given differential equation

bl
but we are interested in finding the equation of a particular member of the family which
passes through the point (1, 1). Therefore substituting x = 1, y = 1 in equation (2), we
get C = 0.
pu
Now substituting the value of C in equation (2) we get the equation of the required
curve as y = x2 + log | x |.
Example 13 Find the equation of a curve passing through the point (–2, 3), given that
be T

2x
re
the slope of the tangent to the curve at any point (x, y) is .
y2
o R

dy
Solution We know that the slope of the tangent to a curve is given by .
dx
tt E

dy 2x
so, = 2 ... (1)
dx y
C

Separating the variables, equation (1) can be written as


y2 dy = 2x dx ... (2)
no N

Integrating both sides of equation (2), we get

∫y ∫ 2 x dx
2
dy =
y3
©

or = x2 + C ... (3)
3
dy
* The notation due to Leibnitz is extremely flexible and useful in many calculation and formal
dx
transformations, where, we can deal with symbols dy and dx exactly as if they were ordinary numbers. By
treating dx and dy like separate entities, we can give neater expressions to many calculations.
Refer: Introduction to Calculus and Analysis, volume-I page 172, By Richard Courant,
Fritz John Spinger – Verlog New York.
DIFFERENTIAL EQUATIONS 395

Substituting x = –2, y = 3 in equation (3), we get C = 5.


Substituting the value of C in equation (3), we get the equation of the required curve as
1
y3
= x 2 + 5 or y = (3x 2 + 15) 3
3
Example 14 In a bank, principal increases continuously at the rate of 5% per year. In
how many years Rs 1000 double itself?

he
Solution Let P be the principal at any time t. According to the given problem,
dp ⎛ 5 ⎞
= ⎜ ⎟×P
dt ⎝ 100 ⎠

is
dp P
or = ... (1)
dt 20

bl
separating the variables in equation (1), we get
dp dt
= ... (2)
P 20
pu
Integrating both sides of equation (2), we get
t
+ C1
be T

log P =
20
t
re
e 20 ⋅ eC1
o R

or P=
t
or P = C e 20 (where e C1 = C ) ... (3)
tt E

Now P = 1000, when t = 0


Substituting the values of P and t in (3), we get C = 1000. Therefore, equation (3),
C

gives
t

P = 1000 e 20
no N

Let t years be the time required to double the principal. Then


t
2000 = 1000 e 20 ⇒ t = 20 loge2
©

EXERCISE 9.4
For each of the differential equations in Exercises 1 to 10, find the general solution:
dy 1 − cos x dy
1. = 2. = 4 − y 2 (−2 < y < 2)
dx 1 + cos x dx
396 MATHEMATICS

dy
3. + y = 1 ( y ≠ 1) 4. sec2 x tan y dx + sec2 y tan x dy = 0
dx
dy
5. (ex + e–x) dy – (ex – e–x) dx = 0 6. = (1 + x 2 ) (1 + y 2 )
dx
dy
= − y5

he
7. y log y dx – x dy = 0 8. x5
dx
dy
9. = sin −1 x 10. ex tan y dx + (1 – ex) sec2 y dy = 0
dx

is
For each of the differential equations in Exercises 11 to 14, find a particular solution
satisfying the given condition:

bl
dy
11. ( x + x + x + 1)
3 2
= 2x2 + x; y = 1 when x = 0
dx
dy
12.
pu
x ( x 2 − 1)
dx
= 1 ; y = 0 when x = 2

⎛ dy ⎞
be T

13. cos ⎜ ⎟ = a (a ∈ R); y = 2 when x = 0


⎝ dx ⎠
re
o R

dy
14. = y tan x ; y = 1 when x = 0
dx
tt E

15. Find the equation of a curve passing through the point (0, 0) and whose differential
equation is y′ = ex sin x.
C

dy
16. For the differential equation xy = ( x + 2) ( y + 2) , find the solution curve
dx
passing through the point (1, –1).
no N

17. Find the equation of a curve passing through the point (0, –2) given that at any
point (x, y) on the curve, the product of the slope of its tangent and y coordinate
of the point is equal to the x coordinate of the point.
©

18. At any point (x, y) of a curve, the slope of the tangent is twice the slope of the
line segment joining the point of contact to the point (– 4, –3). Find the equation
of the curve given that it passes through (–2, 1).
19. The volume of spherical balloon being inflated changes at a constant rate. If
initially its radius is 3 units and after 3 seconds it is 6 units. Find the radius of
balloon after t seconds.
DIFFERENTIAL EQUATIONS 397

20. In a bank, principal increases continuously at the rate of r% per year. Find the
value of r if Rs 100 double itself in 10 years (loge2 = 0.6931).
21. In a bank, principal increases continuously at the rate of 5% per year. An amount
of Rs 1000 is deposited with this bank, how much will it worth after 10 years
(e0.5 = 1.648).
22. In a culture, the bacteria count is 1,00,000. The number is increased by 10% in 2

he
hours. In how many hours will the count reach 2,00,000, if the rate of growth of
bacteria is proportional to the number present?
dy
23. The general solution of the differential equation = e x + y is

is
dx
(A) ex + e–y = C (B) ex + ey = C

bl
(C) e–x + ey = C (D) e–x + e–y = C

9.5.2 Homogeneous differential equations


pu
Consider the following functions in x and y
F1 (x, y) = y2 + 2xy, F2 (x, y) = 2x – 3y,
be T

⎛ y⎞
F3 (x, y) = cos ⎜ ⎟ , F4 (x, y) = sin x + cos y
⎝x⎠
re
o R

If we replace x and y by λx and λy respectively in the above functions, for any nonzero
constant λ, we get
tt E

F1 (λx, λy) = λ2 (y2 + 2xy) = λ2 F1 (x, y)


F2 (λx, λy) = λ (2x – 3y) = λ F2 (x, y)
C

⎛ λy ⎞ ⎛ y⎞
F3 (λx, λy) = cos ⎜ ⎟ = cos ⎜ ⎟ = λ0 F3 (x, y)
⎝ λx ⎠ ⎝x⎠
no N

F4 (λx, λy) = sin λx + cos λy ≠ λn F4 (x, y), for any n ∈ N


Here, we observe that the functions F1, F2, F3 can be written in the form
F(λx, λy) = λn F (x, y) but F4 can not be written in this form. This leads to the following
©

definition:
A function F(x, y) is said to be homogeneous function of degree n if
F(λx, λy) = λn F(x, y) for any nonzero constant λ.
We note that in the above examples, F1, F2, F3 are homogeneous functions of
degree 2, 1, 0 respectively but F4 is not a homogeneous function.
398 MATHEMATICS

We also observe that


⎛ y2 2 y ⎞ 2 ⎛ y⎞
F1(x, y) = x 2 ⎜ 2 + ⎟ = x h1 ⎜ ⎟
⎝x x ⎠ ⎝ x⎠
2⎛ 2x ⎞ ⎛x⎞
F1(x, y) = y ⎜ 1 + ⎟ = y h2 ⎜ ⎟
2
or
⎝ y ⎠ ⎝ y⎠

he
⎛ 3y ⎞ ⎛ y⎞
F2(x, y) = x1 ⎜ 2 − ⎟ = x1h3 ⎜ ⎟
⎝ x ⎠ ⎝x⎠
1⎛ x ⎞ ⎛x⎞
F2(x, y) = y ⎜ 2 − 3 ⎟ = y h4 ⎜ ⎟
1

is
or
⎝ y ⎠ ⎝ y⎠
⎛ y⎞ ⎛ y⎞
⎟ = x h5 ⎜ ⎟
0 0
F3(x, y) = x cos ⎜

bl
⎝ x⎠ ⎝x⎠
n ⎛ y⎞
F4(x, y) ≠ x h6 ⎜ ⎟ , for any n ∈ N
⎝x⎠
pu
n ⎛x⎞
or F4 (x, y) ≠ y h7 ⎜ ⎟ , for any n ∈ N
⎝ y⎠
be T

Therefore, a function F (x, y) is a homogeneous function of degree n if


re
o R

n ⎛ y⎞ ⎛ x⎞
F (x, y) = x g ⎜ ⎟ or ynh ⎜ ⎟
⎝x⎠ ⎝ y⎠
tt E

dy
A differential equation of the form = F (x, y) is said to be homogenous if
dx
C

F(x, y) is a homogenous function of degree zero.


To solve a homogeneous differential equation of the type

⎛ y⎞
no N

dy
= F ( x, y ) = g ⎜ ⎟ ... (1)
dx ⎝x⎠
We make the substitution y = v.x ... (2)
Differentiating equation (2) with respect to x, we get
©

dy dv
= v+x ... (3)
dx dx
dy
Substituting the value of from equation (3) in equation (1), we get
dx
DIFFERENTIAL EQUATIONS 399

dv
v+x = g (v)
dx
dv
or x = g (v) – v ... (4)
dx
Separating the variables in equation (4), we get

he
dv dx
= ... (5)
g (v ) − v x
Integrating both sides of equation (5), we get

is
dv 1
∫ g (v ) − v = ∫ x dx + C ... (6)

bl
Equation (6) gives general solution (primitive) of the differential equation (1) when
y
we replace v by .
x
pu dx
$Note If the homogeneous differential equation is in the form
dy
= F( x , y )
be T

where, F (x, y) is homogenous function of degree zero, then we make substitution


re
o R

x
= v i.e., x = vy and we proceed further to find the general solution as discussed
y
dx ⎛ x⎞
= F( x, y ) = h ⎜ ⎟ .
tt E

above by writing
dy ⎝ y⎠
dy
C

Example 15 Show that the differential equation (x – y) = x + 2y is homogeneous


dx
and solve it.
no N

Solution The given differential equation can be expressed as


dy x + 2y
= ... (1)
dx x− y
©

x 2y
Let F (x, y) =
x y

( x 2 y) 0
Now F (λx, λy) = F ( x, y )
( x y)
400 MATHEMATICS

Therefore, F (x, y) is a homogenous function of degree zero. So, the given differential
equation is a homogenous differential equation.
Alternatively,
⎛ 2y ⎞
1+
dy ⎜ x ⎟ g⎛ y ⎞
=⎜ ⎟ = ⎜ ⎟ ... (2)
dx ⎜ 1− y ⎟ ⎝x⎠

he
⎝ x ⎠
y
R.H.S. of differential equation (2) is of the form g and so it is a homogeneous
x

is
function of degree zero. Therefore, equation (1) is a homogeneous differential equation.
To solve it we make the substitution

bl
y = vx ... (3)
Differentiating equation (3) with respect to, x we get
dy dv
pu dx
= v+x
dx
... (4)

dy
be T

Substituting the value of y and in equation (1) we get


dx
re
o R

dv 1 + 2v
v+x =
dx 1− v
tt E

dv 1 + 2v
or x = −v
dx 1− v
C

dv v2 v 1
or x =
dx 1 v
no N

v 1 dx
or 2
dv =
v v 1 x
Integrating both sides of equation (5), we get
©

v 1 dx
2
dv =
v v 1 x
1 2v 1 3
or dv = – log | x | + C1
2 v2 v 1
DIFFERENTIAL EQUATIONS 401

1 2v 1 3 1
or dv dv log x C1
2 v2 v 1 2
2 v v 1
1 3 1
or log v 2 v 1 2
dv log x C1
2 2 1
2
3
v
2 2

he
1 3 2 2v 1
or log v 2 v 1 . tan 1
log x C1
2 2 3 3

is
1 1 2v 1
or log v 2 v 1 log x 2 3 tan 1
C1 (Why?)
2 2 3

bl
y
Replacing v by , we get
pu x
1 y2 y 1 2y x
or log 2 1 log x 2 3 tan 1
C1
2 x x 2 3x
be T

1 ⎛ y2 y ⎞ ⎛ 2y + x ⎞
log ⎜ 2 + + 1⎟ x 2 = 3 tan −1 ⎜ ⎟ + C1
re
or 2 ⎝x x ⎠ ⎝ 3x ⎠
o R

⎛ 2y + x ⎞
or log ( y 2 + xy + x 2 ) = 2 3 tan −1 ⎜ ⎟ + 2C1
⎝ 3x ⎠
tt E

⎛ x + 2y ⎞
log ( x 2 + xy + y 2 ) = 2 3 tan −1 ⎜ ⎟ +C
C

or
⎝ 3x ⎠
which is the general solution of the differential equation (1)
no N

⎛ y ⎞ dy ⎛ y⎞
Example 16 Show that the differential equation x cos ⎜ ⎟ = y cos ⎜ ⎟ + x is
⎝ x ⎠ dx ⎝x⎠
homogeneous and solve it.
©

Solution The given differential equation can be written as


⎛ y⎞
y cos ⎜ ⎟ + x
dy ⎝x⎠
= ... (1)
dx ⎛ y⎞
x cos ⎜ ⎟
⎝x⎠
402 MATHEMATICS

dy
It is a differential equation of the form = F( x, y ) .
dx

⎛ y⎞
y cos ⎜ ⎟ + x
⎝ x⎠
Here F (x, y) =
⎛ y⎞
x cos ⎜ ⎟

he
⎝x⎠
Replacing x by λx and y by λy, we get

⎛ y⎞
λ [ y cos ⎜ ⎟ + x]

is
⎝x⎠ = λ 0 [F( x, y )]
F (λx, λy) =
⎛ y⎞
λ ⎜ x cos ⎟
⎝ ⎠

bl
x
Thus, F (x, y) is a homogeneous function of degree zero.
Therefore, the given differential equation is a homogeneous differential equation.
pu
To solve it we make the substitution
y = vx ... (2)
be T

Differentiating equation (2) with respect to x, we get


re
dy dv
o R

= v+x ... (3)


dx dx
dy
tt E

Substituting the value of y and in equation (1), we get


dx
dv v cos v + 1
C

v+x =
dx cos v
dv v cos v + 1
−v
no N

or x =
dx cos v
dv 1
or x =
dx cos v
©

dx
or cos v dv =
x
1
Therefore ∫ cos v dv = ∫ x dx
DIFFERENTIAL EQUATIONS 403

or sin v = log | x | + log | C |


or sin v = log | Cx |
y
Replacing v by , we get
x
⎛ y⎞
sin ⎜ ⎟ = log | Cx |

he
⎝x⎠
which is the general solution of the differential equation (1).

⎛ x x

Example 17 Show that the differential equation 2 y e dx + ⎝ y − 2 x e y ⎟⎠ dy = 0 is
⎜ y

is
homogeneous and find its particular solution, given that, x = 0 when y = 1.
Solution The given differential equation can be written as

bl
x

dx 2x e − yy
= ... (1)
pu dy x

2y e y
be T

2 xe − y
y
re
Let F(x, y) =
o R

2 ye y
tt E

⎛ x

λ ⎜ 2 xe y − y ⎟
⎜ ⎟
⎝ ⎠ = λ 0 [F( x, y )]
C

Then F (λx, λy) =


⎛ x

λ ⎜ 2 ye y ⎟
⎜ ⎟
⎝ ⎠
no N

Thus, F (x, y) is a homogeneous function of degree zero. Therefore, the given


differential equation is a homogeneous differential equation.
To solve it, we make the substitution
©

x = vy ... (2)
Differentiating equation (2) with respect to y, we get
dx dv
= v+ y
dy dy
404 MATHEMATICS

dx
Substituting the value of x and in equation (1), we get
dy

dv 2v ev − 1
v+ y =
dy 2ev

he
dv 2v ev − 1
or y = −v
dy 2ev
dv 1
or y = − v

is
dy 2e

− dy
or 2ev dv =

bl
y

dy
or ∫ 2e
v
⋅ dv = − ∫
pu y
or 2 ev = – log |y| + C
be T

x
and replacing v by y , we get
re
x
o R

y
2 e + log | y | = C ... (3)
Substituting x = 0 and y = 1 in equation (3), we get
tt E

2 e0 + log | 1| = C ⇒ C = 2
Substituting the value of C in equation (3), we get
C

x
y
2 e + log | y | = 2
which is the particular solution of the given differential equation.
no N

Example 18 Show that the family of curves for which the slope of the tangent at any

point (x, y) on it is x + y , is given by x2 – y2 = cx.


2 2
©

2 xy
dy
Solution We know that the slope of the tangent at any point on a curve is .
dx
dy x2 + y2
Therefore, =
dx 2 xy
DIFFERENTIAL EQUATIONS 405

y2
1+
dy x2
or = ... (1)
dx 2y
x
Clearly, (1) is a homogenous differential equation. To solve it we make substitution

he
y = vx
Differentiating y = vx with respect to x, we get

dy dv
= v+x

is
dx dx

dv 1 + v 2

bl
or v+x =
dx 2v

dv 1 − v 2
or
pu x
dx
=
2v
be T

2v dx
dv =
1− v 2
x
re
o R

2v dx
or dv = −
v −1
2
x
tt E

2v 1
Therefore ∫ v 2 − 1 dv = −∫ x dx
C

or log | v2 – 1 | = – log | x | + log | C1 |


or log | (v2 – 1) (x) | = log |C1|
no N

or (v2 – 1) x = ± C1

y
Replacing v by , we get
©

⎛ y2 ⎞
⎜ 2 − 1 ⎟ x = ± C1
⎝x ⎠
or (y2 – x2) = ± C1 x or x2 – y2 = Cx
406 MATHEMATICS

EXERCISE 9.5
In each of the Exercises 1 to 10, show that the given differential equation is homogeneous
and solve each of them.

x+ y
1. (x2 + xy) dy = (x2 + y2) dx 2. y′ =
x

he
2 2
3. (x – y) dy – (x + y) dx = 0 4. (x – y ) dx + 2xy dy = 0
dy
5. x2 = x 2 − 2 y 2 + xy 6. x dy – y dx = x 2 + y 2 dx
dx

is
⎧ ⎛ y⎞ ⎛ y ⎞⎫ ⎧ ⎛ y⎞ ⎛ y ⎞⎫
7. ⎨ x cos ⎜ ⎟ + y sin ⎜ ⎟ ⎬ y dx = ⎨ y sin ⎜ ⎟ − x cos ⎜ ⎟ ⎬ x dy
⎩ ⎝x⎠ ⎝ x ⎠⎭ ⎩ ⎝ x⎠ ⎝ x ⎠⎭

bl
dy ⎛ y⎞ ⎛ y⎞
8. x − y + x sin ⎜ ⎟ = 0 9. y dx + x log ⎜ ⎟ dy − 2 x dy = 0
dx ⎝ x⎠ ⎝x⎠
pu x x
y y x
10. 1 e dx e 1 dy 0
be T

y
re
For each of the differential equations in Exercises from 11 to 15, find the particular
o R

solution satisfying the given condition:


11. (x + y) dy + (x – y) dx = 0; y = 1 when x = 1
tt E

12. x2 dy + (xy + y2) dx = 0; y = 1 when x = 1


y
C

13. x sin 2 y dx x dy 0; y when x = 1


x 4

⎛ y⎞
no N

dy y
14. − + cosec ⎜ ⎟ = 0 ; y = 0 when x = 1
dx x ⎝x⎠

dy
15. 2 xy + y − 2 x = 0 ; y = 2 when x = 1
2 2
©

dx
dx ⎛x⎞
16. A homogeneous differential equation of the from = h ⎜ ⎟ can be solved by
dy ⎝ y⎠
making the substitution.
(A) y = vx (B) v = yx (C) x = vy (D) x = v
DIFFERENTIAL EQUATIONS 407

17. Which of the following is a homogeneous differential equation?


(A) (4x + 6y + 5) dy – (3y + 2x + 4) dx = 0
(B) (xy) dx – (x3 + y3) dy = 0
(C) (x3 + 2y2) dx + 2xy dy = 0
(D) y2 dx + (x2 – xy – y2) dy = 0

he
9.5.3 Linear differential equations
A differential equation of the from
dy
+ Py = Q
dx

is
where, P and Q are constants or functions of x only, is known as a first order linear
differential equation. Some examples of the first order linear differential equation are

bl
dy
+ y = sin x
dx
dy ⎛ 1 ⎞
+ ⎜ ⎟ y = ex
pu dx ⎝ x ⎠
dy ⎛ y ⎞ 1
+⎜
be T


dx ⎝ x log x ⎠
=
x
re
Another form of first order linear differential equation is
o R

dx
+ P1 x = Q 1
dy
tt E

where, P1 and Q1 are constants or functions of y only. Some examples of this type of
differential equation are
C

dx
+ x = cos y
dy
no N

dx −2 x
+ = y2e – y
dy y
To solve the first order linear differential equation of the type
©

dy
Py = Q ... (1)
dx
Multiply both sides of the equation by a function of x say g (x) to get
dy
g (x) + P. (g (x)) y = Q . g (x) ... (2)
dx
408 MATHEMATICS

Choose g (x) in such a way that R.H.S. becomes a derivative of y . g (x).


dy d
i.e. g (x) + P. g (x) y = [y . g (x)]
dx dx

dy dy
or g (x) + P. g (x) y = g (x) + y g′ (x)
dx dx

he
⇒ P. g (x) = g′ (x)
g ′ ( x)
or P=
g ( x)

is
Integrating both sides with respect to x, we get

g ′ ( x)
∫ Pdx = ∫ g ( x) dx

∫ P ⋅ dx = log (g (x))

bl
or
pu
or g (x) = e ∫ P dx
be T

On multiplying the equation (1) by g(x) = e ∫ , the L.H.S. becomes the derivative
P dx
re
o R

of some function of x and y. This function g(x) = e ∫


P dx
is called Integrating Factor
(I.F.) of the given differential equation.
tt E

Substituting the value of g (x) in equation (2), we get

P dx dy P dx P dx
C

e Pe y = Q e
dx

d P dx
no N

P dx
or ye = Qe
dx
Integrating both sides with respect to x, we get
©

P dx P dx
y e = Qe dx

P dx P dx
or y= e Qe dx C

which is the general solution of the differential equation.


DIFFERENTIAL EQUATIONS 409

Steps involved to solve first order linear differential equation:


dy
(i) Write the given differential equation in the form + Py = Q where P, Q are
dx
constants or functions of x only.
(ii) Find the Integrating Factor (I.F) = e P dx .

he
(iii) Write the solution of the given differential equation as
y (I.F) = Q × I.F dx C

dx
In case, the first order linear differential equation is in the form dy + P1 x = Q1 ,

is
P1 dy
where, P1 and Q1 are constants or functions of y only. Then I.F = e and the

bl
solution of the differential equation is given by
x . (I.F) = ∫ ( Q1 × I.F) dy + C
pu dy
Example 19 Find the general solution of the differential equation − y = cos x .
dx
be T

Solution Given differential equation is of the form


re
dy
o R

+ Py = Q , where P = –1 and Q = cos x


dx
1 dx x
I.F = e e
tt E

Therefore
Multiplying both sides of equation by I.F, we get
C

dy − x
e− x − e y = e–x cos x
dx
no N

or
dy
( y e− x ) = e–x cos x
dx
On integrating both sides with respect to x, we get
ye– x = ∫ e cos x dx + C
−x
©

... (1)

I = ∫ e cos x dx
−x
Let

⎛ e− x ⎞
⎝ −1 ⎠ ∫
−x
= cos x ⎜ ⎟ − ( − sin x) (−e ) dx
410 MATHEMATICS

= − cos x e − ∫ sin x e dx
−x −x

= − cos x e − ⎡⎣sin x(– e ) − ∫ cos x (−e ) dx ⎤⎦


−x −x −x

= − cos x e + sin x e − ∫ cos x e dx


−x −x −x

or I = – e–x cos x + sin x e–x – I

he
or 2I = (sin x – cos x) e–x
(sin x − cos x) e − x
or I=
2

is
Substituting the value of I in equation (1), we get
⎛ sin x − cos x ⎞ − x
ye– x = ⎜ ⎟e + C
⎝ 2 ⎠

bl
⎛ sin x − cos x ⎞
⎟ + Ce
x
or y= ⎜
⎝ 2 ⎠
pu
which is the general solution of the given differential equation.
dy
be T

Example 20 Find the general solution of the differential equation x + 2 y = x 2 ( x ≠ 0) .


dx
re
Solution The given differential equation is
o R

dy
+ 2y = x2
x ... (1)
dx
tt E

Dividing both sides of equation (1) by x, we get


dy 2
+ y =x
C

dx x
dy 2
+ Py = Q , where P = and Q = x.
no N

which is a linear differential equation of the type


dx x
2 2

So I.F = e ∫ x dx = e2 log x = e
log x
= x 2 [as elog f ( x ) = f ( x)]
Therefore, solution of the given equation is given by
©

y . x2 = ∫ ( x) ( x ) dx + C = ∫ x dx + C
2 3

x2
or y=+ C x −2
4
which is the general solution of the given differential equation.
DIFFERENTIAL EQUATIONS 411

Example 21 Find the general solution of the differential equation y dx – (x + 2y2) dy = 0.


Solution The given differential equation can be written as
dx x
− = 2y
dy y

dx 1

he
This is a linear differential equation of the type + P1 x = Q1 , where P1 = − and
dy y
1
∫ − y dy −1 1
Q1 = 2y. Therefore I.F = e = e − log y = elog ( y ) =

is
y
Hence, the solution of the given differential equation is
⎛1⎞

bl
= ∫ (2 y ) ⎜ ⎟ dy + C
1
x
pu y ⎝ y⎠
x
or = ∫ (2dy ) + C
y
be T

x
or = 2y + C
y
re
o R

or x = 2y2 + Cy
which is a general solution of the given differential equation.
tt E

Example 22 Find the particular solution of the differential equation


dy
+ y cot x = 2x + x2 cot x (x ≠ 0)
C

dx
π
given that y = 0 when x = .
no N

2
dy
Solution The given equation is a linear differential equation of the type + Py = Q ,
dx
©

where P = cot x and Q = 2x + x2 cot x. Therefore


cot x dx
I.F = e e log sin x sin x
Hence, the solution of the differential equation is given by
y . sin x = ∫ (2x + x2 cot x) sin x dx + C
412 MATHEMATICS

or y sin x = ∫ 2x sin x dx + ∫ x2 cos x dx + C


⎛ 2 x2 ⎞ ⎛ 2 x2 ⎞
⎝ 2 ⎠ ∫
− ⎟ dx + ∫ x cos x dx + C
2
or y sin x = sin x ⎜ ⎟ cos x ⎜
⎝ 2 ⎠

y sin x = x sin x − ∫ x cos x dx + ∫ x cos x dx + C


2 2 2
or
or y sin x = x2 sin x + C ... (1)

he
π
Substituting y = 0 and x = in equation (1), we get
2
2
⎛π⎞ ⎛π⎞

is
0 = ⎜ ⎟ sin ⎜ ⎟ + C
⎝2⎠ ⎝2⎠

− π2

bl
or C=
4
Substituting the value of C in equation (1), we get
pu y sin x = x sin x −
2 π2
4
be T

π2
or y = x2 − (sin x ≠ 0)
re
4 sin x
o R

which is the particular solution of the given differential equation.


Example 23 Find the equation of a curve passing through the point (0, 1). If the slope
tt E

of the tangent to the curve at any point (x, y) is equal to the sum of the x coordinate
(abscissa) and the product of the x coordinate and y coordinate (ordinate) of that point.
C

dy
Solution We know that the slope of the tangent to the curve is .
dx
no N

dy
Therefore, = x + xy
dx
dy
or − xy = x ... (1)
©

dx
dy
This is a linear differential equation of the type + Py = Q , where P = – x and Q = x.
dx
− x2
I . F = e∫
− x dx
Therefore, =e 2
DIFFERENTIAL EQUATIONS 413

Hence, the solution of equation is given by

= ∫ ( x) ( e ) dx + C
− x2 − x2
y ⋅e 2 2 ... (2)

− x2
Let I = ∫ ( x) e 2 dx

he
− x2
Let = t , then – x dx = dt or x dx = – dt.
2
− x2
I = − ∫ e dt = −e = –

is
t t
Therefore, e 2
Substituting the value of I in equation (2), we get

bl
− x2 x2
y e 2 = e 2 +C
x2
or
pu y = −1 + C e2 ... (3)
Now (3) represents the equation of family of curves. But we are interested in
be T

finding a particular member of the family passing through (0, 1). Substituting x = 0 and
y = 1 in equation (3) we get
re
1 = – 1 + C . e0 or C = 2
o R

Substituting the value of C in equation (3), we get


x2
tt E

y = −1 + 2e2
which is the equation of the required curve.
C

EXERCISE 9.6
no N

For each of the differential equations given in Exercises 1 to 12, find the general solution:
dy dy dy y
1. + 2 y = sin x 2. + 3 y = e−2 x 3. + = x2
dx dx dx x
©

dy ⎛ π⎞ dy ⎛ π⎞
4. + (sec x) y = tan x ⎜ 0 ≤ x < ⎟ 2
5. cos x + y = tan x ⎜ 0 ≤ x < ⎟
dx ⎝ 2⎠ dx ⎝ 2⎠
dy dy 2
6. x+ 2 y = x 2 log x 7. x log x + y = log x
dx dx x
8. (1 + x ) dy + 2xy dx = cot x dx (x ≠ 0)
2
414 MATHEMATICS

dy dy
9. x + y − x + xy cot x = 0 ( x ≠ 0) 10. ( x + y ) = 1
dx dx
dy
12. ( x + 3 y )
= y ( y > 0) .
2
11. y dx + (x – y2) dy = 0
dx
For each of the differential equations given in Exercises 13 to 15, find a particular

he
solution satisfying the given condition:
dy π
13. + 2 y tan x = sin x; y = 0 when x =
dx 3
dy 1

is
14. (1 + x ) + 2 xy = ; y = 0 when x = 1
2
dx 1 + x2
dy π
− 3 y cot x = sin 2 x; y = 2 when x =

bl
15.
dx 2
16. Find the equation of a curve passing through the origin given that the slope of the
tangent to the curve at any point (x, y) is equal to the sum of the coordinates of
pu
the point.
17. Find the equation of a curve passing through the point (0, 2) given that the sum of
be T

the coordinates of any point on the curve exceeds the magnitude of the slope of
the tangent to the curve at that point by 5.
re
o R

dy
The Integrating Factor of the differential equation x − y = 2 x is
2
18.
dx
1
tt E

(A) e –x (B) e –y (C) (D) x


x
19. The Integrating Factor of the differential equation
C

dx
(1 − y 2 )
+ yx = ay ( 1 y 1) is
dy
no N

1 1 1 1
(A) 2 (B) (C) (D)
y 1 y −1
2
1 − y2 1 − y2

Miscellaneous Examples
©

Example 24 Verify that the function y = c1 eax cos bx + c2 eax sin bx, where c1, c2 are
arbitrary constants is a solution of the differential equation
d2y
− 2a + ( a 2 + b 2 ) y = 0
dy
2
dx dx
DIFFERENTIAL EQUATIONS 415

Solution The given function is


y = eax [c1 cosbx + c2 sinbx] ... (1)
Differentiating both sides of equation (1) with respect to x, we get
dy
= e ax – bc1 sin bx b c2 cos bx c1 cos bx c2 sin bx e ax a
dx

he
dy
or = e ax [(b c2 + a c1 ) cos bx + (a c2 − b c1 )sin bx] ... (2)
dx
Differentiating both sides of equation (2) with respect to x, we get

is
d2y
= e ax [(b c2 a c1 ) ( b sin bx) (a c2 b c1 ) (b cos bx)]
dx 2

bl
+ [(b c2 + a c1 ) cos bx + ( a c2 − b c1 ) sin bx] e ax . a

= e ax [(a 2 c2 − 2ab c1 − b 2 c2 ) sin bx + ( a 2 c1 + 2ab c2 − b 2 c1 ) cos bx]


pu
d 2 y dy
Substituting the values of , and y in the given differential equation, we get
dx 2 dx
be T
re
L.H.S. = e ax [a 2 c2 − 2abc1 − b 2 c2 )sin bx + (a 2 c1 + 2abc2 − b 2 c1 ) cos bx]
o R

2ae ax [(bc2 ac1 ) cos bx (ac2 bc1 )sin bx]


tt E

(a 2 b 2 ) e ax [c1 cos bx c2 sin bx]

( )
⎡ a 2 c2 − 2abc1 − b 2 c2 − 2a 2 c2 + 2abc1 + a 2 c2 + b 2 c2 sin bx ⎤
C

= e ⎢ ⎥
ax

⎢ + (a 2 c + 2abc − b 2 c − 2abc − 2a 2 c + a 2 c + b 2 c )cos bx ⎥


⎣ 1 2 1 2 1 1 1 ⎦
no N

= e ax [0 × sin bx + 0cos bx] = eax × 0 = 0 = R.H.S.


Hence, the given function is a solution of the given differential equation.
Example 25 Form the differential equation of the family of circles in the second
©

quadrant and touching the coordinate axes.


Solution Let C denote the family of circles in the second quadrant and touching the
coordinate axes. Let (–a, a) be the coordinate of the centre of any member of
this family (see Fig 9.6).
416 MATHEMATICS

Equation representing the family C is


Y
(x + a)2 + (y – a)2 = a2 ... (1)
2 2 2
or x + y + 2ax – 2ay + a = 0 ... (2)
Differentiating equation (2) with respect to x, we get (–a, a)

dy dy X’ X
2x + 2 y + 2 a − 2a =0 O

he
dx dx
dy ⎛ dy ⎞
or x+ y = a ⎜ − 1⎟
dx ⎝ dx ⎠
Y’

is
x + y y′
or a= Fig 9.6
y′ − 1

bl
Substituting the value of a in equation (1), we get

x + y y′ ⎤ ⎡ x + y y′ ⎤ ⎡ x + y y′ ⎤
2 2 2

⎢ x + y′ − 1 ⎥ + ⎢ y − y′ − 1 ⎥ = ⎢ y′ − 1 ⎥
pu ⎣ ⎦ ⎣ ⎦ ⎣ ⎦
or [xy′ – x + x + y y′] + [y y′ – y – x – y y′] = [x + y y′]2
2 2

or (x + y)2 y′2 + [x + y]2 = [x + y y′]2


be T

or (x + y)2 [(y′)2 + 1] = [x + y y′]2


re
o R

which is the differential equation representing the given family of circles.

⎛ dy ⎞
Example 26 Find the particular solution of the differential equation log ⎜ ⎟ = 3 x + 4 y
⎝ dx ⎠
tt E

given that y = 0 when x = 0.


C

Solution The given differential equation can be written as


dy
= e(3x + 4y)
no N

dx
dy
or = e3x . e4y ... (1)
dx
©

Separating the variables, we get


dy
= e3x dx
e4 y

∫e dy = ∫ e3x dx
−4 y
Therefore
DIFFERENTIAL EQUATIONS 417

e− 4 y e3 x
or = +C
−4 3
or 4 e3x + 3 e– 4y + 12 C = 0 ... (2)
Substituting x = 0 and y = 0 in (2), we get
−7

he
4 + 3 + 12 C = 0 or C =
12
Substituting the value of C in equation (2), we get
4 e3x + 3 e– 4y – 7 = 0,
which is a particular solution of the given differential equation.

is
Example 27 Solve the differential equation

bl
⎛ y⎞ ⎛ y⎞
(x dy – y dx) y sin ⎜ ⎟ = (y dx + x dy) x cos ⎜ ⎟ .
⎝ x⎠ ⎝x⎠
Solution The given differential equation can be written as
pu
⎡ ⎛ y⎞ 2 ⎛ y ⎞⎤ ⎡ ⎛ y⎞ ⎛ y ⎞⎤
⎢ x y sin ⎜⎝ x ⎟⎠ − x cos ⎜⎝ x ⎟⎠ ⎥ dy = ⎢ xy cos ⎜⎝ x ⎟⎠ + y sin ⎜⎝ x ⎟⎠ ⎥ dx
2

⎣ ⎦ ⎣ ⎦
be T

⎛ y⎞ ⎛ y⎞
re
xy cos ⎜ ⎟ + y 2 sin ⎜ ⎟
o R

dy ⎝ ⎠
x ⎝x⎠
or =
dx ⎛ y⎞ ⎛ y⎞
xy sin ⎜ ⎟ − x 2 cos ⎜ ⎟
⎝x⎠ ⎝x⎠
tt E

Dividing numerator and denominator on RHS by x2, we get


C

⎛ y⎞ ⎛ y ⎞ ⎛ y⎞
2
y
cos ⎜ ⎟ + ⎜ 2 ⎟ sin ⎜ ⎟
dy x ⎝ x⎠ ⎝x ⎠ ⎝x⎠
= ... (1)
dx y ⎛ y⎞ ⎛ y⎞
no N

sin ⎜ ⎟ − cos ⎜ ⎟
x ⎝x⎠ ⎝x⎠

dy ⎛ y⎞
Clearly, equation (1) is a homogeneous differential equation of the form = g⎜ ⎟.
©

dx ⎝x⎠
To solve it, we make the substitution
y = vx ... (2)
dy dv
or = v+x
dx dx
418 MATHEMATICS

dv v cos v + v 2 sin v
or v+x = (using (1) and (2))
dx v sin v − cos v
dv 2v cos v
or x =
dx v sin v − cos v

⎛ v sin v − cos v ⎞

he
2 dx
or ⎜ ⎟ dv =
⎝ v cos v ⎠ x
⎛ v sin v − cos v ⎞ 1
Therefore ∫ ⎜⎝ v cos v
⎟ dv = 2 ∫ dx
⎠ x

is
1 1
or ∫ tan v dv − ∫ v dv = 2 ∫ x dx

bl
or log sec v − log | v | = 2log | x | + log | C1 |
sec v
log
or
pu v x2
= log | C1 |

secv
be T

or = ± C1 ... (3)
v x2
re
y
o R

Replacing v by in equation (3), we get


x
⎛ y⎞
tt E

sec ⎜ ⎟
⎝x⎠
= C where, C = ± C1
⎛ y⎞ 2
⎜ ⎟ (x )
C

⎝x⎠

⎛ y⎞
sec ⎜ ⎟ = C xy
no N

or
⎝x⎠
which is the general solution of the given differential equation.
Example 28 Solve the differential equation
©

(tan–1y – x) dy = (1 + y2) dx.


Solution The given differential equation can be written as

dx x tan −1 y
+ = ... (1)
dy 1 + y 2 1 + y2
DIFFERENTIAL EQUATIONS 419

dx
Now (1) is a linear differential equation of the form + P1 x = Q1,
dy

1 tan −1 y .
where, P1 = 2 and Q1 =
1+ y 1 + y2

he
1
Therefore, I . F = ∫ 1+ y 2 dy −1
e = e tan y
Thus, the solution of the given differential equation is
⎛ tan−1 y ⎞ tan −1 y
= ∫⎜

is
tan −1 y 2 ⎟
e dy + C ... (2)
⎝ 1+ y ⎠
xe

⎛ tan −1 y ⎞ tan −1 y

bl
Let
pu I= ∫ ⎜⎝ 1 + y 2 ⎟⎠ e dy
⎛ 1 ⎞
Substituting tan–1 y = t so that ⎜ 2 ⎟
dy = dt , we get
⎝1+ y ⎠
be T

∫ t e dt = t e
t
I= t
– ∫1 . et dt = t et – et = et (t – 1)
re
−1
o R

or I = e tan y (tan–1y –1)


Substituting the value of I in equation (2), we get
−1 −1
tt E

x . e tan y
= e tan y
(tan −1 y − 1) + C
−1
or x = (tan −1y − 1) + C e− tan y
C

which is the general solution of the given differential equation.


no N

Miscellaneous Exercise on Chapter 9


1. For each of the differential equations given below, indicate its order and degree
(if defined).
©

2 3 2
d2y ⎛ dy ⎞ ⎛ dy ⎞ ⎛ dy ⎞
(i) + 5 x ⎜ ⎟ − 6 y = log x (ii) ⎜ ⎟ − 4 ⎜ ⎟ + 7 y = sin x
dx 2
⎝ dx ⎠ ⎝ dx ⎠ ⎝ dx ⎠

d4y ⎛ d3y ⎞
(iii) − sin ⎜ 3 ⎟=0
dx 4 ⎝ dx ⎠
420 MATHEMATICS

2. For each of the exercises given below, verify that the given function (implicit or
explicit) is a solution of the corresponding differential equation.
d2y dy
x –x
(i) y = a e + b e + x 2
: x 2
+ 2 − xy + x 2 − 2 = 0
dx dx

d2y dy
− 2 + 2y = 0

he
x
(ii) y = e (a cos x + b sin x) : 2
dx dx

d2y
(iii) y = x sin 3x : + 9 y − 6cos3 x = 0
dx 2

is
dy
: (x + y ) − xy = 0
2 2
(iv) x2 = 2y2 log y
dx

bl
3. Form the differential equation representing the family of curves given by
(x – a)2 + 2y2 = a2, where a is an arbitrary constant.
4. Prove that x2 – y2 = c (x2 + y2)2 is the general solution of differential equation
pu
(x3 – 3x y2) dx = (y3 – 3x2y) dy, where c is a parameter.
5. Form the differential equation of the family of circles in the first quadrant which
be T

touch the coordinate axes.


re
dy 1 − y2
o R

6. Find the general solution of the differential equation + =0.


dx 1 − x2
tt E

dy y 2 + y + 1
7. Show that the general solution of the differential equation + = 0 is
dx x 2 + x + 1
C

given by (x + y + 1) = A (1 – x – y – 2xy), where A is parameter.

⎛ π⎞
8. Find the equation of the curve passing through the point ⎜ 0, ⎟ whose differential
⎝ 4⎠
no N

equation is sin x cos y dx + cos x sin y dy = 0.


9. Find the particular solution of the differential equation
(1 + e2x) dy + (1 + y2) ex dx = 0, given that y = 1 when x = 0.
©

x
⎛ x ⎞
10. Solve the differential equation y e y dx = ⎜⎝ x e y + y 2 ⎟⎠ dy ( y ≠ 0) .
11. Find a particular solution of the differential equation (x – y) (dx + dy) = dx – dy,
given that y = –1, when x = 0. (Hint: put x – y = t)
DIFFERENTIAL EQUATIONS 421

⎡ e −2 x y ⎤ dx
12. Solve the differential equation ⎢ − ⎥ = 1 ( x ≠ 0) .
⎣ x x ⎦ dy

dy
13. Find a particular solution of the differential equation + y cot x = 4x cosec x
dx

he
π.
(x ≠ 0), given that y = 0 when x =
2

dy
14. Find a particular solution of the differential equation (x + 1) = 2 e–y – 1, given

is
dx
that y = 0 when x = 0.
15. The population of a village increases continuously at the rate proportional to the

bl
number of its inhabitants present at any time. If the population of the village was
20, 000 in 1999 and 25000 in the year 2004, what will be the population of the
village in 2009?
pu
y dx − x dy
16. The general solution of the differential equation = 0 is
be T

y
(A) xy = C (B) x = Cy2 (C) y = Cx (D) y = Cx2
re
o R

dx
17. The general solution of a differential equation of the type + P1 x = Q1 is
dy
tt E

(A) y e ∫
P1 dy
(
= ∫ Q1e ∫
P1 dy
) dy + C
C

(B) y . e ∫
P1 dx
(
= ∫ Q1e ∫
P1 dx
) dx + C
no N

(C) x e ∫
P1 dy
(
= ∫ Q1e ∫
P1 dy
) dy + C
(D) x e ∫ (
= ∫ Q1e ∫ ) dx + C
©

P1 dx P1 dx

18. The general solution of the differential equation ex dy + (y ex + 2x) dx = 0 is


(A) x ey + x2 = C (B) x ey + y2 = C
(C) y ex + x2 = C (D) y ey + x2 = C
422 MATHEMATICS

Summary
 An equation involving derivatives of the dependent variable with respect to
independent variable (variables) is known as a differential equation.
 Order of a differential equation is the order of the highest order derivative
occurring in the differential equation.

he
 Degree of a differential equation is defined if it is a polynomial equation in its
derivatives.
 Degree (when defined) of a differential equation is the highest power (positive
integer only) of the highest order derivative in it.

is
 A function which satisfies the given differential equation is called its solution.
The solution which contains as many arbitrary constants as the order of the
differential equation is called a general solution and the solution free from

bl
arbitrary constants is called particular solution.
 To form a differential equation from a given function we differentiate the
function successively as many times as the number of arbitrary constants in
pu
the given function and then eliminate the arbitrary constants.
 Variable separable method is used to solve such an equation in which variables
be T

can be separated completely i.e. terms containing y should remain with dy


and terms containing x should remain with dx.
re
o R

 A differential equation which can be expressed in the form


dy dx
f ( x, y ) or g ( x, y ) where, f (x, y) and g(x, y) are homogenous
tt E

dx dy
functions of degree zero is called a homogeneous differential equation.
C

dy
 A differential equation of the form
dx
+ Py Q , where P and Q are constants

or functions of x only is called a first order linear differential equation.


no N

Historical Note
©

One of the principal languages of Science is that of differential equations.


Interestingly, the date of birth of differential equations is taken to be November,
11,1675, when Gottfried Wilthelm Freiherr Leibnitz (1646 - 1716) first put in black
1
∫ y dy = 2 y , thereby introducing both the symbols ∫ and dy.
2
and white the identity
DIFFERENTIAL EQUATIONS 423

Leibnitz was actually interested in the problem of finding a curve whose tangents
were prescribed. This led him to discover the ‘method of separation of variables’
1691. A year later he formulated the ‘method of solving the homogeneous
differential equations of the first order’. He went further in a very short time
to the discovery of the ‘method of solving a linear differential equation of the
first-order’. How surprising is it that all these methods came from a single man

he
and that too within 25 years of the birth of differential equations!
In the old days, what we now call the ‘solution’ of a differential equation,
was used to be referred to as ‘integral’ of the differential equation, the word
being coined by James Bernoulli (1654 - 1705) in 1690. The word ‘solution was

is
first used by Joseph Louis Lagrange (1736 - 1813) in 1774, which was almost
hundred years since the birth of differential equations. It was Jules Henri Poincare
(1854 - 1912) who strongly advocated the use of the word ‘solution’ and thus the

bl
word ‘solution’ has found its deserved place in modern terminology. The name of
the ‘method of separation of variables’ is due to John Bernoulli (1667 - 1748),
a younger brother of James Bernoulli.
pu
Application to geometric problems were also considered. It was again John
Bernoulli who first brought into light the intricate nature of differential equations.
be T

In a letter to Leibnitz, dated May 20, 1715, he revealed the solutions of the
differential equation
re
o R

x2 y″ = 2y,
which led to three types of curves, viz., parabolas, hyperbolas and a class of
cubic curves. This shows how varied the solutions of such innocent looking
tt E

differential equation can be. From the second half of the twentieth century attention
has been drawn to the investigation of this complicated nature of the solutions of
differential equations, under the heading ‘qualitative analysis of differential
C

equations’. Now-a-days, this has acquired prime importance being absolutely


necessary in almost all investigations.
no N

—™—
©
424 MATHEMATICS

Chapter 10
VECTOR ALGEBRA

he
™ In most sciences one generation tears down what another has built and what
one has established another undoes. In Mathematics alone each generation
builds a new story to the old structure. – HERMAN HANKEL ™

is
10.1 Introduction

bl
In our day to day life, we come across many queries such
as – What is your height? How should a football player hit
the ball to give a pass to another player of his team? Observe
pu
that a possible answer to the first query may be 1.6 meters,
a quantity that involves only one value (magnitude) which
is a real number. Such quantities are called scalars.
be T

However, an answer to the second query is a quantity (called


re
force) which involves muscular strength (magnitude) and
o R

direction (in which another player is positioned). Such


quantities are called vectors. In mathematics, physics and
tt E

engineering, we frequently come across with both types of


W.R. Hamilton
quantities, namely, scalar quantities such as length, mass, (1805-1865)
time, distance, speed, area, volume, temperature, work,
C

money, voltage, density, resistance etc. and vector quantities like displacement, velocity,
acceleration, force, weight, momentum, electric field intensity etc.
no N

In this chapter, we will study some of the basic concepts about vectors, various
operations on vectors, and their algebraic and geometric properties. These two type of
properties, when considered together give a full realisation to the concept of vectors,
©

and lead to their vital applicability in various areas as mentioned above.


10.2 Some Basic Concepts
Let ‘l’ be any straight line in plane or three dimensional space. This line can be given
two directions by means of arrowheads. A line with one of these directions prescribed
is called a directed line (Fig 10.1 (i), (ii)).
VECTOR ALGEBRA 425

he
Fig 10.1

is
Now observe that if we restrict the line l to the line segment AB, then a magnitude
is prescribed on the line l with one of the two directions, so that we obtain a directed

bl
line segment (Fig 10.1(iii)). Thus, a directed line segment has magnitude as well as
direction.
Definition 1 A quantity that has magnitude as well as direction is called a vector.
pu uuur
Notice that a directed line segment is a vector (Fig 10.1(iii)), denoted as AB or
uuur r
simply as ar , and read as ‘vector AB ’ or ‘vector a ’.
be T

uuur
The point A from where the vector AB starts is called its initial point, and the
re
o R

point B where it ends is called its terminal point. The distance between initial and
terminal points of a vector is called the magnitude (or length) of the vector, denoted as
uuur r
| AB |, or | a |, or a. The arrow indicates the direction of the vector.
tt E

r
$Note Since the length is never negative, the notation | a | < 0 has no meaning.
C

Position Vector
From Class XI, recall the three dimensional right handed rectangular coordinate
no N

system (Fig 10.2(i)). Consider a point P in space, having coordinates (x, y, z) with
uuur
respect to the origin O (0, 0, 0). Then, the vector OP having O and P as its initial and
terminal points, respectively, is called the position vector of the point P with respect
uuur r
©

to O. Using distance formula (from Class XI), the magnitude of OP (or r ) is given by
uuur
| OP | = x2 + y 2 + z 2
In practice, the position vectors of points A, B, C, etc., with respect to the origin O
r r r
are denoted by a , b , c , etc., respectively (Fig 10.2 (ii)).
426 MATHEMATICS

he
is
Fig 10.2
Direction Cosines
uuur r
Consider the position vector OP or r of a point P(x, y, z) as in Fig 10.3. The angles α,

bl
r
β, γ made by the vector r with the positive directions of x, y and z-axes respectively,
are called its direction angles. The cosine values of these angles, i.e., cos α, cos β and
r
cos γ are called direction cosines of the vector r , and usually denoted by l, m and n,
pu
respectively. Z

C
be T
re
o R

z P(x,y,z)
r
O y
Y
tt E

B
P
x
C

A
O
90°
X
no N

Fig 10.3 X
From Fig 10.3, one may note that the triangle OAP is right angled, and in it, we
©

x r
have cos α = ( r stands for | r |) . Similarly, from the right angled triangles OBP and
r
y z
OCP, we may write cos β = and cos γ = . Thus, the coordinates of the point P may
r r
also be expressed as (lr, mr,nr). The numbers lr, mr and nr, proportional to the direction
r
cosines are called as direction ratios of vector r , and denoted as a, b and c, respectively.
VECTOR ALGEBRA 427

$Note One may note that l 2 + m2 + n2 = 1 but a2 + b2 + c2 ≠ 1, in general.

10.3 Types of Vectors


Zero Vector A vector whose initial and terminal points coincide, is called a zero
r
vector (or null vector), and denoted as 0 . Zero vector can not be assigned a definite

he
direction as it has zero magnitude. Or, alternatively otherwise, it may be regarded as
uuur uuur
having any direction. The vectors AA, BB represent the zero vector,
Unit Vector A vector whose magnitude is unity (i.e., 1 unit) is called a unit vector. The
r
unit vector in the direction of a given vector a is denoted by â .

is
Coinitial Vectors Two or more vectors having the same initial point are called coinitial
vectors.

bl
Collinear Vectors Two or more vectors are said to be collinear if they are parallel to
the same line, irrespective of their magnitudes and directions.
r
pu r
Equal Vectors Two vectors a and b are said to be equal, if they have the same
magnitude and direction regardless of the positions of their initial points, and written
r r
be T

as a = b .
re
Negative of a Vector A vector whose magnitude is the same as that of a given vector
o R

uuur
(say, AB ), but direction is opposite to that of it, is called negative of the given vector.
uuur uuur uuur uuur
For example, vector BA is negative of the vector AB , and written as BA = − AB .
tt E

Remark The vectors defined above are such that any of them may be subject to its
parallel displacement without changing its magnitude and direction. Such vectors are
C

called free vectors. Throughout this chapter, we will be dealing with free vectors only.

Example 1 Represent graphically a displacement


no N

of 40 km, 30° west of south.


uuur
Solution The vector OP represents the required
displacement (Fig 10.4).
©

Example 2 Classify the following measures as


scalars and vectors.
(i) 5 seconds
(ii) 1000 cm3
Fig 10.4
428 MATHEMATICS

(iii) 10 Newton (iv) 30 km/hr (v) 10 g/cm3


(vi) 20 m/s towards north

Solution
(i) Time-scalar (ii) Volume-scalar (iii) Force-vector
(iv) Speed-scalar (v) Density-scalar (vi) Velocity-vector

he
Example 3 In Fig 10.5, which of the vectors are:
(i) Collinear (ii) Equal (iii) Coinitial

Solution

is
r r r
(i) Collinear vectors : a , c and d .
r r

bl
(ii) Equal vectors : a and c .
r r r
(iii) Coinitial vectors : b , c and d .
Fig 10.5
pu EXERCISE 10.1
1. Represent graphically a displacement of 40 km, 30° east of north.
be T

2. Classify the following measures as scalars and vectors.


re
(i) 10 kg (ii) 2 meters north-west (iii) 40°
o R

(iv) 40 watt (v) 10–19 coulomb (vi) 20 m/s2


3. Classify the following as scalar and vector quantities.
tt E

(i) time period (ii) distance (iii) force


(iv) velocity (v) work done
C

4. In Fig 10.6 (a square), identify the following vectors.


(i) Coinitial (ii) Equal
no N

(iii) Collinear but not equal


5. Answer the following as true or false.
r r
(i) a and − a are collinear.
©

(ii) Two collinear vectors are always equal in


magnitude. Fig 10.6
(iii) Two vectors having same magnitude are collinear.
(iv) Two collinear vectors having the same magnitude are equal.
VECTOR ALGEBRA 429

10.4 Addition of Vectors


uuur
A vector AB simply means the displacement from a
point A to the point B. Now consider a situation that a
girl moves from A to B and then from B to C
(Fig 10.7). The net displacement made by the girl from
uuur

he
point A to the point C, is given by the vector AC and Fig 10.7
expressed as
uuur uuur uuur
AC = AB + BC

is
This is known as the triangle law of vector addition.
r r
In general, if we have two vectors a and b (Fig 10.8 (i)), then to add them, they

bl
are positioned so that the initial point of one coincides with the terminal point of the
other (Fig 10.8(ii)).
C C
b
pu
+b b b
a
be T

a
A B A B
re
a a
o R

–b
a –
b
(i) (ii) (iii)
tt E

C’
Fig 10.8
r
For example, in Fig 10.8 (ii), we have shifted vector b without changing its magnitude
C

r
and direction, so that it’s initial point coincides with the terminal point of a . Then, the
r r
vector a + b , represented by the third side AC of the triangle ABC, gives us the sum
no N

r r
(or resultant) of the vectors a and b i.e., in triangle ABC (Fig 10.8 (ii)), we have
uuur uuur uuur
AB + BC = AC
uuur uuur
©

Now again, since AC = − CA , from the above equation, we have


uuur uuur uuur uuur r
AB + BC + CA = AA = 0
This means that when the sides of a triangle are taken in order, it leads to zero
resultant as the initial and terminal points get coincided (Fig 10.8(iii)).
430 MATHEMATICS

uuuur uuur
Now, construct a vector BC′ so that its magnitude is same as the vector BC , but
the direction opposite to that of it (Fig 10.8 (iii)), i.e.,
uuuur uuur
BC′ = − BC
Then, on applying triangle law from the Fig 10.8 (iii), we have
uuuur uuur uuuur uuur uuur r r

he
AC′ = AB + BC′ = AB + (−BC) = a − b
uuuur r r
The vector AC′ is said to represent the difference of a and b .
Now, consider a boat in a river going from one bank of the river to the other in a

is
direction perpendicular to the flow of the river. Then, it is acted upon by two velocity
vectors–one is the velocity imparted to the boat by its engine and other one is the
velocity of the flow of river water. Under the simultaneous influence of these two

bl
velocities, the boat in actual starts travelling with a different velocity. To have a precise
idea about the effective speed and direction
(i.e., the resultant velocity) of the boat, we have
pu
the following law of vector addition.
r r
If we have two vectors a and b represented
be T

by the two adjacent sides of a parallelogram in


re
magnitude and direction (Fig 10.9), then their
o R

r r
sum a + b is represented in magnitude and
direction by the diagonal of the parallelogram
tt E

through their common point. This is known as Fig 10.9


the parallelogram law of vector addition.
C

$Note From Fig 10.9, using the triangle law, one may note that
uuur uuur uuur
OA + AC = OC
no N

uuur uuur uuur uuur uuur


or OA + OB = OC (since AC = OB )
which is parallelogram law. Thus, we may say that the two laws of vector
©

addition are equivalent to each other.


Properties of vector addition
r r
Property 1 For any two vectors a and b ,
r r r r
a+b = b +a (Commutative property)
VECTOR ALGEBRA 431

Proof Consider the parallelogram ABCD


uuur r uuur r
(Fig 10.10). Let AB a and BC b , then using
the triangle law, from triangle ABC, we have
uuur r r
AC = a + b
Now, since the opposite sides of a

he
parallelogram are equal and parallel, from
uuur uuur r
Fig 10.10, we have, AD = BC = b and
uuur uuur r
DC = AB = a . Again using triangle law, from Fig 10.10
triangle ADC, we have

is
uuuur uuur uuur r r
AC = AD + DC = b + a

bl
r r r r
Hence a +b = b +a
r r r
Property 2 For any three vectors a , b and c
pu r r r r r r
(a + b ) + c = a + (b + c ) (Associative property)
r r
be T

r uuur uuur uuur


Proof Let the vectors a , b and c be represented by PQ, QR and RS , respectively,
re
as shown in Fig 10.11(i) and (ii).
o R
tt E
C
no N

Fig 10.11
©

r r uuur uuur uuur


Then a + b = PQ + QR = PR
r r uuur uuur uuur
and b + c = QR + RS = QS
r r r uuur uuur uur
So (a + b ) + c = PR + RS = PS
432 MATHEMATICS

r r r uuur uuur uur


and a + (b + c ) = PQ + QS = PS
r r r r r r
Hence (a + b ) + c = a + (b + c )
Remark The associative property of vector addition enables us to write the sum of
r r r r r r
three vectors a , b , c as a + b + c without using brackets.

he
r
Note that for any vector a , we have
r r r r r
a+0 = 0+a =a
r
Here, the zero vector 0 is called the additive identity for the vector addition.

is
10.5 Multiplication of a Vector by a Scalar
r r
Let a be a given vector and λ a scalar. Then the product of the vector a by the scalar

bl
r r
λ, denoted as λ a , is called the multiplication of vector a by the scalar λ. Note that,
r r r
λ a is also a vector, collinear to the vector a . The vector λ a has the direction same
r
(or opposite) to that of vector a according as the value of λ is positive (or negative).
pu r r
Also, the magnitude of vector λ a is | λ | times the magnitude of the vector a , i.e.,
r r
| λa | = | λ | | a |
be T

A geometric visualisation of multiplication of a vector by a scalar is given


re
in Fig 10.12.
o R

a
tt E
2a

–2
2 a
2 a
a

1
1
C

Fig 10.12
r r
When λ = – 1, then λa = − a, which is a vector having magnitude equal to the
no N

r r r
magnitude of a and direction opposite to that of the direction of a . The vector – a is
r
called the negative (or additive inverse) of vector a and we always have
r r r r r
a + (– a ) = (– a ) + a = 0
©

1 r r
Also, if λ = r , provided a 0, i.e. a is not a null vector, then
|a|
r r 1 r
| λa |=| λ | | a | = r | a | 1
|a|
VECTOR ALGEBRA 433

r r
So, λ a represents the unit vector in the direction of a . We write it as
1 r
â = r a
|a|
r r
$ Note For any scalar k, k 0 = 0.

he
10.5.1 Components of a vector
Let us take the points A(1, 0, 0), B(0, 1, 0) and C(0, 0, 1) on the x-axis, y-axis and
z-axis, respectively. Then, clearly
uuur uuur uuur
| OA |= 1, | OB | = 1 and | OC | = 1

is
uuur uuur uuur
The vectors OA, OB and OC , each having magnitude 1,
are called unit vectors along the axes OX, OY and OZ,

bl
respectively, and denoted by iˆ, ˆj and kˆ , respectively
(Fig 10.13). Fig 10.13
pu uuur
Now, consider the position vector OP of a point P (x, y, z) as in Fig 10.14. Let P1
be the foot of the perpendicular from P on the plane XOY. We, thus, see that P1 P is
be T
re
o R
tt E
C
no N
©

Fig 10.14

parallel to z-axis. As iˆ, ˆj and kˆ are the unit vectors along the x, y and z-axes,
uuur uuur
respectively, and by the definition of the coordinates of P, we have P1P = OR = zkˆ .
uuur uuur uuur
Similarly, QP1 = OS = yjˆ and OQ = xiˆ .
434 MATHEMATICS

uuur uuur uuur


Therefore, it follows that OP1 = OQ + QP1 = xiˆ + yjˆ
uuur uuur uuuur
and OP = OP1 + P1P = xiˆ + yjˆ + zkˆ
Hence, the position vector of P with reference to O is given by
uuur r
OP (or r ) = xiˆ + yjˆ + zkˆ

he
This form of any vector is called its component form. Here, x, y and z are called
r
as the scalar components of r , and xiˆ, yjˆ and zkˆ are called the vector components
r
of r along the respective axes. Sometimes x, y and z are also termed as rectangular

is
components.
r
The length of any vector r = xiˆ + yjˆ + zkˆ , is readily determined by applying the

bl
Pythagoras theorem twice. We note that in the right angle triangle OQP1 (Fig 10.14)
uuuur uuuur uuur
| OP 1 | = | OQ |2 +|QP1|2 = x 2 + y 2 ,
pu
and in the right angle triangle OP1P, we have
uuur uuur uuur
OP = | OP1 |2 | P1P |2 ( x2 y2 ) z2
be T

r
Hence, the length of any vector r = xiˆ + yjˆ + zkˆ is given by
re
o R

r
| r | = | xiˆ + yjˆ + zkˆ | = x 2 + y 2 + z 2
r r
If a and b are any two vectors given in the component form a1iˆ + a2 ˆj + a3 kˆ and
tt E

b1iˆ + b2 ˆj + b3 kˆ , respectively, then


r
C

r
(i) the sum (or resultant) of the vectors a and b is given by
r r
a + b = (a1 + b1 )iˆ + (a2 + b2 ) ˆj + (a3 + b3 )kˆ
no N

r r
(ii) the difference of the vector a and b is given by
r r
a − b = (a1 − b1 )iˆ + (a2 − b2 ) ˆj + (a3 − b3 )kˆ
r
©

r
(iii) the vectors a and b are equal if and only if
a1 = b1, a2 = b2 and a3 = b3
r
(iv) the multiplication of vector a by any scalar λ is given by
r
λa = ( a1 )iˆ ( a2 ) ˆj ( a3 )kˆ
VECTOR ALGEBRA 435

The addition of vectors and the multiplication of a vector by a scalar together give
the following distributive laws:
r r
Let a and b be any two vectors, and k and m be any scalars. Then
(i) kar + mar = (k + m)ar
r r
(ii) k (ma ) = (km)a
r r r

he
r
(iii) k (a b ) ka kb

Remarks
r
(i) One may observe that whatever be the value of λ, the vector λa is always

is
r r r
collinear to the vector a . In fact, two vectors a and b are collinear if and only
r r r r
if there exists a nonzero scalar λ such that b = λa . If the vectors a and b are

bl
r r
given in the component form, i.e. a = a iˆ + a ˆj + a kˆ and b = b iˆ + b ˆj + b kˆ ,
1 2 3 1 2 3
then the two vectors are collinear if and only if
pu
b1iˆ + b2 ˆj + b3 kˆ = λ(a1iˆ + a2 ˆj + a3 kˆ)
be T

⇔ b1iˆ + b2 ˆj + b3 kˆ = (λa1 )iˆ + (λa2 ) ˆj + (λa3 )kˆ


re
⇔ b1 = λa1 , b2 = λa2 , b3 = λ a3
o R

b1 b b
⇔ = 2 = 3 =λ
a1 a2 a3
tt E

r r
(ii) If a a1iˆ a2 ˆj a3 kˆ , then a1, a2, a3 are also called direction ratios of a .
C

(iii) In case if it is given that l, m, n are direction cosines of a vector, then liˆ + mjˆ + nkˆ

= (cos α)iˆ + (cos β) ˆj + (cos γ )kˆ is the unit vector in the direction of that vector,
no N

where α, β and γ are the angles which the vector makes with x, y and z axes
respectively.
r
Example 4 Find the values of x, y and z so that the vectors a = xiˆ + 2 ˆj + zkˆ and
©

r
b = 2iˆ + yjˆ + kˆ are equal.
Solution Note that two vectors are equal if and only if their corresponding components
r r
are equal. Thus, the given vectors a and b will be equal if and only if
x = 2, y = 2, z = 1
436 MATHEMATICS

r r r r r r
Example 5 Let a = iˆ + 2 ˆj and b = 2iˆ + ˆj . Is | a | = | b | ? Are the vectors a and b
equal?
r r
Solution We have | a | = 12 + 22 = 5 and | b | 22 12 5
r r
So, | a | = | b | . But, the two vectors are not equal since their corresponding components

he
are distinct.
r
Example 6 Find unit vector in the direction of vector a = 2iˆ + 3 ˆj + kˆ

r 1 r

is
Solution The unit vector in the direction of a vector a is given by aˆ = r a .
|a|
r
22 + 32 + 12 = 14

bl
Now |a| =

1 2 ˆ 3 ˆ 1 ˆ
Therefore aˆ = (2iˆ + 3 ˆj + kˆ) = i+ j+ k
pu 14 14 14 14
r
Example 7 Find a vector in the direction of vector a = iˆ − 2 ˆj that has magnitude
be T

7 units.
re
Solution The unit vector in the direction of the given vector ar is
o R

1 r 1 ˆ 1 ˆ 2 ˆ
aˆ = r a = (i − 2 ˆj ) = i− j
|a| 5 5 5
tt E

r
Therefore, the vector having magnitude equal to 7 and in the direction of a is
C

∧ ⎛ 1 ∧ 2 ∧⎞ 7 ˆ 14 ˆ
i−
7a = 7⎜ i− j⎟ = j
⎝ 5 5 ⎠ 5 5
no N

Example 8 Find the unit vector in the direction of the sum of the vectors,
r r
a = 2iˆ + 2 ˆj – 5kˆ and b = 2iˆ + ˆj + 3kˆ .
Solution The sum of the given vectors is
©

r r r
a b ( c , say) = 4iˆ 3 ˆj 2kˆ

r
and |c | = 42 + 32 + (−2) 2 = 29
VECTOR ALGEBRA 437

Thus, the required unit vector is


1 r 1 4 ˆ 3 ˆ 2 ˆ
cˆ = r c = (4iˆ + 3 ˆj − 2kˆ) = i+ j− k
|c | 29 29 29 29
r
Example 9 Write the direction ratio’s of the vector a = iˆ + ˆj − 2kˆ and hence calculate
its direction cosines.

he
r
Solution Note that the direction ratio’s a, b, c of a vector r = xiˆ + yjˆ + zkˆ are just
the respective components x, y and z of the vector. So, for the given vector, we have
a = 1, b = 1 and c = –2. Further, if l, m and n are the direction cosines of the given

is
vector, then
a 1 b 1 c −2 r
l= r = , m= r = , n= r = as |r | = 6

bl
|r | 6 |r | 6 | r | 6

⎛ 1 1 2 ⎞
Thus, the direction cosines are ⎜ , ,– ⎟.
pu ⎝ 6 6 6⎠

10.5.2 Vector joining two points


be T

If P1(x1, y1, z1) and P2(x2, y2, z2) are any two points, then the vector joining P1 and P2
uuuur
re
is the vector P1P2 (Fig 10.15).
o R

Joining the points P1 and P2 with the origin


O, and applying triangle law, from the triangle
tt E

OP1P2 , we have
uuur uuuur uuuur
OP1 + P1P2 = OP2 .
C

Using the properties of vector addition, the


above equation becomes
uuuur uuuur uuur
no N

P1P2 = OP2 − OP1 Fig 10.15

uuuur
i.e. P1P2 = ( x2iˆ + y2 ˆj + z2 kˆ) − ( x1iˆ + y1 ˆj + z1kˆ)
©

= ( x2 − x1 )iˆ + ( y2 − y1 ) ˆj + ( z2 − z1 )kˆ
uuuur
The magnitude of vector P1P2 is given by
uuuur
P1P2 = ( x2 − x1 ) 2 + ( y2 − y1 ) 2 + ( z2 − z1 ) 2
438 MATHEMATICS

Example 10 Find the vector joining the points P(2, 3, 0) and Q(– 1, – 2, – 4) directed
from P to Q.
Solution Since the vector is to be directed from P to Q, clearly P is the initial point
uuur
and Q is the terminal point. So, the required vector joining P and Q is the vector PQ ,
given by
uuur

he
PQ = (−1 − 2)iˆ + ( −2 − 3) ˆj + (− 4 − 0)kˆ
uuur
i.e. PQ = −3iˆ − 5 ˆj − 4kˆ.
10.5.3 Section formula

is
uuur uuur
Let P and Q be two points represented by the position vectors OP and OQ , respectively,
with respect to the origin O. Then the line segment

bl
joining the points P and Q may be divided by a third
point, say R, in two ways – internally (Fig 10.16)
and externally (Fig 10.17). Here, we intend to find
pu uuur
the position vector OR for the point R with respect
to the origin O. We take the two cases one by one.
be T

Case I When R divides PQ internally (Fig 10.16).


re
uuur uuur uuur Fig 10.16
o R

If R divides PQ such that m RQ = n PR ,


uuur
where m and n are positive scalars, we say that the point R divides PQ internally in the
tt E

ratio of m : n. Now from triangles ORQ and OPR, we have


uuur uuur uuur r r
RQ = OQ − OR = b − r
C

uuur uuur uuur r r


and PR = OR − OP = r − a ,
r r r r
m (b − r ) = n ( r − a ) (Why?)
no N

Therefore, we have
r r
r mb + na
or r = (on simplification)
m+n
©

Hence, the position vector of the point R which divides P and Q internally in the
ratio of m : n is given by
r r
uuur mb + na
OR =
m+n
VECTOR ALGEBRA 439

Case II When R divides PQ externally (Fig 10.17).


We leave it to the reader as an exercise to verify
that the position vector of the point R which divides
the line segment PQ externally in the ratio
PR m
m : n i.e. is given by
QR n

he
r r
uuur mb − na
OR = m − n Fig 10.17
Remark If R is the midpoint of PQ , then m = n. And therefore, from Case I, the

is
uuur
midpoint R of PQ , will have its position vector as
r r
uuur a +b

bl
OR =
2
uuur r r
Example 11 Consider two points P and Q with position vectors OP = 3a − 2b and
pu
uuur r r
OQ a b . Find the position vector of a point R which divides the line joining P and Q
in the ratio 2:1, (i) internally, and (ii) externally.
be T

Solution
re
(i) The position vector of the point R dividing the join of P and Q internally in the
o R

ratio 2:1 is
r r r r r
uuur 2( a + b ) + (3a − 2b ) 5a
tt E

OR = =
2 +1 3
(ii) The position vector of the point R dividing the join of P and Q externally in the
C

ratio 2:1 is
r r r r
r r
uuur 2( a + b ) − (3a − 2b )
OR = = 4b − a
no N

2 −1
Example 12 Show that the points A(2iˆ ˆj kˆ), B(iˆ 3 ˆj 5kˆ), C(3iˆ 4 j 4kˆ) are
the vertices of a right angled triangle.
©

Solution We have
uuur
AB = (1 − 2)iˆ + (−3 + 1) ˆj + (−5 − 1)kˆ iˆ 2 ˆj 6kˆ
uuur
BC = (3 − 1)iˆ + (−4 + 3) ˆj + (−4 + 5)kˆ = 2iˆ − ˆj + kˆ
uuur
and CA = (2 − 3)iˆ + (−1 + 4) ˆj + (1 + 4)kˆ = −iˆ + 3 ˆj + 5kˆ
440 MATHEMATICS

Further, note that


uuur uuur uuur
| AB |2 = 41 = 6 + 35 =| BC |2 + | CA |2
Hence, the triangle is a right angled triangle.

EXERCISE 10.2

he
1. Compute the magnitude of the following vectors:
r r r 1 ˆ 1 ˆ 1 ˆ
a = iˆ + ˆj + k ;
b = 2iˆ − 7 ˆj − 3kˆ; c = i+ j− k
3 3 3

is
2. Write two different vectors having same magnitude.
3. Write two different vectors having same direction.

bl
4. Find the values of x and y so that the vectors 2iˆ + 3 ˆj and xiˆ + yjˆ are equal.
5. Find the scalar and vector components of the vector with initial point (2, 1) and
terminal point (– 5, 7).
pu r r r
6. Find the sum of the vectors a = iˆ − 2 ˆj + kˆ, b = −2iˆ + 4 ˆj + 5kˆ and c = iˆ − 6 ˆj – 7 kˆ .
r
7. Find the unit vector in the direction of the vector a = iˆ + ˆj + 2kˆ .
be T

uuur
re
8. Find the unit vector in the direction of vector PQ, where P and Q are the points
o R

(1, 2, 3) and (4, 5, 6), respectively.


r r
9. For given vectors, a = 2iˆ − ˆj + 2kˆ and b = −iˆ + ˆj − kˆ , find the unit vector in the
tt E

r r
direction of the vector a + b .
10. Find a vector in the direction of vector 5iˆ − ˆj + 2kˆ which has magnitude 8 units.
C

11. Show that the vectors 2iˆ − 3 ˆj + 4kˆ and − 4iˆ + 6 ˆj − 8kˆ are collinear.
12. Find the direction cosines of the vector iˆ + 2 ˆj + 3kˆ .
no N

13. Find the direction cosines of the vector joining the points A (1, 2, –3) and
B(–1, –2, 1), directed from A to B.
14. Show that the vector iˆ + ˆj + kˆ is equally inclined to the axes OX, OY and OZ.
©

15. Find the position vector of a point R which divides the line joining two points P
and Q whose position vectors are iˆ + 2 ˆj − kˆ and – iˆ + ˆj + kˆ respectively, in the
ratio 2 : 1
(i) internally (ii) externally
VECTOR ALGEBRA 441

16. Find the position vector of the mid point of the vector joining the points P(2, 3, 4)
and Q(4, 1, –2).
r
17. Show that the points A, B and C with position vectors, a = 3iˆ − 4 ˆj − 4kˆ,
r r
b = 2iˆ − ˆj + kˆ and c = iˆ − 3 ˆj − 5kˆ , respectively form the vertices of a right angled
triangle.

he
18. In triangle ABC (Fig 10.18), which of the following is not true:
uuur uuuur uuur r
(A) AB + BC + CA = 0
uuur uuur uuur r
(B) AB + BC − AC = 0

is
uuur uuur uuur r
(C) AB + BC − CA = 0

bl
uuur uuur uuur r Fig 10.18
(D) AB − CB + CA = 0
r r
19. If a and b are two collinear vectors, then which of the following are incorrect:
pu r r
(A) b = λa , for some scalar λ
r
be T

r
(B) a = ± b
r
re
r
(C) the respective components of a and b are not proportional
o R

r r
(D) both the vectors a and b have same direction, but different magnitudes.
tt E

10.6 Product of Two Vectors


So far we have studied about addition and subtraction of vectors. An other algebraic
C

operation which we intend to discuss regarding vectors is their product. We may


recall that product of two numbers is a number, product of two matrices is again a
matrix. But in case of functions, we may multiply them in two ways, namely,
no N

multiplication of two functions pointwise and composition of two functions. Similarly,


multiplication of two vectors is also defined in two ways, namely, scalar (or dot)
product where the result is a scalar, and vector (or cross) product where the
result is a vector. Based upon these two types of products for vectors, they have
©

found various applications in geometry, mechanics and engineering. In this section,


we will discuss these two types of products.
10.6.1 Scalar (or dot) product of two vectors
r r r r
Definition 2 The scalar product of two nonzero vectors a and b , denoted by a ⋅ b , is
442 MATHEMATICS

r r r r
defined as a ⋅ b = | a | | b | cos θ ,
r r
where, θ is the angle between a and b , 0 (Fig 10.19).
r r r r
If either a = 0 or b = 0, then θ is not defined, and in this case,
Fig 10.19
r r
we define a b 0

he
Observations
r r
1. a ⋅ b is a real number.
r r r r r r
2. Let a and b be two nonzero vectors, then a ⋅ b = 0 if and only if a and b are

is
perpendicular to each other. i.e.
r r r r
a ⋅b =0⇔ a ⊥b

bl
r r r r
3. If θ = 0, then a ⋅ b = | a | | b |
r r r
In particular, a ⋅ a = | a |2 , as θ in this case is 0.
pu r r r r
4. If θ = π, then a ⋅ b = − | a | | b |
r r r
| a |2 , as θ in this case is π.
be T

In particular, a ( a )
5. In view of the Observations 2 and 3, for mutually perpendicular unit vectors
re
o R

iˆ, ˆj and kˆ, we have


iˆ ⋅ iˆ = ˆj ⋅ ˆj = kˆ ⋅ kˆ = 1,
tt E

iˆ ⋅ ˆj = ˆj ⋅ kˆ = kˆ iˆ 0
r r
C

6. The angle between two nonzero vectors a and b is given by


r r rr
a b –1 ⎛ a.b ⎞
cos r r , or θ = cos ⎜ r r ⎟
no N

| a || b | ⎝ | a || b | ⎠
7. The scalar product is commutative. i.e.
r r r r
a⋅b = b ⋅ a (Why?)
©

Two important properties of scalar product


r r r
Property 1 (Distributivity of scalar product over addition) Let a, b and c be
any three vectors, then
r r r r r r r
a ⋅ (b + c ) = a ⋅ b + a ⋅ c
VECTOR ALGEBRA 443

r r
Property 2 Let a and b be any two vectors, and λ be any scalar. Then
r r r r r r r r
(λ a ) ⋅ b = ( a ) b (a b ) a ( b )
r r
If two vectors a and b are given in component form as a1iˆ + a2 ˆj + a3 kˆ and

b1iˆ + b2 ˆj + b3 kˆ , then their scalar product is given as

he
r r
a ⋅ b = (a1iˆ + a2 ˆj + a3 kˆ) ⋅ (b1iˆ + b2 ˆj + b3 kˆ)
= a1iˆ ⋅ (b1iˆ + b2 ˆj + b3 kˆ) + a2 ˆj ⋅ (b1iˆ + b2 ˆj + b3 kˆ) + a3 kˆ ⋅ (b1iˆ + b2 ˆj + b3 kˆ)

is
= a1b1 (iˆ ⋅ iˆ) + a1b2 (iˆ ⋅ ˆj ) + a1b3 (iˆ ⋅ kˆ) + a2b1 ( ˆj ⋅ iˆ) + a2b2 ( ˆj ⋅ ˆj ) + a2b3 ( ˆj ⋅ kˆ)
+ a3b1 (kˆ ⋅ iˆ) + a3b2 (kˆ ⋅ ˆj ) + a3b3 ( kˆ ⋅ kˆ) (Using the above Properties 1 and 2)

bl
= a1 b 1 + a 2 b 2 + a 3 b 3 (Using Observation 5)
r r
Thus a ⋅ b = a1b1 + a2 b2 + a3b3
pu
10.6.2 Projection of a vector on a line
uuur
Suppose a vector AB makes an angle θ with a given directed line l (say), in the
uuur r
be T

anticlockwise direction (Fig 10.20). Then the projection of AB on l is a vector p


uuur r
re
(say) with magnitude | AB | cos θ , and the direction of p being the same (or opposite)
o R

r
to that of the line l, depending upon whether cos θ is positive or negative. The vector p
tt E

B
B
a a
θ
C

θ l l
A p C C A
p
(90 < θ < 180 )
0 0 0 0
(0 < θ < 90 )
no N

(i) (ii)

p θ θ p
C l C l
©

A
A
a a
B B
(180 < θ < 270 )
0 0 0 0
(270 < θ < 360 )
(iii) (iv)
Fig 10.20
444 MATHEMATICS

r
is called the projection vector, and its magnitude | p | is simply called as the projection
uuur
of the vector AB on the directed line l.
For example, in each of the following figures (Fig 10.20 (i) to (iv)), projection vector
uuur uuur
of AB along the line l is vector AC .
Observations

he
r
1. If p̂ is the unit vector along a line l, then the projection of a vector a on the line
r
l is given by a pˆ .
r r
2. Projection of a vector a on other vector b , is given by

is
r
r ˆ r ⎛ b ⎞ 1 r r
a ⋅ b, or a ⋅ ⎜ r ⎟ , or r (a ⋅ b )
⎝|b |⎠ |b |

bl
uuur uuur
3. If θ = 0, then the projection vector of AB will be AB itself and if θ = π, then the
uuur uuur
projection vector of AB will be BA .
pu π 3π uuur
4. If θ = or θ = , then the projection vector of AB will be zero vector.
2 2
be T

r
Remark If α, β and γ are the direction angles of vector a = a iˆ + a ˆj + a kˆ , then its
1 2 3
re
direction cosines may be given as
o R

r
a iˆ a1 a2 a3
cos r ˆ r , cos r , and cos r
| a || i | | a | |a| |a|
tt E

r r r
Also, note that | a | cos α, | a |cosβ and | a |cosγ are respectively the projections of
r r
a along OX, OY and OZ. i.e., the scalar components a1, a2 and a3 of the vector a ,
C

r
are precisely the projections of a along x-axis, y-axis and z-axis, respectively. Further,
r
if a is a unit vector, then it may be expressed in terms of its direction cosines as
no N

r
a = cos αiˆ + cos βˆj + cos γkˆ
r r
Example 13 Find the angle between two vectors a and b with magnitudes 1 and 2
r r
respectively and when a ⋅ b =1 .
©

r r r r
Solution Given a b 1, | a | 1 and | b | 2 . We have

r r
1 a b 1 1
cos r r cos
| a || b | 2 3
VECTOR ALGEBRA 445

r r
Example 14 Find angle ‘θ’ between the vectors a = iˆ + ˆj − kˆ and b = iˆ − ˆj + kˆ .
r r
Solution The angle θ between two vectors a and b is given by
r r
a ⋅ br
cosθ = r
| a || b |
r r
a ⋅ b = (iˆ + ˆj − kˆ) ⋅ ( iˆ − ˆj + kˆ) = 1 − 1 − 1 = −1 .

he
Now
−1
Therefore, we have cosθ =
3
1 1
θ = cos

is
hence the required angle is
3
r ˆ ˆ r
Example 15 If a = 5i − j − 3k and b = i + 3 j − 5kˆ , then show that the vectors
ˆ ˆ ˆ
r r r r

bl
a + b and a − b are perpendicular.
Solution We know that two nonzero vectors are perpendicular if their scalar product
pu
is zero.
r r
Here a + b = (5iˆ − ˆj − 3kˆ) + ( iˆ + 3 ˆj − 5kˆ) = 6iˆ + 2 ˆj − 8kˆ
be T

r r
and a − b = (5iˆ − ˆj − 3kˆ) − ( iˆ + 3 ˆj − 5kˆ) = 4iˆ − 4 ˆj + 2kˆ
r r r r
re
(a + b ) ⋅ (a − b ) = (6iˆ + 2 ˆj − 8kˆ) ⋅ (4iˆ − 4 ˆj + 2kˆ) = 24 − 8 − 16 = 0.
o R

So
r r r r
Hence a + b and a − b are perpendicular vectors.
r
tt E

Example 16 Find the projection of the vector a = 2iˆ + 3 ˆj + 2kˆ on the vector
r
b = iˆ + 2 ˆj + kˆ .
C

r r
Solution The projection of vector a on the vector b is given by
1 r r (2 × 1 + 3 × 2 + 2 × 1) 10 5
= =
no N

r (a ⋅ b ) = 6
|b | (1) 2 + (2) 2 + (1) 2 6 3
r r r r r r
Example 17 Find | a − b | , if two vectors a and b are such that | a | 2, | b | 3
r r
©

and a ⋅ b = 4 .
Solution We have
r r r r r r
| a b |2 = (a − b ) ⋅ (a − b )
rr r r r r r r
= a.a − a ⋅ b − b ⋅ a + b ⋅ b
446 MATHEMATICS

r r r r
= | a |2 −2( a ⋅ b )+ | b |2
= (2) 2 − 2(4) + (3) 2
r r
Therefore |a −b | = 5
r r r r r r
Example 18 If a is a unit vector and ( x − a ) ⋅ ( x + a ) = 8 , then find | x | .
r r

he
Solution Since a is a unit vector, | a |= 1 . Also,
r r r r
(x − a) ⋅ (x + a) = 8
r r r r r r r r
or x⋅ x + x ⋅a − a⋅ x − a⋅a = 8
r r

is
or | x |2 1 = 8 i.e. | x | 2 = 9
r
Therefore | x | = 3 (as magnitude of a vector is non negative).
r r r r r

bl
r
Example 19 For any two vectors a and b , we always have | a ⋅ b | ≤ | a | | b | (Cauchy-
Schwartz inequality).
r r r r
Solution The inequality holds trivially when either a = 0 or b = 0 . Actually, in such a
pu r r r r r r
situation we have | a ⋅ b | = 0 = | a | | b | . So, let us assume that | a | ≠ 0 ≠ | b | .
Then, we have
be T

r r
| a ⋅b |
r r = | cos θ | ≤ 1
re
| a || b |
o R

r r r
Therefore | a ⋅ b | ≤ | ar | | b |
r r
tt E

Example 20 For any two vectors a and b , we always C


b
r r r r a +
have | a + b | ≤ | a | + | b | (triangle inequality).
b
C

A B
Solution The inequality holds trivially in case either a
r r r r r r r
a = 0 or b = 0 (How?). So, let | a | 0 | b | . Then,
no N

r r r r r r r r Fig 10.21
| a + b |2 = (a + b ) 2 = (a + b ) ⋅ (a + b )
r r r r r r r r
= a ⋅a + a ⋅b + b ⋅ a + b ⋅b
r r r r
©

= | a |2 +2a ⋅ b + | b |2 (scalar product is commutative)


r r r r
≤ | a |2 +2 | a ⋅ b | + | b |2 (since x ≤ | x | ∀ x ∈ R )
r r r r
≤ | a |2 +2 | a || b | + | b |2 (from Example 19)
r r 2
= (| a | | b |)
VECTOR ALGEBRA 447

r r r r
Hence |a b |≤ |a | |b |
Remark If the equality holds in triangle inequality (in the above Example 20), i.e.
r r r r
| a + b | = | a | + | b |,
uuur uuur uuur
then | AC | = | AB | + | BC |

he
showing that the points A, B and C are collinear.
Example 21 Show that the points A (−2iˆ + 3 ˆj + 5kˆ), B( iˆ + 2 ˆj + 3kˆ) and C(7iˆ − kˆ)
are collinear.

is
Solution We have
uuur
AB = (1 2) iˆ (2 3) ˆj (3 5) kˆ 3iˆ ˆj 2kˆ ,
uuur

bl
BC = (7 1) iˆ (0 2) ˆj ( 1 3)kˆ 6iˆ 2 ˆj 4kˆ ,
uuur
AC = (7 2) iˆ (0 3) ˆj ( 1 5)kˆ 9iˆ 3 ˆj 6kˆ
uuur uuur uuur
pu | AB | = 14, | BC | 2 14 and | AC | 3 14
uuur uuur uuur
Therefore AC = | AB | + | BC |
be T

Hence the points A, B and C are collinear.


re
uuur uuur uuur r
o R

$ Note In Example 21, one may note that although AB + BC + CA = 0 but the
points A, B and C do not form the vertices of a triangle.
tt E

EXERCISE 10.3
r r
C

1. Find the angle between two vectors a and b with magnitudes 3 and 2 ,
r
respectively having ar ⋅ b = 6 .
no N

2. Find the angle between the vectors iˆ − 2 ˆj + 3kˆ and 3iˆ − 2 ˆj + kˆ


3. Find the projection of the vector iˆ − ˆj on the vector iˆ + ˆj .
©

4. Find the projection of the vector iˆ + 3 ˆj + 7 kˆ on the vector 7iˆ − ˆj + 8kˆ .


5. Show that each of the given three vectors is a unit vector:
1 ˆ 1 1 ˆ
(2i + 3 ˆj + 6kˆ), (3iˆ − 6 ˆj + 2kˆ), (6i + 2 ˆj − 3kˆ)
7 7 7
Also, show that they are mutually perpendicular to each other.
448 MATHEMATICS

r r r r r r r r
6. Find | a | and | b | , if (a + b ) ⋅ (a − b ) = 8 and | a |= 8 | b | .
r r r r
7. Evaluate the product (3a − 5b ) ⋅ (2a + 7b ) .
r r
8. Find the magnitude of two vectors a and b , having the same magnitude and
1

he
such that the angle between them is 60o and their scalar product is .
2
r r r r r r
9. Find | x | , if for a unit vector a , ( x − a ) ⋅ ( x + a ) = 12 .
r r r r r
10. If a = 2iˆ + 2 ˆj + 3kˆ, b = − iˆ + 2 ˆj + kˆ and c = 3iˆ + ˆj are such that a + λb is

is
r
perpendicular to c , then find the value of λ.
r r r r r r r r
11. Show that | a | b + | b | a is perpendicular to | a | b − | b | a , for any two nonzero

bl
r r
vectors a and b .
r r r r r
12. If a ⋅ a = 0 and a ⋅ b = 0 , then what can be concluded about the vector b ?
pu r r r r r r r
13. If a , b , c are unit vectors such that a + b + c = 0 , find the value of
r r r r r r
be T

a⋅b +b ⋅c + c ⋅a .
r r r r r r
re
14. If either vector a = 0 or b = 0, then a ⋅ b = 0 . But the converse need not be
o R

true. Justify your answer with an example.


15. If the vertices A, B, C of a triangle ABC are (1, 2, 3), (–1, 0, 0), (0, 1, 2),
tt E

uuur
respectively, then find ∠ABC. [∠ABC is the angle between the vectors BA
uuur
and BC ].
C

16. Show that the points A(1, 2, 7), B(2, 6, 3) and C(3, 10, –1) are collinear.
17. Show that the vectors 2iˆ − ˆj + kˆ, iˆ − 3 ˆj − 5kˆ and 3iˆ − 4 ˆj − 4kˆ form the vertices
no N

of a right angled triangle.


r r
18. If a is a nonzero vector of magnitude ‘a’ and λ a nonzero scalar, then λ a is unit
vector if
©

(A) λ = 1 (B) λ = – 1 (C) a = | λ | (D) a = 1/| λ |

10.6.3 Vector (or cross) product of two vectors


In Section 10.2, we have discussed on the three dimensional right handed rectangular
coordinate system. In this system, when the positive x-axis is rotated counterclockwise
VECTOR ALGEBRA 449

into the positive y-axis, a right handed (standard) screw would advance in the direction
of the positive z-axis (Fig 10.22(i)).
In a right handed coordinate system, the thumb of the right hand points in the
direction of the positive z-axis when the fingers are curled in the direction away from
the positive x-axis toward the positive y-axis (Fig 10.22(ii)).

he
is
bl
pu Fig 10.22 (i), (ii)
r r r r
Definition 3 The vector product of two nonzero vectors a and b , is denoted by a b
be T

and defined as
r r r r
a × b = | a || b | sin θ nˆ ,
re
o R

r r
where, θ is the angle between a and b , 0 ≤ θ ≤ π and n̂ is
r r
a unit vector perpendicular to both a and b , such that
tt E

r r
a , b and nˆ form a right handed system (Fig 10.23). i.e., the
r r
C

right handed system rotated from a to b moves in the


Fig 10.23
direction of n̂ .
r r r r r r r
no N

If either a = 0 or b = 0 , then θ is not defined and in this case, we define a × b = 0 .


Observations
r r
1. a × b is a vector.
©

r r r r r r r
2. Let a and b be two nonzero vectors. Then a × b = 0 if and only if a and b
are parallel (or collinear) to each other, i.e.,
r r r r r
a ×b = 0 ⇔ a b
450 MATHEMATICS

r r r r r r
In particular, a × a = 0 and a × (−a ) = 0 , since in the first situation, θ = 0
and in the second one, θ = π, making the value of sin θ to be 0.
r r r r
3. If then a b | a || b | .
2
4. In view of the Observations 2 and 3, for mutually perpendicular

he
unit vectors iˆ, ˆj and kˆ (Fig 10.24), we have
r
iˆ × iˆ = ˆj × ˆj = kˆ × kˆ = 0
iˆ × ˆj = kˆ, ˆj × kˆ = iˆ, kˆ × iˆ = ˆj Fig 10.24

is
r r
5. In terms of vector product, the angle between two vectors a and b may be
given as

bl
r r
| a ×b |
sin θ = r r
| a || b |
pu r r
6. It is always true that the vector product is not commutative, as a × b = − b × a .
r r

r r r r r r
Indeed, a × b = | a || b | sin θ nˆ , where a , b and nˆ form a right handed system,
be T

r r r r r r
i.e., θ is traversed from a to b , Fig 10.25 (i). While, b × a =| a || b | sin θ nˆ1 , where
re
r r r
o R

r
b , a and nˆ1 form a right handed system i.e. θ is traversed from b to a ,
Fig 10.25(ii).
tt E
C
no N
©

Fig 10.25 (i), (ii)


r r
Thus, if we assume a and b to lie in the plane of the paper, then nˆ and nˆ1 both
will be perpendicular to the plane of the paper. But, n̂ being directed above the
paper while n̂1 directed below the paper. i.e. nˆ1 = − nˆ .
VECTOR ALGEBRA 451

r r r r
Hence a × b = | a || b | sin nˆ
r r r r
= − | a || b | sin θnˆ1 = − b × a
7. In view of the Observations 4 and 6, we have
ˆj × iˆ = − kˆ, kˆ × ˆj = − iˆ and iˆ × kˆ = − ˆj.
r r

he
8. If a and b represent the adjacent sides of a triangle then its area is given as
1 r r
|a b |.
2
By definition of the area of a triangle, we have from

is
Fig 10.26,
1
AB ⋅ CD.

bl
Area of triangle ABC =
2 Fig 10.26
r r
But AB = | b | (as given), and CD = | a | sin θ.
pu 1 r r 1 r r
Thus, Area of triangle ABC = | b || a | sin θ = | a × b | .
2 2
be T

r r
9. If a and b represent the adjacent sides of a parallelogram, then its area is
re
r r
given by | a × b | .
o R

From Fig 10.27, we have


tt E

Area of parallelogram ABCD = AB. DE.


r
But AB = | b | (as given), and
C

r
DE = | a | sin θ .
Thus, Fig 10.27
no N

r r r r
Area of parallelogram ABCD = | b || a | sin θ = | a × b | .
We now state two important properties of vector product.
r r r
Property 3 (Distributivity of vector product over addition): If a , b and c
©

are any three vectors and λ be a scalar, then


r r r r r r r
(i) a × ( b + c ) = a b a c
r r r r r r
(ii) λ (a × b ) = (λa ) × b = a × (λb )
452 MATHEMATICS

r r
Let a and b be two vectors given in component form as a1iˆ + a2 ˆj + a3 kˆ and
b1iˆ + b2 ˆj + b3 kˆ , respectively. Then their cross product may be given by

iˆ ˆj kˆ
r r = a a
a ×b 1 2 a3

he
b1 b2 b3
Explanation We have
r r
a × b = (a1iˆ + a2 ˆj + a3 kˆ) × (b1iˆ + b2 ˆj + b3 kˆ)
= a1b1 (iˆ × iˆ) + a1b2 (iˆ × ˆj ) + a1b3 (iˆ × kˆ) + a2b1 ( ˆj × iˆ)

is
+ a2b2 ( ˆj × ˆj ) + a2b3 ( ˆj × kˆ)

bl
+ a3b1 ( kˆ × iˆ) + a3b2 (kˆ × ˆj ) + a3b3 (kˆ × kˆ) (by Property 1)
pu = a1b2 (iˆ × ˆj ) − a1b3 (kˆ × iˆ) − a2b1 (iˆ × ˆj )
+ a2b3 ( ˆj × kˆ) + a3b1 ( kˆ × iˆ) − a3b2 ( ˆj × kˆ)
(as iˆ × iˆ = ˆj × ˆj = kˆ × kˆ = 0 and iˆ × kˆ = − kˆ × iˆ, ˆj × iˆ = −iˆ × ˆj and kˆ × ˆj = − ˆj × kˆ)
be T

= a1b2 kˆ − a1b3 ˆj − a2b1kˆ + a2b3iˆ + a3b1 ˆj − a3b2iˆ


re
(as iˆ × ˆj = kˆ, ˆj × kˆ = iˆ and kˆ × iˆ = ˆj )
o R

= (a2b3 − a3b2 )iˆ − (a1b3 − a3b1 ) ˆj + (a1b2 − a2b1 )kˆ


tt E

iˆ ˆj kˆ
= a1 a2 a3
C

b1 b2 b3
r r r r
Example 22 Find | a × b |, if a = 2iˆ + ˆj + 3kˆ and b = 3iˆ + 5 ˆj − 2kˆ
no N

Solution We have
iˆ ˆj kˆ
r r
a ×b = 2 1 3
©

3 5 −2

= iˆ(−2 − 15) − (− 4 − 9) ˆj + (10 – 3) kˆ = −17iˆ + 13 ˆj + 7 kˆ


r r
Hence |a b| = (−17) 2 + (13) 2 + (7) 2 = 507
VECTOR ALGEBRA 453

r r
Example 23 Find a unit vector perpendicular to each of the vectors (a + b ) and
r r r
(a − b ), where ar = iˆ + ˆj + kˆ, b = iˆ + 2 ˆj + 3kˆ .
r r r r
Solution We have a + b = 2iˆ + 3 ˆj + 4kˆ and a − b = − ˆj − 2kˆ
r r r r
A vector which is perpendicular to both a + b and a − b is given by

he
iˆ ˆj kˆ
r r r r r
(a + b ) × ( a − b ) = 2 3 4 = −2iˆ + 4 ˆj − 2k (= c , say)
ˆ
0 −1 −2

is
r
Now | c | = 4 + 16 + 4 = 24 = 2 6
Therefore, the required unit vector is

bl
r
c −1 ˆ 2 ˆ 1 ˆ
r = i+ j− k
|c | 6 6 6
pu
$Note There are two perpendicular directions to any plane. Thus, another unit
r r r r 1 ˆ 2 ˆ 1 ˆ
be T

vector perpendicular to a + b and a − b will be i− j+ k . But that will


6 6 6
re
r r r r
o R

be a consequence of (a − b ) × ( a + b ) .

Example 24 Find the area of a triangle having the points A(1, 1, 1), B(1, 2, 3)
tt E

and C(2, 3, 1) as its vertices.


uuur uuur
Solution We have AB = ˆj + 2kˆ and AC = iˆ + 2 ˆj . The area of the given triangle
C

1 uuur uuur
is | AB × AC | .
2
no N

iˆ ˆj kˆ
uuur uuur
AB × AC = 0 1 2 = − 4iˆ + 2 ˆj − k
Now, ˆ
©

1 2 0
uuur uuur
Therefore | AB × AC | = 16 + 4 + 1 = 21
1
Thus, the required area is 21
2
454 MATHEMATICS

Example 25 Find the area of a parallelogram whose adjacent sides are given
r r
by the vectors a = 3iˆ + ˆj + 4kˆ and b = iˆ − ˆj + kˆ
r r
Solution The area of a parallelogram with a and b as its adjacent sides is given
r r
by | a × b | .

he
iˆ ˆj kˆ
r r
Now a × b = 3 1 4 = 5iˆ + ˆj − 4kˆ
1 −1 1

is
r r
Therefore | a ×b | = 25 + 1 + 16 = 42

bl
and hence, the required area is 42 .

EXERCISE 10.4
pu r r r r
1. Find | a × b |, if a = iˆ − 7 ˆj + 7kˆ and b = 3iˆ − 2 ˆj + 2kˆ .
r r r r
2. Find a unit vector perpendicular to each of the vector a + b and a − b , where
be T

r r
a = 3iˆ + 2 ˆj + 2kˆ and b = iˆ + 2 ˆj − 2kˆ .
re
o R

r π π
3. If a unit vector a makes angles with iˆ, with ˆj and an acute angle θ with
3 4
r
tt E

k̂ , then find θ and hence, the components of a .


4. Show that
r r r r r
(a − b ) × ( a + b ) = 2( ar × b )
C

r
5. Find λ and μ if (2iˆ + 6 ˆj + 27 kˆ) × (iˆ + λˆj + μkˆ) = 0 .
r r r r r
no N

6. Given that a b 0 and a × b = 0 . What can you conclude about the vectors
r r
a and b ?
r r r
7. Let the vectors a , b , c be given as a1iˆ + a2 ˆj + a3 kˆ, b1iˆ + b2 ˆj + b3 kˆ,
©

r r r r r r r
c1iˆ + c2 ˆj + c3 kˆ . Then show that a × ( b + c ) = a × b + a × c .
r r r r r r r
8. If either a = 0 or b = 0, then a × b = 0 . Is the converse true? Justify your
answer with an example.
9. Find the area of the triangle with vertices A(1, 1, 2), B(2, 3, 5) and C(1, 5, 5).
VECTOR ALGEBRA 455

10. Find the area of the parallelogram whose adjacent sides are determined by the
r r
vectors a = iˆ − ˆj + 3kˆ and b = 2iˆ − 7 ˆj + kˆ .
r r r r 2 r r
11. Let the vectors a and b be such that | a |= 3 and | b |= , then a × b is a
3
r r
unit vector, if the angle between a and b is
(A) π/6 (B) π/4 (C) π/3 (D) π/2

he
12. Area of a rectangle having vertices A, B, C and D with position vectors
1 1 1 1
– iˆ + ˆj + 4kˆ, iˆ + ˆj + 4kˆ , iˆ − ˆj + 4kˆ and – iˆ − ˆj + 4kˆ , respectively is
2 2 2 2

is
1
(A) (B) 1
2
(C) 2 (D) 4

bl
Miscellaneous Examples
Example 26 Write all the unit vectors in XY-plane.
pu r ∧ ∧
Solution Let r = x i + y j be a unit vector in XY-plane (Fig 10.28). Then, from the
figure, we have x = cos θ and y = sin θ (since | rr | = 1). So, we may write the vector rr as
be T

r uuur
r ( = OP ) = cos iˆ sin ˆj ... (1)
re
r
o R

Clearly, | r | = cos 2 θ + sin 2 θ = 1


tt E
C
no N
©

Fig 10.28
Also, as θ varies from 0 to 2π, the point P (Fig 10.28) traces the circle x2 + y2 = 1
counterclockwise, and this covers all possible directions. So, (1) gives every unit vector
in the XY-plane.
456 MATHEMATICS

Example 27 If iˆ ˆj kˆ, 2iˆ 5 ˆj , 3iˆ 2 ˆj 3kˆ and iˆ 6 ˆj kˆ are the position


uuur
vectors of points A, B, C and D respectively, then find the angle between AB and
uuur uuur uuur
CD . Deduce that AB and CD are collinear.
Solution Note that if θ is the angle between AB and CD, then θ is also the angle
uuur uuur

he
between AB and CD .
uuur
Now AB = Position vector of B – Position vector of A
= (2iˆ + 5 ˆj ) − (iˆ + ˆj + kˆ) = iˆ + 4 ˆj − kˆ

is
uuur
Therefore | AB | = (1) 2 + (4) 2 + (−1) 2 = 3 2
uuur uuur
CD = − 2iˆ − 8 ˆj + 2kˆ and |CD |= 6 2

bl
Similarly
uuur uuur
AB CD
Thus cos θ = uuur uuur
pu |AB||CD|

1(−2) + 4(−8) + (−1)(2) −36


= =
= −1
be T

(3 2)(6 2) 36
uuur uuur
re
Since 0 ≤ θ ≤ π, it follows that θ = π. This shows that AB and CD are collinear.
o R

uuur 1 uuur uuur uuur


Alternatively, AB CD which implies that AB and CD are collinear vectors.
2
tt E

r r r r r r
Example 28 Let a , b and c be three vectors such that | a |= 3, | b |= 4, | c |= 5 and
r r r
each one of them being perpendicular to the sum of the other two, find | a + b + c | .
C

r r r r r r r r r
Solution Given a ⋅ (b + c ) = 0, b ⋅ (c + a ) = 0, c ⋅ (a + b ) = 0.
no N

r r r r r r r r r r r r
Now | a + b + c |2 = (a + b + c ) 2 = ( a + b + c ) ⋅ (a + b + c )
r r r r r r r r r r
= a ⋅ a + a ⋅ (b + c ) + b ⋅ b + b ⋅ (a + c )
r r r rr
©

+ c .(a + b ) + c .c
r r r
= | a |2 + | b |2 + | c |2
= 9 + 16 + 25 = 50
r r r
Therefore | a + b + c | = 50 = 5 2
VECTOR ALGEBRA 457

r r r r r r r
Example 29 Three vectors a, b and c satisfy the condition a + b + c = 0 . Evaluate
r r r r r r r r r
the quantity μ = a ⋅ b + b ⋅ c + c ⋅ a , if | a |= 1, | b |= 4 and | c |= 2 .
r r r r
Solution Since a + b + c = 0 , we have
r r r r
a (a b c ) = 0

he
r r r r r r
or a ⋅ a + a ⋅b + a ⋅c = 0
r r r r r 2
Therefore a ⋅ b + a ⋅ c = − a = −1 ... (1)

is
r r r r
Again, b ⋅ (a + b + c ) = 0
r r r r r 2
a ⋅b + b ⋅c = − b

bl
or = −16 ... (2)
r r r r
Similarly a ⋅ c + b ⋅ c = – 4. ... (3)
pu
Adding (1), (2) and (3), we have
r r r r r r
2 ( a ⋅ b + b ⋅ c + a ⋅ c ) = – 21
be T

−21
re
or 2μ = – 21, i.e., μ =
o R

2
Example 30 If with reference to the right handed system of mutually perpendicular
r r r
tt E

unit vectors iˆ, ˆj and kˆ, α = 3iˆ − ˆj , β = 2iˆ + ˆj – 3kˆ , then express β in the form
r r r r r r r
1 2 , where 1 is parallel to and 2 is perpendicular to α .
C

r r r
Solution Let 1 , is a scalar, i.e., β1 = 3λiˆ − λˆj .
r r r
no N

Now β2 = β − β1 = (2 − 3λ )iˆ + (1 + λ ) ˆj − 3kˆ .


r r r r
Now, since β 2 is to be perpendicular to α , we should have α ⋅β2 = 0 . i.e.,
3(2 − 3λ ) − (1 + λ ) = 0
©

1
or λ=
2
r 3 1 r 1 3
Therefore β1 = iˆ − ˆj and β2 = iˆ + ˆj – 3kˆ
2 2 2 2
458 MATHEMATICS

Miscellaneous Exercise on Chapter 10


1. Write down a unit vector in XY-plane, making an angle of 30° with the positive
direction of x-axis.
2. Find the scalar components and magnitude of the vector joining the points
P (x1, y1, z1) and Q (x2, y2, z2).
3. A girl walks 4 km towards west, then she walks 3 km in a direction 30° east of

he
north and stops. Determine the girl’s displacement from her initial point of
departure.
r r r r r r
4. If a = b + c , then is it true that | a |=| b | + | c | ? Justify your answer.

is
5. Find the value of x for which x(iˆ + ˆj + kˆ) is a unit vector.
6. Find a vector of magnitude 5 units, and parallel to the resultant of the vectors
r r

bl
a = 2iˆ + 3 ˆj − kˆ and b = iˆ − 2 ˆj + kˆ .
r r r
7. If a = iˆ + ˆj + kˆ , b = 2iˆ − ˆj + 3kˆ and c = iˆ − 2 ˆj + kˆ , find a unit vector parallel
r r
pu
to the vector 2a – b + 3c .
r
8. Show that the points A (1, – 2, – 8), B (5, 0, – 2) and C (11, 3, 7) are collinear, and
find the ratio in which B divides AC.
be T

9. Find the position vector of a point R which divides the line joining two points
re
r r r r
P and Q whose position vectors are (2a + b ) and (a – 3b ) externally in the ratio
o R

1 : 2. Also, show that P is the mid point of the line segment RQ.
10. The two adjacent sides of a parallelogram are 2iˆ − 4 ˆj + 5kˆ and iˆ − 2 ˆj − 3kˆ .
tt E

Find the unit vector parallel to its diagonal. Also, find its area.
11. Show that the direction cosines of a vector equally inclined to the axes OX, OY
C

1 1 1
and OZ are , , .
3 3 3
no N

r r r r
12. Let a = iˆ + 4 ˆj + 2kˆ, b = 3iˆ − 2 ˆj + 7 kˆ and c = 2iˆ − ˆj + 4kˆ . Find a vector d
r r r r
which is perpendicular to both a and b , and c ⋅ d = 15 .
©

13. The scalar product of the vector iˆ + ˆj + kˆ with a unit vector along the sum of
vectors 2iˆ + 4 ˆj − 5kˆ and λiˆ + 2 ˆj + 3kˆ is equal to one. Find the value of λ.
r r r
14. If a , b , c are mutually perpendicular vectors of equal magnitudes, show that
r r r r r r
the vector a + b + c is equally inclined to a , b and c .
VECTOR ALGEBRA 459

r r r r r r r r
15. Prove that (a + b ) ⋅ (a + b ) =| a |2 + | b |2 , if and only if a , b are perpendicular,
r r r r
given a ≠ 0, b ≠ 0 .
Choose the correct answer in Exercises 16 to 19.
r r r r
16. If θ is the angle between two vectors a and b , then a ⋅ b ≥ 0 only when
π π

he
(A) 0 < θ < (B) 0 ≤ θ ≤
2 2
(C) 0 < θ < π (D) 0 ≤ θ ≤ π
r r r r
17. Let a and b be two unit vectors and θ is the angle between them. Then a + b

is
is a unit vector if
π π π 2π
(A) θ = (B) θ = (C) θ = (D) θ =

bl
4 3 2 3
18. The value of iˆ.( ˆj kˆ) ˆj (iˆ kˆ) kˆ (iˆ ˆj ) is
pu
(A) 0 (B) –1 (C) 1 (D) 3
r r r r r r
19. If θ is the angle between any two vectors a and b , then | a ⋅ b | = | a × b | when
θ is equal to
be T

π π
re
(A) 0 (B) (C) (D) π
o R

4 2
tt E

Summary
uuur r
C

 Position vector of a point P(x, y, z) is given as OP( = r ) = xiˆ + yjˆ + zkˆ , and its
magnitude by x2 + y2 + z 2 .
no N

 The scalar components of a vector are its direction ratios, and represent its
projections along the respective axes.
 The magnitude (r), direction ratios (a, b, c) and direction cosines (l, m, n) of
©

any vector are related as:


a b c
l= , m= , n=
r r r
r
 The vector sum of the three sides of a triangle taken in order is 0 .
460 MATHEMATICS

 The vector sum of two coinitial vectors is given by the diagonal of the
parallelogram whose adjacent sides are the given vectors.
 The multiplication of a given vector by a scalar λ, changes the magnitude of
the vector by the multiple | λ |, and keeps the direction same (or makes it
opposite) according as the value of λ is positive (or negative).
r

he
r a
For a given vector a , the vector aˆ = r gives the unit vector in the direction
 |a|
r
of a .
 The position vector of a point R dividing a line segment joining the points

is
r r
P and Q whose position vectors are a and b respectively, in the ratio m : n
r r
na + mb

bl
(i) internally, is given by .
m+n
r r
mb − na
pu
(ii) externally, is given by
m−n
.

r r
 The scalar product of two given vectors a and b having angle θ between
be T

them is defined as
re
r r r r
a ⋅ b = | a || b | cos θ .
o R

r r r r
Also, when a ⋅ b is given, the angle ‘θ’ between the vectors a and b may be
tt E

determined by
r r
a ⋅b
cos θ = r r
C

| a || b |
r r
 If θ is the angle between two vectors a and b , then their cross product is
no N

given as
r r r r
a × b = | a || b | sin θ nˆ
r r
where n̂ is a unit vector perpendicular to the plane containing a and b . Such
©

r r
that a, b, nˆ form right handed system of coordinate axes.
r r
 If we have two vectors a and b , given in component form as
r r
a = a1iˆ + a2 ˆj + a3 kˆ and b = b1iˆ + b2 ˆj + b3 kˆ and λ any scalar,
VECTOR ALGEBRA 461

r r
then a + b = (a1 + b1 ) iˆ + (a2 + b2 ) ˆj + ( a3 + b3 ) kˆ ;
r
λa = (λa1 )iˆ + (λa2 ) ˆj + (λa3 )kˆ ;
rr
a.b = a1b1 + a2 b2 + a3b3 ;

iˆ ˆj kˆ

he
r r
and a × b = a1 b1 c1 .
a2 b2 c2

is
Historical Note
The word vector has been derived from a Latin word vectus, which means

bl
“to carry”. The germinal ideas of modern vector theory date from around 1800
when Caspar Wessel (1745-1818) and Jean Robert Argand (1768-1822) described
that how a complex number a + ib could be given a geometric interpretation with
pu
the help of a directed line segment in a coordinate plane. William Rowen Hamilton
(1805-1865) an Irish mathematician was the first to use the term vector for a
be T

directed line segment in his book Lectures on Quaternions (1853). Hamilton’s


method of quaternions (an ordered set of four real numbers given as:
re
o R

a + biˆ + cjˆ + dkˆ, iˆ, ˆj , kˆ following certain algebraic rules) was a solution to the
problem of multiplying vectors in three dimensional space. Though, we must
mention here that in practice, the idea of vector concept and their addition was
tt E

known much earlier ever since the time of Aristotle (384-322 B.C.), a Greek
philosopher, and pupil of Plato (427-348 B.C.). That time it was supposed to be
C

known that the combined action of two or more forces could be seen by adding
them according to parallelogram law. The correct law for the composition of
forces, that forces add vectorially, had been discovered in the case of perpendicular
no N

forces by Stevin-Simon (1548-1620). In 1586 A.D., he analysed the principle of


geometric addition of forces in his treatise DeBeghinselen der Weeghconst
(“Principles of the Art of Weighing”), which caused a major breakthrough in the
development of mechanics. But it took another 200 years for the general concept
©

of vectors to form.
In the 1880, Josaih Willard Gibbs (1839-1903), an American physicist
and mathematician, and Oliver Heaviside (1850-1925), an English engineer, created
what we now know as vector analysis, essentially by separating the real (scalar)
462 MATHEMATICS

part of quaternion from its imaginary (vector) part. In 1881 and 1884, Gibbs
printed a treatise entitled Element of Vector Analysis. This book gave a systematic
and concise account of vectors. However, much of the credit for demonstrating
the applications of vectors is due to the D. Heaviside and P.G. Tait (1831-1901)
who contributed significantly to this subject.

he
—™—

is
bl
pu
be T
re
o R
tt E
C
no N
©
THREE D IMENSIONAL G EOMETRY 463

Chapter 11
THREE DIMENSIONAL GEOMETRY

d
he
v The moving power of mathematical invention is not
reasoning but imagination. – A. DEMORGAN v

is
11.1 Introduction
In Class XI, while studying Analytical Geometry in two

bl
dimensions, and the introduction to three dimensional
geometry, we confined to the Cartesian methods only. In
the previous chapter of this book, we have studied some
pu
basic concepts of vectors. We will now use vector algebra
to three dimensional geometry. T he purpose of this
approach to 3-dimensional geometry is that it makes the
be T

study simple and elegant*.


In this chapter, we shall study the direction cosines
re
o R

and direction ratios of a line joining two points and also


discuss about the equations of lines and planes in space
under different conditions, angle between two lines, two
Leonhard Euler
tt E

planes, a line and a plane, shortest distance between two (1707-1783)


skew lines and distance of a point from a plane. Most of
the above results are obtained in vector form. Nevertheless, we shall also translate
C

these results in the Cartesian form which, at times, presents a more clear geometric
and analytic picture of the situation.
no N

11.2 Direction Cosines andDirection Ratios of a Line


From Chapter 10, recall that if a directed line L passing through the origin makes
angles a, b and g with x, y and z-axes, respectively, called direction angles, then cosine
of these angles, namely, cos a, cos b and cos g are called direction cosines of the
©

directed line L.
If we reverse the direction of L,then the direction angles arereplaced by their supplements,
i.e.,p-a p-b p-g
, and . Thus, the signs of the direction cosines are reversed.
* Fo r va ri ou s ac ti vi ti es i n th re e di me ns io na l ge omet ry, on e ma y re fe r to t he B oo k
“A Hand Book for designing Mathematics Laboratory in Schools”, NCERT, 2005
464 M ATHE MATI CS

d
he
is
bl
Fig 11.1
Note that a given line in space can be extended in two opposite directions and so it
has two sets of direction cosines. In order to have a unique set of direction cosines for
pu
a given line in space, we must take the given line as a directed line. T hese unique
direction cosines are denoted by l, m and n.
be T

Remark If the given line in space does not pass through the origin, then, in order to find
its direction cosines, we draw a line through the origin and parallel to the given line.
re

Nowtake one of the directed lines from the origin and find its direction cosines as two
o R

parallel line have same set of direction cosines.


Any three numbers which are proportional to the direction cosines of a line are
tt E

called the direction ratios of the line. If l, m, n are direction cosines and a, b, c are
direction ratios of a line, then a = ll, b=lm and c = ln, for any nonzero l Î R.

ANote
C

Some authors also call direction ratios as direction numbers.


Let a, b, c be direction ratios of a line and let l, m and n be the direction cosines
(d.c’s) of the line. Then
no N

l m n
= = = k (say), k being a constant.
a b c
l = ak, m = bk, n = ck
©

T herefore ... (1)


But l + m 2 + n2 =
2
1
T herefore k (a + b 2 + c2) =
2 2
1
1
or k= ±
a + b 2 + c2
2
THREE D IMENSIONAL G EOMETRY 465

Hence, from (1), the d.c.’s of the line are


a b c
l =± ,m = ± ,n = ±
a 2 + b 2 + c2 a 2 + b 2 + c2 a 2 + b 2 + c2

d
where, depending on the desired sign of k, either a positive or a negative sign is to be
taken for l, m and n.

he
For any line, if a, b, c are direction ratios of a line, then ka, kb, kc; k ¹ 0 is also a
set of direction ratios. So, any two sets of direction ratios of a line are also proportional.
Also, for any line there are infinitely many sets of direction ratios.
11.2.1 Relation between the direction cosines of a line

is
Consider a line RSwith direction cosines l, m, n. T hrough Z
the origin draw a line parallel to the given line and take a S
point P(x, y, z) on this line. From P draw a perpendicular

bl
R
PA on the x-axis (Fig. 11.2). P (x, y , z)

Let OP = r. T hen cos a=OA =x


. T his gives x = lr.. O
a
pu
a Y
OP r P
Similarly, y = mr and z = nr A r
be T

T hus x + y + z2 = r2 (l2 + m 2 + n 2)
2 2
X a
O x A
But x2 + y2 + z2 = r 2
re
o R

Hence l2 + m 2 + n2 = 1 Fig 11.2


11.2.2 Direction cosines of a line passing through two points
tt E

Since one and only one line passes through two given points, we can determine the
direction cosines of a line passing through the given points P(x1, y1, z1) and Q(x2, y2, z2)
as follows (Fig 11.3 (a)).
C
no N
©

Fig 11.3
466 M ATHE MATI CS

Let l, m, n be the direction cosines of the line PQ and let it makes angles a, b and g
with the x, y and z-axis, respectively.
Draw perpendiculars from P and Q to XY-plane to meet at R and S. Draw a
perpendicular from P to QS to meet at N. Now, in right angle triangle PNQ, ÐPQN=

d
g (Fig 11.3 (b).
NQ z -z
= 2 1

he
T herefore, cosg =
PQ PQ

x2 - x1 y -y
Similarly cosa = and cos b = 2 1
PQ PQ

is
Hence, the direction cosines of the line segment joining the points P(x1, y1, z1) and
Q(x2, y2, z2) are
x2 - x1 y2 - y1 z 2 - z1

bl
, ,
PQ PQ PQ

-x ) +( y -y ) +(z -z )
pu 2 2 2
whe re PQ = ( x2 1 2 1 2 1

ANote T he direction ratios of the line segment joining P(x1, y1, z1) and Q(x2, y2, z2)
be T

may be taken as
re
x2 – x1, y2 – y1, z2 – z1 or x1 – x2, y1 – y2, z1 – z2
o R

Example 1 If a line makes angle 90°, 60° and 30° with the positive direction of x, y and
z-axis respectively, find its direction cosines.
tt E

1
Solution Let the d .c . 's of the lines be l , m, n. T hen l = cos 90 0 = 0, m = cos 60 0 = ,
2
C

3
n = cos 30 0 = .
2
Example 2 If a line has direction ratios 2, – 1, – 2, determine its direction cosines.
no N

Solution Direction cosines are


2 -1 -2
, ,
2 + (-1) + (-2) 2 + (-1) + (-2 ) 2 + ( - 1) + ( -2) 2
2 2 2 2 2 2 2
©

2 -1 -2
or , ,
3 3 3
Example 3 Find the direction cosines of the line passing through the two points
(– 2, 4, – 5) and (1, 2, 3).
THREE D IMENSIONAL G EOMETRY 467

Solution We know the direction cosines of the line passing through two points
P(x1, y1, z1) and Q(x2, y2, z2) are given by
x2 - x1 y2 - y1 z 2 - z1
, ,
PQ PQ PQ

d
( x2 - x1 ) 2 + ( y2 - y1 ) 2 + ( z2 - z1 )
2
whe re PQ =

he
Here P is (– 2, 4, – 5) and Q is (1, 2, 3).
So PQ = (1 - ( -2)) 2 + (2 - 4) 2 + (3 - ( -5)) 2 = 77
T hus, the direction cosines of the line joining two points is

is
3 -2 8
, ,
77 77 77
Example 4 Find the direction cosines of x, y and z-axis.

bl
Solution T he x-axis makes angles 0°, 90° and 90° respectively with x, y and z-axis.
T herefore, the direction cosines of x-axis are cos 0°, cos 90°, cos 90° i.e., 1,0,0.
pu
Similarly, direction cosines of y-axis and z-axis are 0, 1, 0 and 0, 0, 1 respectively.
Example 5 Show that the points A (2, 3, – 4), B (1, – 2, 3) and C (3, 8, – 11) are
be T

collinear.
Solution Direction ratios of line joining A and B are
re
o R

1 – 2, – 2 – 3, 3 + 4 i.e., – 1, – 5, 7.
T he direction ratios of line joining B and C are
3 –1, 8 + 2, – 11 – 3, i.e., 2, 10, – 14.
tt E

It is clear that direction ratios ofAB and BC are proportional, hence, AB is parallel
to BC. But po int B is common to both AB and BC. T herefore , A, B, C are
C

collinear points.

EXERCIS E 11.1
no N

1. If a line makes angles 90°, 135°, 45° with the x, y and z-axes respectively, find its
direction cosines.
2. Find the direction cosines of a line which makes equal angles with the coordinate
©

axes.
3. If a line has the direction ratios – 18, 12, – 4, then what are its direction cosines ?
4. Show that the points (2, 3, 4), (– 1, – 2, 1), (5, 8, 7) are collinear.
5. Find the direction cosines of the sides of the triangle whose vertices are
(3, 5, – 4), (– 1, 1, 2) and (– 5, – 5, – 2).
468 M ATHE MATI CS

11.3 Equation of a Line in S pace


We have studied equation of lines in two dimensions in Class XI, we shall now study
the vector and cartesian equations of a line in space.
A line is uniquely determined if

d
(i) it passes through a given point and has given direction, or
(ii) it passes through two given points.

he
r
11.3.1 Equation of a line through a given point and parallel to a given vector b
r
Let a be the position vector of the given point
A wit h r esp ect to th e o rigin O o f t he

is
rectangular coordinate system. Let l be the
line which passes through the point A and is
r

bl
r
parallel to a given vector b . Let r be the
position vector of an arbitrary point P on the
line (Fig 11.4).
pu
uuur r
T hen AP is parallel to the vector b , i.e.,
uuur r
AP = lb , where l is some real number.. Fig 11.4
be T

uuur uuur uuur


AP = OP – OA
re
But
r
-
o R

r r
i.e. lb = r a
Conversely, for each value of the parameter l, this equation gives the position
tt E

vector of a point P on the line. Hence, the vector equation of the line is given by
r r r
r = a+λ b ... (1)
r
= ++
C

Remark If b aiˆ bjˆ ckˆ , then a, b, c are direction ratios of the line and conversely,,
r
if a, b, c are direction ratios of a line, then b = aiˆ + bjˆ + ckˆ will be the parallel to
r
no N

the line. Here, b should not be confused with | b |.


Derivation of carte sian form from ve ctor form
Let the coordinates of the given point A be (x1, y1, z1) and the direction ratios of
©

the line be a, b, c. Consider the coordinates of any point P be (x, y, z). T hen
r r
r = x iˆ + yˆj + zkˆ ; a = x1 iˆ + y 1 ˆj + z1 kˆ
r
and b = a iˆ + b ˆj + c kˆ
Substituting these values in (1) and equating the coefficients of iˆ , jˆ and k̂ , we get
x = x1 + la; y = y1 + l b; z = z1+ lc ... (2)
THREE D IMENSIONAL G EOMETRY 469

T hese are parametric equations of the line. Eliminating the parameter l from (2),
we get
x – x1 y – y1 z – z1
= = ... (3)
a b c

d
T his is the Cartesian equation of the line.

ANote If l, m, n are the direction cosines of the line, the equation of the line is

he
x – x1 y – y1 z – z 1
= =
l m n
Example 6 Find the vector and the Cartesian equations of the line through the point

is
(5, 2, – 4) and which is parallel to the vector 3 ˆi + 2 ˆj - 8 kˆ .
Solution We have

bl
r r
a = 5 ˆi + 2 ˆj - 4 kˆ and b = 3 iˆ + 2 ˆj - 8 kˆ
T herefore, the vector equation of the line is
r
pu
r = 5 iˆ + 2 jˆ - 4 kˆ + l ( 3 iˆ + 2 jˆ - 8 kˆ )
r
Now, r is the position vector of any point P(x, y, z) on the line.
x iˆ + y ˆj + z kˆ = 5 ˆi + 2 ˆj - 4 kˆ + l ( 3 iˆ + 2 ˆj - 8 kˆ)
be T

T herefore,
re
= (5 + 3l) $i + (2 + 2 l) $j + ( - 4 - 8l ) k$
o R

Eliminating l , we get
x -5 y- 2 z + 4
= =
tt E

3 2 -8
which is the equation of the line in Cartesian form.
C

11.3.2 Equation of a line passing through two given points


r r
Let a and b be the position vectors of two
po in ts A ( x1, y1, z1 ) an d B( x 2, y 2, z 2) ,
no N

respectively that are lying on a line (Fig 11.5).


r
Let r be t he positio n vector of an
arbitrary point P(x, y, z), then P is a point on
=-
©

uuur r r
th e lin e if a nd on ly if AP r a an d
uuur r r
AB = b - a are collinear vectors. T herefore,
P is on the line if and only if

- =l -
r r
r a
r r
(b a ) Fig 11.5
470 M ATHE MATI CS

r r r r
or r = a + l ( b - a ) , l Î R. ... (1)
T his is the vector equation of the line.
Derivation of carte sian form from ve ctor form

d
We have
r
=+ + = + + r r
= + +
r x iˆ y jˆ z kˆ , a x1iˆ y1 ˆj z1 kˆ and b x2 iˆ y2 jˆ z 2 kˆ,

he
Substituting these values in (1), we get
x $i + y $j + z k$ = x1 $i + y1 $j + z1 k$ + l [( x2 - x1 ) $i + ( y2 - y1 ) $j + ( z2 - z1 ) k$ ]

Equating the like coefficients of ˆi , ˆj , kˆ , we get

is
x = x1 + l (x2 – x1); y = y1 + l (y2 – y1); z = z1 + l (z2 – z1)
On eliminating l, we obtain

bl
x - x1 y - y1 z - z1
= =
x2 - x1 y2 - y1 z 2 - z1
pu
which is the equation of the line in Cartesian form.
Example 7 Find the vector equation for the line passing through the points (–1, 0, 2)
and (3, 4, 6).
be T

r r
re
Solution Let a and b be the position vectors of the point A(– 1, 0, 2) and B(3, 4, 6).
=-+
o R

r
Then a iˆ 2 kˆ
and
r
=+ +
b 3 iˆ 4 ˆj 6 kˆ
tt E

r r
T herefore b - a = 4 iˆ + 4 ˆj + 4 kˆ
r
Let r be the position vector of any point on the line. Then the vector equation of
C

=-+ +l + +
the line is
r
r iˆ 2 kˆ (4 iˆ 4 ˆj 4 kˆ )
no N

Example 8 T he Cartesian equation of a line is


+= - = +
x 3 y 5 z 6
2 4 2
©

Find the vector equation for the line.


Solution Comparing the given equation with the standard form
x - x1 y - y1 z - z1
= =
a b c
We observe that x1 = – 3, y1 = 5, z1 = – 6; a = 2, b = 4, c = 2.
THREE D IMENSIONAL G EOMETRY 471

T hus, the required line passes through the point (– 3, 5, – 6) and is parallel to the
r
vector 2 iˆ + 4 ˆj + 2 kˆ . Let r be the position vector of any point on the line, then the
vector equation of the line is given by
r
=- + -

d
r ( 3 ˆi 5 ˆj 6 kˆ ) + l (2 iˆ + 4 ˆj + 2 kˆ )

11.4 Angle between Two Lines

he
Let L1 and L2 be two lines passing through the origin
and with direction ratios a 1, b 1, c1 and a 2, b 2, c2,
respectively. Let P be a point on L1 and Q be a point

is
on L2. Consider the directed lines OP and OQ as
given in Fig 11.6. Let q be the acute angle between
OP an d OQ. No w recall that the directed line

bl
segments OP and OQ are vectors with components
a 1, b 1, c1 and a 2, b 2, c2, respectively. T herefore, the
angle q between them is given by Fig 11.6
pu
a1 a2 + b1 b2 + c1 c2
cos q = ... (1)
a12 + b12 + c12 a 22 + b 22 + c22
be T

T he angle between the lines in terms of sin q is given by


re

- q
o R

sin q = 1 cos 2
tt E

( a1a 2 + b1b2 + c1 c2 ) 2
1-
( a12 + b12 + c12 )(a 22 + b22 + c22 )
=
C

(a12 + b12 + c12 )( a22 + b22 + c22 )- ( a1a 2 + b1b2 + c1c2 )2


=
(a12 + b12 + c12 ) ( a22 + b22 + c22 )
no N

( a1 b2 -a b1 ) 2 +(b c -b c ) +(c a -c 2
a1 ) 2
+b +c a +b +c
2 1 2 2 1 1 2 2
= ... (2)
a12 2 2 2 2 2
©

1 1 2 2 2

ANote In case the lines L1 and L2 do not pass through the origin, we may take
lines L¢1 and L¢2 which are parallel to L1 and L2 respectively and pass through
the origin.
472 M ATHE MATI CS

If instead of direction ratios for the lines L1 and L2, direction cosines, namely,
l1, m 1, n 1 for L1 and l2, m 2, n2 for L2 are given, then (1) and (2) takes the following form:
cos q = |l1 l2 + m 1m 2 + n 1n 2 | (as l12 + m12 + n12 =1 = l22 + m22 + n 22 ) ... (3)

d
and sin q = (l1 m2 - l2 m1 )2 - (m1 n 2 - m2 n1 )2 + (n1 l2 - n 2 l1 )2 ... (4)

he
Two lines with direction ratios a 1, b 1, c1 and a 2, b 2, c2 are
(i) perpendicular i.e. if q = 90° by (1)
a 1a 2 + b 1b 2 + c1c2 = 0
(ii) parallel i.e. if q = 0 by (2)

is
a1 b1 c1
=
a2 = b2 c2

bl
Now, we find the angle between two lines when their equations are given. If q is
acute the angle between the lines

+l +m
pu
r ur r r r
r = a1 b1 and r = a2 b2
r r
b1 × b 2
be T

then cosq = r r
b1 b 2
re

In Cartesian form, if q is the angle between the lines


o R

x - x1 y - y1 z - z1
= = ... (1)
tt E

a1 b1 c1

x - x2 y - y2 z - z 2
=
C

and = ... (2)


a2 b2 c2

where, a 1 ,b 1, c1 and a 2, b 2, c2 are the direction ratios of the lines (1) and (2), respectively,
no N

then

a1 a2 + b1 b2 + c1 c2
cos q =
©

a12 + b12 + c12 a22 + b22 + c22

Example 9 Find the angle between the pair of lines given by


r
r = 3 ˆi + 2 ˆj - 4 kˆ + l (iˆ + 2 ˆj + 2 kˆ )
r
and r = 5 iˆ - 2 jˆ + m (3 ˆi + 2 ˆj + 6 kˆ)
THREE D IMENSIONAL G EOMETRY 473

r r
Solution Here b1 = iˆ + 2 jˆ + 2 kˆ and b2 = 3 iˆ + 2 jˆ + 6 kˆ
T he angle q between the two lines is given by
r r
b1 × b2 (iˆ + 2 ˆj + 2kˆ) × (3 iˆ + 2 ˆj + 6 kˆ)
cos q = r r =

d
b1 b2 1 + 4 + 4 9 + 4 + 36

he
3 + 4 + 12 19
= =
3´ 7 21
æ 19 ö
Hence q = cos–1 ç ÷
è 21 ø

is
Example 10 Find the angle between the pair of lines
x+3 y -1 z + 3
= =

bl
3 5 4
x +1 y-4 z-5
and = =
1 1 2
pu
Solution T he direction ratios of the first line are 3, 5, 4 and the direction ratios of the
second line are 1, 1, 2. If q is the angle between them, then
be T

3.1 + 5.1 + 4.2 16 16 8 3


cos q = = = =
re

32 + 52 + 42 12 + 12 + 2 2 50 6 5 2 6 15
o R

æ8 3 ö
Hence, the required angle is cos–1 çç ÷÷ .
tt E

è 15 ø
11.5 S hortest Distance between Two Lines
C

If two lines in space intersect at a point, then the shortest distance between them is
zero. Also, if two lines in space are parallel,
then the shortest distance between them
no N

will be the perpendicular distance, i.e. the


length of the perpendicular drawn from a
point on one line onto the other line.
©

Further, in a space, there are lines which


are neither intersecting nor parallel. In fact,
such pair of lines are non coplanar and
are called skew lines. For example, let us
consider a room of size 1, 3, 2 units along
x, y and z-axes respectively Fig 11.7. Fig 11.7
474 M ATHE MATI CS

T he line GE that goes diagonally across the ceiling and the line DB passes through
one corner of the ceiling directly above A and goes diagonally down the wall. T hese
lines are skew because they are not parallel and also never meet.
By the shortest distance between two lines we mean the join of a point in one line
with one point on the other line so that the length of the segment so obtained is the

d
smallest.
For skew lines, the line of the shortest distance will be perpendicular to both

he
the lines.
11.5.1 Distance between two skew lines
We now determine the shortest distance between two skew lines in the following way:

is
Let l1 and l2 be two skew lines with equations (Fig. 11.8)
r r r
r = a1 + l b1 ... (1)
r r

bl
r
and r = a2 + m b2 ... (2)
r r
Take any point S on l1 with position vector a1 and T on l2, with position vector a 2.
pu
T hen the magnitude of the shortest distance vector
T
will be equal to that of the projection of ST along the Q
direction of the line of shortest distance (See 10.6.2). l2
be T

uuur
If PQ is the shortest distance vector between
re
r
o R

l1 and l2 , then it being perpendicular to both b1 and


r uuur l1
b2 , the unit vector n̂ along PQ would therefore be S P
tt E

r r Fig 11.8
b1 ´ b2
n̂ = r r ... (3)
| b1 ´ b2 |
C

uuur
Then PQ = d n̂
where, d is the magnitude of the shortest distance vector. Let q be the angle between
uur uuur
no N

ST and PQ . T hen
PQ = ST |cos q |
uuur uur
PQ × ST
cos q = uuuur uur
©

But
| PQ | | ST |
r r
d nˆ × (a 2 - a1 ) uur r r
= (since ST = a2 - a1 )
d ST
r r r r
(b1 ´ b2 ) × (a 2 - a1)
= r r [From (3)]
ST b1 ´ b2
THREE D IMENSIONAL G EOMETRY 475

Hence, the required shortest distance is


d = PQ = ST |cos q |
r r r r
( b1 ´ b2 ) . ( a 2 - a 1 )
or d = r r
| b1 ´ b2 |

d
Carte sian form

he
T he shortest distance between the lines
x - x1 y - y1 z - z1
l1 : = =
a1 b1 c1

is
x - x2 y - y2 z - z2
and l2 : = =
a2 b2 c2

bl
x2 - x1 y2 - y1 z 2 - z1
a1 b1 c1
a2 b2 c2
pu
is
( b1 c2 - b2 c1 ) 2 + ( c1 a 2 - c2 a1 ) 2 + ( a1 b2 - a 2 b1 ) 2
be T
re
11.5.2 Distance between parallel lines
o R

If two lines l1 and l2 are parallel, then they are coplanar. Let the lines be given by
r r r
r = a1 + l b ... (1)
tt E

r r r
and r = a2 + m b … (2)

where, ar1 is the position vector of a point S on l1 and


C

r
a 2 is the position vector of a point T on l2 Fig 11.9.
As l1, l2 are coplanar, if the foot of the perpendicular
no N

from T on the line l1 is P, then the distance between the


lines l1 and l2 = |T P |.
uur r Fig 11.9
Let q be the angle between the vectors ST and b .
©

Then
r uur r uur
b ´ ST = ( | b || ST | sin q) nˆ ... (3)

where n̂ is the unit vector perpendicular to the plane of the lines l1 and l2.
But
uur r
ST = a2 a1
r
-
476 M ATHE MATI CS

T herefore, from (3), we get


r r r r
b ´ ( a2 - a1 ) = | b | PT nˆ (since PT = ST sin q)

×
r r r r
i.e., | b ´ ( a 2 - a1 ) | = | b | PT 1 (as | n̂ | = 1)

d
Hence, the distance between the given parallel lines is
r r

he
uuur r
b ´ ( a 2 - a1 )
d = | PT | = r
|b|

Example 11 Find the shortest distance between the lines l1 and l2 whose vector

is
equations are
r
r = ˆi + ˆj + l (2 ˆi - jˆ + kˆ ) ... (1)

bl
r
and r = 2 iˆ + ˆj - kˆ + m (3 iˆ - 5 ˆj + 2 kˆ ) ... (2)
r r r r r
Solution Comparing (1) and (2) with rr = a1 + l b1 and r = a2 + m b 2 respectively,,
pu
r r
we get a1 = ˆi + ˆj , b1 = 2 iˆ - ˆj + kˆ
r r
be T

a2 = 2 î + ĵ – kˆ and b2 = 3 î – 5 ĵ + 2 kˆ
r r
re
T herefore a2 - a1 = î - kˆ
o R

r r
and b1 ´ b2 = ( 2 iˆ - ˆj + kˆ ) ´ ( 3 ˆi - 5 ˆj + 2 kˆ )
tt E

iˆ jˆ kˆ
= 2 -1 1 = 3 iˆ - ˆj - 7 kˆ
C

3 -5 2
r r
So | b1 ´ b2 | = 9 +1 + 49 = 59
no N

Hence, the shortest distance between the given lines is given by


r r r r
( b1 ´ b2 ) . ( a 2 - a1 ) = | 3 - 0 + 7 | = 10
d = r r
©

| b1 ´ b2 | 59 59

Example 12 Find the distance between the lines l1 and l2 given by


r
r = ˆi + 2 ˆj - 4 kˆ + l ( 2 iˆ + 3 jˆ + 6 kˆ )
r
and r = 3 iˆ + 3 jˆ - 5 kˆ + m ( 2 iˆ + 3 ˆj + 6 kˆ )
THREE D IMENSIONAL G EOMETRY 477

Solution T he two lines are parallel (Why? ) We have


r r r
a1 = ˆi + 2 ˆj - 4 kˆ , a2 = 3 ˆi + 3 ˆj - 5 kˆ and b = 2 iˆ + 3 ˆj + 6 kˆ
T herefore, the distance between the lines is given by

d
iˆ ˆj kˆ
r r r
b ´ ( a 2 - a1 ) 2 3 6

he
d= r =
|b | 2 1 -1
4 + 9 + 36

is
| - 9 iˆ + 14 ˆj - 4 kˆ | 293 293
or = = =
49 49 7

bl
EXERCIS E 11.2
1. Showthat the three lines with direction cosines
pu
12 -3 -4 4 12 3 3 -4 12
, , ; , , ; , , are mutually perpendicular..
be T

13 13 13 13 13 13 13 13 13
2. Show that the line through the points (1, – 1, 2), (3, 4, – 2) is perpendicular to the
re
o R

line through the points (0, 3, 2) and (3, 5, 6).


3. Show that the line through the points (4, 7, 8), (2, 3, 4) is parallel to the line
through the points (– 1, – 2, 1), (1, 2, 5).
tt E

4. Find the equation of the line which passes through the point (1, 2, 3) and is
parallel to the vector 3 iˆ + 2 ˆj - 2 kˆ .
C

5. Find the equation of the line in vector and in cartesian form that passes through
the point with position vector 2 iˆ - j + 4 kˆ and is in the direction iˆ + 2 jˆ - kˆ .
no N

6. Find the cartesian equation of the line which passes through the point (– 2, 4, – 5)
x +3 y- 4 z +8
and parallel to the line given by = = .
©

3 5 6
x -5 y+ 4 z -6
7. T he cartesian equation of a line is = = . Write its vector form.
3 7 2
8. Find the vector and the cartesian equations of the lines that passes through the
origin and (5, – 2, 3).
478 M ATHE MATI CS

9. Find the vector and the cartesian equations of the line that passes through the
points (3, – 2, – 5), (3, – 2, 6).
10. Find the angle between the following pairs of lines:
r
(i) r = 2 iˆ - 5 ˆj + kˆ + l (3 ˆi + 2 ˆj + 6 kˆ ) and

d
r
r = 7 iˆ - 6 kˆ + m ( iˆ + 2 ˆj + 2 kˆ )

he
r
(ii) r = 3 iˆ + jˆ - 2 kˆ + l (iˆ - jˆ - 2 kˆ ) and
r
r = 2 iˆ - jˆ - 56 kˆ + m (3 iˆ - 5 ˆj - 4 kˆ )
11. Find the angle between the following pair of lines:
- = -= + += - =-

is
x 2 y 1 z 3 x 2 y 4 z 5
(i)
2 5 - 3
and
1- 8 4
x y z x - 5 y - 2 z -3

bl
(ii) = = and = =
2 2 1 4 1 8
1 - x 7 y - 14 z - 3
12. Find the values of p so that the lines = =
pu
3 2p 2
7 - 7x y - 5 6 - z
and = = are at right angles.
be T

3p 1 5
re
x -5 y+ 2 z x y z
13. Show that the lines = = and = = are perpendicular to
o R

7 -5 1 1 2 3
each other.
14. Find the shortest distance between the lines
tt E

r
r = ( iˆ + 2 jˆ + kˆ ) + l ( iˆ - ˆj + kˆ ) and
r
r = 2 iˆ - jˆ - kˆ + m (2 iˆ + jˆ + 2 kˆ )
C

15. Find the shortest distance between the lines


x +1 y + 1 z + 1 x -3 y-5 z -7
no N

= = and = =
7 - 6 1 1 -2 1
16. Find the shortest distance between the lines whose vector equations are
r
r = ( iˆ + 2 jˆ + 3 kˆ ) + l ( iˆ - 3 ˆj + 2 kˆ )
©

r
and r = 4 iˆ + 5 ˆj + 6 kˆ + m (2 iˆ + 3 ˆj + kˆ )
17. Find the shortest distance between the lines whose vector equations are
r
r = (1 - t ) ˆi + ( t - 2) ˆj + (3 - 2 t) kˆ and
r
r = ( s + 1) iˆ + (2 s - 1) jˆ - (2 s + 1) kˆ
THREE D IMENSIONAL G EOMETRY 479

11.6 Plane
A plane is determined uniquely if any one of the following is known:
(i) the normal to the plane and its distance from the origin is given, i.e., equation of
a plane in normal form.

d
(ii) it passes through a point and is perpendicular to a given direction.
(iii) it passes through three given non collinear points.

he
Now we shall find vector and Cartesian equations of the planes.
11.6.1 Equation of a plane in normal form
Consider a plane whose perpendicular distance from the origin is d (d ¹ 0). Fig 11.10.

is
uuur
If ON is the normal from the origin to the plane, and n̂ is the unit normal vector
uuur uuur Z
along ON . T hen ON = d n̂ . Let P be any

bl
uuur
po int on th e p lan e. T her efo re, NP is
uuur
perpendicular to ON .
uuur uuur
pu
T herefore, NP × ON = 0 ... (1) P( x, y,z )
r
Let r be the position vector of the point P,, r
uuur r
+ =
be T

uuur uuur uuur


then NP = r - d nˆ (as ON NP OP ) d N
Y
re
T herefore, (1) becomes O
o R

r Ù Ù
(r - d n ) × d n = 0 X
tt E

r Ù Ù Fig 11.10
or (r - d n ) × n = 0 (d ¹ 0)
r Ù Ù Ù
C

or r ×n - d n× n = 0
r Ù Ù Ù
i.e., r×n = d (as n × n = 1) … (2)
no N

T his is the vector form of the equation of the plane.


Carte sian form
Equation (2) gives the vector equation of a plane, where n̂ is the unit vector normal to
©

the plane. Let P(x, y, z) be any point on the plane. T hen


uuur r
= + +
OP = r x iˆ y ˆj z kˆ
Let l, m, n be the direction cosines of n̂ . Then
n̂ = l iˆ +m ˆj +n kˆ
480 M ATHE MATI CS

T herefore, (2) gives


( x iˆ + y ˆj + z kˆ ) × (l ˆi + m ˆj + n kˆ ) = d
i.e., lx + my + nz = d ... (3)
T his is the cartesian equation of the plane in the normal form.

d
r
A Note Equation (3) shows that if r × ( a iˆ + b jˆ + c kˆ ) = d is the vector equation

he
of a plane, then ax + by + cz = d is the Cartesian equation of the plane, where a, b
and c are the direction ratios of the normal to the plane.
6
Example 13 Find the vector equation of the plane which is at a distance of
29
- +

is
from the origin and its normal vector from the origin is 2 iˆ 3 ˆj 4 kˆ .
r
Solution Let n = 2 iˆ - 3 ˆj + 4 kˆ . T hen

bl
r
n 2 iˆ - 3 ˆj + 4 kˆ 2 iˆ - 3 ˆj + 4 kˆ
nˆ = r = =
| n| 4 + 9 + 16 29
pu
Hence, the required equation of the plane is
r æ 2 ˆ -3 ˆ 4 ˆö 6
r ×ç i+ j+ k÷ =
be T

è 29 29 29 ø 29
re
Example 14 Find the direction cosines of the unit vector perpendicular to the plane
o R

r
r × (6 iˆ - 3 jˆ - 2 kˆ ) + 1 = 0 passing through the origin.
Solution T he given equation can be written as
tt E

r
r × ( - 6 ˆi + 3 ˆj + 2 kˆ ) = 1 ... (1)
+ + =7
C

No w | - 6 iˆ + 3 ˆj + 2 kˆ | = 36 9 4
T herefore, dividing both sides of (1) by 7, we get
r æ 6 3 ˆ 2 ˆö
no N

1
r × ç - iˆ + j+ k÷ =
è 7 7 7 ø 7
which is the equation of the plane in the form r nˆ
r
× =d .
©

6 ˆ 3 ˆ 2 ˆ
T his shows that nˆ = - i + j+ k is a unit vector perpendicular to the
7 7 7

plane through the origin. Hence, the direction cosines of n̂ are - 6 , 3 , 2 .


7 7 7
THREE D IMENSIONAL G EOMETRY 481

Example 15 Find the distance of the plane 2x – 3y + 4z – 6 = 0 from the origin.


Solution Since the direction ratios of the normal to the plane are 2, –3, 4; the direction
cosines of it are
-3 -3

d
2 4 2 4
, , , i.e., , ,
2 + (- 3) + 4
2 2 2
2 + (- 3) + 4
2 2 2
2 + (- 3) + 4
2 2 2
29 29 29

he
Hence, dividing the equation 2x – 3y + 4z – 6 = 0 i.e., 2x – 3y + 4z = 6 throughout by
29 , we get
2 -3 4 6
x + y + z =

is
29 29 29 29
T his is of the form lx + my + nz = d, where d is the distance of the plane from the

bl
6
origin. So, the distance of the plane from the origin is .
29
Example 16 Find the coordinates of the foot of the perpendicular drawn from the
pu
origin to the plane 2x – 3y + 4z – 6 = 0.
Solution Let the coordinates of the foot of the perpendicular P from the origin to the
be T

planeis(x1, y1, z1) (Fig 11.11). Z


re
T hen, the direction ratios of the line OP are
o R

x1, y1, z1.


P (x1 , y 1, z1 )
Writing the equation of the plane in the normal
form, we have
tt E

2 3 4 6
x- y+ z= O Y
29 29 29 29
C

2 , -3 4
wh er e, , ar e th e dire ct io n X
29 29 29
no N

cosines of the OP. Fig 11.11


Since d.c.’s and direction ratios of a line are proportional, we have
x1 y1 z1
=
©

= =k
2 -3 4
29 29 29

2k -3k 4k
i.e., x1 = , y1 = , z1 =
29 29 29
482 M ATHE MATI CS

6
Substituting these in the equation of the plane, we get k = .
29
æ 12 -18 , 24 ö
Hence, the foot of the perpendicular is ç , ÷.
è 29 29 29 ø

d
ANote If d is the distance from the origin and l, m, n are the direction cosines of

he
the normal to the plane through the origin, then the foot of the perpendicular is
(ld, md, nd).

11.6.2 Equation of a plane perpendicular to a


given vector and passing through a given point

is
In the space, there can be many planes that are
perpendicular to the given vector, but through a given

bl
point P(x1, y1, z1), only one such plane exists (see
Fig 11.12).
Let a plane pass through a point A with ur
position
pu
r Fig 11.12
vector a and perpendicular to the vector N .
r
Let r be the position vector of any point P(x, y, z) in the plane. (Fig 11.13).
be T

T hen the point P lies in the plane if and only if


uuur ur uuur ur
re
o R

AP is perpendicular to N . i.e., AP . N = 0. But


uuur r r r r r
AP = r - a . Therefore, ( r - a ) × N = 0 … (1)
tt E

T his is the vector equation of the plane.


Carte sian form
Let the given point A be (x1, y1, z1), P be (x, y, z)
C

ur
and direction ratios of N are A, B and C. T hen, Fig 11.13
r
= + +
no N

r r
a = x1 iˆ + y1 ˆj + z1 kˆ, r = x iˆ + y jˆ + z kˆ and N A iˆ B ˆj C kˆ

No w
r r r
×
( r – a ) N= 0
©

So é( x - x1 ) iˆ + ( y - y1 ) jˆ + ( z - z1 ) kˆ ù × (A iˆ + B jˆ + C kˆ ) = 0
ë û
i.e. A (x – x 1) + B (y – y1) + C (z – z1) = 0
Example 17 Find the vector and cartesian equations of the plane which passes through
the point (5, 2, – 4) and perpendicular to the line with direction ratios 2, 3, – 1.
THREE D IMENSIONAL G EOMETRY 483

r
Solution We have the position vector of point (5, 2, – 4) as a = 5 iˆ + 2 jˆ - 4kˆ and the
r
normal vector N perpendicular to the plane as N =2 ˆi + 3 ˆj kˆ
r
-
T herefore, the vector equation of the plane is given by ( r a ) .N 0 - =
r r r

d
r
or [ r - (5 iˆ + 2 ˆj - 4 kˆ )] × (2 iˆ + 3 ˆj - kˆ ) = 0 ... (1)

he
T ransforming (1) into Cartesian form, we have
[( x – 5) iˆ + ( y - 2) ˆj + ( z + 4) kˆ ] × (2 iˆ + 3 ˆj - kˆ ) = 0
or -+ -- +=
2( x 5) 3( y 2) 1( z 4) 0

is
i.e. 2x + 3y – z = 20
which is the cartesian equation of the plane.

bl
11.6.3 Equation of a plane passing through three non collinear points
r r
Let R, S and T be three non collinear points on the plane with position vectors a , b and
r
c respectively (Fig 11.14).
pu
Z
be T
re
(RS X RT)
o R

R
P
r a S
tt E

b c T
O
Y
C

X
Fig 11.14
no N

uuur uuur uuur


T he vectors RS and RT are in the given plane. Therefore, the vector RS
r
´RT
uuur

is perpendicular to the plane containing points R, S and T. Let r be the position vector
©

of any point P in the plane. T herefore, the equation of the plane passing through R and
´
uuur uuur
perpendicular to the vector RS RT is
r r uuur uuur
( r - a ) × (RS´ RT) = 0
r r r r r r
or ( r – a ).[( b – a )×(c – a )] = 0 … (1)
484 M ATHE MATI CS

T his is the equation of the plane in vector form passing through three noncollinear
points.

A Note Why was it necessary to say that the three points

d
had to be non collinear? If the three points were on the same
R
line, then there will be many planes that will contain them
(Fig 11.15). S

he
T hese planes will resemble the pages of a book where the T
line containing the points R, Sand T are members in the binding
of the book.

is
Carte sian form Fig 11.15
Let (x1, y1, z1), (x2, y2, z2) and (x3, y3, z3) be the coordinates of the points R, S and T
respectively. Let (x, y, z) be the coordinates of any point P on the plane with position

bl
r
vector r . T hen
uuur
RP = (x – x1) î + (y – y1) ĵ + (z – z1) k̂
pu
uuur
RS = (x2 – x1) î + (y2 – y1) ĵ + (z2 – z1) k̂
uuur
RT = (x3 – x1) î + (y3 – y1) ĵ + (z3 – z1) k̂
be T

Substituting these values in equation (1) of the vector form and expressing it in the
re
form of a determinant, we have
o R

x - x1 y - y1 z - z1
x2 - x1 y2 - y1 z 2 - z1 = 0
tt E

x3 - x1 y3 - y1 z 3 - z1
which is the equation of the plane in Cartesian form passing through three non collinear
C

points (x1, y1, z1), (x2, y2, z2) and (x3, y3, z3).
Example 18 Find the vector equations of the plane passing through the points
no N

R(2, 5, – 3), S(– 2, – 3, 5) and T (5, 3,– 3).


r
=+- r
=+ -
r
Solution Let a 2 iˆ 5 jˆ 3 kˆ , b = - 2 iˆ - 3 ˆj + 5 kˆ , c 5 iˆ 3 ˆj 3 kˆ
r r
T hen the vector equation of the plane passing through ar , b and c and is
©

given by
r r uuur uuur
( r - a ) × (RS´ RT) = 0 (Why?)
r r r r r r
or ( r - a ) × [(b - a ) ´(c - a )] = 0
i.e. - + - ×-- + ´ - =
r
[ r (2 iˆ 5 jˆ 3 kˆ )] [( 4 iˆ 8 jˆ 8 kˆ ) (3 iˆ 2 jˆ )] 0
THREE D IMENSIONAL G EOMETRY 485

11.6.4 Intercept form of the equation of a plane


In this section, we shall deduce the equation of a plane in terms of the intercepts made
by the plane on the coordinate axes. Let the equation of the plane be
Ax + By + Cz + D = 0 (D ¹ 0) ... (1)

d
Let the plane make intercepts a, b, c on x, y and z axes, respectively (Fig 11.16).
Hence, the plane meets x, y and z-axes at (a, 0, 0),

he
(0, b, 0), (0, 0, c), respectively.

T herefore Aa + D = 0 or A =
-D
a
-D

is
Bb + D = 0 or B =
b
-D

bl
Cc + D = 0 or C =
c
Fig 11.16
Substituting these values in the equation (1) of the
pu
plane and simplifying, we get
x y z
+ + =1 ... (1)
be T

a b c
which is the required equation of the plane in the intercept form.
re
o R

Example 19 Find the equation of the plane with intercepts 2, 3 and 4 on the x, y and
z-axis respectively.
Solution Let the equation of the plane be
tt E

x y z
+ + =1 ... (1)
a b c
C

He re a = 2, b = 3, c = 4.
Substituting the values of a, b and c in (1), we get the required equation of the
no N

x y z
plane as + + = 1 or 6x + 4y + 3z = 12.
2 3 4
11.6.5 Plane passing thro ugh the intersection
©

of two given planes


Let p1 an d p 2 be two p lan es with equat io ns
r
× r
×
r nˆ1 = d 1 and r nˆ 2 = d2 respectively. T he position
vector of any point on the line of intersection must
satisfy both the equations (Fig 11.17). Fig 11.17
486 M ATHE MATI CS

r
If t is the position vector of a point on the line, then
r r
t × nˆ1 = d 1 and t × nˆ 2 = d 2
T herefore, for all real values of l, we have
r
t × ( nˆ1 +lnˆ2 ) = d1 +ld 2

d
r
Since t is arbitrary, it satisfies for any point on the line.
r r r

he
Hence, the equation r × ( n1 + ln2 ) = d1 +l d 2 represents a plane p3 which is such
r
that if any vector r satisfies both the equations p1 and p2, it also satisfies the equation
p3 i.e., any plane passing through the intersection of the planes
r r r r
r × n1 = d1 and r × n2 = d 2

is
r r r
has the equation r × ( n1 + ln2 ) = d 1 + ld 2 ... (1)
Carte sian form

bl
In Cartesian system, let
r
n1 = A1 iˆ + B 2 ˆj + C1 kˆ
pu
r
n2 = A2 iˆ + B 2 ˆj + C 2 kˆ
r
and r = x iˆ + y ˆj + z kˆ
be T

T hen (1) becomes


re

x (A1 + lA2) + y (B1 + lB2) + z (C1 + lC2) = d 1 + ld 2


o R

or (A1 x + B 1 y + C 1z – d 1) + l(A2 x + B 2 y + C 2 z – d 2) = 0 ... (2)


which is the required Cartesian form of the equation of the plane passing through the
tt E

intersection of the given planes for each value of l.


Example 20 Find the vector equation of the plane passing through the intersection of
C

r r
the planes r × ( iˆ + jˆ + kˆ ) = 6 and r × (2iˆ + 3 ˆj + 4 kˆ ) = - 5, and the point (1, 1, 1).
r r
no N

Solution Here, n1 = iˆ + jˆ + kˆ and n2 = 2 iˆ + 3 ˆj + 4 kˆ;


and d 1 = 6 and d 2 = –5
r r r
Hence, using the relation r × ( n1 + l n2 ) = d1 + l d 2 , we get
©

r
r ×[ iˆ + jˆ + kˆ +l (2 iˆ + 3 ˆj + 4 kˆ )] = 6 - 5 l
r
or r ×[(1 + 2 l) iˆ + (1 + 3 l) jˆ + (1 + 4 l)kˆ ] = 6 - 5 l … (1)
where, l is some real number.
THREE D IMENSIONAL G EOMETRY 487

Taking
r
=+ +
r x iˆ y jˆ z kˆ , we get

( x iˆ + y ˆj + z kˆ ) ×[(1 + 2l ) iˆ + (1 + 3l ) ˆj + (1+ 4l )kˆ ]= 6 - 5l


or (1 + 2l ) x + (1 + 3l) y + (1 + 4l) z = 6 – 5l

d
or (x + y + z – 6 ) + l (2x + 3y + 4 z + 5) = 0 ... (2)

he
Given that the plane passes through the point (1,1,1), it must satisfy (2), i.e.
(1 + 1 + 1 – 6) + l (2 + 3 + 4 + 5) = 0
3
or l=

is
14
Putting the values of l in (1), we get

r éæ 3 ö æ 9 ö æ 6 ö ù

bl
15
r êç 1 + ÷ iˆ + ç1 + ÷ ˆj + ç1 + ÷ kˆ ú = 6-
ëè 7 ø è 14 ø è 7 ø û 14
pu
r æ 10 23 13 ö 69
or r ç iˆ + jˆ + kˆ ÷ =
è 7 14 7 ø 14
be T

r
or r × (20 iˆ + 23 ˆj + 26 kˆ ) = 69
re
which is the required vector equation of the plane.
o R

11.7 Coplanarity of Two Lines


Let the given lines be
tt E

r r r
r = a1 +l b1 ... (1)
r r r
and r = a2 + mb2 ... (2)
C

r
T he line (1) passes through the point, say A, with position vector a1 and is parallel
r r
to b1 . T he line (2) passes through the point, say B with position vector a2 and is parallel
no N

r
to b2 .
Thus,
uuur r r
AB = a2 a1 - uuur r r
©

T he given lines are coplanar if and only if AB is perpendicular to b1 ´b2 .


uuur r r r r r r
i.e. AB.( b1 ´ b2 ) = 0 or ( a2 - a1 ) × (b1 ´ b2 ) = 0
Carte sian form
Let (x1, y1, z1) and (x2, y2, z2) be the coordinates of the points A and B respectively.
488 M ATHE MATI CS

r r
Let a 1, b 1, c1 and a 2, b 2, c2 be the direction ratios of b1 and b2 , respectively. T hen

= - + - + -
uuur
AB ( x2 x1 ) iˆ ( y2 y1 ) ˆj (z 2 z1 )kˆ
= + +
r
= uuu+ + r
b1 a1 iˆ b1 ˆj c1 kˆ and b2 a 2 iˆ b2 ˆj c2 kˆ
r r r

d
T he given lines are coplanar if and only if AB× ( b1 ´b2 ) = 0 . In the cartesian form,
it can be expressed as
- - -

he
x2 x1 y2 y1 z2 z1
a1 b1 c1 0 = ... (4)
a2 b2 c2

is
Example 21 Showthat the lines
x +3 y -1 z - 5 x +1 y - 2 z - 5
= = = =

bl
and are coplanar..
–3 1 5 –1 2 5
Solution Here, x1 = – 3, y1 = 1, z1 = 5, a 1 = – 3, b 1 = 1, c1 = 5
x2 = – 1, y2 = 2, z2 = 5, a 2 = –1, b 2 = 2, c2 = 5
pu
Now, consider the determinant
x2 -x y2 -y z2 -z 2 1 0
=- =0
be T

1 1 1
a1 b1 c1 3 1 5
-1
re

a2 b2 c2 2 5
o R

T herefore, lines are coplanar.


11.8 Angle between Two Planes
tt E

De finition 2 T he angle between two planes is defined as the angle between their
normals (Fig 11.18 (a)). Observe that if q is an angle between the two planes, then so
is 180 – q (Fig 11.18 (b)). We shall take the acute angle as the angles between
C

two planes.
no N
©

Fig 11.18
THREE D IMENSIONAL G EOMETRY 489

r r
If n1 and n2 are normals to the planes and q be the angle between the planes
r r
× r r
r n1 = d 1 and r . n 2 = d 2 .
T hen q is the angle between the normals to the planes drawn from some common

d
point.
r r
n1 × n2
cos q = r

he
We have, r
| n1 | | n2 |
r r
A
r
Note T he planes are perpendicular to each other if n . n
r
1 2 = 0 and parallel if
n1 is parallel to n2 .

is
Carte sian form Let q be the angle between the planes,
A1 x + B1 y + C1z + D1 = 0 and A2x + B2 y + C2 z + D2 = 0

bl
T he direction ratios of the normal to the planes are A1, B1, C1 and A2, B2, C2
respectively.
pu
A1 A2 + B1 B2 + C1 C2
T herefore, cos q =
A12 + B 12 + C12 A 22 + B 22 + C22
be T

ANote
re
o R

1. If th e p lan es are at right angles, t hen q = 9 0 o a nd so cos q = 0 .


Hence, cos q = A1A2 + B1B2 + C1C2 = 0.

=BB =CC .
tt E

A1 1 1
2. If the planes are parallel, then
A2 2 2
C

Example 22 Find the angle between the two planes 2x + y – 2z = 5 and 3x – 6y – 2z = 7


using vector method.
no N

Solution T he angle between two planes is the angle between their normals. From the
equation of the planes, the normal vectors are
ur ur
N1 = 2 iˆ + ˆj - 2 kˆ and N 2 = 3 iˆ - 6 ˆj - 2 kˆ
©

ur ur
N1 × N2 (2 iˆ + jˆ - 2 kˆ ) × (3 iˆ - 6 jˆ - 2 kˆ ) æ4 ö
T herefore cos q = ur ur = = ç ÷
| N1 | |N 2 | 4 + 1 + 4 9 + 36 + 4 è 21 ø
æ4 ö
Hence q = cos – 1 ç ÷
è 21 ø
490 M ATHE MATI CS

Example 23 Find the angle between the two planes 3x – 6y + 2z = 7 and 2x + 2y – 2z =5.
Solution Comparing the given equations of the planes with the equations
A1 x + B1 y + C1 z + D1 = 0 and A2 x + B2 y + C2 z + D2 = 0
We get A1 = 3, B1 = – 6, C1 = 2

d
A2 = 2, B2 = 2, C2 = – 2
3 ´ 2 + ( -6) (2) + (2) (-2)

he
cos q =
(32 + (- 6)2 + (-2)2 ) (2 2 + 22 + (-2)2 )
-10 5 5 3
= = =

is
7´2 3 7 3 21

æ5 3ö
T herefore, q = cos-1 çç ÷÷

bl
è 21 ø
11.9 Distance of a Point from a Plane
pu
Ve ctor form
r
Consider a point P with positio n vector a and a plane p 1 whose equation is
r
r × nˆ = d (Fig 11.19).
be T

Z Z
re
o R

p1 p2
p2
Q P
tt E

P p1 a
d N’
a N’ N Y
C

d O
O Y N
X X
no N

(a) (b)
Fig 11.19

Consider a plane p2 through P parallel to the plane p1. T he unit vector normal to
r r
p2 is n̂ . Hence, its equation is ( r - a ) × nˆ = 0
©

r r
i.e., r × nˆ = a × nˆ
r
T hus, the distance ON¢ of this plane from the origin is | a × nˆ | . T herefore, the distance
PQ from the plane p1 is (Fig. 11.21 (a))
i.e., ON – ON¢ = |d – a nˆ |
r
×
THREE D IMENSIONAL G EOMETRY 491

which is the length of the perpendicular from a point to the given plane.
We may establish the similar results for (Fig 11.19 (b)).

ANote
× =d , where urN is normal
r ur

d
1. If theequationof theplanep2 is in the form r N

| a ×N -d | .
r ur

he
to the plane, then the perpendicular distance is ur
| N|

2.
r ur
T he length of the perpendicular from origin O to the plane r N × =d is || Nurd ||

is
r
(since a = 0).

bl
Carte sian form
r
Let P(x1, y1, z1) be the given point with position vector a and
pu Ax + By + Cz = D
be the Cartesian equation of the given plane. T hen
r
a = x1 iˆ + y1 jˆ + z1 kˆ
be T

ur
N = A iˆ + B ˆj + C kˆ
re
o R

Hence, from Note 1, the perpendicular from P to the plane is

( x1 iˆ + y1 jˆ + z1 kˆ ) × ( A iˆ + B jˆ + C kˆ ) - D
tt E

A2 + B2 + C 2
C

A x1 +B y +C z -D
A +B +C
1 1
= 2 2 2
no N

Example 24 Find the distance of a point (2, 5, – 3) from the plane


r
r × ( 6 iˆ - 3 ˆj + 2 kˆ ) = 4

= + - = - +
©

r ur
Solution Here, a 2 iˆ 5 ˆj 3 kˆ , N 6 ˆi 3 ˆj 2 kˆ and d = 4.
T herefore, the distance of the point (2, 5, – 3) from the given plane is

| (2 iˆ + 5 ˆj - 3 kˆ ) × (6 ˆi - 3 ˆj + 2 kˆ ) - 4| | 12 - 15 - 6 - 4 | 13
= =
| 6 iˆ - 3 ˆj + 2 kˆ | 36 + 9 + 4 7
492 M ATHE MATI CS

11.10 Angle between a Line and a Plane


De finition 3 T he angle between a line and a plane is
the complement of the angle between the line and
normal to the plane (Fig 11.20).

d
Ve ctor form If th e equation of the line is
r r r
r = a + l b a nd the equation of the pla ne is

he
r r
r × n = d . T hen the angle q between the line and the
normal to the plane is Fig 11.20
r r
b ×n

is
cos q = r r
| b | ×| n |
and so the angle f between the line and the plane is given by 90 – q, i.e.,

bl
sin (90 – q) = cos q
r r
b×n –1 b × n
pu
i.e. sin f = r r or f = sin
|b | | n | b n
be T

Example 25 Find the angle between the line


re

x+1 y z-3
o R

= =
2 3 6
and the plane 10 x + 2y – 11 z = 3.
tt E

Solution Let q be the angle between the line and the normal to the plane. Converting the
given equations into vector form, we have
C

r
r = ( – iˆ + 3 kˆ ) + l ( 2 iˆ + 3 ˆj + 6 kˆ )
r
r × ( 10 ˆi + 2 ˆj - 11 kˆ ) = 3
no N

and
r r
He re b = 2 iˆ + 3 ˆj + 6 kˆ and n = 10 iˆ + 2 ˆj - 11 kˆ

(2 iˆ + 3 ˆj + 6 kˆ ) × (10 ˆi + 2 ˆj - 11 kˆ )
©

sin f =
2 2 + 32 + 6 2 10 2 + 22 + 112

- 40 -8 8 æ 8 ö
= = = or f = sin -1 ç ÷
7 ´ 15 21 21 è 21 ø
THREE D IMENSIONAL G EOMETRY 493

EXERCIS E 11.3
1. In each of the following cases, determine the direction cosines of the normal to
the plane and the distance from the origin.
(a) z = 2 (b) x + y + z = 1

d
(c) 2x + 3y – z = 5 (d) 5y + 8 = 0

he
2. Find the vector equation of a plane which is at a distance of 7 units from the
origin and normal to the vector 3 iˆ + 5 ˆj - 6 kˆ.
3. Find the Cartesian equation of the following planes:

is
r r
(a) r × ( iˆ + jˆ - kˆ ) = 2 (b) r × (2 iˆ + 3 ˆj - 4 kˆ ) = 1
r
(c) r ×[( s - 2 t) iˆ + (3 - t ) ˆj + (2 s + t ) kˆ ] = 15

bl
4. In the following cases, find the coordinates of the foot of the perpendicular
drawn from the origin.
(a) 2x + 3y + 4z – 12 = 0 (b) 3y + 4z – 6 = 0
pu
(c) x + y + z = 1 (d) 5y + 8 = 0
5. Find the vector and cartesian equations of the planes
be T

(a) that passes through the point (1, 0, – 2) and the normal to the plane is
+-
re

iˆ ˆj kˆ .
o R

(b) that passes through the point (1,4, 6) and the normal vector to the plane is
-+
ˆi 2 ˆj kˆ .
tt E

6. Find the equations of the planes that passes through three points.
(a) (1, 1, – 1), (6, 4, – 5), (– 4, – 2, 3)
C

(b) (1, 1, 0), (1, 2, 1), (– 2, 2, – 1)


7. Find the intercepts cut off by the plane 2x + y – z = 5.
8. Find the equation of the plane with intercept 3 on the y-axis and parallel to ZOX
no N

plane.
9. Find the e quatio n of t he pla ne thro ugh th e inte rsection of the planes
3x – y + 2z – 4 = 0 and x + y + z – 2 = 0 and the point (2, 2, 1).
©

10. Find the vector equation of the plane passing through the intersection of the
r r
planes r .( 2 iˆ + 2 jˆ - 3 kˆ ) = 7 , r .( 2 iˆ + 5 ˆj + 3 kˆ ) = 9 and through the point
(2, 1, 3).
11. Find the equation of the plane thro ugh the line of interse ction of the
planes x + y + z = 1 and 2x + 3y + 4z = 5 which is perpendicular to the plane
x – y + z = 0.
494 M ATHE MATI CS

12. Find the angle between the planes whose vector equations are
r r
r × (2 iˆ + 2 jˆ - 3 kˆ ) = 5 and r × (3 iˆ - 3 jˆ + 5 kˆ ) = 3.
13. In the following cases, determine whether the given planes are parallel or

d
perpendicular, and in case they are neither, find the angles between them.
(a) 7x + 5y + 6z + 30 = 0 and 3x – y – 10z + 4 = 0

he
(b) 2x + y + 3z – 2 = 0 and x – 2y + 5 = 0
(c) 2x – 2y + 4z + 5 = 0 and 3x – 3y + 6z – 1 = 0
(d) 2x – y + 3z – 1 = 0 and 2x – y + 3z + 3 = 0

is
(e) 4x + 8y + z – 8 = 0 and y + z – 4 = 0

14. In the following cases, find the distance of each of the given points from the

bl
corresponding given plane.
Point Plane
(a) (0, 0, 0) 3x – 4y + 12 z = 3
pu
(b) (3, – 2, 1) 2x – y + 2z + 3 = 0
(c) (2, 3, – 5) x + 2y – 2z = 9
be T

(d) (– 6, 0, 0) 2x – 3y + 6z – 2 = 0
re
o R

Miscellaneous Examples
Example 26 A line makes angles a, b, g and d with the diagonals of a cube, prove that
tt E

4
cos2 a + cos2 b + cos2 g + cos2 d =
3
C

Solution A cube is a rectangular parallelopiped having equal length, breadth and height.
Let OADBFEGC be the cube with each side of length a units. (Fig 11.21)
no N

T he four diagonals are OE, AF, BG and CD. Z


T he direction cosines of the diagonal OE which
C(0, 0, a)
is the line joining two points O and E are F(0, a, a)
- - - (a, 0, a) G
©

a 0 a 0 a 0
a2 +a +a
2 2
,
a2 +a +a
2 2
,
a2 +a +a
2 2
O
Y
B(0, a, 0)
1 1 1 D(a, a, 0)
i.e., , , X
3 3 3 Fig 11.21
THREE D IMENSIONAL G EOMETRY 495

–1 1 1 1
Similarly, the direction cosines of AF, BG and CD are , , ; ,
3 3 3 3
–1 1 1 1 –1
, and , ,
, respectively..

d
3 3 3 3 3
Let l, m, n be the direction cosines of the given line which makes angles a, b, g, d

he
with OE, AF, BG, CD, respectively. T hen
1 1
cosa = (l + m+ n); cos b = (– l + m + n);
3 3

is
1 1
cosg = (l – m + n); cos d = (l + m – n) (Why?)
3 3
Squaring and adding, we get

bl
cos2a + cos2 b + cos2 g + cos2 d
1
[ (l + m + n ) 2 + (–l + m + n) 2 ] + (l – m + n) 2 + (l + m –n) 2]
pu
=
3
1 4
= [ 4 (l2 + m 2 + n 2 ) ] = (as l2 + m 2 + n 2 = 1)
be T

3 3
re
Example 27 Find the equation of the plane that contains the point (1, – 1, 2) and is
o R

perpendicular to each of the planes 2x + 3y – 2z = 5 and x + 2y – 3z = 8.


Solution T he equation of the plane containing the given point is
tt E

A (x – 1) + B(y + 1) + C (z – 2) = 0 ... (1)


Applying the condition of perpendicularly to the plane given in (1) with the planes
2x + 3y – 2z = 5 and x + 2y – 3z = 8, we have
C

2A + 3B – 2C = 0 and A + 2B – 3C = 0
Solving these equations, we find A = – 5C and B = 4C. Hence, the required
no N

equation is
– 5C (x – 1) + 4 C (y + 1) + C(z – 2) = 0
i.e. 5x – 4y – z = 7
©

Example 28 Find the distance between the point P(6, 5, 9) and the plane determined
by the points A (3, – 1, 2), B (5, 2, 4) and C(– 1, – 1, 6).
Solution Let A, B, C be the three points in the plane. D is the foot of the perpendicular
drawn from a point P to the plane. PD is the required distance to be determined, which
uuur uuur uuur
is the projection of AP on AB AC . ´
496 M ATHE MATI CS

uuur uuur
Hence, PD = the dot product of AP with the unit vector along AB
uuur
´AC
uuur
.
So AP = 3 iˆ + 6 jˆ + 7 kˆ

iˆ ˆj kˆ

d
and
uuur
AB ´ uuur
AC = 2 3 2 = 12 iˆ - 16 ˆj + 12 kˆ

he
-4 0 4

uuur
Unit vector along AB ´AC
uuur
=
3 iˆ - 4 ˆj + 3 kˆ
34

is
3 iˆ - 4 ˆj + 3 kˆ
Hence PD = ( 3 iˆ + 6 jˆ + 7 kˆ ) .
34

bl
3 34
=
17
pu
Alte rnative ly, find the equation of the plane passing through A, B and C and then
compute the distance of the point P from the plane.
Example 29 Showthat the lines
be T

x- a+d y-a z-a-d


re
= =
o R

a-d a a+d
x-b+c y-b z-b-c
and = = are coplanar..
tt E

b- g b b+g
Solution
C

He re x1 = a – d x2 = b–c
y1 = a y2 = b
z1 = a + d z2 = b+c
no N

a1 = a – d a2 = b–g
b1 = a b2 = b
c1 = a + d c2 = b+g
©

Nowconsider the determinant


x2 - x1 y2 - y1 z 2 - z1 b -c - a + d b - a b +c - a - d
a1 b1 c1 a -d a a+d
=
a2 b2 c2 b-g b b+g
THREE D IMENSIONAL G EOMETRY 497

Adding third column to the first column, we get


b-a b - a b +c - a - d
2 a a a +d =0
b b b+g

d
Since the first and second columns are identical. Hence, the given two lines are

he
coplanar.
Example 30 Find the coordinates of the point where the line through the points
A (3, 4, 1) and B(5, 1, 6) crosses the XY-plane.
Solution T he vector equation of the line through the points A and B is

is
r
r = 3 iˆ + 4 ˆj + kˆ + l [ (5 - 3) iˆ + (1 - 4) ˆj + ( 6 - 1) kˆ ]
r
i.e. r = 3 iˆ + 4 ˆj + kˆ + l ( 2 iˆ - 3 ˆj + 5 kˆ ) ... (1)

bl
Let P be the point where the line AB crosses the XY-plane. T hen the position
vector of the point P is of the form x iˆ + y ˆj .
pu
T his point must satisfy the equation (1). (Why ?)
i.e. x iˆ + y ˆj = (3 + 2 l ) iˆ + ( 4 - 3 l) ˆj + ( 1 + 5 l ) kˆ
be T

Equating the like coefficients of ˆi , ˆj and kˆ , we have


x= 3+2l
re
o R

y=4–3l
0= 1 +5 l
tt E

Solving the above equations, we get


13 23
x= and y =
C

5 5
æ 13 23 ö
Hence, the coordinates of the required point are ç , , 0÷ .
è5 5 ø
no N

Miscellaneous Exercise on Chapter 11


1. Show that the line joining the origin to the point (2, 1, 1) is perpendicular to the
©

line determined by the points (3, 5, – 1), (4, 3, – 1).


2. If l1, m 1, n 1 and l2, m 2, n 2 are the direction cosines of two mutually perpendicular
lines, show that the direction cosines of the line perpendicular to both of these
are m1 n 2 -m 2 n1 , n1 l2 -n
2 l1 , l1 m2 -l
2 m1
498 M ATHE MATI CS

3. Find the angle between the lines whose direction ratios are a, b, c a nd
b – c, c – a, a – b.
4. Find the equation of a line parallel to x-axis and passing through the origin.
5. If the coordinates of the points A, B, C, D be (1, 2, 3), (4, 5, 7), (– 4, 3, – 6) and

d
(2, 9, 2) respectively, then find the angle between the lines AB and CD.
x-1 y - 2 z -3 x -1 y - 1 z - 6

he
6. If the lines = = and = = are perpendicular,,
-3 2k 2 3k 1 -5
find the value of k.
7. Find the vector equation of the line passing through (1, 2, 3) and perpendicular to

is
r
the plane r . ( iˆ + 2 jˆ - 5 kˆ ) + 9 = 0 .
8. Find the equation of the plane passing through (a, b, c) and parallel to the plane
r
× ++ =

bl
r ( iˆ jˆ kˆ ) 2.
r
9. Find the shortest distance between lines r = 6 iˆ + 2 jˆ + 2 kˆ + l ( iˆ - 2 jˆ + 2 kˆ )
pu
r
and r = - 4 iˆ - kˆ + m (3 iˆ - 2 ˆj - 2 kˆ ) .
10. Find the coordinates of the point where the line through (5, 1, 6) and (3, 4,1)
be T

crosses the YZ-plane.


11. Find the coordinates of the point where the line through (5, 1, 6) and (3, 4, 1)
re
o R

crosses the ZX-plane.


12. Find the coordinates of the point where the line through (3, – 4, – 5) and
(2, – 3, 1) crosses the plane 2x + y + z = 7.
tt E

13. Find the equation of the plane passing through the point (– 1, 3, 2) and perpendicular
to each of the planes x + 2y + 3z = 5 and 3x + 3y + z = 0.
C

14. If the poin ts ( 1, 1 , p) and (– 3 , 0, 1) be e quidist ant from the pla ne


r
× + - +=
r (3 iˆ 4 jˆ 12 kˆ ) 13 0, then find the value of p.
no N

15. Find the equation of the plane passing through the line of intersection of the
r r
planes r × ( iˆ + jˆ + kˆ ) = 1 and r × (2 iˆ + 3 jˆ - kˆ ) + 4 = 0 and parallel to x-axis.
16. If O be the origin and the coordinates of P be (1, 2, – 3), then find the equation of
©

the plane passing through P and perpendicular to OP.


17. Find the equation of the plane which contains the line of intersection of the planes
r r
r × ( iˆ + 2 jˆ + 3 kˆ) - 4 = 0 , r × (2 iˆ + jˆ - kˆ) + 5 = 0 and which is perpendicular to the
r
plane r × (5 iˆ + 3 ˆj - 6 kˆ ) + 8 = 0 .
THREE D IMENSIONAL G EOMETRY 499

18. Find the distance of the point (– 1, – 5, – 10) from the point of intersection of the
r r
line r = 2 iˆ - jˆ + 2 kˆ + l (3 iˆ + 4 jˆ + 2 kˆ ) and the plane r × ( iˆ - jˆ + kˆ ) = 5 .
19. Find the vector equation of the line passing through (1, 2, 3) and parallel to the
r r
planes r × ( iˆ - jˆ + 2 kˆ ) = 5 and r × (3 iˆ + ˆj + kˆ ) = 6 .

d
20. Find the vector equation of the line passing through the point (1, 2, – 4) and

he
perpendicular to the two lines:
x - 8 y + 19 z -10 x - 15 y - 29 z - 5 .
= = and = =
3 - 16 7 3 8 -5
21. Prove that if a plane has the intercepts a, b, c and is at a distance of p units from

is
1 1 1 1
the origin, then + 2 + 2 = 2.
a 2
b c p

bl
Choose the correct answer in Exercises 22 and 23.
22. Distance between the two planes: 2x + 3y + 4z = 4 and 4x + 6y + 8z = 12 is
pu
2
(A) 2 units (B) 4 units (C) 8 units (D) units
29
be T

23. T he planes: 2x – y + 4z = 5 and 5x – 2.5y + 10z = 6 are


re
(A) Perpendicular (B) Parallel
o R

æ 5ö
(C) intersect y-axis (D) passes through ç 0, 0, ÷
è 4ø
tt E

Summary

® Dire ction cosine s of a line are the cosines of the angles made by the line
C

with the positive directions of the coordinate axes.


® If l, m, n are the direction cosines of a line, then l2 + m 2 + n 2 = 1.
no N

® Direction cosines of a line joining two points P(x1, y1, z1) and Q(x2, y2, z2) are
x2 -x , y -y , z -z
1 2 1 2 1
PQ PQ PQ
©

where PQ = ( x 2 - x1 ) 2 + ( y 2 - y1 ) 2 + ( z 2 - z1 )2
® Dire ction ratios of a line are the numbers which are proportional to the
direction cosines of a line.
® If l, m, n are the direction cosines and a, b, c are the direction ratios of a line
500 M ATHE MATI CS

then
a b c
l= ;m = ;n=
a +b +c
2 2 2
a +b + c
2 2 2
a + b 2 + c2
2

d
® Ske w line s are lines in space which are neither parallel nor intersecting.
T hey lie in different planes.

he
® Angle be twe e n ske w line s is the angle between two intersecting lines
drawn from any point (preferably through the origin) parallel to each of the
skew lines.
® If l1, m 1, n 1 and l2, m 2, n 2 are the direction cosines of two lines; and q is the
acute angle between the two lines; then

is
cosq = |l1l2 + m 1m 2 + n 1n 2 |
® If a 1, b 1, c1 and a 2, b 2, c2 are the direction ratios of two lines and q is the

bl
acute angle between the two lines; then
a1 a 2 + b1 b2 + c1 c2
cosq =
+ b12 + c12 a22 + b22 + c22
pu
a12
® Vector equation of a line that passes through the given point whose position
r r r
r r
vector is a and parallel to a given vector b is r = a + l b .
be T

® Equation of a line through a point (x1, y1,z1) and having direction cosines l, m, n is
re
o R

x - x1 y - y1 z - z1
= =
l m n
® T he vector equation of a line which passes through two points whose position
tt E

r r r r r r
vectors are a and b is r = a + l (b - a ) .
® Cartesian equation of a line that passes through two points (x1, y1, z1) and
C

x - x1 y - y1 z - z1 .
(x2, y2, z2) is = =
x2 - x1 y2 - y1 z 2 - z1
no N

r r r r r r
® If q is th e acute angle be twee n r = a1 + l b1 an d r = a 2 + l b2 , th en
r r
b1 × b2
cos q = r r
| b1 | | b2 |
©

x - x1 y - y1 z - z1 x - x2 y - y2 z - z2
® If = = and = =
l1 m1 n1 l 2 m 2 n2
are the equations of two lines, then the acute angle between the two lines is
given by cos q = |l1 l2 + m 1 m 2 + n 1 n 2 |.
THREE D IMENSIONAL G EOMETRY 501

® Shortest distance between two skew lines is the line segment perpendicular
to both the lines.

®
r r
Shortest distance between r a1 = +l r r r
b1 and r a2 = +m
r
b2 is

d
r r r r
( b1 ´ b2 ) × (a 2 – a1 )
r r
| b1 ´ b2 |

he
x - x1 y - y1 z - z1
® Shortest distance between the lines: = = and
a1 b1 c1

is
x - x2 y - y2 z - z2
= = is
a2 b2 c2

bl
x2 - x1 y2 - y1 z 2 - z1
a1 b1 c1
pu
a2 b2 c2
(b1c2 - b2 c1 )2 + (c1a 2 - c2 a1 )2 + (a1b2 - a 2 b1 )2
be T

®
r r
Distance between parallel lines r a1 = +l r r
b and r =ar +mbr is
re
2
r r r
o R

b ´ ( a 2 - a1 )
r
|b|
tt E

® In the vector form, equation of a plane which is at a distance d from the


origin, and n̂ is the unit vector normal to the plane through the origin is
C

r
r × nˆ = d .
® Equation of a plane which is at a distance of d from the origin and the direction
no N

cosines of the normal to the plane as l, m, n is lx + my + nz = d.


r
® T he equation of a plane through a point whose position vector is a and
ur r r ur
perpendicular to the vector N is ( r - a ) . N = 0 .
©

® Equation of a plane perpendicular to a given line with direction ratiosA, B, C


and passing through a given point (x1, y1, z1) is
A (x – x1) + B (y – y1) + C (z – z1 ) = 0
® Equation of a plane passing through three non collinear points (x1, y1, z1),
502 M ATHE MATI CS

(x2, y2, z2) and (x3, y3, z3) is

x - x1 y - y1 z - z1
x2 - x1 y2 - y1 z 2 - z1
=0

d
x3 - x1 y3 - y1 z 3 - z1

he
Vector equation of a plane that contains three non collinear points having
r r r r r r r r r
position vectors a , b and c is ( r - a ) . [ (b - a ) ´ ( c - a ) ] = 0

® Equation of a plane that cuts the coordinates axes at (a, 0, 0), (0, b, 0) and
(0, 0, c) is

is
x y z
+ + =1
a b c

bl
® Vec tor e quation of a plane t hat p asses thro ugh t he in terse ction of
r r r r r r r
planes r × n1 = d1 and r × n2 = d 2 is r × ( n1 + l n 2 ) = d1 + l d 2 , where l is any
pu
nonzero constant.
® Cartesian equation of a plane that passes through the intersection of two
be T

given planes A1 x + B1 y + C1 z + D1 = 0 and A2 x + B2 y + C2 z + D2 = 0


is (A1 x + B1 y + C1 z + D1) + l(A2 x + B2 y + C2 z + D2) = 0.
re
o R

r r r r r r
® Two lines r = a1 + l b1 and r = a2 + m b2 are coplanar if
r r r r
( a2 - a1 ) × (b1 ´ b2 ) = 0
tt E

® In the cartesian form above lines passing through the points A(x 1, y 1, z1) and
B ( x 2 , y 2 , z2 )
C

x2 - x1 y2 - y1 z 2 - z1
y – y2 z – z 2
no N

= = are coplanar if
a1 b1 c1
= 0.
b2 C2 a2 b2 c2
r r
® In the vector form, if q is the angle between the two planes, r × n1 = d1 and
©

r r
r r | n1 × n2 |
r × n 2 = d 2 , then q = cos r r .
–1

| n1 || n2 |
r r r r
® T he angle f between the line r = a + l b and the plane r × nˆ = d is
THREE D IMENSIONAL G EOMETRY 503

r
b × nˆ
sin f = r
|b | | nˆ |
® T he angle q between the planes A1x + B1y + C1z + D1 = 0 and

d
A2 x + B2 y + C2 z + D2 = 0 is given by

he
A1 A 2 + B1 B 2 + C1 C 2
cos q =
A21 + B21 + C12 A 22 + B 22 + C22
r r
® T he distance of a point whose position vector is a from the plane r × nˆ = d is
r

is
| d - a × nˆ |
® T he distance from a point (x1, y1, z1) to the plane Ax + By + Cz + D = 0 is

bl
Ax1 + By1 + Cz1 + D
.
A 2 + B2 + C2
pu
be T

—v —
re
o R
tt E
C
no N
©
504 MATHEMATICS

Chapter 12
LINEAR PROGRAMMING

he
™ The mathematical experience of the student is incomplete if he never had
the opportunity to solve a problem invented by himself. – G. POLYA ™

is
12.1 Introduction
In earlier classes, we have discussed systems of linear

bl
equations and their applications in day to day problems. In
Class XI, we have studied linear inequalities and systems
of linear inequalities in two variables and their solutions by
pu
graphical method. Many applications in mathematics
involve systems of inequalities/equations. In this chapter,
we shall apply the systems of linear inequalities/equations
be T

to solve some real life problems of the type as given below:


A furniture dealer deals in only two items–tables and
re
o R

chairs. He has Rs 50,000 to invest and has storage space


of at most 60 pieces. A table costs Rs 2500 and a chair
Rs 500. He estimates that from the sale of one table, he
tt E

can make a profit of Rs 250 and that from the sale of one L. Kantorovich
chair a profit of Rs 75. He wants to know how many tables and chairs he should buy
from the available money so as to maximise his total profit, assuming that he can sell all
C

the items which he buys.


Such type of problems which seek to maximise (or, minimise) profit (or, cost) form
a general class of problems called optimisation problems. Thus, an optimisation
no N

problem may involve finding maximum profit, minimum cost, or minimum use of
resources etc.
A special but a very important class of optimisation problems is linear programming
©

problem. The above stated optimisation problem is an example of linear programming


problem. Linear programming problems are of much interest because of their wide
applicability in industry, commerce, management science etc.
In this chapter, we shall study some linear programming problems and their solutions
by graphical method only, though there are many other methods also to solve such
problems.
LINEAR PROGRAMMING 505

12.2 Linear Programming Problem and its Mathematical Formulation


We begin our discussion with the above example of furniture dealer which will further
lead to a mathematical formulation of the problem in two variables. In this example, we
observe
(i) The dealer can invest his money in buying tables or chairs or combination thereof.
Further he would earn different profits by following different investment

he
strategies.
(ii) There are certain overriding conditions or constraints viz., his investment is
limited to a maximum of Rs 50,000 and so is his storage space which is for a
maximum of 60 pieces.

is
Suppose he decides to buy tables only and no chairs, so he can buy 50000 ÷ 2500,
i.e., 20 tables. His profit in this case will be Rs (250 × 20), i.e., Rs 5000.

bl
Suppose he chooses to buy chairs only and no tables. With his capital of Rs 50,000,
he can buy 50000 ÷ 500, i.e. 100 chairs. But he can store only 60 pieces. Therefore, he
is forced to buy only 60 chairs which will give him a total profit of Rs (60 × 75), i.e.,
pu
Rs 4500.
There are many other possibilities, for instance, he may choose to buy 10 tables
and 50 chairs, as he can store only 60 pieces. Total profit in this case would be
be T

Rs (10 × 250 + 50 × 75), i.e., Rs 6250 and so on.


re
We, thus, find that the dealer can invest his money in different ways and he would
o R

earn different profits by following different investment strategies.


Now the problem is : How should he invest his money in order to get maximum
tt E

profit? To answer this question, let us try to formulate the problem mathematically.

12.2.1 Mathematical formulation of the problem


C

Let x be the number of tables and y be the number of chairs that the dealer buys.
Obviously, x and y must be non-negative, i.e.,
no N

x 0 ... (1)
(Non-negative constraints)
y 0 ... (2)
The dealer is constrained by the maximum amount he can invest (Here it is
Rs 50,000) and by the maximum number of items he can store (Here it is 60).
©

Stated mathematically,
2500x + 500y ≤ 50000 (investment constraint)
or 5x + y ≤ 100 ... (3)
and x + y ≤ 60 (storage constraint) ... (4)
506 MATHEMATICS

The dealer wants to invest in such a way so as to maximise his profit, say, Z which
stated as a function of x and y is given by
Z = 250x + 75y (called objective function) ... (5)
Mathematically, the given problems now reduces to:
Maximise Z = 250x + 75y
subject to the constraints:

he
5x + y ≤ 100
x + y ≤ 60
x ≥ 0, y ≥ 0

is
So, we have to maximise the linear function Z subject to certain conditions determined
by a set of linear inequalities with variables as non-negative. There are also some other

bl
problems where we have to minimise a linear function subject to certain conditions
determined by a set of linear inequalities with variables as non-negative. Such problems
are called Linear Programming Problems.
pu
Thus, a Linear Programming Problem is one that is concerned with finding the
optimal value (maximum or minimum value) of a linear function (called objective
be T

function) of several variables (say x and y), subject to the conditions that the variables
are non-negative and satisfy a set of linear inequalities (called linear constraints).
re
The term linear implies that all the mathematical relations used in the problem are
o R

linear relations while the term programming refers to the method of determining a
particular programme or plan of action.
tt E

Before we proceed further, we now formally define some terms (which have been
used above) which we shall be using in the linear programming problems:
Objective function Linear function Z = ax + by, where a, b are constants, which has
C

to be maximised or minimized is called a linear objective function.


In the above example, Z = 250x + 75y is a linear objective function. Variables x and
no N

y are called decision variables.


Constraints The linear inequalities or equations or restrictions on the variables of a
linear programming problem are called constraints. The conditions x ≥ 0, y ≥ 0 are
called non-negative restrictions. In the above example, the set of inequalities (1) to (4)
©

are constraints.
Optimisation problem A problem which seeks to maximise or minimise a linear
function (say of two variables x and y) subject to certain constraints as determined by
a set of linear inequalities is called an optimisation problem. Linear programming
problems are special type of optimisation problems. The above problem of investing a
LINEAR PROGRAMMING 507

given sum by the dealer in purchasing chairs and tables is an example of an optimisation
problem as well as of a linear programming problem.
We will now discuss how to find solutions to a linear programming problem. In this
chapter, we will be concerned only with the graphical method.
12.2.2 Graphical method of solving linear programming problems
In Class XI, we have learnt how to graph a system of linear inequalities involving two

he
variables x and y and to find its solutions graphically. Let us refer to the problem of
investment in tables and chairs discussed in Section 12.2. We will now solve this problem
graphically. Let us graph the constraints stated as linear inequalities:
5x + y ≤ 100 ... (1)

is
x + y ≤ 60 ... (2)
x≥0 ... (3)

bl
y≥0 ... (4)
The graph of this system (shaded region) consists of the points common to all half
planes determined by the inequalities (1) to (4) (Fig 12.1). Each point in this region
pu
represents a feasible choice open to the dealer for investing in tables and chairs. The
region, therefore, is called the feasible region for the problem. Every point of this
be T

region is called a feasible solution to the problem. Thus, we have,


Feasible region The common region determined by all the constraints including
re
non-negative constraints x, y ≥ 0 of a linear programming problem is called the feasible
o R

region (or solution region) for the problem. In Fig 12.1, the region OABC (shaded) is
the feasible region for the problem. The region other than feasible region is called an
tt E

infeasible region.
Feasible solutions Points within and on the
boundary of the feasible region represent
C

feasible solutions of the constraints. In


Fig 12.1, every point within and on the
no N

boundary of the feasible region OABC


represents feasible solution to the problem.
For example, the point (10, 50) is a feasible
solution of the problem and so are the points
©

(0, 60), (20, 0) etc.


Any point outside the feasible region is
called an infeasible solution. For example,
the point (25, 40) is an infeasible solution of
the problem. Fig 12.1
508 MATHEMATICS

Optimal (feasible) solution: Any point in the feasible region that gives the optimal
value (maximum or minimum) of the objective function is called an optimal solution.
Now, we see that every point in the feasible region OABC satisfies all the constraints
as given in (1) to (4), and since there are infinitely many points, it is not evident how
we should go about finding a point that gives a maximum value of the objective function
Z = 250x + 75y. To handle this situation, we use the following theorems which are

he
fundamental in solving linear programming problems. The proofs of these theorems
are beyond the scope of the book.
Theorem 1 Let R be the feasible region (convex polygon) for a linear programming
problem and let Z = ax + by be the objective function. When Z has an optimal value

is
(maximum or minimum), where the variables x and y are subject to constraints described
by linear inequalities, this optimal value must occur at a corner point* (vertex) of the

bl
feasible region.
Theorem 2 Let R be the feasible region for a linear programming problem, and let
Z = ax + by be the objective function. If R is bounded**, then the objective function
pu
Z has both a maximum and a minimum value on R and each of these occurs at a
corner point (vertex) of R.
be T

Remark If R is unbounded, then a maximum or a minimum value of the objective


re
function may not exist. However, if it exists, it must occur at a corner point of R.
o R

(By Theorem 1).


In the above example, the corner points (vertices) of the bounded (feasible) region
tt E

are: O, A, B and C and it is easy to find their coordinates as (0, 0), (20, 0), (10, 50) and
(0, 60) respectively. Let us now compute the values of Z at these points.
C

We have

Vertex of the Corresponding value


no N

Feasible Region of Z (in Rs)


O (0,0) 0
C (0,60) 4500
B (10,50) 6250 ← Maximum
©

A (20,0) 5000

* A corner point of a feasible region is a point in the region which is the intersection of two boundary lines.
** A feasible region of a system of linear inequalities is said to be bounded if it can be enclosed within a
circle. Otherwise, it is called unbounded. Unbounded means that the feasible region does extend
indefinitely in any direction.
LINEAR PROGRAMMING 509

We observe that the maximum profit to the dealer results from the investment
strategy (10, 50), i.e. buying 10 tables and 50 chairs.
This method of solving linear programming problem is referred as Corner Point
Method. The method comprises of the following steps:
1. Find the feasible region of the linear programming problem and determine its

he
corner points (vertices) either by inspection or by solving the two equations of
the lines intersecting at that point.
2. Evaluate the objective function Z = ax + by at each corner point. Let M and m,
respectively denote the largest and smallest values of these points.

is
3. (i) When the feasible region is bounded, M and m are the maximum and
minimum values of Z.

bl
(ii) In case, the feasible region is unbounded, we have:
4. (a) M is the maximum value of Z, if the open half plane determined by
ax + by > M has no point in common with the feasible region. Otherwise, Z
pu has no maximum value.
(b) Similarly, m is the minimum value of Z, if the open half plane determined by
be T

ax + by < m has no point in common with the feasible region. Otherwise, Z


re
has no minimum value.
o R

We will now illustrate these steps of Corner Point Method by considering some
examples:
tt E

Example 1 Solve the following linear programming problem graphically:


Maximise Z = 4x + y ... (1)
C

subject to the constraints:


x + y ≤ 50 ... (2)
no N

3x + y ≤ 90 ... (3)
x ≥ 0, y ≥ 0 ... (4)
©

Solution The shaded region in Fig 12.2 is the feasible region determined by the system
of constraints (2) to (4). We observe that the feasible region OABC is bounded. So,
we now use Corner Point Method to determine the maximum value of Z.
The coordinates of the corner points O, A, B and C are (0, 0), (30, 0), (20, 30) and
(0, 50) respectively. Now we evaluate Z at each corner point.
510 MATHEMATICS

Corner Point Corresponding value


of Z
(0, 0) 0
(30, 0) 120 ← Maximum
(20, 30) 110
(0, 50) 50

he
is
bl
Fig 12.2
Hence, maximum value of Z is 120 at the point (30, 0).
Example 2 Solve the following linear programming problem graphically:
pu
Minimise Z = 200 x + 500 y ... (1)
subject to the constraints:
x + 2y ≥ 10
be T

... (2)
3x + 4y ≤ 24 ... (3)
re
x ≥ 0, y ≥ 0
o R

... (4)
Solution The shaded region in Fig 12.3 is the feasible region ABC determined by the
system of constraints (2) to (4), which is bounded. The coordinates of corner points
tt E

Corner Point Corresponding value


of Z
C

(0, 5) 2500
(4, 3) 2300 ← Minimum
no N

(0, 6) 3000
©

Fig 12.3
LINEAR PROGRAMMING 511

A, B and C are (0,5), (4,3) and (0,6) respectively. Now we evaluate Z = 200x + 500y
at these points.
Hence, minimum value of Z is 2300 attained at the point (4, 3)
Example 3 Solve the following problem graphically:
Minimise and Maximise Z = 3x + 9y ... (1)

he
subject to the constraints: x + 3y ≤ 60 ... (2)
x + y ≥ 10 ... (3)
x ≤y ... (4)
x ≥ 0, y ≥ 0 ... (5)

is
Solution First of all, let us graph the feasible region of the system of linear inequalities
(2) to (5). The feasible region ABCD is shown in the Fig 12.4. Note that the region is

bl
bounded. The coordinates of the corner points A, B, C and D are (0, 10), (5, 5), (15,15)
and (0, 20) respectively.
pu Corner Corresponding value of
Point Z = 3x + 9y
A (0, 10) 90
be T

B (5, 5) 60 ← Minimum
re
C (15, 15) 180
}← Maximum
o R

D (0, 20) 180 (Multiple


optimal
solutions)
tt E
C

Fig 12.4
no N

We now find the minimum and maximum value of Z. From the table, we find that
the minimum value of Z is 60 at the point B (5, 5) of the feasible region.
The maximum value of Z on the feasible region occurs at the two corner points
C (15, 15) and D (0, 20) and it is 180 in each case.
©

Remark Observe that in the above example, the problem has multiple optimal solutions
at the corner points C and D, i.e. the both points produce same maximum value 180. In
such cases, you can see that every point on the line segment CD joining the two corner
points C and D also give the same maximum value. Same is also true in the case if the
two points produce same minimum value.
512 MATHEMATICS

Example 4 Determine graphically the minimum value of the objective function


Z = – 50x + 20y ... (1)
subject to the constraints:
2x – y ≥ – 5 ... (2)
3x + y ≥ 3 ... (3)
2x – 3y ≤ 12 ... (4)

he
x ≥ 0, y ≥ 0 ... (5)
Solution First of all, let us graph the feasible region of the system of inequalities (2) to
(5). The feasible region (shaded) is shown in the Fig 12.5. Observe that the feasible
region is unbounded.

is
We now evaluate Z at the corner points.

bl
pu Corner Point Z = – 50x + 20y
(0, 5) 100
be T

(0, 3) 60
re
(1, 0) –50

o R

(6, 0) – 300 smallest


tt E
C

Fig 12.5
From this table, we find that – 300 is the smallest value of Z at the corner point
(6, 0). Can we say that minimum value of Z is – 300? Note that if the region would
no N

have been bounded, this smallest value of Z is the minimum value of Z (Theorem 2).
But here we see that the feasible region is unbounded. Therefore, – 300 may or may
not be the minimum value of Z. To decide this issue, we graph the inequality
©

– 50x + 20y < – 300 (see Step 3(ii) of corner Point Method.)
i.e., – 5x + 2y < – 30
and check whether the resulting open half plane has points in common with feasible
region or not. If it has common points, then –300 will not be the minimum value of Z.
Otherwise, –300 will be the minimum value of Z.
LINEAR PROGRAMMING 513

As shown in the Fig 12.5, it has common points. Therefore, Z = –50 x + 20 y


has no minimum value subject to the given constraints.
In the above example, can you say whether z = – 50 x + 20 y has the maximum
value 100 at (0,5)? For this, check whether the graph of – 50 x + 20 y > 100 has points
in common with the feasible region. (Why?)

he
Example 5 Minimise Z = 3x + 2y
subject to the constraints:
x+y≥8 ... (1)
3x + 5y ≤ 15 ... (2)

is
x ≥ 0, y ≥ 0 ... (3)
Solution Let us graph the inequalities (1) to (3) (Fig 12.6). Is there any feasible region?

bl
Why is so?
From Fig 12.6, you can see that
pu
there is no point satisfying all the
constraints simultaneously. Thus, the
problem is having no feasible region and
be T

hence no feasible solution.


re
Remarks From the examples which we
o R

have discussed so far, we notice some


general features of linear programming
tt E

problems:
(i) The feasible region is always a
C

convex region.
Fig 12.6
(ii) The maximum (or minimum)
solution of the objective function occurs at the vertex (corner) of the feasible
no N

region. If two corner points produce the same maximum (or minimum) value
of the objective function, then every point on the line segment joining these
points will also give the same maximum (or minimum) value.
©

EXERCISE 12.1
Solve the following Linear Programming Problems graphically:
1. Maximise Z = 3x + 4y
subject to the constraints : x + y ≤ 4, x ≥ 0, y ≥ 0.
514 MATHEMATICS

2. Minimise Z = – 3x + 4 y
subject to x + 2y ≤ 8, 3x + 2y ≤ 12, x ≥ 0, y ≥ 0.
3. Maximise Z = 5x + 3y
subject to 3x + 5y ≤ 15, 5x + 2y ≤ 10, x ≥ 0, y ≥ 0.
4. Minimise Z = 3x + 5y

he
such that x + 3y ≥ 3, x + y ≥ 2, x, y ≥ 0.
5. Maximise Z = 3x + 2y
subject to x + 2y ≤ 10, 3x + y ≤ 15, x, y ≥ 0.

is
6. Minimise Z = x + 2y
subject to 2x + y ≥ 3, x + 2y ≥ 6, x, y ≥ 0.
Show that the minimum of Z occurs at more than two points.

bl
7. Minimise and Maximise Z = 5x + 10 y
subject to x + 2y ≤ 120, x + y ≥ 60, x – 2y ≥ 0, x, y ≥ 0.
pu
8. Minimise and Maximise Z = x + 2y
subject to x + 2y ≥ 100, 2x – y ≤ 0, 2x + y ≤ 200; x, y ≥ 0.
be T

9. Maximise Z = – x + 2y, subject to the constraints:


x ≥ 3, x + y ≥ 5, x + 2y ≥ 6, y ≥ 0.
re
o R

10. Maximise Z = x + y, subject to x – y ≤ –1, –x + y ≤ 0, x, y ≥ 0.

12.3 Different Types of Linear Programming Problems


tt E

A few important linear programming problems are listed below:


1. Manufacturing problems In these problems, we determine the number of units
C

of different products which should be produced and sold by a firm


when each product requires a fixed manpower, machine hours, labour hour per
no N

unit of product, warehouse space per unit of the output etc., in order to make
maximum profit.
2. Diet problems In these problems, we determine the amount of different kinds
of constituents/nutrients which should be included in a diet so as to minimise the
©

cost of the desired diet such that it contains a certain minimum amount of each
constituent/nutrients.
3. Transportation problems In these problems, we determine a transportation
schedule in order to find the cheapest way of transporting a product from
plants/factories situated at different locations to different markets.
LINEAR PROGRAMMING 515

Let us now solve some of these types of linear programming problems:


Example 6 (Diet problem): A dietician wishes to mix two types of foods in such a
way that vitamin contents of the mixture contain atleast 8 units of vitamin A and 10
units of vitamin C. Food ‘I’ contains 2 units/kg of vitamin A and 1 unit/kg of vitamin C.
Food ‘II’ contains 1 unit/kg of vitamin A and 2 units/kg of vitamin C. It costs

he
Rs 50 per kg to purchase Food ‘I’ and Rs 70 per kg to purchase Food ‘II’. Formulate
this problem as a linear programming problem to minimise the cost of such a mixture.
Solution Let the mixture contain x kg of Food ‘I’ and y kg of Food ‘II’. Clearly, x ≥ 0,
y ≥ 0. We make the following table from the given data:

is
Resources Food Requirement
I II

bl
(x) (y)
Vitamin A 2 1 8
(units/kg)
pu Vitamin C 1 2 10
(units/kg)
be T

Cost (Rs/kg) 50 70
re
Since the mixture must contain at least 8 units of vitamin A and 10 units of
o R

vitamin C, we have the constraints:


2x + y ≥ 8
tt E

x + 2y ≥ 10
Total cost Z of purchasing x kg of food ‘I’ and y kg of Food ‘II’ is
C

Z = 50x + 70y
Hence, the mathematical formulation of the problem is:
no N

Minimise Z = 50x + 70y ... (1)


subject to the constraints:
2x + y ≥ 8 ... (2)
©

x + 2y ≥ 10 ... (3)
x, y ≥ 0 ... (4)
Let us graph the inequalities (2) to (4). The feasible region determined by the
system is shown in the Fig 12.7. Here again, observe that the feasible region is
unbounded.
516 MATHEMATICS

Let us evaluate Z at the corner points A(0,8), B(2,4) and C(10,0).

he
Corner Point Z = 50x + 70y
(0,8) 560
(2,4) 380 ← Minimum
(10,0) 500

is
bl
pu Fig 12.7
In the table, we find that smallest value of Z is 380 at the point (2,4). Can we say
that the minimum value of Z is 380? Remember that the feasible region is unbounded.
Therefore, we have to draw the graph of the inequality
be T

50x + 70y < 380 i.e., 5x + 7y < 38


re
to check whether the resulting open half plane has any point common with the feasible
o R

region. From the Fig 12.7, we see that it has no points in common.
Thus, the minimum value of Z is 380 attained at the point (2, 4). Hence, the optimal
tt E

mixing strategy for the dietician would be to mix 2 kg of Food ‘I’ and 4 kg of Food ‘II’,
and with this strategy, the minimum cost of the mixture will be Rs 380.
C

Example 7 (Allocation problem) A cooperative society of farmers has 50 hectare


of land to grow two crops X and Y. The profit from crops X and Y per hectare are
estimated as Rs 10,500 and Rs 9,000 respectively. To control weeds, a liquid herbicide
no N

has to be used for crops X and Y at rates of 20 litres and 10 litres per hectare. Further,
no more than 800 litres of herbicide should be used in order to protect fish and wild life
using a pond which collects drainage from this land. How much land should be allocated
to each crop so as to maximise the total profit of the society?
©

Solution Let x hectare of land be allocated to crop X and y hectare to crop Y. Obviously,
x ≥ 0, y ≥ 0.
Profit per hectare on crop X = Rs 10500
Profit per hectare on crop Y = Rs 9000
Therefore, total profit = Rs (10500x + 9000y)
LINEAR PROGRAMMING 517

The mathematical formulation of the problem is as follows:


Maximise Z = 10500 x + 9000 y
subject to the constraints:
x + y ≤ 50 (constraint related to land) ... (1)
20x + 10y ≤ 800 (constraint related to use of herbicide)

he
i.e. 2x + y ≤ 80 ... (2)
x ≥ 0, y ≥ 0 (non negative constraint) ... (3)
Let us draw the graph of the system of inequalities (1) to (3). The feasible region
OABC is shown (shaded) in the Fig 12.8. Observe that the feasible region is bounded.

is
The coordinates of the corner points O, A, B and C are (0, 0), (40, 0), (30, 20) and
(0, 50) respectively. Let us evaluate the objective function Z = 10500 x + 9000y at

bl
these vertices to find which one gives the maximum profit.

Corner Point Z = 10500x + 9000y


pu O (0, 0) 0
A ( 40, 0) 420000
be T

B (30, 20) 495000 ← Maximum


re
C (0,50) 450000
o R
tt E
C

Fig 12.8
Hence, the society will get the maximum profit of Rs 4,95,000 by allocating 30
hectares for crop X and 20 hectares for crop Y.
no N

Example 8 (Manufacturing problem) A manufacturing company makes two models


A and B of a product. Each piece of Model A requires 9 labour hours for fabricating
and 1 labour hour for finishing. Each piece of Model B requires 12 labour hours for
©

fabricating and 3 labour hours for finishing. For fabricating and finishing, the maximum
labour hours available are 180 and 30 respectively. The company makes a profit of
Rs 8000 on each piece of model A and Rs 12000 on each piece of Model B. How many
pieces of Model A and Model B should be manufactured per week to realise a maximum
profit? What is the maximum profit per week?
518 MATHEMATICS

Solution Suppose x is the number of pieces of Model A and y is the number of pieces
of Model B. Then
Total profit (in Rs) = 8000 x + 12000 y
Let Z = 8000 x + 12000 y
We now have the following mathematical model for the given problem.

he
Maximise Z = 8000 x + 12000 y ... (1)
subject to the constraints:
9x + 12y ≤ 180 (Fabricating constraint)
i.e. 3x + 4y ≤ 60 ... (2)

is
x + 3y ≤ 30 (Finishing constraint) ... (3)
x ≥ 0, y ≥ 0 (non-negative constraint) ... (4)

bl
The feasible region (shaded) OABC determined by the linear inequalities (2) to (4)
is shown in the Fig 12.9. Note that the feasible region is bounded.
pu
be T
re
o R
tt E

Fig 12.9
C

Let us evaluate the objective function Z at each corner point as shown below:
Corner Point Z = 8000 x + 12000 y
no N

0 (0, 0) 0
A (20, 0) 160000

©

B (12, 6) 168000 Maximum


C (0, 10) 120000
We find that maximum value of Z is 1,68,000 at B (12, 6). Hence, the company
should produce 12 pieces of Model A and 6 pieces of Model B to realise maximum
profit and maximum profit then will be Rs 1,68,000.
LINEAR PROGRAMMING 519

EXERCISE 12.2
1. Reshma wishes to mix two types of food P and Q in such a way that the vitamin
contents of the mixture contain at least 8 units of vitamin A and 11 units of
vitamin B. Food P costs Rs 60/kg and Food Q costs Rs 80/kg. Food P contains
3 units/kg of Vitamin A and 5 units / kg of Vitamin B while food Q contains
4 units/kg of Vitamin A and 2 units/kg of vitamin B. Determine the minimum cost

he
of the mixture.
2. One kind of cake requires 200g of flour and 25g of fat, and another kind of cake
requires 100g of flour and 50g of fat. Find the maximum number of cakes which

is
can be made from 5kg of flour and 1 kg of fat assuming that there is no shortage
of the other ingredients used in making the cakes.
3. A factory makes tennis rackets and cricket bats. A tennis racket takes 1.5 hours

bl
of machine time and 3 hours of craftman’s time in its making while a cricket bat
takes 3 hour of machine time and 1 hour of craftman’s time. In a day, the factory
has the availability of not more than 42 hours of machine time and 24 hours of
pu
craftsman’s time.
(i) What number of rackets and bats must be made if the factory is to work
be T

at full capacity?
(ii) If the profit on a racket and on a bat is Rs 20 and Rs 10 respectively, find
re
o R

the maximum profit of the factory when it works at full capacity.


4. A manufacturer produces nuts and bolts. It takes 1 hour of work on machine A
and 3 hours on machine B to produce a package of nuts. It takes 3 hours on
tt E

machine A and 1 hour on machine B to produce a package of bolts. He earns a


profit of Rs17.50 per package on nuts and Rs 7.00 per package on bolts. How
C

many packages of each should be produced each day so as to maximise his


profit, if he operates his machines for at the most 12 hours a day?
5. A factory manufactures two types of screws, A and B. Each type of screw
no N

requires the use of two machines, an automatic and a hand operated. It takes
4 minutes on the automatic and 6 minutes on hand operated machines to
manufacture a package of screws A, while it takes 6 minutes on automatic and
3 minutes on the hand operated machines to manufacture a package of screws
©

B. Each machine is available for at the most 4 hours on any day. The manufacturer
can sell a package of screws A at a profit of Rs 7 and screws B at a profit of
Rs 10. Assuming that he can sell all the screws he manufactures, how many
packages of each type should the factory owner produce in a day in order to
maximise his profit? Determine the maximum profit.
520 MATHEMATICS

6. A cottage industry manufactures pedestal lamps and wooden shades, each


requiring the use of a grinding/cutting machine and a sprayer. It takes 2 hours on
grinding/cutting machine and 3 hours on the sprayer to manufacture a pedestal
lamp. It takes 1 hour on the grinding/cutting machine and 2 hours on the sprayer
to manufacture a shade. On any day, the sprayer is available for at the most 20
hours and the grinding/cutting machine for at the most 12 hours. The profit from
the sale of a lamp is Rs 5 and that from a shade is Rs 3. Assuming that the

he
manufacturer can sell all the lamps and shades that he produces, how should he
schedule his daily production in order to maximise his profit?
7. A company manufactures two types of novelty souvenirs made of plywood.
Souvenirs of type A require 5 minutes each for cutting and 10 minutes each for

is
assembling. Souvenirs of type B require 8 minutes each for cutting and 8 minutes
each for assembling. There are 3 hours 20 minutes available for cutting and 4
hours for assembling. The profit is Rs 5 each for type A and Rs 6 each for type

bl
B souvenirs. How many souvenirs of each type should the company manufacture
in order to maximise the profit?
8. A merchant plans to sell two types of personal computers – a desktop model and
a portable model that will cost Rs 25000 and Rs 40000 respectively. He estimates
pu
that the total monthly demand of computers will not exceed 250 units. Determine
the number of units of each type of computers which the merchant should stock
be T

to get maximum profit if he does not want to invest more than Rs 70 lakhs and if
his profit on the desktop model is Rs 4500 and on portable model is Rs 5000.
re
9. A diet is to contain at least 80 units of vitamin A and 100 units of minerals. Two
o R

foods F1 and F2 are available. Food F1 costs Rs 4 per unit food and F2 costs
Rs 6 per unit. One unit of food F1 contains 3 units of vitamin A and 4 units of
minerals. One unit of food F2 contains 6 units of vitamin A and 3 units of minerals.
tt E

Formulate this as a linear programming problem. Find the minimum cost for diet
that consists of mixture of these two foods and also meets the minimal nutritional
requirements.
C

10. There are two types of fertilisers F1 and F2. F1 consists of 10% nitrogen and 6%
phosphoric acid and F2 consists of 5% nitrogen and 10% phosphoric acid. After
testing the soil conditions, a farmer finds that she needs atleast 14 kg of nitrogen
no N

and 14 kg of phosphoric acid for her crop. If F1 costs Rs 6/kg and F2 costs
Rs 5/kg, determine how much of each type of fertiliser should be used so that
nutrient requirements are met at a minimum cost. What is the minimum cost?
11. The corner points of the feasible region determined by the following system of
©

linear inequalities:
2x + y ≤ 10, x + 3y ≤ 15, x, y ≥ 0 are (0, 0), (5, 0), (3, 4) and (0, 5). Let
Z = px + qy, where p, q > 0. Condition on p and q so that the maximum of Z
occurs at both (3, 4) and (0, 5) is
(A) p = q (B) p = 2q (C) p = 3q (D) q = 3p
LINEAR PROGRAMMING 521

Miscellaneous Examples
Example 9 (Diet problem) A dietician has to develop a special diet using two foods
P and Q. Each packet (containing 30 g) of food P contains 12 units of calcium, 4 units
of iron, 6 units of cholesterol and 6 units of vitamin A. Each packet of the same quantity
of food Q contains 3 units of calcium, 20 units of iron, 4 units of cholesterol and 3 units
of vitamin A. The diet requires atleast 240 units of calcium, atleast 460 units of iron and

he
at most 300 units of cholesterol. How many packets of each food should be used to
minimise the amount of vitamin A in the diet? What is the minimum amount of vitamin A?
Solution Let x and y be the number of packets of food P and Q respectively. Obviously

is
x ≥ 0, y ≥ 0. Mathematical formulation of the given problem is as follows:
Minimise Z = 6x + 3y (vitamin A)

bl
subject to the constraints
12x + 3y ≥ 240 (constraint on calcium), i.e. 4x + y ≥ 80 ... (1)
4x + 20y ≥ 460 (constraint on iron), i.e. x + 5y ≥ 115
pu ... (2)
6x + 4y ≤ 300 (constraint on cholesterol), i.e. 3x + 2y ≤ 150 ... (3)
be T

x ≥ 0, y ≥ 0 ... (4)
re
Let us graph the inequalities (1) to (4).
o R

The feasible region (shaded) determined by the constraints (1) to (4) is shown in
Fig 12.10 and note that it is bounded.
tt E
C
no N
©

Fig 12.10
522 MATHEMATICS

The coordinates of the corner points L, M and N are (2, 72), (15, 20) and (40, 15)
respectively. Let us evaluate Z at these points:

Corner Point Z=6x+3y


(2, 72) 228
(15, 20) 150 ← Minimum

he
(40, 15) 285

From the table, we find that Z is minimum at the point (15, 20). Hence, the amount
of vitamin A under the constraints given in the problem will be minimum, if 15 packets

is
of food P and 20 packets of food Q are used in the special diet. The minimum amount
of vitamin A will be 150 units.

bl
Example 10 (Manufacturing problem) A manufacturer has three machines I, II
and III installed in his factory. Machines I and II are capable of being operated for
at most 12 hours whereas machine III must be operated for atleast 5 hours a day. She
produces only two items M and N each requiring the use of all the three machines.
pu
The number of hours required for producing 1 unit of each of M and N on the three
machines are given in the following table:
be T

Items Number of hours required on machines


re
I II III
o R

M 1 2 1
N 2 1 1.25
tt E

She makes a profit of Rs 600 and Rs 400 on items M and N respectively. How many
of each item should she produce so as to maximise her profit assuming that she can sell
C

all the items that she produced? What will be the maximum profit?
Solution Let x and y be the number of items M and N respectively.
Total profit on the production = Rs (600 x + 400 y)
no N

Mathematical formulation of the given problem is as follows:


Maximise Z = 600 x + 400 y
subject to the constraints:
©

x + 2y ≤ 12 (constraint on Machine I) ... (1)


2x + y ≤ 12 (constraint on Machine II) ... (2)
5
x+ y ≥ 5 (constraint on Machine III) ... (3)
4
x ≥ 0, y ≥ 0 ... (4)
LINEAR PROGRAMMING 523

Let us draw the graph of constraints (1) to (4). ABCDE is the feasible region
(shaded) as shown in Fig 12.11 determined by the constraints (1) to (4). Observe that
the feasible region is bounded, coordinates of the corner points A, B, C, D and E are
(5, 0) (6, 0), (4, 4), (0, 6) and (0, 4) respectively.

he
is
bl
pu
Fig 12.11
be T

Let us evaluate Z = 600 x + 400 y at these corner points.


re
o R

Corner point Z = 600 x + 400 y


(5, 0) 3000
tt E

(6, 0) 3600
(4, 4) 4000 ← Maximum
C

(0, 6) 2400
no N

(0, 4) 1600

We see that the point (4, 4) is giving the maximum value of Z. Hence, the
manufacturer has to produce 4 units of each item to get the maximum profit of Rs 4000.
©

Example 11 (Transportation problem) There are two factories located one at


place P and the other at place Q. From these locations, a certain commodity is to be
delivered to each of the three depots situated at A, B and C. The weekly requirements
of the depots are respectively 5, 5 and 4 units of the commodity while the production
capacity of the factories at P and Q are respectively 8 and 6 units. The cost of
524 MATHEMATICS

transportation per unit is given below:


From/To Cost (in Rs)
A B C
P 160 100 150
Q 100 120 100

he
How many units should be transported from each factory to each depot in order that
the transportation cost is minimum. What will be the minimum transportation cost?
Solution The problem can be explained diagrammatically as follows (Fig 12.12):

is
Let x units and y units of the commodity be transported from the factory at P to
the depots at A and B respectively. Then (8 – x – y) units will be transported to depot
at C (Why?) Factory

bl
P
8 units

8
R

pu x
16
0
y
s1

–x
50

–y
s Rs 100
R
be T

A B C
Depot 5 units 5 units Depot 4 units Depot
re
o R

]
y)
0
10

Rs 100 –
(5
Rs

5–x
5 – y Rs 120 +
x)
tt E


Q [(5

6 units 6
C

Factory
Fig 12.12
no N

Hence, we have x ≥ 0, y ≥ 0 and 8 – x – y ≥ 0


i.e. x ≥ 0, y ≥ 0 and x + y ≤ 8
Now, the weekly requirement of the depot at A is 5 units of the commodity. Since
©

x units are transported from the factory at P, the remaining (5 – x) units need to be
transported from the factory at Q. Obviously, 5 – x ≥ 0, i.e. x ≤ 5.
Similarly, (5 – y) and 6 – (5 – x + 5 – y) = x + y – 4 units are to be transported from
the factory at Q to the depots at B and C respectively.
Thus, 5 – y ≥ 0 , x + y – 4 ≥0
i.e. y≤5 , x+y≥ 4
LINEAR PROGRAMMING 525

Total transportation cost Z is given by


Z = 160 x + 100 y + 100 ( 5 – x) + 120 (5 – y) + 100 (x + y – 4) + 150 (8 – x – y)
= 10 (x – 7 y + 190)
Therefore, the problem reduces to
Minimise Z = 10 (x – 7y + 190)
subject to the constraints:

he
x ≥ 0, y ≥ 0 ... (1)
x+y≤8 ... (2)
x≤5 ... (3)
y≤5

is
... (4)
and x+y≥4 ... (5)
The shaded region ABCDEF

bl
represented by the constraints (1) to
(5) is the feasible region (Fig 12.13). Fig 12.13
Observe that the feasible region is bounded. The coordinates of the corner points
pu
of the feasible region are (0, 4), (0, 5), (3, 5), (5, 3), (5, 0) and (4, 0).
Let us evaluate Z at these points.
be T

Corner Point Z = 10 (x – 7 y + 190)


(0, 4) 1620
re
o R

(0, 5) 1550 ← Minimum


(3, 5) 1580
tt E

(5, 3) 1740
(5, 0) 1950
C

(4, 0) 1940
From the table, we see that the minimum value of Z is 1550 at the point (0, 5).
no N

Hence, the optimal transportation strategy will be to deliver 0, 5 and 3 units from
the factory at P and 5, 0 and 1 units from the factory at Q to the depots at A, B and C
respectively. Corresponding to this strategy, the transportation cost would be minimum,
i.e., Rs 1550.
©

Miscellaneous Exercise on Chapter 12


1. Refer to Example 9. How many packets of each food should be used to maximise
the amount of vitamin A in the diet? What is the maximum amount of vitamin A
in the diet?
526 MATHEMATICS

2. A farmer mixes two brands P and Q of cattle feed. Brand P, costing Rs 250 per
bag, contains 3 units of nutritional element A, 2.5 units of element B and 2 units
of element C. Brand Q costing Rs 200 per bag contains 1.5 units of nutritional
element A, 11.25 units of element B, and 3 units of element C. The minimum
requirements of nutrients A, B and C are 18 units, 45 units and 24 units respectively.
Determine the number of bags of each brand which should be mixed in order to
produce a mixture having a minimum cost per bag? What is the minimum cost of

he
the mixture per bag?
3. A dietician wishes to mix together two kinds of food X and Y in such a way that
the mixture contains at least 10 units of vitamin A, 12 units of vitamin B and
8 units of vitamin C. The vitamin contents of one kg food is given below:

is
Food Vitamin A Vitamin B Vitamin C

bl
X 1 2 3
Y 2 2 1
One kg of food X costs Rs 16 and one kg of food Y costs Rs 20. Find the least
pu
cost of the mixture which will produce the required diet?
4. A manufacturer makes two types of toys A and B. Three machines are needed
be T

for this purpose and the time (in minutes) required for each toy on the machines
re
is given below:
o R

Types of Toys Machines


I II III
tt E

A 12 18 6
B 6 0 9
C

Each machine is available for a maximum of 6 hours per day. If the profit on
each toy of type A is Rs 7.50 and that on each toy of type B is Rs 5, show that 15
no N

toys of type A and 30 of type B should be manufactured in a day to get maximum


profit.
5. An aeroplane can carry a maximum of 200 passengers. A profit of Rs 1000 is
©

made on each executive class ticket and a profit of Rs 600 is made on each
economy class ticket. The airline reserves at least 20 seats for executive class.
However, at least 4 times as many passengers prefer to travel by economy class
than by the executive class. Determine how many tickets of each type must be
sold in order to maximise the profit for the airline. What is the maximum profit?
LINEAR PROGRAMMING 527

6. Two godowns A and B have grain capacity of 100 quintals and 50 quintals
respectively. They supply to 3 ration shops, D, E and F whose requirements are
60, 50 and 40 quintals respectively. The cost of transportation per quintal from
the godowns to the shops are given in the following table:
Transportation cost per quintal (in Rs)

he
From/To A B
D 6 4
E 3 2

is
F 2.50 3
How should the supplies be transported in order that the transportation cost is

bl
minimum? What is the minimum cost?
7. An oil company has two depots A and B with capacities of 7000 L and 4000 L
respectively. The company is to supply oil to three petrol pumps, D, E and F
pu
whose requirements are 4500L, 3000L and 3500L respectively. The distances
(in km) between the depots and the petrol pumps is given in the following table:
be T

Distance in (km.)
re
From / To A B
o R

D 7 3
E 6 4
tt E

F 3 2
C

Assuming that the transportation cost of 10 litres of oil is Re 1 per km, how
should the delivery be scheduled in order that the transportation cost is minimum?
What is the minimum cost?
no N

8. A fruit grower can use two types of fertilizer in his garden, brand P and brand Q.
The amounts (in kg) of nitrogen, phosphoric acid, potash, and chlorine in a bag of
each brand are given in the table. Tests indicate that the garden needs at least
©

240 kg of phosphoric acid, at least 270 kg of potash and at most 310 kg of


chlorine.
If the grower wants to minimise the amount of nitrogen added to the garden,
how many bags of each brand should be used? What is the minimum amount of
nitrogen added in the garden?
528 MATHEMATICS

kg per bag
Brand P Brand Q
Nitrogen 3 3.5
Phosphoric acid 1 2
Potash 3 1.5

he
Chlorine 1.5 2
9. Refer to Question 8. If the grower wants to maximise the amount of nitrogen
added to the garden, how many bags of each brand should be added? What is

is
the maximum amount of nitrogen added?
10. A toy company manufactures two types of dolls, A and B. Market tests and available

bl
resources have indicated that the combined production level should not exceed 1200
dolls per week and the demand for dolls of type B is at most half of that for dolls of
type A. Further, the production level of dolls of type A can exceed three times the
pu
production of dolls of other type by at most 600 units. If the company makes profit of
Rs 12 and Rs 16 per doll respectively on dolls A and B, how many of each should be
produced weekly in order to maximise the profit?
be T
re
Summary
o R

 A linear programming problem is one that is concerned with finding the optimal
value (maximum or minimum) of a linear function of several variables (called
tt E

objective function) subject to the conditions that the variables are


non-negative and satisfy a set of linear inequalities (called linear constraints).
Variables are sometimes called decision variables and are non-negative.
C

 A few important linear programming problems are:


(i) Diet problems
no N

(ii) Manufacturing problems


(iii) Transportation problems
 The common region determined by all the constraints including the non-negative
©

constraints x ≥ 0, y ≥ 0 of a linear programming problem is called the feasible


region (or solution region) for the problem.
 Points within and on the boundary of the feasible region represent feasible
solutions of the constraints.
Any point outside the feasible region is an infeasible solution.
LINEAR PROGRAMMING 529

 Any point in the feasible region that gives the optimal value (maximum or
minimum) of the objective function is called an optimal solution.

 The following Theorems are fundamental in solving linear programming


problems:
Theorem 1 Let R be the feasible region (convex polygon) for a linear

he
programming problem and let Z = ax + by be the objective function. When Z
has an optimal value (maximum or minimum), where the variables x and y
are subject to constraints described by linear inequalities, this optimal value
must occur at a corner point (vertex) of the feasible region.

is
Theorem 2 Let R be the feasible region for a linear programming problem,
and let Z = ax + by be the objective function. If R is bounded, then the
objective function Z has both a maximum and a minimum value on R and

bl
each of these occurs at a corner point (vertex) of R.

 If the feasible region is unbounded, then a maximum or a minimum may not


pu
exist. However, if it exists, it must occur at a corner point of R.

 Corner point method for solving a linear programming problem. The method
be T

comprises of the following steps:


(i) Find the feasible region of the linear programming problem and determine
re
o R

its corner points (vertices).


(ii) Evaluate the objective function Z = ax + by at each corner point. Let M
and m respectively be the largest and smallest values at these points.
tt E

(iii) If the feasible region is bounded, M and m respectively are the maximum
and minimum values of the objective function.
C

If the feasible region is unbounded, then


(i) M is the maximum value of the objective function, if the open half plane
determined by ax + by > M has no point in common with the feasible
no N

region. Otherwise, the objective function has no maximum value.


(ii) m is the minimum value of the objective function, if the open half plane
determined by ax + by < m has no point in common with the feasible
©

region. Otherwise, the objective function has no minimum value.

 If two corner points of the feasible region are both optimal solutions of the
same type, i.e., both produce the same maximum or minimum, then any point
on the line segment joining these two points is also an optimal solution of the
same type.
530 MATHEMATICS

Historical Note
In the World War II, when the war operations had to be planned to economise
expenditure, maximise damage to the enemy, linear programming problems
came to the forefront.
The first problem in linear programming was formulated in 1941 by the Russian

he
mathematician, L. Kantorovich and the American economist, F. L. Hitchcock,
both of whom worked at it independently of each other. This was the well
known transportation problem. In 1945, an English economist, G.Stigler,
described yet another linear programming problem – that of determining an

is
optimal diet.
In 1947, the American economist, G. B. Dantzig suggested an efficient method
known as the simplex method which is an iterative procedure to solve any

bl
linear programming problem in a finite number of steps.
L. Katorovich and American mathematical economist, T. C. Koopmans were
pu
awarded the nobel prize in the year 1975 in economics for their pioneering
work in linear programming. With the advent of computers and the necessary
softwares, it has become possible to apply linear programming model to
be T

increasingly complex problems in many areas.


re
o R

—™—
tt E
C
no N
©
PROBABILITY 531

Chapter 13
PROBABILITY

he
™ The theory of probabilities is simply the Science of logic
quantitatively treated. – C.S. PEIRCE ™

is
13.1 Introduction
In earlier Classes, we have studied the probability as a
measure of uncertainty of events in a random experiment.

bl
We discussed the axiomatic approach formulated by
Russian Mathematician, A.N. Kolmogorov (1903-1987)
and treated probability as a function of outcomes of the
pu
experiment. We have also established equivalence between
the axiomatic theory and the classical theory of probability
in case of equally likely outcomes. On the basis of this
be T

relationship, we obtained probabilities of events associated


with discrete sample spaces. We have also studied the
re
addition rule of probability. In this chapter, we shall discuss
o R

the important concept of conditional probability of an event


given that another event has occurred, which will be helpful
in understanding the Bayes' theorem, multiplication rule of
tt E

Pierre de Fermat
probability and independence of events. We shall also learn (1601-1665)
an important concept of random variable and its probability
distribution and also the mean and variance of a probability distribution. In the last
C

section of the chapter, we shall study an important discrete probability distribution


called Binomial distribution. Throughout this chapter, we shall take up the experiments
having equally likely outcomes, unless stated otherwise.
no N

13.2 Conditional Probability


Uptill now in probability, we have discussed the methods of finding the probability of
events. If we have two events from the same sample space, does the information
about the occurrence of one of the events affect the probability of the other event? Let
©

us try to answer this question by taking up a random experiment in which the outcomes
are equally likely to occur.
Consider the experiment of tossing three fair coins. The sample space of the
experiment is
S = {HHH, HHT, HTH, THH, HTT, THT, TTH, TTT}
532 MATHEMATICS

1
Since the coins are fair, we can assign the probability
to each sample point. Let
8
E be the event ‘at least two heads appear’ and F be the event ‘first coin shows tail’.
Then
E = {HHH, HHT, HTH, THH}
and F = {THH, THT, TTH, TTT}

he
Therefore P(E) = P ({HHH}) + P ({HHT}) + P ({HTH}) + P ({THH})
1 1 1 1 1
= + + + = (Why ?)
8 8 8 8 2

is
and P(F) = P ({THH}) + P ({THT}) + P ({TTH}) + P ({TTT})
1 1 1 1 1
= + + + =

bl
8 8 8 8 2
Also E ∩ F = {THH}
1
with
pu P(E ∩ F) = P({THH}) =
8
Now, suppose we are given that the first coin shows tail, i.e. F occurs, then what is
be T

the probability of occurrence of E? With the information of occurrence of F, we are


sure that the cases in which first coin does not result into a tail should not be considered
re
while finding the probability of E. This information reduces our sample space from the
o R

set S to its subset F for the event E. In other words, the additional information really
amounts to telling us that the situation may be considered as being that of a new
tt E

random experiment for which the sample space consists of all those outcomes only
which are favourable to the occurrence of the event F.
Now, the sample point of F which is favourable to event E is THH.
C

1
Thus, Probability of E considering F as the sample space = ,
4
no N

1
or Probability of E given that the event F has occurred =
4
This probability of the event E is called the conditional probability of E given
©

that F has already occurred, and is denoted by P (E|F).


1
Thus P(E|F) =
4
Note that the elements of F which favour the event E are the common elements of
E and F, i.e. the sample points of E ∩ F.
PROBABILITY 533

Thus, we can also write the conditional probability of E given that F has occurred as
Number of elementary events favourable to E ∩ F
P(E|F) =
Number of elementary events which arefavourable to F

n (E ∩ F)
=
n (F)

he
Dividing the numerator and the denominator by total number of elementary events
of the sample space, we see that P(E|F) can also be written as
n(E ∩ F)

is
n(S) P(E ∩ F)
P(E|F) = = ... (1)
n(F) P(F)

bl
n(S)
Note that (1) is valid only when P(F) ≠ 0 i.e., F ≠ φ (Why?)
Thus, we can define the conditional probability as follows :
pu
Definition 1 If E and F are two events associated with the same sample space of a
random experiment, the conditional probability of the event E given that F has occurred,
be T

i.e. P (E|F) is given by


re
P (E ∩ F)
o R

P(E|F) = provided P(F) ≠ 0


P (F)
tt E

13.2.1 Properties of conditional probability


Let E and F be events of a sample space S of an experiment, then we have
Property 1 P (S|F) = P(F|F) = 1
C

We know that
P (S ∩ F) P (F)
no N

P (S|F) = = =1
P (F) P (F)

P (F ∩ F) P (F)
Also P(F|F) = = =1
P (F) P (F)
©

Thus P(S|F) = P(F|F) = 1


Property 2 If A and B are any two events of a sample space S and F is an event
of S such that P(F) ≠ 0, then
P((A ∪ B)|F) = P (A|F) + P(B|F) – P ((A ∩ B)|F)
534 MATHEMATICS

In particular, if A and B are disjoint events, then


P((A∪B)|F) = P(A|F) + P(B|F)
We have
P[(A ∪ B) ∩ F]
P((A ∪ B)|F) =
P (F)

he
P[(A ∩ F) ∪ (B ∩ F)]
=
P (F)
(by distributive law of union of sets over intersection)
P (A ∩ F) + P (B ∩ F) – P (A ∩ B ∩ F)

is
=
P (F)

bl
P (A ∩ F) P (B ∩ F) P[(A ∩ B) ∩ F]
= + −
P(F) P(F) P(F)
= P (A|F) + P (B|F) – P ((A ∩ B)|F)
pu
When A and B are disjoint events, then
P ((A ∩ B)|F) = 0
be T

⇒ P ((A ∪ B)|F) = P (A|F) + P (B|F)


re
Property 3 P (E′|F) = 1 − P (E|F)
o R

From Property 1, we know that P (S|F) = 1


⇒ P (E ∪ E′|F) = 1 since S = E ∪ E′
tt E

⇒ P (E|F) + P (E′|F) = 1 since E and E′ are disjoint events


Thus, P (E′|F) = 1 − P (E|F)
C

Let us now take up some examples.

7 9 4
no N

Example 1 If P (A) = , P (B) = and P (A ∩ B) = , evaluate P (A|B).


13 13 13

4
P (A ∩ B) 13 4
©

Solution We have P (A|B) = = =


P ( B) 9 9
13

Example 2 A family has two children. What is the probability that both the children are
boys given that at least one of them is a boy ?
PROBABILITY 535

Solution Let b stand for boy and g for girl. The sample space of the experiment is
S = {(b, b), (g, b), (b, g), (g, g)}
Let E and F denote the following events :
E : ‘both the children are boys’
F : ‘at least one of the child is a boy’
Then E = {(b,b)} and F = {(b,b), (g,b), (b,g)}

he
Now E ∩ F = {(b,b)}
3 1
Thus P (F) = and P (E ∩ F )=
4 4

is
1
P (E ∩ F) 4 1
Therefore P (E|F) = = =

bl
P ( F) 3 3
pu 4
Example 3 Ten cards numbered 1 to 10 are placed in a box, mixed up thoroughly and
then one card is drawn randomly. If it is known that the number on the drawn card is
more than 3, what is the probability that it is an even number?
be T

Solution Let A be the event ‘the number on the card drawn is even’ and B be the
re
event ‘the number on the card drawn is greater than 3’. We have to find P(A|B).
o R

Now, the sample space of the experiment is S = {1, 2, 3, 4, 5, 6, 7, 8, 9, 10}


Then A = {2, 4, 6, 8, 10}, B = {4, 5, 6, 7, 8, 9, 10}
tt E

and A ∩ B = {4, 6, 8, 10}


5 7 4
Also P(A) = , P ( B) = and P (A ∩ B) =
C

10 10 10

4
no N

P (A ∩ B) 10 4
Then P(A|B) = = =
P ( B) 7 7
10
©

Example 4 In a school, there are 1000 students, out of which 430 are girls. It is known
that out of 430, 10% of the girls study in class XII. What is the probability that a student
chosen randomly studies in Class XII given that the chosen student is a girl?
Solution Let E denote the event that a student chosen randomly studies in Class XII
and F be the event that the randomly chosen student is a girl. We have to find P (E|F).
536 MATHEMATICS

430 43
Now P(F) = = 0.43 and P ( E F) = 0.043 (Why?)
1000 1000
P (E ∩ F) 0.043
Then P(E|F) = = = 0.1
P ( F) 0.43
Example 5 A die is thrown three times. Events A and B are defined as below:

he
A : 4 on the third throw
B : 6 on the first and 5 on the second throw
Find the probability of A given that B has already occurred.

is
Solution The sample space has 216 outcomes.

⎧(1,1,4) (1,2,4) ... (1,6,4) (2,1,4) (2,2,4) ... (2,6,4) ⎫

bl
⎪ ⎪
Now A = ⎨(3,1,4) (3,2,4) ... (3,6,4) (4,1,4) (4,2,4) ...(4,6,4) ⎬
⎪(5,1,4) (5,2,4) ... (5,6,4) (6,1,4) (6,2,4) ...(6,6,4) ⎪
⎩ ⎭
pu B = {(6,5,1), (6,5,2), (6,5,3), (6,5,4), (6,5,5), (6,5,6)}
and A ∩ B = {(6,5,4)}.
be T

6 1
Now P (B) = and P (A ∩ B) =
216 216
re
o R

1
P (A ∩ B) 216 1
Then P(A|B) = = =
tt E

P (B) 6 6
216
C

Example 6 A die is thrown twice and the sum of the numbers appearing is observed
to be 6. What is the conditional probability that the number 4 has appeared at least
once?
no N

Solution Let E be the event that ‘number 4 appears at least once’ and F be the event
that ‘the sum of the numbers appearing is 6’.
Then, E = {(4,1), (4,2), (4,3), (4,4), (4,5), (4,6), (1,4), (2,4), (3,4), (5,4), (6,4)}
©

and F = {(1,5), (2,4), (3,3), (4,2), (5,1)}


11 5
We have P(E) = and P (F) =
36 36
Also E ∩ F = {(2,4), (4,2)}
PROBABILITY 537

2
Therefore P(E ∩ F) =
36
Hence, the required probability

2
P (E ∩ F) 36 2
P (E|F) = = =

he
P (F) 5 5
36
For the conditional probability discussed above, we have considered the elemen-
tary events of the experiment to be equally likely and the corresponding definition of

is
the probability of an event was used. However, the same definition can also be used in
the general case where the elementary events of the sample space are not equally
likely, the probabilities P (E ∩ F) and P (F) being calculated accordingly. Let us take up

bl
the following example.
Example 7 Consider the experiment of tossing a coin. If the coin shows head, toss it
pu
again but if it shows tail, then throw a die. Find the
conditional probability of the event that ‘the die shows
be T

a number greater than 4’ given that ‘there is at least


one tail’.
re
o R

Solution The outcomes of the experiment can be


represented in following diagrammatic manner called
the ‘tree diagram’.
tt E

The sample space of the experiment may be


Fig 13.1
described as
C

S = {(H,H), (H,T), (T,1), (T,2), (T,3), (T,4), (T,5), (T,6)}


where (H, H) denotes that both the tosses result into
no N

head and (T, i) denote the first toss result into a tail and
the number i appeared on the die for i = 1,2,3,4,5,6.
Thus, the probabilities assigned to the 8 elementary
events
©

(H, H), (H, T), (T, 1), (T, 2), (T, 3) (T, 4), (T, 5), (T, 6)
1 1 1 1 1 1 1 1
are , , , , , , , respectively which is
4 4 12 12 12 12 12 12
clear from the Fig 13.2. Fig 13.2
538 MATHEMATICS

Let F be the event that ‘there is at least one tail’ and E be the event ‘the die shows
a number greater than 4’. Then
F = {(H,T), (T,1), (T,2), (T,3), (T,4), (T,5), (T,6)}
E = {(T,5), (T,6)} and E ∩ F = {(T,5), (T,6)}
Now P (F) = P({(H,T)}) + P ({(T,1)}) + P ({(T,2)}) + P ({(T,3)})
+ P ({(T,4)}) + P({(T,5)}) + P({(T,6)})

he
1 1 1 1 1 1 1 3
=
4 12 12 12 12 12 12 4
1 1 1

is
and P (E ∩ F) = P ({(T,5)}) + P ({(T,6)}) =
12 12 6

bl
P (E ∩ F) 6 2
Hence P(E|F) = = =
P (F) 3 9
4
pu
EXERCISE 13.1
be T

1. Given that E and F are events such that P(E) = 0.6, P(F) = 0.3 and
re
P(E ∩ F) = 0.2, find P (E|F) and P (F|E)
o R

2. Compute P(A|B), if P(B) = 0.5 and P (A ∩ B) = 0.32


3. If P (A) = 0.8, P (B) = 0.5 and P (B|A) = 0.4, find
tt E

(i) P (A ∩ B) (ii) P(A|B) (iii) P(A ∪ B)


5 2
C

4. Evaluate P(A ∪ B), if 2P(A) = P(B) = and P(A|B) =


13 5
6 5 7
no N

5. If P(A) = , P(B) = and P(A ∪ B) , find


11 11 11
(i) P(A∩B) (ii) P(A|B) (iii) P(B|A)
Determine P(E|F) in Exercises 6 to 9.
©

6. A coin is tossed three times, where


(i) E : head on third toss , F : heads on first two tosses
(ii) E : at least two heads , F : at most two heads
(iii) E : at most two tails , F : at least one tail
PROBABILITY 539

7. Two coins are tossed once, where


(i) E : tail appears on one coin, F : one coin shows head
(ii) E : no tail appears, F : no head appears
8. A die is thrown three times,
E : 4 appears on the third toss, F : 6 and 5 appears respectively
on first two tosses

he
9. Mother, father and son line up at random for a family picture
E : son on one end, F : father in middle
10. A black and a red dice are rolled.
(a) Find the conditional probability of obtaining a sum greater than 9, given

is
that the black die resulted in a 5.
(b) Find the conditional probability of obtaining the sum 8, given that the red die

bl
resulted in a number less than 4.
11. A fair die is rolled. Consider events E = {1,3,5}, F = {2,3} and G = {2,3,4,5}
Find
pu (i) P (E|F) and P (F|E) (ii) P (E|G) and P (G|E)
(iii) P ((E ∪ F)|G) and P ((E ∩ F)|G)
be T

12. Assume that each born child is equally likely to be a boy or a girl. If a family has
two children, what is the conditional probability that both are girls given that
re
(i) the youngest is a girl, (ii) at least one is a girl?
o R

13. An instructor has a question bank consisting of 300 easy True / False questions,
200 difficult True / False questions, 500 easy multiple choice questions and 400
difficult multiple choice questions. If a question is selected at random from the
tt E

question bank, what is the probability that it will be an easy question given that it
is a multiple choice question?
C

14. Given that the two numbers appearing on throwing two dice are different. Find
the probability of the event ‘the sum of numbers on the dice is 4’.
15. Consider the experiment of throwing a die, if a multiple of 3 comes up, throw the
no N

die again and if any other number comes, toss a coin. Find the conditional probability
of the event ‘the coin shows a tail’, given that ‘at least one die shows a 3’.
In each of the Exercises 16 and 17 choose the correct answer:
©

1
16. If P (A) = , P (B) = 0, then P (A|B) is
2
1
(A) 0 (B)
2
(C) not defined (D) 1
540 MATHEMATICS

17. If A and B are events such that P(A|B) = P(B|A), then


(A) A ⊂ B but A ≠ B (B) A = B
(C) A ∩ B = φ (D) P(A) = P(B)
13.3 Multiplication Theorem on Probability
Let E and F be two events associated with a sample space S. Clearly, the set E ∩ F

he
denotes the event that both E and F have occurred. In other words, E ∩ F denotes the
simultaneous occurrence of the events E and F. The event E ∩ F is also written as EF.
Very often we need to find the probability of the event EF. For example, in the
experiment of drawing two cards one after the other, we may be interested in finding

is
the probability of the event ‘a king and a queen’. The probability of event EF is obtained
by using the conditional probability as obtained below :
We know that the conditional probability of event E given that F has occurred is

bl
denoted by P(E|F) and is given by
P (E ∩ F)
P(E|F) = , P (F) ≠ 0
P (F)
pu
From this result, we can write
P (E ∩ F) = P (F) . P (E|F) ... (1)
be T

Also, we know that


re
P (F ∩ E)
o R

P (F|E) = , P (E) ≠ 0
P (E)

P (E ∩ F)
tt E

or P (F|E) = (since E ∩ F = F ∩ E)
P (E)
Thus, P(E ∩ F) = P(E). P(F|E) .... (2)
C

Combining (1) and (2), we find that


P (E ∩ F) = P(E) P(F|E)
no N

= P(F) P(E|F) provided P(E) ≠ 0 and P(F) ≠ 0.


The above result is known as the multiplication rule of probability.
Let us now take up an example.
©

Example 8 An urn contains 10 black and 5 white balls. Two balls are drawn from the
urn one after the other without replacement. What is the probability that both drawn
balls are black?
Solution Let E and F denote respectively the events that first and second ball drawn
are black. We have to find P (E ∩ F) or P (EF).
PROBABILITY 541

10
Now P (E) = P (black ball in first draw) =
15
Also given that the first ball drawn is black, i.e., event E has occurred, now there
are 9 black balls and five white balls left in the urn. Therefore, the probability that the
second ball drawn is black, given that the ball in the first draw is black, is nothing but
the conditional probability of F given that E has occurred.

he
9
i.e. P(F|E) =
14
By multiplication rule of probability, we have

is
P (E ∩ F) = P (E) P (F|E)
10 9 3

bl
=
15 14 7
Multiplication rule of probability for more than two events If E, F and G are
three events of sample space, we have
pu
P (E ∩ F ∩ G) = P (E) P (F|E) P (G|(E ∩ F)) = P (E) P (F|E) P (G|EF)
Similarly, the multiplication rule of probability can be extended for four or
be T

more events.
re
The following example illustrates the extension of multiplication rule of probability
o R

for three events.


Example 9 Three cards are drawn successively, without replacement from a pack of
tt E

52 well shuffled cards. What is the probability that first two cards are kings and the
third card drawn is an ace?
C

Solution Let K denote the event that the card drawn is king and A be the event that
the card drawn is an ace. Clearly, we have to find P (KKA)
4
no N

Now P(K) =
52
Also, P (K|K) is the probability of second king with the condition that one king has
already been drawn. Now there are three kings in (52 − 1) = 51 cards.
©

3
Therefore P(K|K) =
51
Lastly, P(A|KK) is the probability of third drawn card to be an ace, with the condition
that two kings have already been drawn. Now there are four aces in left 50 cards.
542 MATHEMATICS

4
Therefore P (A|KK) =
50
By multiplication law of probability, we have
P (KKA) = P(K) P(K|K) P(A|KK)
4 3 4 2
=

he
52 51 50 5525
13.4 Independent Events
Consider the experiment of drawing a card from a deck of 52 playing cards, in which

is
the elementary events are assumed to be equally likely. If E and F denote the events
'the card drawn is a spade' and 'the card drawn is an ace' respectively, then
13 1 4 1

bl
P(E) = and P(F)
52 4 52 13
Also E and F is the event ' the card drawn is the ace of spades' so that
pu P (E ∩ F) =
1
52
1
be T

P (E F) 52 1
Hence P(E|F) =
re
P (F) 1 4
o R

13
1
tt E

Since P(E) = = P (E|F), we can say that the occurrence of event F has not
4
affected the probability of occurrence of the event E.
C

We also have
1
P (E F) 52 1
no N

P(F|E) = P(F)
P(E) 1 13
4
1
©

Again, P (F) = = P (F|E) shows that occurrence of event E has not affected
13
the probability of occurrence of the event F.
Thus, E and F are two events such that the probability of occurrence of one of
them is not affected by occurrence of the other.
Such events are called independent events.
PROBABILITY 543

Definition 2 Two events E and F are said to be independent, if


P (F|E) = P (F) provided P (E) ≠ 0
and P (E|F) = P (E) provided P (F) ≠ 0
Thus, in this definition we need to have P (E) ≠ 0 and P(F) ≠ 0
Now, by the multiplication rule of probability, we have
P (E ∩ F) = P(E) . P (F|E) ... (1)

he
If E and F are independent, then (1) becomes
P (E ∩ F) = P (E) . P (F) ... (2)
Thus, using (2), the independence of two events is also defined as follows:

is
Definition 3 Let E and F be two events associated with the same random experiment,
then E and F are said to be independent if

bl
P (E ∩ F) = P (E) . P (F)
Remarks
(i) Two events E and F are said to be dependent if they are not independent, i.e. if
pu P (E ∩ F ) ≠ P (E) . P (F)
(ii) Sometimes there is a confusion between independent events and mutually
be T

exclusive events. Term ‘independent’ is defined in terms of ‘probability of events’


whereas mutually exclusive is defined in term of events (subset of sample space).
re
o R

Moreover, mutually exclusive events never have an outcome common, but


independent events, may have common outcome. Clearly, ‘independent’ and
‘mutually exclusive’ do not have the same meaning.
tt E

In other words, two independent events having nonzero probabilities of occurrence


can not be mutually exclusive, and conversely, i.e. two mutually exclusive events
C

having nonzero probabilities of occurrence can not be independent.


(iii) Two experiments are said to be independent if for every pair of events E and F,
where E is associated with the first experiment and F with the second experiment,
no N

the probability of the simultaneous occurrence of the events E and F when the
two experiments are performed is the product of P(E) and P(F) calculated
separately on the basis of two experiments, i.e., P (E ∩ F) = P (E) . P(F)
(iv) Three events A, B and C are said to be mutually independent, if
©

P (A ∩ B) = P (A) P (B)
P (A ∩ C) = P (A) P (C)
P (B ∩ C) = P (B) P(C)
and P (A ∩ B ∩ C) = P (A) P (B) P (C)
544 MATHEMATICS

If at least one of the above is not true for three given events, we say that the
events are not independent.
Example 10 A die is thrown. If E is the event ‘the number appearing is a multiple of
3’ and F be the event ‘the number appearing is even’ then find whether E and F are
independent ?
Solution We know that the sample space is S = {1, 2, 3, 4, 5, 6}

he
Now E = { 3, 6}, F = { 2, 4, 6} and E ∩ F = {6}
2 1 3 1 1
Then P (E) = = , P (F) = = and P (E ∩ F) =
6 3 6 2 6

is
Clearly P (E ∩ F) = P(E). P (F)
Hence E and F are independent events.

bl
Example 11 An unbiased die is thrown twice. Let the event A be ‘odd number on the
first throw’ and B the event ‘odd number on the second throw’. Check the independence
of the events A and B.
pu
Solution If all the 36 elementary events of the experiment are considered to be equally
likely, we have
be T

18 1 18 1
P(A) = = and P (B)
re
36 2 36 2
o R

Also P (A ∩ B) = P (odd number on both throws)


9 1
tt E

= =
36 4
C

1 1 1
Now P (A) P(B) = × =
2 2 4
Clearly P (A ∩ B) = P (A) × P (B)
no N

Thus, A and B are independent events


Example 12 Three coins are tossed simultaneously. Consider the event E ‘three heads
or three tails’, F ‘at least two heads’ and G ‘at most two heads’. Of the pairs (E,F),
©

(E,G) and (F,G), which are independent? which are dependent?


Solution The sample space of the experiment is given by
S = {HHH, HHT, HTH, THH, HTT, THT, TTH, TTT}
Clearly E = {HHH, TTT}, F= {HHH, HHT, HTH, THH}
PROBABILITY 545

and G = {HHT, HTH, THH, HTT, THT, TTH, TTT}


Also E ∩ F = {HHH}, E ∩ G = {TTT}, F ∩ G = { HHT, HTH, THH}

2 1 4 1 7
Therefore P (E) = = , P (F) = = , P(G) =
8 4 8 2 8

1 1 3

he
and P (E ∩F) = , P (E ∩ G) = , P(F ∩ G) =
8 8 8

1 1 1 1 7 7
Also P (E) . P (F) = , P(E) P(G)
4 2 8 4 8 32

is
1 7 7
and P (F) . P(G) =
2 8 16

bl
Thus P (E ∩ F) = P(E) . P(F)
P (E ∩ G) ≠ P(E) . P(G)
and
pu P (F ∩ G) ≠ P (F) . P(G)
Hence, the events (E and F) are independent, and the events (E and G) and
be T

(F and G) are dependent.


re
Example 13 Prove that if E and F are independent events, then so are the events
o R

E and F′.
Solution Since E and F are independent, we have
tt E

P (E ∩ F) = P (E) . P (F) ....(1)


From the venn diagram in Fig 13.3, it is clear
C

that E ∩ F and E ∩ F ′ are mutually exclusive events (E ’ ∩ F ’ ) S


E F
and also E =(E ∩ F) ∪ (E ∩ F′).
no N

Therefore P (E) = P (E ∩ F) + P(E ∩ F′)


or P (E ∩ F′) = P(E) − P(E ∩ F)
(E ∩ F ’ ) (E ∩ F) (E ’∩ F)
= P(E) − P(E) . P(F)
©

(by (1)) Fig 13.3


= P(E) (1−P(F))
= P(E). P(F′)
Hence, E and F′ are independent
546 MATHEMATICS

$ Note In a similar manner, it can be shown that if the events E and F are
independent, then
(a) E′ and F are independent,
(b) E′ and F′ are independent

Example 14 If A and B are two independent events, then the probability of occurrence

he
of at least one of A and B is given by 1– P(A′) P(B′)
Solution We have
P (at least one of A and B) = P(A ∪ B)

is
= P(A) + P(B) − P(A ∩ B)
= P(A) + P(B) − P(A) P(B)
= P(A) + P(B) [1−P(A)]

bl
= P(A) + P(B). P(A′)
= 1− P(A′) + P(B) P(A′)
pu = 1− P(A′) [1− P(B)]
= 1− P(A′) P (B′)
be T

EXERCISE 13.2
re
o R

3 1
1. If P(A) and P (B) , find P (A ∩ B) if A and B are independent events.
5 5
2. Two cards are drawn at random and without replacement from a pack of 52
tt E

playing cards. Find the probability that both the cards are black.
3. A box of oranges is inspected by examining three randomly selected oranges
C

drawn without replacement. If all the three oranges are good, the box is approved
for sale, otherwise, it is rejected. Find the probability that a box containing 15
oranges out of which 12 are good and 3 are bad ones will be approved for sale.
no N

4. A fair coin and an unbiased die are tossed. Let A be the event ‘head appears on
the coin’ and B be the event ‘3 on the die’. Check whether A and B are
independent events or not.
5. A die marked 1, 2, 3 in red and 4, 5, 6 in green is tossed. Let A be the event,
©

‘the number is even,’ and B be the event, ‘the number is red’. Are A and B
independent?
3 3 1
6. Let E and F be events with P (E) , P (F) = and P (E ∩ F) = . Are
5 10 5
E and F independent?
PROBABILITY 547

1 3
7. Given that the events A and B are such that P(A) = , P (A ∪ B) = and
2 5
P (B) = p. Find p if they are (i) mutually exclusive (ii) independent.
8. Let A and B be independent events with P (A) = 0.3 and P(B) = 0.4. Find
(i) P (A ∩ B) (ii) P (A ∪ B)
(iii) P (A|B) (iv) P (B|A)

he
1 1 1
9. If A and B are two events such that P (A) = , P (B) = and P (A ∩ B) = ,
4 2 8
find P (not A and not B).

is
1 7 1
10. Events A and B are such that P (A) = , P(B) = and P(not A or not B) = .
2 12 4
State whether A and B are independent ?

bl
11. Given two independent events A and B such that P(A) = 0.3, P(B) = 0.6.
Find
(i) P(A and B) (ii) P(A and not B)
pu
(iii) P(A or B) (iv) P(neither A nor B)
12. A die is tossed thrice. Find the probability of getting an odd number at least once.
be T

13. Two balls are drawn at random with replacement from a box containing 10 black
and 8 red balls. Find the probability that
re
o R

(i) both balls are red.


(ii) first ball is black and second is red.
(iii) one of them is black and other is red.
tt E

1 1
14. Probability of solving specific problem independently by A and B are and
2 3
C

respectively. If both try to solve the problem independently, find the probability
that
(i) the problem is solved (ii) exactly one of them solves the problem.
no N

15. One card is drawn at random from a well shuffled deck of 52 cards. In which of
the following cases are the events E and F independent ?
(i) E : ‘the card drawn is a spade’
©

F : ‘the card drawn is an ace’


(ii) E : ‘the card drawn is black’
F : ‘the card drawn is a king’
(iii) E : ‘the card drawn is a king or queen’
F : ‘the card drawn is a queen or jack’.
548 MATHEMATICS

16. In a hostel, 60% of the students read Hindi news paper, 40% read English news
paper and 20% read both Hindi and English news papers. A student is selected
at random.
(a) Find the probability that she reads neither Hindi nor English news papers.
(b) If she reads Hindi news paper, find the probability that she reads English
news paper.

he
(c) If she reads English news paper, find the probability that she reads Hindi
news paper.
Choose the correct answer in Exercises 17 and 18.
17. The probability of obtaining an even prime number on each die, when a pair of

is
dice is rolled is
1 1 1
(A) 0 (B) (C) (D)
3 12 36

bl
18. Two events A and B will be independent, if
(A) A and B are mutually exclusive
pu
(B) P(A′B′) = [1 – P(A)] [1 – P(B)]
(C) P(A) = P(B)
be T

(D) P(A) + P(B) = 1


re
13.5 Bayes' Theorem
o R

Consider that there are two bags I and II. Bag I contains 2 white and 3 red balls and
Bag II contains 4 white and 5 red balls. One ball is drawn at random from one of the
1
tt E

bags. We can find the probability of selecting any of the bags (i.e. ) or probability of
2
drawing a ball of a particular colour (say white) from a particular bag (say Bag I). In
C

other words, we can find the probability that the ball drawn is of a particular colour, if
we are given the bag from which the ball is drawn. But, can we find the probability that
the ball drawn is from a particular bag (say Bag II), if the colour of the ball drawn is
no N

given? Here, we have to find the reverse probability of Bag II to be selected when an
event occurred after it is known. Famous mathematician, John Bayes' solved the problem
of finding reverse probability by using conditional probability. The formula developed
by him is known as ‘Bayes theorem’ which was published posthumously in 1763.
©

Before stating and proving the Bayes' theorem, let us first take up a definition and
some preliminary results.
13.5.1 Partition of a sample space
A set of events E1, E2, ..., En is said to represent a partition of the sample space S if
(a) Ei ∩ Ej = φ, i ≠ j, i, j = 1, 2, 3, ..., n
PROBABILITY 549

(b) E1 ∪ Ε2 ∪ ... ∪ En= S and


(c) P(Ei) > 0 for all i = 1, 2, ..., n.
In other words, the events E1, E2, ..., En represent a partition of the sample space
S if they are pairwise disjoint, exhaustive and have nonzero probabilities.
As an example, we see that any nonempty event E and its complement E′ form a
partition of the sample space S since they satisfy E ∩ E′ = φ and E ∪ E′ = S.

he
From the Venn diagram in Fig 13.3, one can easily observe that if E and F are any
two events associated with a sample space S, then the set {E ∩ F′, E ∩ F, E′ ∩ F, E′ ∩ F′}
is a partition of the sample space S. It may be mentioned that the partition of a sample
space is not unique. There can be several partitions of the same sample space.

is
We shall now prove a theorem known as Theorem of total probability.

bl
13.5.2 Theorem of total probability
Let {E1, E2,...,En} be a partition of the sample space S, and suppose that each of the
events E1, E2,..., En has nonzero probability of occurrence. Let A be any event associated
pu
with S, then
P(A) = P(E1) P(A|E1) + P(E2) P(A|E2) + ... + P(En) P(A|En)
be T

n
= ∑ P(E j ) P (A|E j )
re
j=1
o R

Proof Given that E1, E2,..., En is a partition of the sample space S (Fig 13.4). Therefore,
S = E1 ∪ E2 ∪ ... ∪ En ... (1)
tt E

and Ei ∩ Ej = φ, i ≠ j, i, j = 1, 2, ..., n
Now, we know that for any event A,
C

A=A∩S
= A ∩ (E1 ∪ E2 ∪ ... ∪ En)
= (A ∩ E1) ∪ (A ∩ E2) ∪ ...∪ (A ∩ En) Fig 13.4
no N

Also A ∩ Ei and A ∩ Ej are respectively the subsets of Ei and Ej . We know that


Ei and Ej are disjoint, for i ≠ j , therefore, A ∩ Ei and A ∩ Ej are also disjoint for all
i ≠ j, i, j = 1, 2, ..., n.
©

Thus, P(A) = P [(A ∩ E1) ∪ (A ∩ E2)∪ .....∪ (A ∩ En)]


= P (A ∩ E1) + P (A ∩ E2) + ... + P (A ∩ En)
Now, by multiplication rule of probability, we have
P(A ∩ Ei) = P(Ei) P(A|Ei) as P (Ei) ≠ 0∀i = 1,2,..., n
550 MATHEMATICS

Therefore, P (A) = P (E1) P (A|E1) + P (E2) P (A|E2) + ... + P (En)P(A|En)


n
or P(A) = ∑ P(E j ) P (A|E j )
j=1

Example 15 A person has undertaken a construction job. The probabilities are 0.65
that there will be strike, 0.80 that the construction job will be completed on time if there
is no strike, and 0.32 that the construction job will be completed on time if there is a

he
strike. Determine the probability that the construction job will be completed on time.
Solution Let A be the event that the construction job will be completed on time, and B
be the event that there will be a strike. We have to find P(A).
We have

is
P(B) = 0.65, P(no strike) = P(B′) = 1 − P(B) = 1 − 0.65 = 0.35
P(A|B) = 0.32, P(A|B′) = 0.80

bl
Since events B and B′ form a partition of the sample space S, therefore, by theorem
on total probability, we have
P(A) = P(B) P(A|B) + P(B′) P(A|B′)
pu = 0.65 × 0.32 + 0.35 × 0.8
= 0.208 + 0.28 = 0.488
be T

Thus, the probability that the construction job will be completed in time is 0.488.
re
We shall now state and prove the Bayes' theorem.
o R

Bayes’ Theorem If E1, E2 ,..., En are n non empty events which constitute a partition
of sample space S, i.e. E1, E2 ,..., En are pairwise disjoint and E1∪ E2∪ ... ∪ En = S and
A is any event of nonzero probability, then
tt E

P (E i ) P (A|E i )
P(Ei|A) = n for any i = 1, 2, 3, ..., n
∑ P(E j ) P (A|E j )
C

j=1

Proof By formula of conditional probability, we know that


no N

P(A ∩ E i )
P(Ei|A) =
P(A)
P (Ei ) P (A|Ei )
©

= (by multiplication rule of probability)


P (A)
P (E i ) P(A|E i )
= n (by the result of theorem of total probability)
∑ P(E j ) P(A|E j )
j=1
PROBABILITY 551

Remark The following terminology is generally used when Bayes' theorem is applied.
The events E1, E2, ..., En are called hypotheses.
The probability P(Ei) is called the priori probability of the hypothesis Ei
The conditional probability P(Ei |A) is called a posteriori probability of the
hypothesis Ei.
Bayes' theorem is also called the formula for the probability of "causes". Since the

he
Ei's are a partition of the sample space S, one and only one of the events Ei occurs (i.e.
one of the events Ei must occur and only one can occur). Hence, the above formula
gives us the probability of a particular Ei (i.e. a "Cause"), given that the event A has
occurred.

is
The Bayes' theorem has its applications in variety of situations, few of which are
illustrated in following examples.

bl
Example 16 Bag I contains 3 red and 4 black balls while another Bag II contains 5 red
and 6 black balls. One ball is drawn at random from one of the bags and it is found to
be red. Find the probability that it was drawn from Bag II.
pu
Solution Let E1 be the event of choosing the bag I, E2 the event of choosing the bag II
and A be the event of drawing a red ball.
be T

1
Then P(E1) = P(E2) =
re
2
o R

3
Also P(A|E1) = P(drawing a red ball from Bag I) =
7
tt E

5
and P(A|E2) = P(drawing a red ball from Bag II) =
11
C

Now, the probability of drawing a ball from Bag II, being given that it is red,
is P(E2|A)
By using Bayes' theorem, we have
no N

1 5
×
P(E 2 ) P(A|E 2 ) 2 11 35
P(E2|A) = = 1 3 1 5 =
P(E1 ) P(A|E1 ) + P(E 2 ) P(A|E 2 ) × + × 68
©

2 7 2 11
Example 17 Given three identical boxes I, II and III, each containing two coins. In
box I, both coins are gold coins, in box II, both are silver coins and in the box III, there
is one gold and one silver coin. A person chooses a box at random and takes out a coin.
If the coin is of gold, what is the probability that the other coin in the box is also of gold?
552 MATHEMATICS

Solution Let E1, E2 and E3 be the events that boxes I, II and III are chosen, respectively.
1
Then P(E1) = P(E2) = P(E3) =
3
Also, let A be the event that ‘the coin drawn is of gold’
2
Then P(A|E1) = P(a gold coin from bag I) = =1

he
2
P(A|E2) = P(a gold coin from bag II) = 0
1
P(A|E3) = P(a gold coin from bag III) =
2

is
Now, the probability that the other coin in the box is of gold
= the probability that gold coin is drawn from the box I.

bl
= P(E1|A)
By Bayes' theorem, we know that
pu P(E1 ) P(A|E1 )
P(E1|A) =
P(E1 ) P(A|E1 ) + P(E 2 ) P(A|E 2 ) + P(E 3 ) P (A|E 3 )
be T

1
×1
2
re
3 =
=
o R

1 1 1 1 3
×1+ × 0 + ×
3 3 3 2
tt E

Example 18 Suppose that the reliability of a HIV test is specified as follows:


Of people having HIV, 90% of the test detect the disease but 10% go undetected. Of
C

people free of HIV, 99% of the test are judged HIV–ive but 1% are diagnosed as
showing HIV+ive. From a large population of which only 0.1% have HIV, one person
is selected at random, given the HIV test, and the pathologist reports him/her as
no N

HIV+ive. What is the probability that the person actually has HIV?
Solution Let E denote the event that the person selected is actually having HIV and A
the event that the person's HIV test is diagnosed as +ive. We need to find P(E|A).
©

Also E′ denotes the event that the person selected is actually not having HIV.
Clearly, {E, E′} is a partition of the sample space of all people in the population.
We are given that
0.1
P(E) = 0.1% 0.001
100
PROBABILITY 553

P(E′) = 1 – P(E) = 0.999


P(A|E) = P(Person tested as HIV+ive given that he/she
is actually having HIV)
90
= 90% 0.9
100
and P(A|E′) = P(Person tested as HIV +ive given that he/she

he
is actually not having HIV)
1
= 1% = = 0.01
100
Now, by Bayes' theorem

is
P(E) P(A|E)
P(E|A) =
P(E) P(A|E) + P(E ) P (A|E )

bl
0.001× 0.9 90
= =
0.001× 0.9 + 0.999 × 0.01 1089
pu = 0.083 approx.
Thus, the probability that a person selected at random is actually having HIV
be T

given that he/she is tested HIV+ive is 0.083.


re
Example 19 In a factory which manufactures bolts, machines A, B and C manufacture
o R

respectively 25%, 35% and 40% of the bolts. Of their outputs, 5, 4 and 2 percent are
respectively defective bolts. A bolt is drawn at random from the product and is found
to be defective. What is the probability that it is manufactured by the machine B?
tt E

Solution Let events B1, B2, B3 be the following :


B1 : the bolt is manufactured by machine A
C

B2 : the bolt is manufactured by machine B


B3 : the bolt is manufactured by machine C
no N

Clearly, B1, B2, B3 are mutually exclusive and exhaustive events and hence, they
represent a partition of the sample space.
Let the event E be ‘the bolt is defective’.
©

The event E occurs with B1 or with B2 or with B3. Given that,


P(B1) = 25% = 0.25, P (B2) = 0.35 and P(B3) = 0.40
Again P(E|B1) = Probability that the bolt drawn is defective given that it is manu-
factured by machine A = 5% = 0.05
Similarly, P(E|B2) = 0.04, P(E|B3) = 0.02.
554 MATHEMATICS

Hence, by Bayes' Theorem, we have


P(B2 ) P (E|B2 )
P(B2|E) =
P(B1 ) P (E|B1 ) + P (B2 ) P(E|B2 ) +P(B 3 ) P (E|B3 )
0.35 × 0.04
=
0.25 × 0.05 + 0.35 × 0.04 + 0.40 × 0.02

he
0.0140 28
= =
0.0345 69
Example 20 A doctor is to visit a patient. From the past experience, it is known that
the probabilities that he will come by train, bus, scooter or by other means of transport

is
3 1 1 2 1 1 1
are respectively , , and . The probabilities that he will be late are , , and ,
10 5 10 5 4 3 12
if he comes by train, bus and scooter respectively, but if he comes by other means of

bl
transport, then he will not be late. When he arrives, he is late. What is the probability
that he comes by train?
pu
Solution Let E be the event that the doctor visits the patient late and let T1, T2, T3, T4
be the events that the doctor comes by train, bus, scooter, and other means of transport
respectively.
be T

3 1 1 2
Then P(T1) = , P (T2 ) = , P (T3 ) = and P (T4 ) = (given)
re
10 5 10 5
o R

1
P(E|T1) = Probability that the doctor arriving late comes by train =
4
tt E

1 1
Similarly, P(E|T2) = , P(E|T3) = and P(E|T4) = 0, since he is not late if he
3 12
comes by other means of transport.
C

Therefore, by Bayes' Theorem, we have


P(T1|E) = Probability that the doctor arriving late comes by train
no N

P(T1 ) P (E|T1 )
=
P(T1 ) P (E|T1 ) + P (T2 ) P(E|T2 ) + P (T3 ) P(E|T3 )+ P (T4 )P (E|T4 )
3 1
©

10 4 3 120 1
= = × =
3 1 1 1 1 1 2 40 18 2
0
10 4 5 3 10 12 5
1
Hence, the required probability is .
2
PROBABILITY 555

Example 21 A man is known to speak truth 3 out of 4 times. He throws a die and
reports that it is a six. Find the probability that it is actually a six.

Solution Let E be the event that the man reports that six occurs in the throwing of the
die and let S1 be the event that six occurs and S2 be the event that six does not occur.
1
Then P(S1) = Probability that six occurs =
6

he
5
P(S2) = Probability that six does not occur =
6
P(E|S1) = Probability that the man reports that six occurs when six has

is
actually occurred on the die
3
= Probability that the man speaks the truth =

bl
4
P(E|S2) = Probability that the man reports that six occurs when six has
not actually occurred on the die
pu = Probability that the man does not speak the truth 1
3 1
4 4
be T

Thus, by Bayes' theorem, we get


P(S1|E) = Probability that the report of the man that six has occurred is
re
actually a six
o R

P(S1 ) P(E |S1 )


=
P(S1 ) P(E|S1 ) + P(S2 ) P (E|S2 )
tt E

1 3
1 24 3
C

6 4
=
1 3 5 1 8 8 8
6 4 6 4
no N

3
Hence, the required probability is .
8
©

EXERCISE 13.3
1. An urn contains 5 red and 5 black balls. A ball is drawn at random, its colour is
noted and is returned to the urn. Moreover, 2 additional balls of the colour drawn
are put in the urn and then a ball is drawn at random. What is the probability that
the second ball is red?
556 MATHEMATICS

2. A bag contains 4 red and 4 black balls, another bag contains 2 red and 6 black
balls. One of the two bags is selected at random and a ball is drawn from the bag
which is found to be red. Find the probability that the ball is drawn from the
first bag.
3. Of the students in a college, it is known that 60% reside in hostel and 40% are
day scholars (not residing in hostel). Previous year results report that 30% of all
students who reside in hostel attain A grade and 20% of day scholars attain A

he
grade in their annual examination. At the end of the year, one student is chosen
at random from the college and he has an A grade, what is the probability that the
student is a hostlier?
4. In answering a question on a multiple choice test, a student either knows the

is
3 1
answer or guesses. Let be the probability that he knows the answer and
4 4
be the probability that he guesses. Assuming that a student who guesses at the

bl
1
answer will be correct with probability . What is the probability that the stu-
4
dent knows the answer given that he answered it correctly?
5.
pu
A laboratory blood test is 99% effective in detecting a certain disease when it is
in fact, present. However, the test also yields a false positive result for 0.5% of
the healthy person tested (i.e. if a healthy person is tested, then, with probability
be T

0.005, the test will imply he has the disease). If 0.1 percent of the population
re
actually has the disease, what is the probability that a person has the disease
o R

given that his test result is positive ?


6. There are three coins. One is a two headed coin (having head on both faces),
another is a biased coin that comes up heads 75% of the time and third is an
tt E

unbiased coin. One of the three coins is chosen at random and tossed, it shows
heads, what is the probability that it was the two headed coin ?
C

7. An insurance company insured 2000 scooter drivers, 4000 car drivers and 6000
truck drivers. The probability of an accidents are 0.01, 0.03 and 0.15 respectively.
One of the insured persons meets with an accident. What is the probability that
no N

he is a scooter driver?
8. A factory has two machines A and B. Past record shows that machine A produced
60% of the items of output and machine B produced 40% of the items. Further,
2% of the items produced by machine A and 1% produced by machine B were
©

defective. All the items are put into one stockpile and then one item is chosen at
random from this and is found to be defective. What is the probability that it was
produced by machine B?
9. Two groups are competing for the position on the Board of directors of a
corporation. The probabilities that the first and the second groups will win are
PROBABILITY 557

0.6 and 0.4 respectively. Further, if the first group wins, the probability of
introducing a new product is 0.7 and the corresponding probability is 0.3 if the
second group wins. Find the probability that the new product introduced was by
the second group.
10. Suppose a girl throws a die. If she gets a 5 or 6, she tosses a coin three times and
notes the number of heads. If she gets 1, 2, 3 or 4, she tosses a coin once and

he
notes whether a head or tail is obtained. If she obtained exactly one head, what
is the probability that she threw 1, 2, 3 or 4 with the die?
11. A manufacturer has three machine operators A, B and C. The first operator A
produces 1% defective items, where as the other two operators B and C pro-

is
duce 5% and 7% defective items respectively. A is on the job for 50% of the
time, B is on the job for 30% of the time and C is on the job for 20% of the time.
A defective item is produced, what is the probability that it was produced by A?

bl
12. A card from a pack of 52 cards is lost. From the remaining cards of the pack,
two cards are drawn and are found to be both diamonds. Find the probability of
pu
the lost card being a diamond.
4
13. Probability that A speaks truth is . A coin is tossed. A reports that a head
be T

5
appears. The probability that actually there was head is
re
o R

4 1 1 2
(A) (B) (C) (D)
5 2 5 5
14. If A and B are two events such that A ⊂ B and P(B) ≠ 0, then which of the
tt E

following is correct?
P (B)
C

(A) P(A | B) (B) P(A|B) < P(A)


P (A)
(C) P(A|B) ≥ P(A) (D) None of these
no N

13.6 Random Variables and its Probability Distributions


We have already learnt about random experiments and formation of sample spaces. In
most of these experiments, we were not only interested in the particular outcome that
©

occurs but rather in some number associated with that outcomes as shown in following
examples/experiments.
(i) In tossing two dice, we may be interested in the sum of the numbers on the
two dice.
(ii) In tossing a coin 50 times, we may want the number of heads obtained.
558 MATHEMATICS

(iii) In the experiment of taking out four articles (one after the other) at random
from a lot of 20 articles in which 6 are defective, we want to know the
number of defectives in the sample of four and not in the particular sequence
of defective and nondefective articles.
In all of the above experiments, we have a rule which assigns to each outcome of
the experiment a single real number. This single real number may vary with different
outcomes of the experiment. Hence, it is a variable. Also its value depends upon the

he
outcome of a random experiment and, hence, is called random variable. A random
variable is usually denoted by X.
If you recall the definition of a function, you will realise that the random variable X
is really speaking a function whose domain is the set of outcomes (or sample space) of

is
a random experiment. A random variable can take any real value, therefore, its
co-domain is the set of real numbers. Hence, a random variable can be defined as

bl
follows :
Definition 4 A random variable is a real valued function whose domain is the sample
space of a random experiment.
pu
For example, let us consider the experiment of tossing a coin two times in succession.
The sample space of the experiment is S = {HH, HT, TH, TT}.
be T

If X denotes the number of heads obtained, then X is a random variable and for
each outcome, its value is as given below :
re
o R

X (HH) = 2, X (HT) = 1, X (TH) = 1, X (TT) = 0.


More than one random variables can be defined on the same sample space. For
example, let Y denote the number of heads minus the number of tails for each outcome
tt E

of the above sample space S.


Then Y (HH) = 2, Y (HT) = 0, Y (TH) = 0, Y (TT) = – 2.
C

Thus, X and Y are two different random variables defined on the same sample
space S.
no N

Example 22 A person plays a game of tossing a coin thrice. For each head, he is
given Rs 2 by the organiser of the game and for each tail, he has to give Rs 1.50 to the
organiser. Let X denote the amount gained or lost by the person. Show that X is a
random variable and exhibit it as a function on the sample space of the experiment.
©

Solution X is a number whose values are defined on the outcomes of a random


experiment. Therefore, X is a random variable.
Now, sample space of the experiment is
S = {HHH, HHT, HTH, THH, HTT, THT, TTH, TTT}
PROBABILITY 559

Then X (HHH) = Rs (2 × 3) = Rs 6
X (HHT) = X (HTH) = X (THH) = Rs (2 × 2 − 1 × 1.50) = Rs 2.50
X (HTT) = X (THT) = (TTH) = Rs (1 × 2) – (2 × 1.50) = – Re 1
and X (TTT) = − Rs (3 × 1.50) = − Rs 4.50
where, minus sign shows the loss to the player. Thus, for each element of the sample

he
space, X takes a unique value, hence, X is a function on the sample space whose range
is
{– 1, 2.50, – 4.50, 6}
Example 23 A bag contains 2 white and 1 red balls. One ball is drawn at random and

is
then put back in the box after noting its colour. The process is repeated again. If X
denotes the number of red balls recorded in the two draws, describe X.

bl
Solution Let the balls in the bag be denoted by w1, w2, r. Then the sample space is
S = {w1 w1, w1 w2, w2 w2, w2 w1, w1 r, w2 r, r w1, r w2, r r}
Now, for ω∈S
pu X (ω) = number of red balls
Therefore
be T

X ({w1 w1}) = X ({w1 w2}) = X ({w2 w2}) = X ({w2 w1}) = 0


re
X ({w1 r}) = X ({w2 r}) = X ({r w1}) = X ({r w2}) = 1 and X ({r r}) = 2
o R

Thus, X is a random variable which can take values 0, 1 or 2.


13.6.1 Probability distribution of a random variable
tt E

Let us look at the experiment of selecting one family out of ten families f1, f2 ,..., f10 in
such a manner that each family is equally likely to be selected. Let the families f1, f2,
C

... , f10 have 3, 4, 3, 2, 5, 4, 3, 6, 4, 5 members, respectively.


Let us select a family and note down the number of members in the family denoting
X. Clearly, X is a random variable defined as below :
no N

X (f1) = 3, X (f2) = 4, X (f3) = 3, X (f4) = 2, X (f5) = 5,


X (f6) = 4, X (f7) = 3, X (f8) = 6, X (f9) = 4, X (f10) = 5
Thus, X can take any value 2,3,4,5 or 6 depending upon which family is selected.
©

Now, X will take the value 2 when the family f4 is selected. X can take the value
3 when any one of the families f1, f3, f7 is selected.
Similarly, X = 4, when family f2, f6 or f9 is selected,
X = 5, when family f5 or f10 is selected
and X = 6, when family f8 is selected.
560 MATHEMATICS

Since we had assumed that each family is equally likely to be selected, the probability
1
that family f4 is selected is .
10
1 1
Thus, the probability that X can take the value 2 is . We write P(X = 2) =
10 10
Also, the probability that any one of the families f1, f3 or f7 is selected is

he
3
P({f1, f3, f7}) =
10
3

is
Thus, the probability that X can take the value 3 =
10
3

bl
We write P(X = 3) =
10
Similarly, we obtain
pu P(X = 4) = P({f2, f6, f9}) =
3
10
be T

2
P(X = 5) = P({f5, f10}) =
re
10
o R

1
and P(X = 6) = P({f8}) =
10
tt E

Such a description giving the values of the random variable along with the
corresponding probabilities is called the probability distribution of the random
C

variable X.
In general, the probability distribution of a random variable X is defined as follows:
Definition 5 The probability distribution of a random variable X is the system of numbers
no N

X : x1 x2 ... xn
P(X) : p1 p2 ... pn
©

n
where, pi 0, pi = 1, i = 1, 2,..., n
i 1

The real numbers x1, x2,..., xn are the possible values of the random variable X and
pi (i = 1,2,..., n) is the probability of the random variable X taking the value xi i.e.,
P(X = xi) = pi
PROBABILITY 561

$ Note If x is one of the possible values of a random variable X, the statement


i
X = x is true only at some point (s) of the sample space. Hence, the probability that
i
X takes value xi is always nonzero, i.e. P(X = xi) ≠ 0.
Also for all possible values of the random variable X, all elements of the sample
space are covered. Hence, the sum of all the probabilities in a probability distribution
must be one.

he
Example 24 Two cards are drawn successively with replacement from a well-shuffled
deck of 52 cards. Find the probability distribution of the number of aces.
Solution The number of aces is a random variable. Let it be denoted by X. Clearly, X

is
can take the values 0, 1, or 2.
Now, since the draws are done with replacement, therefore, the two draws form

bl
independent experiments.
Therefore, P(X = 0) = P(non-ace and non-ace)
= P(non-ace) × P(non-ace)
pu
48 48 144
= × =
52 52 169
be T

P(X = 1) = P(ace and non-ace or non-ace and ace)


re
= P(ace and non-ace) + P(non-ace and ace)
o R

= P(ace). P(non-ace) + P (non-ace) . P(ace)


tt E

4 48 48 4 24
= × + × =
52 52 52 52 169
C

and P(X = 2) = P (ace and ace)


4 4 1
=
52 52 169
no N

Thus, the required probability distribution is

X 0 1 2
©

144 24 1
P(X)
169 169 169

Example 25 Find the probability distribution of number of doublets in three throws of


a pair of dice.
562 MATHEMATICS

Solution Let X denote the number of doublets. Possible doublets are


(1,1) , (2,2), (3,3), (4,4), (5,5), (6,6)
Clearly, X can take the value 0, 1, 2, or 3.
6 1
Probability of getting a doublet
36 6

he
1 5
Probability of not getting a doublet 1
6 6
5 5 5 125
Now P(X = 0) = P (no doublet) =

is
6 6 6 216
P(X = 1) = P (one doublet and two non-doublets)
1 5 5 5 1 5 5 5 1

bl
=
6 6 6 6 6 6 6 6 6

1 52 75
pu =3
6 62 216
P(X = 2) = P (two doublets and one non-doublet)
be T

1 1 5 1 5 1 5 1 1 1 5 15
re
= 3
6 6 6 6 6 6 6 6 6 62 6 216
o R

and P(X = 3) = P (three doublets)


1 1 1 1
tt E

=
6 6 6 216
Thus, the required probability distribution is
C

X 0 1 2 3
125 75 15 1
no N

P(X)
216 216 216 216
Verification Sum of the probabilities
©

n
125 75 15 1
∑ pi =
216 216 216 216
i =1

125 75 15 1 216
= 1
216 216
PROBABILITY 563

Example 26 Let X denote the number of hours you study during a randomly selected
school day. The probability that X can take the values x, has the following form, where
k is some unknown constant.

⎧0.1, if x = 0
⎪kx, if x =1or 2

P(X = x) = ⎨
⎪k (5 − x), if x = 3or 4

he
⎪⎩0, otherwise

(a) Find the value of k.


(b) What is the probability that you study at least two hours ? Exactly two hours? At

is
most two hours?

Solution The probability distribution of X is

bl
X 0 1 2 3 4
P(X) 0.1 k 2k 2k k
pu n
(a) We know that ∑ pi =1
be T

i =1
Therefore 0.1 + k + 2k + 2k + k = 1
re
i.e. k = 0.15
o R

(b) P(you study at least two hours) = P(X ≥ 2)


= P(X = 2) + P (X = 3) + P (X = 4)
tt E

= 2k + 2k + k = 5k = 5 × 0.15 = 0.75
P(you study exactly two hours) = P(X = 2)
C

= 2k = 2 × 0.15 = 0.3
P(you study at most two hours) = P(X ≤ 2)
no N

= P (X = 0) + P(X = 1) + P(X = 2)
= 0.1 + k + 2k = 0.1 + 3k = 0.1 + 3 × 0.15
= 0.55
©

13.6.2 Mean of a random variable


In many problems, it is desirable to describe some feature of the random variable by
means of a single number that can be computed from its probability distribution. Few
such numbers are mean, median and mode. In this section, we shall discuss mean only.
Mean is a measure of location or central tendency in the sense that it roughly locates a
middle or average value of the random variable.
564 MATHEMATICS

Definition 6 Let X be a random variable whose possible values x1, x2, x3, ..., xn occur
with probabilities p1, p2, p3,..., pn , respectively. The mean of X, denoted by μ, is the
n
number ∑ xi pi i.e. the mean of X is the weighted average of the possible values of X,
i =1

each value being weighted by its probability with which it occurs.

he
The mean of a random variable X is also called the expectation of X, denoted by
E(X).
n

Thus, E (X) = μ = xi pi = x p + x p + ... + x p .


1 1 2 2 n n
i 1

is
In other words, the mean or expectation of a random variable X is the sum of the
products of all possible values of X by their respective probabilities.

bl
Example 27 Let a pair of dice be thrown and the random variable X be the sum of the
numbers that appear on the two dice. Find the mean or expectation of X.
Solution The sample space of the experiment consists of 36 elementary events in the
pu
form of ordered pairs (xi , yi), where xi = 1, 2, 3, 4, 5, 6 and yi = 1, 2, 3, 4, 5, 6.
The random variable X i.e. the sum of the numbers on the two dice takes the
be T

values 2, 3, 4, 5, 6, 7, 8, 9, 10, 11 or 12.


re
1
o R

Now P(X = 2) = P({(1,1)})


36
2
tt E

P(X = 3) = P({(1,2), (2,1)})


36
3
C

P(X = 4) = P({(1,3), (2,2), (3,1)})


36
4
no N

P(X = 5) = P({(1,4), (2,3), (3,2), (4,1)})


36
5
P(X = 6) = P({(1,5), (2,4), (3,3), (4,2), (5,1)})
36
©

6
P(X = 7) = P({(1,6), (2,5), (3,4), (4,3), (5,2), (6,1)})
36
5
P(X = 8) = P({(2,6), (3,5), (4,4), (5,3), (6,2)})
36
PROBABILITY 565

4
P(X = 9) = P({(3,6), (4,5), (5,4), (6,3)})
36
3
P(X = 10) = P({(4,6), (5,5), (6,4)})
36
2

he
P(X = 11) = P({(5,6), (6,5)})
36
1
P(X = 12) = P({(6,6)})
36

is
The probability distribution of X is

X or xi 2 3 4 5 6 7 8 9 10 11 12

bl
1 2 3 4 5 6 5 4 3 2 1
P(X) or pi
36 36 36 36 36 36 36 36 36 36 36
pu
Therefore,
be T

n
1 2 3 4
μ = E(X) = ∑ xi pi = 2×
36
+ 3× + 4 × + 5 ×
36 36 36
re
i =1
o R

5 6 5 4 3 2 1
6 7 8 9 10 11 12
36 36 36 36 36 36 36
tt E

2 6 12 20 30 42 40 36 30 22 12
= =7
36
C

Thus, the mean of the sum of the numbers that appear on throwing two fair dice is 7.
13.6.3 Variance of a random variable
no N

The mean of a random variable does not give us information about the variability in the
values of the random variable. In fact, if the variance is small, then the values of the
random variable are close to the mean. Also random variables with different probability
distributions can have equal means, as shown in the following distributions of X and Y.
©

X 1 2 3 4
1 2 3 2
P(X)
8 8 8 8
566 MATHEMATICS

Y –1 0 4 5 6
1 2 3 1 1
P(Y)
8 8 8 8 8
1 2 3 2 22
Clearly E(X) = 1× + 2 × + 3× + 4 × = = 2.75
8 8 8 8 8

he
1 2 3 1 1 22
and E(Y) = −1× + 0 × + 4 × + 5× = 6 × = = 2.75
8 8 8 8 8 8
The variables X and Y are different, however their means are same. It is also

is
easily observable from the diagramatic representation of these distributions (Fig 13.5).
P(X) P(Y)

bl
3 3
8 8
2
pu 2
8 8
be T

1 1
8 8
re
O O
o R

1 2 3 4 –1 1 2 3 4 5 6
(i) (ii)
Fig 13.5
tt E

To distinguish X from Y, we require a measure of the extent to which the values of


the random variables spread out. In Statistics, we have studied that the variance is a
C

measure of the spread or scatter in data. Likewise, the variability or spread in the
values of a random variable may be measured by variance.
Definition 7 Let X be a random variable whose possible values x1, x2,...,xn occur with
no N

probabilities p(x1), p(x2),..., p(xn) respectively.


2
Let μ = E (X) be the mean of X. The variance of X, denoted by Var (X) or x is
defined as
©

n
σ x 2 = Var (X) = ∑ ( xi − μ) 2 p ( xi )
i =1

2
or equivalently x = E(X – μ)2
PROBABILITY 567

The non-negative number


n
σx = Var(X) = ∑ ( xi − μ)2 p( xi )
i =1

is called the standard deviation of the random variable X.


Another formula to find the variance of a random variable. We know that,

he
n
Var (X) = ∑ ( xi − μ)2 p( xi )
i =1

is
n
= ( xi 2 μ2 2μ xi ) p ( xi )
i 1

bl
n n n
= ∑ xi 2 p( xi ) + ∑ μ 2 p( xi ) − ∑ 2μxi p( xi )
i =1 i =1 i =1
pu n n n
= ∑ xi 2 p( xi ) + μ 2 ∑ p( xi ) − 2μ ∑ xi p( xi )
i =1 i =1 i =1
be T

2⎡ ⎤
n n n
∑i i 2
+ 2
− ∑ i ∑ xi p( xi ) ⎥
re
= x p ( x ) μ 2μ ⎢ since p (x ) =1andμ =
o R

i =1 ⎣ i =1 i =1 ⎦
tt E

n
= ∑ xi 2 p( xi ) − μ 2
i =1
C

2
n
⎛ n ⎞
or Var (X) = ∑ xi p( xi ) − ⎜ ∑ xi p( xi ) ⎟
2

i =1 ⎝ i =1 ⎠
no N

n
Var (X) = E(X2) – [E(X)]2, where E(X2) = ∑ xi p( xi )
2
or
i =1
©

Example 28 Find the variance of the number obtained on a throw of an unbiased die.

Solution The sample space of the experiment is S = {1, 2, 3, 4, 5, 6}.


Let X denote the number obtained on the throw. Then X is a random variable
which can take values 1, 2, 3, 4, 5, or 6.
568 MATHEMATICS

1
Also P(1) = P(2) = P(3) = P(4) = P(5) = P(6) =
6
Therefore, the Probability distribution of X is

X 1 2 3 4 5 6

he
1 1 1 1 1 1
P(X)
6 6 6 6 6 6
n
Now E(X) = xi p ( xi )

is
i 1

1 1 1 1 1 1 21
= 1× + 2 × + 3× + 4 × + 5× + 6 × =

bl
6 6 6 6 6 6 6

2 1 2 1 2 1 2 1 2 1 2 1 91
Also E(X2) = 1 × + 2 × + 3 × + 4 × + 5 × + 6 × =
6 6 6 6 6 6 6
pu
Thus, Var (X) = E (X2) – (E(X))2
be T

2
91 ⎛ 21 ⎞ 91 441 35
= −⎜ ⎟ = −
6 ⎝ 6 ⎠ 6 36 12
re
o R

Example 29 Two cards are drawn simultaneously (or successively without replacement)
from a well shuffled pack of 52 cards. Find the mean, variance and standard deviation
tt E

of the number of kings.


Solution Let X denote the number of kings in a draw of two cards. X is a random
C

variable which can assume the values 0, 1 or 2.

48!
48
no N

C2 2!(48 2)! 48 47 188


Now P(X = 0) = P (no king) 52
C2 52! 52 51 221
2!(52 2)!
©

4 48
C1 C1
P(X = 1) = P (one king and one non-king) = 52
C2

4 × 48× 2 32
= =
52 × 51 221
PROBABILITY 569

4
C2 4×3 1
and P(X = 2) = P (two kings) = = =
52
C 2 52 × 51 221
Thus, the probability distribution of X is
X 0 1 2
188 32 1

he
P(X)
221 221 221
n

Now Mean of X = E(X) = xi p ( xi )

is
i 1

188 32 1 34
= 0× +1× + 2× =
221 221 221 221

bl
n
Also E(X2) = ∑ xi2 p( xi )
pu i =1

188 2 32 1 36
= 02 × +1 × + 22 × =
221 221 221 221
be T

Now Var(X) = E(X2) – [E(X)]2


re
o R

2
36 ⎛ 34 ⎞ 6800
= –⎜ ⎟ =
221 ⎝ 221 ⎠ (221)2
tt E

6800
Therefore σx = Var(X) = = 0.37
C

221

EXERCISE 13.4
no N

1. State which of the following are not the probability distributions of a random
variable. Give reasons for your answer.
(i) X 0 1 2
©

P(X) 0.4 0.4 0.2

(ii) X 0 1 2 3 4
P(X) 0.1 0.5 0.2 – 0.1 0.3
570 MATHEMATICS

(iii) Y –1 0 1
P(Y) 0.6 0.1 0.2

(iv) Z 3 2 1 0 –1
P(Z) 0.3 0.2 0.4 0.1 0.05

he
2. An urn contains 5 red and 2 black balls. Two balls are randomly drawn. Let X
represent the number of black balls. What are the possible values of X? Is X a
random variable ?
3. Let X represent the difference between the number of heads and the number of

is
tails obtained when a coin is tossed 6 times. What are possible values of X?
4. Find the probability distribution of

bl
(i) number of heads in two tosses of a coin.
(ii) number of tails in the simultaneous tosses of three coins.
pu
(iii) number of heads in four tosses of a coin.

5. Find the probability distribution of the number of successes in two tosses of a die,
be T

where a success is defined as


re
(i) number greater than 4
o R

(ii) six appears on at least one die


tt E

6. From a lot of 30 bulbs which include 6 defectives, a sample of 4 bulbs is drawn


at random with replacement. Find the probability distribution of the number of
defective bulbs.
C

7. A coin is biased so that the head is 3 times as likely to occur as tail. If the coin is
tossed twice, find the probability distribution of number of tails.
no N

8. A random variable X has the following probability distribution:


X 0 1 2 3 4 5 6 7
©

P(X) 0 k 2k 2k 3k k 2k 7k 2 +k
2 2

Determine
(i) k (ii) P(X < 3)
(iii) P(X > 6) (iv) P(0 < X < 3)
PROBABILITY 571

9. The random variable X has a probability distribution P(X) of the following form,
where k is some number :

⎧k , if x = 0
⎪2k , if x =1

P(X) = ⎨
⎪3k , if x = 2

he
⎪⎩0, otherwise

(a) Determine the value of k.


(b) Find P (X < 2), P (X ≤ 2), P(X ≥ 2).

is
10. Find the mean number of heads in three tosses of a fair coin.
11. Two dice are thrown simultaneously. If X denotes the number of sixes, find the
expectation of X.

bl
12. Two numbers are selected at random (without replacement) from the first six
positive integers. Let X denote the larger of the two numbers obtained. Find
pu
E(X).
13. Let X denote the sum of the numbers obtained when two fair dice are rolled.
Find the variance and standard deviation of X.
be T

14. A class has 15 students whose ages are 14, 17, 15, 14, 21, 17, 19, 20, 16, 18, 20,
re
17, 16, 19 and 20 years. One student is selected in such a manner that each has
o R

the same chance of being chosen and the age X of the selected student is
recorded. What is the probability distribution of the random variable X? Find
mean, variance and standard deviation of X.
tt E

15. In a meeting, 70% of the members favour and 30% oppose a certain proposal.
A member is selected at random and we take X = 0 if he opposed, and X = 1 if
C

he is in favour. Find E(X) and Var (X).


Choose the correct answer in each of the following:
no N

16. The mean of the numbers obtained on throwing a die having written 1 on three
faces, 2 on two faces and 5 on one face is
8
(A) 1 (B) 2 (C) 5 (D)
©

3
17. Suppose that two cards are drawn at random from a deck of cards. Let X be the
number of aces obtained. Then the value of E(X) is
37 5 1 2
(A) (B) (C) (D)
221 13 13 13
572 MATHEMATICS

13.7 Bernoulli Trials and Binomial Distribution


13.7.1 Bernoulli trials
Many experiments are dichotomous in nature. For example, a tossed coin shows a
‘head’ or ‘tail’, a manufactured item can be ‘defective’ or ‘non-defective’, the response
to a question might be ‘yes’ or ‘no’, an egg has ‘hatched’ or ‘not hatched’, the decision
is ‘yes’ or ‘no’ etc. In such cases, it is customary to call one of the outcomes a ‘success’

he
and the other ‘not success’ or ‘failure’. For example, in tossing a coin, if the occurrence
of the head is considered a success, then occurrence of tail is a failure.
Each time we toss a coin or roll a die or perform any other experiment, we call it a
trial. If a coin is tossed, say, 4 times, the number of trials is 4, each having exactly two

is
outcomes, namely, success or failure. The outcome of any trial is independent of the
outcome of any other trial. In each of such trials, the probability of success or failure
remains constant. Such independent trials which have only two outcomes usually

bl
referred as ‘success’ or ‘failure’ are called Bernoulli trials.
Definition 8 Trials of a random experiment are called Bernoulli trials, if they satisfy
the following conditions :
pu
(i) There should be a finite number of trials.
(ii) The trials should be independent.
be T

(iii) Each trial has exactly two outcomes : success or failure.


re
(iv) The probability of success remains the same in each trial.
o R

For example, throwing a die 50 times is a case of 50 Bernoulli trials, in which each
trial results in success (say an even number) or failure (an odd number) and the
tt E

probability of success (p) is same for all 50 throws. Obviously, the successive throws
of the die are independent experiments. If the die is fair and have six numbers 1 to 6
1 1
C

written on six faces, then p = and q = 1 – p = = probability of failure.


2 2
Example 30 Six balls are drawn successively from an urn containing 7 red and 9 black
no N

balls. Tell whether or not the trials of drawing balls are Bernoulli trials when after each
draw the ball drawn is
(i) replaced (ii) not replaced in the urn.
©

Solution
(i) The number of trials is finite. When the drawing is done with replacement, the
7
probability of success (say, red ball) is p = which is same for all six trials
16
(draws). Hence, the drawing of balls with replacements are Bernoulli trials.
PROBABILITY 573

(ii) When the drawing is done without replacement, the probability of success
7 6
(i.e., red ball) in first trial is , in 2nd trial is if the first ball drawn is red or
7 16 15
if the first ball drawn is black and so on. Clearly, the probability of success is
15
not same for all trials, hence the trials are not Bernoulli trials.

he
13.7.2 Binomial distribution
Consider the experiment of tossing a coin in which each trial results in success (say,
heads) or failure (tails). Let S and F denote respectively success and failure in each
trial. Suppose we are interested in finding the ways in which we have one success in
six trials.

is
Clearly, six different cases are there as listed below:
SFFFFF, FSFFFF, FFSFFF, FFFSFF, FFFFSF, FFFFFS.

bl
6!
Similarly, two successes and four failures can have 4! 2! combinations. It will be
pu
lengthy job to list all of these ways. Therefore, calculation of probabilities of 0, 1, 2,...,
n number of successes may be lengthy and time consuming. To avoid the lengthy
calculations and listing of all the possible cases, for the probabilities of number of
be T

successes in n-Bernoulli trials, a formula is derived. For this purpose, let us take the
re
experiment made up of three Bernoulli trials with probabilities p and q = 1 – p for
o R

success and failure respectively in each trial. The sample space of the experiment is
the set
S = {SSS, SSF, SFS, FSS, SFF, FSF, FFS, FFF}
tt E

The number of successes is a random variable X and can take values 0, 1, 2, or 3.


The probability distribution of the number of successes is as below :
C

P(X = 0) = P(no success)


= P({FFF}) = P(F) P(F) P(F)
no N

= q . q . q = q3 since the trials are independent


P(X = 1) = P(one successes)
= P({SFF, FSF, FFS})
= P({SFF}) + P({FSF}) + P({FFS})
©

= P(S) P(F) P(F) + P(F) P(S) P(F) + P(F) P(F) P(S)


= p.q.q + q.p.q + q.q.p = 3pq2
P(X = 2) = P (two successes)
= P({SSF, SFS, FSS})
= P({SSF}) + P ({SFS}) + P({FSS})
574 MATHEMATICS

= P(S) P(S) P(F) + P(S) P(F) P(S) + P(F) P(S) P(S)


= p.p.q. + p.q.p + q.p.p = 3p2q
and P(X = 3) = P(three success) = P ({SSS})
= P(S) . P(S) . P(S) = p3
Thus, the probability distribution of X is
X 0 1 2 3

he
P(X) q3 3q 2p 3qp 2 p3
Also, the binominal expansion of (q + p)3 is

q 3 + 3q 2 p + 3qp 2 + p 3

is
Note that the probabilities of 0, 1, 2 or 3 successes are respectively the 1st, 2nd,
3rd and 4th term in the expansion of (q + p)3.

bl
Also, since q + p = 1, it follows that the sum of these probabilities, as expected, is 1.
Thus, we may conclude that in an experiment of n-Bernoulli trials, the probabilities
pu
of 0, 1, 2,..., n successes can be obtained as 1st, 2nd,...,(n + 1)th terms in the expansion
of (q + p)n. To prove this assertion (result), let us find the probability of x-successes in
an experiment of n-Bernoulli trials.
be T

Clearly, in case of x successes (S), there will be (n – x) failures (F).


re
n!
o R

Now, x successes (S) and (n – x) failures (F) can be obtained in x !(n x)! ways.

In each of these ways, the probability of x successes and (n − x) failures is


tt E

= P(x successes) . P(n–x) failures is

P (S).P (S)...P(S) P (F).P (F)...P(F)


C

= 1442443 1442443 = px qn–x


x times ( n x ) times

n!
no N

Thus, the probability of x successes in n-Bernoulli trials is px qn–x


x !(n − x)!
or nCx px qn–x
Thus P(x successes) = n C x p x q n − x , x = 0, 1, 2,...,n. (q = 1 – p)
©

Clearly, P(x successes), i.e. n C x p x q n − x is the (x + 1)th term in the binomial


expansion of (q + p)n.
Thus, the probability distribution of number of successes in an experiment consisting
of n Bernoulli trials may be obtained by the binomial expansion of (q + p)n. Hence, this
PROBABILITY 575

distribution of number of successes X can be written as


X 0 1 2 ... x ... n
P (X) nC 0 q n nC 1 q n–1 p 1 nC 2 q n–2 p 2 n
C x q n–xp x n
Cn pn
The above probability distribution is known as binomial distribution with parameters
n and p, because for given values of n and p, we can find the complete probability

he
distribution.
The probability of x successes P (X = x) is also denoted by P (x) and is given by
P (x) = nCx qn–xpx, x = 0, 1,..., n. (q = 1 – p)
This P (x) is called the probability function of the binomial distribution.

is
A binomial distribution with n-Bernoulli trials and probability of success in each
trial as p, is denoted by B (n, p).

bl
Let us now take up some examples.
Example 31 If a fair coin is tossed 10 times, find the probability of
pu
(i) exactly six heads
(ii) at least six heads
be T

(iii) at most six heads


re
Solution The repeated tosses of a coin are Bernoulli trials. Let X denote the number
o R

of heads in an experiment of 10 trials.


1
Clearly, X has the binomial distribution with n = 10 and p =
tt E

2
Therefore P(X = x) = nCxqn–xpx, x = 0, 1, 2,...,n
C

1 1
Here n = 10, p ,q=1–p=
2 2
10 − x 10
no N

x
10 ⎛1⎞ ⎛ 1 ⎞ 10 ⎛ 1 ⎞
Therefore P(X = x) = Cx ⎜ ⎟ ⎜ ⎟ = Cx ⎜ ⎟
⎝2⎠ ⎝2⎠ ⎝ 2⎠

10
©

10 ⎛1⎞ 10! 1 105


Now (i) P(X = 6) = C6 ⎜ ⎟ = =
⎝2⎠ 6!× 4! 210 512
(ii) P(at least six heads) = P(X ≥ 6)
= P (X = 6) + P (X = 7) + P (X = 8) + P(X = 9) + P (X = 10)
576 MATHEMATICS

10 10 10 10 10
10 ⎛1⎞ ⎛1⎞ ⎛1⎞ ⎛1⎞ ⎛1⎞
= C6 ⎜ ⎟ + 10 C7 ⎜ ⎟ + 10 C8 ⎜ ⎟ + 10 C9 ⎜ ⎟ + 10 C10 ⎜ ⎟
⎝2⎠ ⎝2⎠ ⎝ 2⎠ ⎝2⎠ ⎝ 2⎠

10! 10! 10! 10! 10! 1 193


= 10
=
6! 4! 7! 3! 8! 2! 9! 1! 10! 2 512
(iii) P(at most six heads) = P(X ≤ 6)

he
= P (X = 0) + P (X = 1) + P (X = 2) + P (X = 3)
+ P (X = 4) + P (X = 5) + P (X = 6)
10 10 10 10
⎛1⎞ ⎛1⎞ ⎛1⎞ ⎛1⎞

is
10 10 10
= ⎜ ⎟ + C1 ⎜ ⎟ + C 2 ⎜ ⎟ + C3 ⎜ ⎟
2
⎝ ⎠ 2
⎝ ⎠ 2
⎝ ⎠ ⎝2⎠
10 10 10

bl
10 ⎛1⎞ ⎛1⎞ ⎛1⎞
+ C 4 ⎜ ⎟ + 10 C5 ⎜ ⎟ + 10 C6 ⎜ ⎟
⎝2⎠ ⎝2⎠ ⎝ 2⎠
848 53
pu = =
1024 64
Example 32 Ten eggs are drawn successively with replacement from a lot containing
be T

10% defective eggs. Find the probability that there is at least one defective egg.
re
Solution Let X denote the number of defective eggs in the 10 eggs drawn. Since the
o R

drawing is done with replacement, the trials are Bernoulli trials. Clearly, X has the
10 1
tt E

binomial distribution with n = 10 and p .


100 10
9
q = 1− p =
C

Therefore
10
Now P(at least one defective egg) = P(X ≥ 1) = 1 – P (X = 0)
no N

10
10 ⎛ 9⎞ 910
= 1 − C0 ⎜ ⎟ = 1−
⎝ 10 ⎠ 1010
©

EXERCISE 13.5
1. A die is thrown 6 times. If ‘getting an odd number’ is a success, what is the
probability of
(i) 5 successes? (ii) at least 5 successes?
(iii) at most 5 successes?
PROBABILITY 577

2. A pair of dice is thrown 4 times. If getting a doublet is considered a success, find


the probability of two successes.
3. There are 5% defective items in a large bulk of items. What is the probability
that a sample of 10 items will include not more than one defective item?
4. Five cards are drawn successively with replacement from a well-shuffled deck
of 52 cards. What is the probability that

he
(i) all the five cards are spades?
(ii) only 3 cards are spades?
(iii) none is a spade?

is
5. The probability that a bulb produced by a factory will fuse after 150 days of use
is 0.05. Find the probability that out of 5 such bulbs
(i) none

bl
(ii) not more than one
(iii) more than one
pu
(iv) at least one
will fuse after 150 days of use.
be T

6. A bag consists of 10 balls each marked with one of the digits 0 to 9. If four balls
are drawn successively with replacement from the bag, what is the probability
re
that none is marked with the digit 0?
o R

7. In an examination, 20 questions of true-false type are asked. Suppose a student


tosses a fair coin to determine his answer to each question. If the coin falls
tt E

heads, he answers 'true'; if it falls tails, he answers 'false'. Find the probability
that he answers at least 12 questions correctly.
C

1
8. Suppose X has a binomial distribution B 6, . Show that X = 3 is the most
2
likely outcome.
no N

(Hint : P(X = 3) is the maximum among all P(xi), xi = 0,1,2,3,4,5,6)


9. On a multiple choice examination with three possible answers for each of the
five questions, what is the probability that a candidate would get four or more
©

correct answers just by guessing ?


10. A person buys a lottery ticket in 50 lotteries, in each of which his chance of
1
winning a prize is . What is the probability that he will win a prize
100
(a) at least once (b) exactly once (c) at least twice?
578 MATHEMATICS

11. Find the probability of getting 5 exactly twice in 7 throws of a die.


12. Find the probability of throwing at most 2 sixes in 6 throws of a single die.
13. It is known that 10% of certain articles manufactured are defective. What is the
probability that in a random sample of 12 such articles, 9 are defective?
In each of the following, choose the correct answer:
14. In a box containing 100 bulbs, 10 are defective. The probability that out of a

he
sample of 5 bulbs, none is defective is
5 5
⎛1⎞ ⎛9⎞ 9
(A) 10–1 (B) ⎜ ⎟ (C) ⎜ ⎟ (D)
⎝2⎠ ⎝ 10 ⎠ 10

is
1.
15. The probability that a student is not a swimmer is Then the probability that
5

bl
out of five students, four are swimmers is
4 4
5 ⎛4⎞ 1 ⎛4⎞ 1
pu
(A) C4 ⎜ ⎟
⎝5⎠ 5
(B) ⎜ ⎟
⎝5⎠ 5
4
1 ⎛4⎞
be T

5
(C) C1 ⎜ ⎟ (D) None of these
5 ⎝5⎠
re
o R

Miscellaneous Examples
Example 33 Coloured balls are distributed in four boxes as shown in the following
tt E

table:

Box Colour
C

Black White Red Blue


no N

I 3 4 5 6
II 2 2 2 2
III 1 2 3 1
©

IV 4 3 1 5

A box is selected at random and then a ball is randomly drawn from the selected
box. The colour of the ball is black, what is the probability that ball drawn is from the
box III?
PROBABILITY 579

Solution Let A, E1, E2, E3 and E4 be the events as defined below :


A : a black ball is selected E1 : box I is selected
E2 : box II is selected E3 : box III is selected
E4 : box IV is selected
Since the boxes are chosen at random,

he
1
Therefore P(E1) = P(E2) = P(E3) = P(E4) =
4
3 2 1 4
Also P(A|E1) = , P(A|E2) = , P(A|E3) = and P(A|E4) =
18 8 7 13

is
P(box III is selected, given that the drawn ball is black) = P(E3|A). By Bayes'
theorem,
P(E3 ) P(A|E 3 )

bl
P(E3|A) =
P(E1 )P(A|E1 ) P(E 2 ) P(A|E 2 ) + P(E 3 ) P (A|E 3 ) P(E 4 ) P(A|E 4 )
1 1
pu ×
4 7
= 1 3 1 1 1 1 1 4 = 0.165
× + × + × + ×
be T

4 18 4 4 4 7 4 13
1
re
Example 34 Find the mean of the Binomial distribution B 4, .
o R

3
1
Solution Let X be the random variable whose probability distribution is B 4, .
tt E

3
1 1 2
Here n = 4, p = and q = 1 − =
3 3 3
C

4− x x
4 ⎛2⎞ ⎛1⎞
We know that P(X = x) = C x ⎜ ⎟ ⎜ ⎟ , x = 0, 1, 2, 3, 4.
⎝3⎠ ⎝ 3⎠
no N

i.e. the distribution of X is


xi P(xi) xi P(xi)
4
©

4 2
0 C0 0
3
3 3
4 2 1 4 2 1
1 C1 C1
3 3 3 3
580 MATHEMATICS

2 2 2 2
2 1 4 2 1
2
4
C2 2 C2
3 3 3 3

3 3
2 1 4 2 1
3
4
C3 3 C3
3 3 3 3

he
4 4
1 4 1
4
4
C4 4 C4
3 3

is
4
Now Mean (μ) = ∑ xi p( xi )

bl
i =1

3 2 2 3 4
⎛ 2⎞ ⎛1⎞ ⎛ 2⎞ ⎛1⎞ 4 2 1 1
= 0 + 4 C1 ⎜ ⎟ ⎜ ⎟ + 2 ⋅ 4 C2 ⎜ ⎟ ⎜ ⎟ + 3 C3 4 4 C4
pu ⎝ 3 ⎠ ⎝ 3⎠ ⎝ 3⎠ ⎝ 3⎠ 3 3 3

23 22 2 1
be T

= 4× 4
+ 2 × 6 × 4
+ 3 × 4 × 4 + 4 × 1× 4
3 3 3 3
re
32 + 48 + 24 + 4 108 4
o R

= = =
34 81 3
3
tt E

Example 35 The probability of a shooter hitting a target is . How many minimum


4
number of times must he/she fire so that the probability of hitting the target at least
C

once is more than 0.99?


Solution Let the shooter fire n times. Obviously, n fires are n Bernoulli trials. In each
no N

3
trial, p = probability of hitting the target = and q = probability of not hitting the
4
n− x x
1 ⎛1⎞ ⎛3⎞ n 3
x
©

n− x x
. Then P(X = x) = C x q p = C x ⎜ ⎟ = C
n n
target = ⎜ ⎟ x n .
4 ⎝4⎠ ⎝4⎠ 4
Now, given that,
P(hitting the target at least once) > 0.99
i.e. P(x ≥ 1) > 0.99
PROBABILITY 581

Therefore, 1 – P (x = 0) > 0.99


1
or 1 − n C0 > 0.99
4n
1 1
or
n
C0 0.01 i.e. < 0.01
4n 4n

he
1
or 4n > = 100 ... (1)
0.01
The minimum value of n to satisfy the inequality (1) is 4.
Thus, the shooter must fire 4 times.

is
Example 36 A and B throw a die alternatively till one of them gets a ‘6’ and wins the
game. Find their respective probabilities of winning, if A starts first.

bl
Solution Let S denote the success (getting a ‘6’) and F denote the failure (not getting
a ‘6’).
1 5
Thus,
pu P(S) =
6
, P(F) =
6
1
be T

P(A wins in the first throw) = P(S) =


6
re
A gets the third throw, when the first throw by A and second throw by B result into
o R

failures.
5 5 1
Therefore, P(A wins in the 3rd throw) = P(FFS) = P(F) P(F) P(S) =
tt E

6 6 6
2
⎛5⎞ 1
=⎜ ⎟ ×
C

⎝6⎠ 6
4
5 1
no N

P(A wins in the 5th throw) = P (FFFFS) and so on.


6 6
2 4
1 5 1 5 1
Hence, P(A wins) = ...
6 6 6 6 6
©

1
6 6
= =
25 11
1−
36
582 MATHEMATICS

6 5
P(B wins) = 1 – P (A wins) = 1
11 11
Remark If a + ar + ar2 + ... + arn–1 + ..., where | r | < 1, then sum of this infinite G.P.
a .
is given by (Refer A.1.3 of Class XI Text book).
1− r

he
Example 37 If a machine is correctly set up, it produces 90% acceptable items. If it is
incorrectly set up, it produces only 40% acceptable items. Past experience shows that
80% of the set ups are correctly done. If after a certain set up, the machine produces
2 acceptable items, find the probability that the machine is correctly setup.

is
Solution Let A be the event that the machine produces 2 acceptable items.
Also let B1 represent the event of correct set up and B2 represent the event of

bl
incorrect setup.
Now P(B1) = 0.8, P(B2) = 0.2
P(A|B1) = 0.9 × 0.9 and P(A|B2) = 0.4 × 0.4
pu P (B1 ) P (A|B 1 )
Therefore P(B1|A) =
P (B1 ) P (A|B 1 ) + P (B2 ) P (A|B 2 )
be T

0.8 × 0.9 × 0.9 648


re
= = = 0.95
o R

0.8 × 0.9 × 0.9 + 0.2 × 0.4 × 0.4 680

Miscellaneous Exercise on Chapter 13


tt E

1. A and B are two events such that P (A) ≠ 0. Find P(B|A), if


(i) A is a subset of B (ii) A ∩ B = φ
C

2. A couple has two children,


(i) Find the probability that both children are males, if it is known that at least
no N

one of the children is male.


(ii) Find the probability that both children are females, if it is known that the
elder child is a female.
©

3. Suppose that 5% of men and 0.25% of women have grey hair. A grey haired
person is selected at random. What is the probability of this person being male?
Assume that there are equal number of males and females.
4. Suppose that 90% of people are right-handed. What is the probability that
at most 6 of a random sample of 10 people are right-handed?
PROBABILITY 583

5. An urn contains 25 balls of which 10 balls bear a mark 'X' and the remaining 15
bear a mark 'Y'. A ball is drawn at random from the urn, its mark is noted down
and it is replaced. If 6 balls are drawn in this way, find the probability that
(i) all will bear 'X' mark.
(ii) not more than 2 will bear 'Y' mark.
(iii) at least one ball will bear 'Y' mark.

he
(iv) the number of balls with 'X' mark and 'Y' mark will be equal.

6. In a hurdle race, a player has to cross 10 hurdles. The probability that he will
5

is
clear each hurdle is . What is the probability that he will knock down fewer
6
than 2 hurdles?

bl
7. A die is thrown again and again until three sixes are obtained. Find the probabil-
ity of obtaining the third six in the sixth throw of the die.
8. If a leap year is selected at random, what is the chance that it will contain 53
pu
tuesdays?
9. An experiment succeeds twice as often as it fails. Find the probability that in the
be T

next six trials, there will be atleast 4 successes.


10. How many times must a man toss a fair coin so that the probability of having
re
o R

at least one head is more than 90%?


11. In a game, a man wins a rupee for a six and loses a rupee for any other number
when a fair die is thrown. The man decided to throw a die thrice but to quit as
tt E

and when he gets a six. Find the expected value of the amount he wins / loses.
12. Suppose we have four boxes A,B,C and D containing coloured marbles as given
C

below:
Box Marble colour
no N

Red White Black


A 1 6 3
B 6 2 2
©

C 8 1 1
D 0 6 4
One of the boxes has been selected at random and a single marble is drawn from
it. If the marble is red, what is the probability that it was drawn from box A?, box B?,
box C?
584 MATHEMATICS

13. Assume that the chances of a patient having a heart attack is 40%. It is also
assumed that a meditation and yoga course reduce the risk of heart attack by
30% and prescription of certain drug reduces its chances by 25%. At a time a
patient can choose any one of the two options with equal probabilities. It is given
that after going through one of the two options the patient selected at random
suffers a heart attack. Find the probability that the patient followed a course of

he
meditation and yoga?
14. If each element of a second order determinant is either zero or one, what is the
probability that the value of the determinant is positive? (Assume that the indi-
vidual entries of the determinant are chosen independently, each value being

is
1
assumed with probability ).
2

bl
15. An electronic assembly consists of two subsystems, say, A and B. From previ-
ous testing procedures, the following probabilities are assumed to be known:
P(A fails) = 0.2
pu P(B fails alone) = 0.15
P(A and B fail) = 0.15
be T

Evaluate the following probabilities


re
(i) P(A fails|B has failed) (ii) P(A fails alone)
o R

16. Bag I contains 3 red and 4 black balls and Bag II contains 4 red and 5 black balls.
One ball is transferred from Bag I to Bag II and then a ball is drawn from Bag II.
tt E

The ball so drawn is found to be red in colour. Find the probability that the
transferred ball is black.
C

Choose the correct answer in each of the following:


17. If A and B are two events such that P(A) ≠ 0 and P(B | A) = 1, then
no N

(A) A ⊂ B (B) B ⊂ A (C) B = φ (D) A = φ


18. If P(A|B) > P(A), then which of the following is correct :
(A) P(B|A) < P(B) (B) P(A ∩ B) < P(A) . P(B)
©

(C) P(B|A) > P(B) (D) P(B|A) = P(B)


19. If A and B are any two events such that P(A) + P(B) – P(A and B) = P(A), then
(A) P(B|A) = 1 (B) P(A|B) = 1
(C) P(B|A) = 0 (D) P(A|B) = 0
PROBABILITY 585

Summary
The salient features of the chapter are –
 The conditional probability of an event E, given the occurrence of the event F
P (E ∩ F)
is given by P (E | F) = , P(F) ≠ 0
P(F)

he
 0 ≤ P (E|F) ≤ 1, P (E′|F) = 1 – P (E|F)
P ((E ∪ F)|G) = P (E|G) + P (F|G) – P ((E ∩ F)|G)
 P (E ∩ F) = P (E) P (F|E), P (E) ≠ 0

is
P (E ∩ F) = P (F) P (E|F), P (F) ≠ 0
 If E and F are independent, then

bl
P (E ∩ F) = P (E) P (F)
P (E|F) = P (E), P (F) ≠ 0
P (F|E) = P (F), P(E) ≠ 0

pu
Theorem of total probability
Let {E1, E2, ...,En) be a partition of a sample space and suppose that each of
be T

E1, E2, ..., En has nonzero probability. Let A be any event associated with S,
then
re
o R

P(A) = P(E1) P (A|E1) + P (E2) P (A|E2) + ... + P (En) P(A|En)


 Bayes' theorem If E1, E2, ..., En are events which constitute a partition of
sample space S, i.e. E1, E2, ..., En are pairwise disjoint and E1 4 E2 4 ... 4 En = S
tt E

and A be any event with nonzero probability, then


P(E i ) P (A|E i )
C

P(E i | A) n
P(E j ) P (A|E j )
j 1
no N

 A random variable is a real valued function whose domain is the sample


space of a random experiment.
 The probability distribution of a random variable X is the system of numbers
©

X : x1 x2 ... xn
P(X) : p1 p2 ... pn
n
where, pi > 0, ∑ pi = 1, i = 1, 2,..., n
i =1
586 MATHEMATICS

 Let X be a random variable whose possible values x1, x2, x3, ..., xn occur with
probabilities p1, p2, p3, ... pn respectively. The mean of X, denoted by μ, is
n

the number xi pi .
i 1
The mean of a random variable X is also called the expectation of X, denoted
by E (X).

he
 Let X be a random variable whose possible values x1, x2, ..., xn occur with
probabilities p(x1), p(x2), ..., p(xn) respectively.
Let μ = E(X) be the mean of X. The variance of X, denoted by Var (X) or

is
n
2
σx2, is defined as x Var (X) = ( xi μ) 2 p ( xi )
i 1

bl
or equivalently σ = E (X – μ)
x
2 2

The non-negative number


pu n

x Va r (X) = ( xi μ) 2 p ( xi )
i 1
be T

is called the standard deviation of the random variable X.


 Var (X) = E (X2) – [E(X)]2
re
o R

 Trials of a random experiment are called Bernoulli trials, if they satisfy the
following conditions :
(i) There should be a finite number of trials.
tt E

(ii) The trials should be independent.


(iii) Each trial has exactly two outcomes : success or failure.
C

(iv) The probability of success remains the same in each trial.


For Binomial distribution B (n, p), P (X = x) = nCx q n–x px, x = 0, 1,..., n
no N

(q = 1 – p)

Historical Note
©

The earliest indication on measurement of chances in game of dice appeared


in 1477 in a commentary on Dante's Divine Comedy. A treatise on gambling
named liber de Ludo Alcae, by Geronimo Carden (1501-1576) was published
posthumously in 1663. In this treatise, he gives the number of favourable cases
for each event when two dice are thrown.
PROBABILITY 587

Galileo (1564-1642) gave casual remarks concerning the correct evaluation


of chance in a game of three dice. Galileo analysed that when three dice are
thrown, the sum of the number that appear is more likely to be 10 than the sum 9,
because the number of cases favourable to 10 are more than the number of
cases for the appearance of number 9.
Apart from these early contributions, it is generally acknowledged that the

he
true origin of the science of probability lies in the correspondence between two
great men of the seventeenth century, Pascal (1623-1662) and Pierre de Fermat
(1601-1665). A French gambler, Chevalier de Metre asked Pascal to explain
some seeming contradiction between his theoretical reasoning and the

is
observation gathered from gambling. In a series of letters written around 1654,
Pascal and Fermat laid the first foundation of science of probability. Pascal solved
the problem in algebraic manner while Fermat used the method of combinations.

bl
Great Dutch Scientist, Huygens (1629-1695), became acquainted with the
content of the correspondence between Pascal and Fermat and published a first
book on probability, "De Ratiociniis in Ludo Aleae" containing solution of many
pu
interesting rather than difficult problems on probability in games of chances.
The next great work on probability theory is by Jacob Bernoulli (1654-1705),
be T

in the form of a great book, "Ars Conjectendi" published posthumously in 1713


by his nephew, Nicholes Bernoulli. To him is due the discovery of one of the most
re
important probability distribution known as Binomial distribution. The next
o R

remarkable work on probability lies in 1993. A. N. Kolmogorov (1903-1987) is


credited with the axiomatic theory of probability. His book, ‘Foundations of
tt E

probability’ published in 1933, introduces probability as a set function and is


considered a ‘classic!’.
C

—™—
no N
©

You might also like